Text Models
| Model Name | Model ID | Price (in/out) | Context Limit | Capabilities | Traits |
|---|
| GLM 4.6 | zai-org-glm-4.6 | $0.85 / $2.75 | 202,752 | Function Calling | — |
| Venice Uncensored 1.1 | venice-uncensored | $0.20 / $0.90 | 32,768 | — | most_uncensored |
| Venice Small | qwen3-4b | $0.05 / $0.15 | 32,768 | Function Calling, Reasoning | — |
| Venice Medium (3.1) | mistral-31-24b | $0.50 / $2.00 | 131,072 | Function Calling, Vision | default_vision |
| Venice Large 1.1 (D) | qwen3-235b | $0.45 / $3.50 | 131,072 | Function Calling, Reasoning | — |
| Qwen 3 235B A22B Thinking 2507 | qwen3-235b-a22b-thinking-2507 | $0.45 / $3.50 | 131,072 | Function Calling, Reasoning | — |
| Qwen 3 235B A22B Instruct 2507 | qwen3-235b-a22b-instruct-2507 | $0.15 / $0.75 | 131,072 | Function Calling | — |
| Llama 3.2 3B | llama-3.2-3b | $0.15 / $0.60 | 131,072 | Function Calling | fastest |
| Llama 3.3 70B | llama-3.3-70b | $0.70 / $2.80 | 131,072 | Function Calling | default, function_calling_default |
| Qwen 3 Coder 480B | qwen3-coder-480b-a35b-instruct | $0.75 / $3.00 | 262,144 | Function Calling | default_code |
Pricing is per 1M tokens (input / output). Additional usage-based pricing applies when using enable_web_search or enable_web_scraping, see search pricing details.
Model Change Notice: Starting December 14, 2025, qwen3-235b will be deprecated and calls will automatically route to qwen3-235b-a22b-thinking-2507.The disable_thinking parameter will be ignored. For non-thinking behavior, use qwen3-235b-a22b-instruct-2507 directly. Learn more about model changes.
Beta Models
| Model Name | Model ID | Price (in/out) | Context Limit | Capabilities | Traits |
|---|
| OpenAI GPT OSS 120B | openai-gpt-oss-120b | $0.07 / $0.30 | 131,072 | Function Calling | — |
| Google Gemma 3 27B Instruct | google-gemma-3-27b-it | $0.12 / $0.20 | 202,752 | Function Calling, Vision | — |
| Qwen 3 Next 80B | qwen3-next-80b | $0.35 / $1.90 | 262,144 | Function Calling | — |
| DeepSeek R1 | deepseek-ai-DeepSeek-R1 | $0.85 / $2.75 | 131,072 | Function Calling | — |
| Hermes 3 Llama 3.1 405B | hermes-3-llama-3.1-405b | $1.10 / $3.00 | 131,072 | — | — |
Beta models are experimental and not recommended for production use. These models may be changed, removed, or replaced at any time without notice. Use them for testing and evaluation purposes only. For production applications, use the stable models listed above.
Image Models
| Model Name | Model ID | Price | Model Source | Traits |
|---|
| Nano Banana Pro | nano-banana-pro | $0.18 | Nano Banana Pro | — |
| Venice SD35 | venice-sd35 | $0.01 | Stable Diffusion 3.5 Large | default, eliza-default |
| HiDream | hidream | $0.01 | HiDream I1 Dev | — |
| Qwen Image | qwen-image | $0.01 | Qwen Image | — |
| Lustify SDXL | lustify-sdxl | $0.01 | Lustify SDXL | — |
| Lustify v7 | lustify-v7 | $0.01 | Lustify v7 | — |
| Anime (WAI) | wai-Illustrious | $0.01 | WAI-Illustrious | — |
Audio Models
Text-to-Speech Models
tts-kokoro Kokoro TTS - 60+ multilingual voices for natural speech
| Model Name | Model ID | Price | Voices Available | Model Source |
|---|
| Kokoro Text to Speech | tts-kokoro | $3.50 per 1M chars | 60+ voices | Kokoro-82M |
The tts-kokoro model supports a wide range of multilingual and stylistic voices (including af_nova, am_liam, bf_emma, zf_xiaobei, and jm_kumo). Voice is selected using the voice parameter in the request payload.
Embedding Models
text-embedding-bge-m3 BGE-M3 - Versatile embedding model for text similarity
| Model Name | Model ID | Price | Model Source |
|---|
| BGE-M3 | text-embedding-bge-m3 | $0.15 / $0.60 per 1M tokens | KimChen/bge-m3-GGUF |
Image Processing Models
upscaler Image Upscaler - Enhance image resolution up to 4x
qwen-image Qwen Image - Multimodal image editing model
Image Upscaler
| Model Name | Model ID | Price | Upscale Options |
|---|
| Upscaler | upscaler | $0.01 | 2x ($0.02), 4x ($0.08) |
Image Editing (Inpaint)
| Model Name | Model ID | Price | Model Source | Traits |
|---|
| Qwen Image | qwen-image | $0.04 | Qwen Image | specialized_editing |
Model Features
- Vision: Ability to process and understand images
- Reasoning: Advanced logical reasoning capabilities
- Function Calling: Support for calling external functions and tools
- Traits: Special characteristics or optimizations (e.g., fastest, most_intelligent, most_uncensored)
Usage Notes
- Input pricing refers to tokens sent to the model
- Output pricing refers to tokens generated by the model
- Context limits define the maximum number of tokens the model can process in a single request
- (D) Scheduled for deprecation. For timelines and migration guidance, see the Deprecation Tracker.