Compare AI model APIs by modality, provider, price, and context window
Browse 4,357 model API records across 116 providers. Each section below groups models by input and output modality, so you can compare text, image, audio, video, PDF, and multimodal APIs in one place.
Model vendors and gateway providers
Normalized model API records
Records with input or output costs
Input -> output groupings
Filter by input and output modality
Choose an input type and an output type to quickly narrow the directory. Add useful rows to the browser compare list, then open Compare to start with those models selected.
Compare text -> text model APIs by price, context, and capabilities.
Compare text + image -> text model APIs by price, context, and capabilities.
Compare text + image + pdf -> text model APIs by price, context, and capabilities.
Compare multimodal -> text model APIs by price, context, and capabilities.
Compare text + image + video -> text model APIs by price, context, and capabilities.
Compare image + pdf + text -> text model APIs by price, context, and capabilities.
Compare image + text -> text model APIs by price, context, and capabilities.
Compare text + pdf -> text model APIs by price, context, and capabilities.
Compare text -> image model APIs by price, context, and capabilities.
Compare image + text + video -> text model APIs by price, context, and capabilities.
Compare text + image -> image model APIs by price, context, and capabilities.
Compare text + image -> text + image model APIs by price, context, and capabilities.
AI model API comparison tables by modality
Tables are sorted by context window within each modality group, then by provider and model name. Prices are shown as normalized numeric values, displayed as per 1M units where applicable.
2,330 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Alibaba (China) alibaba-cn | Qwen Long qwen-long | qwen | text | text | 10,000,000 | 8,192 | $0.072 / 1M | $0.287 / 1M | No | Yes | Jan 25, 2025 | |
AIHubMix aihubmix | Qwen3 Coder Plus qwen3-coder-next | qwen | text | text | 2,000,000 | 64,000 | $0.137 / 1M | $0.548 / 1M | No | Yes | Jul 23, 2025 | |
NanoGPT nano-gpt | Gemini 1.5 Flash google/gemini-flash-1.5 | gemini-flash | text | text | 2,000,000 | 8,192 | $0.0748 / 1M | $0.306 / 1M | No | No | May 14, 2024 | |
Qiniu qiniu-ai | x-AI/Grok-4.1-Fast x-ai/grok-4.1-fast | Unknown | text | text | 2,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Nov 20, 2025 | |
Requesty requesty | Grok 4 Fast xai/grok-4-fast | grok | text | text | 2,000,000 | 64,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
Vercel AI Gateway vercel | Grok 4 Fast Reasoning xai/grok-4-fast-reasoning | grok | text | text | 2,000,000 | 256,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Jul 9, 2025 | |
Vercel AI Gateway vercel | Grok 4.1 Fast Non-Reasoning xai/grok-4.1-fast-non-reasoning | grok | text | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Jul 9, 2025 | |
Vercel AI Gateway vercel | Grok 4.1 Fast Reasoning xai/grok-4.1-fast-reasoning | grok | text | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Jul 9, 2025 | |
Vercel AI Gateway vercel | Grok 4.20 Multi Agent Beta xai/grok-4.20-multi-agent-beta | grok | text | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 13, 2026 | |
Vercel AI Gateway vercel | Grok 4.20 Multi-Agent xai/grok-4.20-multi-agent | grok | text | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 23, 2026 | |
Alibaba (China) alibaba-cn | Qwen3 Coder Plus qwen3-coder-plus | qwen | text | text | 1,048,576 | 65,536 | $1 / 1M | $5 / 1M | No | Yes | Jul 23, 2025 | |
Alibaba alibaba | Qwen3 Coder Plus qwen3-coder-plus | qwen | text | text | 1,048,576 | 65,536 | $1 / 1M | $5 / 1M | No | Yes | Jul 23, 2025 | |
Hugging Face huggingface | DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro | deepseek-thinking | text | text | 1,048,576 | 393,216 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Kilo Gateway kilo | Xiaomi: MiMo V2.5 Pro xiaomi/mimo-v2.5-pro | Unknown | text | text | 1,048,576 | 128,000 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 22, 2026 | |
Kilo Gateway kilo | Xiaomi: MiMo-V2-Pro xiaomi/mimo-v2-pro | Unknown | text | text | 1,048,576 | 131,072 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Xiaomi: MiMo-V2.5 xiaomi/mimo-v2.5 | Unknown | text | text | 1,048,576 | 131,072 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 22, 2026 | |
LLM Gateway llmgateway | Qwen3 Coder Plus qwen3-coder-plus | qwen | text | text | 1,048,576 | 65,536 | $1 / 1M | $5 / 1M | No | Yes | Jul 23, 2025 | |
NovitaAI novita-ai | DeepSeek V4 Flash deepseek/deepseek-v4-flash | deepseek-flash | text | text | 1,048,576 | 393,216 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
NovitaAI novita-ai | DeepSeek V4 Pro deepseek/deepseek-v4-pro | deepseek-thinking | text | text | 1,048,576 | 393,216 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Nvidia nvidia | DeepSeek V4 Flash deepseek-ai/deepseek-v4-flash | deepseek-flash | text | text | 1,048,576 | 393,216 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
Nvidia nvidia | DeepSeek V4 Pro deepseek-ai/deepseek-v4-pro | deepseek-thinking | text | text | 1,048,576 | 393,216 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Ollama Cloud ollama-cloud | deepseek-v4-flash deepseek-v4-flash | deepseek-flash | text | text | 1,048,576 | 1,048,576 | Unknown | Unknown | Yes | Yes | Apr 24, 2026 | |
Ollama Cloud ollama-cloud | deepseek-v4-pro deepseek-v4-pro | deepseek-thinking | text | text | 1,048,576 | 1,048,576 | Unknown | Unknown | Yes | Yes | Apr 24, 2026 | |
Ollama Cloud ollama-cloud | nemotron-3-nano:30b nemotron-3-nano:30b | nemotron | text | text | 1,048,576 | 131,072 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
OpenCode Go opencode-go | MiMo V2 Pro mimo-v2-pro | mimo-v2-pro | text | text | 1,048,576 | 128,000 | $1 / 1M | $3 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenCode Go opencode-go | MiMo V2.5 Pro mimo-v2.5-pro | mimo-v2.5-pro | text | text | 1,048,576 | 128,000 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 22, 2026 | |
OpenCode Zen opencode | MiMo V2 Pro Free mimo-v2-pro-free | mimo-pro-free | text | text | 1,048,576 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenCode Zen opencode | Qwen3.6 Plus Free qwen3.6-plus-free | qwen-free | text | text | 1,048,576 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 30, 2026 | |
OpenRouter openrouter | DeepSeek V4 Flash deepseek/deepseek-v4-flash | deepseek-flash | text | text | 1,048,576 | 393,216 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
OpenRouter openrouter | DeepSeek V4 Pro deepseek/deepseek-v4-pro | deepseek-thinking | text | text | 1,048,576 | 393,216 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
OpenRouter openrouter | MiMo-V2-Pro xiaomi/mimo-v2-pro | mimo | text | text | 1,048,576 | 65,536 | $1 / 1M | $3 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenRouter openrouter | MiMo-V2.5 xiaomi/mimo-v2.5 | mimo | text | text | 1,048,576 | 131,072 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 22, 2026 | |
Xiaomi xiaomi | MiMo-V2.5 mimo-v2.5 | mimo | text | text | 1,048,576 | 131,072 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 22, 2026 | |
ZenMux zenmux | MiMo-V2.5 xiaomi/mimo-v2.5 | mimo | text | text | 1,048,576 | 131,072 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 22, 2026 | |
Amazon Bedrock amazon-bedrock | Palmyra X5 writer.palmyra-x5-v1:0 | palmyra | text | text | 1,040,000 | 8,192 | $0.6 / 1M | $6 / 1M | Yes | Yes | Apr 28, 2025 | |
Kilo Gateway kilo | Writer: Palmyra X5 writer/palmyra-x5 | Unknown | text | text | 1,040,000 | 8,192 | $0.6 / 1M | $6 / 1M | No | No | Apr 28, 2025 | |
302.AI 302ai | MiniMax-M1 MiniMax-M1 | minimax | text | text | 1,000,000 | 128,000 | $0.132 / 1M | $1.254 / 1M | No | Yes | Jun 16, 2025 | |
302.AI 302ai | MiniMax-M2 MiniMax-M2 | Unknown | text | text | 1,000,000 | 128,000 | $0.33 / 1M | $1.32 / 1M | No | Yes | Oct 26, 2025 | |
302.AI 302ai | MiniMax-M2.1 MiniMax-M2.1 | Unknown | text | text | 1,000,000 | 131,072 | $0.3 / 1M | $1.2 / 1M | No | Yes | Dec 19, 2025 | |
302.AI 302ai | Qwen-Flash qwen-flash | Unknown | text | text | 1,000,000 | 32,768 | $0.022 / 1M | $0.22 / 1M | No | Yes | Jul 28, 2025 | |
302.AI 302ai | Qwen-Plus qwen-plus | qwen | text | text | 1,000,000 | 32,768 | $0.12 / 1M | $1.2 / 1M | No | Yes | Jul 23, 2024 | |
AIHubMix aihubmix | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.154 / 1M | $0.308 / 1M | Yes | Yes | Apr 24, 2026 | |
AIHubMix aihubmix | DeepSeek V4 Flash Think deepseek-v4-flash-think | deepseek | text | text | 1,000,000 | 384,000 | $0.154 / 1M | $0.308 / 1M | Yes | Yes | Apr 24, 2026 | |
AIHubMix aihubmix | DeepSeek V4 Pro deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $0.478 / 1M | $0.956 / 1M | Yes | Yes | Apr 24, 2026 | |
Alibaba (China) alibaba-cn | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
Alibaba (China) alibaba-cn | DeepSeek V4 Pro deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Alibaba (China) alibaba-cn | Qwen Deep Research qwen-deep-research | qwen | text | text | 1,000,000 | 32,768 | $7.742 / 1M | $23.367 / 1M | No | Yes | Jan 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen Flash qwen-flash | qwen | text | text | 1,000,000 | 32,768 | $0.022 / 1M | $0.216 / 1M | Yes | Yes | Jul 28, 2025 | |
Alibaba (China) alibaba-cn | Qwen Plus qwen-plus | qwen | text | text | 1,000,000 | 32,768 | $0.115 / 1M | $0.287 / 1M | Yes | Yes | Sep 11, 2025 | |
Alibaba (China) alibaba-cn | Qwen Turbo qwen-turbo | qwen | text | text | 1,000,000 | 16,384 | $0.044 / 1M | $0.087 / 1M | Yes | Yes | Jul 15, 2025 | |
Alibaba (China) alibaba-cn | Qwen3 Coder Flash qwen3-coder-flash | qwen | text | text | 1,000,000 | 65,536 | $0.144 / 1M | $0.574 / 1M | No | Yes | Jul 28, 2025 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Qwen3 Coder Plus qwen3-coder-plus | qwen | text | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Jul 23, 2025 | |
Alibaba Coding Plan alibaba-coding-plan | Qwen3 Coder Plus qwen3-coder-plus | qwen | text | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Jul 23, 2025 | |
Alibaba alibaba | Qwen Flash qwen-flash | qwen | text | text | 1,000,000 | 32,768 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Jul 28, 2025 | |
Alibaba alibaba | Qwen Plus qwen-plus | qwen | text | text | 1,000,000 | 32,768 | $0.4 / 1M | $1.2 / 1M | Yes | Yes | Sep 11, 2025 | |
Alibaba alibaba | Qwen Turbo qwen-turbo | qwen | text | text | 1,000,000 | 16,384 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Apr 28, 2025 | |
Alibaba alibaba | Qwen3 Coder Flash qwen3-coder-flash | qwen | text | text | 1,000,000 | 65,536 | $0.3 / 1M | $1.5 / 1M | No | Yes | Jul 28, 2025 | |
Baseten baseten | DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
DeepSeek deepseek | DeepSeek Chat deepseek-chat | deepseek | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | No | Yes | Feb 28, 2026 | |
DeepSeek deepseek | DeepSeek Reasoner deepseek-reasoner | deepseek-thinking | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Feb 28, 2026 | |
DeepSeek deepseek | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
DeepSeek deepseek | DeepSeek V4 Pro deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Fireworks AI fireworks-ai | DeepSeek V4 Pro accounts/fireworks/models/deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
HPC-AI hpc-ai | MiniMax M2.5 minimax/minimax-m2.5 | minimax-m2.5 | text | text | 1,000,000 | 131,072 | $0.14 / 1M | $0.56 / 1M | Yes | Yes | Mar 25, 2026 | |
Jiekou.AI jiekou | MiniMax M1 minimaxai/minimax-m1-80k | minimax | text | text | 1,000,000 | 40,000 | $0.55 / 1M | $2.2 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax M1 minimax/minimax-m1 | Unknown | text | text | 1,000,000 | 40,000 | $0.4 / 1M | $2.2 / 1M | Yes | Yes | Jun 17, 2025 | |
Kilo Gateway kilo | Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | Unknown | text | text | 1,000,000 | 32,768 | $0.26 / 1M | $0.78 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking | Unknown | text | text | 1,000,000 | 32,768 | $0.26 / 1M | $0.78 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen-Plus qwen/qwen-plus | Unknown | text | text | 1,000,000 | 32,768 | $0.4 / 1M | $1.2 / 1M | No | Yes | Sep 11, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash | Unknown | text | text | 1,000,000 | 65,536 | $0.195 / 1M | $0.975 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus | Unknown | text | text | 1,000,000 | 65,536 | $0.65 / 1M | $3.25 / 1M | No | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
LLM Gateway llmgateway | DeepSeek V4 Pro deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
LLM Gateway llmgateway | Gemma 3 1B IT gemma-3-1b-it | gemma | text | text | 1,000,000 | 16,384 | $0.08 / 1M | $0.3 / 1M | No | No | Mar 12, 2025 | |
LLM Gateway llmgateway | MiniMax Text 01 minimax-text-01 | minimax | text | text | 1,000,000 | 131,072 | $0.2 / 1M | $1.1 / 1M | Yes | No | Jan 15, 2025 | |
LLM Gateway llmgateway | Qwen Flash qwen-flash | qwen | text | text | 1,000,000 | 32,768 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | Qwen Plus qwen-plus | qwen | text | text | 1,000,000 | 32,768 | $0.4 / 1M | $1.2 / 1M | Yes | Yes | Sep 11, 2025 | |
LLM Gateway llmgateway | Qwen Turbo qwen-turbo | qwen | text | text | 1,000,000 | 16,384 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Apr 28, 2025 | |
LLM Gateway llmgateway | Qwen3 Coder Flash qwen3-coder-flash | qwen | text | text | 1,000,000 | 65,536 | $0.3 / 1M | $1.5 / 1M | No | Yes | Jul 28, 2025 | |
NanoGPT nano-gpt | Amazon Nova 2 Lite amazon/nova-2-lite-v1 | nova | text | text | 1,000,000 | 65,535 | $0.5099999999999999 / 1M | $4.25 / 1M | No | No | Dec 3, 2024 | |
NanoGPT nano-gpt | Auto model auto-model | Unknown | text | text | 1,000,000 | 1,000,000 | $0 / 1M | $0 / 1M | No | No | Jun 1, 2024 | |
NanoGPT nano-gpt | Auto model (Basic) auto-model-basic | Unknown | text | text | 1,000,000 | 1,000,000 | $9.996 / 1M | $19.992 / 1M | No | No | Jun 1, 2024 | |
NanoGPT nano-gpt | Auto model (Premium) auto-model-premium | Unknown | text | text | 1,000,000 | 1,000,000 | $9.996 / 1M | $19.992 / 1M | No | No | Jun 1, 2024 | |
NanoGPT nano-gpt | Auto model (Standard) auto-model-standard | Unknown | text | text | 1,000,000 | 1,000,000 | $9.996 / 1M | $19.992 / 1M | No | No | Jun 1, 2024 | |
NanoGPT nano-gpt | GLM-4 Long glm-4-long | Unknown | text | text | 1,000,000 | 4,096 | $0.2006 / 1M | $0.2006 / 1M | No | No | Aug 1, 2024 | |
NanoGPT nano-gpt | GPT-5.1 (2025-11-13) openai/gpt-5.1-2025-11-13 | gpt | text | text | 1,000,000 | 32,768 | $1.25 / 1M | $10 / 1M | No | No | Nov 13, 2025 | |
NanoGPT nano-gpt | MiniMax M1 MiniMax-M1 | Unknown | text | text | 1,000,000 | 131,072 | $0.1394 / 1M | $1.3328 / 1M | No | No | Jun 16, 2025 | |
NanoGPT nano-gpt | MiniMax M1 80K MiniMaxAI/MiniMax-M1-80k | minimax | text | text | 1,000,000 | 131,072 | $0.6052 / 1M | $2.4225000000000003 / 1M | No | No | Jun 16, 2025 | |
NanoGPT nano-gpt | Qwen Turbo qwen-turbo | Unknown | text | text | 1,000,000 | 8,192 | $0.04998 / 1M | $0.2006 / 1M | No | No | Nov 1, 2024 | |
NanoGPT nano-gpt | v0 1.5 LG v0-1.5-lg | Unknown | text | text | 1,000,000 | 64,000 | $15 / 1M | $75 / 1M | No | No | Jul 4, 2025 | |
NovitaAI novita-ai | MiniMax M1 minimaxai/minimax-m1-80k | minimax | text | text | 1,000,000 | 40,000 | $0.55 / 1M | $2.2 / 1M | Yes | Yes | Jun 17, 2025 | |
OpenCode Go opencode-go | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
OpenCode Go opencode-go | DeepSeek V4 Pro deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
OpenRouter openrouter | MiniMax M1 minimax/minimax-m1 | minimax | text | text | 1,000,000 | 40,000 | $0.4 / 1M | $2.2 / 1M | Yes | Yes | Jun 17, 2025 | |
Perplexity Agent perplexity-agent | Nemotron 3 Super 120B nvidia/nemotron-3-super-120b-a12b | nemotron | text | text | 1,000,000 | 32,000 | $0.25 / 1M | $2.5 / 1M | Yes | Yes | Mar 11, 2026 | |
Qiniu qiniu-ai | MiniMax M1 MiniMax-M1 | Unknown | text | text | 1,000,000 | 80,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen-Turbo qwen-turbo | Unknown | text | text | 1,000,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
The Grid AI the-grid-ai | Text Max text-max | Unknown | text | text | 1,000,000 | 128,000 | Unknown | Unknown | Yes | Yes | Mar 24, 2026 | |
Venice AI venice | DeepSeek V4 Flash deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 32,768 | $0.175 / 1M | $0.35 / 1M | Yes | Yes | Apr 25, 2026 | |
Venice AI venice | DeepSeek V4 Pro deepseek-v4-pro | deepseek | text | text | 1,000,000 | 32,768 | $2.175 / 1M | $4.35 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | DeepSeek V4 Flash deepseek/deepseek-v4-flash | deepseek | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | DeepSeek V4 Pro deepseek/deepseek-v4-pro | deepseek | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | MiMo V2 Pro xiaomi/mimo-v2-pro | mimo | text | text | 1,000,000 | 128,000 | $1 / 1M | $3 / 1M | Yes | Yes | Mar 20, 2026 | |
Vercel AI Gateway vercel | Qwen3 Coder Plus alibaba/qwen3-coder-plus | qwen | text | text | 1,000,000 | 1,000,000 | $1 / 1M | $5 / 1M | No | Yes | Jul 23, 2025 | |
Xiaomi xiaomi | MiMo-V2-Pro mimo-v2-pro | mimo | text | text | 1,000,000 | 128,000 | $1 / 1M | $3 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (China) xiaomi-token-plan-cn | MiMo-V2-Pro mimo-v2-pro | mimo | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (China) xiaomi-token-plan-cn | MiMo-V2.5-Pro mimo-v2.5-pro | mimo-v2.5-pro | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
Xiaomi Token Plan (Europe) xiaomi-token-plan-ams | MiMo-V2-Pro mimo-v2-pro | mimo | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (Europe) xiaomi-token-plan-ams | MiMo-V2.5-Pro mimo-v2.5-pro | mimo-v2.5-pro | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
Xiaomi Token Plan (Singapore) xiaomi-token-plan-sgp | MiMo-V2-Pro mimo-v2-pro | mimo | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (Singapore) xiaomi-token-plan-sgp | MiMo-V2.5-Pro mimo-v2.5-pro | mimo-v2.5-pro | text | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
ZenMux zenmux | DeepSeek V4 Flash deepseek/deepseek-v4-flash | deepseek-flash | text | text | 1,000,000 | 384,000 | $0.14 / 1M | $0.28 / 1M | Yes | Yes | Apr 24, 2026 | |
ZenMux zenmux | DeepSeek V4 Pro deepseek/deepseek-v4-pro | deepseek-thinking | text | text | 1,000,000 | 384,000 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
ZenMux zenmux | Qwen3-Coder-Plus qwen/qwen3-coder-plus | Unknown | text | text | 1,000,000 | 64,000 | $1 / 1M | $5 / 1M | No | Yes | Jul 23, 2025 | |
ZenMux zenmux | Qwen3.6-Plus qwen/qwen3.6-plus | Unknown | text | text | 1,000,000 | 64,000 | $0.5 / 1M | $3 / 1M | Yes | Yes | Mar 30, 2026 | |
NanoGPT nano-gpt | Qwen Plus qwen-plus | Unknown | text | text | 995,904 | 32,768 | $0.3995 / 1M | $1.2002 / 1M | Yes | No | Jan 25, 2024 | |
Together AI togetherai | DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro | deepseek | text | text | 512,000 | 384,000 | $2.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 24, 2026 | |
Cortecs cortecs | MiniMax-M2 minimax-m2 | minimax | text | text | 400,000 | 400,000 | $0.39 / 1M | $1.57 / 1M | Yes | Yes | Oct 27, 2025 | |
Helicone helicone | OpenAI: GPT-5 Codex gpt-5-codex | gpt-codex | text | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Jan 1, 2025 | |
Poe poe | GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | gpt-codex | text | text | 400,000 | 128,000 | $0.22 / 1M | $1.8 / 1M | Yes | Yes | Nov 12, 2025 | |
Qiniu qiniu-ai | OpenAI/GPT-5 openai/gpt-5 | Unknown | text | text | 400,000 | 128,000 | Unknown | Unknown | No | Yes | Sep 19, 2025 | |
ZenMux zenmux | GPT-5.3 Codex openai/gpt-5.3-codex | Unknown | text | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 20, 2026 | |
ZenMux zenmux | GPT-5.4 Mini openai/gpt-5.4-mini | Unknown | text | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | No | Yes | Mar 20, 2026 | |
ZenMux zenmux | GPT-5.4 Nano openai/gpt-5.4-nano | Unknown | text | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | No | Yes | Mar 20, 2026 | |
NanoGPT nano-gpt | Amazon Nova Lite 1.0 amazon/nova-lite-v1 | nova-lite | text | text | 300,000 | 5,120 | $0.0595 / 1M | $0.238 / 1M | No | No | Dec 3, 2024 | |
NanoGPT nano-gpt | Amazon Nova Pro 1.0 amazon/nova-pro-v1 | nova-pro | text | text | 300,000 | 32,000 | $0.7989999999999999 / 1M | $3.1959999999999997 / 1M | No | No | Dec 3, 2024 | |
302.AI 302ai | kimi-k2-0905-preview kimi-k2-0905-preview | Unknown | text | text | 262,144 | 262,144 | $0.632 / 1M | $2.53 / 1M | No | Yes | Sep 5, 2025 | |
302.AI 302ai | kimi-k2-thinking kimi-k2-thinking | Unknown | text | text | 262,144 | 262,144 | $0.575 / 1M | $2.3 / 1M | Yes | Yes | Sep 5, 2025 | |
302.AI 302ai | kimi-k2-thinking-turbo kimi-k2-thinking-turbo | Unknown | text | text | 262,144 | 262,144 | $1.265 / 1M | $9.119 / 1M | Yes | Yes | Sep 5, 2025 | |
302.AI 302ai | qwen3-coder-480b-a35b-instruct qwen3-coder-480b-a35b-instruct | Unknown | text | text | 262,144 | 65,536 | $0.86 / 1M | $3.43 / 1M | No | Yes | Jul 23, 2025 | |
Abacus abacus | Qwen3 235B A22B Instruct Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 8,192 | $0.13 / 1M | $0.6 / 1M | Yes | Yes | Jul 1, 2025 | |
Abacus abacus | Qwen3 Coder 480B A35B Instruct Qwen/qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $0.29 / 1M | $1.2 / 1M | Yes | Yes | Jul 22, 2025 | |
Alibaba (China) alibaba-cn | Moonshot Kimi K2 Thinking kimi-k2-thinking | kimi | text | text | 262,144 | 16,384 | $0.574 / 1M | $2.294 / 1M | Yes | Yes | Nov 6, 2025 | |
Alibaba (China) alibaba-cn | Qwen3 Max qwen3-max | qwen | text | text | 262,144 | 65,536 | $0.861 / 1M | $3.441 / 1M | No | Yes | Sep 23, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct | qwen | text | text | 262,144 | 65,536 | $0.216 / 1M | $0.861 / 1M | No | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $0.861 / 1M | $3.441 / 1M | No | Yes | Apr 1, 2025 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Qwen3 Coder Next qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Feb 3, 2026 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Qwen3 Max qwen3-max-2026-01-23 | qwen | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | No | Yes | Jan 23, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | Qwen3 Coder Next qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Feb 3, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | Qwen3 Max qwen3-max-2026-01-23 | qwen | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | No | Yes | Jan 23, 2026 | |
Alibaba alibaba | Qwen3 Max qwen3-max | qwen | text | text | 262,144 | 65,536 | $1.2 / 1M | $6 / 1M | No | Yes | Sep 23, 2025 | |
Alibaba alibaba | Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct | qwen | text | text | 262,144 | 65,536 | $0.45 / 1M | $2.25 / 1M | No | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $1.5 / 1M | $7.5 / 1M | No | Yes | Apr 1, 2025 | |
Amazon Bedrock amazon-bedrock | NVIDIA Nemotron 3 Super 120B A12B nvidia.nemotron-super-3-120b | nemotron | text | text | 262,144 | 131,072 | $0.15 / 1M | $0.65 / 1M | Yes | Yes | Mar 11, 2026 | |
Amazon Bedrock amazon-bedrock | Qwen3 235B A22B 2507 qwen.qwen3-235b-a22b-2507-v1:0 | qwen | text | text | 262,144 | 131,072 | $0.22 / 1M | $0.88 / 1M | No | Yes | Sep 18, 2025 | |
Amazon Bedrock amazon-bedrock | Qwen3 Coder 30B A3B Instruct qwen.qwen3-coder-30b-a3b-v1:0 | qwen | text | text | 262,144 | 131,072 | $0.15 / 1M | $0.6 / 1M | No | Yes | Sep 18, 2025 | |
Azure Cognitive Services azure-cognitive-services | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Dec 2, 2025 | |
Azure azure | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Dec 2, 2025 | |
Baseten baseten | Kimi K2 Instruct 0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | No | Yes | Mar 6, 2026 | |
Baseten baseten | Kimi K2 Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Mar 6, 2026 | |
Baseten baseten | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text | text | 262,144 | 8,192 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 12, 2026 | |
Baseten baseten | Nemotron 3 Super nvidia/Nemotron-120B-A12B | nemotron | text | text | 262,144 | 32,678 | $0.3 / 1M | $0.75 / 1M | Yes | Yes | Mar 11, 2026 | |
Chutes chutes | Devstral 2 123B Instruct 2512 TEE mistralai/Devstral-2-123B-Instruct-2512-TEE | Unknown | text | text | 262,144 | 65,536 | $0.05 / 1M | $0.22 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Kimi K2 Instruct 0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,144 | 262,144 | $0.39 / 1M | $1.9 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Kimi K2 Thinking TEE moonshotai/Kimi-K2-Thinking-TEE | kimi-thinking | text | text | 262,144 | 65,535 | $0.4 / 1M | $1.75 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | MiMo V2 Flash XiaomiMiMo/MiMo-V2-Flash | mimo | text | text | 262,144 | 32,000 | $0.09 / 1M | $0.29 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | MiroThinker V1.5 235B miromind-ai/MiroThinker-v1.5-235B | Unknown | text | text | 262,144 | 8,192 | $0.3 / 1M | $1.2 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | NVIDIA Nemotron 3 Nano 30B A3B BF16 nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 | nemotron | text | text | 262,144 | 262,144 | $0.06 / 1M | $0.24 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 235B A22B Instruct 2507 TEE Qwen/Qwen3-235B-A22B-Instruct-2507-TEE | qwen | text | text | 262,144 | 65,536 | $0.08 / 1M | $0.55 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 262,144 | $0.11 / 1M | $0.6 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,144 | 262,144 | $0.08 / 1M | $0.33 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 Coder 480B A35B Instruct FP8 TEE Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8-TEE | qwen | text | text | 262,144 | 262,144 | $0.22 / 1M | $0.95 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 Coder Next Qwen/Qwen3-Coder-Next | qwen | text | text | 262,144 | 65,536 | $0.07 / 1M | $0.3 / 1M | No | Yes | Feb 5, 2026 | |
Chutes chutes | Qwen3 Next 80B A3B Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | qwen | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.8 / 1M | No | Yes | Jan 10, 2026 | |
Clarifai clarifai | Qwen3 30B A3B Instruct 2507 qwen/qwenLM/models/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,144 | 262,144 | $0.3 / 1M | $0.5 / 1M | No | Yes | Feb 25, 2026 | |
Clarifai clarifai | Qwen3 30B A3B Thinking 2507 qwen/qwenLM/models/Qwen3-30B-A3B-Thinking-2507 | qwen | text | text | 262,144 | 131,072 | $0.36 / 1M | $1.3 / 1M | Yes | Yes | Feb 25, 2026 | |
Clarifai clarifai | Qwen3 Coder 30B A3B Instruct qwen/qwenCoder/models/Qwen3-Coder-30B-A3B-Instruct | qwen | text | text | 262,144 | 65,536 | $0.11458 / 1M | $0.74812 / 1M | No | Yes | Feb 12, 2026 | |
Deep Infra deepinfra | Kimi K2 0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Sep 5, 2025 | |
Deep Infra deepinfra | MiniMax M2 MiniMaxAI/MiniMax-M2 | minimax | text | text | 262,144 | 32,768 | $0.254 / 1M | $1.02 / 1M | Yes | Yes | Nov 13, 2025 | |
Deep Infra deepinfra | Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,144 | 66,536 | $0.4 / 1M | $1.6 / 1M | No | Yes | Jul 23, 2025 | |
Deep Infra deepinfra | Qwen3 Coder 480B A35B Instruct Turbo Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | qwen | text | text | 262,144 | 66,536 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jul 23, 2025 | |
DigitalOcean digitalocean | Kimi K2.5 kimi-k2.5 | kimi | text | text | 262,144 | 32,768 | $0.5 / 1M | $2.7 / 1M | Yes | Yes | Apr 16, 2026 | |
FastRouter fastrouter | Qwen3 Coder qwen/qwen3-coder | qwen | text | text | 262,144 | 66,536 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jul 23, 2025 | |
Friendli friendli | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 262,144 | $0.2 / 1M | $0.8 / 1M | No | Yes | Jan 29, 2026 | |
Groq groq | Kimi K2 Instruct 0905 moonshotai/kimi-k2-instruct-0905 | kimi | text | text | 262,144 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Sep 5, 2025 | |
Helicone helicone | Kimi K2 (09/05) kimi-k2-0905 | kimi | text | text | 262,144 | 16,384 | $0.5 / 1M | $2 / 1M | No | Yes | Sep 5, 2025 | |
Helicone helicone | Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct | qwen | text | text | 262,144 | 262,144 | $0.09999999999999999 / 1M | $0.3 / 1M | No | Yes | Jul 31, 2025 | |
Hugging Face huggingface | Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,144 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Sep 4, 2025 | |
Hugging Face huggingface | Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
Hugging Face huggingface | MiMo-V2-Flash XiaomiMiMo/MiMo-V2-Flash | mimo | text | text | 262,144 | 4,096 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 16, 2025 | |
Hugging Face huggingface | Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 131,072 | $0.3 / 1M | $3 / 1M | Yes | Yes | Jul 25, 2025 | |
Hugging Face huggingface | Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,144 | 66,536 | $2 / 1M | $2 / 1M | No | Yes | Jul 23, 2025 | |
Hugging Face huggingface | Qwen3-Coder-Next Qwen/Qwen3-Coder-Next | qwen | text | text | 262,144 | 65,536 | $0.2 / 1M | $1.5 / 1M | No | Yes | Feb 3, 2026 | |
Hugging Face huggingface | Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | qwen | text | text | 262,144 | 66,536 | $0.25 / 1M | $1 / 1M | No | Yes | Sep 11, 2025 | |
Hugging Face huggingface | Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking | qwen | text | text | 262,144 | 131,072 | $0.3 / 1M | $2 / 1M | No | Yes | Sep 11, 2025 | |
IO.NET io-net | Qwen 3 235B Thinking Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 4,096 | $0.11 / 1M | $0.6 / 1M | Yes | Yes | Jul 1, 2025 | |
IO.NET io-net | Qwen 3 Next 80B Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | qwen | text | text | 262,144 | 4,096 | $0.1 / 1M | $0.8 / 1M | No | Yes | Jan 10, 2025 | |
Jiekou.AI jiekou | Kimi K2 0905 moonshotai/kimi-k2-0905 | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | qwen/qwen3-coder-next qwen/qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | $0.2 / 1M | $1.5 / 1M | No | Yes | Feb 1, 2026 | |
Jiekou.AI jiekou | Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $0.29 / 1M | $1.2 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | XiaomiMiMo/MiMo-V2-Flash xiaomimimo/mimo-v2-flash | mimo | text | text | 262,144 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Arcee AI: Trinity Large Thinking arcee-ai/trinity-large-thinking | Unknown | text | text | 262,144 | 262,144 | $0.22 / 1M | $0.85 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Arcee AI: Trinity Large Thinking (free) arcee-ai/trinity-large-thinking:free | Unknown | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Elephant (new) openrouter/elephant-alpha | Unknown | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 13, 2026 | |
Kilo Gateway kilo | Mistral: Devstral 2 2512 mistralai/devstral-2512 | Unknown | text | text | 262,144 | 65,536 | $0.4 / 1M | $2 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Morph: Morph V3 Large morph/morph-v3-large | Unknown | text | text | 262,144 | 131,072 | $0.9 / 1M | $1.9 / 1M | No | No | Aug 15, 2024 | |
Kilo Gateway kilo | NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b | Unknown | text | text | 262,144 | 52,429 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Feb 4, 2026 | |
Kilo Gateway kilo | NVIDIA: Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b | Unknown | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | NVIDIA: Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free | Unknown | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | Unknown | text | text | 262,144 | 52,429 | $0.071 / 1M | $0.1 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | Unknown | text | text | 262,144 | 262,144 | $0.11 / 1M | $0.6 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | Unknown | text | text | 262,144 | 262,144 | $0.09 / 1M | $0.3 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | Unknown | text | text | 262,144 | 52,429 | $0.22 / 1M | $1 / 1M | No | Yes | Jul 23, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 Coder Next qwen/qwen3-coder-next | Unknown | text | text | 262,144 | 65,536 | $0.12 / 1M | $0.75 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Max qwen/qwen3-max | Unknown | text | text | 262,144 | 32,768 | $1.2 / 1M | $6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking | Unknown | text | text | 262,144 | 32,768 | $0.78 / 1M | $3.9 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Xiaomi: MiMo-V2-Flash xiaomi/mimo-v2-flash | Unknown | text | text | 262,144 | 65,536 | $0.09 / 1M | $0.29 / 1M | Yes | Yes | Mar 15, 2026 | |
Kimi For Coding kimi-for-coding | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2025 | |
LLM Gateway llmgateway | Devstral 2 devstral-2512 | devstral | text | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Dec 9, 2025 | |
LLM Gateway llmgateway | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
LLM Gateway llmgateway | Kimi K2 Thinking Turbo kimi-k2-thinking-turbo | kimi-thinking | text | text | 262,144 | 262,144 | $1.15 / 1M | $8 / 1M | Yes | Yes | Nov 6, 2025 | |
LLM Gateway llmgateway | Qwen3 Coder Next qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | $0.8 / 1M | $4 / 1M | Yes | Yes | Oct 15, 2025 | |
LLM Gateway llmgateway | Qwen3 Max qwen3-max | qwen | text | text | 262,144 | 65,536 | $1.2 / 1M | $6 / 1M | No | Yes | Sep 23, 2025 | |
LLM Gateway llmgateway | Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct | qwen | text | text | 262,144 | 65,536 | $0.45 / 1M | $2.25 / 1M | No | Yes | Apr 1, 2025 | |
LLM Gateway llmgateway | Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $1.5 / 1M | $7.5 / 1M | No | Yes | Apr 1, 2025 | |
LMStudio lmstudio | Qwen3 30B A3B 2507 qwen/qwen3-30b-a3b-2507 | qwen | text | text | 262,144 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jul 30, 2025 | |
LMStudio lmstudio | Qwen3 Coder 30B qwen/qwen3-coder-30b | qwen | text | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Jul 23, 2025 | |
Meganova meganova | Kimi K2 Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.6 / 1M | Yes | Yes | Nov 6, 2025 | |
Meganova meganova | MiMo V2 Flash XiaomiMiMo/MiMo-V2-Flash | mimo | text | text | 262,144 | 32,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 17, 2025 | |
Mistral mistral | Devstral 2 devstral-2512 | devstral | text | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Dec 9, 2025 | |
Mistral mistral | Devstral 2 (latest) devstral-medium-latest | devstral | text | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Dec 2, 2025 | |
Mixlayer mixlayer | Qwen3.5 122B A10B qwen/qwen3.5-122b-a10b | qwen | text | text | 262,144 | 262,144 | $0.4 / 1M | $3.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Mixlayer mixlayer | Qwen3.5 27B qwen/qwen3.5-27b | qwen | text | text | 262,144 | 262,144 | $0.3 / 1M | $2.4 / 1M | Yes | Yes | Mar 18, 2026 | |
Mixlayer mixlayer | Qwen3.5 35B A3B qwen/qwen3.5-35b-a3b | qwen | text | text | 262,144 | 262,144 | $0.25 / 1M | $1.3 / 1M | Yes | Yes | Mar 18, 2026 | |
Mixlayer mixlayer | Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | qwen | text | text | 262,144 | 262,144 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Mar 18, 2026 | |
Mixlayer mixlayer | Qwen3.5 9B qwen/qwen3.5-9b | qwen | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 18, 2026 | |
ModelScope modelscope | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 131,072 | $0 / 1M | $0 / 1M | No | Yes | Jul 21, 2025 | |
ModelScope modelscope | Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,144 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jul 30, 2025 | |
ModelScope modelscope | Qwen3 30B A3B Thinking 2507 Qwen/Qwen3-30B-A3B-Thinking-2507 | qwen | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 30, 2025 | |
ModelScope modelscope | Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct | qwen | text | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | No | Yes | Jul 31, 2025 | |
ModelScope modelscope | Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 25, 2025 | |
Moonshot AI (China) moonshotai-cn | Kimi K2 0905 kimi-k2-0905-preview | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
Moonshot AI (China) moonshotai-cn | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
Moonshot AI (China) moonshotai-cn | Kimi K2 Thinking Turbo kimi-k2-thinking-turbo | kimi-thinking | text | text | 262,144 | 262,144 | $1.15 / 1M | $8 / 1M | Yes | Yes | Nov 6, 2025 | |
Moonshot AI (China) moonshotai-cn | Kimi K2 Turbo kimi-k2-turbo-preview | kimi | text | text | 262,144 | 262,144 | $2.4 / 1M | $10 / 1M | No | Yes | Sep 5, 2025 | |
Moonshot AI moonshotai | Kimi K2 0905 kimi-k2-0905-preview | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
Moonshot AI moonshotai | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
Moonshot AI moonshotai | Kimi K2 Thinking Turbo kimi-k2-thinking-turbo | kimi-thinking | text | text | 262,144 | 262,144 | $1.15 / 1M | $8 / 1M | Yes | Yes | Nov 6, 2025 | |
Moonshot AI moonshotai | Kimi K2 Turbo kimi-k2-turbo-preview | kimi | text | text | 262,144 | 262,144 | $2.4 / 1M | $10 / 1M | No | Yes | Sep 5, 2025 | |
NanoGPT nano-gpt | Devstral 2 123B mistralai/devstral-2-123b-instruct-2512 | devstral | text | text | 262,144 | 65,536 | $0.4 / 1M | $1.4 / 1M | No | No | Dec 9, 2025 | |
NanoGPT nano-gpt | Ministral 14B mistralai/ministral-14b-2512 | ministral | text | text | 262,144 | 32,768 | $0.2 / 1M | $0.2 / 1M | No | No | Dec 4, 2025 | |
NanoGPT nano-gpt | Ministral 8B mistralai/ministral-8b-2512 | ministral | text | text | 262,144 | 32,768 | $0.15 / 1M | $0.15 / 1M | No | No | Dec 4, 2025 | |
Nebius Token Factory nebius | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 8,192 | $0.2 / 1M | $0.6 / 1M | Yes | Yes | Oct 4, 2025 | |
Nebius Token Factory nebius | Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 8,192 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Oct 4, 2025 | |
Nebius Token Factory nebius | Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,144 | 66,536 | $0.4 / 1M | $1.8 / 1M | No | Yes | Oct 4, 2025 | |
NovitaAI novita-ai | Kimi K2 0905 moonshotai/kimi-k2-0905 | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
NovitaAI novita-ai | Kimi K2 Thinking moonshotai/kimi-k2-thinking | kimi | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 7, 2025 | |
NovitaAI novita-ai | Ling-2.6-1T inclusionai/ling-2.6-1t | ling | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | No | Yes | Apr 23, 2026 | |
NovitaAI novita-ai | Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 65,536 | $0.3 / 1M | $1.3 / 1M | No | Yes | Jul 23, 2025 | |
NovitaAI novita-ai | Qwen3 Coder Next qwen/qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | $0.2 / 1M | $1.5 / 1M | No | Yes | Feb 3, 2026 | |
NovitaAI novita-ai | Qwen3 Max qwen/qwen3-max | qwen | text | text | 262,144 | 65,536 | $2.11 / 1M | $8.45 / 1M | No | Yes | Sep 24, 2025 | |
NovitaAI novita-ai | XiaomiMiMo/MiMo-V2-Flash xiaomimimo/mimo-v2-flash | mimo | text | text | 262,144 | 32,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 19, 2025 | |
Nvidia nvidia | Devstral-2-123B-Instruct-2512 mistralai/devstral-2-123b-instruct-2512 | devstral | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 9, 2025 | |
Nvidia nvidia | Kimi K2 0905 moonshotai/kimi-k2-instruct-0905 | kimi | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | No | Yes | Sep 5, 2025 | |
Nvidia nvidia | Kimi K2 Thinking moonshotai/kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2025 | |
Nvidia nvidia | Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b | nemotron | text | text | 262,144 | 262,144 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Mar 11, 2026 | |
Nvidia nvidia | Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,144 | 66,536 | $0 / 1M | $0 / 1M | No | Yes | Jul 23, 2025 | |
Nvidia nvidia | Qwen3-Next-80B-A3B-Instruct qwen/qwen3-next-80b-a3b-instruct | qwen | text | text | 262,144 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Sep 5, 2025 | |
Nvidia nvidia | Qwen3-Next-80B-A3B-Thinking qwen/qwen3-next-80b-a3b-thinking | qwen | text | text | 262,144 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Ollama Cloud ollama-cloud | devstral-2:123b devstral-2:123b | devstral | text | text | 262,144 | 262,144 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | kimi-k2-thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | kimi-k2:1t kimi-k2:1t | kimi | text | text | 262,144 | 262,144 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | nemotron-3-super nemotron-3-super | nemotron | text | text | 262,144 | 65,536 | Unknown | Unknown | Yes | Yes | Mar 12, 2026 | |
Ollama Cloud ollama-cloud | qwen3-coder-next qwen3-coder-next | qwen | text | text | 262,144 | 65,536 | Unknown | Unknown | No | Yes | Feb 8, 2026 | |
Ollama Cloud ollama-cloud | qwen3-coder:480b qwen3-coder:480b | qwen | text | text | 262,144 | 65,536 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | qwen3-next:80b qwen3-next:80b | qwen | text | text | 262,144 | 32,768 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
OpenCode Zen opencode | Kimi K2 kimi-k2 | kimi | text | text | 262,144 | 262,144 | $0.4 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
OpenCode Zen opencode | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.4 / 1M | $2.5 / 1M | Yes | Yes | Sep 5, 2025 | |
OpenCode Zen opencode | MiMo V2 Flash Free mimo-v2-flash-free | mimo-flash-free | text | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 16, 2025 | |
OpenCode Zen opencode | Qwen3 Coder qwen3-coder | qwen | text | text | 262,144 | 65,536 | $0.45 / 1M | $1.8 / 1M | No | Yes | Jul 23, 2025 | |
OpenRouter openrouter | Devstral 2 2512 mistralai/devstral-2512 | devstral | text | text | 262,144 | 262,144 | $0.15 / 1M | $0.6 / 1M | No | Yes | Sep 12, 2025 | |
OpenRouter openrouter | Elephant (free) openrouter/elephant-alpha | elephant | text | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 13, 2026 | |
OpenRouter openrouter | Kimi K2 Instruct 0905 moonshotai/kimi-k2-0905 | kimi | text | text | 262,144 | 16,384 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
OpenRouter openrouter | Kimi K2 Instruct 0905 (exacto) moonshotai/kimi-k2-0905:exacto | kimi | text | text | 262,144 | 16,384 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 5, 2025 | |
OpenRouter openrouter | Kimi K2 Thinking moonshotai/kimi-k2-thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
OpenRouter openrouter | MiMo-V2-Flash xiaomi/mimo-v2-flash | mimo | text | text | 262,144 | 65,536 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 14, 2025 | |
OpenRouter openrouter | Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b | nemotron | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Mar 11, 2026 | |
OpenRouter openrouter | Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free | nemotron | text | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 11, 2026 | |
OpenRouter openrouter | Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-07-25 | qwen | text | text | 262,144 | 131,072 | $0.15 / 1M | $0.85 / 1M | No | Yes | Jul 21, 2025 | |
OpenRouter openrouter | Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | qwen | text | text | 262,144 | 81,920 | $0.078 / 1M | $0.312 / 1M | Yes | Yes | Jul 25, 2025 | |
OpenRouter openrouter | Qwen3 Coder qwen/qwen3-coder | qwen | text | text | 262,144 | 66,536 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jul 23, 2025 | |
OpenRouter openrouter | Qwen3 Max qwen/qwen3-max | qwen | text | text | 262,144 | 32,768 | $1.2 / 1M | $6 / 1M | Yes | Yes | Sep 5, 2025 | |
OpenRouter openrouter | Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | qwen | text | text | 262,144 | 262,144 | $0.14 / 1M | $1.4 / 1M | No | Yes | Sep 11, 2025 | |
OpenRouter openrouter | Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | qwen | text | text | 262,144 | 262,144 | $0.14 / 1M | $1.4 / 1M | Yes | Yes | Sep 11, 2025 | |
OpenRouter openrouter | Trinity Large Thinking arcee-ai/trinity-large-thinking | trinity | text | text | 262,144 | 80,000 | $0.22 / 1M | $0.85 / 1M | Yes | Yes | Apr 3, 2026 | |
OVHcloud AI Endpoints ovhcloud | Qwen3-Coder-30B-A3B-Instruct qwen3-coder-30b-a3b-instruct | Unknown | text | text | 262,144 | 262,144 | $0.07 / 1M | $0.26 / 1M | No | Yes | Oct 28, 2025 | |
Qiniu qiniu-ai | Qwen3 235b A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 | Unknown | text | text | 262,144 | 64,000 | Unknown | Unknown | No | Yes | Aug 12, 2025 | |
Qiniu qiniu-ai | Qwen3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 | Unknown | text | text | 262,144 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 12, 2025 | |
Qiniu qiniu-ai | Qwen3 Max qwen3-max | Unknown | text | text | 262,144 | 65,536 | Unknown | Unknown | No | Yes | Sep 24, 2025 | |
Regolo AI regolo-ai | Qwen3-Coder-Next qwen3-coder-next | qwen | text | text | 262,144 | 16,384 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 1, 2026 | |
submodel submodel | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 131,072 | $0.2 / 1M | $0.3 / 1M | No | Yes | Aug 23, 2025 | |
submodel submodel | Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 131,072 | $0.2 / 1M | $0.6 / 1M | Yes | Yes | Aug 23, 2025 | |
submodel submodel | Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | qwen | text | text | 262,144 | 262,144 | $0.2 / 1M | $0.8 / 1M | No | Yes | Aug 23, 2025 | |
Synthetic synthetic | Kimi K2 0905 hf:moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,144 | 32,768 | $1.2 / 1M | $1.2 / 1M | No | Yes | Sep 5, 2025 | |
Synthetic synthetic | Kimi K2 Thinking hf:moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,144 | 262,144 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Nov 7, 2025 | |
Synthetic synthetic | Nemotron 3 Super 120B hf:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 | nemotron | text | text | 262,144 | 65,536 | $0.3 / 1M | $1 / 1M | Yes | Yes | Apr 3, 2026 | |
Together AI togetherai | Qwen3 235B A22B Instruct 2507 FP8 Qwen/Qwen3-235B-A22B-Instruct-2507-tput | qwen | text | text | 262,144 | 262,144 | $0.2 / 1M | $0.6 / 1M | Yes | Yes | Jul 25, 2025 | |
Together AI togetherai | Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | qwen | text | text | 262,144 | 262,144 | $2 / 1M | $2 / 1M | No | Yes | Jul 23, 2025 | |
Together AI togetherai | Qwen3 Coder Next FP8 Qwen/Qwen3-Coder-Next-FP8 | qwen | text | text | 262,144 | 262,144 | $0.5 / 1M | $1.2 / 1M | Yes | Yes | Feb 3, 2026 | |
Vercel AI Gateway vercel | MiMo V2 Flash xiaomi/mimo-v2-flash | mimo | text | text | 262,144 | 32,000 | $0.1 / 1M | $0.29 / 1M | Yes | Yes | Dec 17, 2025 | |
Vercel AI Gateway vercel | Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b | nemotron | text | text | 262,144 | 262,144 | $0.06 / 1M | $0.24 / 1M | Yes | No | Dec 1, 2024 | |
Vercel AI Gateway vercel | Qwen3 Coder 480B A35B Instruct alibaba/qwen3-coder | qwen | text | text | 262,144 | 66,536 | $0.38 / 1M | $1.53 / 1M | No | Yes | Apr 1, 2025 | |
Vercel AI Gateway vercel | Qwen3 Max alibaba/qwen3-max | qwen | text | text | 262,144 | 32,768 | $1.2 / 1M | $6 / 1M | No | Yes | Sep 23, 2025 | |
Vercel AI Gateway vercel | Qwen3 Max Preview alibaba/qwen3-max-preview | qwen | text | text | 262,144 | 32,768 | $1.2 / 1M | $6 / 1M | No | Yes | Sep 23, 2025 | |
Vercel AI Gateway vercel | Qwen3 Next 80B A3B Instruct alibaba/qwen3-next-80b-a3b-instruct | qwen | text | text | 262,144 | 32,768 | $0.09 / 1M | $1.1 / 1M | No | Yes | Sep 12, 2025 | |
Vertex google-vertex | Kimi K2 Thinking moonshotai/kimi-k2-thinking-maas | kimi-thinking | text | text | 262,144 | 262,144 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 13, 2025 | |
Vertex google-vertex | Qwen3 235B A22B Instruct qwen/qwen3-235b-a22b-instruct-2507-maas | qwen | text | text | 262,144 | 16,384 | $0.22 / 1M | $0.88 / 1M | Yes | Yes | Aug 13, 2025 | |
Weights & Biases wandb | NVIDIA Nemotron 3 Super 120B nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 | nemotron | text | text | 262,144 | 262,144 | $0.2 / 1M | $0.8 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.1 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.3 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,144 | 262,144 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,144 | 262,144 | $1 / 1M | $1.5 / 1M | No | Yes | Mar 12, 2026 | |
Vercel AI Gateway vercel | Kimi K2 Thinking Turbo moonshotai/kimi-k2-thinking-turbo | kimi-thinking | text | text | 262,114 | 262,114 | $1.15 / 1M | $8 / 1M | Yes | Yes | Nov 6, 2025 | |
Vercel AI Gateway vercel | MiniMax M2 minimax/minimax-m2 | minimax | text | text | 262,114 | 262,114 | $0.27 / 1M | $1.15 / 1M | Yes | Yes | Oct 27, 2025 | |
OpenCode Zen opencode | Ling 2.6 Flash Free ling-2.6-flash-free | ling-flash-free | text | text | 262,100 | 32,800 | $0 / 1M | $0 / 1M | No | Yes | Apr 21, 2026 | |
Vercel AI Gateway vercel | Trinity Large Thinking arcee-ai/trinity-large-thinking | trinity | text | text | 262,100 | 80,000 | $0.25 / 1M | $0.8999999999999999 / 1M | Yes | Yes | Apr 3, 2026 | |
Amazon Bedrock amazon-bedrock | Qwen/Qwen3-Next-80B-A3B-Instruct qwen.qwen3-next-80b-a3b | qwen | text | text | 262,000 | 262,000 | $0.14 / 1M | $1.4 / 1M | No | Yes | Nov 25, 2025 | |
Azure azure | Grok 4.20 (Non-Reasoning) grok-4-20-non-reasoning | grok | text | text | 262,000 | 8,192 | $2 / 1M | $6 / 1M | No | Yes | Apr 8, 2026 | |
Azure azure | Grok 4.20 (Reasoning) grok-4-20-reasoning | grok | text | text | 262,000 | 8,192 | $2 / 1M | $6 / 1M | Yes | Yes | Apr 8, 2026 | |
Cortecs cortecs | Devstral 2 2512 devstral-2512 | Unknown | text | text | 262,000 | 262,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 9, 2025 | |
Cortecs cortecs | Kimi K2 Thinking kimi-k2-thinking | Unknown | text | text | 262,000 | 262,000 | $0.656 / 1M | $2.731 / 1M | Yes | Yes | Dec 8, 2025 | |
Cortecs cortecs | Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct | qwen | text | text | 262,000 | 262,000 | $0.441 / 1M | $1.984 / 1M | No | Yes | Jul 25, 2025 | |
Meganova meganova | Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.09 / 1M | $0.6 / 1M | No | Yes | Jul 23, 2025 | |
NanoGPT nano-gpt | Qwen3 30B A3B Instruct 2507 TEE TEE/qwen3-30b-a3b-instruct-2507 | qwen | text | text | 262,000 | 32,768 | $0.15 / 1M | $0.44999999999999996 / 1M | No | No | Jul 29, 2025 | |
OpenRouter openrouter | Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.2 / 1M | $0.8 / 1M | No | Yes | Jul 29, 2025 | |
OpenRouter openrouter | Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | qwen | text | text | 262,000 | 262,000 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Jul 29, 2025 | |
Qiniu qiniu-ai | Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct | Unknown | text | text | 262,000 | 4,096 | Unknown | Unknown | No | Yes | Aug 14, 2025 | |
SiliconFlow (China) siliconflow-cn | ByteDance-Seed/Seed-OSS-36B-Instruct ByteDance-Seed/Seed-OSS-36B-Instruct | seed | text | text | 262,000 | 262,000 | $0.21 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | moonshotai/Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,000 | 262,000 | $0.4 / 1M | $2 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | moonshotai/Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,000 | 262,000 | $0.55 / 1M | $2.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/moonshotai/Kimi-K2-Instruct-0905 Pro/moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,000 | 262,000 | $0.4 / 1M | $2 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/moonshotai/Kimi-K2-Thinking Pro/moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,000 | 262,000 | $0.55 / 1M | $2.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.09 / 1M | $0.6 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,000 | 262,000 | $0.13 / 1M | $0.6 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.09 / 1M | $0.3 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen/Qwen3-30B-A3B-Thinking-2507 | qwen | text | text | 262,000 | 131,000 | $0.09 / 1M | $0.3 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.14 / 1M | $1.4 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking | qwen | text | text | 262,000 | 262,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | stepfun-ai/Step-3.5-Flash stepfun-ai/Step-3.5-Flash | step | text | text | 262,000 | 262,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Feb 11, 2026 | |
SiliconFlow siliconflow | ByteDance-Seed/Seed-OSS-36B-Instruct ByteDance-Seed/Seed-OSS-36B-Instruct | seed | text | text | 262,000 | 262,000 | $0.21 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | moonshotai/Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 262,000 | 262,000 | $0.4 / 1M | $2 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | moonshotai/Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 262,000 | 262,000 | $0.55 / 1M | $2.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.09 / 1M | $0.6 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 262,000 | 262,000 | $0.13 / 1M | $0.6 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | qwen | text | text | 262,000 | 262,000 | $0.09 / 1M | $0.3 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen/Qwen3-30B-A3B-Thinking-2507 | qwen | text | text | 262,000 | 131,000 | $0.09 / 1M | $0.3 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | qwen | text | text | 262,000 | 262,000 | $0.14 / 1M | $1.4 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking | qwen | text | text | 262,000 | 262,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | stepfun-ai/Step-3.5-Flash stepfun-ai/Step-3.5-Flash | step | text | text | 262,000 | 262,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Feb 11, 2026 | |
ZenMux zenmux | Kimi K2 0905 moonshotai/kimi-k2-0905 | Unknown | text | text | 262,000 | 64,000 | $0.6 / 1M | $2.5 / 1M | No | Yes | Sep 4, 2025 | |
ZenMux zenmux | Kimi K2 Thinking moonshotai/kimi-k2-thinking | Unknown | text | text | 262,000 | 64,000 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
ZenMux zenmux | Kimi K2 Thinking Turbo moonshotai/kimi-k2-thinking-turbo | Unknown | text | text | 262,000 | 64,000 | $1.15 / 1M | $8 / 1M | Yes | Yes | Nov 6, 2025 | |
ZenMux zenmux | MiMo-V2-Flash xiaomi/mimo-v2-flash | Unknown | text | text | 262,000 | 64,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 17, 2025 | |
Scaleway scaleway | Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 | qwen | text | text | 260,000 | 16,384 | $0.75 / 1M | $2.25 / 1M | No | Yes | Mar 17, 2026 | |
302.AI 302ai | qwen3-max-2025-09-23 qwen3-max-2025-09-23 | Unknown | text | text | 258,048 | 65,536 | $0.86 / 1M | $3.43 / 1M | No | Yes | Sep 24, 2025 | |
NanoGPT nano-gpt | Qwen3.5 397B A17B TEE TEE/qwen3.5-397b-a17b | qwen | text | text | 258,048 | 65,536 | $0.6 / 1M | $3.6 / 1M | No | No | Feb 28, 2026 | |
Abacus abacus | Kimi K2 Turbo Preview kimi-k2-turbo-preview | kimi | text | text | 256,000 | 8,192 | $0.15 / 1M | $8 / 1M | No | Yes | Jul 8, 2025 | |
Amazon Bedrock amazon-bedrock | Devstral 2 123B mistral.devstral-2-123b | devstral | text | text | 256,000 | 8,192 | $0.4 / 1M | $2 / 1M | No | Yes | Feb 17, 2026 | |
Amazon Bedrock amazon-bedrock | Kimi K2 Thinking moonshot.kimi-k2-thinking | kimi-thinking | text | text | 256,000 | 256,000 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Dec 2, 2025 | |
Azure azure | Codestral 25.01 codestral-2501 | codestral | text | text | 256,000 | 256,000 | $0.3 / 1M | $0.9 / 1M | No | Yes | Jan 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | Codestral 25.01 codestral-2501 | codestral | text | text | 256,000 | 256,000 | $0.3 / 1M | $0.9 / 1M | No | Yes | Jan 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | Command A cohere-command-a | command-a | text | text | 256,000 | 8,000 | $2.5 / 1M | $10 / 1M | Yes | Yes | Mar 13, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok 4 grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
Azure azure | Command A cohere-command-a | command-a | text | text | 256,000 | 8,000 | $2.5 / 1M | $10 / 1M | Yes | Yes | Mar 13, 2025 | |
Azure azure | Grok 4 grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Azure azure | Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Nemotron 3 Super 120B workers-ai/@cf/nvidia/nemotron-3-120b-a12b | nemotron | text | text | 256,000 | 256,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Mar 11, 2026 | |
Cloudflare Workers AI cloudflare-workers-ai | Nemotron 3 Super 120B @cf/nvidia/nemotron-3-120b-a12b | nemotron | text | text | 256,000 | 256,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Mar 11, 2026 | |
Cohere cohere | Command A command-a-03-2025 | command-a | text | text | 256,000 | 8,000 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 13, 2025 | |
Cohere cohere | Command A Reasoning command-a-reasoning-08-2025 | command-a | text | text | 256,000 | 32,000 | $2.5 / 1M | $10 / 1M | Yes | Yes | Aug 21, 2025 | |
Cortecs cortecs | Qwen3 Coder Next 80B qwen3-coder-next | qwen | text | text | 256,000 | 65,536 | $0.158 / 1M | $0.84 / 1M | Yes | Yes | Feb 4, 2026 | |
DigitalOcean digitalocean | Nemotron-3-Super-120B nvidia-nemotron-3-super-120b | nemotron | text | text | 256,000 | 32,768 | $0.3 / 1M | $0.65 / 1M | Yes | Yes | Apr 16, 2026 | |
DigitalOcean digitalocean | Trinity Large Thinking arcee-trinity-large-thinking | trinity | text | text | 256,000 | 128,000 | $0.25 / 1M | $0.9 / 1M | Yes | Yes | Apr 16, 2026 | |
FastRouter fastrouter | Grok 4 x-ai/grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Fireworks AI fireworks-ai | Kimi K2 Thinking accounts/fireworks/models/kimi-k2-thinking | kimi-thinking | text | text | 256,000 | 256,000 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Nov 6, 2025 | |
Firmware firmware | Grok 4.1 Fast (Reasoning) grok-code-fast-1 | grok | text | text | 256,000 | 128,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
GitHub Models github-models | AI21 Jamba 1.5 Large ai21-labs/ai21-jamba-1.5-large | jamba | text | text | 256,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 29, 2024 | |
GitHub Models github-models | AI21 Jamba 1.5 Mini ai21-labs/ai21-jamba-1.5-mini | jamba | text | text | 256,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 29, 2024 | |
Helicone helicone | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 256,000 | 262,144 | $0.48 / 1M | $2 / 1M | No | Yes | Nov 6, 2025 | |
Helicone helicone | xAI Grok 4 grok-4 | grok | text | text | 256,000 | 256,000 | $3 / 1M | $15 / 1M | No | Yes | Jul 9, 2024 | |
Helicone helicone | xAI Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.19999999999999998 / 1M | $1.5 / 1M | No | Yes | Aug 25, 2024 | |
iFlow iflowcn | Kimi-K2-0905 kimi-k2-0905 | kimi | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Sep 5, 2025 | |
iFlow iflowcn | Qwen3-235B-A22B-Instruct qwen3-235b-a22b-instruct | qwen | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Jul 1, 2025 | |
iFlow iflowcn | Qwen3-235B-A22B-Thinking qwen3-235b-a22b-thinking-2507 | qwen | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 1, 2025 | |
iFlow iflowcn | Qwen3-Coder-Plus qwen3-coder-plus | qwen | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Jul 1, 2025 | |
iFlow iflowcn | Qwen3-Max qwen3-max | qwen | text | text | 256,000 | 32,000 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2025 | |
iFlow iflowcn | Qwen3-Max-Preview qwen3-max-preview | qwen | text | text | 256,000 | 32,000 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2025 | |
Kilo Gateway kilo | AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | Unknown | text | text | 256,000 | 4,096 | $2 / 1M | $8 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Cohere: Command A cohere/command-a | Unknown | text | text | 256,000 | 8,192 | $2.5 / 1M | $10 / 1M | No | No | Mar 13, 2025 | |
Kilo Gateway kilo | Kwaipilot: KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 | Unknown | text | text | 256,000 | 80,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Mistral: Codestral 2508 mistralai/codestral-2508 | Unknown | text | text | 256,000 | 51,200 | $0.3 / 1M | $0.9 / 1M | No | Yes | Aug 1, 2025 | |
Kilo Gateway kilo | Relace: Relace Apply 3 relace/relace-apply-3 | Unknown | text | text | 256,000 | 128,000 | $0.85 / 1M | $1.25 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Relace: Relace Search relace/relace-search | Unknown | text | text | 256,000 | 128,000 | $1 / 1M | $3 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | StepFun: Step 3.5 Flash stepfun/step-3.5-flash | Unknown | text | text | 256,000 | 256,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Jan 29, 2026 | |
Kilo Gateway kilo | xAI: Grok Code Fast 1 x-ai/grok-code-fast-1 | Unknown | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 26, 2025 | |
Kilo Gateway kilo | xAI: Grok Code Fast 1 Optimized (experimental, free) x-ai/grok-code-fast-1:optimized:free | Unknown | text | text | 256,000 | 10,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | Codestral codestral-2508 | mistral | text | text | 256,000 | 16,384 | $0.3 / 1M | $0.9 / 1M | No | No | Jul 30, 2025 | |
LLM Gateway llmgateway | Grok 4 grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
LLM Gateway llmgateway | Grok 4 (0709) grok-4-0709 | grok | text | text | 256,000 | 256,000 | $3 / 1M | $15 / 1M | No | Yes | Jul 9, 2025 | |
LLM Gateway llmgateway | Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
LLM Gateway llmgateway | MiMo-V2-Flash mimo-v2-flash | mimo | text | text | 256,000 | 64,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Feb 4, 2026 | |
Mistral mistral | Codestral (latest) codestral-latest | codestral | text | text | 256,000 | 4,096 | $0.3 / 1M | $0.9 / 1M | No | Yes | Jan 4, 2025 | |
NanoGPT nano-gpt | Codestral 2508 mistralai/codestral-2508 | codestral | text | text | 256,000 | 32,768 | $0.3 / 1M | $0.8999999999999999 / 1M | No | No | Aug 1, 2025 | |
NanoGPT nano-gpt | Cohere Command A (08/2025) command-a-reasoning-08-2025 | Unknown | text | text | 256,000 | 8,192 | $2.5 / 1M | $10 / 1M | No | No | Aug 22, 2025 | |
NanoGPT nano-gpt | Doubao 1.5 Pro 256k doubao-1.5-pro-256k | Unknown | text | text | 256,000 | 16,384 | $0.799 / 1M | $1.445 / 1M | No | No | Mar 12, 2025 | |
NanoGPT nano-gpt | Doubao Seed 1.6 doubao-seed-1-6-250615 | Unknown | text | text | 256,000 | 16,384 | $0.204 / 1M | $0.51 / 1M | No | No | Jun 15, 2025 | |
NanoGPT nano-gpt | Doubao Seed 1.6 Flash doubao-seed-1-6-flash-250615 | Unknown | text | text | 256,000 | 16,384 | $0.0374 / 1M | $0.374 / 1M | No | No | Jun 15, 2025 | |
NanoGPT nano-gpt | Doubao Seed 1.6 Thinking doubao-seed-1-6-thinking-250615 | Unknown | text | text | 256,000 | 16,384 | $0.204 / 1M | $2.04 / 1M | No | No | Jun 15, 2025 | |
NanoGPT nano-gpt | Doubao Seed 2.0 Code Preview doubao-seed-2-0-code-preview-260215 | Unknown | text | text | 256,000 | 128,000 | $0.782 / 1M | $3.893 / 1M | No | No | Feb 14, 2026 | |
NanoGPT nano-gpt | Doubao Seed 2.0 Lite doubao-seed-2-0-lite-260215 | Unknown | text | text | 256,000 | 32,000 | $0.1462 / 1M | $0.8738 / 1M | No | No | Feb 14, 2026 | |
NanoGPT nano-gpt | Doubao Seed 2.0 Mini doubao-seed-2-0-mini-260215 | Unknown | text | text | 256,000 | 32,000 | $0.0493 / 1M | $0.4845 / 1M | No | No | Feb 14, 2026 | |
NanoGPT nano-gpt | Doubao Seed 2.0 Pro doubao-seed-2-0-pro-260215 | Unknown | text | text | 256,000 | 128,000 | $0.782 / 1M | $3.876 / 1M | No | No | Feb 14, 2026 | |
NanoGPT nano-gpt | Doubao Seed Code Preview doubao-seed-code-preview-latest | Unknown | text | text | 256,000 | 16,384 | $0.1 / 1M | $0.4 / 1M | Yes | No | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT-5 Codex openai/gpt-5-codex | gpt-codex | text | text | 256,000 | 32,768 | $9.996 / 1M | $19.992 / 1M | No | No | Sep 15, 2025 | |
NanoGPT nano-gpt | Grok Code Fast 1 x-ai/grok-code-fast-1 | grok | text | text | 256,000 | 131,072 | $0.2 / 1M | $1.5 / 1M | Yes | No | Aug 28, 2025 | |
NanoGPT nano-gpt | Hunyuan T1 hunyuan-t1-latest | Unknown | text | text | 256,000 | 16,384 | $0.17 / 1M | $0.66 / 1M | No | No | Mar 22, 2025 | |
NanoGPT nano-gpt | Jamba Large jamba-large | Unknown | text | text | 256,000 | 4,096 | $1.989 / 1M | $7.99 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | Jamba Large 1.6 jamba-large-1.6 | Unknown | text | text | 256,000 | 4,096 | $1.989 / 1M | $7.99 / 1M | No | No | Mar 12, 2025 | |
NanoGPT nano-gpt | Jamba Large 1.7 jamba-large-1.7 | Unknown | text | text | 256,000 | 4,096 | $1.989 / 1M | $7.99 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | Jamba Mini jamba-mini | Unknown | text | text | 256,000 | 4,096 | $0.1989 / 1M | $0.408 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | Jamba Mini 1.6 jamba-mini-1.6 | Unknown | text | text | 256,000 | 4,096 | $0.1989 / 1M | $0.408 / 1M | No | No | Mar 1, 2025 | |
NanoGPT nano-gpt | Jamba Mini 1.7 jamba-mini-1.7 | Unknown | text | text | 256,000 | 4,096 | $0.1989 / 1M | $0.408 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | KAT Coder Pro V1 KAT-Coder-Pro-V1 | Unknown | text | text | 256,000 | 32,768 | $1.5 / 1M | $6 / 1M | No | No | Oct 28, 2025 | |
NanoGPT nano-gpt | Kimi K2 0905 moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 256,000 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Sep 25, 2025 | |
NanoGPT nano-gpt | Kimi K2 Instruct moonshotai/kimi-k2-instruct | kimi | text | text | 256,000 | 8,192 | $0.1 / 1M | $2 / 1M | No | Yes | Jul 1, 2025 | |
NanoGPT nano-gpt | Kimi K2 Thinking moonshotai/kimi-k2-thinking | kimi-thinking | text | text | 256,000 | 262,144 | $0.3 / 1M | $1.2 / 1M | No | Yes | Nov 6, 2025 | |
NanoGPT nano-gpt | Kimi K2 Thinking Original moonshotai/kimi-k2-thinking-original | kimi-thinking | text | text | 256,000 | 16,384 | $0.6 / 1M | $2.5 / 1M | Yes | No | Nov 6, 2025 | |
NanoGPT nano-gpt | Kimi K2 Thinking Turbo Original moonshotai/kimi-k2-thinking-turbo-original | kimi-thinking | text | text | 256,000 | 16,384 | $1.15 / 1M | $8 / 1M | Yes | No | Nov 6, 2025 | |
NanoGPT nano-gpt | MiMo V2 Flash xiaomi/mimo-v2-flash | mimo | text | text | 256,000 | 32,768 | $0.102 / 1M | $0.306 / 1M | No | No | Dec 17, 2025 | |
NanoGPT nano-gpt | MiMo V2 Flash (Thinking) xiaomi/mimo-v2-flash-thinking | mimo | text | text | 256,000 | 32,768 | $0.102 / 1M | $0.306 / 1M | No | No | Dec 17, 2025 | |
NanoGPT nano-gpt | MiMo V2 Flash (Thinking) Original xiaomi/mimo-v2-flash-thinking-original | mimo | text | text | 256,000 | 32,768 | $0.102 / 1M | $0.306 / 1M | No | No | Dec 17, 2025 | |
NanoGPT nano-gpt | MiMo V2 Flash Original xiaomi/mimo-v2-flash-original | mimo | text | text | 256,000 | 32,768 | $0.102 / 1M | $0.306 / 1M | No | No | Dec 17, 2025 | |
NanoGPT nano-gpt | Nvidia Nemotron 3 Nano 30B nvidia/nemotron-3-nano-30b-a3b | nemotron | text | text | 256,000 | 262,144 | $0.17 / 1M | $0.68 / 1M | No | No | Dec 15, 2025 | |
NanoGPT nano-gpt | Qwen3 30B A3B Instruct 2507 qwen3-30b-a3b-instruct-2507 | Unknown | text | text | 256,000 | 32,768 | $0.2 / 1M | $0.5 / 1M | No | No | Feb 20, 2025 | |
NanoGPT nano-gpt | Qwen3 Max 2026-01-23 qwen3-max-2026-01-23 | Unknown | text | text | 256,000 | 32,768 | $1.2002 / 1M | $6.001 / 1M | No | No | Jan 26, 2026 | |
NanoGPT nano-gpt | Step 3.5 Flash stepfun-ai/step-3.5-flash | step | text | text | 256,000 | 256,000 | $0.2 / 1M | $0.5 / 1M | Yes | No | Feb 2, 2026 | |
NanoGPT nano-gpt | Step 3.5 Flash Thinking stepfun-ai/step-3.5-flash:thinking | step | text | text | 256,000 | 256,000 | $0.2 / 1M | $0.5 / 1M | Yes | No | Feb 2, 2026 | |
Nebius Token Factory nebius | Nemotron-3-Super-120B-A12B nvidia/nemotron-3-super-120b-a12b | Unknown | text | text | 256,000 | 32,768 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Mar 12, 2026 | |
NovitaAI novita-ai | Kat Coder Pro kwaipilot/kat-coder-pro | Unknown | text | text | 256,000 | 128,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jan 5, 2026 | |
Nvidia nvidia | Step 3.5 Flash stepfun-ai/step-3.5-flash | Unknown | text | text | 256,000 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 2, 2026 | |
OpenCode Zen opencode | Grok Code Fast 1 grok-code | grok | text | text | 256,000 | 256,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 20, 2025 | |
OpenCode Zen opencode | Hy3 preview Free hy3-preview-free | hy3-free | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 20, 2026 | |
OpenRouter openrouter | Codestral 2508 mistralai/codestral-2508 | codestral | text | text | 256,000 | 256,000 | $0.3 / 1M | $0.9 / 1M | No | Yes | Aug 1, 2025 | |
OpenRouter openrouter | Grok 4 x-ai/grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
OpenRouter openrouter | Grok Code Fast 1 x-ai/grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 26, 2025 | |
OpenRouter openrouter | Nemotron 3 Nano 30B A3B (free) nvidia/nemotron-3-nano-30b-a3b:free | nemotron | text | text | 256,000 | 256,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 31, 2026 | |
OpenRouter openrouter | Step 3.5 Flash stepfun/step-3.5-flash | step | text | text | 256,000 | 256,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Jan 29, 2026 | |
Poe poe | Grok Code Fast 1 xai/grok-code-fast-1 | grok | text | text | 256,000 | 128,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 22, 2025 | |
Poe poe | kimi-k2-thinking novita/kimi-k2-thinking | kimi | text | text | 256,000 | Unknown | Unknown | Unknown | Yes | Yes | Nov 7, 2025 | |
Qiniu qiniu-ai | Kimi K2 0905 moonshotai/kimi-k2-0905 | Unknown | text | text | 256,000 | 100,000 | Unknown | Unknown | No | Yes | Sep 8, 2025 | |
Qiniu qiniu-ai | Kimi K2 Thinking moonshotai/kimi-k2-thinking | Unknown | text | text | 256,000 | 100,000 | Unknown | Unknown | No | Yes | Nov 7, 2025 | |
Qiniu qiniu-ai | Meituan/Longcat-Flash-Lite meituan/longcat-flash-lite | Unknown | text | text | 256,000 | 320,000 | Unknown | Unknown | No | Yes | Feb 6, 2026 | |
Qiniu qiniu-ai | Mimo-V2-Flash mimo-v2-flash | Unknown | text | text | 256,000 | 256,000 | Unknown | Unknown | Yes | Yes | Dec 17, 2025 | |
Qiniu qiniu-ai | Qwen3 Max Preview qwen3-max-preview | Unknown | text | text | 256,000 | 64,000 | Unknown | Unknown | No | Yes | Sep 6, 2025 | |
Qiniu qiniu-ai | x-AI/Grok-Code-Fast 1 x-ai/grok-code-fast-1 | Unknown | text | text | 256,000 | 10,000 | Unknown | Unknown | Yes | Yes | Sep 2, 2025 | |
Qiniu qiniu-ai | Xiaomi/Mimo-V2-Flash xiaomi/mimo-v2-flash | Unknown | text | text | 256,000 | 256,000 | Unknown | Unknown | Yes | No | Dec 26, 2025 | |
Scaleway scaleway | Devstral 2 123B Instruct (2512) devstral-2-123b-instruct-2512 | devstral | text | text | 256,000 | 16,384 | $0.4 / 1M | $2 / 1M | No | Yes | Mar 17, 2026 | |
StepFun stepfun | Step 3.5 Flash step-3.5-flash | Unknown | text | text | 256,000 | 256,000 | $0.096 / 1M | $0.288 / 1M | Yes | Yes | Feb 13, 2026 | |
StepFun stepfun | Step 3.5 Flash 2603 step-3.5-flash-2603 | Unknown | text | text | 256,000 | 256,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Apr 2, 2026 | |
Synthetic synthetic | Qwen 3 235B Instruct hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | qwen | text | text | 256,000 | 32,000 | $0.2 / 1M | $0.6 / 1M | No | Yes | Jul 21, 2025 | |
Synthetic synthetic | Qwen 3 Coder 480B hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | qwen | text | text | 256,000 | 32,000 | $2 / 1M | $2 / 1M | No | Yes | Jul 23, 2025 | |
Synthetic synthetic | Qwen3 235B A22B Thinking 2507 hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | qwen | text | text | 256,000 | 32,000 | $0.65 / 1M | $3 / 1M | Yes | Yes | Jul 25, 2025 | |
Tencent TokenHub tencent-tokenhub | Hy3 preview hy3-preview | Hy | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 20, 2026 | |
Venice AI venice | GPT-5.2 openai-gpt-52 | gpt | text | text | 256,000 | 65,536 | $2.19 / 1M | $17.5 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | Kimi K2 Thinking kimi-k2-thinking | kimi-thinking | text | text | 256,000 | 65,536 | $0.75 / 1M | $3.2 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | Mistral Small 3.2 24B Instruct mistral-small-3-2-24b-instruct | mistral-small | text | text | 256,000 | 16,384 | $0.09375 / 1M | $0.25 / 1M | No | Yes | Mar 16, 2026 | |
Venice AI venice | Nemotron Cascade 2 30B A3B nvidia-nemotron-cascade-2-30b-a3b | nemotron | text | text | 256,000 | 32,768 | $0.14 / 1M | $0.8 / 1M | Yes | Yes | Apr 9, 2026 | |
Venice AI venice | Qwen 3 Coder 480b qwen3-coder-480b-a35b-instruct | qwen | text | text | 256,000 | 65,536 | $0.75 / 1M | $3 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Qwen 3 Coder 480B Turbo qwen3-coder-480b-a35b-instruct-turbo | qwen | text | text | 256,000 | 65,536 | $0.35 / 1M | $1.5 / 1M | No | Yes | Feb 26, 2026 | |
Venice AI venice | Qwen 3 Next 80b qwen3-next-80b | qwen | text | text | 256,000 | 16,384 | $0.35 / 1M | $1.9 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Trinity Large Thinking arcee-trinity-large-thinking | trinity | text | text | 256,000 | 65,536 | $0.3125 / 1M | $1.125 / 1M | Yes | Yes | Apr 4, 2026 | |
Vercel AI Gateway vercel | Codestral (latest) mistral/codestral | codestral | text | text | 256,000 | 4,096 | $0.3 / 1M | $0.9 / 1M | No | Yes | Jan 4, 2025 | |
Vercel AI Gateway vercel | Command A cohere/command-a | command | text | text | 256,000 | 8,000 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 13, 2025 | |
Vercel AI Gateway vercel | Devstral 2 mistral/devstral-2 | devstral | text | text | 256,000 | 256,000 | Unknown | Unknown | No | Yes | Dec 9, 2025 | |
Vercel AI Gateway vercel | Devstral Small 2 mistral/devstral-small-2 | devstral | text | text | 256,000 | 256,000 | Unknown | Unknown | No | Yes | May 7, 2025 | |
Vercel AI Gateway vercel | Grok 4 xai/grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Vercel AI Gateway vercel | Grok Code Fast 1 xai/grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
Vercel AI Gateway vercel | Kat Coder Pro V2 kwaipilot/kat-coder-pro-v2 | kat-coder | text | text | 256,000 | 256,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 30, 2026 | |
Vercel AI Gateway vercel | KAT-Coder-Pro V1 kwaipilot/kat-coder-pro-v1 | kat-coder | text | text | 256,000 | 32,000 | Unknown | Unknown | Yes | No | Oct 24, 2025 | |
Vercel AI Gateway vercel | Kimi K2 Turbo moonshotai/kimi-k2-turbo | kimi | text | text | 256,000 | 16,384 | $2.4 / 1M | $10 / 1M | No | Yes | Sep 5, 2025 | |
Vercel AI Gateway vercel | NVIDIA Nemotron 3 Super 120B A12B nvidia/nemotron-3-super-120b-a12b | nemotron | text | text | 256,000 | 32,000 | $0.15 / 1M | $0.65 / 1M | No | No | Mar 30, 2026 | |
Vercel AI Gateway vercel | Qwen 3 Max Thinking alibaba/qwen3-max-thinking | qwen | text | text | 256,000 | 65,536 | $1.2 / 1M | $6 / 1M | Yes | Yes | Jan 1, 2025 | |
Vercel AI Gateway vercel | Qwen3 Coder Next alibaba/qwen3-coder-next | qwen | text | text | 256,000 | 256,000 | $0.5 / 1M | $1.2 / 1M | Yes | Yes | Feb 19, 2026 | |
Vercel AI Gateway vercel | Seed 1.6 bytedance/seed-1.6 | seed | text | text | 256,000 | 32,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Sep 1, 2025 | |
xAI xai | Grok 4 grok-4 | grok | text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
xAI xai | Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 256,000 | 10,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 28, 2025 | |
Xiaomi xiaomi | MiMo-V2-Flash mimo-v2-flash | mimo | text | text | 256,000 | 64,000 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Feb 4, 2026 | |
ZenMux zenmux | Agnes 1.5 Pro sapiens-ai/agnes-1.5-pro | Unknown | text | text | 256,000 | 256,000 | $0.16 / 1M | $0.8 / 1M | Yes | Yes | Mar 21, 2026 | |
ZenMux zenmux | Doubao Seed 2.0 Code volcengine/doubao-seed-2.0-code | Unknown | text | text | 256,000 | 32,000 | $0.9 / 1M | $4.48 / 1M | No | Yes | Mar 20, 2026 | |
ZenMux zenmux | Grok Code Fast 1 x-ai/grok-code-fast-1 | Unknown | text | text | 256,000 | 64,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Aug 26, 2025 | |
ZenMux zenmux | Hy3 preview tencent/hy3-preview | Hy | text | text | 256,000 | 64,000 | $0.172 / 1M | $0.572 / 1M | Yes | Yes | Apr 20, 2026 | |
ZenMux zenmux | KAT-Coder-Pro-V2 kuaishou/kat-coder-pro-v2 | Unknown | text | text | 256,000 | 80,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Mar 30, 2026 | |
ZenMux zenmux | Qwen3-Max-Thinking qwen/qwen3-max | Unknown | text | text | 256,000 | 64,000 | $1.2 / 1M | $6 / 1M | Yes | Yes | Jan 23, 2026 | |
ZenMux zenmux | Step 3.5 Flash stepfun/step-3.5-flash | Unknown | text | text | 256,000 | 64,000 | $0.1 / 1M | $0.3 / 1M | No | Yes | Feb 2, 2026 | |
ZenMux zenmux | Step 3.5 Flash (Free) stepfun/step-3.5-flash-free | Unknown | text | text | 256,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 2, 2026 | |
Vultr vultr | Kimi K2 Instruct Kimi-K2.5 | kimi | text | text | 254,000 | 32,768 | $0.55 / 1M | $2.75 / 1M | No | Yes | Jan 27, 2026 | |
AIHubMix aihubmix | Qwen3 Max qwen3-max-2026-01-23 | qwen | text | text | 252,000 | 32,000 | $0.34246 / 1M | $1.36984 / 1M | No | Yes | Sep 23, 2025 | |
LucidQuery AI lucidquery | LucidQuery Nexus Coder lucidquery-nexus-coder | lucid | text | text | 250,000 | 60,000 | $2 / 1M | $5 / 1M | Yes | Yes | Sep 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3.6 Max Preview qwen3.6-max-preview | qwen | text | text | 245,800 | 65,536 | $1.32 / 1M | $7.9 / 1M | Yes | Yes | Apr 21, 2026 | |
NanoGPT nano-gpt | Qwen3.6 Max Preview qwen3.6-max-preview | qwen3.6 | text | text | 245,800 | 65,536 | $1.3 / 1M | $7.8 / 1M | No | No | Apr 21, 2026 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Steam GLM-4.5-Air-Derestricted-Steam | Unknown | text | text | 220,600 | 65,536 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 28, 2025 | |
Vercel AI Gateway vercel | Kimi K2 Thinking moonshotai/kimi-k2-thinking | kimi-thinking | text | text | 216,144 | 216,144 | $0.47 / 1M | $2 / 1M | Yes | Yes | Nov 6, 2025 | |
Poe poe | glm-4.7 novita/glm-4.7 | Unknown | text | text | 205,000 | 131,072 | Unknown | Unknown | Yes | Yes | Dec 22, 2025 | |
Poe poe | glm-4.7-n novita/glm-4.7-n | Unknown | text | text | 205,000 | 131,072 | Unknown | Unknown | Yes | Yes | Dec 22, 2025 | |
Poe poe | GLM-5 novita/glm-5 | Unknown | text | text | 205,000 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 15, 2026 | |
Poe poe | minimax-m2.1 novita/minimax-m2.1 | Unknown | text | text | 205,000 | 131,072 | Unknown | Unknown | Yes | Yes | Dec 26, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/zai-org/GLM-4.7 Pro/zai-org/GLM-4.7 | glm | text | text | 205,000 | 205,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/zai-org/GLM-5 Pro/zai-org/GLM-5 | glm | text | text | 205,000 | 205,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
SiliconFlow (China) siliconflow-cn | Pro/zai-org/GLM-5.1 Pro/zai-org/GLM-5.1 | glm | text | text | 205,000 | 205,000 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 8, 2026 | |
SiliconFlow (China) siliconflow-cn | zai-org/GLM-4.6 zai-org/GLM-4.6 | glm | text | text | 205,000 | 205,000 | $0.5 / 1M | $1.9 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.6 zai-org/GLM-4.6 | glm | text | text | 205,000 | 205,000 | $0.5 / 1M | $1.9 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.7 zai-org/GLM-4.7 | glm | text | text | 205,000 | 205,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-5 zai-org/GLM-5 | glm | text | text | 205,000 | 205,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
SiliconFlow siliconflow | zai-org/GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 205,000 | 205,000 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 8, 2026 | |
302.AI 302ai | MiniMax-M2.7 MiniMax-M2.7 | Unknown | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | No | Yes | Mar 19, 2026 | |
302.AI 302ai | MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed | Unknown | text | text | 204,800 | 131,072 | $0.6 / 1M | $4.8 / 1M | No | Yes | Mar 19, 2026 | |
Abacus abacus | GLM-5 zai-org/glm-5 | glm | text | text | 204,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
AIHubMix aihubmix | Coding-GLM-5-Free coding-glm-5-free | glm | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 11, 2026 | |
AIHubMix aihubmix | Coding-MiniMax-M2.7-Free coding-minimax-m2.7-free | minimax | text | text | 204,800 | 13,100 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
AIHubMix aihubmix | MiniMax-M2.1 minimax-m2.1 | minimax | text | text | 204,800 | 192,000 | $0.288 / 1M | $1.152 / 1M | Yes | Yes | Dec 23, 2025 | |
Alibaba (China) alibaba-cn | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Alibaba (China) alibaba-cn | MiniMax-M2.7 MiniMax/MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Amazon Bedrock amazon-bedrock | GLM-4.7 zai.glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Amazon Bedrock amazon-bedrock | MiniMax M2.1 minimax.minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
Baseten baseten | GLM-4.7 zai-org/GLM-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Clarifai clarifai | MiniMax-M2.5 High Throughput minimaxai/chat-completion/models/MiniMax-M2_5-high-throughput | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 25, 2026 | |
Cortecs cortecs | GLM-5.1 glm-5.1 | glm | text | text | 204,800 | 131,072 | $1.31 / 1M | $4.1 / 1M | Yes | Yes | Apr 14, 2026 | |
D.Run (China) drun | MiniMax M2.5 public/minimax-m25 | minimax | text | text | 204,800 | 131,072 | $0.29 / 1M | $1.16 / 1M | Yes | Yes | Mar 1, 2025 | |
Deep Infra deepinfra | GLM-4.6 zai-org/GLM-4.6 | glm | text | text | 204,800 | 131,072 | $0.43 / 1M | $1.74 / 1M | Yes | Yes | Sep 30, 2025 | |
Deep Infra deepinfra | MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.27 / 1M | $0.95 / 1M | Yes | Yes | Feb 12, 2026 | |
DigitalOcean digitalocean | MiniMax M2.5 minimax-m2.5 | minimax-m2.5 | text | text | 204,800 | 128,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Apr 16, 2026 | |
FastRouter fastrouter | GLM-5 z-ai/glm-5 | glm | text | text | 204,800 | 131,072 | $0.95 / 1M | $3.15 / 1M | Yes | Yes | Feb 11, 2026 | |
Helicone helicone | Zai GLM-4.6 glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.44999999999999996 / 1M | $1.5 / 1M | Yes | Yes | Jul 18, 2024 | |
Hugging Face huggingface | GLM-4.7 zai-org/GLM-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Hugging Face huggingface | MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
Hugging Face huggingface | MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Hugging Face huggingface | MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Jiekou.AI jiekou | GLM-4.7 zai-org/glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | Minimax M2.1 minimax/minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Kilo Auto Balanced kilo-auto/balanced | Unknown | text | text | 204,800 | 131,072 | $0.6 / 1M | $3 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Kilo Auto Free kilo-auto/free | Unknown | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax M2.7 minimax/minimax-m2.7 | minimax-m2.7 | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4.6 z-ai/glm-4.6 | Unknown | text | text | 204,800 | 204,800 | $0.39 / 1M | $1.9 / 1M | Yes | Yes | Mar 15, 2026 | |
KUAE Cloud Coding Plan kuae-cloud-coding-plan | GLM-4.7 GLM-4.7 | glm | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
LLM Gateway llmgateway | GLM-4.6 glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
LLM Gateway llmgateway | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
LLM Gateway llmgateway | GLM-5 glm-5 | glm | text | text | 204,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
LLM Gateway llmgateway | MiniMax-M2.1 minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
LLM Gateway llmgateway | MiniMax-M2.5 minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
LLM Gateway llmgateway | MiniMax-M2.5-highspeed minimax-m2.5-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Feb 13, 2026 | |
LLM Gateway llmgateway | MiniMax-M2.7 minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
LLM Gateway llmgateway | MiniMax-M2.7-highspeed minimax-m2.7-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Mar 18, 2026 | |
Meganova meganova | MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
MiniMax (minimax.io) minimax | MiniMax-M2.1 MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
MiniMax (minimax.io) minimax | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
MiniMax (minimax.io) minimax | MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Feb 13, 2026 | |
MiniMax (minimax.io) minimax | MiniMax-M2.7 MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax (minimax.io) minimax | MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2.1 MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Feb 13, 2026 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2.7 MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed | minimax | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2.1 MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 23, 2025 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 13, 2026 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2.7 MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2.1 MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 23, 2025 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 13, 2026 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2.7 MiniMax-M2.7 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Moark moark | GLM-4.7 GLM-4.7 | glm | text | text | 204,800 | 131,072 | $3.5 / 1M | $14 / 1M | Yes | Yes | Dec 22, 2025 | |
Moark moark | MiniMax-M2.1 MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $2.1 / 1M | $8.4 / 1M | Yes | Yes | Dec 23, 2025 | |
NanoGPT nano-gpt | MiniMax M2.5 minimax/minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
NanoGPT nano-gpt | MiniMax M2.7 minimax/minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
NovitaAI novita-ai | GLM 4.6 zai-org/glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.55 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
NovitaAI novita-ai | GLM-4.7 zai-org/glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
NovitaAI novita-ai | GLM-5.1 zai-org/glm-5.1 | glm | text | text | 204,800 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Mar 27, 2026 | |
NovitaAI novita-ai | Minimax M2.1 minimax/minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | No | Yes | Dec 23, 2025 | |
NovitaAI novita-ai | MiniMax M2.5 minimax/minimax-m2.5 | minimax | text | text | 204,800 | 131,100 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
NovitaAI novita-ai | MiniMax M2.5 Highspeed minimax/minimax-m2.5-highspeed | minimax-m2.5 | text | text | 204,800 | 131,100 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Feb 12, 2026 | |
NovitaAI novita-ai | MiniMax M2.7 minimax/minimax-m2.7 | minimax-m2.7 | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
NovitaAI novita-ai | MiniMax-M2 minimax/minimax-m2 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
Nvidia nvidia | GLM-4.7 z-ai/glm4.7 | glm | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
Nvidia nvidia | MiniMax-M2.1 minimaxai/minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 23, 2025 | |
Nvidia nvidia | MiniMax-M2.5 minimaxai/minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
Nvidia nvidia | MiniMax-M2.7 minimaxai/minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 11, 2026 | |
Ollama Cloud ollama-cloud | minimax-m2 minimax-m2 | minimax | text | text | 204,800 | 128,000 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | minimax-m2.1 minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | minimax-m2.5 minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | Unknown | Unknown | Yes | Yes | Feb 12, 2026 | |
Ollama Cloud ollama-cloud | minimax-m2.7 minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | Unknown | Unknown | Yes | Yes | Mar 18, 2026 | |
OpenCode Go opencode-go | MiniMax M2.5 minimax-m2.5 | minimax-m2.5 | text | text | 204,800 | 65,536 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenCode Go opencode-go | MiniMax M2.7 minimax-m2.7 | minimax-m2.7 | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenCode Zen opencode | GLM-4.6 glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
OpenCode Zen opencode | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
OpenCode Zen opencode | GLM-4.7 Free glm-4.7-free | glm-free | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
OpenCode Zen opencode | GLM-5 glm-5 | glm | text | text | 204,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
OpenCode Zen opencode | GLM-5 Free glm-5-free | glm-free | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 11, 2026 | |
OpenCode Zen opencode | GLM-5.1 glm-5.1 | glm | text | text | 204,800 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
OpenCode Zen opencode | MiniMax M2.1 minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
OpenCode Zen opencode | MiniMax M2.1 Free minimax-m2.1-free | minimax-free | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 23, 2025 | |
OpenCode Zen opencode | MiniMax M2.5 minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenCode Zen opencode | MiniMax M2.5 Free minimax-m2.5-free | minimax-free | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenCode Zen opencode | MiniMax M2.7 minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenCode Zen opencode | Nemotron 3 Super Free nemotron-3-super-free | nemotron-free | text | text | 204,800 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 11, 2026 | |
OpenRouter openrouter | GLM-4.7 z-ai/glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
OpenRouter openrouter | MiniMax M2.1 minimax/minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
OpenRouter openrouter | MiniMax M2.5 minimax/minimax-m2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenRouter openrouter | MiniMax M2.5 (free) minimax/minimax-m2.5:free | minimax | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenRouter openrouter | MiniMax M2.7 minimax/minimax-m2.7 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Qiniu qiniu-ai | Minimax/Minimax-M2.1 minimax/minimax-m2.1 | Unknown | text | text | 204,800 | 128,000 | Unknown | Unknown | Yes | Yes | Dec 23, 2025 | |
Qiniu qiniu-ai | Minimax/Minimax-M2.5 minimax/minimax-m2.5 | Unknown | text | text | 204,800 | 128,000 | Unknown | Unknown | Yes | Yes | Feb 12, 2026 | |
Qiniu qiniu-ai | Minimax/Minimax-M2.5 Highspeed minimax/minimax-m2.5-highspeed | Unknown | text | text | 204,800 | 128,000 | Unknown | Unknown | Yes | Yes | Feb 14, 2026 | |
Synthetic synthetic | MiniMax-M2.1 hf:MiniMaxAI/MiniMax-M2.1 | minimax | text | text | 204,800 | 131,072 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Dec 23, 2025 | |
Tencent Coding Plan (China) tencent-coding-plan | MiniMax-M2.5 minimax-m2.5 | minimax | text | text | 204,800 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
Together AI togetherai | MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Vercel AI Gateway vercel | MiniMax M2.1 minimax/minimax-m2.1 | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
Vercel AI Gateway vercel | MiniMax M2.1 Lightning minimax/minimax-m2.1-lightning | minimax | text | text | 204,800 | 131,072 | $0.3 / 1M | $2.4 / 1M | Yes | Yes | Oct 27, 2025 | |
Vercel AI Gateway vercel | MiniMax M2.5 minimax/minimax-m2.5 | minimax | text | text | 204,800 | 131,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 19, 2026 | |
Z.AI Coding Plan zai-coding-plan | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
Z.AI zai | GLM-4.6 glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
Z.AI zai | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Z.AI zai | GLM-5 glm-5 | glm | text | text | 204,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
ZenMux zenmux | MiniMax M2.5 minimax/minimax-m2.5 | Unknown | text | text | 204,800 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 13, 2026 | |
ZenMux zenmux | MiniMax M2.5 highspeed minimax/minimax-m2.5-lightning | Unknown | text | text | 204,800 | 131,072 | $0.6 / 1M | $4.8 / 1M | Yes | Yes | Feb 13, 2026 | |
ZenMux zenmux | MiniMax M2.7 minimax/minimax-m2.7 | Unknown | text | text | 204,800 | 131,070 | $0.3055 / 1M | $1.2219 / 1M | Yes | Yes | Mar 20, 2026 | |
ZenMux zenmux | MiniMax M2.7 highspeed minimax/minimax-m2.7-highspeed | Unknown | text | text | 204,800 | 131,070 | $0.611 / 1M | $2.4439 / 1M | Yes | Yes | Mar 20, 2026 | |
Zhipu AI Coding Plan zhipuai-coding-plan | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
Zhipu AI zhipuai | GLM-4.6 glm-4.6 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
Zhipu AI zhipuai | GLM-4.7 glm-4.7 | glm | text | text | 204,800 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Zhipu AI zhipuai | GLM-5 glm-5 | glm | text | text | 204,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
Amazon Bedrock amazon-bedrock | MiniMax M2 minimax.minimax-m2 | minimax | text | text | 204,608 | 128,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
Baseten baseten | MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 204,000 | 204,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
ZenMux zenmux | MiniMax M2 minimax/minimax-m2 | Unknown | text | text | 204,000 | 64,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
ZenMux zenmux | MiniMax M2.1 minimax/minimax-m2.1 | Unknown | text | text | 204,000 | 64,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Cortecs cortecs | GLM-4.7-Flash glm-4.7-flash | glm | text | text | 203,000 | 203,000 | $0.09 / 1M | $0.53 / 1M | Yes | Yes | Aug 8, 2025 | |
NanoGPT nano-gpt | GLM 4.6 TEE TEE/glm-4.6 | glm | text | text | 203,000 | 65,535 | $0.75 / 1M | $2 / 1M | No | No | Sep 30, 2025 | |
NanoGPT nano-gpt | GLM 4.7 Flash TEE TEE/glm-4.7-flash | glm-flash | text | text | 203,000 | 65,535 | $0.15 / 1M | $0.5 / 1M | No | No | Jan 19, 2026 | |
NanoGPT nano-gpt | GLM 5 TEE TEE/glm-5 | glm | text | text | 203,000 | 65,535 | $1.2 / 1M | $3.5 / 1M | No | No | Feb 11, 2026 | |
Fireworks AI fireworks-ai | GLM 5.1 accounts/fireworks/models/glm-5p1 | glm | text | text | 202,800 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 1, 2026 | |
NovitaAI novita-ai | GLM-5 zai-org/glm-5 | glm | text | text | 202,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Vercel AI Gateway vercel | GLM 5 Turbo zai/glm-5-turbo | glm | text | text | 202,800 | 131,100 | $1.2 / 1M | $4 / 1M | Yes | Yes | Mar 17, 2026 | |
Vercel AI Gateway vercel | GLM-5 zai/glm-5 | glm | text | text | 202,800 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 19, 2026 | |
AIHubMix aihubmix | GLM-5 glm-5 | glm | text | text | 202,752 | Unknown | $0.88 / 1M | $2.816 / 1M | Yes | Yes | Feb 11, 2026 | |
Alibaba (China) alibaba-cn | GLM-5 glm-5 | glm | text | text | 202,752 | 16,384 | $0.86 / 1M | $3.15 / 1M | Yes | Yes | Feb 11, 2026 | |
Alibaba (China) alibaba-cn | GLM-5.1 glm-5.1 | glm | text | text | 202,752 | 128,000 | $0.87 / 1M | $3.48 / 1M | Yes | Yes | Apr 14, 2026 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | GLM-4.7 glm-4.7 | glm | text | text | 202,752 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | GLM-5 glm-5 | glm | text | text | 202,752 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 11, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | GLM-4.7 glm-4.7 | glm | text | text | 202,752 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 22, 2025 | |
Alibaba Coding Plan alibaba-coding-plan | GLM-5 glm-5 | glm | text | text | 202,752 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 11, 2026 | |
Amazon Bedrock amazon-bedrock | GLM-5 zai.glm-5 | glm | text | text | 202,752 | 101,376 | $1 / 1M | $3.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Baseten baseten | GLM-5 zai-org/GLM-5 | glm | text | text | 202,752 | 131,072 | $0.95 / 1M | $3.15 / 1M | Yes | Yes | Feb 12, 2026 | |
Chutes chutes | GLM 4.6 FP8 zai-org/GLM-4.6-FP8 | Unknown | text | text | 202,752 | 65,535 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | GLM 4.6 TEE zai-org/GLM-4.6-TEE | glm | text | text | 202,752 | 65,536 | $0.4 / 1M | $1.7 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | GLM 4.7 Flash zai-org/GLM-4.7-Flash | Unknown | text | text | 202,752 | 65,535 | $0.06 / 1M | $0.35 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | GLM 4.7 FP8 zai-org/GLM-4.7-FP8 | Unknown | text | text | 202,752 | 65,535 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | GLM 4.7 TEE zai-org/GLM-4.7-TEE | glm | text | text | 202,752 | 65,535 | $0.4 / 1M | $1.5 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | GLM 5 TEE zai-org/GLM-5-TEE | glm | text | text | 202,752 | 65,535 | $0.95 / 1M | $3.15 / 1M | Yes | Yes | Feb 14, 2026 | |
Chutes chutes | GLM 5 Turbo zai-org/GLM-5-Turbo | glm | text | text | 202,752 | 65,535 | $0.49 / 1M | $1.96 / 1M | Yes | Yes | Mar 11, 2026 | |
Chutes chutes | GLM 5.1 TEE zai-org/GLM-5.1-TEE | glm | text | text | 202,752 | 65,535 | $0.95 / 1M | $3.15 / 1M | Yes | Yes | Apr 8, 2026 | |
Cortecs cortecs | GLM 5 glm-5 | glm | text | text | 202,752 | 202,752 | $1.08 / 1M | $3.44 / 1M | Yes | Yes | Feb 11, 2026 | |
Cortecs cortecs | MiniMax-M2.7 minimax-M2.7 | minimax | text | text | 202,752 | 131,072 | $0.47 / 1M | $1.4 / 1M | Yes | Yes | Mar 18, 2026 | |
Deep Infra deepinfra | GLM-4.7 zai-org/GLM-4.7 | glm | text | text | 202,752 | 16,384 | $0.43 / 1M | $1.75 / 1M | Yes | Yes | Dec 22, 2025 | |
Deep Infra deepinfra | GLM-4.7-Flash zai-org/GLM-4.7-Flash | glm-flash | text | text | 202,752 | 16,384 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Deep Infra deepinfra | GLM-5 zai-org/GLM-5 | glm | text | text | 202,752 | 16,384 | $0.8 / 1M | $2.56 / 1M | Yes | Yes | Feb 12, 2026 | |
Deep Infra deepinfra | GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 202,752 | 16,384 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
DigitalOcean digitalocean | GLM 5 glm-5 | glm | text | text | 202,752 | 128,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Apr 16, 2026 | |
Fireworks AI fireworks-ai | GLM 5 accounts/fireworks/models/glm-5 | glm | text | text | 202,752 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
Friendli friendli | GLM-5 zai-org/GLM-5 | glm | text | text | 202,752 | 202,752 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Friendli friendli | GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 202,752 | 202,752 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
Hugging Face huggingface | GLM-5 zai-org/GLM-5 | glm | text | text | 202,752 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
Hugging Face huggingface | GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 202,752 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Apr 3, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4.7 z-ai/glm-4.7 | Unknown | text | text | 202,752 | 65,535 | $0.38 / 1M | $1.98 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash | Unknown | text | text | 202,752 | 40,551 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 5 z-ai/glm-5 | Unknown | text | text | 202,752 | 131,072 | $0.72 / 1M | $2.3 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 5 Turbo z-ai/glm-5-turbo | Unknown | text | text | 202,752 | 131,072 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 5.1 z-ai/glm-5.1 | Unknown | text | text | 202,752 | 131,072 | $1.26 / 1M | $3.96 / 1M | Yes | Yes | Mar 27, 2026 | |
Meganova meganova | GLM-4.6 zai-org/GLM-4.6 | glm | text | text | 202,752 | 131,072 | $0.45 / 1M | $1.9 / 1M | Yes | Yes | Sep 30, 2025 | |
Meganova meganova | GLM-4.7 zai-org/GLM-4.7 | glm | text | text | 202,752 | 131,072 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Dec 22, 2025 | |
Meganova meganova | GLM-5 zai-org/GLM-5 | glm | text | text | 202,752 | 131,072 | $0.8 / 1M | $2.56 / 1M | Yes | Yes | Feb 11, 2026 | |
ModelScope modelscope | GLM-4.6 ZhipuAI/GLM-4.6 | glm | text | text | 202,752 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 30, 2025 | |
Nvidia nvidia | GLM5 z-ai/glm5 | glm | text | text | 202,752 | 131,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
Ollama Cloud ollama-cloud | glm-4.6 glm-4.6 | glm | text | text | 202,752 | 131,072 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | glm-4.7 glm-4.7 | glm | text | text | 202,752 | 131,072 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | glm-5 glm-5 | glm | text | text | 202,752 | 131,072 | Unknown | Unknown | Yes | Yes | Feb 11, 2026 | |
Ollama Cloud ollama-cloud | glm-5.1 glm-5.1 | glm | text | text | 202,752 | 131,072 | Unknown | Unknown | Yes | Yes | Apr 7, 2026 | |
OpenCode Go opencode-go | GLM-5 glm-5 | glm | text | text | 202,752 | 32,768 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
OpenCode Go opencode-go | GLM-5.1 glm-5.1 | glm | text | text | 202,752 | 32,768 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
OpenRouter openrouter | GLM-5 z-ai/glm-5 | glm | text | text | 202,752 | 131,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenRouter openrouter | GLM-5-Turbo z-ai/glm-5-turbo | glm | text | text | 202,752 | 131,072 | $0.96 / 1M | $3.2 / 1M | Yes | Yes | Mar 16, 2026 | |
OpenRouter openrouter | GLM-5.1 z-ai/glm-5.1 | glm | text | text | 202,752 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | GLM-5 glm-5 | glm | text | text | 202,752 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 11, 2026 | |
Together AI togetherai | GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 202,752 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 7, 2026 | |
Together AI togetherai | MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 | minimax | text | text | 202,752 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Vercel AI Gateway vercel | GLM 4.7 zai/glm-4.7 | glm | text | text | 202,752 | 120,000 | $0.43 / 1M | $1.75 / 1M | Yes | Yes | Dec 22, 2025 | |
Vertex google-vertex | GLM-5 zai-org/glm-5-maas | glm | text | text | 202,752 | 131,072 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 11, 2026 | |
Vivgrid vivgrid | GLM-5 glm-5 | glm | text | text | 202,752 | 131,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Wafer wafer.ai | GLM-5.1 GLM-5.1 | glm | text | text | 202,752 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 7, 2026 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted GLM-4.5-Air-Derestricted | Unknown | text | text | 202,600 | 98,304 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 28, 2025 | |
HPC-AI hpc-ai | GLM 5.1 zai-org/glm-5.1 | glm | text | text | 202,000 | 202,000 | $0.66 / 1M | $2 / 1M | Yes | Yes | Apr 8, 2026 | |
302.AI 302ai | glm-4.6 glm-4.6 | Unknown | text | text | 200,000 | 131,072 | $0.286 / 1M | $1.142 / 1M | Yes | Yes | Sep 30, 2025 | |
302.AI 302ai | glm-4.7 glm-4.7 | Unknown | text | text | 200,000 | 131,072 | $0.286 / 1M | $1.142 / 1M | Yes | Yes | Dec 22, 2025 | |
302.AI 302ai | glm-4.7-flashx glm-4.7-flashx | Unknown | text | text | 200,000 | 131,072 | $0.0715 / 1M | $0.429 / 1M | Yes | Yes | Jan 20, 2026 | |
302.AI 302ai | glm-5 glm-5 | Unknown | text | text | 200,000 | 131,072 | $0.6 / 1M | $2.6 / 1M | Yes | Yes | Feb 12, 2026 | |
302.AI 302ai | glm-5-turbo glm-5-turbo | Unknown | text | text | 200,000 | 131,072 | $0.72 / 1M | $3.2 / 1M | Yes | Yes | Mar 16, 2026 | |
302.AI 302ai | glm-5.1 glm-5.1 | Unknown | text | text | 200,000 | 131,072 | $0.86 / 1M | $3.5 / 1M | Yes | Yes | Apr 10, 2026 | |
302.AI 302ai | glm-for-coding glm-for-coding | Unknown | text | text | 200,000 | 131,072 | $0.086 / 1M | $0.343 / 1M | Yes | Yes | Sep 30, 2025 | |
Abacus abacus | o3-mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
AIHubMix aihubmix | Coding-GLM-5.1 coding-glm-5.1 | glm | text | text | 200,000 | 128,000 | $0.06 / 1M | $0.22 / 1M | Yes | Yes | Apr 11, 2026 | |
AIHubMix aihubmix | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 128,000 | $0.845 / 1M | $3.38 / 1M | Yes | Yes | Mar 27, 2026 | |
AIHubMix aihubmix | MiniMax-M2.7 minimax-m2.7 | minimax | text | text | 200,000 | 128,000 | $0.2958 / 1M | $1.1832 / 1M | Yes | Yes | Mar 18, 2026 | |
Amazon Bedrock amazon-bedrock | GLM-4.7-Flash zai.glm-4.7-flash | glm-flash | text | text | 200,000 | 131,072 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Azure azure | Codex Mini codex-mini | gpt-codex-mini | text | text | 200,000 | 100,000 | $1.5 / 1M | $6 / 1M | Yes | Yes | May 16, 2025 | |
Azure Cognitive Services azure-cognitive-services | Codex Mini codex-mini | gpt-codex-mini | text | text | 200,000 | 100,000 | $1.5 / 1M | $6 / 1M | Yes | Yes | May 16, 2025 | |
Azure Cognitive Services azure-cognitive-services | o3-mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
Azure azure | o3-mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
Baseten baseten | GLM 4.6 zai-org/GLM-4.6 | glm | text | text | 200,000 | 200,000 | $0.6 / 1M | $2.2 / 1M | No | Yes | Sep 16, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | o3-mini openai/o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
DigitalOcean digitalocean | o3-mini openai-o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
DInference dinference | GLM-4.7 glm-4.7 | glm | text | text | 200,000 | 128,000 | $0.45 / 1M | $1.65 / 1M | Yes | Yes | Dec 1, 2025 | |
DInference dinference | GLM-5 glm-5 | glm | text | text | 200,000 | 128,000 | $0.75 / 1M | $2.4 / 1M | Yes | Yes | Feb 1, 2026 | |
Fireworks AI fireworks-ai | MiniMax-M2.1 accounts/fireworks/models/minimax-m2p1 | minimax | text | text | 200,000 | 200,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
GitHub Models github-models | OpenAI o3-mini openai/o3-mini | o-mini | text | text | 200,000 | 100,000 | $0 / 1M | $0 / 1M | Yes | No | Jan 31, 2025 | |
Helicone helicone | OpenAI o3 Mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | No | Yes | Oct 1, 2023 | |
Helicone helicone | OpenAI: o1 o1 | o | text | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | No | No | Jan 1, 2025 | |
Helicone helicone | Perplexity Sonar Pro sonar-pro | sonar-pro | text | text | 200,000 | 4,096 | $3 / 1M | $15 / 1M | No | No | Jan 27, 2025 | |
Hugging Face huggingface | GLM-4.7-Flash zai-org/GLM-4.7-Flash | glm | text | text | 200,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 8, 2025 | |
iFlow iflowcn | GLM-4.6 glm-4.6 | glm | text | text | 200,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 13, 2025 | |
IO.NET io-net | GLM 4.6 zai-org/GLM-4.6 | glm | text | text | 200,000 | 4,096 | $0.4 / 1M | $1.75 / 1M | No | Yes | Nov 15, 2024 | |
Jiekou.AI jiekou | GLM-4.7-Flash zai-org/glm-4.7-flash | glm | text | text | 200,000 | 128,000 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 1, 2026 | |
LLM Gateway llmgateway | Claude 3.5 Haiku claude-3-5-haiku | claude | text | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
LLM Gateway llmgateway | Claude 3.7 Sonnet claude-3-7-sonnet | claude | text | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 24, 2025 | |
LLM Gateway llmgateway | GLM-4.7-Flash glm-4.7-flash | glm-flash | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 19, 2026 | |
LLM Gateway llmgateway | GLM-4.7-FlashX glm-4.7-flashx | glm-flash | text | text | 200,000 | 131,072 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
LLM Gateway llmgateway | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 131,072 | $6 / 1M | $24 / 1M | Yes | Yes | Mar 27, 2026 | |
LLM Gateway llmgateway | o3-mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
NanoGPT nano-gpt | Azure o1 azure-o1 | Unknown | text | text | 200,000 | 100,000 | $14.994 / 1M | $59.993 / 1M | No | No | Dec 17, 2024 | |
NanoGPT nano-gpt | Azure o3-mini azure-o3-mini | Unknown | text | text | 200,000 | 65,536 | $1.088 / 1M | $4.3996 / 1M | No | No | Jan 31, 2025 | |
NanoGPT nano-gpt | GLM 4.6 z-ai/glm-4.6 | glm | text | text | 200,000 | 65,535 | $0.4 / 1M | $1.5 / 1M | Yes | Yes | Sep 30, 2025 | |
NanoGPT nano-gpt | GLM 4.6 Thinking z-ai/glm-4.6:thinking | glm | text | text | 200,000 | 65,535 | $0.4 / 1M | $1.5 / 1M | Yes | Yes | Sep 29, 2025 | |
NanoGPT nano-gpt | GLM 4.7 zai-org/glm-4.7 | glm | text | text | 200,000 | 128,000 | $0.15 / 1M | $0.8 / 1M | Yes | Yes | Jan 29, 2026 | |
NanoGPT nano-gpt | GLM 4.7 Flash zai-org/glm-4.7-flash | glm-flash | text | text | 200,000 | 128,000 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
NanoGPT nano-gpt | GLM 5 zai-org/glm-5 | glm | text | text | 200,000 | 128,000 | $0.3 / 1M | $2.55 / 1M | Yes | Yes | Feb 11, 2026 | |
NanoGPT nano-gpt | GLM 5 Thinking zai-org/glm-5:thinking | glm | text | text | 200,000 | 128,000 | $0.3 / 1M | $2.55 / 1M | Yes | Yes | Feb 11, 2026 | |
NanoGPT nano-gpt | GLM 5.1 zai-org/glm-5.1 | glm | text | text | 200,000 | 131,072 | $0.3 / 1M | $2.55 / 1M | Yes | Yes | Mar 27, 2026 | |
NanoGPT nano-gpt | GLM 5.1 Thinking zai-org/glm-5.1:thinking | glm | text | text | 200,000 | 131,072 | $0.3 / 1M | $2.55 / 1M | Yes | Yes | Mar 27, 2026 | |
NanoGPT nano-gpt | MiniMax M2 MiniMax-M2 | Unknown | text | text | 200,000 | 131,072 | $0.17 / 1M | $1.53 / 1M | Yes | No | Oct 25, 2025 | |
NanoGPT nano-gpt | MiniMax M2.1 minimax/minimax-m2.1 | minimax | text | text | 200,000 | 131,072 | $0.33 / 1M | $1.32 / 1M | Yes | Yes | Dec 19, 2025 | |
NanoGPT nano-gpt | MiniMax M2.1 TEE TEE/minimax-m2.1 | minimax | text | text | 200,000 | 131,072 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
NanoGPT nano-gpt | OpenAI o1 openai/o1 | o | text | text | 200,000 | 100,000 | $14.993999999999998 / 1M | $59.993 / 1M | Yes | No | Dec 17, 2024 | |
NanoGPT nano-gpt | OpenAI o3 openai/o3 | o | text | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | No | No | Apr 16, 2025 | |
NanoGPT nano-gpt | OpenAI o3 Deep Research openai/o3-deep-research | o | text | text | 200,000 | 100,000 | $9.996 / 1M | $19.992 / 1M | Yes | No | Apr 16, 2025 | |
NanoGPT nano-gpt | OpenAI o3-mini openai/o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 31, 2025 | |
NanoGPT nano-gpt | OpenAI o3-mini (High) openai/o3-mini-high | o-mini | text | text | 200,000 | 100,000 | $0.64 / 1M | $2.588 / 1M | Yes | Yes | Jan 31, 2025 | |
NanoGPT nano-gpt | OpenAI o3-mini (Low) openai/o3-mini-low | o-mini | text | text | 200,000 | 100,000 | $9.996 / 1M | $19.992 / 1M | Yes | Yes | Jan 31, 2025 | |
NanoGPT nano-gpt | OpenAI o3-pro (2025-06-10) openai/o3-pro-2025-06-10 | o-pro | text | text | 200,000 | 100,000 | $9.996 / 1M | $19.992 / 1M | Yes | Yes | Jun 10, 2025 | |
NanoGPT nano-gpt | OpenAI o4-mini openai/o4-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
NanoGPT nano-gpt | OpenAI o4-mini Deep Research openai/o4-mini-deep-research | o-mini | text | text | 200,000 | 100,000 | $9.996 / 1M | $19.992 / 1M | Yes | No | Apr 16, 2025 | |
NanoGPT nano-gpt | OpenAI o4-mini high openai/o4-mini-high | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
NanoGPT nano-gpt | Perplexity Pro sonar-pro | Unknown | text | text | 200,000 | 128,000 | $2.992 / 1M | $14.994 / 1M | No | No | Feb 19, 2025 | |
NanoGPT nano-gpt | v0 1.0 MD v0-1.0-md | Unknown | text | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | No | Jul 4, 2025 | |
NanoGPT nano-gpt | v0 1.5 MD v0-1.5-md | Unknown | text | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | No | Jul 4, 2025 | |
NanoGPT nano-gpt | Yi Medium 200k yi-medium-200k | Unknown | text | text | 200,000 | 4,096 | $2.499 / 1M | $2.499 / 1M | No | No | Mar 1, 2024 | |
Nebius Token Factory nebius | GLM-5 zai-org/GLM-5 | Unknown | text | text | 200,000 | 16,384 | $1 / 1M | $3.2 / 1M | Yes | Yes | Mar 10, 2026 | |
NovitaAI novita-ai | GLM-4.7-Flash zai-org/glm-4.7-flash | glm | text | text | 200,000 | 128,000 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
OpenAI openai | o3-mini o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
OpenCode Zen opencode | Big Pickle big-pickle | big-pickle | text | text | 200,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Oct 17, 2025 | |
OpenRouter openrouter | GLM 4.6 z-ai/glm-4.6 | glm | text | text | 200,000 | 128,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Sep 30, 2025 | |
OpenRouter openrouter | GLM 4.6 (exacto) z-ai/glm-4.6:exacto | glm | text | text | 200,000 | 128,000 | $0.6 / 1M | $1.9 / 1M | Yes | Yes | Sep 30, 2025 | |
OpenRouter openrouter | GLM-4.7-Flash z-ai/glm-4.7-flash | glm | text | text | 200,000 | 65,535 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
OpenRouter openrouter | Pareto Code Router openrouter/pareto-code | Unknown | text | text | 200,000 | 200,000 | Unknown | Unknown | Yes | Yes | Apr 21, 2026 | |
Poe poe | glm-4.7-flash novita/glm-4.7-flash | Unknown | text | text | 200,000 | 65,500 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Poe poe | o3-deep-research openai/o3-deep-research | o | text | text | 200,000 | 100,000 | $9 / 1M | $36 / 1M | Yes | Yes | Jun 27, 2025 | |
Poe poe | o4-mini-deep-research openai/o4-mini-deep-research | o-mini | text | text | 200,000 | 100,000 | $1.8 / 1M | $7.2 / 1M | Yes | Yes | Jun 27, 2025 | |
Qiniu qiniu-ai | Minimax/Minimax-M2 minimax/minimax-m2 | Unknown | text | text | 200,000 | 128,000 | Unknown | Unknown | Yes | Yes | Oct 28, 2025 | |
Qiniu qiniu-ai | Z-AI/GLM 4.6 z-ai/glm-4.6 | Unknown | text | text | 200,000 | 200,000 | Unknown | Unknown | No | Yes | Oct 11, 2025 | |
Qiniu qiniu-ai | Z-Ai/GLM 4.7 z-ai/glm-4.7 | Unknown | text | text | 200,000 | 200,000 | Unknown | Unknown | Yes | Yes | Dec 23, 2025 | |
Qiniu qiniu-ai | Z-Ai/GLM 5 z-ai/glm-5 | Unknown | text | text | 200,000 | 128,000 | Unknown | Unknown | Yes | Yes | Feb 12, 2026 | |
Synthetic synthetic | GLM 4.6 hf:zai-org/GLM-4.6 | glm | text | text | 200,000 | 64,000 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Sep 30, 2025 | |
Synthetic synthetic | GLM 4.7 hf:zai-org/GLM-4.7 | glm | text | text | 200,000 | 64,000 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Dec 22, 2025 | |
Venice AI venice | GLM 4.7 Flash Heretic olafangensan-glm-4.7-flash-heretic | glm-flash | text | text | 200,000 | 24,000 | $0.14 / 1M | $0.8 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | GLM 5 Turbo z-ai-glm-5-turbo | glm | text | text | 200,000 | 32,768 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | GLM 5.1 zai-org-glm-5-1 | glm | text | text | 200,000 | 24,000 | $1.75 / 1M | $5.5 / 1M | Yes | Yes | Apr 8, 2026 | |
Vercel AI Gateway vercel | GLM 4.6 zai/glm-4.6 | glm | text | text | 200,000 | 96,000 | $0.45 / 1M | $1.8 / 1M | Yes | Yes | Sep 30, 2025 | |
Vercel AI Gateway vercel | GLM 4.7 Flash zai/glm-4.7-flash | glm | text | text | 200,000 | 131,000 | $0.07 / 1M | $0.39999999999999997 / 1M | Yes | Yes | Mar 13, 2026 | |
Vercel AI Gateway vercel | GLM 4.7 FlashX zai/glm-4.7-flashx | glm-flash | text | text | 200,000 | 128,000 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 1, 2025 | |
Vercel AI Gateway vercel | o3-mini openai/o3-mini | o-mini | text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Jan 29, 2025 | |
Vultr vultr | GLM 5 FP8 GLM-5-FP8 | glm | text | text | 200,000 | 131,072 | $0.85 / 1M | $3.1 / 1M | No | Yes | Feb 11, 2026 | |
Weights & Biases wandb | GLM 5 zai-org/GLM-5-FP8 | glm | text | text | 200,000 | 200,000 | $1 / 1M | $3.2 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | GLM-5.1 zai-org/GLM-5.1 | glm | text | text | 200,000 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Mar 27, 2026 | |
Z.AI Coding Plan zai-coding-plan | GLM-5-Turbo glm-5-turbo | glm | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 16, 2026 | |
Z.AI Coding Plan zai-coding-plan | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 27, 2026 | |
Z.AI zai | GLM-4.7-Flash glm-4.7-flash | glm-flash | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 19, 2026 | |
Z.AI zai | GLM-4.7-FlashX glm-4.7-flashx | glm-flash | text | text | 200,000 | 131,072 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Z.AI zai | GLM-5-Turbo glm-5-turbo | glm | text | text | 200,000 | 131,072 | $1.2 / 1M | $4 / 1M | Yes | Yes | Mar 16, 2026 | |
Z.AI zai | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 131,072 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Mar 27, 2026 | |
ZenMux zenmux | GLM 4.6 z-ai/glm-4.6 | Unknown | text | text | 200,000 | 64,000 | $0.35 / 1M | $1.54 / 1M | Yes | Yes | Sep 30, 2025 | |
ZenMux zenmux | GLM 4.7 z-ai/glm-4.7 | Unknown | text | text | 200,000 | 64,000 | $0.28 / 1M | $1.14 / 1M | Yes | Yes | Dec 23, 2025 | |
ZenMux zenmux | GLM 4.7 Flash (Free) z-ai/glm-4.7-flash-free | Unknown | text | text | 200,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 19, 2026 | |
ZenMux zenmux | GLM 4.7 FlashX z-ai/glm-4.7-flashx | Unknown | text | text | 200,000 | 64,000 | $0.07 / 1M | $0.42 / 1M | Yes | Yes | Jan 19, 2026 | |
ZenMux zenmux | GLM 5 z-ai/glm-5 | Unknown | text | text | 200,000 | 128,000 | $0.58 / 1M | $2.6 / 1M | Yes | Yes | Feb 12, 2026 | |
ZenMux zenmux | GLM 5 Turbo z-ai/glm-5-turbo | Unknown | text | text | 200,000 | 128,000 | $0.88 / 1M | $3.48 / 1M | Yes | Yes | Mar 20, 2026 | |
ZenMux zenmux | GLM-5.1 z-ai/glm-5.1 | Unknown | text | text | 200,000 | 131,072 | $0.8781 / 1M | $3.5126 / 1M | Yes | Yes | Apr 3, 2026 | |
Zhipu AI Coding Plan zhipuai-coding-plan | GLM-5-Turbo glm-5-turbo | glm | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 16, 2026 | |
Zhipu AI Coding Plan zhipuai-coding-plan | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 27, 2026 | |
Zhipu AI zhipuai | GLM-4.7-Flash glm-4.7-flash | glm-flash | text | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 19, 2026 | |
Zhipu AI zhipuai | GLM-4.7-FlashX glm-4.7-flashx | glm-flash | text | text | 200,000 | 131,072 | $0.07 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Zhipu AI zhipuai | GLM-5.1 glm-5.1 | glm | text | text | 200,000 | 131,072 | $6 / 1M | $24 / 1M | Yes | Yes | Mar 27, 2026 | |
Cortecs cortecs | GLM 4.7 glm-4.7 | glm | text | text | 198,000 | 198,000 | $0.45 / 1M | $2.23 / 1M | Yes | Yes | Dec 22, 2025 | |
Fireworks AI fireworks-ai | GLM 4.7 accounts/fireworks/models/glm-4p7 | glm | text | text | 198,000 | 198,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Dec 22, 2025 | |
Firmware firmware | Z.AI GLM-5.1 zai-glm-5-1 | glm | text | text | 198,000 | 8,192 | $1.4 / 1M | $4.4 / 1M | No | Yes | Feb 22, 2025 | |
Venice AI venice | GLM 4.6 zai-org-glm-4.6 | glm | text | text | 198,000 | 16,384 | $0.85 / 1M | $2.75 / 1M | Yes | Yes | Apr 4, 2026 | |
Venice AI venice | GLM 4.7 zai-org-glm-4.7 | glm | text | text | 198,000 | 16,384 | $0.55 / 1M | $2.65 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | GLM 5 zai-org-glm-5 | glm | text | text | 198,000 | 32,000 | $1 / 1M | $3.2 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | MiniMax M2.5 minimax-m25 | minimax | text | text | 198,000 | 32,768 | $0.34 / 1M | $1.19 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | MiniMax M2.7 minimax-m27 | minimax | text | text | 198,000 | 32,768 | $0.375 / 1M | $1.5 / 1M | Yes | Yes | Apr 12, 2026 | |
SiliconFlow (China) siliconflow-cn | Pro/MiniMaxAI/MiniMax-M2.1 Pro/MiniMaxAI/MiniMax-M2.1 | minimax | text | text | 197,000 | 131,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Dec 23, 2025 | |
SiliconFlow siliconflow | MiniMaxAI/MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 | minimax | text | text | 197,000 | 131,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Dec 23, 2025 | |
SiliconFlow siliconflow | MiniMaxAI/MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 197,000 | 131,000 | $0.3 / 1M | $1.2 / 1M | No | Yes | Feb 15, 2026 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 196,608 | 24,576 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | MiniMax-M2.5 MiniMax-M2.5 | minimax | text | text | 196,608 | 24,576 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 12, 2026 | |
Amazon Bedrock amazon-bedrock | MiniMax M2.5 minimax.minimax-m2.5 | minimax | text | text | 196,608 | 98,304 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Chutes chutes | MiniMax M2.1 TEE MiniMaxAI/MiniMax-M2.1-TEE | minimax | text | text | 196,608 | 65,536 | $0.27 / 1M | $1.12 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | MiniMax M2.5 TEE MiniMaxAI/MiniMax-M2.5-TEE | minimax | text | text | 196,608 | 65,536 | $0.3 / 1M | $1.1 / 1M | Yes | Yes | Feb 15, 2026 | |
Cortecs cortecs | MiniMax-M2.5 minimax-m2.5 | minimax | text | text | 196,608 | 196,608 | $0.32 / 1M | $1.18 / 1M | Yes | Yes | Feb 12, 2026 | |
Deep Infra deepinfra | MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 | Unknown | text | text | 196,608 | 196,608 | $0.28 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
Fireworks AI fireworks-ai | MiniMax-M2.5 accounts/fireworks/models/minimax-m2p5 | minimax | text | text | 196,608 | 196,608 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Fireworks AI fireworks-ai | MiniMax-M2.7 accounts/fireworks/models/minimax-m2p7 | minimax | text | text | 196,608 | 196,608 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Apr 12, 2026 | |
Friendli friendli | MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 196,608 | 196,608 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 12, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax M2 minimax/minimax-m2 | Unknown | text | text | 196,608 | 196,608 | $0.255 / 1M | $1 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax M2.1 minimax/minimax-m2.1 | Unknown | text | text | 196,608 | 39,322 | $0.27 / 1M | $0.95 / 1M | Yes | Yes | Dec 23, 2025 | |
Kilo Gateway kilo | MiniMax: MiniMax M2.5 minimax/minimax-m2.5 | Unknown | text | text | 196,608 | 196,608 | $0.25 / 1M | $1.2 / 1M | Yes | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | MiniMax M2.1 Lightning minimax-m2.1-lightning | minimax | text | text | 196,608 | 131,072 | $0.12 / 1M | $0.48 / 1M | Yes | No | Dec 23, 2025 | |
LLM Gateway llmgateway | MiniMax-M2 minimax-m2 | minimax | text | text | 196,608 | 128,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
Meganova meganova | MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 | minimax | text | text | 196,608 | 131,072 | $0.28 / 1M | $1.2 / 1M | Yes | Yes | Dec 23, 2025 | |
MiniMax (minimax.io) minimax | MiniMax-M2 MiniMax-M2 | minimax | text | text | 196,608 | 128,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
MiniMax (minimaxi.com) minimax-cn | MiniMax-M2 MiniMax-M2 | minimax | text | text | 196,608 | 128,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Oct 27, 2025 | |
MiniMax Coding Plan (minimax.io) minimax-coding-plan | MiniMax-M2 MiniMax-M2 | minimax | text | text | 196,608 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Oct 27, 2025 | |
MiniMax Coding Plan (minimaxi.com) minimax-cn-coding-plan | MiniMax-M2 MiniMax-M2 | minimax | text | text | 196,608 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Oct 27, 2025 | |
Synthetic synthetic | GLM 5.1 hf:zai-org/GLM-5.1 | glm | text | text | 196,608 | 65,536 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 12, 2026 | |
Synthetic synthetic | GLM-4.7-Flash hf:zai-org/GLM-4.7-Flash | glm | text | text | 196,608 | 65,536 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 18, 2026 | |
Synthetic synthetic | GLM-5 hf:zai-org/GLM-5 | glm | text | text | 196,608 | 65,536 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 8, 2026 | |
Synthetic synthetic | MiniMax-M2 hf:MiniMaxAI/MiniMax-M2 | minimax | text | text | 196,608 | 131,000 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Oct 27, 2025 | |
Weights & Biases wandb | MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 196,608 | 196,608 | $0.3 / 1M | $1.2 / 1M | No | Yes | Mar 12, 2026 | |
OpenRouter openrouter | MiniMax M2 minimax/minimax-m2 | minimax | text | text | 196,600 | 118,000 | $0.28 / 1M | $1.15 / 1M | Yes | Yes | Oct 23, 2025 | |
CloudFerro Sherlock cloudferro-sherlock | MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 196,000 | 196,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 5, 2026 | |
Cortecs cortecs | MiniMax-M2.1 minimax-m2.1 | minimax | text | text | 196,000 | 196,000 | $0.34 / 1M | $1.34 / 1M | Yes | Yes | Dec 23, 2025 | |
Vultr vultr | MiniMax M2.5 MiniMax-M2.5 | minimax | text | text | 194,000 | 4,096 | $0.3 / 1M | $1.2 / 1M | No | Yes | Feb 11, 2025 | |
Firmware firmware | MiniMax-M2.5 minimax-m2-5 | minimax | text | text | 192,000 | 8,192 | $0.3 / 1M | $1.2 / 1M | No | Yes | Feb 22, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/MiniMaxAI/MiniMax-M2.5 Pro/MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 192,000 | 131,000 | $0.3 / 1M | $1.22 / 1M | No | Yes | Feb 13, 2026 | |
Synthetic synthetic | MiniMax-M2.5 hf:MiniMaxAI/MiniMax-M2.5 | minimax | text | text | 191,488 | 65,536 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 7, 2026 | |
Regolo AI regolo-ai | MiniMax 2.5 minimax-m2.5 | minimax | text | text | 190,000 | 64,000 | $0.8 / 1M | $3.5 / 1M | Yes | Yes | Mar 10, 2026 | |
Baseten baseten | DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 | deepseek | text | text | 164,000 | 131,000 | $0.77 / 1M | $0.77 / 1M | No | Yes | Mar 24, 2025 | |
Baseten baseten | DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 164,000 | 131,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Aug 25, 2025 | |
Meganova meganova | DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $1 / 1M | No | Yes | Aug 25, 2025 | |
Meganova meganova | DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 164,000 | 164,000 | $0.26 / 1M | $0.38 / 1M | No | Yes | Dec 3, 2025 | |
Meganova meganova | DeepSeek V3.2 Exp deepseek-ai/DeepSeek-V3.2-Exp | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $0.4 / 1M | No | Yes | Oct 10, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 TEE TEE/deepseek-v3.1 | deepseek | text | text | 164,000 | 8,192 | $1 / 1M | $2.5 / 1M | No | No | Aug 21, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.2 TEE TEE/deepseek-v3.2 | deepseek | text | text | 164,000 | 65,536 | $0.5 / 1M | $1 / 1M | No | No | Dec 1, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-R1 deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 164,000 | 164,000 | $0.5 / 1M | $2.18 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-V3 deepseek-ai/DeepSeek-V3 | deepseek | text | text | 164,000 | 164,000 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-V3.1-Terminus deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $0.42 / 1M | Yes | Yes | Dec 3, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/deepseek-ai/DeepSeek-R1 Pro/deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 164,000 | 164,000 | $0.5 / 1M | $2.18 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/deepseek-ai/DeepSeek-V3 Pro/deepseek-ai/DeepSeek-V3 | deepseek | text | text | 164,000 | 164,000 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/deepseek-ai/DeepSeek-V3.1-Terminus Pro/deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Pro/deepseek-ai/DeepSeek-V3.2 Pro/deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $0.42 / 1M | Yes | Yes | Dec 3, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-R1 deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 164,000 | 164,000 | $0.5 / 1M | $2.18 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-V3 deepseek-ai/DeepSeek-V3 | deepseek | text | text | 164,000 | 164,000 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-V3.1-Terminus deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $0.42 / 1M | Yes | Yes | Dec 3, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.2-Exp | deepseek | text | text | 164,000 | 164,000 | $0.27 / 1M | $0.41 / 1M | Yes | Yes | Nov 25, 2025 | |
Vercel AI Gateway vercel | DeepSeek V3.2 deepseek/deepseek-v3.2 | deepseek | text | text | 163,842 | 8,000 | $0.27 / 1M | $0.4 / 1M | No | No | Dec 1, 2025 | |
Alibaba (China) alibaba-cn | siliconflow/deepseek-r1-0528 siliconflow/deepseek-r1-0528 | deepseek-thinking | text | text | 163,840 | 32,768 | $0.5 / 1M | $2.18 / 1M | Yes | Yes | Nov 25, 2025 | |
Alibaba (China) alibaba-cn | siliconflow/deepseek-v3-0324 siliconflow/deepseek-v3-0324 | deepseek | text | text | 163,840 | 163,840 | $0.25 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
Alibaba (China) alibaba-cn | siliconflow/deepseek-v3.1-terminus siliconflow/deepseek-v3.1-terminus | deepseek | text | text | 163,840 | 65,536 | $0.27 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
Alibaba (China) alibaba-cn | siliconflow/deepseek-v3.2 siliconflow/deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.42 / 1M | Yes | Yes | Dec 3, 2025 | |
Amazon Bedrock amazon-bedrock | DeepSeek-V3.1 deepseek.v3-v1:0 | deepseek | text | text | 163,840 | 81,920 | $0.58 / 1M | $1.68 / 1M | Yes | Yes | Sep 18, 2025 | |
Amazon Bedrock amazon-bedrock | DeepSeek-V3.2 deepseek.v3.2 | deepseek | text | text | 163,840 | 81,920 | $0.62 / 1M | $1.85 / 1M | Yes | Yes | Feb 6, 2026 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-R1 deepseek-r1 | deepseek-thinking | text | text | 163,840 | 163,840 | $1.35 / 1M | $5.4 / 1M | Yes | No | Jan 20, 2025 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-R1-0528 deepseek-r1-0528 | deepseek-thinking | text | text | 163,840 | 163,840 | $1.35 / 1M | $5.4 / 1M | Yes | Yes | May 28, 2025 | |
Azure azure | DeepSeek-R1 deepseek-r1 | deepseek-thinking | text | text | 163,840 | 163,840 | $1.35 / 1M | $5.4 / 1M | Yes | No | Jan 20, 2025 | |
Azure azure | DeepSeek-R1-0528 deepseek-r1-0528 | deepseek-thinking | text | text | 163,840 | 163,840 | $1.35 / 1M | $5.4 / 1M | Yes | Yes | May 28, 2025 | |
Chutes chutes | DeepSeek R1 0528 TEE deepseek-ai/DeepSeek-R1-0528-TEE | deepseek-thinking | text | text | 163,840 | 65,536 | $0.4 / 1M | $1.75 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek R1 TEE deepseek-ai/DeepSeek-R1-TEE | deepseek-thinking | text | text | 163,840 | 163,840 | $0.3 / 1M | $1.2 / 1M | Yes | No | Jan 10, 2026 | |
Chutes chutes | DeepSeek R1T Chimera tngtech/DeepSeek-R1T-Chimera | tngtech | text | text | 163,840 | 163,840 | $0.3 / 1M | $1.2 / 1M | Yes | No | Jan 10, 2026 | |
Chutes chutes | DeepSeek TNG R1T2 Chimera tngtech/DeepSeek-TNG-R1T2-Chimera | tngtech | text | text | 163,840 | 163,840 | $0.25 / 1M | $0.85 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3 deepseek-ai/DeepSeek-V3 | deepseek | text | text | 163,840 | 163,840 | $0.3 / 1M | $1.2 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3 0324 TEE deepseek-ai/DeepSeek-V3-0324-TEE | deepseek | text | text | 163,840 | 65,536 | $0.19 / 1M | $0.87 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3.1 TEE deepseek-ai/DeepSeek-V3.1-TEE | deepseek | text | text | 163,840 | 65,536 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3.1 Terminus TEE deepseek-ai/DeepSeek-V3.1-Terminus-TEE | deepseek | text | text | 163,840 | 65,536 | $0.23 / 1M | $0.9 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3.2 Speciale TEE deepseek-ai/DeepSeek-V3.2-Speciale-TEE | deepseek | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.41 / 1M | Yes | No | Jan 10, 2026 | |
Chutes chutes | TNG R1T Chimera TEE tngtech/TNG-R1T-Chimera-TEE | tngtech | text | text | 163,840 | 65,536 | $0.25 / 1M | $0.85 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | TNG R1T Chimera Turbo tngtech/TNG-R1T-Chimera-Turbo | Unknown | text | text | 163,840 | 65,536 | $0.22 / 1M | $0.6 / 1M | Yes | Yes | Jan 27, 2026 | |
Deep Infra deepinfra | DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 | Unknown | text | text | 163,840 | 64,000 | $0.5 / 1M | $2.15 / 1M | Yes | No | May 28, 2025 | |
Deep Infra deepinfra | DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | Unknown | text | text | 163,840 | 64,000 | $0.26 / 1M | $0.38 / 1M | Yes | Yes | Dec 2, 2025 | |
Fireworks AI fireworks-ai | DeepSeek V3.1 accounts/fireworks/models/deepseek-v3p1 | deepseek | text | text | 163,840 | 163,840 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Aug 21, 2025 | |
Helicone helicone | DeepSeek V3.2 deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.41 / 1M | No | Yes | Sep 22, 2025 | |
Hugging Face huggingface | DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 | deepseek-thinking | text | text | 163,840 | 163,840 | $3 / 1M | $5 / 1M | Yes | Yes | May 28, 2025 | |
Hugging Face huggingface | DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 163,840 | 65,536 | $0.28 / 1M | $0.4 / 1M | Yes | Yes | Dec 1, 2025 | |
Jiekou.AI jiekou | DeepSeek R1 0528 deepseek/deepseek-r1-0528 | deepseek-thinking | text | text | 163,840 | 32,768 | $0.7 / 1M | $2.5 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | DeepSeek V3 0324 deepseek/deepseek-v3-0324 | deepseek | text | text | 163,840 | 163,840 | $0.28 / 1M | $1.14 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | DeepSeek V3.1 deepseek/deepseek-v3.1 | deepseek | text | text | 163,840 | 32,768 | $0.27 / 1M | $1 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3 deepseek/deepseek-chat | Unknown | text | text | 163,840 | 163,840 | $0.32 / 1M | $0.89 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | Unknown | text | text | 163,840 | 65,536 | $0.2 / 1M | $0.77 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | Unknown | text | text | 163,840 | 32,768 | $0.21 / 1M | $0.79 / 1M | Yes | Yes | Sep 22, 2025 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2 | Unknown | text | text | 163,840 | 65,536 | $0.26 / 1M | $0.38 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | Unknown | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.41 / 1M | Yes | Yes | Sep 29, 2025 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | Unknown | text | text | 163,840 | 163,840 | $0.4 / 1M | $1.2 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | DeepSeek: R1 0528 deepseek/deepseek-r1-0528 | Unknown | text | text | 163,840 | 65,536 | $0.45 / 1M | $2.15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | TNG: DeepSeek R1T2 Chimera tngtech/deepseek-r1t2-chimera | Unknown | text | text | 163,840 | 163,840 | $0.25 / 1M | $0.85 / 1M | Yes | Yes | Jul 8, 2025 | |
Meganova meganova | DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 | deepseek-thinking | text | text | 163,840 | 64,000 | $0.5 / 1M | $2.15 / 1M | Yes | No | May 28, 2025 | |
Meganova meganova | DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 | deepseek | text | text | 163,840 | 163,840 | $0.25 / 1M | $0.88 / 1M | No | Yes | Mar 24, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.2 Exp deepseek-ai/deepseek-v3.2-exp | deepseek | text | text | 163,840 | 65,536 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | No | No | Sep 29, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.2 Exp Thinking deepseek-ai/deepseek-v3.2-exp-thinking | deepseek-thinking | text | text | 163,840 | 65,536 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | Yes | No | Sep 29, 2025 | |
NovitaAI novita-ai | DeepSeek R1 0528 deepseek/deepseek-r1-0528 | deepseek-thinking | text | text | 163,840 | 32,768 | $0.7 / 1M | $2.5 / 1M | Yes | Yes | May 28, 2025 | |
NovitaAI novita-ai | DeepSeek V3 0324 deepseek/deepseek-v3-0324 | deepseek | text | text | 163,840 | 163,840 | $0.27 / 1M | $1.12 / 1M | No | Yes | Mar 25, 2025 | |
NovitaAI novita-ai | Deepseek V3.2 deepseek/deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | $0.269 / 1M | $0.4 / 1M | Yes | Yes | Dec 1, 2025 | |
NovitaAI novita-ai | Deepseek V3.2 Exp deepseek/deepseek-v3.2-exp | Unknown | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.41 / 1M | Yes | Yes | Sep 29, 2025 | |
Nvidia nvidia | DeepSeek V3.2 deepseek-ai/deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2025 | |
Ollama Cloud ollama-cloud | cogito-2.1:671b cogito-2.1:671b | cogito | text | text | 163,840 | 32,000 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | deepseek-v3.1:671b deepseek-v3.1:671b | deepseek | text | text | 163,840 | 163,840 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | deepseek-v3.2 deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
OpenRouter openrouter | DeepSeek V3.2 deepseek/deepseek-v3.2 | deepseek | text | text | 163,840 | 65,536 | $0.28 / 1M | $0.4 / 1M | Yes | Yes | Dec 1, 2025 | |
OpenRouter openrouter | DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | deepseek | text | text | 163,840 | 65,536 | $0.27 / 1M | $0.41 / 1M | Yes | Yes | Dec 1, 2025 | |
OpenRouter openrouter | DeepSeek-V3.1 deepseek/deepseek-chat-v3.1 | deepseek | text | text | 163,840 | 163,840 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Aug 21, 2025 | |
Vercel AI Gateway vercel | DeepSeek V3 0324 deepseek/deepseek-v3 | deepseek | text | text | 163,840 | 16,384 | $0.77 / 1M | $0.77 / 1M | No | Yes | Dec 26, 2024 | |
Vercel AI Gateway vercel | DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | deepseek | text | text | 163,840 | 163,840 | $0.27 / 1M | $0.4 / 1M | Yes | Yes | Sep 29, 2025 | |
Vercel AI Gateway vercel | DeepSeek-V3.1 deepseek/deepseek-v3.1 | deepseek | text | text | 163,840 | 128,000 | $0.3 / 1M | $1 / 1M | Yes | Yes | Aug 21, 2025 | |
Together AI togetherai | DeepSeek R1 deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 163,839 | 163,839 | $3 / 1M | $7 / 1M | Yes | No | Mar 24, 2025 | |
Baseten baseten | DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 163,800 | 131,100 | $0.3 / 1M | $0.45 / 1M | Yes | Yes | Mar 6, 2026 | |
Nebius Token Factory nebius | DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 | Unknown | text | text | 163,000 | 16,384 | $0.3 / 1M | $0.45 / 1M | Yes | Yes | Feb 4, 2026 | |
ZenMux zenmux | DeepSeek-V3.2-Exp deepseek/deepseek-v3.2-exp | Unknown | text | text | 163,000 | 64,000 | $0.22 / 1M | $0.33 / 1M | Yes | Yes | Sep 29, 2025 | |
Synthetic synthetic | DeepSeek V3.2 hf:deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 162,816 | 8,000 | $0.27 / 1M | $0.4 / 1M | Yes | Yes | Dec 1, 2025 | |
Weights & Biases wandb | DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 161,000 | 161,000 | $0.55 / 1M | $1.65 / 1M | No | Yes | Mar 12, 2026 | |
Fireworks AI fireworks-ai | DeepSeek V3.2 accounts/fireworks/models/deepseek-v3p2 | deepseek | text | text | 160,000 | 160,000 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Dec 1, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | Unknown | text | text | 160,000 | 32,768 | $0.07 / 1M | $0.27 / 1M | No | Yes | Jul 31, 2025 | |
NanoGPT nano-gpt | DeepSeek Prover v2 671B deepseek/deepseek-prover-v2-671b | deepseek | text | text | 160,000 | 16,384 | $1 / 1M | $2.5 / 1M | No | No | Apr 30, 2025 | |
NovitaAI novita-ai | Deepseek Prover V2 671B deepseek/deepseek-prover-v2-671b | Unknown | text | text | 160,000 | 160,000 | $0.7 / 1M | $2.5 / 1M | No | No | Apr 30, 2025 | |
NovitaAI novita-ai | Qwen3 Coder 30b A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | Unknown | text | text | 160,000 | 32,768 | $0.07 / 1M | $0.27 / 1M | No | Yes | Oct 9, 2025 | |
OpenRouter openrouter | Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | qwen | text | text | 160,000 | 65,536 | $0.07 / 1M | $0.27 / 1M | No | Yes | Jul 31, 2025 | |
Qiniu qiniu-ai | Deepseek/Deepseek-Math-V2 deepseek/deepseek-math-v2 | Unknown | text | text | 160,000 | 160,000 | Unknown | Unknown | Yes | No | Dec 4, 2025 | |
Venice AI venice | DeepSeek V3.2 deepseek-v3.2 | deepseek | text | text | 160,000 | 32,768 | $0.33 / 1M | $0.48 / 1M | Yes | Yes | Mar 24, 2026 | |
Vercel AI Gateway vercel | Qwen 3 Coder 30B A3B Instruct alibaba/qwen3-coder-30b-a3b | qwen | text | text | 160,000 | 32,768 | $0.07 / 1M | $0.27 / 1M | Yes | Yes | Apr 1, 2025 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Iceblink v2 GLM-4.5-Air-Derestricted-Iceblink-v2 | Unknown | text | text | 158,600 | 65,536 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 28, 2025 | |
302.AI 302ai | Qwen-Max-Latest qwen-max-latest | qwen | text | text | 131,072 | 8,192 | $0.343 / 1M | $1.372 / 1M | No | Yes | Jan 25, 2025 | |
Abacus abacus | Qwen3 Max qwen3-max | qwen | text | text | 131,072 | 16,384 | $1.2 / 1M | $6 / 1M | Yes | Yes | May 28, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 deepseek-r1 | deepseek-thinking | text | text | 131,072 | 16,384 | $0.574 / 1M | $2.294 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 0528 deepseek-r1-0528 | deepseek-thinking | text | text | 131,072 | 16,384 | $0.574 / 1M | $2.294 / 1M | Yes | Yes | May 28, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek V3.1 deepseek-v3-1 | deepseek | text | text | 131,072 | 65,536 | $0.574 / 1M | $1.721 / 1M | No | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek V3.2 Exp deepseek-v3-2-exp | deepseek | text | text | 131,072 | 65,536 | $0.287 / 1M | $0.431 / 1M | No | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | Moonshot Kimi K2 Instruct moonshot-kimi-k2-instruct | kimi | text | text | 131,072 | 8,192 | $0.574 / 1M | $2.294 / 1M | No | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen Doc Turbo qwen-doc-turbo | qwen | text | text | 131,072 | 8,192 | $0.087 / 1M | $0.144 / 1M | No | Yes | Jan 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen Max qwen-max | qwen | text | text | 131,072 | 8,192 | $0.345 / 1M | $1.377 / 1M | No | Yes | Jan 25, 2025 | |
Alibaba (China) alibaba-cn | Qwen2.5 14B Instruct qwen2-5-14b-instruct | qwen | text | text | 131,072 | 8,192 | $0.144 / 1M | $0.431 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5 32B Instruct qwen2-5-32b-instruct | qwen | text | text | 131,072 | 8,192 | $0.287 / 1M | $0.861 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5 72B Instruct qwen2-5-72b-instruct | qwen | text | text | 131,072 | 8,192 | $0.574 / 1M | $1.721 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5 7B Instruct qwen2-5-7b-instruct | qwen | text | text | 131,072 | 8,192 | $0.072 / 1M | $0.144 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5-Coder 32B Instruct qwen2-5-coder-32b-instruct | qwen | text | text | 131,072 | 8,192 | $0.287 / 1M | $0.861 / 1M | No | Yes | Nov 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5-Coder 7B Instruct qwen2-5-coder-7b-instruct | qwen | text | text | 131,072 | 8,192 | $0.144 / 1M | $0.287 / 1M | No | Yes | Nov 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen3 14B qwen3-14b | qwen | text | text | 131,072 | 8,192 | $0.144 / 1M | $0.574 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3 235B-A22B qwen3-235b-a22b | qwen | text | text | 131,072 | 16,384 | $0.287 / 1M | $1.147 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3 32B qwen3-32b | qwen | text | text | 131,072 | 16,384 | $0.287 / 1M | $1.147 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3 8B qwen3-8b | qwen | text | text | 131,072 | 8,192 | $0.072 / 1M | $0.287 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking | qwen | text | text | 131,072 | 32,768 | $0.144 / 1M | $1.434 / 1M | Yes | Yes | Sep 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct | qwen | text | text | 131,072 | 32,768 | $0.144 / 1M | $0.574 / 1M | No | Yes | Sep 1, 2025 | |
Alibaba (China) alibaba-cn | QwQ 32B qwq-32b | qwen | text | text | 131,072 | 8,192 | $0.287 / 1M | $0.861 / 1M | Yes | Yes | Dec 1, 2024 | |
Alibaba (China) alibaba-cn | QwQ Plus qwq-plus | qwen | text | text | 131,072 | 8,192 | $0.23 / 1M | $0.574 / 1M | Yes | Yes | Mar 5, 2025 | |
Alibaba alibaba | Qwen2.5 14B Instruct qwen2-5-14b-instruct | qwen | text | text | 131,072 | 8,192 | $0.35 / 1M | $1.4 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen2.5 32B Instruct qwen2-5-32b-instruct | qwen | text | text | 131,072 | 8,192 | $0.7 / 1M | $2.8 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen2.5 72B Instruct qwen2-5-72b-instruct | qwen | text | text | 131,072 | 8,192 | $1.4 / 1M | $5.6 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen2.5 7B Instruct qwen2-5-7b-instruct | qwen | text | text | 131,072 | 8,192 | $0.175 / 1M | $0.7 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen3 14B qwen3-14b | qwen | text | text | 131,072 | 8,192 | $0.35 / 1M | $1.4 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3 235B-A22B qwen3-235b-a22b | qwen | text | text | 131,072 | 16,384 | $0.7 / 1M | $2.8 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3 32B qwen3-32b | qwen | text | text | 131,072 | 16,384 | $0.7 / 1M | $2.8 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3 8B qwen3-8b | qwen | text | text | 131,072 | 8,192 | $0.18 / 1M | $0.7 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking | qwen | text | text | 131,072 | 32,768 | $0.5 / 1M | $6 / 1M | Yes | Yes | Sep 1, 2025 | |
Alibaba alibaba | Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct | qwen | text | text | 131,072 | 32,768 | $0.5 / 1M | $2 / 1M | No | Yes | Sep 1, 2025 | |
Alibaba alibaba | QwQ Plus qwq-plus | qwen | text | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | Yes | Yes | Mar 5, 2025 | |
Amazon Bedrock amazon-bedrock | Qwen3 Coder 480B A35B Instruct qwen.qwen3-coder-480b-a35b-v1:0 | qwen | text | text | 131,072 | 65,536 | $0.22 / 1M | $1.8 / 1M | No | Yes | Sep 18, 2025 | |
Amazon Bedrock amazon-bedrock | Qwen3 Coder Next qwen.qwen3-coder-next | qwen | text | text | 131,072 | 65,536 | $0.22 / 1M | $1.8 / 1M | Yes | Yes | Feb 6, 2026 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-V3-0324 deepseek-v3-0324 | deepseek | text | text | 131,072 | 131,072 | $1.14 / 1M | $4.56 / 1M | No | Yes | Mar 24, 2025 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-V3.1 deepseek-v3.1 | deepseek | text | text | 131,072 | 131,072 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Aug 21, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok 3 grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok 3 Mini grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
Azure azure | DeepSeek-V3-0324 deepseek-v3-0324 | deepseek | text | text | 131,072 | 131,072 | $1.14 / 1M | $4.56 / 1M | No | Yes | Mar 24, 2025 | |
Azure azure | DeepSeek-V3.1 deepseek-v3.1 | deepseek | text | text | 131,072 | 131,072 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Aug 21, 2025 | |
Azure azure | Grok 3 grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
Azure azure | Grok 3 Mini grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
Cerebras cerebras | GPT OSS 120B gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.25 / 1M | $0.69 / 1M | Yes | Yes | Aug 5, 2025 | |
Cerebras cerebras | Z.AI GLM-4.7 zai-glm-4.7 | Unknown | text | text | 131,072 | 40,000 | $2.25 / 1M | $2.75 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B | deepseek-thinking | text | text | 131,072 | 131,072 | $0.03 / 1M | $0.11 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | DeepSeek V3.2 TEE deepseek-ai/DeepSeek-V3.2-TEE | deepseek | text | text | 131,072 | 65,536 | $0.28 / 1M | $0.42 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | GLM 4.5 Air zai-org/GLM-4.5-Air | glm | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.22 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | GLM 4.5 FP8 zai-org/GLM-4.5-FP8 | Unknown | text | text | 131,072 | 65,536 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | GLM 4.5 TEE zai-org/GLM-4.5-TEE | glm | text | text | 131,072 | 65,536 | $0.35 / 1M | $1.55 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | gpt oss 120b TEE openai/gpt-oss-120b-TEE | gpt-oss | text | text | 131,072 | 65,536 | $0.04 / 1M | $0.18 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | gpt oss 20b openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 131,072 | $0.02 / 1M | $0.1 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Hermes 4 405B FP8 TEE NousResearch/Hermes-4-405B-FP8-TEE | nousresearch | text | text | 131,072 | 65,536 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Hermes 4 70B NousResearch/Hermes-4-70B | nousresearch | text | text | 131,072 | 131,072 | $0.11 / 1M | $0.38 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Mistral Nemo Instruct 2407 unsloth/Mistral-Nemo-Instruct-2407 | unsloth | text | text | 131,072 | 131,072 | $0.02 / 1M | $0.04 / 1M | No | No | Jan 10, 2026 | |
Clarifai clarifai | GPT OSS 120B High Throughput openai/chat-completion/models/gpt-oss-120b-high-throughput | gpt-oss | text | text | 131,072 | 16,384 | $0.09 / 1M | $0.36 / 1M | Yes | Yes | Feb 25, 2026 | |
Clarifai clarifai | GPT OSS 20B openai/chat-completion/models/gpt-oss-20b | gpt-oss | text | text | 131,072 | 16,384 | $0.045 / 1M | $0.18 / 1M | Yes | Yes | Dec 12, 2025 | |
Clarifai clarifai | Trinity Mini arcee_ai/AFM/models/trinity-mini | trinity-mini | text | text | 131,072 | 131,072 | $0.045 / 1M | $0.15 / 1M | Yes | Yes | Feb 25, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GLM-4.7-Flash workers-ai/@cf/zai-org/glm-4.7-flash | glm-flash | text | text | 131,072 | 131,072 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Cloudflare Workers AI cloudflare-workers-ai | GLM-4.7-Flash @cf/zai-org/glm-4.7-flash | glm-flash | text | text | 131,072 | 131,072 | $0.06 / 1M | $0.4 / 1M | Yes | Yes | Jan 19, 2026 | |
Cortecs cortecs | GLM 4.5 glm-4.5 | glm | text | text | 131,072 | 131,072 | $0.67 / 1M | $2.46 / 1M | Yes | Yes | Jul 29, 2025 | |
Cortecs cortecs | GLM 4.5 Air glm-4.5-air | glm-air | text | text | 131,072 | 131,072 | $0.22 / 1M | $1.34 / 1M | Yes | Yes | Aug 1, 2025 | |
D.Run (China) drun | DeepSeek R1 public/deepseek-r1 | deepseek-thinking | text | text | 131,072 | 32,000 | $0.55 / 1M | $2.2 / 1M | Yes | Yes | Jan 20, 2025 | |
D.Run (China) drun | DeepSeek V3 public/deepseek-v3 | deepseek | text | text | 131,072 | 8,192 | $0.28 / 1M | $1.1 / 1M | No | Yes | Dec 26, 2024 | |
Deep Infra deepinfra | GLM-4.5 zai-org/GLM-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | No | Yes | Jul 28, 2025 | |
Deep Infra deepinfra | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 16,384 | $0.05 / 1M | $0.24 / 1M | Yes | Yes | Aug 5, 2025 | |
Deep Infra deepinfra | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 16,384 | $0.03 / 1M | $0.14 / 1M | Yes | Yes | Aug 5, 2025 | |
Deep Infra deepinfra | Kimi K2 moonshotai/Kimi-K2-Instruct | kimi | text | text | 131,072 | 32,768 | $0.5 / 1M | $2 / 1M | No | Yes | Jul 11, 2025 | |
Deep Infra deepinfra | Kimi K2 Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 131,072 | 32,768 | $0.47 / 1M | $2 / 1M | Yes | Yes | Nov 7, 2025 | |
Deep Infra deepinfra | Llama 3.1 70B meta-llama/Llama-3.1-70B-Instruct | llama | text | text | 131,072 | 16,384 | $0.4 / 1M | $0.4 / 1M | No | Yes | Jul 23, 2024 | |
Deep Infra deepinfra | Llama 3.1 70B Turbo meta-llama/Llama-3.1-70B-Instruct-Turbo | llama | text | text | 131,072 | 16,384 | $0.4 / 1M | $0.4 / 1M | No | Yes | Jul 23, 2024 | |
Deep Infra deepinfra | Llama 3.1 8B meta-llama/Llama-3.1-8B-Instruct | llama | text | text | 131,072 | 16,384 | $0.02 / 1M | $0.05 / 1M | No | Yes | Jul 23, 2024 | |
Deep Infra deepinfra | Llama 3.1 8B Turbo meta-llama/Llama-3.1-8B-Instruct-Turbo | llama | text | text | 131,072 | 16,384 | $0.02 / 1M | $0.03 / 1M | No | Yes | Jul 23, 2024 | |
Deep Infra deepinfra | Llama 3.3 70B Turbo meta-llama/Llama-3.3-70B-Instruct-Turbo | llama | text | text | 131,072 | 16,384 | $0.1 / 1M | $0.32 / 1M | No | Yes | Dec 6, 2024 | |
DigitalOcean digitalocean | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 131,072 | 32,768 | $0.99 / 1M | $0.99 / 1M | Yes | Yes | Jan 30, 2025 | |
DigitalOcean digitalocean | gpt-oss-120b openai-gpt-oss-120b | gpt-oss | text | text | 131,072 | 131,072 | $0.1 / 1M | $0.7 / 1M | Yes | Yes | Apr 16, 2026 | |
DigitalOcean digitalocean | gpt-oss-20b openai-gpt-oss-20b | gpt-oss | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.45 / 1M | Yes | Yes | Apr 16, 2026 | |
DInference dinference | GPT OSS 120B gpt-oss-120b | Unknown | text | text | 131,072 | 32,768 | $0.0675 / 1M | $0.27 / 1M | No | Yes | Aug 1, 2025 | |
evroc evroc | Llama 3.3 70B nvidia/Llama-3.3-70B-Instruct-FP8 | llama | text | text | 131,072 | 32,768 | $1.18 / 1M | $1.18 / 1M | No | No | Dec 1, 2024 | |
evroc evroc | Magistral Small 1.2 24B mistralai/Magistral-Small-2509 | magistral-small | text | text | 131,072 | 131,072 | $0.59 / 1M | $2.36 / 1M | No | No | Jun 1, 2025 | |
FastRouter fastrouter | DeepSeek R1 Distill Llama 70B deepseek-ai/deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 131,072 | 131,072 | $0.03 / 1M | $0.14 / 1M | Yes | No | Jan 23, 2025 | |
FastRouter fastrouter | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Aug 5, 2025 | |
FastRouter fastrouter | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 65,536 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Aug 5, 2025 | |
FastRouter fastrouter | Kimi K2 moonshotai/kimi-k2 | kimi | text | text | 131,072 | 32,768 | $0.55 / 1M | $2.2 / 1M | No | Yes | Jul 11, 2025 | |
Fireworks AI fireworks-ai | GLM 4.5 accounts/fireworks/models/glm-4p5 | glm | text | text | 131,072 | 131,072 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Jul 29, 2025 | |
Fireworks AI fireworks-ai | GLM 4.5 Air accounts/fireworks/models/glm-4p5-air | glm-air | text | text | 131,072 | 131,072 | $0.22 / 1M | $0.88 / 1M | Yes | Yes | Aug 1, 2025 | |
Fireworks AI fireworks-ai | GPT OSS 120B accounts/fireworks/models/gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Aug 5, 2025 | |
Fireworks AI fireworks-ai | GPT OSS 20B accounts/fireworks/models/gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,768 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Aug 5, 2025 | |
Firmware firmware | GPT OSS 120B gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Jan 1, 1970 | |
Firmware firmware | GPT OSS 20B gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,768 | $0.07 / 1M | $0.2 / 1M | Yes | Yes | Jan 1, 1970 | |
Friendli friendli | Llama 3.1 8B Instruct meta-llama/Llama-3.1-8B-Instruct | llama | text | text | 131,072 | 8,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Dec 23, 2025 | |
Friendli friendli | Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 131,072 | 131,072 | $0.6 / 1M | $0.6 / 1M | No | Yes | Dec 23, 2025 | |
Groq groq | Compound groq/compound | groq | text | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 4, 2025 | |
Groq groq | Compound Mini groq/compound-mini | groq | text | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 4, 2025 | |
Groq groq | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 131,072 | 8,192 | $0.75 / 1M | $0.99 / 1M | Yes | Yes | Jan 20, 2025 | |
Groq groq | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Aug 5, 2025 | |
Groq groq | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 65,536 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Aug 5, 2025 | |
Groq groq | Kimi K2 Instruct moonshotai/kimi-k2-instruct | kimi | text | text | 131,072 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Jul 14, 2025 | |
Groq groq | Llama 3.1 8B Instant llama-3.1-8b-instant | llama | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.08 / 1M | No | Yes | Jul 23, 2024 | |
Groq groq | Llama 3.3 70B Versatile llama-3.3-70b-versatile | llama | text | text | 131,072 | 32,768 | $0.59 / 1M | $0.79 / 1M | No | Yes | Dec 6, 2024 | |
Groq groq | Qwen QwQ 32B qwen-qwq-32b | qwen | text | text | 131,072 | 16,384 | $0.29 / 1M | $0.39 / 1M | Yes | Yes | Nov 27, 2024 | |
Groq groq | Qwen3 32B qwen/qwen3-32b | qwen | text | text | 131,072 | 40,960 | $0.29 / 1M | $0.59 / 1M | Yes | Yes | Dec 23, 2024 | |
Groq groq | Safety GPT OSS 20B openai/gpt-oss-safeguard-20b | gpt-oss | text | text | 131,072 | 65,536 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Mar 5, 2025 | |
Helicone helicone | Hermes 2 Pro Llama 3 8B hermes-2-pro-llama-3-8b | llama | text | text | 131,072 | 131,072 | $0.14 / 1M | $0.14 / 1M | No | Yes | May 27, 2024 | |
Helicone helicone | Kimi K2 (07/11) kimi-k2-0711 | kimi | text | text | 131,072 | 16,384 | $0.5700000000000001 / 1M | $2.3 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | Meta Llama 3.1 8B Instant llama-3.1-8b-instant | llama | text | text | 131,072 | 32,678 | $0.049999999999999996 / 1M | $0.08 / 1M | No | Yes | Jul 1, 2024 | |
Helicone helicone | Meta Llama 3.3 70B Versatile llama-3.3-70b-versatile | llama | text | text | 131,072 | 32,678 | $0.59 / 1M | $0.7899999999999999 / 1M | No | Yes | Dec 6, 2024 | |
Helicone helicone | OpenAI GPT-OSS 120b gpt-oss-120b | gpt-oss | text | text | 131,072 | 131,072 | $0.04 / 1M | $0.16 / 1M | Yes | Yes | Jun 1, 2024 | |
Helicone helicone | OpenAI GPT-OSS 20b gpt-oss-20b | gpt-oss | text | text | 131,072 | 131,072 | $0.049999999999999996 / 1M | $0.19999999999999998 / 1M | Yes | Yes | Jun 1, 2024 | |
Helicone helicone | Qwen3 32B qwen3-32b | qwen | text | text | 131,072 | 40,960 | $0.29 / 1M | $0.59 / 1M | Yes | Yes | Apr 28, 2025 | |
Helicone helicone | xAI Grok 3 grok-3 | grok | text | text | 131,072 | 131,072 | $3 / 1M | $15 / 1M | No | Yes | Jun 1, 2024 | |
Helicone helicone | xAI Grok 3 Mini grok-3-mini | grok | text | text | 131,072 | 131,072 | $0.3 / 1M | $0.5 / 1M | No | Yes | Jun 1, 2024 | |
Hugging Face huggingface | Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct | kimi | text | text | 131,072 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Jul 14, 2025 | |
IO.NET io-net | GPT-OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 4,096 | $0.04 / 1M | $0.4 / 1M | No | Yes | Dec 1, 2024 | |
Jiekou.AI jiekou | GLM-4.5 zai-org/glm-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | Kimi K2 Instruct moonshotai/kimi-k2-instruct | kimi | text | text | 131,072 | 131,072 | $0.57 / 1M | $2.3 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-instruct-2507 | qwen | text | text | 131,072 | 16,384 | $0.15 / 1M | $0.8 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | Qwen3 235B A22b Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | qwen | text | text | 131,072 | 131,072 | $0.3 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | AionLabs: Aion-1.0 aion-labs/aion-1.0 | Unknown | text | text | 131,072 | 32,768 | $4 / 1M | $8 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | Unknown | text | text | 131,072 | 32,768 | $0.7 / 1M | $1.4 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | AionLabs: Aion-2.0 aion-labs/aion-2.0 | Unknown | text | text | 131,072 | 32,768 | $0.8 / 1M | $1.6 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning | Unknown | text | text | 131,072 | 32,000 | $0.9 / 1M | $3.3 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Arcee AI: Trinity Mini arcee-ai/trinity-mini | Unknown | text | text | 131,072 | 131,072 | $0.045 / 1M | $0.15 / 1M | Yes | Yes | Jan 28, 2026 | |
Kilo Gateway kilo | Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | Unknown | text | text | 131,072 | 64,000 | $0.75 / 1M | $1.2 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking | Unknown | text | text | 131,072 | 65,536 | $0.07 / 1M | $0.28 / 1M | Yes | No | Sep 19, 2025 | |
Kilo Gateway kilo | DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | Unknown | text | text | 131,072 | 16,384 | $0.7 / 1M | $0.8 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Llama Guard 3 8B meta-llama/llama-guard-3-8b | Unknown | text | text | 131,072 | 26,215 | $0.02 / 1M | $0.06 / 1M | No | No | Feb 4, 2026 | |
Kilo Gateway kilo | Meituan: LongCat Flash Chat meituan/longcat-flash-chat | Unknown | text | text | 131,072 | 131,072 | $0.2 / 1M | $0.8 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | Unknown | text | text | 131,072 | 26,215 | $0.4 / 1M | $0.4 / 1M | No | Yes | Jul 23, 2024 | |
Kilo Gateway kilo | Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | Unknown | text | text | 131,072 | 16,384 | $0.1 / 1M | $0.32 / 1M | No | Yes | Feb 4, 2026 | |
Kilo Gateway kilo | Mistral Large 2407 mistralai/mistral-large-2407 | Unknown | text | text | 131,072 | 32,768 | $2 / 1M | $6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral Large 2411 mistralai/mistral-large-2411 | Unknown | text | text | 131,072 | 26,215 | $2 / 1M | $6 / 1M | No | Yes | Nov 4, 2024 | |
Kilo Gateway kilo | Mistral: Devstral Medium mistralai/devstral-medium | Unknown | text | text | 131,072 | 26,215 | $0.4 / 1M | $2 / 1M | No | Yes | Jul 10, 2025 | |
Kilo Gateway kilo | Mistral: Devstral Small 1.1 mistralai/devstral-small | Unknown | text | text | 131,072 | 26,215 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jul 10, 2025 | |
Kilo Gateway kilo | Mistral: Mistral Nemo mistralai/mistral-nemo | Unknown | text | text | 131,072 | 16,384 | $0.02 / 1M | $0.04 / 1M | No | Yes | Jul 30, 2024 | |
Kilo Gateway kilo | MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | Unknown | text | text | 131,072 | 26,215 | $0.4 / 1M | $2 / 1M | No | Yes | Sep 5, 2025 | |
Kilo Gateway kilo | MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking | Unknown | text | text | 131,072 | 65,535 | $0.47 / 1M | $2 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | Unknown | text | text | 131,072 | 163,840 | $0.27 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
Kilo Gateway kilo | Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b | Unknown | text | text | 131,072 | 16,384 | $1 / 1M | $1 / 1M | No | No | Aug 16, 2024 | |
Kilo Gateway kilo | Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | Unknown | text | text | 131,072 | 32,768 | $0.3 / 1M | $0.3 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Nous: Hermes 4 405B nousresearch/hermes-4-405b | Unknown | text | text | 131,072 | 26,215 | $1 / 1M | $3 / 1M | Yes | No | Aug 25, 2025 | |
Kilo Gateway kilo | Nous: Hermes 4 70B nousresearch/hermes-4-70b | Unknown | text | text | 131,072 | 131,072 | $0.13 / 1M | $0.4 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct | Unknown | text | text | 131,072 | 16,384 | $1.2 / 1M | $1.2 / 1M | No | Yes | Oct 12, 2024 | |
Kilo Gateway kilo | NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 | Unknown | text | text | 131,072 | 131,072 | $0.6 / 1M | $1.8 / 1M | Yes | No | Apr 11, 2026 | |
Kilo Gateway kilo | NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | Unknown | text | text | 131,072 | 26,215 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 16, 2025 | |
Kilo Gateway kilo | NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 | Unknown | text | text | 131,072 | 26,215 | $0.04 / 1M | $0.16 / 1M | Yes | Yes | Aug 18, 2025 | |
Kilo Gateway kilo | OpenAI: gpt-oss-120b openai/gpt-oss-120b | Unknown | text | text | 131,072 | 26,215 | $0.039 / 1M | $0.19 / 1M | Yes | Yes | Aug 5, 2025 | |
Kilo Gateway kilo | OpenAI: gpt-oss-20b openai/gpt-oss-20b | Unknown | text | text | 131,072 | 26,215 | $0.03 / 1M | $0.14 / 1M | Yes | Yes | Aug 5, 2025 | |
Kilo Gateway kilo | OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | Unknown | text | text | 131,072 | 65,536 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Oct 29, 2025 | |
Kilo Gateway kilo | Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 | Unknown | text | text | 131,072 | 131,072 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Feb 4, 2026 | |
Kilo Gateway kilo | Qwen: Qwen-Turbo qwen/qwen-turbo | Unknown | text | text | 131,072 | 8,192 | $0.0325 / 1M | $0.13 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b | Unknown | text | text | 131,072 | 8,192 | $0.455 / 1M | $1.82 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | Unknown | text | text | 131,072 | 52,429 | $0.09 / 1M | $1.1 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | Unknown | text | text | 131,072 | 32,768 | $0.0975 / 1M | $0.78 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | Unknown | text | text | 131,072 | 16,384 | $0.85 / 1M | $0.85 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | Unknown | text | text | 131,072 | 16,384 | $0.65 / 1M | $0.75 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Switchpoint Router switchpoint/router | Unknown | text | text | 131,072 | 32,768 | $0.85 / 1M | $3.4 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | Unknown | text | text | 131,072 | 131,072 | $0.14 / 1M | $0.57 / 1M | Yes | No | Nov 25, 2025 | |
Kilo Gateway kilo | TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 | Unknown | text | text | 131,072 | 131,072 | $0.3 / 1M | $0.5 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b | Unknown | text | text | 131,072 | 131,072 | $0.09 / 1M | $0.45 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | xAI: Grok 3 x-ai/grok-3 | Unknown | text | text | 131,072 | 26,215 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
Kilo Gateway kilo | xAI: Grok 3 Beta x-ai/grok-3-beta | Unknown | text | text | 131,072 | 26,215 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
Kilo Gateway kilo | xAI: Grok 3 Mini x-ai/grok-3-mini | Unknown | text | text | 131,072 | 26,215 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
Kilo Gateway kilo | xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta | Unknown | text | text | 131,072 | 26,215 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
Kilo Gateway kilo | Z.ai: GLM 4.5 z-ai/glm-4.5 | Unknown | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4.5 Air z-ai/glm-4.5-air | Unknown | text | text | 131,072 | 98,304 | $0.13 / 1M | $0.85 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | GLM-4.5 glm-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | GLM-4.5-Air glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | GLM-4.5-Flash glm-4.5-flash | glm-flash | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | GPT OSS 120B gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,766 | $0.15 / 1M | $0.75 / 1M | Yes | Yes | Aug 5, 2025 | |
LLM Gateway llmgateway | GPT OSS 20B gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,766 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Aug 5, 2025 | |
LLM Gateway llmgateway | Grok 3 grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
LLM Gateway llmgateway | Kimi K2 kimi-k2 | kimi | text | text | 131,072 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Jul 11, 2025 | |
LLM Gateway llmgateway | Qwen Coder Plus qwen-coder-plus | qwen | text | text | 131,072 | 8,192 | $0.5 / 1M | $1 / 1M | No | Yes | Sep 18, 2024 | |
LLM Gateway llmgateway | Qwen2.5 Coder 7B qwen25-coder-7b | qwen | text | text | 131,072 | 8,192 | $0.05 / 1M | $0.05 / 1M | No | No | Sep 19, 2024 | |
LLM Gateway llmgateway | Qwen3 235B A22B FP8 qwen3-235b-a22b-fp8 | qwen | text | text | 131,072 | 8,192 | $0.5 / 1M | $2.5 / 1M | Yes | Yes | Apr 28, 2025 | |
LLM Gateway llmgateway | Qwen3 235B A22B Instruct (2507) qwen3-235b-a22b-instruct-2507 | qwen | text | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | No | Yes | Jul 8, 2025 | |
LLM Gateway llmgateway | Qwen3 235B A22B Thinking (2507) qwen3-235b-a22b-thinking-2507 | qwen | text | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | Yes | Yes | Jul 8, 2025 | |
LLM Gateway llmgateway | Qwen3 30B A3B FP8 qwen3-30b-a3b-fp8 | qwen | text | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Apr 28, 2025 | |
LLM Gateway llmgateway | Qwen3 30B A3B Instruct (2507) qwen3-30b-a3b-instruct-2507 | qwen | text | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | Yes | Jul 8, 2025 | |
LLM Gateway llmgateway | Qwen3 30B A3B Thinking (2507) qwen3-30b-a3b-thinking-2507 | qwen | text | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Jul 8, 2025 | |
LLM Gateway llmgateway | Qwen3 32B qwen3-32b | qwen | text | text | 131,072 | 16,384 | $0.7 / 1M | $2.8 / 1M | Yes | Yes | Apr 1, 2025 | |
LLM Gateway llmgateway | Qwen3 32B FP8 qwen3-32b-fp8 | qwen | text | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Apr 28, 2025 | |
LLM Gateway llmgateway | Qwen3 4B FP8 qwen3-4b-fp8 | qwen | text | text | 131,072 | 8,192 | $0.03 / 1M | $0.05 / 1M | Yes | Yes | Apr 28, 2025 | |
LLM Gateway llmgateway | Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking | qwen | text | text | 131,072 | 32,768 | $0.5 / 1M | $6 / 1M | Yes | Yes | Sep 1, 2025 | |
LLM Gateway llmgateway | Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct | qwen | text | text | 131,072 | 32,768 | $0.5 / 1M | $2 / 1M | No | Yes | Sep 1, 2025 | |
LLM Gateway llmgateway | QwQ Plus qwq-plus | qwen | text | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | Yes | Yes | Mar 5, 2025 | |
LMStudio lmstudio | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 5, 2025 | |
Meganova meganova | Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 131,072 | 16,384 | $0.1 / 1M | $0.3 / 1M | No | Yes | Dec 6, 2024 | |
Meganova meganova | Mistral Nemo Instruct 2407 mistralai/Mistral-Nemo-Instruct-2407 | mistral | text | text | 131,072 | 65,536 | $0.02 / 1M | $0.04 / 1M | No | Yes | Jul 18, 2024 | |
Mistral mistral | Mistral Large 2.1 mistral-large-2411 | mistral-large | text | text | 131,072 | 16,384 | $2 / 1M | $6 / 1M | No | Yes | Nov 4, 2024 | |
ModelScope modelscope | GLM-4.5 ZhipuAI/GLM-4.5 | glm | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
Moonshot AI (China) moonshotai-cn | Kimi K2 0711 kimi-k2-0711-preview | kimi | text | text | 131,072 | 16,384 | $0.6 / 1M | $2.5 / 1M | No | Yes | Jul 14, 2025 | |
Moonshot AI moonshotai | Kimi K2 0711 kimi-k2-0711-preview | kimi | text | text | 131,072 | 16,384 | $0.6 / 1M | $2.5 / 1M | No | Yes | Jul 14, 2025 | |
NanoGPT nano-gpt | Aion 1.0 mini (DeepSeek) aion-labs/aion-1.0-mini | deepseek | text | text | 131,072 | 8,192 | $0.7989999999999999 / 1M | $1.394 / 1M | No | No | Feb 20, 2025 | |
NanoGPT nano-gpt | Anubis 70B v1.1 TheDrummer 2/Anubis-70B-v1.1 | llama | text | text | 131,072 | 16,384 | $0.31 / 1M | $0.31 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | ERNIE 4.5 300B baidu/ernie-4.5-300b-a47b | ernie | text | text | 131,072 | 16,384 | $0.35 / 1M | $1.15 / 1M | No | No | Jun 30, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B TEE TEE/gemma-3-27b-it | gemma | text | text | 131,072 | 8,192 | $0.2 / 1M | $0.8 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Iceblink GLM-4.5-Air-Derestricted-Iceblink | Unknown | text | text | 131,072 | 98,304 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 28, 2025 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Iceblink ReExtract GLM-4.5-Air-Derestricted-Iceblink-ReExtract | Unknown | text | text | 131,072 | 98,304 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 12, 2025 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Iceblink v2 ReExtract GLM-4.5-Air-Derestricted-Iceblink-v2-ReExtract | Unknown | text | text | 131,072 | 65,536 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 12, 2025 | |
NanoGPT nano-gpt | GLM 4.5 Air Derestricted Steam ReExtract GLM-4.5-Air-Derestricted-Steam-ReExtract | Unknown | text | text | 131,072 | 65,536 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 12, 2025 | |
NanoGPT nano-gpt | GLM 4.6 Derestricted v5 GLM-4.6-Derestricted-v5 | Unknown | text | text | 131,072 | 8,192 | $0.4 / 1M | $1.5 / 1M | No | No | Dec 23, 2025 | |
NanoGPT nano-gpt | GPT-OSS 120B TEE TEE/gpt-oss-120b | gpt-oss | text | text | 131,072 | 16,384 | $2 / 1M | $2 / 1M | No | No | Aug 5, 2025 | |
NanoGPT nano-gpt | GPT-OSS 20B TEE TEE/gpt-oss-20b | gpt-oss | text | text | 131,072 | 8,192 | $0.2 / 1M | $0.8 / 1M | No | No | Aug 5, 2025 | |
NanoGPT nano-gpt | Grok 3 Beta grok-3-beta | Unknown | text | text | 131,072 | 131,072 | $3 / 1M | $15 / 1M | No | No | Sep 29, 2025 | |
NanoGPT nano-gpt | Grok 3 Mini Beta grok-3-mini-beta | Unknown | text | text | 131,072 | 131,072 | $0.3 / 1M | $0.5 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | Grok 3 Mini Fast Beta grok-3-mini-fast-beta | Unknown | text | text | 131,072 | 131,072 | $0.6 / 1M | $4 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | Llama 3.1 8b Instruct meta-llama/llama-3.1-8b-instruct | llama | text | text | 131,072 | 16,384 | $0.0544 / 1M | $0.0544 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.2 Medium meta-llama/llama-3.2-90b-vision-instruct | llama | text | text | 131,072 | 16,384 | $0.9009999999999999 / 1M | $0.9009999999999999 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70b Instruct meta-llama/llama-3.3-70b-instruct | llama | text | text | 131,072 | 16,384 | $0.05 / 1M | $0.23 / 1M | No | Yes | Feb 27, 2025 | |
NanoGPT nano-gpt | Ministral 3B mistralai/ministral-3b-2512 | ministral | text | text | 131,072 | 32,768 | $0.1 / 1M | $0.1 / 1M | No | No | Dec 4, 2025 | |
NanoGPT nano-gpt | Mistral Medium 3.1 mistralai/mistral-medium-3.1 | mistral-medium | text | text | 131,072 | 32,768 | $0.4 / 1M | $2 / 1M | No | No | Sep 5, 2025 | |
NanoGPT nano-gpt | Trinity Large arcee-ai/trinity-large | trinity | text | text | 131,072 | 8,192 | $0.25 / 1M | $1 / 1M | No | No | Dec 1, 2025 | |
NanoGPT nano-gpt | Trinity Mini arcee-ai/trinity-mini | trinity-mini | text | text | 131,072 | 8,192 | $0.045000000000000005 / 1M | $0.15 / 1M | No | No | Dec 1, 2025 | |
Nebius Token Factory nebius | DeepSeek R1 0528 Fast deepseek-ai/DeepSeek-R1-0528-fast | deepseek | text | text | 131,072 | 8,192 | $2 / 1M | $6 / 1M | Yes | Yes | Feb 4, 2025 | |
NovitaAI novita-ai | baichuan-m2-32b baichuan/baichuan-m2-32b | baichuan | text | text | 131,072 | 131,072 | $0.07 / 1M | $0.07 / 1M | No | No | Aug 13, 2025 | |
NovitaAI novita-ai | DeepSeek V3.1 deepseek/deepseek-v3.1 | deepseek | text | text | 131,072 | 32,768 | $0.27 / 1M | $1 / 1M | Yes | Yes | Aug 21, 2025 | |
NovitaAI novita-ai | Deepseek V3.1 Terminus deepseek/deepseek-v3.1-terminus | deepseek | text | text | 131,072 | 32,768 | $0.27 / 1M | $1 / 1M | Yes | Yes | Sep 22, 2025 | |
NovitaAI novita-ai | ERNIE-4.5-21B-A3B-Thinking baidu/ernie-4.5-21B-a3b-thinking | ernie | text | text | 131,072 | 65,536 | $0.07 / 1M | $0.28 / 1M | Yes | No | Sep 19, 2025 | |
NovitaAI novita-ai | GLM 4.5 Air zai-org/glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0.13 / 1M | $0.85 / 1M | Yes | Yes | Oct 13, 2025 | |
NovitaAI novita-ai | GLM-4.5 zai-org/glm-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
NovitaAI novita-ai | Kimi K2 Instruct moonshotai/kimi-k2-instruct | Unknown | text | text | 131,072 | 131,072 | $0.57 / 1M | $2.3 / 1M | No | Yes | Jul 11, 2025 | |
NovitaAI novita-ai | Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | llama | text | text | 131,072 | 120,000 | $0.135 / 1M | $0.4 / 1M | No | Yes | Dec 7, 2024 | |
NovitaAI novita-ai | Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-instruct-2507 | qwen | text | text | 131,072 | 16,384 | $0.09 / 1M | $0.58 / 1M | No | Yes | Jul 22, 2025 | |
NovitaAI novita-ai | Qwen3 235B A22b Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | qwen | text | text | 131,072 | 32,768 | $0.3 / 1M | $3 / 1M | Yes | Yes | Jul 25, 2025 | |
NovitaAI novita-ai | Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | Unknown | text | text | 131,072 | 32,768 | $0.15 / 1M | $1.5 / 1M | No | Yes | Sep 10, 2025 | |
NovitaAI novita-ai | Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | Unknown | text | text | 131,072 | 32,768 | $0.15 / 1M | $1.5 / 1M | Yes | Yes | Sep 10, 2025 | |
Nvidia nvidia | GLM-5.1 z-ai/glm-5.1 | glm | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 27, 2026 | |
Nvidia nvidia | Llama-3.1-Nemotron-Ultra-253B-v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 | llama | text | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Nvidia nvidia | nemotron-3-nano-30b-a3b nvidia/nemotron-3-nano-30b-a3b | nemotron | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2024 | |
Nvidia nvidia | nvidia-nemotron-nano-9b-v2 nvidia/nvidia-nemotron-nano-9b-v2 | nemotron | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 18, 2025 | |
Nvidia nvidia | Qwen3-235B-A22B qwen/qwen3-235b-a22b | qwen | text | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Ollama Cloud ollama-cloud | gpt-oss:120b gpt-oss:120b | gpt-oss | text | text | 131,072 | 32,768 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | gpt-oss:20b gpt-oss:20b | gpt-oss | text | text | 131,072 | 32,768 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
OpenCode Zen opencode | Trinity Large Preview trinity-large-preview-free | trinity | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | No | Yes | Jan 28, 2026 | |
OpenRouter openrouter | DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | deepseek | text | text | 131,072 | 65,536 | $0.27 / 1M | $1 / 1M | Yes | Yes | Sep 22, 2025 | |
OpenRouter openrouter | DeepSeek V3.1 Terminus (exacto) deepseek/deepseek-v3.1-terminus:exacto | deepseek | text | text | 131,072 | 65,536 | $0.27 / 1M | $1 / 1M | Yes | Yes | Sep 22, 2025 | |
OpenRouter openrouter | Devstral Medium mistralai/devstral-medium-2507 | devstral | text | text | 131,072 | 131,072 | $0.4 / 1M | $2 / 1M | No | Yes | Jul 10, 2025 | |
OpenRouter openrouter | Devstral Small 1.1 mistralai/devstral-small-2507 | devstral | text | text | 131,072 | 131,072 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jul 10, 2025 | |
OpenRouter openrouter | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.072 / 1M | $0.28 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenRouter openrouter | GPT OSS 120B (exacto) openai/gpt-oss-120b:exacto | gpt-oss | text | text | 131,072 | 32,768 | $0.05 / 1M | $0.24 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenRouter openrouter | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,768 | $0.05 / 1M | $0.2 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenRouter openrouter | GPT OSS Safeguard 20B openai/gpt-oss-safeguard-20b | gpt-oss | text | text | 131,072 | 65,536 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Oct 29, 2025 | |
OpenRouter openrouter | gpt-oss-120b (free) openai/gpt-oss-120b:free | gpt-oss | text | text | 131,072 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenRouter openrouter | gpt-oss-20b (free) openai/gpt-oss-20b:free | gpt-oss | text | text | 131,072 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 31, 2026 | |
OpenRouter openrouter | Grok 3 x-ai/grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
OpenRouter openrouter | Grok 3 Beta x-ai/grok-3-beta | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
OpenRouter openrouter | Grok 3 Mini x-ai/grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
OpenRouter openrouter | Grok 3 Mini Beta x-ai/grok-3-mini-beta | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
OpenRouter openrouter | Hermes 3 405B Instruct (free) nousresearch/hermes-3-llama-3.1-405b:free | hermes | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | Yes | No | Aug 16, 2024 | |
OpenRouter openrouter | Hermes 4 405B nousresearch/hermes-4-405b | hermes | text | text | 131,072 | 131,072 | $1 / 1M | $3 / 1M | Yes | Yes | Aug 25, 2025 | |
OpenRouter openrouter | Hermes 4 70B nousresearch/hermes-4-70b | hermes | text | text | 131,072 | 131,072 | $0.13 / 1M | $0.4 / 1M | Yes | Yes | Aug 25, 2025 | |
OpenRouter openrouter | Intellect 3 prime-intellect/intellect-3 | glm | text | text | 131,072 | 8,192 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jan 15, 2025 | |
OpenRouter openrouter | Kimi K2 moonshotai/kimi-k2 | kimi | text | text | 131,072 | 32,768 | $0.55 / 1M | $2.2 / 1M | No | Yes | Jul 11, 2025 | |
OpenRouter openrouter | LFM2.5-1.2B-Instruct (free) liquid/lfm-2.5-1.2b-instruct:free | liquid | text | text | 131,072 | 32,768 | $0 / 1M | $0 / 1M | No | No | Jan 28, 2026 | |
OpenRouter openrouter | LFM2.5-1.2B-Thinking (free) liquid/lfm-2.5-1.2b-thinking:free | liquid | text | text | 131,072 | 32,768 | $0 / 1M | $0 / 1M | Yes | No | Jan 28, 2026 | |
OpenRouter openrouter | Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free | llama | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | No | Yes | Dec 6, 2024 | |
OpenRouter openrouter | nvidia-nemotron-nano-9b-v2 nvidia/nemotron-nano-9b-v2 | nemotron | text | text | 131,072 | 131,072 | $0.04 / 1M | $0.16 / 1M | Yes | Yes | Aug 18, 2025 | |
OpenRouter openrouter | Qwen3 Coder (exacto) qwen/qwen3-coder:exacto | qwen | text | text | 131,072 | 32,768 | $0.38 / 1M | $1.53 / 1M | No | Yes | Jul 23, 2025 | |
OpenRouter openrouter | Trinity Large Preview arcee-ai/trinity-large-preview:free | trinity | text | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | No | Yes | Jan 28, 2026 | |
OVHcloud AI Endpoints ovhcloud | gpt-oss-120b gpt-oss-120b | Unknown | text | text | 131,072 | 131,072 | $0.09 / 1M | $0.47 / 1M | Yes | Yes | Aug 28, 2025 | |
OVHcloud AI Endpoints ovhcloud | gpt-oss-20b gpt-oss-20b | Unknown | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.18 / 1M | Yes | Yes | Aug 28, 2025 | |
OVHcloud AI Endpoints ovhcloud | Llama-3.1-8B-Instruct llama-3.1-8b-instruct | Unknown | text | text | 131,072 | 131,072 | $0.11 / 1M | $0.11 / 1M | No | Yes | Jun 11, 2025 | |
OVHcloud AI Endpoints ovhcloud | Meta-Llama-3_3-70B-Instruct meta-llama-3_3-70b-instruct | Unknown | text | text | 131,072 | 131,072 | $0.74 / 1M | $0.74 / 1M | No | Yes | Apr 1, 2025 | |
Poe poe | Grok 3 xai/grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Apr 11, 2025 | |
Poe poe | Grok 3 Mini xai/grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Apr 11, 2025 | |
Qiniu qiniu-ai | GLM 4.5 glm-4.5 | Unknown | text | text | 131,072 | 98,304 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Meituan/Longcat-Flash-Chat meituan/longcat-flash-chat | Unknown | text | text | 131,072 | 131,072 | Unknown | Unknown | No | No | Nov 5, 2025 | |
Qiniu qiniu-ai | Qwen3 Next 80B A3B Instruct qwen3-next-80b-a3b-instruct | Unknown | text | text | 131,072 | 32,768 | Unknown | Unknown | No | Yes | Sep 12, 2025 | |
Qiniu qiniu-ai | Qwen3 Next 80B A3B Thinking qwen3-next-80b-a3b-thinking | Unknown | text | text | 131,072 | 32,768 | Unknown | Unknown | Yes | Yes | Sep 12, 2025 | |
submodel submodel | GLM 4.5 Air zai-org/GLM-4.5-Air | glm-air | text | text | 131,072 | 131,072 | $0.1 / 1M | $0.5 / 1M | No | Yes | Jul 28, 2025 | |
submodel submodel | GLM 4.5 FP8 zai-org/GLM-4.5-FP8 | glm | text | text | 131,072 | 131,072 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Jul 28, 2025 | |
submodel submodel | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 32,768 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Aug 23, 2025 | |
Tencent Coding Plan (China) tencent-coding-plan | Auto tc-code-latest | auto | text | text | 131,072 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Mar 8, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | Hunyuan-T1 hunyuan-t1 | hunyuan | text | text | 131,072 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 8, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | Hunyuan-TurboS hunyuan-turbos | hunyuan | text | text | 131,072 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Mar 8, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | Tencent HY 2.0 Instruct hunyuan-2.0-instruct | hunyuan | text | text | 131,072 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Mar 8, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | Tencent HY 2.0 Think hunyuan-2.0-thinking | hunyuan | text | text | 131,072 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 8, 2026 | |
Together AI togetherai | DeepSeek V3 deepseek-ai/DeepSeek-V3 | deepseek | text | text | 131,072 | 131,072 | $1.25 / 1M | $1.25 / 1M | Yes | Yes | May 29, 2025 | |
Together AI togetherai | DeepSeek V3.1 deepseek-ai/DeepSeek-V3-1 | deepseek | text | text | 131,072 | 131,072 | $0.6 / 1M | $1.7 / 1M | Yes | Yes | Aug 21, 2025 | |
Together AI togetherai | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 131,072 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Aug 5, 2025 | |
Together AI togetherai | Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo | llama | text | text | 131,072 | 131,072 | $0.88 / 1M | $0.88 / 1M | No | Yes | Dec 6, 2024 | |
Upstage upstage | solar-pro3 solar-pro3 | solar-pro | text | text | 131,072 | 8,192 | $0.25 / 1M | $0.25 / 1M | Yes | Yes | Jan 1, 2026 | |
Vercel AI Gateway vercel | DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | deepseek | text | text | 131,072 | 65,536 | $0.27 / 1M | $1 / 1M | Yes | Yes | Sep 22, 2025 | |
Vercel AI Gateway vercel | GLM 4.5 zai/glm-4.5 | glm | text | text | 131,072 | 131,072 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
Vercel AI Gateway vercel | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 131,072 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Aug 5, 2025 | |
Vercel AI Gateway vercel | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 32,768 | $0.07 / 1M | $0.3 / 1M | Yes | Yes | Aug 5, 2025 | |
Vercel AI Gateway vercel | gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | gpt-oss | text | text | 131,072 | 65,536 | $0.08 / 1M | $0.3 / 1M | Yes | Yes | Dec 1, 2024 | |
Vercel AI Gateway vercel | Grok 3 xai/grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
Vercel AI Gateway vercel | Grok 3 Fast xai/grok-3-fast | grok | text | text | 131,072 | 8,192 | $5 / 1M | $25 / 1M | No | Yes | Feb 17, 2025 | |
Vercel AI Gateway vercel | Grok 3 Mini xai/grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
Vercel AI Gateway vercel | Grok 3 Mini Fast xai/grok-3-mini-fast | grok | text | text | 131,072 | 8,192 | $0.6 / 1M | $4 / 1M | Yes | Yes | Feb 17, 2025 | |
Vercel AI Gateway vercel | INTELLECT 3 prime-intellect/intellect-3 | intellect | text | text | 131,072 | 131,072 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Nov 26, 2025 | |
Vercel AI Gateway vercel | Kimi K2 0905 moonshotai/kimi-k2-0905 | kimi | text | text | 131,072 | 16,384 | $0.6 / 1M | $2.5 / 1M | No | No | Sep 5, 2025 | |
Vercel AI Gateway vercel | Kimi K2 Instruct moonshotai/kimi-k2 | kimi | text | text | 131,072 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Jul 14, 2025 | |
Vercel AI Gateway vercel | Llama 3.1 70B Instruct meta/llama-3.1-70b | llama | text | text | 131,072 | 16,384 | $0.4 / 1M | $0.4 / 1M | No | Yes | Jul 23, 2024 | |
Vercel AI Gateway vercel | Llama 3.1 8B Instruct meta/llama-3.1-8b | llama | text | text | 131,072 | 16,384 | $0.03 / 1M | $0.05 / 1M | No | Yes | Jul 23, 2024 | |
Vercel AI Gateway vercel | Nvidia Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 | nemotron | text | text | 131,072 | 131,072 | $0.04 / 1M | $0.16 / 1M | Yes | Yes | Aug 18, 2025 | |
Vercel AI Gateway vercel | Qwen3 Next 80B A3B Thinking alibaba/qwen3-next-80b-a3b-thinking | qwen | text | text | 131,072 | 65,536 | $0.15 / 1M | $1.5 / 1M | Yes | Yes | Sep 12, 2025 | |
Vercel AI Gateway vercel | Trinity Mini arcee-ai/trinity-mini | trinity | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.15 / 1M | No | No | Dec 1, 2025 | |
Vertex google-vertex | GPT OSS 120B openai/gpt-oss-120b-maas | gpt-oss | text | text | 131,072 | 32,768 | $0.09 / 1M | $0.36 / 1M | Yes | Yes | Aug 5, 2025 | |
Vertex google-vertex | GPT OSS 20B openai/gpt-oss-20b-maas | gpt-oss | text | text | 131,072 | 32,768 | $0.07 / 1M | $0.25 / 1M | Yes | Yes | Aug 5, 2025 | |
Weights & Biases wandb | gpt-oss-120b openai/gpt-oss-120b | gpt-oss | text | text | 131,072 | 131,072 | $0.15 / 1M | $0.6 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | gpt-oss-20b openai/gpt-oss-20b | gpt-oss | text | text | 131,072 | 131,072 | $0.05 / 1M | $0.2 / 1M | No | Yes | Mar 12, 2026 | |
xAI xai | Grok 2 grok-2 | grok | text | text | 131,072 | 8,192 | $2 / 1M | $10 / 1M | No | Yes | Aug 20, 2024 | |
xAI xai | Grok 2 (1212) grok-2-1212 | grok | text | text | 131,072 | 8,192 | $2 / 1M | $10 / 1M | No | Yes | Dec 12, 2024 | |
xAI xai | Grok 2 Latest grok-2-latest | grok | text | text | 131,072 | 8,192 | $2 / 1M | $10 / 1M | No | Yes | Dec 12, 2024 | |
xAI xai | Grok 3 grok-3 | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Fast grok-3-fast | grok | text | text | 131,072 | 8,192 | $5 / 1M | $25 / 1M | No | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Fast Latest grok-3-fast-latest | grok | text | text | 131,072 | 8,192 | $5 / 1M | $25 / 1M | No | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Latest grok-3-latest | grok | text | text | 131,072 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Mini grok-3-mini | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Mini Fast grok-3-mini-fast | grok | text | text | 131,072 | 8,192 | $0.6 / 1M | $4 / 1M | Yes | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Mini Fast Latest grok-3-mini-fast-latest | grok | text | text | 131,072 | 8,192 | $0.6 / 1M | $4 / 1M | Yes | Yes | Feb 17, 2025 | |
xAI xai | Grok 3 Mini Latest grok-3-mini-latest | grok | text | text | 131,072 | 8,192 | $0.3 / 1M | $0.5 / 1M | Yes | Yes | Feb 17, 2025 | |
xAI xai | Grok Beta grok-beta | grok-beta | text | text | 131,072 | 4,096 | $5 / 1M | $15 / 1M | No | Yes | Nov 1, 2024 | |
Z.AI Coding Plan zai-coding-plan | GLM-4.5-Air glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
Z.AI zai | GLM-4.5 glm-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
Z.AI zai | GLM-4.5-Air glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jul 28, 2025 | |
Z.AI zai | GLM-4.5-Flash glm-4.5-flash | glm-flash | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
Zhipu AI Coding Plan zhipuai-coding-plan | GLM-4.5-Air glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
Zhipu AI zhipuai | GLM-4.5 glm-4.5 | glm | text | text | 131,072 | 98,304 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
Zhipu AI zhipuai | GLM-4.5-Air glm-4.5-air | glm-air | text | text | 131,072 | 98,304 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jul 28, 2025 | |
Zhipu AI zhipuai | GLM-4.5-Flash glm-4.5-flash | glm-flash | text | text | 131,072 | 98,304 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 28, 2025 | |
Amazon Bedrock amazon-bedrock | Llama 3.2 1B Instruct meta.llama3-2-1b-instruct-v1:0 | llama | text | text | 131,000 | 4,096 | $0.1 / 1M | $0.1 / 1M | No | Yes | Sep 25, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.2 3B Instruct meta.llama3-2-3b-instruct-v1:0 | llama | text | text | 131,000 | 4,096 | $0.15 / 1M | $0.15 / 1M | No | Yes | Sep 25, 2024 | |
Cerebras cerebras | Qwen 3 235B Instruct qwen-3-235b-a22b-instruct-2507 | qwen | text | text | 131,000 | 32,000 | $0.6 / 1M | $1.2 / 1M | No | Yes | Jul 22, 2025 | |
CloudFerro Sherlock cloudferro-sherlock | OpenAI GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 131,000 | 131,000 | $2.92 / 1M | $2.92 / 1M | Yes | Yes | Aug 28, 2025 | |
Cortecs cortecs | Kimi K2 Instruct kimi-k2-instruct | kimi | text | text | 131,000 | 131,000 | $0.551 / 1M | $2.646 / 1M | No | Yes | Sep 5, 2025 | |
DigitalOcean digitalocean | Qwen3-32B alibaba-qwen3-32b | qwen | text | text | 131,000 | 40,960 | $0.25 / 1M | $0.55 / 1M | Yes | Yes | Apr 16, 2026 | |
Kilo Gateway kilo | IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro | Unknown | text | text | 131,000 | 32,768 | $0.017 / 1M | $0.11 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | Unknown | text | text | 131,000 | 26,215 | $0.55 / 1M | $2.2 / 1M | No | Yes | Mar 15, 2026 | |
NanoGPT nano-gpt | GLM 4.7 TEE TEE/glm-4.7 | glm | text | text | 131,000 | 65,535 | $0.85 / 1M | $3.3 / 1M | No | No | Jan 29, 2026 | |
Qiniu qiniu-ai | GLM 4.5 Air glm-4.5-air | Unknown | text | text | 131,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
SiliconFlow (China) siliconflow-cn | baidu/ERNIE-4.5-300B-A47B baidu/ERNIE-4.5-300B-A47B | ernie | text | text | 131,000 | 131,000 | $0.28 / 1M | $1.1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | qwen | text | text | 131,000 | 131,000 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-R1-Distill-Qwen-32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | qwen | text | text | 131,000 | 131,000 | $0.18 / 1M | $0.18 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | inclusionAI/Ling-flash-2.0 inclusionAI/Ling-flash-2.0 | ling | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | inclusionAI/Ling-mini-2.0 inclusionAI/Ling-mini-2.0 | ling | text | text | 131,000 | 131,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | inclusionAI/Ring-flash-2.0 inclusionAI/Ring-flash-2.0 | ring | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-72B-Instruct-128K Qwen/Qwen2.5-72B-Instruct-128K | qwen | text | text | 131,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-14B Qwen/Qwen3-14B | qwen | text | text | 131,000 | 131,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-32B Qwen/Qwen3-32B | qwen | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-8B Qwen/Qwen3-8B | qwen | text | text | 131,000 | 131,000 | $0.06 / 1M | $0.06 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/QwQ-32B Qwen/QwQ-32B | qwen | text | text | 131,000 | 131,000 | $0.15 / 1M | $0.58 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | tencent/Hunyuan-A13B-Instruct tencent/Hunyuan-A13B-Instruct | hunyuan | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | THUDM/GLM-Z1-32B-0414 THUDM/GLM-Z1-32B-0414 | glm-z | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | THUDM/GLM-Z1-9B-0414 THUDM/GLM-Z1-9B-0414 | glm-z | text | text | 131,000 | 131,000 | $0.086 / 1M | $0.086 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | zai-org/GLM-4.5-Air zai-org/GLM-4.5-Air | glm-air | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.86 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | baidu/ERNIE-4.5-300B-A47B baidu/ERNIE-4.5-300B-A47B | ernie | text | text | 131,000 | 131,000 | $0.28 / 1M | $1.1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | qwen | text | text | 131,000 | 131,000 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/DeepSeek-R1-Distill-Qwen-32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | qwen | text | text | 131,000 | 131,000 | $0.18 / 1M | $0.18 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | inclusionAI/Ling-flash-2.0 inclusionAI/Ling-flash-2.0 | ling | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | inclusionAI/Ling-mini-2.0 inclusionAI/Ling-mini-2.0 | ling | text | text | 131,000 | 131,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | inclusionAI/Ring-flash-2.0 inclusionAI/Ring-flash-2.0 | ring | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | moonshotai/Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct | kimi | text | text | 131,000 | 131,000 | $0.58 / 1M | $2.29 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | nex-agi/DeepSeek-V3.1-Nex-N1 nex-agi/DeepSeek-V3.1-Nex-N1 | deepseek | text | text | 131,000 | 131,000 | $0.5 / 1M | $2 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | openai/gpt-oss-120b openai/gpt-oss-120b | gpt-oss | text | text | 131,000 | 8,000 | $0.05 / 1M | $0.45 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | openai/gpt-oss-20b openai/gpt-oss-20b | gpt-oss | text | text | 131,000 | 8,000 | $0.04 / 1M | $0.18 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-72B-Instruct-128K Qwen/Qwen2.5-72B-Instruct-128K | qwen | text | text | 131,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-14B Qwen/Qwen3-14B | qwen | text | text | 131,000 | 131,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-235B-A22B Qwen/Qwen3-235B-A22B | qwen | text | text | 131,000 | 131,000 | $0.35 / 1M | $1.42 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-32B Qwen/Qwen3-32B | qwen | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-8B Qwen/Qwen3-8B | qwen | text | text | 131,000 | 131,000 | $0.06 / 1M | $0.06 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/QwQ-32B Qwen/QwQ-32B | qwen | text | text | 131,000 | 131,000 | $0.15 / 1M | $0.58 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | tencent/Hunyuan-A13B-Instruct tencent/Hunyuan-A13B-Instruct | hunyuan | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | THUDM/GLM-Z1-32B-0414 THUDM/GLM-Z1-32B-0414 | glm-z | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.57 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | THUDM/GLM-Z1-9B-0414 THUDM/GLM-Z1-9B-0414 | glm-z | text | text | 131,000 | 131,000 | $0.086 / 1M | $0.086 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.5 zai-org/GLM-4.5 | glm | text | text | 131,000 | 131,000 | $0.4 / 1M | $2 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.5-Air zai-org/GLM-4.5-Air | glm-air | text | text | 131,000 | 131,000 | $0.14 / 1M | $0.86 / 1M | No | Yes | Nov 25, 2025 | |
STACKIT stackit | GPT-OSS 120B openai/gpt-oss-120b | gpt | text | text | 131,000 | 8,192 | $0.49 / 1M | $0.71 / 1M | Yes | Yes | Aug 5, 2025 | |
Vercel AI Gateway vercel | Trinity Large Preview arcee-ai/trinity-large-preview | trinity | text | text | 131,000 | 131,000 | $0.25 / 1M | $1 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | DeepSeek TNG R1T2 Chimera deepseek-tng-r1t2-chimera | deepseek-thinking | text | text | 130,000 | 163,840 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jul 2, 2025 | |
Vultr vultr | GPT OSS 120B gpt-oss-120b | gpt-oss | text | text | 129,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Aug 5, 2025 | |
302.AI 302ai | Deepseek-Chat deepseek-chat | deepseek | text | text | 128,000 | 8,192 | $0.29 / 1M | $0.43 / 1M | No | Yes | Nov 29, 2024 | |
302.AI 302ai | Deepseek-Reasoner deepseek-reasoner | deepseek-thinking | text | text | 128,000 | 128,000 | $0.29 / 1M | $0.43 / 1M | Yes | Yes | Jan 20, 2025 | |
302.AI 302ai | deepseek-v3.2 deepseek-v3.2 | Unknown | text | text | 128,000 | 8,192 | $0.29 / 1M | $0.43 / 1M | No | Yes | Dec 1, 2025 | |
302.AI 302ai | DeepSeek-V3.2-Thinking deepseek-v3.2-thinking | Unknown | text | text | 128,000 | 128,000 | $0.29 / 1M | $0.43 / 1M | Yes | Yes | Dec 1, 2025 | |
302.AI 302ai | GLM-4.5 glm-4.5 | Unknown | text | text | 128,000 | 98,304 | $0.286 / 1M | $1.142 / 1M | No | Yes | Jul 29, 2025 | |
302.AI 302ai | glm-4.5-air glm-4.5-air | Unknown | text | text | 128,000 | 98,304 | $0.1143 / 1M | $0.286 / 1M | No | Yes | Jul 29, 2025 | |
302.AI 302ai | glm-4.5-airx glm-4.5-airx | Unknown | text | text | 128,000 | 16,384 | $0.572 / 1M | $1.714 / 1M | No | Yes | Jul 29, 2025 | |
302.AI 302ai | glm-4.5-x glm-4.5-x | Unknown | text | text | 128,000 | 16,384 | $1.143 / 1M | $2.29 / 1M | No | Yes | Jul 29, 2025 | |
302.AI 302ai | Qwen3-235B-A22B qwen3-235b-a22b | qwen | text | text | 128,000 | 16,384 | $0.29 / 1M | $2.86 / 1M | No | Yes | Apr 29, 2025 | |
302.AI 302ai | qwen3-235b-a22b-instruct-2507 qwen3-235b-a22b-instruct-2507 | Unknown | text | text | 128,000 | 65,536 | $0.29 / 1M | $1.143 / 1M | No | Yes | Jul 30, 2025 | |
302.AI 302ai | Qwen3-30B-A3B qwen3-30b-a3b | qwen | text | text | 128,000 | 8,192 | $0.11 / 1M | $1.08 / 1M | No | Yes | Apr 29, 2025 | |
Abacus abacus | DeepSeek R1 deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 128,000 | 8,192 | $3 / 1M | $7 / 1M | Yes | Yes | Jan 20, 2025 | |
Abacus abacus | DeepSeek V3.1 deepseek/deepseek-v3.1 | deepseek | text | text | 128,000 | 8,192 | $0.55 / 1M | $1.66 / 1M | Yes | Yes | Jan 20, 2025 | |
Abacus abacus | DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 128,000 | 8,192 | $0.27 / 1M | $1 / 1M | Yes | Yes | Jun 1, 2025 | |
Abacus abacus | DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2 | deepseek | text | text | 128,000 | 8,192 | $0.27 / 1M | $0.4 / 1M | Yes | Yes | Jun 15, 2025 | |
Abacus abacus | GLM-4.5 zai-org/glm-4.5 | glm | text | text | 128,000 | 8,192 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
Abacus abacus | GLM-4.6 zai-org/glm-4.6 | glm | text | text | 128,000 | 8,192 | $0.6 / 1M | $2.2 / 1M | No | Yes | Mar 1, 2025 | |
Abacus abacus | GLM-4.7 zai-org/glm-4.7 | glm | text | text | 128,000 | 8,192 | $0.6 / 1M | $2.2 / 1M | No | Yes | Jun 1, 2025 | |
Abacus abacus | Llama 3.1 405B Instruct Turbo meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | llama | text | text | 128,000 | 4,096 | $3.5 / 1M | $3.5 / 1M | No | Yes | Jul 23, 2024 | |
Abacus abacus | Llama 3.1 8B Instruct meta-llama/Meta-Llama-3.1-8B-Instruct | llama | text | text | 128,000 | 4,096 | $0.02 / 1M | $0.05 / 1M | No | Yes | Jul 23, 2024 | |
Abacus abacus | Llama 3.3 70B Versatile llama-3.3-70b-versatile | llama | text | text | 128,000 | 32,768 | $0.59 / 1M | $0.79 / 1M | No | Yes | Dec 6, 2024 | |
Abacus abacus | Qwen 2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct | qwen | text | text | 128,000 | 8,192 | $0.11 / 1M | $0.38 / 1M | No | Yes | Sep 19, 2024 | |
Abacus abacus | Qwen 2.5 Coder 32B qwen-2.5-coder-32b | qwen | text | text | 128,000 | 8,192 | $0.79 / 1M | $0.79 / 1M | No | Yes | Nov 11, 2024 | |
Abacus abacus | Qwen3 32B Qwen/Qwen3-32B | qwen | text | text | 128,000 | 8,192 | $0.09 / 1M | $0.29 / 1M | Yes | Yes | Apr 29, 2025 | |
Amazon Bedrock amazon-bedrock | DeepSeek-R1 deepseek.r1-v1:0 | deepseek-thinking | text | text | 128,000 | 32,768 | $1.35 / 1M | $5.4 / 1M | Yes | Yes | May 29, 2025 | |
Amazon Bedrock amazon-bedrock | GPT OSS Safeguard 120B openai.gpt-oss-safeguard-120b | gpt-oss | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | GPT OSS Safeguard 20B openai.gpt-oss-safeguard-20b | gpt-oss | text | text | 128,000 | 4,096 | $0.07 / 1M | $0.2 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | gpt-oss-120b openai.gpt-oss-120b-1:0 | gpt-oss | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | gpt-oss-20b openai.gpt-oss-20b-1:0 | gpt-oss | text | text | 128,000 | 4,096 | $0.07 / 1M | $0.3 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.1 405B Instruct meta.llama3-1-405b-instruct-v1:0 | llama | text | text | 128,000 | 4,096 | $2.4 / 1M | $2.4 / 1M | No | Yes | Jul 23, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.1 70B Instruct meta.llama3-1-70b-instruct-v1:0 | llama | text | text | 128,000 | 4,096 | $0.72 / 1M | $0.72 / 1M | No | Yes | Jul 23, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.1 8B Instruct meta.llama3-1-8b-instruct-v1:0 | llama | text | text | 128,000 | 4,096 | $0.22 / 1M | $0.22 / 1M | No | Yes | Jul 23, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.3 70B Instruct meta.llama3-3-70b-instruct-v1:0 | llama | text | text | 128,000 | 4,096 | $0.72 / 1M | $0.72 / 1M | No | Yes | Dec 6, 2024 | |
Amazon Bedrock amazon-bedrock | Ministral 14B 3.0 mistral.ministral-3-14b-instruct | ministral | text | text | 128,000 | 4,096 | $0.2 / 1M | $0.2 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | Ministral 3 8B mistral.ministral-3-8b-instruct | ministral | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.15 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | Nova Micro amazon.nova-micro-v1:0 | nova-micro | text | text | 128,000 | 8,192 | $0.035 / 1M | $0.14 / 1M | No | Yes | Dec 3, 2024 | |
Amazon Bedrock amazon-bedrock | NVIDIA Nemotron Nano 3 30B nvidia.nemotron-nano-3-30b | nemotron | text | text | 128,000 | 4,096 | $0.06 / 1M | $0.24 / 1M | Yes | Yes | Dec 23, 2025 | |
Amazon Bedrock amazon-bedrock | NVIDIA Nemotron Nano 9B v2 nvidia.nemotron-nano-9b-v2 | nemotron | text | text | 128,000 | 4,096 | $0.06 / 1M | $0.23 / 1M | No | Yes | Dec 1, 2024 | |
Azure Cognitive Services azure-cognitive-services | Command R cohere-command-r-08-2024 | command-r | text | text | 128,000 | 4,000 | $0.15 / 1M | $0.6 / 1M | No | Yes | Aug 30, 2024 | |
Azure Cognitive Services azure-cognitive-services | Command R+ cohere-command-r-plus-08-2024 | command-r | text | text | 128,000 | 4,000 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 30, 2024 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-V3.2 deepseek-v3.2 | deepseek | text | text | 128,000 | 128,000 | $0.58 / 1M | $1.68 / 1M | Yes | Yes | Dec 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | DeepSeek-V3.2-Speciale deepseek-v3.2-speciale | deepseek | text | text | 128,000 | 128,000 | $0.58 / 1M | $1.68 / 1M | Yes | No | Dec 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | Llama-3.3-70B-Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 32,768 | $0.71 / 1M | $0.71 / 1M | No | Yes | Dec 6, 2024 | |
Azure Cognitive Services azure-cognitive-services | MAI-DS-R1 mai-ds-r1 | mai | text | text | 128,000 | 8,192 | $1.35 / 1M | $5.4 / 1M | Yes | No | Jan 20, 2025 | |
Azure Cognitive Services azure-cognitive-services | Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct | llama | text | text | 128,000 | 32,768 | $5.33 / 1M | $16 / 1M | No | Yes | Jul 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct | llama | text | text | 128,000 | 32,768 | $2.68 / 1M | $3.54 / 1M | No | Yes | Jul 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct | llama | text | text | 128,000 | 32,768 | $0.3 / 1M | $0.61 / 1M | No | Yes | Jul 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Ministral 3B ministral-3b | ministral | text | text | 128,000 | 8,192 | $0.04 / 1M | $0.04 / 1M | No | Yes | Oct 22, 2024 | |
Azure Cognitive Services azure-cognitive-services | Mistral Large 24.11 mistral-large-2411 | mistral-large | text | text | 128,000 | 32,768 | $2 / 1M | $6 / 1M | No | Yes | Nov 1, 2024 | |
Azure Cognitive Services azure-cognitive-services | Mistral Nemo mistral-nemo | mistral-nemo | text | text | 128,000 | 128,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Jul 18, 2024 | |
Azure Cognitive Services azure-cognitive-services | o1-mini o1-mini | o-mini | text | text | 128,000 | 65,536 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Sep 12, 2024 | |
Azure Cognitive Services azure-cognitive-services | o1-preview o1-preview | o | text | text | 128,000 | 32,768 | $16.5 / 1M | $66 / 1M | Yes | Yes | Sep 12, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.17 / 1M | $0.68 / 1M | No | No | Apr 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.13 / 1M | $0.52 / 1M | No | No | Apr 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-small-instruct (128k) phi-3-small-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | No | Apr 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3.5-mini-instruct phi-3.5-mini-instruct | phi | text | text | 128,000 | 4,096 | $0.13 / 1M | $0.52 / 1M | No | No | Aug 20, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3.5-MoE-instruct phi-3.5-moe-instruct | phi | text | text | 128,000 | 4,096 | $0.16 / 1M | $0.64 / 1M | No | No | Aug 20, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-4 phi-4 | phi | text | text | 128,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | No | No | Dec 11, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-4-mini phi-4-mini | phi | text | text | 128,000 | 4,096 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-4-mini-reasoning phi-4-mini-reasoning | phi | text | text | 128,000 | 4,096 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Dec 11, 2024 | |
Azure azure | Command R cohere-command-r-08-2024 | command-r | text | text | 128,000 | 4,000 | $0.15 / 1M | $0.6 / 1M | No | Yes | Aug 30, 2024 | |
Azure azure | Command R+ cohere-command-r-plus-08-2024 | command-r | text | text | 128,000 | 4,000 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 30, 2024 | |
Azure azure | DeepSeek-V3.2 deepseek-v3.2 | deepseek | text | text | 128,000 | 128,000 | $0.58 / 1M | $1.68 / 1M | Yes | Yes | Dec 1, 2025 | |
Azure azure | DeepSeek-V3.2-Speciale deepseek-v3.2-speciale | deepseek | text | text | 128,000 | 128,000 | $0.58 / 1M | $1.68 / 1M | Yes | No | Dec 1, 2025 | |
Azure azure | Llama-3.3-70B-Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 32,768 | $0.71 / 1M | $0.71 / 1M | No | Yes | Dec 6, 2024 | |
Azure azure | MAI-DS-R1 mai-ds-r1 | mai | text | text | 128,000 | 8,192 | $1.35 / 1M | $5.4 / 1M | Yes | No | Jan 20, 2025 | |
Azure azure | Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct | llama | text | text | 128,000 | 32,768 | $5.33 / 1M | $16 / 1M | No | Yes | Jul 23, 2024 | |
Azure azure | Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct | llama | text | text | 128,000 | 32,768 | $2.68 / 1M | $3.54 / 1M | No | Yes | Jul 23, 2024 | |
Azure azure | Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct | llama | text | text | 128,000 | 32,768 | $0.3 / 1M | $0.61 / 1M | No | Yes | Jul 23, 2024 | |
Azure azure | Ministral 3B ministral-3b | ministral | text | text | 128,000 | 8,192 | $0.04 / 1M | $0.04 / 1M | No | Yes | Oct 22, 2024 | |
Azure azure | Mistral Large 24.11 mistral-large-2411 | mistral-large | text | text | 128,000 | 32,768 | $2 / 1M | $6 / 1M | No | Yes | Nov 1, 2024 | |
Azure azure | Mistral Nemo mistral-nemo | mistral-nemo | text | text | 128,000 | 128,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Jul 18, 2024 | |
Azure azure | o1-mini o1-mini | o-mini | text | text | 128,000 | 65,536 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Sep 12, 2024 | |
Azure azure | o1-preview o1-preview | o | text | text | 128,000 | 32,768 | $16.5 / 1M | $66 / 1M | Yes | Yes | Sep 12, 2024 | |
Azure azure | Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.17 / 1M | $0.68 / 1M | No | No | Apr 23, 2024 | |
Azure azure | Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.13 / 1M | $0.52 / 1M | No | No | Apr 23, 2024 | |
Azure azure | Phi-3-small-instruct (128k) phi-3-small-128k-instruct | phi | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | No | Apr 23, 2024 | |
Azure azure | Phi-3.5-mini-instruct phi-3.5-mini-instruct | phi | text | text | 128,000 | 4,096 | $0.13 / 1M | $0.52 / 1M | No | No | Aug 20, 2024 | |
Azure azure | Phi-3.5-MoE-instruct phi-3.5-moe-instruct | phi | text | text | 128,000 | 4,096 | $0.16 / 1M | $0.64 / 1M | No | No | Aug 20, 2024 | |
Azure azure | Phi-4 phi-4 | phi | text | text | 128,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | No | No | Dec 11, 2024 | |
Azure azure | Phi-4-mini phi-4-mini | phi | text | text | 128,000 | 4,096 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
Azure azure | Phi-4-mini-reasoning phi-4-mini-reasoning | phi | text | text | 128,000 | 4,096 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Dec 11, 2024 | |
Bailing bailing | Ling-1T Ling-1T | ling | text | text | 128,000 | 32,000 | $0.57 / 1M | $2.29 / 1M | No | Yes | Oct 1, 2025 | |
Bailing bailing | Ring-1T Ring-1T | ring | text | text | 128,000 | 32,000 | $0.57 / 1M | $2.29 / 1M | Yes | No | Oct 1, 2025 | |
Baseten baseten | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.5 / 1M | Yes | Yes | Aug 5, 2025 | |
Berget.AI berget | GLM 4.7 zai-org/GLM-4.7 | glm | text | text | 128,000 | 8,192 | $0.77 / 1M | $2.75 / 1M | Yes | Yes | Jan 19, 2026 | |
Berget.AI berget | GPT-OSS-120B openai/gpt-oss-120b | gpt-oss | text | text | 128,000 | 8,192 | $0.44 / 1M | $0.99 / 1M | Yes | Yes | Aug 5, 2025 | |
Berget.AI berget | Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 128,000 | 8,192 | $0.99 / 1M | $0.99 / 1M | Yes | Yes | Apr 27, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BART Large CNN workers-ai/@cf/facebook/bart-large-cnn | bart | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Apr 9, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BGE Base EN v1.5 workers-ai/@cf/baai/bge-base-en-v1.5 | bge | text | text | 128,000 | 16,384 | $0.067 / 1M | $0 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BGE Large EN v1.5 workers-ai/@cf/baai/bge-large-en-v1.5 | bge | text | text | 128,000 | 16,384 | $0.2 / 1M | $0 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BGE M3 workers-ai/@cf/baai/bge-m3 | bge | text | text | 128,000 | 16,384 | $0.012 / 1M | $0 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BGE Reranker Base workers-ai/@cf/baai/bge-reranker-base | bge | text | text | 128,000 | 16,384 | $0.0031 / 1M | $0 / 1M | No | No | Apr 9, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | BGE Small EN v1.5 workers-ai/@cf/baai/bge-small-en-v1.5 | bge | text | text | 128,000 | 16,384 | $0.02 / 1M | $0 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Deepgram Aura 2 (EN) workers-ai/@cf/deepgram/aura-2-en | aura | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Deepgram Aura 2 (ES) workers-ai/@cf/deepgram/aura-2-es | aura | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Deepgram Nova 3 workers-ai/@cf/deepgram/nova-3 | nova | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | DeepSeek R1 Distill Qwen 32B workers-ai/@cf/deepseek-ai/deepseek-r1-distill-qwen-32b | deepseek-thinking | text | text | 128,000 | 16,384 | $0.5 / 1M | $4.88 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | DistilBERT SST-2 INT8 workers-ai/@cf/huggingface/distilbert-sst-2-int8 | distilbert | text | text | 128,000 | 16,384 | $0.026 / 1M | $0 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Gemma 3 12B IT workers-ai/@cf/google/gemma-3-12b-it | gemma | text | text | 128,000 | 16,384 | $0.35 / 1M | $0.56 / 1M | No | No | Apr 11, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Gemma SEA-LION v4 27B IT workers-ai/@cf/aisingapore/gemma-sea-lion-v4-27b-it | gemma | text | text | 128,000 | 16,384 | $0.35 / 1M | $0.56 / 1M | No | No | Sep 25, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT OSS 120B workers-ai/@cf/openai/gpt-oss-120b | Unknown | text | text | 128,000 | 16,384 | $0.35 / 1M | $0.75 / 1M | No | No | Aug 5, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT OSS 20B workers-ai/@cf/openai/gpt-oss-20b | Unknown | text | text | 128,000 | 16,384 | $0.2 / 1M | $0.3 / 1M | No | No | Aug 5, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | IBM Granite 4.0 H Micro workers-ai/@cf/ibm-granite/granite-4.0-h-micro | granite | text | text | 128,000 | 16,384 | $0.017 / 1M | $0.11 / 1M | No | No | Oct 15, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | IndicTrans2 EN-Indic 1B workers-ai/@cf/ai4bharat/indictrans2-en-indic-1B | indictrans | text | text | 128,000 | 16,384 | $0.34 / 1M | $0.34 / 1M | No | No | Sep 25, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 2 7B Chat FP16 workers-ai/@cf/meta/llama-2-7b-chat-fp16 | llama | text | text | 128,000 | 16,384 | $0.56 / 1M | $6.67 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3 8B Instruct workers-ai/@cf/meta/llama-3-8b-instruct | llama | text | text | 128,000 | 16,384 | $0.28 / 1M | $0.83 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3 8B Instruct AWQ workers-ai/@cf/meta/llama-3-8b-instruct-awq | llama | text | text | 128,000 | 16,384 | $0.12 / 1M | $0.27 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.1 8B Instruct workers-ai/@cf/meta/llama-3.1-8b-instruct | llama | text | text | 128,000 | 16,384 | $0.28 / 1M | $0.8299999999999998 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.1 8B Instruct AWQ workers-ai/@cf/meta/llama-3.1-8b-instruct-awq | llama | text | text | 128,000 | 16,384 | $0.12 / 1M | $0.27 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.1 8B Instruct FP8 workers-ai/@cf/meta/llama-3.1-8b-instruct-fp8 | llama | text | text | 128,000 | 16,384 | $0.15 / 1M | $0.29 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.2 11B Vision Instruct workers-ai/@cf/meta/llama-3.2-11b-vision-instruct | llama | text | text | 128,000 | 16,384 | $0.049 / 1M | $0.68 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.2 1B Instruct workers-ai/@cf/meta/llama-3.2-1b-instruct | llama | text | text | 128,000 | 16,384 | $0.027 / 1M | $0.2 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.2 3B Instruct workers-ai/@cf/meta/llama-3.2-3b-instruct | llama | text | text | 128,000 | 16,384 | $0.051 / 1M | $0.34 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 3.3 70B Instruct FP8 Fast workers-ai/@cf/meta/llama-3.3-70b-instruct-fp8-fast | llama | text | text | 128,000 | 16,384 | $0.29 / 1M | $2.25 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama 4 Scout 17B 16E Instruct workers-ai/@cf/meta/llama-4-scout-17b-16e-instruct | llama | text | text | 128,000 | 16,384 | $0.27 / 1M | $0.85 / 1M | No | No | Apr 16, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Llama Guard 3 8B workers-ai/@cf/meta/llama-guard-3-8b | llama | text | text | 128,000 | 16,384 | $0.48 / 1M | $0.03 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | M2M100 1.2B workers-ai/@cf/meta/m2m100-1.2b | m2m | text | text | 128,000 | 16,384 | $0.34 / 1M | $0.34 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Mistral 7B Instruct v0.1 workers-ai/@cf/mistral/mistral-7b-instruct-v0.1 | mistral | text | text | 128,000 | 16,384 | $0.11 / 1M | $0.19 / 1M | No | No | Apr 3, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Mistral Small 3.1 24B Instruct workers-ai/@cf/mistralai/mistral-small-3.1-24b-instruct | mistral-small | text | text | 128,000 | 16,384 | $0.35 / 1M | $0.56 / 1M | No | No | Apr 11, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | MyShell MeloTTS workers-ai/@cf/myshell-ai/melotts | melotts | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Pipecat Smart Turn v2 workers-ai/@cf/pipecat-ai/smart-turn-v2 | smart-turn | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | PLaMo Embedding 1B workers-ai/@cf/pfnet/plamo-embedding-1b | plamo | text | text | 128,000 | 16,384 | $0.019 / 1M | $0 / 1M | No | No | Sep 25, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Qwen 2.5 Coder 32B Instruct workers-ai/@cf/qwen/qwen2.5-coder-32b-instruct | qwen | text | text | 128,000 | 16,384 | $0.66 / 1M | $1 / 1M | No | No | Apr 11, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Qwen3 30B A3B FP8 workers-ai/@cf/qwen/qwen3-30b-a3b-fp8 | qwen | text | text | 128,000 | 16,384 | $0.051 / 1M | $0.34 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Qwen3 Embedding 0.6B workers-ai/@cf/qwen/qwen3-embedding-0.6b | qwen | text | text | 128,000 | 16,384 | $0.012 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | QwQ 32B workers-ai/@cf/qwen/qwq-32b | qwen | text | text | 128,000 | 16,384 | $0.66 / 1M | $1 / 1M | No | No | Apr 11, 2025 | |
Cloudflare Workers AI cloudflare-workers-ai | GPT OSS 120B @cf/openai/gpt-oss-120b | Unknown | text | text | 128,000 | 16,384 | $0.35 / 1M | $0.75 / 1M | Yes | Yes | Aug 5, 2025 | |
Cloudflare Workers AI cloudflare-workers-ai | GPT OSS 20B @cf/openai/gpt-oss-20b | Unknown | text | text | 128,000 | 16,384 | $0.2 / 1M | $0.3 / 1M | Yes | Yes | Aug 5, 2025 | |
Cohere cohere | Aya Expanse 32B c4ai-aya-expanse-32b | Unknown | text | text | 128,000 | 4,000 | Unknown | Unknown | No | No | Oct 24, 2024 | |
Cohere cohere | Command R command-r-08-2024 | command-r | text | text | 128,000 | 4,000 | $0.15 / 1M | $0.6 / 1M | No | Yes | Aug 30, 2024 | |
Cohere cohere | Command R+ command-r-plus-08-2024 | command-r | text | text | 128,000 | 4,000 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 30, 2024 | |
Cohere cohere | Command R7B command-r7b-12-2024 | command-r | text | text | 128,000 | 4,000 | $0.0375 / 1M | $0.15 / 1M | No | Yes | Feb 27, 2024 | |
Cohere cohere | Command R7B Arabic command-r7b-arabic-02-2025 | command-r | text | text | 128,000 | 4,000 | $0.0375 / 1M | $0.15 / 1M | No | Yes | Feb 27, 2025 | |
Cortecs cortecs | DeepSeek V3 0324 deepseek-v3-0324 | deepseek | text | text | 128,000 | 128,000 | $0.551 / 1M | $1.654 / 1M | No | Yes | Mar 24, 2025 | |
Cortecs cortecs | GPT Oss 120b gpt-oss-120b | gpt-oss | text | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | No | Yes | Aug 5, 2025 | |
Cortecs cortecs | INTELLECT 3 intellect-3 | Unknown | text | text | 128,000 | 128,000 | $0.219 / 1M | $1.202 / 1M | Yes | Yes | Nov 26, 2025 | |
Cortecs cortecs | Llama 3.1 405B Instruct llama-3.1-405b-instruct | llama | text | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | No | Yes | Jul 23, 2024 | |
Cortecs cortecs | Qwen3 Next 80B A3B Thinking qwen3-next-80b-a3b-thinking | Unknown | text | text | 128,000 | 128,000 | $0.164 / 1M | $1.311 / 1M | Yes | Yes | Sep 11, 2025 | |
DigitalOcean digitalocean | Llama 3.3 Instruct 70B llama3.3-70b-instruct | llama | text | text | 128,000 | 128,000 | $0.65 / 1M | $0.65 / 1M | No | Yes | Dec 6, 2024 | |
Fireworks AI fireworks-ai | Kimi K2 Instruct accounts/fireworks/models/kimi-k2-instruct | kimi | text | text | 128,000 | 16,384 | $1 / 1M | $3 / 1M | No | Yes | Jul 11, 2025 | |
Firmware firmware | DeepSeek v3.2 deepseek-v3-2 | deepseek | text | text | 128,000 | 8,192 | $0.58 / 1M | $1.68 / 1M | No | Yes | Sep 29, 2025 | |
GitHub Copilot github-copilot | Grok Code Fast 1 grok-code-fast-1 | grok | text | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 27, 2025 | |
GitHub Models github-models | Cohere Command A cohere/cohere-command-a | command-a | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 1, 2024 | |
GitHub Models github-models | Cohere Command R cohere/cohere-command-r | command-r | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 1, 2024 | |
GitHub Models github-models | Cohere Command R 08-2024 cohere/cohere-command-r-08-2024 | command-r | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Aug 1, 2024 | |
GitHub Models github-models | Cohere Command R+ cohere/cohere-command-r-plus | command-r | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Aug 1, 2024 | |
GitHub Models github-models | Cohere Command R+ 08-2024 cohere/cohere-command-r-plus-08-2024 | command-r | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Aug 1, 2024 | |
GitHub Models github-models | DeepSeek-V3-0324 deepseek/deepseek-v3-0324 | deepseek | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 24, 2025 | |
GitHub Models github-models | Grok 3 xai/grok-3 | grok | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 9, 2024 | |
GitHub Models github-models | Grok 3 Mini xai/grok-3-mini | grok | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 9, 2024 | |
GitHub Models github-models | Llama-3.3-70B-Instruct meta/llama-3.3-70b-instruct | llama | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 6, 2024 | |
GitHub Models github-models | Meta-Llama-3.1-405B-Instruct meta/meta-llama-3.1-405b-instruct | llama | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 23, 2024 | |
GitHub Models github-models | Meta-Llama-3.1-70B-Instruct meta/meta-llama-3.1-70b-instruct | llama | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 23, 2024 | |
GitHub Models github-models | Meta-Llama-3.1-8B-Instruct meta/meta-llama-3.1-8b-instruct | llama | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 23, 2024 | |
GitHub Models github-models | Ministral 3B mistral-ai/ministral-3b | ministral | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Oct 22, 2024 | |
GitHub Models github-models | Mistral Large 24.11 mistral-ai/mistral-large-2411 | mistral-large | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 1, 2024 | |
GitHub Models github-models | Mistral Nemo mistral-ai/mistral-nemo | mistral-nemo | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jul 18, 2024 | |
GitHub Models github-models | OpenAI o1-mini openai/o1-mini | o-mini | text | text | 128,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | No | Dec 17, 2024 | |
GitHub Models github-models | OpenAI o1-preview openai/o1-preview | o | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | No | Sep 12, 2024 | |
GitHub Models github-models | Phi-3-medium instruct (128k) microsoft/phi-3-medium-128k-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
GitHub Models github-models | Phi-3-mini instruct (128k) microsoft/phi-3-mini-128k-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
GitHub Models github-models | Phi-3-small instruct (128k) microsoft/phi-3-small-128k-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
GitHub Models github-models | Phi-3.5-mini instruct (128k) microsoft/phi-3.5-mini-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 20, 2024 | |
GitHub Models github-models | Phi-3.5-MoE instruct (128k) microsoft/phi-3.5-moe-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 20, 2024 | |
GitHub Models github-models | Phi-4-mini-instruct microsoft/phi-4-mini-instruct | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2024 | |
GitHub Models github-models | Phi-4-mini-reasoning microsoft/phi-4-mini-reasoning | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2024 | |
GitHub Models github-models | Phi-4-Reasoning microsoft/phi-4-reasoning | phi | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2024 | |
Helicone helicone | Baidu Ernie 4.5 21B A3B Thinking ernie-4.5-21b-a3b-thinking | ernie | text | text | 128,000 | 8,000 | $0.07 / 1M | $0.28 / 1M | Yes | No | Mar 16, 2025 | |
Helicone helicone | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 128,000 | 4,096 | $0.03 / 1M | $0.13 / 1M | Yes | Yes | Jan 20, 2025 | |
Helicone helicone | DeepSeek Reasoner deepseek-reasoner | deepseek-thinking | text | text | 128,000 | 64,000 | $0.56 / 1M | $1.68 / 1M | No | No | Jan 20, 2025 | |
Helicone helicone | DeepSeek V3 deepseek-v3 | deepseek | text | text | 128,000 | 8,192 | $0.56 / 1M | $1.68 / 1M | No | Yes | Dec 26, 2024 | |
Helicone helicone | DeepSeek V3.1 Terminus deepseek-v3.1-terminus | deepseek | text | text | 128,000 | 16,384 | $0.27 / 1M | $1 / 1M | Yes | Yes | Sep 22, 2025 | |
Helicone helicone | Meta Llama 3.1 8B Instruct Turbo llama-3.1-8b-instruct-turbo | llama | text | text | 128,000 | 128,000 | $0.02 / 1M | $0.03 / 1M | No | Yes | Jul 23, 2024 | |
Helicone helicone | Meta Llama 3.3 70B Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 16,400 | $0.13 / 1M | $0.39 / 1M | No | Yes | Dec 6, 2024 | |
Helicone helicone | Mistral-Large mistral-large-2411 | mistral-large | text | text | 128,000 | 32,768 | $2 / 1M | $6 / 1M | No | Yes | Jul 24, 2024 | |
Helicone helicone | OpenAI: GPT-5 Pro gpt-5-pro | gpt-pro | text | text | 128,000 | 32,768 | $15 / 1M | $120 / 1M | No | No | Jan 1, 2025 | |
Helicone helicone | OpenAI: o1-mini o1-mini | o-mini | text | text | 128,000 | 65,536 | $1.1 / 1M | $4.4 / 1M | No | No | Jan 1, 2025 | |
iFlow iflowcn | DeepSeek-R1 deepseek-r1 | deepseek-thinking | text | text | 128,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 20, 2025 | |
iFlow iflowcn | DeepSeek-V3 deepseek-v3 | deepseek | text | text | 128,000 | 32,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 26, 2024 | |
iFlow iflowcn | DeepSeek-V3.2-Exp deepseek-v3.2 | deepseek | text | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2025 | |
iFlow iflowcn | Kimi-K2 kimi-k2 | kimi | text | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 1, 2024 | |
iFlow iflowcn | Qwen3-235B-A22B qwen3-235b | qwen | text | text | 128,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2024 | |
iFlow iflowcn | Qwen3-32B qwen3-32b | qwen | text | text | 128,000 | 32,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 1, 2024 | |
Inception inception | Mercury 2 mercury-2 | mercury | text | text | 128,000 | 50,000 | $0.25 / 1M | $0.75 / 1M | Yes | Yes | Feb 24, 2026 | |
Inception inception | Mercury Edit 2 mercury-edit-2 | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | Yes | No | Mar 30, 2026 | |
IO.NET io-net | DeepSeek R1 deepseek-ai/DeepSeek-R1-0528 | deepseek-thinking | text | text | 128,000 | 4,096 | $2 / 1M | $8.75 / 1M | Yes | Yes | May 28, 2025 | |
IO.NET io-net | Devstral Small 2505 mistralai/Devstral-Small-2505 | devstral | text | text | 128,000 | 4,096 | $0.05 / 1M | $0.22 / 1M | No | Yes | May 1, 2025 | |
IO.NET io-net | Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 128,000 | 4,096 | $0.13 / 1M | $0.38 / 1M | No | Yes | Dec 6, 2024 | |
IO.NET io-net | Magistral Small 2506 mistralai/Magistral-Small-2506 | magistral-small | text | text | 128,000 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | Yes | Jun 1, 2025 | |
IO.NET io-net | Mistral Nemo Instruct 2407 mistralai/Mistral-Nemo-Instruct-2407 | mistral-nemo | text | text | 128,000 | 4,096 | $0.02 / 1M | $0.04 / 1M | No | Yes | Jul 1, 2024 | |
Kilo Gateway kilo | AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct | Unknown | text | text | 128,000 | 32,768 | $0.05 / 1M | $0.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | Unknown | text | text | 128,000 | 5,120 | $0.035 / 1M | $0.14 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Body Builder (beta) openrouter/bodybuilder | Unknown | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Cohere: Command R (08-2024) cohere/command-r-08-2024 | Unknown | text | text | 128,000 | 4,000 | $0.15 / 1M | $0.6 / 1M | No | Yes | Aug 30, 2024 | |
Kilo Gateway kilo | Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | Unknown | text | text | 128,000 | 4,000 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 30, 2024 | |
Kilo Gateway kilo | Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 | Unknown | text | text | 128,000 | 4,000 | $0.0375 / 1M | $0.15 / 1M | No | Yes | Feb 27, 2024 | |
Kilo Gateway kilo | Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b | Unknown | text | text | 128,000 | 32,768 | $1.25 / 1M | $1.25 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Inception: Mercury 2 inception/mercury-2 | Unknown | text | text | 128,000 | 50,000 | $0.25 / 1M | $0.75 / 1M | Yes | Yes | Feb 24, 2026 | |
Kilo Gateway kilo | Mercury Edit 2 inception/mercury-edit-2 | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | Yes | No | Mar 30, 2026 | |
Kilo Gateway kilo | Mistral Large mistralai/mistral-large | Unknown | text | text | 128,000 | 25,600 | $2 / 1M | $6 / 1M | No | Yes | Dec 2, 2025 | |
Kilo Gateway kilo | OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | Unknown | text | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | Unknown | text | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview | Unknown | text | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | Unknown | text | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | No | Jan 1, 2025 | |
Kilo Gateway kilo | Perplexity: Sonar Deep Research perplexity/sonar-deep-research | Unknown | text | text | 128,000 | 25,600 | $2 / 1M | $8 / 1M | Yes | No | Jan 27, 2025 | |
Kilo Gateway kilo | Upstage: Solar Pro 3 upstage/solar-pro-3 | Unknown | text | text | 128,000 | 32,768 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4 32B z-ai/glm-4-32b | Unknown | text | text | 128,000 | 32,768 | $0.1 / 1M | $0.1 / 1M | No | Yes | Mar 15, 2026 | |
Llama llama | Cerebras-Llama-4-Maverick-17B-128E-Instruct cerebras-llama-4-maverick-17b-128e-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Llama llama | Cerebras-Llama-4-Scout-17B-16E-Instruct cerebras-llama-4-scout-17b-16e-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Llama llama | Groq-Llama-4-Maverick-17B-128E-Instruct groq-llama-4-maverick-17b-128e-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Llama llama | Llama-3.3-70B-Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Dec 6, 2024 | |
Llama llama | Llama-3.3-8B-Instruct llama-3.3-8b-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Dec 6, 2024 | |
LLM Gateway llmgateway | Devstral Small devstral-small-2507 | devstral | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jul 10, 2025 | |
LLM Gateway llmgateway | GLM-4 32B (0414-128k) glm-4-32b-0414-128k | glm | text | text | 128,000 | 16,384 | $0.1 / 1M | $0.1 / 1M | No | Yes | Apr 14, 2025 | |
LLM Gateway llmgateway | GLM-4.5 AirX glm-4.5-airx | glm | text | text | 128,000 | 16,384 | $1.1 / 1M | $4.5 / 1M | No | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | GLM-4.5 X glm-4.5-x | glm | text | text | 128,000 | 16,384 | $2.2 / 1M | $8.9 / 1M | Yes | Yes | Jul 28, 2025 | |
LLM Gateway llmgateway | Llama 3.1 70B Instruct llama-3.1-70b-instruct | llama | text | text | 128,000 | 2,048 | $0.72 / 1M | $0.72 / 1M | No | No | Jul 23, 2024 | |
LLM Gateway llmgateway | Llama 3.1 8B Instruct llama-3.1-8b-instruct | llama | text | text | 128,000 | 2,048 | $0.22 / 1M | $0.22 / 1M | No | Yes | Jul 23, 2024 | |
LLM Gateway llmgateway | Llama 3.1 Nemotron Ultra 253B llama-3.1-nemotron-ultra-253b | llama | text | text | 128,000 | 8,192 | $0.6 / 1M | $1.8 / 1M | No | No | Apr 7, 2025 | |
LLM Gateway llmgateway | Llama 3.2 11B Instruct llama-3.2-11b-instruct | llama | text | text | 128,000 | 8,192 | $0.07 / 1M | $0.33 / 1M | No | No | Sep 25, 2024 | |
LLM Gateway llmgateway | Llama-3.3-70B-Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Dec 6, 2024 | |
LLM Gateway llmgateway | Sonar sonar | sonar | text | text | 128,000 | 4,096 | $1 / 1M | $1 / 1M | No | No | Sep 1, 2025 | |
Mistral mistral | Devstral Medium devstral-medium-2507 | devstral | text | text | 128,000 | 128,000 | $0.4 / 1M | $2 / 1M | No | Yes | Jul 10, 2025 | |
Mistral mistral | Devstral Small devstral-small-2507 | devstral | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jul 10, 2025 | |
Mistral mistral | Devstral Small 2505 devstral-small-2505 | devstral | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.3 / 1M | No | Yes | May 7, 2025 | |
Mistral mistral | Magistral Medium (latest) magistral-medium-latest | magistral-medium | text | text | 128,000 | 16,384 | $2 / 1M | $5 / 1M | Yes | Yes | Mar 20, 2025 | |
Mistral mistral | Magistral Small magistral-small | magistral-small | text | text | 128,000 | 128,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Mar 17, 2025 | |
Mistral mistral | Ministral 3B (latest) ministral-3b-latest | ministral | text | text | 128,000 | 128,000 | $0.04 / 1M | $0.04 / 1M | No | Yes | Oct 4, 2024 | |
Mistral mistral | Ministral 8B (latest) ministral-8b-latest | ministral | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Oct 4, 2024 | |
Mistral mistral | Mistral Nemo mistral-nemo | mistral-nemo | text | text | 128,000 | 128,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Jul 1, 2024 | |
NanoGPT nano-gpt | Amazon Nova Micro 1.0 amazon/nova-micro-v1 | nova-micro | text | text | 128,000 | 5,120 | $0.0357 / 1M | $0.1394 / 1M | No | No | Dec 3, 2024 | |
NanoGPT nano-gpt | Azure gpt-4-turbo azure-gpt-4-turbo | Unknown | text | text | 128,000 | 4,096 | $9.996 / 1M | $30.005 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Baichuan 4 Turbo Baichuan4-Turbo | Unknown | text | text | 128,000 | 32,768 | $2.42 / 1M | $2.42 / 1M | No | No | Aug 19, 2025 | |
NanoGPT nano-gpt | Cogito v1 Preview Qwen 32B deepcogito/cogito-v1-preview-qwen-32B | qwen | text | text | 128,000 | 32,768 | $1.7999999999999998 / 1M | $1.7999999999999998 / 1M | No | No | May 10, 2025 | |
NanoGPT nano-gpt | Cogito v2.1 671B MoE deepcogito/cogito-v2.1-671b | cogito | text | text | 128,000 | 16,384 | $1.25 / 1M | $1.25 / 1M | Yes | No | Nov 19, 2025 | |
NanoGPT nano-gpt | Cohere: Command R cohere/command-r | command-r | text | text | 128,000 | 4,096 | $0.476 / 1M | $1.428 / 1M | No | No | Mar 11, 2024 | |
NanoGPT nano-gpt | Cohere: Command R+ cohere/command-r-plus-08-2024 | command-r | text | text | 128,000 | 4,096 | $2.856 / 1M | $14.246 / 1M | No | Yes | Aug 30, 2024 | |
NanoGPT nano-gpt | DeepHermes-3 Mistral 24B (Preview) NousResearch 2/DeepHermes-3-Mistral-24B-Preview | nousresearch | text | text | 128,000 | 32,768 | $0.3 / 1M | $0.3 / 1M | No | No | May 10, 2025 | |
NanoGPT nano-gpt | DeepSeek Chat 0324 deepseek-v3-0324 | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.7 / 1M | No | Yes | Mar 24, 2025 | |
NanoGPT nano-gpt | DeepSeek Math V2 deepseek-math-v2 | Unknown | text | text | 128,000 | 65,536 | $0.6 / 1M | $2.2 / 1M | No | No | Dec 3, 2025 | |
NanoGPT nano-gpt | DeepSeek R1 deepseek-r1 | Unknown | text | text | 128,000 | 8,192 | $0.4 / 1M | $1.7 / 1M | Yes | No | Jan 20, 2025 | |
NanoGPT nano-gpt | DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 | deepseek | text | text | 128,000 | 163,840 | $0.4 / 1M | $1.7 / 1M | Yes | No | May 28, 2025 | |
NanoGPT nano-gpt | DeepSeek R1 0528 TEE TEE/deepseek-r1-0528 | deepseek | text | text | 128,000 | 65,536 | $2 / 1M | $2 / 1M | No | No | May 28, 2025 | |
NanoGPT nano-gpt | Deepseek R1 Cheaper deepseek-reasoner-cheaper | Unknown | text | text | 128,000 | 65,536 | $0.4 / 1M | $1.7 / 1M | No | No | Jan 20, 2025 | |
NanoGPT nano-gpt | DeepSeek R1 Fast deepseek-r1-sambanova | Unknown | text | text | 128,000 | 4,096 | $4.998 / 1M | $6.987 / 1M | No | No | Feb 20, 2025 | |
NanoGPT nano-gpt | DeepSeek TNG R1T2 Chimera tngtech/DeepSeek-TNG-R1T2-Chimera | tngtech | text | text | 128,000 | 8,192 | $0.31 / 1M | $0.31 / 1M | No | No | Sep 5, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | deepseek | text | text | 128,000 | 8,192 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | No | No | Dec 10, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 128,000 | 65,536 | $0.25 / 1M | $0.7 / 1M | No | Yes | Aug 2, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 Terminus (Thinking) deepseek-ai/DeepSeek-V3.1-Terminus:thinking | deepseek-thinking | text | text | 128,000 | 65,536 | $0.25 / 1M | $0.7 / 1M | No | Yes | Sep 22, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 Thinking deepseek-ai/DeepSeek-V3.1:thinking | deepseek-thinking | text | text | 128,000 | 65,536 | $0.2 / 1M | $0.7 / 1M | No | No | Aug 21, 2025 | |
NanoGPT nano-gpt | Doubao Seed 1.8 doubao-seed-1-8-251215 | Unknown | text | text | 128,000 | 8,192 | $0.612 / 1M | $6.12 / 1M | No | No | Dec 15, 2025 | |
NanoGPT nano-gpt | Gemini 2.0 Pro Reasoner gemini-2.0-pro-reasoner | Unknown | text | text | 128,000 | 65,536 | $1.292 / 1M | $4.998 / 1M | No | No | Feb 5, 2025 | |
NanoGPT nano-gpt | Gemma 3 1B IT unsloth/gemma-3-1b-it | unsloth | text | text | 128,000 | 8,192 | $0.1003 / 1M | $0.1003 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | GLM 4 32B 0414 THUDM/GLM-4-32B-0414 | glm | text | text | 128,000 | 65,536 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 14, 2025 | |
NanoGPT nano-gpt | GLM 4 Air 0111 glm-4-air-0111 | Unknown | text | text | 128,000 | 4,096 | $0.1394 / 1M | $0.1394 / 1M | No | No | Jan 11, 2025 | |
NanoGPT nano-gpt | GLM 4 Plus 0111 glm-4-plus-0111 | Unknown | text | text | 128,000 | 4,096 | $9.996 / 1M | $9.996 / 1M | No | No | Feb 19, 2025 | |
NanoGPT nano-gpt | GLM Z1 32B 0414 THUDM/GLM-Z1-32B-0414 | glm-z | text | text | 128,000 | 65,536 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | GLM-4 glm-4 | Unknown | text | text | 128,000 | 4,096 | $14.994 / 1M | $14.994 / 1M | No | No | Jan 16, 2024 | |
NanoGPT nano-gpt | GLM-4 Air glm-4-air | Unknown | text | text | 128,000 | 4,096 | $0.2006 / 1M | $0.2006 / 1M | No | No | Jun 5, 2024 | |
NanoGPT nano-gpt | GLM-4 Flash glm-4-flash | Unknown | text | text | 128,000 | 4,096 | $0.1003 / 1M | $0.1003 / 1M | No | No | Aug 1, 2024 | |
NanoGPT nano-gpt | GLM-4 Plus glm-4-plus | Unknown | text | text | 128,000 | 4,096 | $7.497 / 1M | $7.497 / 1M | No | No | Aug 1, 2024 | |
NanoGPT nano-gpt | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 128,000 | 16,384 | $0.05 / 1M | $0.25 / 1M | Yes | Yes | Aug 5, 2025 | |
NanoGPT nano-gpt | GPT OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 128,000 | 16,384 | $0.04 / 1M | $0.15 / 1M | Yes | No | Aug 5, 2025 | |
NanoGPT nano-gpt | GPT OSS Safeguard 20B openai/gpt-oss-safeguard-20b | gpt-oss | text | text | 128,000 | 16,384 | $0.075 / 1M | $0.3 / 1M | Yes | No | Oct 29, 2025 | |
NanoGPT nano-gpt | GPT-4 Turbo Preview openai/gpt-4-turbo-preview | gpt | text | text | 128,000 | 4,096 | $9.996 / 1M | $30.004999999999995 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | GPT-4o mini Search Preview openai/gpt-4o-mini-search-preview | gpt-mini | text | text | 128,000 | 16,384 | $0.088 / 1M | $0.35 / 1M | No | No | Jul 18, 2024 | |
NanoGPT nano-gpt | Hermes 4 (Thinking) NousResearch 2/Hermes-4-70B:thinking | nousresearch | text | text | 128,000 | 8,192 | $0.2006 / 1M | $0.39949999999999997 / 1M | No | No | Sep 17, 2025 | |
NanoGPT nano-gpt | Hermes 4 Large NousResearch 2/hermes-4-405b | nousresearch | text | text | 128,000 | 8,192 | $0.3 / 1M | $1.2 / 1M | No | Yes | Aug 26, 2025 | |
NanoGPT nano-gpt | Hermes 4 Large (Thinking) NousResearch 2/hermes-4-405b:thinking | nousresearch | text | text | 128,000 | 8,192 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jan 1, 2025 | |
NanoGPT nano-gpt | Hermes 4 Medium NousResearch 2/hermes-4-70b | nousresearch | text | text | 128,000 | 8,192 | $0.2006 / 1M | $0.39949999999999997 / 1M | No | No | Jul 3, 2025 | |
NanoGPT nano-gpt | K2-Think LLM360/K2-Think | kimi-thinking | text | text | 128,000 | 32,768 | $0.17 / 1M | $0.68 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | KAT Coder Air V1 KAT-Coder-Air-V1 | Unknown | text | text | 128,000 | 32,768 | $0.1 / 1M | $0.2 / 1M | No | No | Oct 28, 2025 | |
NanoGPT nano-gpt | KAT Coder Exp 72B 1010 KAT-Coder-Exp-72B-1010 | Unknown | text | text | 128,000 | 32,768 | $0.1 / 1M | $0.2 / 1M | No | No | Oct 28, 2025 | |
NanoGPT nano-gpt | Kimi K2 0711 moonshotai/kimi-k2-instruct-0711 | kimi | text | text | 128,000 | 8,192 | $0.1 / 1M | $2 / 1M | No | Yes | Jul 11, 2025 | |
NanoGPT nano-gpt | Kimi K2 0711 Instruct FP4 baseten/Kimi-K2-Instruct-FP4 | kimi | text | text | 128,000 | 131,072 | $0.1 / 1M | $2 / 1M | No | No | Jul 11, 2025 | |
NanoGPT nano-gpt | Kimi K2 Thinking TEE TEE/kimi-k2-thinking | kimi-thinking | text | text | 128,000 | 65,535 | $2 / 1M | $2 / 1M | No | No | Nov 6, 2025 | |
NanoGPT nano-gpt | Kimi K2.5 TEE TEE/kimi-k2.5 | kimi | text | text | 128,000 | 65,535 | $0.3 / 1M | $1.9 / 1M | No | No | Jan 29, 2026 | |
NanoGPT nano-gpt | Kimi K2.5 Thinking TEE TEE/kimi-k2.5-thinking | kimi-thinking | text | text | 128,000 | 65,535 | $0.3 / 1M | $1.9 / 1M | Yes | No | Jan 29, 2026 | |
NanoGPT nano-gpt | Llama 3.1 8B (decentralized) Meta-Llama-3-1-8B-Instruct-FP8 | Unknown | text | text | 128,000 | 16,384 | $0.02 / 1M | $0.03 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B TEE/llama3-3-70b | llama | text | text | 128,000 | 16,384 | $2 / 1M | $2 / 1M | No | No | Jul 3, 2025 | |
NanoGPT nano-gpt | Llama-xLAM-2 70B fc-r Salesforce/Llama-xLAM-2-70b-fc-r | llama | text | text | 128,000 | 16,384 | $2.5 / 1M | $2.5 / 1M | No | No | Apr 13, 2025 | |
NanoGPT nano-gpt | LongCat Flash meituan-longcat/LongCat-Flash-Chat-FP8 | longcat | text | text | 128,000 | 32,768 | $0.15 / 1M | $0.7 / 1M | No | Yes | Aug 31, 2025 | |
NanoGPT nano-gpt | Mistral Large 2411 mistralai/mistral-large | mistral-large | text | text | 128,000 | 256,000 | $2.006 / 1M | $6.001 / 1M | No | No | Feb 26, 2024 | |
NanoGPT nano-gpt | Mistral Small 3.2 24b Instruct chutesai/Mistral-Small-3.2-24B-Instruct-2506 | chutesai | text | text | 128,000 | 131,072 | $0.2 / 1M | $0.4 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Nvidia Nemotron Nano 9B v2 nvidia/nvidia-nemotron-nano-9b-v2 | nemotron | text | text | 128,000 | 16,384 | $0.17 / 1M | $0.68 / 1M | No | No | Aug 18, 2025 | |
NanoGPT nano-gpt | Nvidia Nemotron Super 49B nvidia/Llama-3.3-Nemotron-Super-49B-v1 | nemotron | text | text | 128,000 | 16,384 | $0.15 / 1M | $0.15 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Nvidia Nemotron Super 49B v1.5 nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 | nemotron | text | text | 128,000 | 16,384 | $0.05 / 1M | $0.25 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Nvidia Nemotron Ultra 253B nvidia/Llama-3.1-Nemotron-Ultra-253B-v1 | nemotron | text | text | 128,000 | 16,384 | $0.4 / 1M | $0.8 / 1M | No | No | Jul 3, 2025 | |
NanoGPT nano-gpt | Olmo 3 32B Think allenai/olmo-3-32b-think | allenai | text | text | 128,000 | 8,192 | $0.3 / 1M | $0.44999999999999996 / 1M | Yes | No | Nov 1, 2025 | |
NanoGPT nano-gpt | OpenAI o1-preview openai/o1-preview | o | text | text | 128,000 | 32,768 | $14.993999999999998 / 1M | $59.993 / 1M | Yes | No | Sep 12, 2024 | |
NanoGPT nano-gpt | Phi 4 Mini phi-4-mini-instruct | Unknown | text | text | 128,000 | 16,384 | $0.17 / 1M | $0.68 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Phi 4 Multimodal phi-4-multimodal-instruct | Unknown | text | text | 128,000 | 16,384 | $0.07 / 1M | $0.11 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Qwen: QwQ 32B qwq-32b | Unknown | text | text | 128,000 | 32,768 | $0.25599999 / 1M | $0.30499999 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct | Unknown | text | text | 128,000 | 65,536 | $0.1 / 1M | $0.4 / 1M | No | Yes | Aug 5, 2025 | |
NanoGPT nano-gpt | Qwen3 Coder 480B TEE TEE/qwen3-coder | qwen | text | text | 128,000 | 32,768 | $1.5 / 1M | $2 / 1M | No | No | Jul 23, 2025 | |
NanoGPT nano-gpt | QwenLong L1 32B Tongyi-Zhiwen/QwenLong-L1-32B | qwen | text | text | 128,000 | 40,960 | $0.13999999999999999 / 1M | $0.6 / 1M | No | No | Jan 25, 2025 | |
NanoGPT nano-gpt | RNJ-1 Instruct 8B essentialai/rnj-1-instruct | rnj | text | text | 128,000 | 8,192 | $0.15 / 1M | $0.15 / 1M | No | No | Dec 13, 2025 | |
NanoGPT nano-gpt | Sarvam Medium sarvan-medium | Unknown | text | text | 128,000 | 16,384 | $0.25 / 1M | $0.75 / 1M | No | No | Jan 1, 2025 | |
NanoGPT nano-gpt | Shisa V2 Llama 3.3 70B shisa-ai/shisa-v2-llama3.3-70b | llama | text | text | 128,000 | 16,384 | $0.5 / 1M | $0.5 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Step R1 V Mini step-r1-v-mini | Unknown | text | text | 128,000 | 65,536 | $2.5 / 1M | $11 / 1M | No | No | Apr 8, 2025 | |
NanoGPT nano-gpt | TNG R1T Chimera tngtech/tng-r1t-chimera | tngtech | text | text | 128,000 | 65,536 | $0.3 / 1M | $1.2 / 1M | No | No | Nov 26, 2025 | |
NanoGPT nano-gpt | Tongyi DeepResearch 30B A3B Alibaba-NLP/Tongyi-DeepResearch-30B-A3B | yi | text | text | 128,000 | 65,536 | $0.08 / 1M | $0.24000000000000002 / 1M | No | No | Aug 26, 2025 | |
NanoGPT nano-gpt | Venice Uncensored venice-uncensored | Unknown | text | text | 128,000 | 16,384 | $0.4 / 1M | $0.4 / 1M | No | No | Feb 24, 2025 | |
Nebius Token Factory nebius | DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 | Unknown | text | text | 128,000 | 32,768 | $0.8 / 1M | $2.4 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | DeepSeek-V3-0324 deepseek-ai/DeepSeek-V3-0324 | Unknown | text | text | 128,000 | 8,192 | $0.5 / 1M | $1.5 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | DeepSeek-V3-0324 (Fast) deepseek-ai/DeepSeek-V3-0324-fast | Unknown | text | text | 128,000 | 8,192 | $0.75 / 1M | $2.25 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | GLM-4.5 zai-org/GLM-4.5 | Unknown | text | text | 128,000 | 4,096 | $0.6 / 1M | $2.2 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | GLM-4.5-Air zai-org/GLM-4.5-Air | Unknown | text | text | 128,000 | 4,096 | $0.2 / 1M | $1.2 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | GLM-4.7 (FP8) zai-org/GLM-4.7-FP8 | Unknown | text | text | 128,000 | 4,096 | $0.4 / 1M | $2 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | gpt-oss-120b openai/gpt-oss-120b | Unknown | text | text | 128,000 | 8,192 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | gpt-oss-20b openai/gpt-oss-20b | Unknown | text | text | 128,000 | 4,096 | $0.05 / 1M | $0.2 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Hermes-4-405B NousResearch/Hermes-4-405B | Unknown | text | text | 128,000 | 8,192 | $1 / 1M | $3 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Hermes-4-70B NousResearch/Hermes-4-70B | Unknown | text | text | 128,000 | 8,192 | $0.13 / 1M | $0.4 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | INTELLECT-3 PrimeIntellect/INTELLECT-3 | Unknown | text | text | 128,000 | 8,192 | $0.2 / 1M | $1.1 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking | Unknown | text | text | 128,000 | 16,384 | $0.6 / 1M | $2.5 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Llama-3.1-Nemotron-Ultra-253B-v1 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 | Unknown | text | text | 128,000 | 4,096 | $0.6 / 1M | $1.8 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct | Unknown | text | text | 128,000 | 8,192 | $0.13 / 1M | $0.4 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Llama-3.3-70B-Instruct (Fast) meta-llama/Llama-3.3-70B-Instruct-fast | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Meta-Llama-3.1-8B-Instruct meta-llama/Meta-Llama-3.1-8B-Instruct | Unknown | text | text | 128,000 | 4,096 | $0.02 / 1M | $0.06 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Meta-Llama-3.1-8B-Instruct (Fast) meta-llama/Meta-Llama-3.1-8B-Instruct-fast | Unknown | text | text | 128,000 | 4,096 | $0.03 / 1M | $0.09 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 | Unknown | text | text | 128,000 | 8,192 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen2.5-Coder-7B (Fast) Qwen/Qwen2.5-Coder-7B-fast | Unknown | text | text | 128,000 | 8,192 | $0.03 / 1M | $0.09 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 | Unknown | text | text | 128,000 | 8,192 | $0.1 / 1M | $0.3 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-30B-A3B-Thinking-2507 Qwen/Qwen3-30B-A3B-Thinking-2507 | Unknown | text | text | 128,000 | 16,384 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-32B Qwen/Qwen3-32B | Unknown | text | text | 128,000 | 8,192 | $0.1 / 1M | $0.3 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-32B (Fast) Qwen/Qwen3-32B-fast | Unknown | text | text | 128,000 | 8,192 | $0.2 / 1M | $0.6 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct | Unknown | text | text | 128,000 | 8,192 | $0.1 / 1M | $0.3 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking | Unknown | text | text | 128,000 | 16,384 | $0.15 / 1M | $1.2 / 1M | Yes | Yes | Feb 4, 2026 | |
NovitaAI novita-ai | DeepSeek R1 0528 Qwen3 8B deepseek/deepseek-r1-0528-qwen3-8b | Unknown | text | text | 128,000 | 32,000 | $0.06 / 1M | $0.09 / 1M | Yes | No | May 29, 2025 | |
NovitaAI novita-ai | Qwen3 4B qwen/qwen3-4b-fp8 | Unknown | text | text | 128,000 | 20,000 | $0.03 / 1M | $0.03 / 1M | Yes | No | Apr 29, 2025 | |
NovitaAI novita-ai | Qwen3 8B qwen/qwen3-8b-fp8 | Unknown | text | text | 128,000 | 20,000 | $0.035 / 1M | $0.138 / 1M | Yes | No | Apr 29, 2025 | |
Nvidia nvidia | Codegemma 1.1 7b google/codegemma-1.1-7b | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Apr 30, 2024 | |
Nvidia nvidia | Codegemma 7b google/codegemma-7b | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Mar 21, 2024 | |
Nvidia nvidia | Codellama 70b meta/codellama-70b | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Jan 29, 2024 | |
Nvidia nvidia | Codestral 22b Instruct V0.1 mistralai/codestral-22b-instruct-v0.1 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 29, 2024 | |
Nvidia nvidia | Deepseek Coder 6.7b Instruct deepseek-ai/deepseek-coder-6.7b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Oct 29, 2023 | |
Nvidia nvidia | Deepseek R1 deepseek-ai/deepseek-r1 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | No | Jan 20, 2025 | |
Nvidia nvidia | Deepseek R1 0528 deepseek-ai/deepseek-r1-0528 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | May 28, 2025 | |
Nvidia nvidia | DeepSeek V3.1 deepseek-ai/deepseek-v3.1 | deepseek | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 26, 2025 | |
Nvidia nvidia | DeepSeek V3.1 Terminus deepseek-ai/deepseek-v3.1-terminus | deepseek | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 22, 2025 | |
Nvidia nvidia | Gemma 2 27b It google/gemma-2-27b-it | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jun 24, 2024 | |
Nvidia nvidia | Gemma 2 2b It google/gemma-2-2b-it | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jul 16, 2024 | |
Nvidia nvidia | Gemma 3 12b It google/gemma-3-12b-it | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Mar 1, 2025 | |
Nvidia nvidia | GPT-OSS-120B openai/gpt-oss-120b | gpt-oss | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | No | Aug 14, 2025 | |
Nvidia nvidia | Kimi K2 Instruct moonshotai/kimi-k2-instruct | kimi | text | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Nvidia nvidia | Llama 3.1 405b Instruct meta/llama-3.1-405b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jul 16, 2024 | |
Nvidia nvidia | Llama 3.1 70b Instruct meta/llama-3.1-70b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jul 16, 2024 | |
Nvidia nvidia | Llama 3.1 Nemotron 51b Instruct nvidia/llama-3.1-nemotron-51b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Sep 22, 2024 | |
Nvidia nvidia | Llama 3.1 Nemotron 70b Instruct nvidia/llama-3.1-nemotron-70b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Oct 12, 2024 | |
Nvidia nvidia | Llama 3.2 1b Instruct meta/llama-3.2-1b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Sep 18, 2024 | |
Nvidia nvidia | Llama 3.3 70b Instruct meta/llama-3.3-70b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Nov 26, 2024 | |
Nvidia nvidia | Llama 3.3 Nemotron Super 49b V1 nvidia/llama-3.3-nemotron-super-49b-v1 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Mar 16, 2025 | |
Nvidia nvidia | Llama 3.3 Nemotron Super 49b V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Mar 16, 2025 | |
Nvidia nvidia | Llama3 70b Instruct meta/llama3-70b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 17, 2024 | |
Nvidia nvidia | Llama3 8b Instruct meta/llama3-8b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 17, 2024 | |
Nvidia nvidia | Llama3 Chatqa 1.5 70b nvidia/llama3-chatqa-1.5-70b | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 28, 2024 | |
Nvidia nvidia | Mamba Codestral 7b V0.1 mistralai/mamba-codestral-7b-v0.1 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | No | Jul 16, 2024 | |
Nvidia nvidia | Mistral Large 2 Instruct mistralai/mistral-large-2-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jul 24, 2024 | |
Nvidia nvidia | Mistral Small 3.1 24b Instruct 2503 mistralai/mistral-small-3.1-24b-instruct-2503 | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Mar 11, 2025 | |
Nvidia nvidia | Nemotron 4 340b Instruct nvidia/nemotron-4-340b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jun 13, 2024 | |
Nvidia nvidia | Phi 3.5 Moe Instruct microsoft/phi-3.5-moe-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Aug 17, 2024 | |
Nvidia nvidia | Qwen2.5 Coder 32b Instruct qwen/qwen2.5-coder-32b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Nov 6, 2024 | |
Nvidia nvidia | Qwen2.5 Coder 7b Instruct qwen/qwen2.5-coder-7b-instruct | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Sep 17, 2024 | |
Nvidia nvidia | Qwq 32b qwen/qwq-32b | Unknown | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | No | Mar 5, 2025 | |
OpenAI openai | o1-mini o1-mini | o-mini | text | text | 128,000 | 65,536 | $1.1 / 1M | $4.4 / 1M | Yes | No | Sep 12, 2024 | |
OpenAI openai | o1-preview o1-preview | o | text | text | 128,000 | 32,768 | $15 / 1M | $60 / 1M | Yes | No | Sep 12, 2024 | |
OpenCode Zen opencode | GPT-5.3 Codex Spark gpt-5.3-codex-spark | gpt-codex-spark | text | text | 128,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 12, 2026 | |
OpenRouter openrouter | Devstral Small mistralai/devstral-small-2505 | devstral | text | text | 128,000 | 128,000 | $0.06 / 1M | $0.12 / 1M | No | Yes | May 7, 2025 | |
OpenRouter openrouter | GLM 4.5 z-ai/glm-4.5 | glm | text | text | 128,000 | 96,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jul 28, 2025 | |
OpenRouter openrouter | GLM 4.5 Air z-ai/glm-4.5-air | glm-air | text | text | 128,000 | 96,000 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jul 28, 2025 | |
OpenRouter openrouter | GLM 4.5 Air (free) z-ai/glm-4.5-air:free | glm-air | text | text | 128,000 | 96,000 | $0 / 1M | $0 / 1M | Yes | No | Jul 28, 2025 | |
OpenRouter openrouter | Mercury 2 inception/mercury-2 | mercury | text | text | 128,000 | 50,000 | $0.25 / 1M | $0.75 / 1M | Yes | Yes | Mar 4, 2026 | |
OpenRouter openrouter | Mercury Edit 2 inception/mercury-edit-2 | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | Yes | No | Mar 30, 2026 | |
OpenRouter openrouter | Nemotron Nano 9B V2 (free) nvidia/nemotron-nano-9b-v2:free | nemotron | text | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 18, 2025 | |
OpenRouter openrouter | Qwen3 Coder Flash qwen/qwen3-coder-flash | qwen | text | text | 128,000 | 66,536 | $0.3 / 1M | $1.5 / 1M | No | Yes | Jul 23, 2025 | |
Perplexity Agent perplexity-agent | Sonar perplexity/sonar | sonar | text | text | 128,000 | 8,192 | $0.25 / 1M | $2.5 / 1M | No | Yes | Sep 1, 2025 | |
Perplexity perplexity | Perplexity Sonar Deep Research sonar-deep-research | Unknown | text | text | 128,000 | 32,768 | $2 / 1M | $8 / 1M | Yes | No | Sep 1, 2025 | |
Perplexity perplexity | Sonar sonar | sonar | text | text | 128,000 | 4,096 | $1 / 1M | $1 / 1M | No | No | Sep 1, 2025 | |
Poe poe | DeepSeek-V3.2 novita/deepseek-v3.2 | Unknown | text | text | 128,000 | Unknown | $0.27 / 1M | $0.4 / 1M | Yes | Yes | Dec 1, 2025 | |
Poe poe | GPT-4o-mini-Search openai/gpt-4o-mini-search | gpt-mini | text | text | 128,000 | 8,192 | $0.14 / 1M | $0.54 / 1M | No | Yes | Mar 11, 2025 | |
Poe poe | GPT-4o-Search openai/gpt-4o-search | gpt | text | text | 128,000 | 8,192 | $2.2 / 1M | $9 / 1M | No | Yes | Mar 11, 2025 | |
Poe poe | GPT-5.3-Codex-Spark openai/gpt-5.3-codex-spark | Unknown | text | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 4, 2026 | |
Poe poe | GPT-OSS-120B-CS cerebras/gpt-oss-120b-cs | Unknown | text | text | 128,000 | Unknown | $0.35 / 1M | $0.75 / 1M | Yes | Yes | Aug 6, 2025 | |
Poe poe | Llama-3.1-8B-CS cerebras/llama-3.1-8b-cs | Unknown | text | text | 128,000 | Unknown | $0.1 / 1M | $0.1 / 1M | No | Yes | May 13, 2025 | |
Privatemode AI privatemode-ai | gpt-oss-120b gpt-oss-120b | gpt-oss | text | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 14, 2025 | |
Privatemode AI privatemode-ai | Qwen3-Coder 30B-A3B qwen3-coder-30b-a3b | qwen | text | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | No | Yes | Apr 1, 2025 | |
Qiniu qiniu-ai | DeepSeek-R1 deepseek-r1 | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | DeepSeek-R1-0528 deepseek-r1-0528 | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | DeepSeek-V3 deepseek-v3 | Unknown | text | text | 128,000 | 16,000 | Unknown | Unknown | No | No | Aug 13, 2025 | |
Qiniu qiniu-ai | DeepSeek-V3-0324 deepseek-v3-0324 | Unknown | text | text | 128,000 | 16,000 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | DeepSeek-V3.1 deepseek-v3.1 | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 19, 2025 | |
Qiniu qiniu-ai | DeepSeek/DeepSeek-V3.1-Terminus deepseek/deepseek-v3.1-terminus | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | No | Yes | Sep 22, 2025 | |
Qiniu qiniu-ai | DeepSeek/DeepSeek-V3.1-Terminus-Thinking deepseek/deepseek-v3.1-terminus-thinking | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | No | Sep 22, 2025 | |
Qiniu qiniu-ai | Deepseek/DeepSeek-V3.2 deepseek/deepseek-v3.2-251201 | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | Yes | Dec 1, 2025 | |
Qiniu qiniu-ai | DeepSeek/DeepSeek-V3.2-Exp deepseek/deepseek-v3.2-exp | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | No | Yes | Sep 29, 2025 | |
Qiniu qiniu-ai | DeepSeek/DeepSeek-V3.2-Exp-Thinking deepseek/deepseek-v3.2-exp-thinking | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | Yes | No | Sep 29, 2025 | |
Qiniu qiniu-ai | Doubao 1.5 Pro 32k doubao-1.5-pro-32k | Unknown | text | text | 128,000 | 12,000 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Doubao 1.5 Thinking Pro doubao-1.5-thinking-pro | Unknown | text | text | 128,000 | 16,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | gpt-oss-120b gpt-oss-120b | Unknown | text | text | 128,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 6, 2025 | |
Qiniu qiniu-ai | gpt-oss-20b gpt-oss-20b | Unknown | text | text | 128,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 6, 2025 | |
Qiniu qiniu-ai | Kimi K2 kimi-k2 | Unknown | text | text | 128,000 | 128,000 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen 3 235B A22B qwen3-235b-a22b | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen2.5-Max-2025-01-25 qwen-max-2025-01-25 | Unknown | text | text | 128,000 | 4,096 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen3 30b A3b Instruct 2507 qwen3-30b-a3b-instruct-2507 | Unknown | text | text | 128,000 | 32,000 | Unknown | Unknown | No | Yes | Feb 4, 2026 | |
Regolo AI regolo-ai | GPT-OSS-120B gpt-oss-120b | gpt-oss | text | text | 128,000 | 16,384 | $1 / 1M | $4.2 / 1M | Yes | Yes | Aug 5, 2025 | |
Regolo AI regolo-ai | GPT-OSS-20B gpt-oss-20b | gpt-oss | text | text | 128,000 | 16,384 | $0.4 / 1M | $1.8 / 1M | Yes | Yes | Mar 1, 2026 | |
Regolo AI regolo-ai | Llama 3.3 70B Instruct llama-3.3-70b-instruct | llama | text | text | 128,000 | 16,384 | $0.6 / 1M | $2.7 / 1M | No | Yes | Apr 28, 2025 | |
SAP AI Core sap-ai-core | sonar sonar | sonar | text | text | 128,000 | 4,096 | $1 / 1M | $1 / 1M | No | No | Sep 1, 2025 | |
SAP AI Core sap-ai-core | sonar-deep-research sonar-deep-research | sonar-deep-research | text | text | 128,000 | 32,768 | $2 / 1M | $8 / 1M | Yes | No | Sep 1, 2025 | |
Scaleway scaleway | GPT-OSS 120B gpt-oss-120b | gpt-oss | text | text | 128,000 | 32,768 | $0.15 / 1M | $0.6 / 1M | No | Yes | Mar 17, 2026 | |
Scaleway scaleway | Llama 3.1 8B Instruct llama-3.1-8b-instruct | llama | text | text | 128,000 | 16,384 | $0.2 / 1M | $0.2 / 1M | No | Yes | Mar 17, 2026 | |
Scaleway scaleway | Mistral Nemo Instruct 2407 mistral-nemo-instruct-2407 | mistral-nemo | text | text | 128,000 | 8,192 | $0.2 / 1M | $0.2 / 1M | No | Yes | Mar 17, 2026 | |
Scaleway scaleway | Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct | qwen | text | text | 128,000 | 32,768 | $0.2 / 1M | $0.8 / 1M | No | Yes | Mar 17, 2026 | |
SiliconFlow (China) siliconflow-cn | ascend-tribe/pangu-pro-moe ascend-tribe/pangu-pro-moe | pangu | text | text | 128,000 | 128,000 | $0.2 / 1M | $0.6 / 1M | Yes | No | Jan 16, 2026 | |
SiliconFlow (China) siliconflow-cn | Kwaipilot/KAT-Dev Kwaipilot/KAT-Dev | kat-coder | text | text | 128,000 | 128,000 | $0.2 / 1M | $0.6 / 1M | No | Yes | Jan 16, 2026 | |
STACKIT stackit | Llama 3.1 8B neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8 | llama | text | text | 128,000 | 8,192 | $0.16 / 1M | $0.27 / 1M | No | Yes | Jul 23, 2024 | |
STACKIT stackit | Llama 3.3 70B cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic | llama | text | text | 128,000 | 8,192 | $0.49 / 1M | $0.71 / 1M | No | Yes | Dec 5, 2024 | |
STACKIT stackit | Mistral Nemo neuralmagic/Mistral-Nemo-Instruct-2407-FP8 | mistral | text | text | 128,000 | 8,192 | $0.49 / 1M | $0.71 / 1M | No | Yes | Jul 1, 2024 | |
Synthetic synthetic | DeepSeek R1 hf:deepseek-ai/DeepSeek-R1 | deepseek-thinking | text | text | 128,000 | 128,000 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Jan 20, 2025 | |
Synthetic synthetic | DeepSeek R1 (0528) hf:deepseek-ai/DeepSeek-R1-0528 | deepseek-thinking | text | text | 128,000 | 128,000 | $3 / 1M | $8 / 1M | Yes | Yes | Aug 1, 2025 | |
Synthetic synthetic | DeepSeek V3 hf:deepseek-ai/DeepSeek-V3 | deepseek | text | text | 128,000 | 128,000 | $1.25 / 1M | $1.25 / 1M | Yes | Yes | May 29, 2025 | |
Synthetic synthetic | DeepSeek V3 (0324) hf:deepseek-ai/DeepSeek-V3-0324 | deepseek | text | text | 128,000 | 128,000 | $1.2 / 1M | $1.2 / 1M | No | Yes | Aug 1, 2025 | |
Synthetic synthetic | DeepSeek V3.1 hf:deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 128,000 | 128,000 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Aug 21, 2025 | |
Synthetic synthetic | DeepSeek V3.1 Terminus hf:deepseek-ai/DeepSeek-V3.1-Terminus | deepseek | text | text | 128,000 | 128,000 | $1.2 / 1M | $1.2 / 1M | Yes | Yes | Sep 25, 2025 | |
Synthetic synthetic | GPT OSS 120B hf:openai/gpt-oss-120b | gpt-oss | text | text | 128,000 | 32,768 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Aug 5, 2025 | |
Synthetic synthetic | Llama-3.1-405B-Instruct hf:meta-llama/Llama-3.1-405B-Instruct | llama | text | text | 128,000 | 32,768 | $3 / 1M | $3 / 1M | Yes | Yes | Jul 23, 2024 | |
Synthetic synthetic | Llama-3.1-70B-Instruct hf:meta-llama/Llama-3.1-70B-Instruct | llama | text | text | 128,000 | 32,768 | $0.9 / 1M | $0.9 / 1M | Yes | Yes | Jul 23, 2024 | |
Synthetic synthetic | Llama-3.1-8B-Instruct hf:meta-llama/Llama-3.1-8B-Instruct | llama | text | text | 128,000 | 32,768 | $0.2 / 1M | $0.2 / 1M | Yes | Yes | Jul 23, 2024 | |
Synthetic synthetic | Llama-3.3-70B-Instruct hf:meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 128,000 | 32,768 | $0.9 / 1M | $0.9 / 1M | Yes | Yes | Dec 6, 2024 | |
The Grid AI the-grid-ai | Text Prime text-prime | Unknown | text | text | 128,000 | 30,000 | Unknown | Unknown | Yes | Yes | Feb 26, 2026 | |
The Grid AI the-grid-ai | Text Standard text-standard | Unknown | text | text | 128,000 | 16,000 | Unknown | Unknown | Yes | Yes | Feb 26, 2026 | |
Venice AI venice | Aion 2.0 aion-labs-aion-2-0 | o | text | text | 128,000 | 32,768 | $1 / 1M | $2 / 1M | Yes | No | Apr 12, 2026 | |
Venice AI venice | GLM 4.7 Flash zai-org-glm-4.7-flash | glm-flash | text | text | 128,000 | 16,384 | $0.125 / 1M | $0.5 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | Hermes 3 Llama 3.1 405b hermes-3-llama-3.1-405b | hermes | text | text | 128,000 | 16,384 | $1.1 / 1M | $3 / 1M | No | No | Mar 12, 2026 | |
Venice AI venice | Llama 3.2 3B llama-3.2-3b | llama | text | text | 128,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Llama 3.3 70B llama-3.3-70b | llama | text | text | 128,000 | 4,096 | $0.7 / 1M | $2.8 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Mercury 2 mercury-2 | mercury | text | text | 128,000 | 50,000 | $0.3125 / 1M | $0.9375 / 1M | Yes | Yes | Apr 9, 2026 | |
Venice AI venice | NVIDIA Nemotron 3 Nano 30B nvidia-nemotron-3-nano-30b-a3b | nemotron | text | text | 128,000 | 16,384 | $0.075 / 1M | $0.3 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | OpenAI GPT OSS 120B openai-gpt-oss-120b | gpt-oss | text | text | 128,000 | 16,384 | $0.07 / 1M | $0.3 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Qwen 3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 | qwen | text | text | 128,000 | 16,384 | $0.15 / 1M | $0.75 / 1M | No | Yes | Mar 12, 2026 | |
Venice AI venice | Qwen 3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 | qwen | text | text | 128,000 | 16,384 | $0.45 / 1M | $3.5 / 1M | Yes | Yes | Mar 12, 2026 | |
Vercel AI Gateway vercel | DeepSeek V3.2 Thinking deepseek/deepseek-v3.2-thinking | deepseek-thinking | text | text | 128,000 | 64,000 | $0.28 / 1M | $0.42 / 1M | Yes | Yes | Dec 1, 2025 | |
Vercel AI Gateway vercel | DeepSeek-R1 deepseek/deepseek-r1 | deepseek-thinking | text | text | 128,000 | 32,768 | $1.35 / 1M | $5.4 / 1M | Yes | Yes | May 29, 2025 | |
Vercel AI Gateway vercel | Devstral Small 1.1 mistral/devstral-small | devstral | text | text | 128,000 | 64,000 | $0.1 / 1M | $0.3 / 1M | No | Yes | May 7, 2025 | |
Vercel AI Gateway vercel | GLM 4.5 Air zai/glm-4.5-air | glm-air | text | text | 128,000 | 96,000 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jul 28, 2025 | |
Vercel AI Gateway vercel | GPT 4o Mini Search Preview openai/gpt-4o-mini-search-preview | gpt-mini | text | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | No | Jan 1, 2025 | |
Vercel AI Gateway vercel | Llama 3.2 1B Instruct meta/llama-3.2-1b | llama | text | text | 128,000 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | No | Sep 18, 2024 | |
Vercel AI Gateway vercel | Llama 3.2 3B Instruct meta/llama-3.2-3b | llama | text | text | 128,000 | 8,192 | $0.15 / 1M | $0.15 / 1M | No | No | Sep 18, 2024 | |
Vercel AI Gateway vercel | Llama-3.3-70B-Instruct meta/llama-3.3-70b | llama | text | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Dec 6, 2024 | |
Vercel AI Gateway vercel | LongCat Flash Chat meituan/longcat-flash-chat | longcat | text | text | 128,000 | 8,192 | Unknown | Unknown | No | Yes | Aug 30, 2025 | |
Vercel AI Gateway vercel | LongCat Flash Thinking meituan/longcat-flash-thinking | longcat | text | text | 128,000 | 8,192 | $0.15 / 1M | $1.5 / 1M | Yes | Yes | Sep 23, 2025 | |
Vercel AI Gateway vercel | Magistral Medium (latest) mistral/magistral-medium | magistral-medium | text | text | 128,000 | 16,384 | $2 / 1M | $5 / 1M | Yes | Yes | Mar 20, 2025 | |
Vercel AI Gateway vercel | Magistral Small mistral/magistral-small | magistral-small | text | text | 128,000 | 128,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Mar 17, 2025 | |
Vercel AI Gateway vercel | Mercury 2 inception/mercury-2 | mercury | text | text | 128,000 | 128,000 | $0.25 / 1M | $0.75 / 1M | Yes | Yes | Mar 6, 2026 | |
Vercel AI Gateway vercel | Mercury Edit 2 inception/mercury-edit-2 | Unknown | text | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | Yes | No | Mar 30, 2026 | |
Vercel AI Gateway vercel | Ministral 3B (latest) mistral/ministral-3b | ministral | text | text | 128,000 | 128,000 | $0.04 / 1M | $0.04 / 1M | No | Yes | Oct 4, 2024 | |
Vercel AI Gateway vercel | Ministral 8B (latest) mistral/ministral-8b | ministral | text | text | 128,000 | 128,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Oct 4, 2024 | |
Vercel AI Gateway vercel | Nova Micro amazon/nova-micro | nova-micro | text | text | 128,000 | 8,192 | $0.035 / 1M | $0.14 / 1M | No | Yes | Dec 3, 2024 | |
Vertex google-vertex | Llama 3.3 70B Instruct meta/llama-3.3-70b-instruct-maas | llama | text | text | 128,000 | 8,192 | $0.72 / 1M | $0.72 / 1M | No | Yes | Apr 29, 2025 | |
Vivgrid vivgrid | DeepSeek-V3.2 deepseek-v3.2 | deepseek | text | text | 128,000 | 128,000 | $0.28 / 1M | $0.42 / 1M | Yes | Yes | Dec 1, 2025 | |
Weights & Biases wandb | Llama 3.1 70B meta-llama/Llama-3.1-70B-Instruct | llama | text | text | 128,000 | 128,000 | $0.8 / 1M | $0.8 / 1M | No | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 128,000 | 128,000 | $0.71 / 1M | $0.71 / 1M | Yes | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Meta-Llama-3.1-8B-Instruct meta-llama/Llama-3.1-8B-Instruct | llama | text | text | 128,000 | 128,000 | $0.22 / 1M | $0.22 / 1M | Yes | Yes | Mar 12, 2026 | |
Weights & Biases wandb | Phi-4-mini-instruct microsoft/Phi-4-mini-instruct | phi | text | text | 128,000 | 128,000 | $0.08 / 1M | $0.35 / 1M | Yes | Yes | Mar 12, 2026 | |
ZenMux zenmux | DeepSeek V3.2 deepseek/deepseek-v3.2 | Unknown | text | text | 128,000 | 64,000 | $0.28 / 1M | $0.43 / 1M | Yes | Yes | Dec 5, 2025 | |
ZenMux zenmux | DeepSeek-V3.2 (Non-thinking Mode) deepseek/deepseek-chat | Unknown | text | text | 128,000 | 64,000 | $0.28 / 1M | $0.42 / 1M | No | Yes | Dec 1, 2025 | |
ZenMux zenmux | GLM 4.5 z-ai/glm-4.5 | Unknown | text | text | 128,000 | 64,000 | $0.35 / 1M | $1.54 / 1M | Yes | Yes | Jul 25, 2025 | |
ZenMux zenmux | GLM 4.5 Air z-ai/glm-4.5-air | Unknown | text | text | 128,000 | 64,000 | $0.11 / 1M | $0.56 / 1M | Yes | Yes | Jul 25, 2025 | |
ZenMux zenmux | GPT-5.3 Chat openai/gpt-5.3-chat | Unknown | text | text | 128,000 | 16,380 | $1.75 / 1M | $14 / 1M | No | Yes | Mar 20, 2026 | |
ZenMux zenmux | Ling-1T inclusionai/ling-1t | Unknown | text | text | 128,000 | 64,000 | $0.56 / 1M | $2.24 / 1M | No | Yes | Oct 9, 2025 | |
ZenMux zenmux | Ring-1T inclusionai/ring-1t | Unknown | text | text | 128,000 | 64,000 | $0.56 / 1M | $2.24 / 1M | Yes | Yes | Oct 12, 2025 | |
Helicone helicone | Perplexity Sonar sonar | sonar | text | text | 127,000 | 4,096 | $1 / 1M | $1 / 1M | No | No | Jan 27, 2025 | |
Helicone helicone | Perplexity Sonar Deep Research sonar-deep-research | sonar-deep-research | text | text | 127,000 | 4,096 | $2 / 1M | $8 / 1M | Yes | No | Jan 27, 2025 | |
Helicone helicone | Perplexity Sonar Reasoning sonar-reasoning | sonar-reasoning | text | text | 127,000 | 4,096 | $1 / 1M | $5 / 1M | Yes | No | Jan 27, 2025 | |
Helicone helicone | Perplexity Sonar Reasoning Pro sonar-reasoning-pro | sonar-reasoning | text | text | 127,000 | 4,096 | $2 / 1M | $8 / 1M | Yes | No | Jan 27, 2025 | |
NanoGPT nano-gpt | Perplexity Reasoning Pro sonar-reasoning-pro | Unknown | text | text | 127,000 | 128,000 | $2.006 / 1M | $7.9985 / 1M | Yes | No | Feb 19, 2025 | |
NanoGPT nano-gpt | Perplexity Simple sonar | Unknown | text | text | 127,000 | 128,000 | $1.003 / 1M | $1.003 / 1M | No | No | Feb 19, 2025 | |
Vercel AI Gateway vercel | Sonar Reasoning perplexity/sonar-reasoning | sonar-reasoning | text | text | 127,000 | 8,000 | $1 / 1M | $5 / 1M | Yes | No | Feb 19, 2025 | |
Vercel AI Gateway vercel | Sonar Reasoning Pro perplexity/sonar-reasoning-pro | sonar-reasoning | text | text | 127,000 | 8,000 | $2 / 1M | $8 / 1M | Yes | No | Feb 19, 2025 | |
Vultr vultr | DeepSeek V3.2 DeepSeek-V3.2 | deepseek | text | text | 127,000 | 4,096 | $0.55 / 1M | $1.65 / 1M | No | Yes | Dec 1, 2025 | |
Qiniu qiniu-ai | Qwen3 30b A3b Thinking 2507 qwen3-30b-a3b-thinking-2507 | Unknown | text | text | 126,000 | 32,000 | Unknown | Unknown | Yes | Yes | Feb 4, 2026 | |
Jiekou.AI jiekou | ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b-paddle | ernie | text | text | 123,000 | 12,000 | $0.28 / 1M | $1.1 / 1M | No | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | Unknown | text | text | 123,000 | 12,000 | $0.28 / 1M | $1.1 / 1M | No | No | Jan 1, 2026 | |
NovitaAI novita-ai | ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b-paddle | Unknown | text | text | 123,000 | 12,000 | $0.28 / 1M | $1.1 / 1M | No | No | Jun 30, 2025 | |
Amazon Bedrock amazon-bedrock | Palmyra X4 writer.palmyra-x4-v1:0 | palmyra | text | text | 122,880 | 8,192 | $2.5 / 1M | $10 / 1M | Yes | Yes | Apr 28, 2025 | |
Kilo Gateway kilo | Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | Unknown | text | text | 120,000 | 8,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Jun 30, 2025 | |
LucidQuery AI lucidquery | LucidNova RF1 100B lucidnova-rf1-100b | nova | text | text | 120,000 | 8,000 | $2 / 1M | $5 / 1M | Yes | Yes | Sep 10, 2025 | |
NovitaAI novita-ai | ERNIE 4.5 21B A3B baidu/ernie-4.5-21B-a3b | ernie | text | text | 120,000 | 8,000 | $0.07 / 1M | $0.28 / 1M | No | Yes | Jun 30, 2025 | |
Regolo AI regolo-ai | Llama 3.1 8B Instruct llama-3.1-8b-instruct | llama | text | text | 120,000 | 120,000 | $0.05 / 1M | $0.25 / 1M | No | Yes | Apr 7, 2025 | |
Regolo AI regolo-ai | Mistral Small 3.2 mistral-small3.2 | mistral-small | text | text | 120,000 | 120,000 | $0.5 / 1M | $2.2 / 1M | Yes | Yes | Jan 31, 2025 | |
IO.NET io-net | Qwen 3 Coder 480B Intel/Qwen3-Coder-480B-A35B-Instruct-int4-mixed-ar | qwen | text | text | 106,000 | 4,096 | $0.22 / 1M | $0.95 / 1M | No | Yes | Jan 15, 2025 | |
Scaleway scaleway | Llama-3.3-70B-Instruct llama-3.3-70b-instruct | llama | text | text | 100,000 | 16,384 | $0.9 / 1M | $0.9 / 1M | No | Yes | Mar 17, 2026 | |
Kilo Gateway kilo | Morph: Morph V3 Fast morph/morph-v3-fast | Unknown | text | text | 81,920 | 38,000 | $0.8 / 1M | $1.2 / 1M | No | No | Aug 15, 2024 | |
Kilo Gateway kilo | Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | Unknown | text | text | 80,000 | 16,384 | $0.051 / 1M | $0.34 / 1M | No | No | Mar 15, 2026 | |
NanoGPT nano-gpt | Venice Uncensored Web venice-uncensored:web | Unknown | text | text | 80,000 | 16,384 | $0.4 / 1M | $0.4 / 1M | No | No | May 1, 2024 | |
submodel submodel | DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 | deepseek-thinking | text | text | 75,000 | 163,840 | $0.5 / 1M | $2.15 / 1M | Yes | Yes | Aug 23, 2025 | |
submodel submodel | DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 | deepseek | text | text | 75,000 | 163,840 | $0.2 / 1M | $0.8 / 1M | No | Yes | Aug 23, 2025 | |
submodel submodel | DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text | text | 75,000 | 163,840 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Aug 23, 2025 | |
CloudFerro Sherlock cloudferro-sherlock | Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | llama | text | text | 70,000 | 70,000 | $2.92 / 1M | $2.92 / 1M | No | Yes | Dec 6, 2024 | |
Alibaba (China) alibaba-cn | DeepSeek V3 deepseek-v3 | deepseek | text | text | 65,536 | 8,192 | $0.287 / 1M | $1.147 / 1M | No | Yes | Dec 1, 2024 | |
Deep Infra deepinfra | DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro | deepseek-thinking | text | text | 65,536 | 65,536 | $1.74 / 1M | $3.48 / 1M | Yes | Yes | Apr 24, 2026 | |
evroc evroc | GPT OSS 120B openai/gpt-oss-120b | gpt-oss | text | text | 65,536 | 65,536 | $0.24 / 1M | $0.94 / 1M | Yes | Yes | Aug 5, 2025 | |
GitHub Models github-models | DeepSeek-R1 deepseek/deepseek-r1 | deepseek-thinking | text | text | 65,536 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 20, 2025 | |
GitHub Models github-models | DeepSeek-R1-0528 deepseek/deepseek-r1-0528 | deepseek-thinking | text | text | 65,536 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | May 28, 2025 | |
GitHub Models github-models | MAI-DS-R1 microsoft/mai-ds-r1 | mai | text | text | 65,536 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 20, 2025 | |
Jiekou.AI jiekou | Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | qwen | text | text | 65,536 | 65,536 | $0.15 / 1M | $1.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | qwen | text | text | 65,536 | 65,536 | $0.15 / 1M | $1.5 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think | Unknown | text | text | 65,536 | 65,536 | $0.15 / 1M | $0.5 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | AllenAI: Olmo 3.1 32B Instruct allenai/olmo-3.1-32b-instruct | Unknown | text | text | 65,536 | 32,768 | $0.2 / 1M | $0.6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax M2-her minimax/minimax-m2-her | Unknown | text | text | 65,536 | 2,048 | $0.3 / 1M | $1.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | Unknown | text | text | 65,536 | 13,108 | $2 / 1M | $6 / 1M | No | Yes | Apr 17, 2024 | |
Kilo Gateway kilo | Reka Flash 3 rekaai/reka-flash-3 | Unknown | text | text | 65,536 | 65,536 | $0.1 / 1M | $0.2 / 1M | Yes | No | Apr 11, 2026 | |
NanoGPT nano-gpt | Aion 1.0 aion-labs/aion-1.0 | llama | text | text | 65,536 | 8,192 | $3.995 / 1M | $7.99 / 1M | No | No | Feb 1, 2025 | |
NanoGPT nano-gpt | Anubis 70B v1 TheDrummer 2/Anubis-70B-v1 | llama | text | text | 65,536 | 16,384 | $0.31 / 1M | $0.31 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Hermes 3 70B NousResearch 2/hermes-3-llama-3.1-70b | nousresearch | text | text | 65,536 | 8,192 | $0.408 / 1M | $0.408 / 1M | No | No | Jan 7, 2026 | |
NanoGPT nano-gpt | Mixtral 8x22B mistralai/mixtral-8x22b-instruct-v0.1 | mixtral | text | text | 65,536 | 32,768 | $0.8999999999999999 / 1M | $0.8999999999999999 / 1M | No | No | Dec 11, 2025 | |
NanoGPT nano-gpt | Olmo 3.1 32B Instruct allenai/olmo-3.1-32b-instruct | allenai | text | text | 65,536 | 8,192 | $0.2 / 1M | $0.6 / 1M | No | No | Jan 25, 2026 | |
NanoGPT nano-gpt | Olmo 3.1 32B Think allenai/olmo-3.1-32b-think | allenai | text | text | 65,536 | 8,192 | $0.15 / 1M | $0.5 / 1M | Yes | No | Jan 25, 2026 | |
OVHcloud AI Endpoints ovhcloud | Mistral-7B-Instruct-v0.3 mistral-7b-instruct-v0.3 | Unknown | text | text | 65,536 | 65,536 | $0.11 / 1M | $0.11 / 1M | No | Yes | Apr 1, 2025 | |
OVHcloud AI Endpoints ovhcloud | Mistral-Nemo-Instruct-2407 mistral-nemo-instruct-2407 | Unknown | text | text | 65,536 | 65,536 | $0.14 / 1M | $0.14 / 1M | No | Yes | Nov 20, 2024 | |
Upstage upstage | solar-pro2 solar-pro2 | solar-pro | text | text | 65,536 | 8,192 | $0.25 / 1M | $0.25 / 1M | Yes | Yes | May 20, 2025 | |
Kilo Gateway kilo | WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | Unknown | text | text | 65,535 | 8,000 | $0.62 / 1M | $0.62 / 1M | No | No | Apr 24, 2024 | |
NovitaAI novita-ai | Wizardlm 2 8x22B microsoft/wizardlm-2-8x22b | Unknown | text | text | 65,535 | 8,000 | $0.62 / 1M | $0.62 / 1M | No | No | Apr 24, 2024 | |
NanoGPT nano-gpt | MiniMax M2-her minimax/minimax-m2-her | minimax | text | text | 65,532 | 2,048 | $0.30200000000000005 / 1M | $1.2069999999999999 / 1M | No | No | Jan 24, 2026 | |
evroc evroc | Qwen3 30B 2507 Qwen/Qwen3-30B-A3B-Instruct-2507-FP8 | qwen | text | text | 64,000 | 64,000 | $0.35 / 1M | $1.42 / 1M | No | Yes | Jul 30, 2025 | |
IO.NET io-net | GPT-OSS 20B openai/gpt-oss-20b | gpt-oss | text | text | 64,000 | 4,096 | $0.03 / 1M | $0.14 / 1M | No | Yes | Dec 1, 2024 | |
Kilo Gateway kilo | DeepSeek: R1 deepseek/deepseek-r1 | Unknown | text | text | 64,000 | 16,000 | $0.7 / 1M | $2.5 / 1M | Yes | Yes | Jan 20, 2025 | |
LLM Gateway llmgateway | DeepSeek R1 (0528) deepseek-r1-0528 | deepseek | text | text | 64,000 | 16,384 | $0.8 / 1M | $2.4 / 1M | Yes | No | May 28, 2025 | |
Mistral mistral | Mixtral 8x22B open-mixtral-8x22b | mixtral | text | text | 64,000 | 64,000 | $2 / 1M | $6 / 1M | No | Yes | Apr 17, 2024 | |
NanoGPT nano-gpt | DeepSeek Reasoner deepseek-reasoner | Unknown | text | text | 64,000 | 65,536 | $0.4 / 1M | $1.7 / 1M | No | No | Jan 20, 2025 | |
NovitaAI novita-ai | DeepSeek R1 (Turbo) deepseek/deepseek-r1-turbo | Unknown | text | text | 64,000 | 16,000 | $0.7 / 1M | $2.5 / 1M | Yes | Yes | Mar 5, 2025 | |
NovitaAI novita-ai | DeepSeek R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | deepseek-thinking | text | text | 64,000 | 32,000 | $0.3 / 1M | $0.3 / 1M | Yes | No | Jan 20, 2025 | |
NovitaAI novita-ai | DeepSeek V3 (Turbo) deepseek/deepseek-v3-turbo | Unknown | text | text | 64,000 | 16,000 | $0.4 / 1M | $1.3 / 1M | No | Yes | Mar 5, 2025 | |
OpenRouter openrouter | DeepSeek: R1 deepseek/deepseek-r1 | deepseek-thinking | text | text | 64,000 | 16,000 | $0.7 / 1M | $2.5 / 1M | Yes | Yes | Jan 20, 2025 | |
Vercel AI Gateway vercel | Mixtral 8x22B mistral/mixtral-8x22b-instruct | mixtral | text | text | 64,000 | 64,000 | $2 / 1M | $6 / 1M | No | Yes | Apr 17, 2024 | |
NovitaAI novita-ai | Mistral Nemo mistralai/mistral-nemo | mistral-nemo | text | text | 60,288 | 16,000 | $0.04 / 1M | $0.17 / 1M | No | No | Jul 30, 2024 | |
Vercel AI Gateway vercel | Mistral Nemo mistral/mistral-nemo | mistral-nemo | text | text | 60,288 | 16,000 | $0.04 / 1M | $0.17 / 1M | No | Yes | Jul 1, 2024 | |
Kilo Gateway kilo | Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | Unknown | text | text | 60,000 | 12,000 | $0.027 / 1M | $0.2 / 1M | No | No | Jan 27, 2026 | |
NanoGPT nano-gpt | Perplexity Deep Research sonar-deep-research | Unknown | text | text | 60,000 | 128,000 | $3.4 / 1M | $13.6 / 1M | No | No | Feb 25, 2025 | |
Chutes chutes | Hermes 4 14B NousResearch/Hermes-4-14B | nousresearch | text | text | 40,960 | 40,960 | $0.01 / 1M | $0.05 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 14B Qwen/Qwen3-14B | qwen | text | text | 40,960 | 40,960 | $0.05 / 1M | $0.22 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 235B A22B Qwen/Qwen3-235B-A22B | qwen | text | text | 40,960 | 40,960 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 30B A3B Qwen/Qwen3-30B-A3B | qwen | text | text | 40,960 | 40,960 | $0.06 / 1M | $0.22 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3 32B Qwen/Qwen3-32B | qwen | text | text | 40,960 | 40,960 | $0.08 / 1M | $0.24 / 1M | Yes | Yes | Jan 10, 2026 | |
evroc evroc | Qwen3 Embedding 8B Qwen/Qwen3-Embedding-8B | text-embedding | text | text | 40,960 | 40,960 | $0.12 / 1M | $0.12 / 1M | No | No | Jul 30, 2025 | |
Jiekou.AI jiekou | Qwen3 235B A22B qwen/qwen3-235b-a22b-fp8 | qwen | text | text | 40,960 | 20,000 | $0.2 / 1M | $0.8 / 1M | Yes | No | Jan 1, 2026 | |
Jiekou.AI jiekou | Qwen3 30B A3B qwen/qwen3-30b-a3b-fp8 | qwen | text | text | 40,960 | 20,000 | $0.09 / 1M | $0.45 / 1M | Yes | No | Jan 1, 2026 | |
Jiekou.AI jiekou | Qwen3 32B qwen/qwen3-32b-fp8 | qwen | text | text | 40,960 | 20,000 | $0.1 / 1M | $0.45 / 1M | Yes | No | Jan 1, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 14B qwen/qwen3-14b | Unknown | text | text | 40,960 | 40,960 | $0.06 / 1M | $0.24 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b | Unknown | text | text | 40,960 | 40,960 | $0.08 / 1M | $0.28 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 32B qwen/qwen3-32b | Unknown | text | text | 40,960 | 40,960 | $0.08 / 1M | $0.24 / 1M | Yes | Yes | Feb 4, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 8B qwen/qwen3-8b | Unknown | text | text | 40,960 | 8,192 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
NovitaAI novita-ai | Qwen3 235B A22B qwen/qwen3-235b-a22b-fp8 | Unknown | text | text | 40,960 | 20,000 | $0.2 / 1M | $0.8 / 1M | Yes | No | Apr 29, 2025 | |
NovitaAI novita-ai | Qwen3 30B A3B qwen/qwen3-30b-a3b-fp8 | Unknown | text | text | 40,960 | 20,000 | $0.09 / 1M | $0.45 / 1M | Yes | No | Apr 29, 2025 | |
NovitaAI novita-ai | Qwen3 32B qwen/qwen3-32b-fp8 | Unknown | text | text | 40,960 | 20,000 | $0.1 / 1M | $0.45 / 1M | Yes | No | Apr 29, 2025 | |
Vercel AI Gateway vercel | Qwen 3.32B alibaba/qwen-3-32b | qwen | text | text | 40,960 | 16,384 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Apr 1, 2025 | |
Vercel AI Gateway vercel | Qwen3 235B A22B Instruct 2507 alibaba/qwen-3-235b | qwen | text | text | 40,960 | 16,384 | $0.13 / 1M | $0.6 / 1M | No | Yes | Apr 1, 2025 | |
Vercel AI Gateway vercel | Qwen3-14B alibaba/qwen-3-14b | qwen | text | text | 40,960 | 16,384 | $0.06 / 1M | $0.24 / 1M | Yes | Yes | Apr 1, 2025 | |
Vercel AI Gateway vercel | Qwen3-30B-A3B alibaba/qwen-3-30b | qwen | text | text | 40,960 | 16,384 | $0.08 / 1M | $0.29 / 1M | Yes | Yes | Apr 1, 2025 | |
Qiniu qiniu-ai | Qwen3 30B A3B qwen3-30b-a3b | Unknown | text | text | 40,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen3 32B qwen3-32b | Unknown | text | text | 40,000 | 4,096 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-14B-Instruct Qwen/Qwen2.5-14B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-32B-Instruct Qwen/Qwen2.5-32B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.18 / 1M | $0.18 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-72B-Instruct Qwen/Qwen2.5-72B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-7B-Instruct Qwen/Qwen2.5-7B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.05 / 1M | $0.05 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-Coder-32B-Instruct Qwen/Qwen2.5-Coder-32B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.18 / 1M | $0.18 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | tencent/Hunyuan-MT-7B tencent/Hunyuan-MT-7B | hunyuan | text | text | 33,000 | 33,000 | $0 / 1M | $0 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | THUDM/GLM-4-32B-0414 THUDM/GLM-4-32B-0414 | glm | text | text | 33,000 | 33,000 | $0.27 / 1M | $0.27 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | THUDM/GLM-4-9B-0414 THUDM/GLM-4-9B-0414 | glm | text | text | 33,000 | 33,000 | $0.086 / 1M | $0.086 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | meta-llama/Meta-Llama-3.1-8B-Instruct meta-llama/Meta-Llama-3.1-8B-Instruct | llama | text | text | 33,000 | 4,000 | $0.06 / 1M | $0.06 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-14B-Instruct Qwen/Qwen2.5-14B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-32B-Instruct Qwen/Qwen2.5-32B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.18 / 1M | $0.18 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-72B-Instruct Qwen/Qwen2.5-72B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-7B-Instruct Qwen/Qwen2.5-7B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.05 / 1M | $0.05 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-Coder-32B-Instruct Qwen/Qwen2.5-Coder-32B-Instruct | qwen | text | text | 33,000 | 4,000 | $0.18 / 1M | $0.18 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | tencent/Hunyuan-MT-7B tencent/Hunyuan-MT-7B | hunyuan | text | text | 33,000 | 33,000 | $0 / 1M | $0 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | THUDM/GLM-4-32B-0414 THUDM/GLM-4-32B-0414 | glm | text | text | 33,000 | 33,000 | $0.27 / 1M | $0.27 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | THUDM/GLM-4-9B-0414 THUDM/GLM-4-9B-0414 | glm | text | text | 33,000 | 33,000 | $0.086 / 1M | $0.086 / 1M | No | Yes | Nov 25, 2025 | |
Abacus abacus | QwQ 32B Qwen/QwQ-32B | qwen | text | text | 32,768 | 32,768 | $0.4 / 1M | $0.4 / 1M | Yes | Yes | Nov 28, 2024 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 32,768 | 16,384 | $0.287 / 1M | $0.861 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Llama 8B deepseek-r1-distill-llama-8b | deepseek-thinking | text | text | 32,768 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1-5b | qwen | text | text | 32,768 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Qwen 14B deepseek-r1-distill-qwen-14b | qwen | text | text | 32,768 | 16,384 | $0.144 / 1M | $0.431 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b | qwen | text | text | 32,768 | 16,384 | $0.287 / 1M | $0.861 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | DeepSeek R1 Distill Qwen 7B deepseek-r1-distill-qwen-7b | qwen | text | text | 32,768 | 16,384 | $0.072 / 1M | $0.144 / 1M | Yes | Yes | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen Plus Character qwen-plus-character | qwen | text | text | 32,768 | 4,096 | $0.115 / 1M | $0.287 / 1M | No | Yes | Jan 1, 2024 | |
Alibaba alibaba | Qwen Max qwen-max | qwen | text | text | 32,768 | 8,192 | $1.6 / 1M | $6.4 / 1M | No | Yes | Jan 25, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-4 32K gpt-4-32k | gpt | text | text | 32,768 | 32,768 | $60 / 1M | $120 / 1M | No | Yes | Mar 14, 2023 | |
Azure azure | GPT-4 32K gpt-4-32k | gpt | text | text | 32,768 | 32,768 | $60 / 1M | $120 / 1M | No | Yes | Mar 14, 2023 | |
Chutes chutes | DeepHermes 3 Mistral 24B Preview NousResearch/DeepHermes-3-Mistral-24B-Preview | nousresearch | text | text | 32,768 | 32,768 | $0.02 / 1M | $0.1 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Hermes 4.3 36B NousResearch/Hermes-4.3-36B | nousresearch | text | text | 32,768 | 8,192 | $0.1 / 1M | $0.39 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | Llama 3.2 1B Instruct unsloth/Llama-3.2-1B-Instruct | Unknown | text | text | 32,768 | 8,192 | $0.01 / 1M | $0.01 / 1M | No | No | Jan 27, 2026 | |
Chutes chutes | Qwen2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct | qwen | text | text | 32,768 | 32,768 | $0.13 / 1M | $0.52 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen2.5 Coder 32B Instruct Qwen/Qwen2.5-Coder-32B-Instruct | qwen | text | text | 32,768 | 32,768 | $0.03 / 1M | $0.11 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | Qwen3Guard Gen 0.6B Qwen/Qwen3Guard-Gen-0.6B | qwen | text | text | 32,768 | 8,192 | $0.01 / 1M | $0.01 / 1M | No | No | Jan 10, 2026 | |
evroc evroc | Devstral Small 2 24B Instruct 2512 mistralai/devstral-small-2-24b-instruct-2512 | devstral | text | text | 32,768 | 32,768 | $0.12 / 1M | $0.47 / 1M | No | Yes | Dec 1, 2025 | |
Groq groq | Mistral Saba 24B mistral-saba-24b | mistral | text | text | 32,768 | 32,768 | $0.79 / 1M | $0.79 / 1M | No | Yes | Feb 6, 2025 | |
IO.NET io-net | Kimi K2 Instruct moonshotai/Kimi-K2-Instruct-0905 | kimi | text | text | 32,768 | 4,096 | $0.39 / 1M | $1.9 / 1M | No | Yes | Sep 5, 2024 | |
IO.NET io-net | Kimi K2 Thinking moonshotai/Kimi-K2-Thinking | kimi-thinking | text | text | 32,768 | 4,096 | $0.55 / 1M | $2.25 / 1M | Yes | Yes | Nov 1, 2024 | |
Kilo Gateway kilo | AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | Unknown | text | text | 32,768 | 32,768 | $0.8 / 1M | $1.6 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Arcee AI: Coder Large arcee-ai/coder-large | Unknown | text | text | 32,768 | 32,768 | $0.5 / 1M | $0.8 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | Unknown | text | text | 32,768 | 7,168 | $0.15 / 1M | $0.75 / 1M | Yes | Yes | Aug 21, 2025 | |
Kilo Gateway kilo | DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | Unknown | text | text | 32,768 | 32,768 | $0.29 / 1M | $0.29 / 1M | Yes | No | Nov 25, 2025 | |
Kilo Gateway kilo | EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct | Unknown | text | text | 32,768 | 6,554 | $0.15 / 1M | $0.15 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemma 3n 4B google/gemma-3n-e4b-it | Unknown | text | text | 32,768 | 6,554 | $0.02 / 1M | $0.04 / 1M | No | No | May 20, 2025 | |
Kilo Gateway kilo | LiquidAI: LFM2-24B-A2B liquid/lfm-2-24b-a2b | Unknown | text | text | 32,768 | 32,768 | $0.03 / 1M | $0.12 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | Unknown | text | text | 32,768 | 16,384 | $0.05 / 1M | $0.08 / 1M | No | Yes | Jan 10, 2026 | |
Kilo Gateway kilo | Mistral: Mistral Small Creative mistralai/mistral-small-creative | Unknown | text | text | 32,768 | 32,768 | $0.1 / 1M | $0.3 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct | Unknown | text | text | 32,768 | 16,384 | $0.54 / 1M | $0.54 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Saba mistralai/mistral-saba | Unknown | text | text | 32,768 | 32,768 | $0.2 / 1M | $0.6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen-Max qwen/qwen-max | Unknown | text | text | 32,768 | 8,192 | $1.04 / 1M | $4.16 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | Unknown | text | text | 32,768 | 6,554 | $0.04 / 1M | $0.1 / 1M | No | Yes | Apr 16, 2025 | |
Kilo Gateway kilo | Qwen: Qwen2.5 Coder 7B Instruct qwen/qwen2.5-coder-7b-instruct | Unknown | text | text | 32,768 | 6,554 | $0.03 / 1M | $0.09 / 1M | No | No | Nov 1, 2024 | |
Kilo Gateway kilo | Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | Unknown | text | text | 32,768 | 6,554 | $0.051 / 1M | $0.34 / 1M | Yes | Yes | Jul 29, 2025 | |
Kilo Gateway kilo | Qwen: QwQ 32B qwen/qwq-32b | Unknown | text | text | 32,768 | 32,768 | $0.15 / 1M | $0.4 / 1M | Yes | Yes | Apr 11, 2025 | |
Kilo Gateway kilo | Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | Unknown | text | text | 32,768 | 16,384 | $0.12 / 1M | $0.39 / 1M | No | Yes | Jan 10, 2026 | |
Kilo Gateway kilo | Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | Unknown | text | text | 32,768 | 8,192 | $0.2 / 1M | $0.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | TheDrummer: Rocinante 12B thedrummer/rocinante-12b | Unknown | text | text | 32,768 | 32,768 | $0.17 / 1M | $0.43 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 | Unknown | text | text | 32,768 | 32,768 | $0.55 / 1M | $0.8 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | Unknown | text | text | 32,768 | 32,768 | $0.4 / 1M | $0.4 / 1M | No | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | Llama 3.2 3B Instruct llama-3.2-3b-instruct | llama | text | text | 32,768 | 32,000 | $0.03 / 1M | $0.05 / 1M | No | No | Sep 18, 2024 | |
LLM Gateway llmgateway | Llama 4 Scout llama-4-scout | llama | text | text | 32,768 | 16,384 | $0.18 / 1M | $0.59 / 1M | No | Yes | Apr 5, 2025 | |
LLM Gateway llmgateway | Qwen Max qwen-max | qwen | text | text | 32,768 | 8,192 | $1.6 / 1M | $6.4 / 1M | No | Yes | Jan 25, 2025 | |
NanoGPT nano-gpt | Amoral Gemma3 27B v2 soob3123/amoral-gemma3-27B-v2 | gemma | text | text | 32,768 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | No | May 23, 2025 | |
NanoGPT nano-gpt | Baichuan 4 Air Baichuan4-Air | Unknown | text | text | 32,768 | 32,768 | $0.157 / 1M | $0.157 / 1M | No | No | Aug 19, 2025 | |
NanoGPT nano-gpt | Baichuan M2 32B Medical Baichuan-M2 | Unknown | text | text | 32,768 | 32,768 | $15.73 / 1M | $15.73 / 1M | No | No | Aug 19, 2025 | |
NanoGPT nano-gpt | Damascus R1 Llama-3.3-70B-Damascus-R1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | DMind-1 dmind/dmind-1 | gpt | text | text | 32,768 | 8,192 | $0.3 / 1M | $0.6 / 1M | No | No | Jun 1, 2025 | |
NanoGPT nano-gpt | DMind-1-Mini dmind/dmind-1-mini | gpt | text | text | 32,768 | 8,192 | $0.2 / 1M | $0.4 / 1M | No | No | Jun 1, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B Big Tiger v3 Gemma-3-27B-Big-Tiger-v3 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B CardProjector v4 Gemma-3-27B-CardProjector-v4 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B Glitter Gemma-3-27B-Glitter | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B IT Gemma-3-27B-it | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B IT Abliterated Gemma-3-27B-it-Abliterated | Unknown | text | text | 32,768 | 96,000 | $0.42 / 1M | $0.42 / 1M | No | No | Jul 3, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B Nidum Uncensored Gemma-3-27B-Nidum-Uncensored | Unknown | text | text | 32,768 | 96,000 | $0.306 / 1M | $0.306 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B RPMax v3 Gemma-3-27B-ArliAI-RPMax-v3 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 3, 2025 | |
NanoGPT nano-gpt | Llama 3.1 8b (uncensored) aion-labs/aion-rp-llama-3.1-8b | llama | text | text | 32,768 | 16,384 | $0.2006 / 1M | $0.2006 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Anthrobomination Llama-3.3-70B-Anthrobomination | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Argunaut 1 SFT Llama-3.3-70B-Argunaut-1-SFT | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B ArliAI RPMax v2 Llama-3.3-70B-ArliAI-RPMax-v2 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70B ArliAI RPMax v3 Llama-3.3-70B-ArliAI-RPMax-v3 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Aurora Borealis Llama-3.3-70B-Aurora-Borealis | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Bigger Body Llama-3.3-70B-Bigger-Body | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Cirrus x1 Llama-3.3-70B-Cirrus-x1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Cu Mai R1 Llama-3.3-70B-Cu-Mai-R1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Dark Ages v0.1 Llama-3.3-70B-Dark-Ages-v0.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Electra R1 Llama-3.3-70B-Electra-R1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Electranova v1.0 Llama-3.3-70B-Electranova-v1.0 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Fallen R1 v1 Llama-3.3-70B-Fallen-R1-v1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Fallen v1 Llama-3.3-70B-Fallen-v1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Forgotten Abomination v5.0 Llama-3.3-70B-Forgotten-Abomination-v5.0 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Forgotten Safeword 3.6 Llama-3.3-70B-Forgotten-Safeword-3.6 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B GeneticLemonade Opus Llama-3.3-70B-GeneticLemonade-Opus | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B GeneticLemonade Unleashed v3 Llama-3.3-70B-GeneticLemonade-Unleashed-v3 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Ignition v0.1 Llama-3.3-70B-Ignition-v0.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Incandescent Malevolence Llama-3.3-70B-Incandescent-Malevolence | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Legion V2.1 Llama-3.3-70B-Legion-V2.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Magnum v4 SE Llama-3.3-70B-Magnum-v4-SE | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Magnum v4 SE Cirrus x1 SLERP Llama-3.3-70B-Magnum-v4-SE-Cirrus-x1-SLERP | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70B Mhnnn x1 Llama-3.3-70B-Mhnnn-x1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70b Mirai Fanfare Llama-3.3-70B-MiraiFanfare | Unknown | text | text | 32,768 | 16,384 | $0.493 / 1M | $0.493 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70B Mokume Gane R1 Llama-3.3-70B-Mokume-Gane-R1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B MS Nevoria Llama-3.3-70B-MS-Nevoria | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Nova Llama-3.3-70B-Nova | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Omega Directive Unslop v2.0 Llama-3.3-70B-The-Omega-Directive-Unslop-v2.0 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Omega Directive Unslop v2.1 Llama-3.3-70B-The-Omega-Directive-Unslop-v2.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Predatorial Extasy Llama-3.3-70B-Predatorial-Extasy | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Progenitor V3.3 Llama-3.3-70B-Progenitor-V3.3 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70B RAWMAW Llama-3.3-70B-RAWMAW | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B RPMax v1.4 Llama-3.3-70B-ArliAI-RPMax-v1.4 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Sapphira 0.1 Llama-3.3-70B-Sapphira-0.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Sapphira 0.2 Llama-3.3-70B-Sapphira-0.2 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Shakudo Llama-3.3-70B-Shakudo | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B StrawberryLemonade v1.0 Llama-3.3-70B-StrawberryLemonade-v1.0 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B StrawberryLemonade v1.2 Llama-3.3-70B-Strawberrylemonade-v1.2 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Vulpecula R1 Llama-3.3-70B-Vulpecula-R1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3+ 70B Hanami x1 Llama-3.3+(3.1v3.3)-70B-Hanami-x1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3+ 70B New Dawn v1.1 Llama-3.3+(3.1v3.3)-70B-New-Dawn-v1.1 | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3+ 70B TenyxChat DaybreakStorywriter Llama-3.3+(3v3.3)-70B-TenyxChat-DaybreakStorywriter | Unknown | text | text | 32,768 | 16,384 | $0.306 / 1M | $0.306 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Magistral Small 2506 Magistral-Small-2506 | Unknown | text | text | 32,768 | 32,768 | $0.4 / 1M | $1.4 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | Manta Pro 1.0 meganova-ai/manta-pro-1.0 | nova | text | text | 32,768 | 32,768 | $0.060000000000000005 / 1M | $0.5 / 1M | No | No | Dec 20, 2025 | |
NanoGPT nano-gpt | MiroThinker v1.5 235B miromind-ai/mirothinker-v1.5-235b | gpt | text | text | 32,768 | 4,000 | $0.3 / 1M | $1.2 / 1M | No | No | Jan 7, 2026 | |
NanoGPT nano-gpt | Mistral Devstral Small 2505 mistralai/Devstral-Small-2505 | devstral | text | text | 32,768 | 8,192 | $0.060000000000000005 / 1M | $0.060000000000000005 / 1M | No | No | Aug 2, 2025 | |
NanoGPT nano-gpt | Mistral Small Creative mistralai/mistral-small-creative | mistral-small | text | text | 32,768 | 32,768 | $0.1 / 1M | $0.3 / 1M | No | Yes | Dec 16, 2025 | |
NanoGPT nano-gpt | Mixtral 8x7B mistralai/mixtral-8x7b-instruct-v0.1 | mixtral | text | text | 32,768 | 32,768 | $0.27 / 1M | $0.27 / 1M | No | No | Dec 11, 2025 | |
NanoGPT nano-gpt | NemoMix 12B Unleashed MarinaraSpaghetti/NemoMix-Unleashed-12B | mistral-nemo | text | text | 32,768 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | OpenReasoning Nemotron 32B pamanseau/OpenReasoning-Nemotron-32B | nemotron | text | text | 32,768 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | No | Aug 21, 2025 | |
NanoGPT nano-gpt | Qwen 2.5 32B Abliterated huihui-ai/Qwen2.5-32B-Instruct-abliterated | qwen | text | text | 32,768 | 8,192 | $0.7 / 1M | $0.7 / 1M | No | No | Jan 6, 2025 | |
NanoGPT nano-gpt | QwQ 32b Arli V1 QwQ-32B-ArliAI-RpR-v1 | Unknown | text | text | 32,768 | 32,768 | $0.2 / 1M | $0.2 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | Shisa V2.1 Llama 3.3 70B shisa-ai/shisa-v2.1-llama3.3-70b | llama | text | text | 32,768 | 4,096 | $0.5 / 1M | $0.5 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | The Drummer Cydonia 24B v4.3 TheDrummer 2/Cydonia-24B-v4.3 | llama | text | text | 32,768 | 32,768 | $0.1003 / 1M | $0.1207 / 1M | No | No | Dec 25, 2025 | |
NanoGPT nano-gpt | The Drummer Magidonia 24B v4.3 TheDrummer 2/Magidonia-24B-v4.3 | llama | text | text | 32,768 | 32,768 | $0.1003 / 1M | $0.1207 / 1M | No | No | Dec 25, 2025 | |
NanoGPT nano-gpt | Universal Summarizer universal-summarizer | Unknown | text | text | 32,768 | 32,768 | $30 / 1M | $30 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Veiled Calla 12B soob3123/Veiled-Calla-12B | llama | text | text | 32,768 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | No | Apr 13, 2025 | |
NanoGPT nano-gpt | Web Answer fastgpt | Unknown | text | text | 32,768 | 32,768 | $7.5 / 1M | $7.5 / 1M | No | No | Jan 1, 2024 | |
Nebius Token Factory nebius | BGE-ICL BAAI/bge-en-icl | text-embedding | text | text | 32,768 | Unknown | $0.01 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | e5-mistral-7b-instruct intfloat/e5-mistral-7b-instruct | text-embedding | text | text | 32,768 | Unknown | $0.01 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | Qwen3-Embedding-8B Qwen/Qwen3-Embedding-8B | text-embedding | text | text | 32,768 | Unknown | $0.01 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
NovitaAI novita-ai | DeepSeek R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b | deepseek-thinking | text | text | 32,768 | 16,384 | $0.15 / 1M | $0.15 / 1M | Yes | No | Jan 20, 2025 | |
NovitaAI novita-ai | Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | llama | text | text | 32,768 | 32,000 | $0.03 / 1M | $0.05 / 1M | No | No | Sep 18, 2024 | |
Nvidia nvidia | Llama Embed Nemotron 8B nvidia/llama-embed-nemotron-8b | llama | text | text | 32,768 | 2,048 | $0 / 1M | $0 / 1M | No | No | Mar 18, 2025 | |
Ollama Cloud ollama-cloud | rnj-1:8b rnj-1:8b | rnj | text | text | 32,768 | 4,096 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
OpenRouter openrouter | Gemma 3n 4B google/gemma-3n-e4b-it | gemma | text | text | 32,768 | 32,768 | $0.02 / 1M | $0.04 / 1M | No | No | May 20, 2025 | |
OpenRouter openrouter | Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | qwen | text | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Nov 11, 2024 | |
OpenRouter openrouter | Uncensored (free) cognitivecomputations/dolphin-mistral-24b-venice-edition:free | mistral | text | text | 32,768 | 32,768 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 | |
OVHcloud AI Endpoints ovhcloud | Qwen3-32B qwen3-32b | Unknown | text | text | 32,768 | 32,768 | $0.09 / 1M | $0.25 / 1M | Yes | Yes | Jul 16, 2025 | |
Regolo AI regolo-ai | Qwen3-Embedding-8B qwen3-embedding-8b | qwen | text | text | 32,768 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | No | Feb 1, 2026 | |
Regolo AI regolo-ai | Qwen3-Reranker-4B qwen3-reranker-4b | qwen | text | text | 32,768 | 8,192 | $0.12 / 1M | $0.12 / 1M | No | No | Feb 1, 2026 | |
Scaleway scaleway | Qwen3 Embedding 8B qwen3-embedding-8b | qwen | text | text | 32,768 | 4,096 | $0.1 / 1M | $0 / 1M | No | No | Mar 17, 2026 | |
StepFun stepfun | Step 1 (32K) step-1-32k | Unknown | text | text | 32,768 | 32,768 | $2.05 / 1M | $9.59 / 1M | Yes | Yes | Feb 13, 2026 | |
Synthetic synthetic | Qwen2.5-Coder-32B-Instruct hf:Qwen/Qwen2.5-Coder-32B-Instruct | qwen | text | text | 32,768 | 32,768 | $0.8 / 1M | $0.8 / 1M | No | No | Nov 11, 2024 | |
Together AI togetherai | Rnj-1 Instruct essentialai/Rnj-1-Instruct | rnj | text | text | 32,768 | 32,768 | $0.15 / 1M | $0.15 / 1M | No | Yes | Dec 5, 2025 | |
Upstage upstage | solar-mini solar-mini | solar-mini | text | text | 32,768 | 4,096 | $0.15 / 1M | $0.15 / 1M | No | Yes | Apr 22, 2025 | |
Vercel AI Gateway vercel | LongCat Flash Thinking 2601 meituan/longcat-flash-thinking-2601 | longcat | text | text | 32,768 | 32,768 | Unknown | Unknown | Yes | No | Mar 13, 2026 | |
Vercel AI Gateway vercel | Qwen3 Embedding 0.6B alibaba/qwen3-embedding-0.6b | qwen | text | text | 32,768 | 32,768 | $0.01 / 1M | $0 / 1M | No | No | Nov 14, 2025 | |
Vercel AI Gateway vercel | Qwen3 Embedding 4B alibaba/qwen3-embedding-4b | qwen | text | text | 32,768 | 32,768 | $0.02 / 1M | $0 / 1M | No | No | Jun 5, 2025 | |
Vercel AI Gateway vercel | Qwen3 Embedding 8B alibaba/qwen3-embedding-8b | qwen | text | text | 32,768 | 32,768 | $0.05 / 1M | $0 / 1M | No | No | Jun 5, 2025 | |
Weights & Biases wandb | OpenPipe Qwen3 14B Instruct OpenPipe/Qwen3-14B-Instruct | qwen | text | text | 32,768 | 32,768 | $0.05 / 1M | $0.22 / 1M | No | Yes | Mar 12, 2026 | |
NanoGPT nano-gpt | Gemini 2.0 Flash Thinking 1219 gemini-2.0-flash-thinking-exp-1219 | Unknown | text | text | 32,767 | 8,192 | $0.1003 / 1M | $0.408 / 1M | No | No | Dec 19, 2024 | |
NanoGPT nano-gpt | Gemini LearnLM Experimental learnlm-1.5-pro-experimental | Unknown | text | text | 32,767 | 8,192 | $3.502 / 1M | $10.506 / 1M | No | No | May 14, 2024 | |
NanoGPT nano-gpt | Gemini Text + Image gemini-2.0-flash-exp-image-generation | Unknown | text | text | 32,767 | 8,192 | $0.2 / 1M | $0.8 / 1M | No | No | Feb 19, 2025 | |
Azure Cognitive Services azure-cognitive-services | Phi-4-reasoning phi-4-reasoning | phi | text | text | 32,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | Yes | No | Dec 11, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-4-reasoning-plus phi-4-reasoning-plus | phi | text | text | 32,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | Yes | No | Dec 11, 2024 | |
Azure azure | Phi-4-reasoning phi-4-reasoning | phi | text | text | 32,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | Yes | No | Dec 11, 2024 | |
Azure azure | Phi-4-reasoning-plus phi-4-reasoning-plus | phi | text | text | 32,000 | 4,096 | $0.125 / 1M | $0.5 / 1M | Yes | No | Dec 11, 2024 | |
Berget.AI berget | Mistral Small 3.2 24B Instruct 2506 mistralai/Mistral-Small-3.2-24B-Instruct-2506 | mistral-small | text | text | 32,000 | 8,192 | $0.33 / 1M | $0.33 / 1M | Yes | Yes | Oct 1, 2025 | |
Cerebras cerebras | Llama 3.1 8B llama3.1-8b | llama | text | text | 32,000 | 8,000 | $0.1 / 1M | $0.1 / 1M | No | Yes | Jan 1, 2025 | |
CloudFerro Sherlock cloudferro-sherlock | Bielik 11B v2.6 Instruct speakleash/Bielik-11B-v2.6-Instruct | Unknown | text | text | 32,000 | 32,000 | $0.67 / 1M | $0.67 / 1M | No | Yes | Mar 13, 2025 | |
CloudFerro Sherlock cloudferro-sherlock | Bielik 11B v3.0 Instruct speakleash/Bielik-11B-v3.0-Instruct | Unknown | text | text | 32,000 | 32,000 | $0.67 / 1M | $0.67 / 1M | No | Yes | Mar 13, 2025 | |
GitHub Models github-models | Codestral 25.01 mistral-ai/codestral-2501 | codestral | text | text | 32,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 1, 2025 | |
Helicone helicone | Qwen2.5 Coder 7B fast qwen2.5-coder-7b-fast | qwen | text | text | 32,000 | 8,192 | $0.03 / 1M | $0.09 / 1M | No | No | Sep 15, 2024 | |
Hugging Face huggingface | Qwen 3 Embedding 4B Qwen/Qwen3-Embedding-4B | qwen | text | text | 32,000 | 2,048 | $0.01 / 1M | $0 / 1M | No | No | Jan 1, 2025 | |
Hugging Face huggingface | Qwen 3 Embedding 8B Qwen/Qwen3-Embedding-8B | qwen | text | text | 32,000 | 4,096 | $0.01 / 1M | $0 / 1M | No | No | Jan 1, 2025 | |
Inference inference | Qwen 3 Embedding 4B qwen/qwen3-embedding-4b | qwen | text | text | 32,000 | 2,048 | $0.01 / 1M | $0 / 1M | No | No | Jan 1, 2025 | |
Mistral mistral | Mixtral 8x7B open-mixtral-8x7b | mixtral | text | text | 32,000 | 32,000 | $0.7 / 1M | $0.7 / 1M | No | Yes | Dec 11, 2023 | |
Morph morph | Auto auto | auto | text | text | 32,000 | 32,000 | $0.85 / 1M | $1.55 / 1M | No | No | Jun 1, 2024 | |
Morph morph | Morph v3 Large morph-v3-large | morph | text | text | 32,000 | 32,000 | $0.9 / 1M | $1.9 / 1M | No | No | Aug 15, 2024 | |
NanoGPT nano-gpt | Doubao 1.5 Pro 32k doubao-1.5-pro-32k | Unknown | text | text | 32,000 | 8,192 | $0.1343 / 1M | $0.3349 / 1M | No | No | Jan 22, 2025 | |
NanoGPT nano-gpt | Ernie X1 32k ernie-x1-32k-preview | Unknown | text | text | 32,000 | 16,384 | $0.33 / 1M | $1.32 / 1M | No | No | Apr 3, 2025 | |
NanoGPT nano-gpt | GLM 4 9B 0414 THUDM/GLM-4-9B-0414 | glm | text | text | 32,000 | 8,000 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 14, 2025 | |
NanoGPT nano-gpt | GLM Z1 9B 0414 THUDM/GLM-Z1-9B-0414 | glm-z | text | text | 32,000 | 8,000 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 14, 2025 | |
NanoGPT nano-gpt | GLM Z1 Air glm-z1-air | Unknown | text | text | 32,000 | 16,384 | $0.07 / 1M | $0.07 / 1M | No | Yes | Apr 15, 2025 | |
NanoGPT nano-gpt | GLM Z1 AirX glm-z1-airx | Unknown | text | text | 32,000 | 16,384 | $0.7 / 1M | $0.7 / 1M | No | Yes | Apr 15, 2025 | |
NanoGPT nano-gpt | GLM Z1 Rumination 32B 0414 THUDM/GLM-Z1-Rumination-32B-0414 | glm-z | text | text | 32,000 | 65,536 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Mistral Saba mistralai/mistral-saba | mistral | text | text | 32,000 | 32,768 | $0.1989 / 1M | $0.595 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | Mistral Tiny mistralai/mistral-tiny | mistral | text | text | 32,000 | 8,192 | $0.25499999999999995 / 1M | $0.25499999999999995 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Qwen 2.5 Max qwen-max | Unknown | text | text | 32,000 | 8,192 | $1.5997 / 1M | $6.392 / 1M | No | No | Apr 3, 2024 | |
NanoGPT nano-gpt | Qwerky 72B featherless-ai/Qwerky-72B | qwerky | text | text | 32,000 | 8,192 | $0.5 / 1M | $0.5 / 1M | No | No | Mar 20, 2025 | |
NanoGPT nano-gpt | Yi Large yi-large | Unknown | text | text | 32,000 | 4,096 | $3.196 / 1M | $3.196 / 1M | No | No | May 13, 2024 | |
Nebius Token Factory nebius | Nemotron-3-Nano-30B-A3B nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B | Unknown | text | text | 32,000 | 4,096 | $0.06 / 1M | $0.24 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Nemotron-Nano-V2-12b nvidia/Nemotron-Nano-V2-12b | Unknown | text | text | 32,000 | 4,096 | $0.07 / 1M | $0.2 / 1M | No | Yes | Feb 4, 2026 | |
NovitaAI novita-ai | Qwen 2.5 72B Instruct qwen/qwen-2.5-72b-instruct | qwen | text | text | 32,000 | 8,192 | $0.38 / 1M | $0.4 / 1M | No | Yes | Oct 15, 2024 | |
NovitaAI novita-ai | Qwen2.5 7B Instruct qwen/qwen2.5-7b-instruct | Unknown | text | text | 32,000 | 32,000 | $0.07 / 1M | $0.07 / 1M | No | Yes | Apr 16, 2025 | |
Privatemode AI privatemode-ai | Qwen3-Embedding 4B qwen3-embedding-4b | qwen | text | text | 32,000 | 2,560 | $0 / 1M | $0 / 1M | No | No | Jun 6, 2025 | |
Scaleway scaleway | DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 32,000 | 8,196 | $0.9 / 1M | $0.9 / 1M | Yes | Yes | Mar 17, 2026 | |
Venice AI venice | Venice Uncensored 1.1 venice-uncensored | venice | text | text | 32,000 | 8,192 | $0.2 / 1M | $0.9 / 1M | No | No | Mar 12, 2026 | |
Vercel AI Gateway vercel | Mercury Coder Small Beta inception/mercury-coder-small | mercury | text | text | 32,000 | 16,384 | $0.25 / 1M | $1 / 1M | No | Yes | Apr 16, 2026 | |
Vercel AI Gateway vercel | Morph v3 Large morph/morph-v3-large | morph | text | text | 32,000 | 32,000 | $0.9 / 1M | $1.9 / 1M | No | No | Aug 15, 2024 | |
Vercel AI Gateway vercel | voyage-4 voyage/voyage-4 | voyage | text | text | 32,000 | Unknown | Unknown | Unknown | No | No | Mar 6, 2026 | |
Vercel AI Gateway vercel | voyage-4-large voyage/voyage-4-large | voyage | text | text | 32,000 | Unknown | Unknown | Unknown | No | No | Mar 6, 2026 | |
Vercel AI Gateway vercel | voyage-4-lite voyage/voyage-4-lite | voyage | text | text | 32,000 | Unknown | Unknown | Unknown | No | No | Mar 6, 2026 | |
NanoGPT nano-gpt | Qwen 2.5 32b EVA Qwen2.5-32B-EVA-v0.2 | Unknown | text | text | 24,576 | 8,192 | $0.493 / 1M | $0.493 / 1M | No | No | Sep 1, 2024 | |
NanoGPT nano-gpt | Hunyuan Turbo S hunyuan-turbos-20250226 | Unknown | text | text | 24,000 | 8,192 | $0.187 / 1M | $0.374 / 1M | No | No | Feb 27, 2025 | |
NanoGPT nano-gpt | Llama 3.1 70B Euryale Sao10K/L3.1-70B-Euryale-v2.2 | llama | text | text | 20,480 | 16,384 | $0.306 / 1M | $0.357 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Euryale Sao10K/L3.3-70B-Euryale-v2.3 | llama | text | text | 20,480 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-3.5-turbo openai/gpt-3.5-turbo | gpt | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | No | Nov 6, 2023 | |
Kilo Gateway kilo | OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | Unknown | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | Yes | Nov 6, 2023 | |
Kilo Gateway kilo | OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | Unknown | text | text | 16,385 | 4,096 | $3 / 1M | $4 / 1M | No | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | GPT-3.5-turbo gpt-3.5-turbo | gpt | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | No | Nov 6, 2023 | |
NanoGPT nano-gpt | GPT-3.5 Turbo openai/gpt-3.5-turbo | gpt | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | No | Jan 1, 2024 | |
OpenAI openai | GPT-3.5-turbo gpt-3.5-turbo | gpt | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | No | Nov 6, 2023 | |
Vercel AI Gateway vercel | GPT-3.5 Turbo openai/gpt-3.5-turbo | gpt | text | text | 16,385 | 4,096 | $0.5 / 1M | $1.5 / 1M | No | No | Mar 1, 2023 | |
Alibaba (China) alibaba-cn | Qwen-MT Plus qwen-mt-plus | qwen | text | text | 16,384 | 8,192 | $0.259 / 1M | $0.775 / 1M | No | No | Jan 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen-MT Turbo qwen-mt-turbo | qwen | text | text | 16,384 | 8,192 | $0.101 / 1M | $0.28 / 1M | No | No | Jan 1, 2025 | |
Alibaba alibaba | Qwen-MT Plus qwen-mt-plus | qwen | text | text | 16,384 | 8,192 | $2.46 / 1M | $7.37 / 1M | No | No | Jan 1, 2025 | |
Alibaba alibaba | Qwen-MT Turbo qwen-mt-turbo | qwen | text | text | 16,384 | 8,192 | $0.16 / 1M | $0.49 / 1M | No | No | Jan 1, 2025 | |
Amazon Bedrock amazon-bedrock | Qwen3 32B (dense) qwen.qwen3-32b-v1:0 | qwen | text | text | 16,384 | 16,384 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Sep 18, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 | gpt | text | text | 16,384 | 16,384 | $0.5 / 1M | $1.5 / 1M | No | No | Jan 25, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 | gpt | text | text | 16,384 | 16,384 | $3 / 1M | $4 / 1M | No | No | Jun 13, 2023 | |
Azure Cognitive Services azure-cognitive-services | GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 | gpt | text | text | 16,384 | 16,384 | $1 / 1M | $2 / 1M | No | No | Nov 6, 2023 | |
Azure azure | GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 | gpt | text | text | 16,384 | 16,384 | $0.5 / 1M | $1.5 / 1M | No | No | Jan 25, 2024 | |
Azure azure | GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 | gpt | text | text | 16,384 | 16,384 | $3 / 1M | $4 / 1M | No | No | Jun 13, 2023 | |
Azure azure | GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 | gpt | text | text | 16,384 | 16,384 | $1 / 1M | $2 / 1M | No | No | Nov 6, 2023 | |
Chutes chutes | Llama 3.2 3B Instruct unsloth/Llama-3.2-3B-Instruct | unsloth | text | text | 16,384 | 16,384 | $0.01 / 1M | $0.01 / 1M | No | No | Feb 12, 2025 | |
Cortecs cortecs | Qwen3 32B qwen3-32b | qwen | text | text | 16,384 | 16,384 | $0.099 / 1M | $0.33 / 1M | No | Yes | Apr 29, 2025 | |
Helicone helicone | Meta Llama 3.1 8B Instruct llama-3.1-8b-instruct | llama | text | text | 16,384 | 16,384 | $0.02 / 1M | $0.049999999999999996 / 1M | No | Yes | Jul 23, 2024 | |
Kilo Gateway kilo | Magnum v4 72B anthracite-org/magnum-v4-72b | Unknown | text | text | 16,384 | 2,048 | $3 / 1M | $5 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | Unknown | text | text | 16,384 | 16,384 | $0.02 / 1M | $0.05 / 1M | No | Yes | Dec 23, 2025 | |
Kilo Gateway kilo | Microsoft: Phi 4 microsoft/phi-4 | Unknown | text | text | 16,384 | 16,384 | $0.06 / 1M | $0.14 / 1M | No | No | Dec 11, 2024 | |
NanoGPT nano-gpt | Brave (Research) brave-research | Unknown | text | text | 16,384 | 16,384 | $5 / 1M | $5 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | DeepSeek R1 Llama 70B Abliterated huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated | deepseek | text | text | 16,384 | 8,192 | $0.7 / 1M | $0.7 / 1M | Yes | No | Jan 20, 2025 | |
NanoGPT nano-gpt | DeepSeek R1 Qwen Abliterated huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated | qwen | text | text | 16,384 | 8,192 | $1.4 / 1M | $1.4 / 1M | Yes | No | Jan 20, 2025 | |
NanoGPT nano-gpt | EVA Llama 3.33 70B EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0 | llama | text | text | 16,384 | 16,384 | $2.006 / 1M | $2.006 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | EVA-LLaMA-3.33-70B-v0.1 EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1 | llama | text | text | 16,384 | 16,384 | $2.006 / 1M | $2.006 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | EVA-Qwen2.5-32B-v0.2 EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2 | qwen | text | text | 16,384 | 8,192 | $0.7989999999999999 / 1M | $0.7989999999999999 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | EVA-Qwen2.5-72B-v0.2 EVA-UNIT-01/EVA-Qwen2.5-72B-v0.2 | qwen | text | text | 16,384 | 8,192 | $0.7989999999999999 / 1M | $0.7989999999999999 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | Evayale 70b Steelskull/L3.3-MS-Evayale-70B | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Exa (Research Pro) exa-research-pro | Unknown | text | text | 16,384 | 16,384 | $2.5 / 1M | $2.5 / 1M | No | No | Jun 4, 2025 | |
NanoGPT nano-gpt | Grayline Qwen3 8B soob3123/GrayLine-Qwen3-8B | qwen | text | text | 16,384 | 32,768 | $0.3 / 1M | $0.3 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | L3.3 70B Loki v2.0 CrucibleLab/L3.3-70B-Loki-V2.0 | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jan 22, 2026 | |
NanoGPT nano-gpt | Llama 3.05 Storybreaker Ministral 70b Envoid/Llama-3.05-NT-Storybreaker-Ministral-70B | llama | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 1, 2024 | |
NanoGPT nano-gpt | Llama 3.1 70B Celeste v0.1 nothingiisreal/L3.1-70B-Celeste-V0.1-BF16 | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.1 70B Dracarys 2 abacusai/Dracarys-72B-Instruct | llama | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Aug 2, 2025 | |
NanoGPT nano-gpt | Llama 3.1 70B Hanami Sao10K/L3.1-70B-Hanami-x1 | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Cu Mai Steelskull/L3.3-Cu-Mai-R1-70b | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Llama 3.3 70B Instruct abliterated huihui-ai/Llama-3.3-70B-Instruct-abliterated | llama | text | text | 16,384 | 16,384 | $0.7 / 1M | $0.7 / 1M | No | No | Aug 8, 2025 | |
NanoGPT nano-gpt | Llama 3.3 70B Wayfarer LatitudeGames/Wayfarer-Large-70B-Llama-3.3 | llama | text | text | 16,384 | 16,384 | $0.700000007 / 1M | $0.700000007 / 1M | No | No | Feb 20, 2025 | |
NanoGPT nano-gpt | Lumimaid v0.2 NeverSleep/Lumimaid-v0.2-70B | llama | text | text | 16,384 | 8,192 | $1 / 1M | $1.5 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Mag Mell R1 inflatebot/MN-12B-Mag-Mell-R1 | mistral-nemo | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Magnum V2 72B anthracite-org/magnum-v2-72b | llama | text | text | 16,384 | 8,192 | $2.006 / 1M | $2.992 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Manta Flash 1.0 meganova-ai/manta-flash-1.0 | nova | text | text | 16,384 | 16,384 | $0.02 / 1M | $0.16 / 1M | No | No | Dec 20, 2025 | |
NanoGPT nano-gpt | Mistral Nemo mistralai/Mistral-Nemo-Instruct-2407 | mistral-nemo | text | text | 16,384 | 8,192 | $0.1003 / 1M | $0.1207 / 1M | No | No | Jul 18, 2024 | |
NanoGPT nano-gpt | Mistral Nemo Inferor 12B Infermatic/MN-12B-Inferor-v0.0 | mistral-nemo | text | text | 16,384 | 8,192 | $0.25499999999999995 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Mistral Nemo Starcannon 12b v1 VongolaChouko/Starcannon-Unleashed-12B-v1.0 | mistral-nemo | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | MN-LooseCannon-12B-v1 GalrionSoftworks/MN-LooseCannon-12B-v1 | mistral-nemo | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | MS Evalebis 70b Steelskull/L3.3-MS-Evalebis-70b | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | MS3.2 24B Magnum Diamond Doctor-Shotgun/MS3.2-24B-Magnum-Diamond | mistral | text | text | 16,384 | 32,768 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Nov 24, 2025 | |
NanoGPT nano-gpt | Nemotron 3.1 70B abliterated huihui-ai/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated | nemotron | text | text | 16,384 | 16,384 | $0.7 / 1M | $0.7 / 1M | No | No | Jul 23, 2024 | |
NanoGPT nano-gpt | Nemotron Tenyxchat Storybreaker 70b Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B | nemotron | text | text | 16,384 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 1, 2024 | |
NanoGPT nano-gpt | Nvidia Nemotron 70b nvidia/Llama-3.1-Nemotron-70B-Instruct-HF | nemotron | text | text | 16,384 | 8,192 | $0.357 / 1M | $0.408 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Omega Directive 24B Unslop v2.0 ReadyArt/MS3.2-The-Omega-Directive-24B-Unslop-v2.0 | llama | text | text | 16,384 | 32,768 | $0.5 / 1M | $0.5 / 1M | No | No | Dec 8, 2025 | |
NanoGPT nano-gpt | Rocinante 12b TheDrummer 2/Rocinante-12B-v1.1 | llama | text | text | 16,384 | 8,192 | $0.408 / 1M | $0.595 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Sao10K Stheno 8b Sao10K/L3-8B-Stheno-v3.2 | llama | text | text | 16,384 | 8,192 | $0.2006 / 1M | $0.2006 / 1M | No | No | Nov 29, 2024 | |
NanoGPT nano-gpt | Steelskull Electra R1 70b Steelskull/L3.3-Electra-R1-70b | llama | text | text | 16,384 | 16,384 | $0.69989 / 1M | $0.69989 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Steelskull Nevoria 70b Steelskull/L3.3-MS-Nevoria-70b | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | Steelskull Nevoria R1 70b Steelskull/L3.3-Nevoria-R1-70b | llama | text | text | 16,384 | 16,384 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Dec 6, 2024 | |
NanoGPT nano-gpt | The Drummer Cydonia 24B v2 TheDrummer 2/Cydonia-24B-v2 | llama | text | text | 16,384 | 32,768 | $0.1003 / 1M | $0.1207 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | The Drummer Cydonia 24B v4 TheDrummer 2/Cydonia-24B-v4 | llama | text | text | 16,384 | 32,768 | $0.2006 / 1M | $0.2414 / 1M | No | No | Jul 22, 2025 | |
NanoGPT nano-gpt | The Drummer Cydonia 24B v4.1 TheDrummer 2/Cydonia-24B-v4.1 | llama | text | text | 16,384 | 32,768 | $0.1003 / 1M | $0.1207 / 1M | No | No | Aug 19, 2025 | |
NanoGPT nano-gpt | The Omega Abomination V1 ReadyArt/The-Omega-Abomination-L-70B-v1.0 | llama | text | text | 16,384 | 16,384 | $0.7 / 1M | $0.95 / 1M | No | No | Dec 1, 2024 | |
NovitaAI novita-ai | Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | llama | text | text | 16,384 | 16,384 | $0.02 / 1M | $0.05 / 1M | No | No | Jul 24, 2024 | |
NovitaAI novita-ai | Qwen MT Plus qwen/qwen-mt-plus | Unknown | text | text | 16,384 | 8,192 | $0.25 / 1M | $0.75 / 1M | No | No | Sep 3, 2025 | |
OpenRouter openrouter | DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | deepseek | text | text | 16,384 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 24, 2025 | |
StepFun stepfun | Step 2 (16K) step-2-16k | Unknown | text | text | 16,384 | 8,192 | $5.21 / 1M | $16.44 / 1M | Yes | Yes | Feb 13, 2026 | |
GitHub Models github-models | Phi-4 microsoft/phi-4 | phi | text | text | 16,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2024 | |
Inference inference | Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct | llama | text | text | 16,000 | 4,096 | $0.025 / 1M | $0.025 / 1M | No | Yes | Jan 1, 2025 | |
Inference inference | Llama 3.2 1B Instruct meta/llama-3.2-1b-instruct | llama | text | text | 16,000 | 4,096 | $0.01 / 1M | $0.01 / 1M | No | Yes | Jan 1, 2025 | |
Inference inference | Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct | llama | text | text | 16,000 | 4,096 | $0.02 / 1M | $0.02 / 1M | No | Yes | Jan 1, 2025 | |
Inference inference | Mistral Nemo 12B Instruct mistral/mistral-nemo-12b-instruct | mistral-nemo | text | text | 16,000 | 4,096 | $0.038 / 1M | $0.1 / 1M | No | Yes | Jan 1, 2025 | |
Kilo Gateway kilo | Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 | Unknown | text | text | 16,000 | 16,000 | $3 / 1M | $3 / 1M | No | No | Mar 15, 2026 | |
Morph morph | Morph v3 Fast morph-v3-fast | morph | text | text | 16,000 | 16,000 | $0.8 / 1M | $1.2 / 1M | No | No | Aug 15, 2024 | |
NanoGPT nano-gpt | Step-2 16k Exp step-2-16k-exp | Unknown | text | text | 16,000 | 8,192 | $7.004 / 1M | $19.992 / 1M | No | No | Jul 5, 2024 | |
Requesty requesty | GPT-5 Nano openai/gpt-5-nano | gpt-nano | text | text | 16,000 | 4,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | Morph v3 Fast morph/morph-v3-fast | morph | text | text | 16,000 | 16,000 | $0.8 / 1M | $1.2 / 1M | No | No | Aug 15, 2024 | |
NanoGPT nano-gpt | Yi Lightning yi-lightning | Unknown | text | text | 12,000 | 4,096 | $0.2006 / 1M | $0.2006 / 1M | No | No | Oct 16, 2024 | |
Alibaba (China) alibaba-cn | Tongyi Intent Detect V3 tongyi-intent-detect-v3 | yi | text | text | 8,192 | 1,024 | $0.058 / 1M | $0.144 / 1M | No | No | Jan 1, 2024 | |
Alibaba alibaba | Qwen Plus Character (Japanese) qwen-plus-character-ja | qwen | text | text | 8,192 | 512 | $0.5 / 1M | $1.4 / 1M | No | Yes | Jan 1, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-4 gpt-4 | gpt | text | text | 8,192 | 8,192 | $60 / 1M | $120 / 1M | No | Yes | Mar 14, 2023 | |
Azure Cognitive Services azure-cognitive-services | Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct | llama | text | text | 8,192 | 2,048 | $2.68 / 1M | $3.54 / 1M | No | No | Apr 18, 2024 | |
Azure Cognitive Services azure-cognitive-services | Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct | llama | text | text | 8,192 | 2,048 | $0.3 / 1M | $0.61 / 1M | No | No | Apr 18, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-small-instruct (8k) phi-3-small-8k-instruct | phi | text | text | 8,192 | 2,048 | $0.15 / 1M | $0.6 / 1M | No | No | Apr 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | text-embedding-ada-002 text-embedding-ada-002 | text-embedding | text | text | 8,192 | 1,536 | $0.1 / 1M | $0 / 1M | No | No | Dec 15, 2022 | |
Azure azure | GPT-4 gpt-4 | gpt | text | text | 8,192 | 8,192 | $60 / 1M | $120 / 1M | No | Yes | Mar 14, 2023 | |
Azure azure | Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct | llama | text | text | 8,192 | 2,048 | $2.68 / 1M | $3.54 / 1M | No | No | Apr 18, 2024 | |
Azure azure | Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct | llama | text | text | 8,192 | 2,048 | $0.3 / 1M | $0.61 / 1M | No | No | Apr 18, 2024 | |
Azure azure | Phi-3-small-instruct (8k) phi-3-small-8k-instruct | phi | text | text | 8,192 | 2,048 | $0.15 / 1M | $0.6 / 1M | No | No | Apr 23, 2024 | |
Azure azure | text-embedding-ada-002 text-embedding-ada-002 | text-embedding | text | text | 8,192 | 1,536 | $0.1 / 1M | $0 / 1M | No | No | Dec 15, 2022 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-4 openai/gpt-4 | gpt | text | text | 8,192 | 8,192 | $30 / 1M | $60 / 1M | No | Yes | Apr 9, 2024 | |
DigitalOcean digitalocean | GTE Large (v1.5) gte-large-en-v1.5 | text-embedding | text | text | 8,192 | 1,024 | $0.09 / 1M | $0 / 1M | No | No | Apr 16, 2026 | |
GitHub Models github-models | JAIS 30b Chat core42/jais-30b-chat | jais | text | text | 8,192 | 2,048 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 30, 2023 | |
GitHub Models github-models | Meta-Llama-3-70B-Instruct meta/meta-llama-3-70b-instruct | llama | text | text | 8,192 | 2,048 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 18, 2024 | |
GitHub Models github-models | Meta-Llama-3-8B-Instruct meta/meta-llama-3-8b-instruct | llama | text | text | 8,192 | 2,048 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 18, 2024 | |
GitHub Models github-models | Phi-3-small instruct (8k) microsoft/phi-3-small-8k-instruct | phi | text | text | 8,192 | 2,048 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
Google google | Gemma 3n 2B gemma-3n-e2b-it | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | Jul 9, 2025 | |
Google google | Gemma 3n 4B gemma-3n-e4b-it | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Groq groq | Gemma 2 9B gemma2-9b-it | gemma | text | text | 8,192 | 8,192 | $0.2 / 1M | $0.2 / 1M | No | Yes | Jun 27, 2024 | |
Groq groq | Llama 3 70B llama3-70b-8192 | llama | text | text | 8,192 | 8,192 | $0.59 / 1M | $0.79 / 1M | No | Yes | Apr 18, 2024 | |
Groq groq | Llama 3 8B llama3-8b-8192 | llama | text | text | 8,192 | 8,192 | $0.05 / 1M | $0.08 / 1M | No | Yes | Apr 18, 2024 | |
Groq groq | Llama Guard 3 8B llama-guard-3-8b | llama | text | text | 8,192 | 8,192 | $0.2 / 1M | $0.2 / 1M | No | No | Jul 23, 2024 | |
Helicone helicone | Google Gemma 2 gemma2-9b-it | gemma | text | text | 8,192 | 8,192 | $0.01 / 1M | $0.03 / 1M | No | No | Jun 25, 2024 | |
Kilo Gateway kilo | Google: Gemma 2 27B google/gemma-2-27b-it | Unknown | text | text | 8,192 | 2,048 | $0.65 / 1M | $0.65 / 1M | No | No | Jun 24, 2024 | |
Kilo Gateway kilo | Google: Gemma 2 9B google/gemma-2-9b-it | Unknown | text | text | 8,192 | 1,639 | $0.03 / 1M | $0.09 / 1M | No | No | Jun 28, 2024 | |
Kilo Gateway kilo | Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | Unknown | text | text | 8,192 | 8,000 | $0.51 / 1M | $0.74 / 1M | No | No | Jul 23, 2024 | |
Kilo Gateway kilo | Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | Unknown | text | text | 8,192 | 16,384 | $0.03 / 1M | $0.04 / 1M | No | Yes | Apr 3, 2025 | |
Kilo Gateway kilo | NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | Unknown | text | text | 8,192 | 8,192 | $0.14 / 1M | $0.14 / 1M | No | No | Jun 27, 2024 | |
Kilo Gateway kilo | Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b | Unknown | text | text | 8,192 | 8,192 | $0.04 / 1M | $0.05 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | Unknown | text | text | 8,192 | 8,192 | $1.48 / 1M | $1.48 / 1M | No | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | Gemma 2 27B IT gemma-2-27b-it-together | gemma | text | text | 8,192 | 16,384 | $0.08 / 1M | $0.08 / 1M | No | No | Jun 27, 2024 | |
LLM Gateway llmgateway | Gemma 3n 2B gemma-3n-e2b-it | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | Jul 9, 2025 | |
LLM Gateway llmgateway | Gemma 3n 4B gemma-3n-e4b-it | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | May 20, 2025 | |
LLM Gateway llmgateway | GPT-4 gpt-4 | gpt | text | text | 8,192 | 8,192 | $30 / 1M | $60 / 1M | No | Yes | Apr 9, 2024 | |
LLM Gateway llmgateway | Hermes 2 Pro Llama 3 8B hermes-2-pro-llama-3-8b | hermes | text | text | 8,192 | 8,192 | $0.14 / 1M | $0.14 / 1M | No | No | May 27, 2024 | |
LLM Gateway llmgateway | Llama 3 70B Instruct llama-3-70b-instruct | llama | text | text | 8,192 | 8,000 | $0.51 / 1M | $0.74 / 1M | No | No | Apr 18, 2024 | |
LLM Gateway llmgateway | Llama 3 8B Instruct llama-3-8b-instruct | llama | text | text | 8,192 | 8,192 | $0.04 / 1M | $0.04 / 1M | No | No | Apr 3, 2025 | |
NanoGPT nano-gpt | Brave (Answers) brave | Unknown | text | text | 8,192 | 8,192 | $5 / 1M | $5 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Brave (Pro) brave-pro | Unknown | text | text | 8,192 | 8,192 | $5 / 1M | $5 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Dolphin 72b cognitivecomputations/dolphin-2.9.2-qwen2-72b | qwen | text | text | 8,192 | 4,096 | $0.306 / 1M | $0.306 / 1M | No | No | Feb 27, 2025 | |
NanoGPT nano-gpt | Exa (Research) exa-research | Unknown | text | text | 8,192 | 8,192 | $2.5 / 1M | $2.5 / 1M | No | No | Jun 4, 2025 | |
NanoGPT nano-gpt | Hunyuan MT 7B tencent/Hunyuan-MT-7B | hunyuan | text | text | 8,192 | 8,192 | $10 / 1M | $20 / 1M | No | No | Sep 18, 2025 | |
NanoGPT nano-gpt | Llama 3 70B abliterated failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5 | llama | text | text | 8,192 | 8,192 | $0.7 / 1M | $0.7 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | Manta Mini 1.0 meganova-ai/manta-mini-1.0 | nova | text | text | 8,192 | 8,192 | $0.02 / 1M | $0.16 / 1M | No | No | Dec 20, 2025 | |
NanoGPT nano-gpt | Neural Daredevil 8B abliterated mlabonne/NeuralDaredevil-8B-abliterated | llama | text | text | 8,192 | 8,192 | $0.44 / 1M | $0.44 / 1M | No | No | Dec 1, 2024 | |
Nebius Token Factory nebius | bge-multilingual-gemma2 BAAI/bge-multilingual-gemma2 | text-embedding | text | text | 8,192 | Unknown | $0.01 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | Gemma-2-2b-it google/gemma-2-2b-it | Unknown | text | text | 8,192 | 4,096 | $0.02 / 1M | $0.06 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | Gemma-2-9b-it (Fast) google/gemma-2-9b-it-fast | Unknown | text | text | 8,192 | 4,096 | $0.03 / 1M | $0.09 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | Llama-Guard-3-8B meta-llama/Llama-Guard-3-8B | Unknown | text | text | 8,192 | 1,024 | $0.02 / 1M | $0.06 / 1M | No | No | Feb 4, 2026 | |
NovitaAI novita-ai | DeepSeek R1 Distill LLama 70B deepseek/deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 8,192 | 8,192 | $0.8 / 1M | $0.8 / 1M | Yes | No | Jan 27, 2025 | |
NovitaAI novita-ai | Hermes 2 Pro Llama 3 8B nousresearch/hermes-2-pro-llama-3-8b | Unknown | text | text | 8,192 | 8,192 | $0.14 / 1M | $0.14 / 1M | No | No | Jun 27, 2024 | |
NovitaAI novita-ai | L3 70B Euryale V2.1 sao10k/l3-70b-euryale-v2.1 | Unknown | text | text | 8,192 | 8,192 | $1.48 / 1M | $1.48 / 1M | No | Yes | Jun 18, 2024 | |
NovitaAI novita-ai | L3 8B Stheno V3.2 sao10k/L3-8B-Stheno-v3.2 | Unknown | text | text | 8,192 | 32,000 | $0.05 / 1M | $0.05 / 1M | No | Yes | Nov 29, 2024 | |
NovitaAI novita-ai | L31 70B Euryale V2.2 sao10k/l31-70b-euryale-v2.2 | Unknown | text | text | 8,192 | 8,192 | $1.48 / 1M | $1.48 / 1M | No | Yes | Sep 19, 2024 | |
NovitaAI novita-ai | Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | llama | text | text | 8,192 | 8,192 | $0.04 / 1M | $0.04 / 1M | No | No | Apr 25, 2024 | |
NovitaAI novita-ai | Llama3 70B Instruct meta-llama/llama-3-70b-instruct | llama | text | text | 8,192 | 8,000 | $0.51 / 1M | $0.74 / 1M | No | No | Apr 25, 2024 | |
NovitaAI novita-ai | Sao10k L3 8B Lunaris sao10k/l3-8b-lunaris | Unknown | text | text | 8,192 | 8,192 | $0.05 / 1M | $0.05 / 1M | No | No | Nov 28, 2024 | |
OpenAI openai | GPT-4 gpt-4 | gpt | text | text | 8,192 | 8,192 | $30 / 1M | $60 / 1M | No | Yes | Apr 9, 2024 | |
OpenAI openai | text-embedding-ada-002 text-embedding-ada-002 | text-embedding | text | text | 8,192 | 1,536 | $0.1 / 1M | $0 / 1M | No | No | Dec 15, 2022 | |
OpenRouter openrouter | DeepSeek R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | deepseek-thinking | text | text | 8,192 | 8,192 | $0 / 1M | $0 / 1M | Yes | No | Jan 23, 2025 | |
OpenRouter openrouter | Gemma 2 9B google/gemma-2-9b-it | gemma | text | text | 8,192 | 8,192 | $0.03 / 1M | $0.09 / 1M | No | No | Jun 28, 2024 | |
OpenRouter openrouter | Gemma 3n 2B (free) google/gemma-3n-e2b-it:free | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | Jul 9, 2025 | |
OpenRouter openrouter | Gemma 3n 4B (free) google/gemma-3n-e4b-it:free | gemma | text | text | 8,192 | 2,000 | $0 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Vercel AI Gateway vercel | Codestral Embed mistral/codestral-embed | codestral-embed | text | text | 8,192 | 1,536 | $0.15 / 1M | $0 / 1M | No | No | May 28, 2025 | |
Vercel AI Gateway vercel | Embed v4.0 cohere/embed-v4.0 | cohere-embed | text | text | 8,192 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Apr 15, 2025 | |
Vercel AI Gateway vercel | Gemini Embedding 001 google/gemini-embedding-001 | gemini-embedding | text | text | 8,192 | 1,536 | $0.15 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Vercel AI Gateway vercel | GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | gpt | text | text | 8,192 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Mar 1, 2023 | |
Vercel AI Gateway vercel | Mistral Embed mistral/mistral-embed | mistral-embed | text | text | 8,192 | 1,536 | $0.1 / 1M | $0 / 1M | No | No | Dec 11, 2023 | |
Vercel AI Gateway vercel | Text Embedding 005 google/text-embedding-005 | text-embedding | text | text | 8,192 | 1,536 | $0.03 / 1M | $0 / 1M | No | No | Aug 1, 2024 | |
Vercel AI Gateway vercel | Text Multilingual Embedding 002 google/text-multilingual-embedding-002 | text-embedding | text | text | 8,192 | 1,536 | $0.03 / 1M | $0 / 1M | No | No | Mar 1, 2024 | |
Vercel AI Gateway vercel | text-embedding-3-large openai/text-embedding-3-large | text-embedding | text | text | 8,192 | 1,536 | $0.13 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Vercel AI Gateway vercel | text-embedding-3-small openai/text-embedding-3-small | text-embedding | text | text | 8,192 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Vercel AI Gateway vercel | text-embedding-ada-002 openai/text-embedding-ada-002 | text-embedding | text | text | 8,192 | 1,536 | $0.1 / 1M | $0 / 1M | No | No | Dec 15, 2022 | |
Vercel AI Gateway vercel | Titan Text Embeddings V2 amazon/titan-embed-text-v2 | titan-embed | text | text | 8,192 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | Apr 1, 2024 | |
Vercel AI Gateway vercel | voyage-3-large voyage/voyage-3-large | voyage | text | text | 8,192 | 1,536 | $0.18 / 1M | $0 / 1M | No | No | Sep 1, 2024 | |
Vercel AI Gateway vercel | voyage-3.5 voyage/voyage-3.5 | voyage | text | text | 8,192 | 1,536 | $0.06 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Vercel AI Gateway vercel | voyage-3.5-lite voyage/voyage-3.5-lite | voyage | text | text | 8,192 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Vercel AI Gateway vercel | voyage-code-2 voyage/voyage-code-2 | voyage | text | text | 8,192 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Jan 1, 2024 | |
Vercel AI Gateway vercel | voyage-code-3 voyage/voyage-code-3 | voyage | text | text | 8,192 | 1,536 | $0.18 / 1M | $0 / 1M | No | No | Sep 1, 2024 | |
Vercel AI Gateway vercel | voyage-finance-2 voyage/voyage-finance-2 | voyage | text | text | 8,192 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Mar 1, 2024 | |
Vercel AI Gateway vercel | voyage-law-2 voyage/voyage-law-2 | voyage | text | text | 8,192 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Mar 1, 2024 | |
Azure Cognitive Services azure-cognitive-services | text-embedding-3-large text-embedding-3-large | text-embedding | text | text | 8,191 | 3,072 | $0.13 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Azure Cognitive Services azure-cognitive-services | text-embedding-3-small text-embedding-3-small | text-embedding | text | text | 8,191 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Azure azure | text-embedding-3-large text-embedding-3-large | text-embedding | text | text | 8,191 | 3,072 | $0.13 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Azure azure | text-embedding-3-small text-embedding-3-small | text-embedding | text | text | 8,191 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Kilo Gateway kilo | OpenAI: GPT-4 openai/gpt-4 | Unknown | text | text | 8,191 | 4,096 | $30 / 1M | $60 / 1M | No | Yes | Apr 9, 2024 | |
Kilo Gateway kilo | OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | Unknown | text | text | 8,191 | 4,096 | $30 / 1M | $60 / 1M | No | Yes | Mar 15, 2026 | |
OpenAI openai | text-embedding-3-large text-embedding-3-large | text-embedding | text | text | 8,191 | 3,072 | $0.13 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
OpenAI openai | text-embedding-3-small text-embedding-3-small | text-embedding | text | text | 8,191 | 1,536 | $0.02 / 1M | $0 / 1M | No | No | Jan 25, 2024 | |
Scaleway scaleway | BGE Multilingual Gemma2 bge-multilingual-gemma2 | gemma | text | text | 8,191 | 3,072 | $0.1 / 1M | $0 / 1M | No | No | Jun 15, 2025 | |
Cohere cohere | Aya Expanse 8B c4ai-aya-expanse-8b | Unknown | text | text | 8,000 | 4,000 | Unknown | Unknown | No | No | Oct 24, 2024 | |
Cohere cohere | Command A Translate command-a-translate-08-2025 | command-a | text | text | 8,000 | 8,000 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 28, 2025 | |
DigitalOcean digitalocean | Qwen3 Embedding 0.6B qwen3-embedding-0.6b | text-embedding | text | text | 8,000 | 1,024 | $0.04 / 1M | $0 / 1M | No | No | Apr 16, 2026 | |
Kilo Gateway kilo | Inflection: Inflection 3 Pi inflection/inflection-3-pi | Unknown | text | text | 8,000 | 1,024 | $2.5 / 1M | $10 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Inflection: Inflection 3 Productivity inflection/inflection-3-productivity | Unknown | text | text | 8,000 | 1,024 | $2.5 / 1M | $10 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Mancer: Weaver (alpha) mancer/weaver | Unknown | text | text | 8,000 | 2,000 | $0.75 / 1M | $1 / 1M | No | No | Mar 15, 2026 | |
Mistral mistral | Mistral 7B open-mistral-7b | mistral | text | text | 8,000 | 8,000 | $0.25 / 1M | $0.25 / 1M | No | Yes | Sep 27, 2023 | |
Mistral mistral | Mistral Embed mistral-embed | mistral-embed | text | text | 8,000 | 3,072 | $0.1 / 1M | $0 / 1M | No | No | Dec 11, 2023 | |
NanoGPT nano-gpt | Ernie 4.5 8k Preview ernie-4.5-8k-preview | Unknown | text | text | 8,000 | 16,384 | $0.66 / 1M | $2.6 / 1M | No | No | Mar 25, 2025 | |
NanoGPT nano-gpt | GLM Zero Preview glm-zero-preview | Unknown | text | text | 8,000 | 4,096 | $1.802 / 1M | $1.802 / 1M | No | No | Dec 1, 2024 | |
NanoGPT nano-gpt | GLM-4 AirX glm-4-airx | Unknown | text | text | 8,000 | 4,096 | $2.006 / 1M | $2.006 / 1M | No | No | Jun 5, 2024 | |
NanoGPT nano-gpt | Inflection 3 Pi inflection/inflection-3-pi | gpt | text | text | 8,000 | 4,096 | $2.499 / 1M | $9.996 / 1M | No | No | Oct 11, 2024 | |
NanoGPT nano-gpt | Inflection 3 Productivity inflection/inflection-3-productivity | gpt | text | text | 8,000 | 4,096 | $2.499 / 1M | $9.996 / 1M | No | No | Oct 11, 2024 | |
NanoGPT nano-gpt | Step-2 Mini step-2-mini | Unknown | text | text | 8,000 | 4,096 | $0.2006 / 1M | $0.408 / 1M | No | No | Jul 5, 2024 | |
Kilo Gateway kilo | Goliath 120B alpindale/goliath-120b | Unknown | text | text | 6,144 | 1,024 | $3.75 / 1M | $7.5 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | ReMM SLERP 13B undi95/remm-slerp-l2-13b | Unknown | text | text | 6,144 | 4,096 | $0.45 / 1M | $0.65 / 1M | No | No | Mar 15, 2026 | |
Alibaba (China) alibaba-cn | Qwen Math Plus qwen-math-plus | qwen | text | text | 4,096 | 3,072 | $0.574 / 1M | $1.721 / 1M | No | Yes | Sep 19, 2024 | |
Alibaba (China) alibaba-cn | Qwen Math Turbo qwen-math-turbo | qwen | text | text | 4,096 | 3,072 | $0.287 / 1M | $0.861 / 1M | No | Yes | Sep 19, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5-Math 72B Instruct qwen2-5-math-72b-instruct | qwen | text | text | 4,096 | 3,072 | $0.574 / 1M | $1.721 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5-Math 7B Instruct qwen2-5-math-7b-instruct | qwen | text | text | 4,096 | 3,072 | $0.144 / 1M | $0.287 / 1M | No | Yes | Sep 1, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 | gpt | text | text | 4,096 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Mar 1, 2023 | |
Azure Cognitive Services azure-cognitive-services | GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct | gpt | text | text | 4,096 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Sep 21, 2023 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct | phi | text | text | 4,096 | 1,024 | $0.17 / 1M | $0.68 / 1M | No | No | Apr 23, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct | phi | text | text | 4,096 | 1,024 | $0.13 / 1M | $0.52 / 1M | No | No | Apr 23, 2024 | |
Azure azure | GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 | gpt | text | text | 4,096 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Mar 1, 2023 | |
Azure azure | GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct | gpt | text | text | 4,096 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Sep 21, 2023 | |
Azure azure | Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct | phi | text | text | 4,096 | 1,024 | $0.17 / 1M | $0.68 / 1M | No | No | Apr 23, 2024 | |
Azure azure | Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct | phi | text | text | 4,096 | 1,024 | $0.13 / 1M | $0.52 / 1M | No | No | Apr 23, 2024 | |
GitHub Models github-models | Phi-3-medium instruct (4k) microsoft/phi-3-medium-4k-instruct | phi | text | text | 4,096 | 1,024 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
GitHub Models github-models | Phi-3-mini instruct (4k) microsoft/phi-3-mini-4k-instruct | phi | text | text | 4,096 | 1,024 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 23, 2024 | |
Groq groq | ALLaM-2-7b allam-2-7b | allam | text | text | 4,096 | 4,096 | $0 / 1M | $0 / 1M | No | No | Sep 1, 2024 | |
Kilo Gateway kilo | AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | Unknown | text | text | 4,096 | 4,096 | $0.8 / 1M | $1.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | EleutherAI: Llemma 7b eleutherai/llemma_7b | Unknown | text | text | 4,096 | 4,096 | $0.8 / 1M | $1.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | MythoMax 13B gryphe/mythomax-l2-13b | Unknown | text | text | 4,096 | 4,096 | $0.06 / 1M | $0.06 / 1M | No | No | Apr 25, 2024 | |
NanoGPT nano-gpt | Exa (Answer) exa-answer | Unknown | text | text | 4,096 | 4,096 | $2.5 / 1M | $2.5 / 1M | No | No | Jun 4, 2025 | |
NovitaAI novita-ai | Mythomax L2 13B gryphe/mythomax-l2-13b | Unknown | text | text | 4,096 | 3,200 | $0.09 / 1M | $0.09 / 1M | No | No | Apr 25, 2024 | |
STACKIT stackit | E5 Mistral 7B intfloat/e5-mistral-7b-instruct | mistral | text | text | 4,096 | 4,096 | $0.02 / 1M | $0.02 / 1M | No | No | Dec 11, 2023 | |
Kilo Gateway kilo | OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | Unknown | text | text | 4,095 | 4,096 | $1 / 1M | $2 / 1M | No | Yes | Jun 13, 2023 | |
Kilo Gateway kilo | OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | Unknown | text | text | 4,095 | 4,096 | $1.5 / 1M | $2 / 1M | No | No | Sep 21, 2023 | |
Inference inference | Osmosis Structure 0.6B osmosis/osmosis-structure-0.6b | osmosis | text | text | 4,000 | 2,048 | $0.1 / 1M | $0.5 / 1M | No | Yes | Jan 1, 2025 | |
NanoGPT nano-gpt | MythoMax 13B Gryphe/MythoMax-L2-13b | llama | text | text | 4,000 | 4,096 | $0.1003 / 1M | $0.1003 / 1M | No | No | Aug 8, 2025 | |
Kilo Gateway kilo | Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | Unknown | text | text | 2,824 | 565 | $0.11 / 1M | $0.19 / 1M | No | No | Apr 3, 2025 | |
Google google | Gemini Embedding 001 gemini-embedding-001 | gemini | text | text | 2,048 | 3,072 | $0.15 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Poe poe | Tako trytako/tako | tako | text | text | 2,048 | Unknown | Unknown | Unknown | No | Yes | Aug 15, 2024 | |
Vertex google-vertex | Gemini Embedding 001 gemini-embedding-001 | gemini | text | text | 2,048 | 3,072 | $0.15 / 1M | $0 / 1M | No | No | May 20, 2025 | |
Azure Cognitive Services azure-cognitive-services | Embed v3 English cohere-embed-v3-english | cohere-embed | text | text | 512 | 1,024 | $0.1 / 1M | $0 / 1M | No | No | Nov 7, 2023 | |
Azure Cognitive Services azure-cognitive-services | Embed v3 Multilingual cohere-embed-v3-multilingual | cohere-embed | text | text | 512 | 1,024 | $0.1 / 1M | $0 / 1M | No | No | Nov 7, 2023 | |
Azure azure | Embed v3 English cohere-embed-v3-english | cohere-embed | text | text | 512 | 1,024 | $0.1 / 1M | $0 / 1M | No | No | Nov 7, 2023 | |
Azure azure | Embed v3 Multilingual cohere-embed-v3-multilingual | cohere-embed | text | text | 512 | 1,024 | $0.1 / 1M | $0 / 1M | No | No | Nov 7, 2023 | |
DigitalOcean digitalocean | Multi-QA-mpnet-base-dot-v1 multi-qa-mpnet-base-dot-v1 | text-embedding | text | text | 512 | 768 | $0.009 / 1M | $0 / 1M | No | No | Apr 16, 2026 | |
evroc evroc | E5 Multi-Lingual Large Embeddings 0.6B intfloat/multilingual-e5-large-instruct | text-embedding | text | text | 512 | 512 | $0.12 / 1M | $0.12 / 1M | No | No | Jun 1, 2024 | |
Groq groq | Llama Prompt Guard 2 22M meta-llama/llama-prompt-guard-2-22m | llama | text | text | 512 | 512 | $0.03 / 1M | $0.03 / 1M | No | No | Oct 1, 2024 | |
Groq groq | Llama Prompt Guard 2 86M meta-llama/llama-prompt-guard-2-86m | llama | text | text | 512 | 512 | $0.04 / 1M | $0.04 / 1M | No | No | Oct 1, 2024 | |
Helicone helicone | Meta Llama Prompt Guard 2 22M llama-prompt-guard-2-22m | llama | text | text | 512 | 2 | $0.01 / 1M | $0.01 / 1M | No | No | Oct 1, 2024 | |
Helicone helicone | Meta Llama Prompt Guard 2 86M llama-prompt-guard-2-86m | llama | text | text | 512 | 2 | $0.01 / 1M | $0.01 / 1M | No | No | Oct 1, 2024 | |
DigitalOcean digitalocean | All-MiniLM-L6-v2 all-mini-lm-l6-v2 | text-embedding | text | text | 256 | 384 | $0.009 / 1M | $0 / 1M | No | No | Apr 16, 2026 | |
Poe poe | claude-code poetools/claude-code | Unknown | text | text | Unknown | Unknown | Unknown | Unknown | Yes | Yes | Nov 27, 2025 | |
Poe poe | GLM-4.6 novita/glm-4.6 | glm | text | text | Unknown | Unknown | Unknown | Unknown | No | Yes | Sep 30, 2025 | |
Poe poe | llama-3.3-70b-cs cerebras/llama-3.3-70b-cs | Unknown | text | text | Unknown | Unknown | Unknown | Unknown | No | No | May 13, 2025 | |
Poe poe | qwen3-235b-2507-cs cerebras/qwen3-235b-2507-cs | Unknown | text | text | Unknown | Unknown | Unknown | Unknown | Yes | Yes | Aug 6, 2025 | |
Poe poe | qwen3-32b-cs cerebras/qwen3-32b-cs | Unknown | text | text | Unknown | Unknown | Unknown | Unknown | Yes | Yes | May 15, 2025 | |
Vercel AI Gateway vercel | Gemini Embedding 2 google/gemini-embedding-2 | gemini-embedding | text | text | Unknown | Unknown | Unknown | Unknown | No | No | Mar 23, 2026 | |
Vercel AI Gateway vercel | MiniMax M2.5 High Speed minimax/minimax-m2.5-highspeed | minimax | text | text | Unknown | Unknown | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Mar 13, 2026 |
948 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Deep Infra deepinfra | Llama 4 Scout 17B meta-llama/Llama-4-Scout-17B-16E-Instruct | llama | text, image | text | 10,000,000 | 16,384 | $0.08 / 1M | $0.3 / 1M | No | Yes | Apr 5, 2025 | |
Amazon Bedrock amazon-bedrock | Llama 4 Scout 17B Instruct meta.llama4-scout-17b-instruct-v1:0 | llama | text, image | text | 3,500,000 | 16,384 | $0.17 / 1M | $0.66 / 1M | No | Yes | Apr 5, 2025 | |
NanoGPT nano-gpt | Gemini 2.0 Pro 0205 gemini-2.0-pro-exp-02-05 | Unknown | text, image | text | 2,097,152 | 8,192 | $1.989 / 1M | $7.956 / 1M | No | No | Feb 5, 2025 | |
NanoGPT nano-gpt | Gemini 2.0 Pro 1206 gemini-exp-1206 | Unknown | text, image | text | 2,097,152 | 8,192 | $1.258 / 1M | $4.998 / 1M | No | No | Dec 6, 2024 | |
302.AI 302ai | gemini-2.0-flash-lite gemini-2.0-flash-lite | gemini-flash-lite | text, image | text | 2,000,000 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | No | Jun 16, 2025 | |
302.AI 302ai | grok-4-1-fast-non-reasoning grok-4-1-fast-non-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 20, 2025 | |
302.AI 302ai | grok-4-1-fast-reasoning grok-4-1-fast-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 20, 2025 | |
302.AI 302ai | grok-4-fast-non-reasoning grok-4-fast-non-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 23, 2025 | |
302.AI 302ai | grok-4-fast-reasoning grok-4-fast-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 23, 2025 | |
302.AI 302ai | grok-4.20-beta-0309-non-reasoning grok-4.20-beta-0309-non-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | No | Yes | Mar 16, 2026 | |
302.AI 302ai | grok-4.20-beta-0309-reasoning grok-4.20-beta-0309-reasoning | Unknown | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 16, 2026 | |
302.AI 302ai | grok-4.20-multi-agent-beta-0309 grok-4.20-multi-agent-beta-0309 | Unknown | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 16, 2026 | |
Abacus abacus | Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 16,384 | $0.2 / 1M | $0.5 / 1M | No | Yes | Jul 9, 2025 | |
Abacus abacus | Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 16,384 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 17, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
Azure Cognitive Services azure-cognitive-services | Grok 4 Fast (Reasoning) grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
Azure azure | Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
Azure azure | Grok 4 Fast (Reasoning) grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
Firmware firmware | Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 128,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 25, 2025 | |
Firmware firmware | Grok 4.1 Fast (Reasoning) grok-4-1-fast-reasoning | grok | text, image | text | 2,000,000 | 128,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 25, 2025 | |
Helicone helicone | xAI Grok 4.1 Fast Reasoning grok-4-1-fast-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.19999999999999998 / 1M | $0.5 / 1M | Yes | Yes | Nov 17, 2025 | |
Helicone helicone | xAI: Grok 4 Fast Reasoning grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.19999999999999998 / 1M | $0.5 / 1M | Yes | Yes | Sep 1, 2025 | |
Jiekou.AI jiekou | grok-4-1-fast-non-reasoning grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.18 / 1M | $0.45 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | grok-4-1-fast-reasoning grok-4-1-fast-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.18 / 1M | $0.45 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | grok-4-fast-non-reasoning grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.18 / 1M | $0.45 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | grok-4-fast-reasoning grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 2,000,000 | $0.18 / 1M | $0.45 / 1M | No | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | xAI: Grok 4 Fast x-ai/grok-4-fast | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Aug 19, 2025 | |
Kilo Gateway kilo | xAI: Grok 4.1 Fast x-ai/grok-4.1-fast | Unknown | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 19, 2025 | |
LLM Gateway llmgateway | Grok 4 Fast grok-4-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
LLM Gateway llmgateway | Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
LLM Gateway llmgateway | Grok 4 Fast Reasoning grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Jul 9, 2025 | |
LLM Gateway llmgateway | Grok 4.1 Fast grok-4-1-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 19, 2025 | |
LLM Gateway llmgateway | Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 19, 2025 | |
LLM Gateway llmgateway | Grok 4.1 Fast Reasoning grok-4-1-fast-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 19, 2025 | |
LLM Gateway llmgateway | Grok 4.20 (Non-Reasoning) grok-4-20-beta-0309-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | No | Yes | Mar 9, 2026 | |
LLM Gateway llmgateway | Grok 4.20 (Reasoning) grok-4-20-beta-0309-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 9, 2026 | |
NanoGPT nano-gpt | Grok 4 Fast x-ai/grok-4-fast | grok | text, image | text | 2,000,000 | 131,072 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 20, 2025 | |
NanoGPT nano-gpt | Grok 4 Fast Thinking x-ai/grok-4-fast:thinking | grok | text, image | text | 2,000,000 | 131,072 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Jul 9, 2025 | |
NanoGPT nano-gpt | Grok 4.1 Fast x-ai/grok-4.1-fast | grok | text, image | text | 2,000,000 | 131,072 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 20, 2025 | |
NanoGPT nano-gpt | Grok 4.1 Fast Reasoning x-ai/grok-4.1-fast-reasoning | grok | text, image | text | 2,000,000 | 131,072 | $0.2 / 1M | $0.5 / 1M | Yes | No | Nov 20, 2025 | |
OpenRouter openrouter | Grok 4 Fast x-ai/grok-4-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Aug 19, 2025 | |
OpenRouter openrouter | Grok 4.1 Fast x-ai/grok-4.1-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 19, 2025 | |
OpenRouter openrouter | Grok 4.20 Beta x-ai/grok-4.20-beta | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 12, 2026 | |
OpenRouter openrouter | Grok 4.20 Multi - Agent Beta x-ai/grok-4.20-multi-agent-beta | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | No | Mar 12, 2026 | |
Perplexity Agent perplexity-agent | Grok 4.1 Fast (Non-Reasoning) xai/grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 19, 2025 | |
Poe poe | Grok-4-Fast-Non-Reasoning xai/grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 128,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 16, 2025 | |
Poe poe | Grok-4-Fast-Reasoning xai/grok-4-fast-reasoning | grok | text, image | text | 2,000,000 | 128,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 16, 2025 | |
Poe poe | Grok-4.1-Fast-Non-Reasoning xai/grok-4.1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | Unknown | Unknown | No | Yes | Nov 19, 2025 | |
Poe poe | Grok-4.1-Fast-Reasoning xai/grok-4.1-fast-reasoning | grok | text, image | text | 2,000,000 | 30,000 | Unknown | Unknown | Yes | Yes | Nov 19, 2025 | |
Venice AI venice | Grok 4.20 grok-4-20 | grok | text, image | text | 2,000,000 | 128,000 | $2.27 / 1M | $6.8 / 1M | Yes | Yes | Apr 19, 2026 | |
Venice AI venice | Grok 4.20 Multi-Agent grok-4-20-multi-agent | grok | text, image | text | 2,000,000 | 128,000 | $2.27 / 1M | $6.8 / 1M | Yes | No | Apr 19, 2026 | |
Vercel AI Gateway vercel | Grok 4 Fast (Non-Reasoning) xai/grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
xAI xai | Grok 4 Fast grok-4-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
xAI xai | Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
xAI xai | Grok 4.1 Fast grok-4-1-fast | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 19, 2025 | |
xAI xai | Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 19, 2025 | |
xAI xai | Grok 4.20 (Non-Reasoning) grok-4.20-0309-non-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | No | Yes | Mar 9, 2026 | |
xAI xai | Grok 4.20 (Reasoning) grok-4.20-0309-reasoning | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 9, 2026 | |
xAI xai | Grok 4.20 Multi-Agent grok-4.20-multi-agent-0309 | grok | text, image | text | 2,000,000 | 30,000 | $2 / 1M | $6 / 1M | Yes | No | Mar 9, 2026 | |
ZenMux zenmux | Grok 4 Fast x-ai/grok-4-fast | Unknown | text, image | text | 2,000,000 | 64,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Sep 19, 2025 | |
ZenMux zenmux | Grok 4.1 Fast x-ai/grok-4.1-fast | Unknown | text, image | text | 2,000,000 | 64,000 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Nov 20, 2025 | |
ZenMux zenmux | Grok 4.1 Fast Non Reasoning x-ai/grok-4.1-fast-non-reasoning | Unknown | text, image | text | 2,000,000 | 64,000 | $0.2 / 1M | $0.5 / 1M | No | Yes | Nov 20, 2025 | |
Abacus abacus | GPT-5.4 gpt-5.4 | gpt | text, image | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.4 Pro gpt-5.4-pro | gpt-pro | text, image | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
Azure azure | GPT-5.4 Pro gpt-5.4-pro | gpt-pro | text, image | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
LLM Gateway llmgateway | GPT-5.4 Pro gpt-5.4-pro | gpt-pro | text, image | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenAI openai | GPT-5.4 Pro gpt-5.4-pro | gpt-pro | text, image | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
Perplexity Agent perplexity-agent | GPT-5.4 openai/gpt-5.4 | gpt | text, image | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
ZenMux zenmux | GPT-5.4 openai/gpt-5.4 | Unknown | text, image | text | 1,050,000 | 128,000 | $3.75 / 1M | $18.75 / 1M | Yes | Yes | Mar 20, 2026 | |
ZenMux zenmux | GPT-5.4 Pro openai/gpt-5.4-pro | Unknown | text, image | text | 1,050,000 | 128,000 | $45 / 1M | $225 / 1M | Yes | Yes | Mar 20, 2026 | |
NanoGPT nano-gpt | Gemini 2.5 Flash gemini-2.5-flash | Unknown | text, image | text | 1,048,756 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | No | Jun 5, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Lite gemini-2.5-flash-lite | Unknown | text, image | text | 1,048,756 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | No | Jun 17, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Lite Preview gemini-2.5-flash-lite-preview-06-17 | Unknown | text, image | text | 1,048,756 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | No | Jun 17, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Preview gemini-2.5-flash-preview-04-17 | Unknown | text, image | text | 1,048,756 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | No | Apr 17, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Preview Thinking gemini-2.5-flash-preview-04-17:thinking | Unknown | text, image | text | 1,048,756 | 65,536 | $0.15 / 1M | $3.5 / 1M | Yes | No | Apr 17, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Pro gemini-2.5-pro | Unknown | text, image | text | 1,048,756 | 65,536 | $2.5 / 1M | $10 / 1M | Yes | No | Jun 5, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Pro Experimental 0325 gemini-2.5-pro-exp-03-25 | Unknown | text, image | text | 1,048,756 | 65,536 | $2.5 / 1M | $10 / 1M | Yes | No | Mar 25, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Pro Preview 0325 gemini-2.5-pro-preview-03-25 | Unknown | text, image | text | 1,048,756 | 65,536 | $2.5 / 1M | $10 / 1M | Yes | No | Mar 25, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Pro Preview 0506 gemini-2.5-pro-preview-05-06 | Unknown | text, image | text | 1,048,756 | 65,536 | $2.5 / 1M | $10 / 1M | Yes | No | May 6, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Pro Preview 0605 gemini-2.5-pro-preview-06-05 | Unknown | text, image | text | 1,048,756 | 65,536 | $2.5 / 1M | $10 / 1M | Yes | No | Jun 5, 2025 | |
NanoGPT nano-gpt | Gemini 3 Flash (Preview) google/gemini-3-flash-preview | gemini-flash | text, image | text | 1,048,756 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
NanoGPT nano-gpt | Gemini 3 Flash Thinking google/gemini-3-flash-preview-thinking | gemini-flash | text, image | text | 1,048,756 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | No | Dec 17, 2025 | |
NanoGPT nano-gpt | Gemini 3 Pro gemini-3-pro-preview | Unknown | text, image | text | 1,048,756 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
NanoGPT nano-gpt | Gemini 3 Pro Image gemini-3-pro-image-preview | Unknown | text, image | text | 1,048,756 | 65,536 | $2 / 1M | $12 / 1M | No | No | Nov 18, 2025 | |
NanoGPT nano-gpt | Gemini 3 Pro Thinking gemini-3-pro-preview-thinking | Unknown | text, image | text | 1,048,756 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Cortecs cortecs | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image | text | 1,048,576 | 65,535 | $1.654 / 1M | $11.024 / 1M | No | Yes | Jun 17, 2025 | |
Helicone helicone | Google Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image | text | 1,048,576 | 65,535 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 17, 2025 | |
Helicone helicone | Google Gemini 2.5 Flash Lite gemini-2.5-flash-lite | gemini-flash-lite | text, image | text | 1,048,576 | 65,535 | $0.09999999999999999 / 1M | $0.39999999999999997 / 1M | Yes | Yes | Jul 22, 2025 | |
Helicone helicone | Google Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 17, 2025 | |
Kilo Gateway kilo | Meta: Llama 4 Maverick meta-llama/llama-4-maverick | Unknown | text, image | text | 1,048,576 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Dec 24, 2025 | |
LLM Gateway llmgateway | Gemini Pro Latest gemini-pro-latest | gemini | text, image | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 27, 2026 | |
NanoGPT nano-gpt | Llama 4 Maverick meta-llama/llama-4-maverick | llama | text, image | text | 1,048,576 | 65,536 | $0.18000000000000002 / 1M | $0.8 / 1M | No | Yes | Sep 5, 2025 | |
NovitaAI novita-ai | Llama 4 Maverick Instruct meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | Unknown | text, image | text | 1,048,576 | 8,192 | $0.27 / 1M | $0.85 / 1M | No | No | Apr 6, 2025 | |
Ollama Cloud ollama-cloud | gemini-3-flash-preview gemini-3-flash-preview | gemini-flash | text, image | text | 1,048,576 | 65,536 | Unknown | Unknown | Yes | Yes | Apr 8, 2026 | |
NanoGPT nano-gpt | Gemini 2.5 Flash 0520 gemini-2.5-flash-preview-05-20 | Unknown | text, image | text | 1,048,000 | 65,536 | $0.15 / 1M | $0.6 / 1M | No | No | May 20, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash 0520 Thinking gemini-2.5-flash-preview-05-20:thinking | Unknown | text, image | text | 1,048,000 | 65,536 | $0.15 / 1M | $3.5 / 1M | Yes | No | May 20, 2025 | |
Abacus abacus | GPT-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Abacus abacus | GPT-4.1 Mini gpt-4.1-mini | gpt | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Abacus abacus | GPT-4.1 Nano gpt-4.1-nano | gpt | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
AIHubMix aihubmix | GPT-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
AIHubMix aihubmix | GPT-4.1 mini gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-4.1 mini gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-4.1 nano gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
Azure azure | GPT-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Azure azure | GPT-4.1 mini gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Azure azure | GPT-4.1 nano gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
Cortecs cortecs | GPT 4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2.354 / 1M | $9.417 / 1M | No | Yes | Apr 14, 2025 | |
FastRouter fastrouter | GPT-4.1 openai/gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Helicone helicone | OpenAI GPT-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Helicone helicone | OpenAI GPT-4.1 Mini gpt-4.1-mini-2025-04-14 | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.39999999999999997 / 1M | $1.5999999999999999 / 1M | No | Yes | Apr 14, 2025 | |
Helicone helicone | OpenAI GPT-4.1 Mini gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.39999999999999997 / 1M | $1.5999999999999999 / 1M | No | Yes | Apr 14, 2025 | |
Helicone helicone | OpenAI GPT-4.1 Nano gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.09999999999999999 / 1M | $0.39999999999999997 / 1M | No | Yes | Apr 14, 2025 | |
LLM Gateway llmgateway | GPT-4.1 nano gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
NanoGPT nano-gpt | GPT 4.1 Mini openai/gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | No | Apr 14, 2025 | |
OpenAI openai | GPT-4.1 nano gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
OpenRouter openrouter | GPT-4.1 openai/gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
OpenRouter openrouter | GPT-4.1 Mini openai/gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Poe poe | GPT-4.1 openai/gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $1.8 / 1M | $7.2 / 1M | No | Yes | Apr 14, 2025 | |
Poe poe | GPT-4.1-mini openai/gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.36 / 1M | $1.4 / 1M | No | Yes | Apr 15, 2025 | |
Poe poe | GPT-4.1-nano openai/gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.09 / 1M | $0.36 / 1M | No | Yes | Apr 15, 2025 | |
Requesty requesty | GPT-4.1 openai/gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Requesty requesty | GPT-4.1 Mini openai/gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
SAP AI Core sap-ai-core | gpt-4.1 gpt-4.1 | gpt | text, image | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
SAP AI Core sap-ai-core | gpt-4.1-mini gpt-4.1-mini | gpt-mini | text, image | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Vercel AI Gateway vercel | GPT-4.1 nano openai/gpt-4.1-nano | gpt-nano | text, image | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
ZenMux zenmux | Qwen3.5 Flash qwen/qwen3.5-flash | Unknown | text, image | text | 1,020,000 | 1,020,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Mar 20, 2026 | |
Kilo Gateway kilo | MiniMax: MiniMax-01 minimax/minimax-01 | Unknown | text, image | text | 1,000,192 | 1,000,192 | $0.2 / 1M | $1.1 / 1M | No | No | Jan 15, 2025 | |
302.AI 302ai | gemini-2.5-flash gemini-2.5-flash | gemini-flash | text, image | text | 1,000,000 | 65,536 | $0.3 / 1M | $2.5 / 1M | No | Yes | Jun 17, 2025 | |
302.AI 302ai | gemini-2.5-flash-lite-preview-09-2025 gemini-2.5-flash-lite-preview-09-2025 | Unknown | text, image | text | 1,000,000 | 65,536 | $0.1 / 1M | $0.4 / 1M | No | Yes | Sep 26, 2025 | |
302.AI 302ai | gemini-2.5-flash-nothink gemini-2.5-flash-nothink | gemini-flash | text, image | text | 1,000,000 | 65,536 | $0.3 / 1M | $2.5 / 1M | No | Yes | Jun 24, 2025 | |
302.AI 302ai | gemini-2.5-flash-preview-09-2025 gemini-2.5-flash-preview-09-2025 | Unknown | text, image | text | 1,000,000 | 65,536 | $0.3 / 1M | $2.5 / 1M | No | Yes | Sep 26, 2025 | |
302.AI 302ai | gemini-2.5-pro gemini-2.5-pro | gemini-pro | text, image | text | 1,000,000 | 65,536 | $1.25 / 1M | $10 / 1M | No | Yes | Jun 17, 2025 | |
302.AI 302ai | gemini-3-flash-preview gemini-3-flash-preview | Unknown | text, image | text | 1,000,000 | 65,536 | $0.5 / 1M | $3 / 1M | No | Yes | Dec 18, 2025 | |
302.AI 302ai | gemini-3-pro-preview gemini-3-pro-preview | Unknown | text, image | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | No | Yes | Nov 19, 2025 | |
302.AI 302ai | gpt-4.1 gpt-4.1 | gpt | text, image | text | 1,000,000 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
302.AI 302ai | gpt-4.1-mini gpt-4.1-mini | gpt-mini | text, image | text | 1,000,000 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
302.AI 302ai | gpt-4.1-nano gpt-4.1-nano | gpt-nano | text, image | text | 1,000,000 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Apr 14, 2025 | |
Abacus abacus | Llama 4 Maverick 17B 128E Instruct FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | llama | text, image | text | 1,000,000 | 32,768 | $0.14 / 1M | $0.59 / 1M | No | Yes | Apr 5, 2025 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Qwen3.5 Plus qwen3.5-plus | qwen | text, image | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 16, 2026 | |
Amazon Bedrock amazon-bedrock | Llama 4 Maverick 17B Instruct meta.llama4-maverick-17b-instruct-v1:0 | llama | text, image | text | 1,000,000 | 16,384 | $0.24 / 1M | $0.97 / 1M | No | Yes | Apr 5, 2025 | |
Deep Infra deepinfra | Llama 4 Maverick 17B FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | llama | text, image | text | 1,000,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Apr 5, 2025 | |
Jiekou.AI jiekou | claude-opus-4-6 claude-opus-4-6 | claude-opus | text, image | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 1, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6 | Unknown | text, image | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
LLM Gateway llmgateway | Qwen3 VL Flash qwen3-vl-flash | qwen | text, image | text | 1,000,000 | 32,000 | $0.05 / 1M | $0.4 / 1M | No | Yes | Oct 9, 2025 | |
NanoGPT nano-gpt | Gemini 2.0 Flash gemini-2.0-flash-001 | Unknown | text, image | text | 1,000,000 | 8,192 | $0.1003 / 1M | $0.408 / 1M | No | Yes | Dec 11, 2024 | |
NanoGPT nano-gpt | Gemini 2.0 Flash Lite gemini-2.0-flash-lite | Unknown | text, image | text | 1,000,000 | 8,192 | $0.0748 / 1M | $0.306 / 1M | No | No | Dec 11, 2024 | |
NanoGPT nano-gpt | Gemini 2.0 Flash Thinking 0121 gemini-2.0-flash-thinking-exp-01-21 | Unknown | text, image | text | 1,000,000 | 8,192 | $0.306 / 1M | $1.003 / 1M | Yes | No | Jan 21, 2025 | |
OpenRouter openrouter | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | claude-sonnet | text, image | text | 1,000,000 | 128,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
OpenRouter openrouter | MiniMax-01 minimax/minimax-01 | minimax | text, image | text | 1,000,000 | 1,000,000 | $0.2 / 1M | $1.1 / 1M | Yes | Yes | Jan 15, 2025 | |
Requesty requesty | Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 | claude-sonnet | text, image | text | 1,000,000 | 128,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
Venice AI venice | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image | text | 1,000,000 | 128,000 | $6 / 1M | $30 / 1M | Yes | Yes | Mar 16, 2026 | |
Venice AI venice | Claude Opus 4.6 Fast claude-opus-4-6-fast | claude-opus | text, image | text | 1,000,000 | 128,000 | $36 / 1M | $180 / 1M | Yes | Yes | Apr 8, 2026 | |
Venice AI venice | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image | text | 1,000,000 | 128,000 | $6 / 1M | $30 / 1M | Yes | Yes | Apr 16, 2026 | |
Venice AI venice | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image | text | 1,000,000 | 64,000 | $3.6 / 1M | $18 / 1M | Yes | Yes | Mar 16, 2026 | |
Venice AI venice | GPT-5.4 openai-gpt-54 | gpt | text, image | text | 1,000,000 | 131,072 | $3.13 / 1M | $18.8 / 1M | Yes | Yes | Mar 9, 2026 | |
Venice AI venice | GPT-5.4 Pro openai-gpt-54-pro | gpt-pro | text, image | text | 1,000,000 | 128,000 | $37.5 / 1M | $225 / 1M | Yes | Yes | Mar 9, 2026 | |
Venice AI venice | GPT-5.5 openai-gpt-55 | gpt | text, image | text | 1,000,000 | 131,072 | $6.25 / 1M | $37.5 / 1M | Yes | Yes | Apr 25, 2026 | |
Venice AI venice | GPT-5.5 Pro openai-gpt-55-pro | gpt-pro | text, image | text | 1,000,000 | 128,000 | $37.5 / 1M | $225 / 1M | Yes | Yes | Apr 25, 2026 | |
Venice AI venice | Grok 4.1 Fast grok-41-fast | grok | text, image | text | 1,000,000 | 30,000 | $0.23 / 1M | $0.57 / 1M | Yes | Yes | Apr 9, 2026 | |
Vercel AI Gateway vercel | Nova 2 Lite amazon/nova-2-lite | nova | text, image | text | 1,000,000 | 1,000,000 | $0.3 / 1M | $2.5 / 1M | Yes | No | Dec 1, 2024 | |
ZenMux zenmux | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | Unknown | text, image | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 18, 2026 | |
ZenMux zenmux | Qwen3.5 Plus qwen/qwen3.5-plus | Unknown | text, image | text | 1,000,000 | 64,000 | $0.8 / 1M | $4.8 / 1M | Yes | Yes | Mar 20, 2026 | |
Vertex google-vertex | Llama 4 Maverick 17B 128E Instruct meta/llama-4-maverick-17b-128e-instruct-maas | llama | text, image | text | 524,288 | 8,192 | $0.35 / 1M | $1.15 / 1M | No | Yes | Apr 29, 2025 | |
Synthetic synthetic | Llama-4-Maverick-17B-128E-Instruct-FP8 hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | llama | text, image | text | 524,000 | 4,096 | $0.22 / 1M | $0.88 / 1M | No | Yes | Apr 5, 2025 | |
v0 v0 | v0-1.5-lg v0-1.5-lg | v0 | text, image | text | 512,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Jun 9, 2025 | |
IO.NET io-net | Llama 4 Maverick 17B 128E Instruct meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | llama | text, image | text | 430,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jan 15, 2025 | |
302.AI 302ai | gpt-5 gpt-5 | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Aug 8, 2025 | |
302.AI 302ai | gpt-5-mini gpt-5-mini | Unknown | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | No | Yes | Aug 8, 2025 | |
302.AI 302ai | gpt-5-pro gpt-5-pro | Unknown | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | No | Yes | Oct 8, 2025 | |
302.AI 302ai | gpt-5-thinking gpt-5-thinking | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 8, 2025 | |
302.AI 302ai | gpt-5.1 gpt-5.1 | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Nov 14, 2025 | |
302.AI 302ai | gpt-5.2 gpt-5.2 | Unknown | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | No | Yes | Dec 12, 2025 | |
302.AI 302ai | gpt-5.4-mini gpt-5.4-mini | Unknown | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | No | Yes | Mar 19, 2026 | |
302.AI 302ai | gpt-5.4-mini-2026-03-17 gpt-5.4-mini-2026-03-17 | Unknown | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | No | Yes | Mar 19, 2026 | |
302.AI 302ai | gpt-5.4-nano gpt-5.4-nano | Unknown | text, image | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | No | Yes | Mar 19, 2026 | |
302.AI 302ai | gpt-5.4-nano-2026-03-17 gpt-5.4-nano-2026-03-17 | Unknown | text, image | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | No | Yes | Mar 19, 2026 | |
Abacus abacus | GPT-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Abacus abacus | GPT-5 Codex gpt-5-codex | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Abacus abacus | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Abacus abacus | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Abacus abacus | GPT-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Abacus abacus | GPT-5.1 Chat Latest gpt-5.1-chat-latest | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Abacus abacus | GPT-5.1 Codex gpt-5.1-codex | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Abacus abacus | GPT-5.1 Codex Max gpt-5.1-codex-max | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Abacus abacus | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Abacus abacus | GPT-5.2 Chat Latest gpt-5.2-chat-latest | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 1, 2026 | |
Abacus abacus | GPT-5.3 Chat Latest gpt-5.3-chat-latest | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 1, 2026 | |
AIHubMix aihubmix | GPT-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 15, 2025 | |
AIHubMix aihubmix | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
AIHubMix aihubmix | GPT-5.1 Codex mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
AIHubMix aihubmix | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
AIHubMix aihubmix | GPT-5.2-Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
AIHubMix aihubmix | GPT-5.4 gpt-5.4 | gpt | text, image | text | 400,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 11, 2026 | |
AIHubMix aihubmix | GPT-5.4-Mini gpt-5.4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | No | Yes | Mar 11, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5 Pro gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5-Codex gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.1 Codex Mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
Azure azure | GPT-5 Pro gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
Azure azure | GPT-5-Codex gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Azure azure | GPT-5.1 Codex Max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Azure azure | GPT-5.1 Codex Mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure azure | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Azure azure | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
Azure azure | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.1 openai/gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.1 Codex openai/gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.2 openai/gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
DigitalOcean digitalocean | GPT-5 openai-gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
DigitalOcean digitalocean | GPT-5 mini openai-gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
DigitalOcean digitalocean | GPT-5 nano openai-gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
DigitalOcean digitalocean | GPT-5.1 Codex Max openai-gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
DigitalOcean digitalocean | GPT-5.2 openai-gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
DigitalOcean digitalocean | GPT-5.2 pro openai-gpt-5-2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
DigitalOcean digitalocean | GPT-5.4 mini openai-gpt-5.4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
DigitalOcean digitalocean | GPT-5.4 nano openai-gpt-5.4-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
DigitalOcean digitalocean | GPT-5.4 pro openai-gpt-5.4-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
FastRouter fastrouter | GPT-5 openai/gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
FastRouter fastrouter | GPT-5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
FastRouter fastrouter | GPT-5 Nano openai/gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Firmware firmware | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Firmware firmware | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Firmware firmware | GPT-5.3 Codex gpt-5-3-codex | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 15, 2026 | |
GitHub Copilot github-copilot | GPT-5.1-Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 13, 2025 | |
GitHub Copilot github-copilot | GPT-5.1-Codex-max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 4, 2025 | |
GitHub Copilot github-copilot | GPT-5.1-Codex-mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 13, 2025 | |
GitHub Copilot github-copilot | GPT-5.2-Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2025 | |
GitHub Copilot github-copilot | GPT-5.3-Codex gpt-5.3-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 24, 2026 | |
GitHub Copilot github-copilot | GPT-5.4 gpt-5.4 | gpt | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 5, 2026 | |
GitHub Copilot github-copilot | GPT-5.4 Mini gpt-5.4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 17, 2026 | |
GitHub Copilot github-copilot | GPT-5.5 gpt-5.5 | gpt | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5 Codex) duo-chat-gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 22, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5 Mini) duo-chat-gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 22, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.1) duo-chat-gpt-5-1 | gpt | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 22, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.2) duo-chat-gpt-5-2 | gpt | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 23, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.4 Mini) duo-chat-gpt-5-4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 17, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.4 Nano) duo-chat-gpt-5-4-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 17, 2026 | |
Helicone helicone | OpenAI GPT-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | OpenAI GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | OpenAI GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.049999999999999996 / 1M | $0.39999999999999997 / 1M | No | Yes | Jan 1, 2025 | |
Jiekou.AI jiekou | gpt-5-chat-latest gpt-5-chat-latest | gpt | text, image | text | 400,000 | 128,000 | $1.125 / 1M | $9 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5-codex gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.125 / 1M | $9 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5-mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.225 / 1M | $1.8 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5-nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.045 / 1M | $0.36 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5-pro gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $13.5 / 1M | $108 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.125 / 1M | $9 / 1M | Yes | Yes | Feb 1, 2026 | |
Jiekou.AI jiekou | gpt-5.1-codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.125 / 1M | $9 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.1-codex-max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.125 / 1M | $9 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.1-codex-mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.225 / 1M | $1.8 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.575 / 1M | $12.6 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.2-codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gpt-5.2-pro gpt-5.2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $18.9 / 1M | $151.2 / 1M | Yes | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Codex openai/gpt-5-codex | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Kilo Gateway kilo | OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Kilo Gateway kilo | OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | Unknown | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Kilo Gateway kilo | OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex | Unknown | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
LLM Gateway llmgateway | GPT-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
LLM Gateway llmgateway | GPT-5 Chat (latest) gpt-5-chat-latest | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
LLM Gateway llmgateway | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
LLM Gateway llmgateway | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
LLM Gateway llmgateway | GPT-5 Pro gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
LLM Gateway llmgateway | GPT-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
LLM Gateway llmgateway | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
LLM Gateway llmgateway | GPT-5.1 Codex mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
LLM Gateway llmgateway | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
LLM Gateway llmgateway | GPT-5.2 Pro gpt-5.2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
LLM Gateway llmgateway | GPT-5.4 mini gpt-5.4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
LLM Gateway llmgateway | GPT-5.4 nano gpt-5.4-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
NanoGPT nano-gpt | GPT 5 Chat openai/gpt-5-chat-latest | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
NanoGPT nano-gpt | GPT 5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | No | Aug 7, 2025 | |
NanoGPT nano-gpt | GPT 5 Nano openai/gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | No | Aug 7, 2025 | |
NanoGPT nano-gpt | GPT 5 Pro openai/gpt-5-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $15 / 1M | $120 / 1M | Yes | No | Aug 7, 2025 | |
NanoGPT nano-gpt | GPT 5.1 Chat openai/gpt-5.1-chat | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.1 Chat (Latest) openai/gpt-5.1-chat-latest | gpt | text, image | text | 400,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | No | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.1 Codex openai/gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.1 Codex Mini openai/gpt-5.1-codex-mini | gpt-codex-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | No | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.2 Chat openai/gpt-5.2-chat | gpt | text, image | text | 400,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | No | Jan 1, 2026 | |
OpenAI openai | GPT-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenAI openai | GPT-5 Chat (latest) gpt-5-chat-latest | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
OpenAI openai | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenAI openai | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenAI openai | GPT-5 Pro gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
OpenAI openai | GPT-5-Codex gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
OpenAI openai | GPT-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenAI openai | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenAI openai | GPT-5.1 Codex Max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenAI openai | GPT-5.1 Codex mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenAI openai | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenAI openai | GPT-5.2 Pro gpt-5.2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenAI openai | GPT-5.4 mini gpt-5.4-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
OpenAI openai | GPT-5.4 nano gpt-5.4-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
OpenCode Zen opencode | GPT-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.07 / 1M | $8.5 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenCode Zen opencode | GPT-5 Codex gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.07 / 1M | $8.5 / 1M | Yes | Yes | Sep 15, 2025 | |
OpenCode Zen opencode | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenCode Zen opencode | GPT-5.1 gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.07 / 1M | $8.5 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenCode Zen opencode | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.07 / 1M | $8.5 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenCode Zen opencode | GPT-5.1 Codex Max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenCode Zen opencode | GPT-5.1 Codex Mini gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenCode Zen opencode | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenRouter openrouter | GPT-5 openai/gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenRouter openrouter | GPT-5 Chat (latest) openai/gpt-5-chat | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
OpenRouter openrouter | GPT-5 Codex openai/gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
OpenRouter openrouter | GPT-5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenRouter openrouter | GPT-5 Nano openai/gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
OpenRouter openrouter | GPT-5 Pro openai/gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
OpenRouter openrouter | GPT-5.1 openai/gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenRouter openrouter | GPT-5.1-Codex openai/gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenRouter openrouter | GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenRouter openrouter | GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 100,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenRouter openrouter | GPT-5.2 openai/gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenRouter openrouter | GPT-5.2 Pro openai/gpt-5.2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenRouter openrouter | GPT-5.2-Codex openai/gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
Perplexity Agent perplexity-agent | GPT-5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Perplexity Agent perplexity-agent | GPT-5.1 openai/gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Perplexity Agent perplexity-agent | GPT-5.2 openai/gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Poe poe | GPT-5 openai/gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Aug 5, 2025 | |
Poe poe | GPT-5-Codex openai/gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Sep 23, 2025 | |
Poe poe | GPT-5-mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.22 / 1M | $1.8 / 1M | Yes | Yes | Jun 25, 2025 | |
Poe poe | GPT-5-nano openai/gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.045 / 1M | $0.36 / 1M | Yes | Yes | Aug 5, 2025 | |
Poe poe | GPT-5-Pro openai/gpt-5-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $14 / 1M | $110 / 1M | Yes | Yes | Oct 6, 2025 | |
Poe poe | GPT-5.1 openai/gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Nov 12, 2025 | |
Poe poe | GPT-5.1-Codex openai/gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Nov 12, 2025 | |
Poe poe | GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | Unknown | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Dec 8, 2025 | |
Poe poe | GPT-5.2 openai/gpt-5.2 | Unknown | text, image | text | 400,000 | 128,000 | $1.6 / 1M | $13 / 1M | Yes | Yes | Dec 8, 2025 | |
Poe poe | GPT-5.2-Codex openai/gpt-5.2-codex | Unknown | text, image | text | 400,000 | 128,000 | $1.6 / 1M | $13 / 1M | Yes | Yes | Jan 14, 2026 | |
Poe poe | GPT-5.2-Pro openai/gpt-5.2-pro | Unknown | text, image | text | 400,000 | 128,000 | $19 / 1M | $150 / 1M | Yes | Yes | Dec 11, 2025 | |
Poe poe | GPT-5.3-Codex openai/gpt-5.3-codex | Unknown | text, image | text | 400,000 | 128,000 | $1.6 / 1M | $13 / 1M | Yes | Yes | Feb 10, 2026 | |
Poe poe | GPT-5.4-Mini openai/gpt-5.4-mini | Unknown | text, image | text | 400,000 | 128,000 | $0.68 / 1M | $4 / 1M | Yes | Yes | Mar 12, 2026 | |
Poe poe | GPT-5.4-Nano openai/gpt-5.4-nano | Unknown | text, image | text | 400,000 | 128,000 | $0.18 / 1M | $1.1 / 1M | Yes | Yes | Mar 11, 2026 | |
QiHang qihang-ai | GPT-5.2 gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Dec 11, 2025 | |
QiHang qihang-ai | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $0.14 / 1M | $1.14 / 1M | Yes | Yes | Dec 11, 2025 | |
Qiniu qiniu-ai | OpenAI/GPT-5.2 openai/gpt-5.2 | Unknown | text, image | text | 400,000 | 128,000 | Unknown | Unknown | Yes | Yes | Dec 11, 2025 | |
Requesty requesty | GPT-5 Chat (latest) openai/gpt-5-chat | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
Requesty requesty | GPT-5 Codex openai/gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Requesty requesty | GPT-5 Pro openai/gpt-5-pro | gpt-pro | text, image | text | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Oct 6, 2025 | |
Requesty requesty | GPT-5.1 openai/gpt-5.1 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Requesty requesty | GPT-5.1-Codex openai/gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Requesty requesty | GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.1 / 1M | $9 / 1M | Yes | Yes | Nov 13, 2025 | |
Requesty requesty | GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | gpt-codex | text, image | text | 400,000 | 100,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
Requesty requesty | GPT-5.2 openai/gpt-5.2 | gpt | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Requesty requesty | GPT-5.2 Pro openai/gpt-5.2-pro | gpt-pro | text, image | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
Requesty requesty | GPT-5.2-Codex openai/gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
SAP AI Core sap-ai-core | gpt-5 gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
SAP AI Core sap-ai-core | gpt-5-mini gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
SAP AI Core sap-ai-core | gpt-5-nano gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Venice AI venice | GPT-5.3 Codex openai-gpt-53-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $2.19 / 1M | $17.5 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | GPT-5.4 Mini openai-gpt-54-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.9375 / 1M | $5.625 / 1M | Yes | Yes | Mar 31, 2026 | |
Vercel AI Gateway vercel | GPT-5 openai/gpt-5 | gpt | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5 Nano openai/gpt-5-nano | gpt-nano | text, image | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5-Codex openai/gpt-5-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 15, 2025 | |
Vivgrid vivgrid | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Vivgrid vivgrid | GPT-5.1 Codex Max gpt-5.1-codex-max | gpt-codex | text, image | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Vivgrid vivgrid | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
Vivgrid vivgrid | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
ZenMux zenmux | GPT-5 Codex openai/gpt-5-codex | Unknown | text, image | text | 400,000 | 64,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Sep 23, 2025 | |
ZenMux zenmux | GPT-5.1-Codex openai/gpt-5.1-codex | Unknown | text, image | text | 400,000 | 64,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
NanoGPT nano-gpt | Llama 4 Scout meta-llama/llama-4-scout | llama | text, image | text | 328,000 | 65,536 | $0.085 / 1M | $0.46 / 1M | No | Yes | Sep 5, 2025 | |
Synthetic synthetic | Llama-4-Scout-17B-16E-Instruct hf:meta-llama/Llama-4-Scout-17B-16E-Instruct | llama | text, image | text | 328,000 | 4,096 | $0.15 / 1M | $0.6 / 1M | No | Yes | Apr 5, 2025 | |
Kilo Gateway kilo | Meta: Llama 4 Scout meta-llama/llama-4-scout | Unknown | text, image | text | 327,680 | 16,384 | $0.08 / 1M | $0.3 / 1M | No | Yes | Apr 5, 2025 | |
Cortecs cortecs | Nova Pro 1.0 nova-pro-v1 | nova-pro | text, image | text | 300,000 | 5,000 | $1.016 / 1M | $4.061 / 1M | No | Yes | Dec 3, 2024 | |
Kilo Gateway kilo | Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | Unknown | text, image | text | 300,000 | 5,120 | $0.8 / 1M | $3.2 / 1M | No | Yes | Dec 3, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-5 gpt-5 | gpt | text, image | text | 272,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 272,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 272,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Azure azure | GPT-5 gpt-5 | gpt | text, image | text | 272,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Azure azure | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 272,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Azure azure | GPT-5 Nano gpt-5-nano | gpt-nano | text, image | text | 272,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Aug 7, 2025 | |
Firmware firmware | GPT-5.4 gpt-5-4 | gpt | text, image | text | 272,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Vivgrid vivgrid | GPT-5 Mini gpt-5-mini | gpt-mini | text, image | text | 272,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
GitHub Copilot github-copilot | GPT-5-mini gpt-5-mini | gpt-mini | text, image | text | 264,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 13, 2025 | |
GitHub Copilot github-copilot | GPT-5.1 gpt-5.1 | gpt | text, image | text | 264,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 13, 2025 | |
GitHub Copilot github-copilot | GPT-5.2 gpt-5.2 | gpt | text, image | text | 264,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-VL Plus qwen3-vl-plus | qwen | text, image | text | 262,144 | 32,768 | $0.143353 / 1M | $1.433525 / 1M | Yes | Yes | Sep 23, 2025 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Kimi K2.5 kimi-k2.5 | kimi | text, image | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
Alibaba alibaba | Qwen3-VL Plus qwen3-vl-plus | qwen | text, image | text | 262,144 | 32,768 | $0.2 / 1M | $1.6 / 1M | Yes | Yes | Sep 23, 2025 | |
Azure Cognitive Services azure-cognitive-services | Kimi K2.5 kimi-k2.5 | kimi | text, image | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 6, 2026 | |
Azure azure | Kimi K2.5 kimi-k2.5 | kimi | text, image | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 6, 2026 | |
Azure azure | Kimi K2.6 kimi-k2.6 | kimi | text, image | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 22, 2026 | |
Chutes chutes | Qwen3 VL 235B A22B Instruct Qwen/Qwen3-VL-235B-A22B-Instruct | qwen | text, image | text | 262,144 | 262,144 | $0.3 / 1M | $1.2 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen3.5 397B A17B TEE Qwen/Qwen3.5-397B-A17B-TEE | qwen | text, image | text | 262,144 | 65,536 | $0.39 / 1M | $2.34 / 1M | Yes | Yes | Feb 18, 2026 | |
Clarifai clarifai | Ministral 3 14B Reasoning 2512 mistralai/completion/models/Ministral-3-14B-Reasoning-2512 | ministral | text, image | text | 262,144 | 262,144 | $2.5 / 1M | $1.7 / 1M | Yes | Yes | Dec 12, 2025 | |
Clarifai clarifai | Ministral 3 3B Reasoning 2512 mistralai/completion/models/Ministral-3-3B-Reasoning-2512 | ministral | text, image | text | 262,144 | 262,144 | $1.039 / 1M | $0.54825 / 1M | Yes | Yes | Feb 25, 2026 | |
Hugging Face huggingface | Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B | qwen | text, image | text | 262,144 | 32,768 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 1, 2026 | |
Kilo Gateway kilo | Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512 | Unknown | text, image | text | 262,144 | 52,429 | $0.2 / 1M | $0.2 / 1M | No | Yes | Dec 16, 2025 | |
Kilo Gateway kilo | Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512 | Unknown | text, image | text | 262,144 | 52,429 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 16, 2025 | |
Kilo Gateway kilo | MoonshotAI: Kimi K2.6 moonshotai/kimi-k2.6 | Unknown | text, image | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | Unknown | text, image | text | 262,144 | 52,429 | $0.2 / 1M | $0.88 / 1M | No | Yes | Jan 10, 2026 | |
LLM Gateway llmgateway | Ministral 14B ministral-14b-2512 | mistral | text, image | text | 262,144 | 8,192 | $0.2 / 1M | $0.2 / 1M | No | No | Dec 2, 2025 | |
LLM Gateway llmgateway | Ministral 8B ministral-8b-2512 | mistral | text, image | text | 262,144 | 8,192 | $0.15 / 1M | $0.15 / 1M | No | No | Dec 2, 2025 | |
LLM Gateway llmgateway | Mistral Large (latest) mistral-large-latest | mistral-large | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 2, 2025 | |
LLM Gateway llmgateway | Mistral Large 3 mistral-large-2512 | mistral-large | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 2, 2025 | |
LLM Gateway llmgateway | Qwen3-VL Plus qwen3-vl-plus | qwen | text, image | text | 262,144 | 32,768 | $0.2 / 1M | $1.6 / 1M | Yes | Yes | Sep 23, 2025 | |
Meganova meganova | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,144 | 262,144 | $0.45 / 1M | $2.8 / 1M | Yes | Yes | Jan 27, 2026 | |
Mistral mistral | Mistral Large (latest) mistral-large-latest | mistral-large | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 2, 2025 | |
Mistral mistral | Mistral Large 3 mistral-large-2512 | mistral-large | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 2, 2025 | |
Mistral mistral | Mistral Medium 3.1 mistral-medium-2508 | mistral-medium | text, image | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Aug 12, 2025 | |
NanoGPT nano-gpt | Ministral 3 14B mistralai/ministral-14b-instruct-2512 | ministral | text, image | text | 262,144 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | No | Dec 2, 2025 | |
NanoGPT nano-gpt | Mistral Large 3 675B mistralai/mistral-large-3-675b-instruct-2512 | mistral-large | text, image | text | 262,144 | 256,000 | $1 / 1M | $3 / 1M | No | No | Dec 2, 2025 | |
NovitaAI novita-ai | Gemma 4 26B A4B google/gemma-4-26b-a4b-it | gemma | text, image | text | 262,144 | 131,072 | $0.13 / 1M | $0.4 / 1M | Yes | Yes | Apr 2, 2026 | |
NovitaAI novita-ai | Gemma 4 31B google/gemma-4-31b-it | gemma | text, image | text | 262,144 | 131,072 | $0.14 / 1M | $0.4 / 1M | Yes | Yes | Apr 2, 2026 | |
Nvidia nvidia | Ministral 3 14B Instruct 2512 mistralai/ministral-14b-instruct-2512 | ministral | text, image | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | No | Yes | Dec 8, 2025 | |
Nvidia nvidia | Mistral Large 3 675B Instruct 2512 mistralai/mistral-large-3-675b-instruct-2512 | mistral-large | text, image | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | No | Yes | Dec 2, 2025 | |
Nvidia nvidia | Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b | qwen | text, image | text | 262,144 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 16, 2026 | |
Ollama Cloud ollama-cloud | devstral-small-2:24b devstral-small-2:24b | devstral | text, image | text | 262,144 | 262,144 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | gemma4:31b gemma4:31b | gemma | text, image | text | 262,144 | 262,144 | Unknown | Unknown | Yes | Yes | Apr 8, 2026 | |
Ollama Cloud ollama-cloud | kimi-k2.5 kimi-k2.5 | kimi | text, image | text | 262,144 | 262,144 | Unknown | Unknown | Yes | Yes | Jan 27, 2026 | |
Ollama Cloud ollama-cloud | kimi-k2.6:cloud kimi-k2.6:cloud | kimi | text, image | text | 262,144 | 262,144 | Unknown | Unknown | Yes | Yes | Apr 20, 2026 | |
Ollama Cloud ollama-cloud | ministral-3:14b ministral-3:14b | ministral | text, image | text | 262,144 | 128,000 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | ministral-3:3b ministral-3:3b | ministral | text, image | text | 262,144 | 128,000 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | ministral-3:8b ministral-3:8b | ministral | text, image | text | 262,144 | 128,000 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | mistral-large-3:675b mistral-large-3:675b | mistral-large | text, image | text | 262,144 | 262,144 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | qwen3-vl:235b qwen3-vl:235b | qwen | text, image | text | 262,144 | 32,768 | Unknown | Unknown | Yes | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | qwen3-vl:235b-instruct qwen3-vl:235b-instruct | qwen | text, image | text | 262,144 | 131,072 | Unknown | Unknown | No | Yes | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | qwen3.5:397b qwen3.5:397b | qwen | text, image | text | 262,144 | 81,920 | Unknown | Unknown | Yes | Yes | Feb 17, 2026 | |
OpenRouter openrouter | Kimi K2.6 moonshotai/kimi-k2.6 | kimi | text, image | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
OpenRouter openrouter | Mistral Medium 3.1 mistralai/mistral-medium-3.1 | mistral-medium | text, image | text | 262,144 | 262,144 | $0.4 / 1M | $2 / 1M | No | Yes | Aug 12, 2025 | |
OpenRouter openrouter | Mistral Small 4 mistralai/mistral-small-2603 | mistral-small | text, image | text | 262,144 | 262,144 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Mar 16, 2026 | |
Poe poe | Gemma-4-31B google/gemma-4-31b | Unknown | text, image | text | 262,144 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Apr 2, 2026 | |
Poe poe | Kimi-K2.5-FW fireworks-ai/kimi-k2.5-fw | Unknown | text, image | text | 262,144 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jan 27, 2026 | |
Regolo AI regolo-ai | Qwen3.5-122B qwen3.5-122b | qwen | text, image | text | 262,144 | 16,384 | $0.9 / 1M | $3.6 / 1M | Yes | Yes | Feb 1, 2026 | |
Regolo AI regolo-ai | Qwen3.5-9B qwen3.5-9b | qwen | text, image | text | 262,144 | 8,192 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Feb 1, 2026 | |
Synthetic synthetic | Kimi K2.5 hf:moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,144 | 65,536 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Jan 1, 2026 | |
Synthetic synthetic | Kimi K2.5 (NVFP4) hf:nvidia/Kimi-K2.5-NVFP4 | kimi | text, image | text | 262,144 | 65,536 | $0.55 / 1M | $2.19 / 1M | Yes | Yes | Jan 1, 2026 | |
Synthetic synthetic | Qwen3.5-97B-A17B hf:Qwen/Qwen3.5-397B-A17B | qwen | text, image | text | 262,144 | 65,536 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 11, 2026 | |
Together AI togetherai | Gemma 4 31B Instruct google/gemma-4-31B-it | gemma | text, image | text | 262,144 | 131,072 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Apr 7, 2026 | |
Together AI togetherai | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $2.8 / 1M | Yes | Yes | Jan 27, 2026 | |
Together AI togetherai | Qwen3.5 397B A17B Qwen/Qwen3.5-397B-A17B | qwen | text, image | text | 262,144 | 130,000 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 16, 2026 | |
Weights & Biases wandb | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,144 | 262,144 | $0.5 / 1M | $2.85 / 1M | Yes | Yes | Mar 12, 2026 | |
Amazon Bedrock amazon-bedrock | Qwen/Qwen3-VL-235B-A22B-Instruct qwen.qwen3-vl-235b-a22b | qwen | text, image | text | 262,000 | 262,000 | $0.3 / 1M | $1.5 / 1M | No | Yes | Nov 25, 2025 | |
Cortecs cortecs | Devstral Small 2 2512 devstral-small-2512 | Unknown | text, image | text | 262,000 | 262,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 9, 2025 | |
Fireworks AI fireworks-ai | Kimi K2.6 accounts/fireworks/models/kimi-k2p6 | kimi-thinking | text, image | text | 262,000 | 262,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 17, 2026 | |
SiliconFlow (China) siliconflow-cn | Pro/moonshotai/Kimi-K2.5 Pro/moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,000 | 262,000 | $0.45 / 1M | $2.25 / 1M | Yes | Yes | Jan 27, 2026 | |
SiliconFlow (China) siliconflow-cn | Pro/moonshotai/Kimi-K2.6 Pro/moonshotai/Kimi-K2.6 | kimi | text, image | text | 262,000 | 262,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.3 / 1M | $1.5 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.45 / 1M | $3.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.29 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.29 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-32B-Instruct Qwen/Qwen3-VL-32B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.2 / 1M | $0.6 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-32B-Thinking Qwen/Qwen3-VL-32B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-8B-Instruct Qwen/Qwen3-VL-8B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.18 / 1M | $0.68 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-VL-8B-Thinking Qwen/Qwen3-VL-8B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.18 / 1M | $2 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | moonshotai/Kimi-K2.5 moonshotai/Kimi-K2.5 | kimi | text, image | text | 262,000 | 262,000 | $0.45 / 1M | $2.25 / 1M | Yes | Yes | Jan 27, 2026 | |
SiliconFlow siliconflow | moonshotai/Kimi-K2.6 moonshotai/Kimi-K2.6 | kimi | text, image | text | 262,000 | 262,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.3 / 1M | $1.5 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.45 / 1M | $3.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.29 / 1M | $1 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.29 / 1M | $1 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-32B-Instruct Qwen/Qwen3-VL-32B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.2 / 1M | $0.6 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-32B-Thinking Qwen/Qwen3-VL-32B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.2 / 1M | $1.5 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-8B-Instruct Qwen/Qwen3-VL-8B-Instruct | qwen | text, image | text | 262,000 | 262,000 | $0.18 / 1M | $0.68 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-VL-8B-Thinking Qwen/Qwen3-VL-8B-Thinking | qwen | text, image | text | 262,000 | 262,000 | $0.18 / 1M | $2 / 1M | Yes | Yes | Nov 25, 2025 | |
302.AI 302ai | doubao-seed-1-6-thinking-250715 doubao-seed-1-6-thinking-250715 | Unknown | text, image | text | 256,000 | 16,000 | $0.121 / 1M | $1.21 / 1M | Yes | Yes | Jul 15, 2025 | |
302.AI 302ai | doubao-seed-1-6-vision-250815 doubao-seed-1-6-vision-250815 | Unknown | text, image | text | 256,000 | 32,000 | $0.114 / 1M | $1.143 / 1M | No | Yes | Sep 30, 2025 | |
302.AI 302ai | doubao-seed-code-preview-251028 doubao-seed-code-preview-251028 | Unknown | text, image | text | 256,000 | 32,000 | $0.17 / 1M | $1.14 / 1M | No | Yes | Nov 11, 2025 | |
Abacus abacus | Grok 4 grok-4-0709 | grok | text, image | text | 256,000 | 16,384 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Abacus abacus | Grok Code Fast 1 grok-code-fast-1 | grok | text, image | text | 256,000 | 16,384 | $0.2 / 1M | $1.5 / 1M | No | Yes | Sep 1, 2025 | |
Amazon Bedrock amazon-bedrock | Kimi K2.5 moonshotai.kimi-k2.5 | kimi | text, image | text | 256,000 | 256,000 | $0.6 / 1M | $3 / 1M | Yes | Yes | Feb 6, 2026 | |
Amazon Bedrock amazon-bedrock | Ministral 3 3B mistral.ministral-3-3b-instruct | ministral | text, image | text | 256,000 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | Yes | Dec 2, 2025 | |
Amazon Bedrock amazon-bedrock | Mistral Large 3 mistral.mistral-large-3-675b-instruct | mistral | text, image | text | 256,000 | 8,192 | $0.5 / 1M | $1.5 / 1M | No | Yes | Dec 2, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Kimi K2.5 workers-ai/@cf/moonshotai/kimi-k2.5 | kimi | text, image | text | 256,000 | 256,000 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Kimi K2.6 workers-ai/@cf/moonshotai/kimi-k2.6 | kimi | text, image | text | 256,000 | 256,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
Cloudflare Workers AI cloudflare-workers-ai | Gemma 4 26B A4B IT @cf/google/gemma-4-26b-a4b-it | gemma | text, image | text | 256,000 | 16,384 | $0.1 / 1M | $0.3 / 1M | Yes | Yes | Dec 15, 2025 | |
Cloudflare Workers AI cloudflare-workers-ai | Kimi K2.5 @cf/moonshotai/kimi-k2.5 | kimi | text, image | text | 256,000 | 256,000 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Cloudflare Workers AI cloudflare-workers-ai | Kimi K2.6 @cf/moonshotai/kimi-k2.6 | kimi | text, image | text | 256,000 | 256,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
Cortecs cortecs | Kimi K2.6 kimi-k2.6 | kimi-thinking | text, image | text | 256,000 | 256,000 | $0.81 / 1M | $3.54 / 1M | Yes | Yes | Apr 17, 2026 | |
Fireworks AI fireworks-ai | Kimi K2.5 Turbo (firepass) accounts/fireworks/routers/kimi-k2p5-turbo | kimi-thinking | text, image | text | 256,000 | 256,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
Firmware firmware | Kimi-K2.5 kimi-k2.5 | Unknown | text, image | text | 256,000 | 128,000 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 1970 | |
Firmware firmware | Kimi-K2.6 kimi-k2-6 | Unknown | text, image | text | 256,000 | 128,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Jan 1, 1970 | |
Google google | Gemma 4 26B gemma-4-26b-a4b-it | gemma | text, image | text | 256,000 | 8,192 | Unknown | Unknown | Yes | Yes | Apr 2, 2026 | |
Google google | Gemma 4 31B gemma-4-31b-it | gemma | text, image | text | 256,000 | 8,192 | Unknown | Unknown | Yes | Yes | Apr 2, 2026 | |
iFlow iflowcn | Qwen3-VL-Plus qwen3-vl-plus | qwen | text, image | text | 256,000 | 32,000 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2025 | |
Jiekou.AI jiekou | grok-4-0709 grok-4-0709 | grok | text, image | text | 256,000 | 8,192 | $2.7 / 1M | $13.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | grok-code-fast-1 grok-code-fast-1 | grok | text, image | text | 256,000 | 256,000 | $0.18 / 1M | $1.35 / 1M | No | Yes | Jan 1, 2026 | |
LLM Gateway llmgateway | Qwen3 Max (2026-01-23) qwen3-max-2026-01-23 | qwen | text, image | text | 256,000 | 32,800 | $3 / 1M | $15 / 1M | Yes | Yes | Jan 23, 2026 | |
LLM Gateway llmgateway | Seed 1.6 (250615) seed-1-6-250615 | seed | text, image | text | 256,000 | 8,192 | $0.25 / 1M | $2 / 1M | Yes | Yes | Jun 25, 2025 | |
LLM Gateway llmgateway | Seed 1.6 (250915) seed-1-6-250915 | seed | text, image | text | 256,000 | 8,192 | $0.25 / 1M | $2 / 1M | Yes | Yes | Sep 15, 2025 | |
LLM Gateway llmgateway | Seed 1.6 Flash (250715) seed-1-6-flash-250715 | seed | text, image | text | 256,000 | 8,192 | $0.07 / 1M | $0.3 / 1M | Yes | Yes | Jul 26, 2025 | |
LLM Gateway llmgateway | Seed 1.8 (251228) seed-1-8-251228 | seed | text, image | text | 256,000 | 8,192 | $0.25 / 1M | $2 / 1M | Yes | Yes | Dec 18, 2025 | |
Mistral mistral | Devstral Small 2 labs-devstral-small-2512 | devstral | text, image | text | 256,000 | 256,000 | $0 / 1M | $0 / 1M | No | Yes | Dec 9, 2025 | |
Mistral mistral | Mistral Small (latest) mistral-small-latest | mistral-small | text, image | text | 256,000 | 256,000 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Mar 16, 2026 | |
Mistral mistral | Mistral Small 4 mistral-small-2603 | mistral-small | text, image | text | 256,000 | 256,000 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Mar 16, 2026 | |
NanoGPT nano-gpt | Grok 4 x-ai/grok-4-07-09 | grok | text, image | text | 256,000 | 131,072 | $3 / 1M | $15 / 1M | Yes | No | Jul 9, 2025 | |
NanoGPT nano-gpt | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image | text | 256,000 | 65,536 | $0.3 / 1M | $1.9 / 1M | No | Yes | Jan 26, 2026 | |
NanoGPT nano-gpt | Kimi K2.5 Thinking moonshotai/kimi-k2.5:thinking | kimi-thinking | text, image | text | 256,000 | 65,536 | $0.3 / 1M | $1.9 / 1M | Yes | Yes | Jan 26, 2026 | |
NanoGPT nano-gpt | Kimi K2.6 moonshotai/kimi-k2.6 | kimi-k2.6 | text, image | text | 256,000 | 65,536 | $0.53 / 1M | $2.73 / 1M | No | Yes | Apr 21, 2026 | |
NanoGPT nano-gpt | Kimi K2.6 Thinking moonshotai/kimi-k2.6:thinking | kimi-thinking | text, image | text | 256,000 | 65,536 | $0.53 / 1M | $2.73 / 1M | Yes | Yes | Apr 21, 2026 | |
Nebius Token Factory nebius | Kimi-K2.5 moonshotai/Kimi-K2.5 | kimi | text, image | text | 256,000 | 8,192 | $0.5 / 1M | $2.5 / 1M | Yes | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Kimi-K2.5-fast moonshotai/Kimi-K2.5-fast | kimi | text, image | text | 256,000 | 8,192 | $0.5 / 1M | $2.5 / 1M | Yes | Yes | Feb 4, 2026 | |
Poe poe | Grok-4 xai/grok-4 | grok | text, image | text | 256,000 | 128,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 10, 2025 | |
Qiniu qiniu-ai | Qwen3.5 397B A17B qwen3.5-397b-a17b | Unknown | text, image | text | 256,000 | 64,000 | Unknown | Unknown | Yes | Yes | Feb 22, 2026 | |
Regolo AI regolo-ai | Mistral Small 4 119B mistral-small-4-119b | mistral-small | text, image | text | 256,000 | 16,384 | $0.75 / 1M | $3 / 1M | Yes | Yes | Mar 15, 2026 | |
Requesty requesty | Grok 4 xai/grok-4 | grok | text, image | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 9, 2025 | |
Venice AI venice | Gemma 4 Uncensored gemma-4-uncensored | gemma | text, image | text | 256,000 | 8,192 | $0.1625 / 1M | $0.5 / 1M | No | Yes | Apr 19, 2026 | |
Venice AI venice | GPT-5.2 Codex openai-gpt-52-codex | gpt-codex | text, image | text | 256,000 | 65,536 | $2.19 / 1M | $17.5 / 1M | Yes | Yes | Mar 12, 2026 | |
Venice AI venice | Kimi K2.5 kimi-k2-5 | kimi | text, image | text | 256,000 | 65,536 | $0.56 / 1M | $3.5 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | Kimi K2.6 kimi-k2-6 | kimi | text, image | text | 256,000 | 65,536 | $0.7448 / 1M | $4.655 / 1M | Yes | Yes | Apr 24, 2026 | |
Venice AI venice | Mistral Small 4 mistral-small-2603 | mistral-small | text, image | text | 256,000 | 65,536 | $0.1875 / 1M | $0.75 / 1M | Yes | Yes | Apr 9, 2026 | |
Venice AI venice | Qwen 3.5 9B qwen3-5-9b | qwen | text, image | text | 256,000 | 32,768 | $0.1 / 1M | $0.15 / 1M | Yes | Yes | Apr 19, 2026 | |
Venice AI venice | Qwen3 VL 235B qwen3-vl-235b-a22b | qwen | text, image | text | 256,000 | 16,384 | $0.25 / 1M | $1.5 / 1M | No | Yes | Mar 12, 2026 | |
Vercel AI Gateway vercel | Mistral Large 3 mistral/mistral-large-3 | mistral-large | text, image | text | 256,000 | 256,000 | $0.5 / 1M | $1.5 / 1M | No | No | Dec 2, 2025 | |
Vercel AI Gateway vercel | Mistral Small (latest) mistral/mistral-small | mistral-small | text, image | text | 256,000 | 256,000 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Mar 16, 2026 | |
Vercel AI Gateway vercel | Seed 1.8 bytedance/seed-1.8 | seed | text, image | text | 256,000 | 64,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Oct 1, 2025 | |
ZenMux zenmux | Agnes 1.5 Lite sapiens-ai/agnes-1.5-lite | Unknown | text, image | text | 256,000 | 256,000 | $0.12 / 1M | $0.6 / 1M | No | Yes | Mar 26, 2026 | |
ZenMux zenmux | Doubao-Seed-Code volcengine/doubao-seed-code | Unknown | text, image | text | 256,000 | 64,000 | $0.17 / 1M | $1.12 / 1M | Yes | Yes | Nov 11, 2025 | |
302.AI 302ai | doubao-seed-1-8-251215 doubao-seed-1-8-251215 | Unknown | text, image | text | 224,000 | 64,000 | $0.114 / 1M | $0.286 / 1M | No | Yes | Dec 18, 2025 | |
STACKIT stackit | Qwen3-VL 235B Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 | qwen | text, image | text | 218,000 | 8,192 | $1.64 / 1M | $1.91 / 1M | No | Yes | Nov 1, 2024 | |
GitHub Copilot github-copilot | Claude Sonnet 4 claude-sonnet-4 | claude-sonnet | text, image | text | 216,000 | 16,000 | $0 / 1M | $0 / 1M | Yes | Yes | May 22, 2025 | |
Deep Infra deepinfra | GLM-4.6V zai-org/GLM-4.6V | glm | text, image | text | 204,800 | 131,072 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Sep 30, 2025 | |
Vercel AI Gateway vercel | MiniMax M2.7 High Speed minimax/minimax-m2.7-highspeed | minimax | text, image | text | 204,800 | 131,100 | $0.6 / 1M | $2.4 / 1M | Yes | Yes | Mar 18, 2026 | |
Amazon Bedrock amazon-bedrock | Google Gemma 3 27B Instruct google.gemma-3-27b-it | gemma | text, image | text | 202,752 | 8,192 | $0.12 / 1M | $0.2 / 1M | No | Yes | Jul 27, 2025 | |
302.AI 302ai | claude-opus-4-1-20250805 claude-opus-4-1-20250805 | Unknown | text, image | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | No | Yes | Aug 5, 2025 | |
302.AI 302ai | claude-opus-4-1-20250805-thinking claude-opus-4-1-20250805-thinking | Unknown | text, image | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 27, 2025 | |
302.AI 302ai | claude-opus-4-5-20251101-thinking claude-opus-4-5-20251101-thinking | Unknown | text, image | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 25, 2025 | |
302.AI 302ai | grok-4.1 grok-4.1 | Unknown | text, image | text | 200,000 | 64,000 | $2 / 1M | $10 / 1M | No | Yes | Nov 18, 2025 | |
Abacus abacus | o3 o3 | o | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
Abacus abacus | o3-pro o3-pro | o-pro | text, image | text | 200,000 | 100,000 | $20 / 1M | $40 / 1M | Yes | Yes | Jun 10, 2025 | |
Abacus abacus | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
AIHubMix aihubmix | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Azure Cognitive Services azure-cognitive-services | o1 o1 | o | text, image | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
Azure Cognitive Services azure-cognitive-services | o3 o3 | o | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
Azure Cognitive Services azure-cognitive-services | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Azure azure | o1 o1 | o | text, image | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
Azure azure | o3 o3 | o | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
Azure azure | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | o1 openai/o1 | o | text, image | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | o3 openai/o3 | o | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | o3-pro openai/o3-pro | o-pro | text, image | text | 200,000 | 100,000 | $20 / 1M | $80 / 1M | Yes | Yes | Jun 10, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | o4-mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Deep Infra deepinfra | Claude Opus 4 anthropic/claude-4-opus | claude-opus | text, image | text | 200,000 | 32,000 | $16.5 / 1M | $82.5 / 1M | Yes | Yes | Jun 12, 2025 | |
Deep Infra deepinfra | Claude Sonnet 3.7 (Latest) anthropic/claude-3-7-sonnet-latest | claude-sonnet | text, image | text | 200,000 | 64,000 | $3.3 / 1M | $16.5 / 1M | Yes | Yes | Mar 13, 2025 | |
GitHub Copilot github-copilot | Claude Sonnet 4.6 claude-sonnet-4.6 | claude-sonnet | text, image | text | 200,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 17, 2026 | |
GitHub Models github-models | OpenAI o1 openai/o1 | o | text, image | text | 200,000 | 100,000 | $0 / 1M | $0 / 1M | Yes | No | Dec 17, 2024 | |
GitHub Models github-models | OpenAI o3 openai/o3 | o | text, image | text | 200,000 | 100,000 | $0 / 1M | $0 / 1M | Yes | No | Jan 31, 2025 | |
GitHub Models github-models | OpenAI o4-mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $0 / 1M | $0 / 1M | Yes | No | Jan 31, 2025 | |
Helicone helicone | Anthropic: Claude 3 Haiku claude-3-haiku-20240307 | claude-haiku | text, image | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 7, 2024 | |
Helicone helicone | Anthropic: Claude 3.5 Haiku claude-3.5-haiku | claude-haiku | text, image | text | 200,000 | 8,192 | $0.7999999999999999 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Helicone helicone | Anthropic: Claude 3.5 Sonnet v2 claude-3.5-sonnet-v2 | claude-sonnet | text, image | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Helicone helicone | Anthropic: Claude 3.7 Sonnet claude-3.7-sonnet | claude-sonnet | text, image | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | Yes | Feb 19, 2025 | |
Helicone helicone | Anthropic: Claude 4.5 Haiku claude-4.5-haiku | claude-haiku | text, image | text | 200,000 | 8,192 | $1 / 1M | $5 / 1M | No | Yes | Oct 1, 2025 | |
Helicone helicone | Anthropic: Claude 4.5 Haiku (20251001) claude-haiku-4-5-20251001 | claude-haiku | text, image | text | 200,000 | 8,192 | $1 / 1M | $5 / 1M | No | Yes | Oct 1, 2025 | |
Helicone helicone | Anthropic: Claude Opus 4 claude-opus-4 | claude-opus | text, image | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 14, 2025 | |
Helicone helicone | Anthropic: Claude Opus 4.1 claude-opus-4-1 | claude-opus | text, image | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Helicone helicone | Anthropic: Claude Opus 4.1 (20250805) claude-opus-4-1-20250805 | claude-opus | text, image | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Helicone helicone | Anthropic: Claude Opus 4.5 claude-4.5-opus | claude-opus | text, image | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Helicone helicone | Anthropic: Claude Sonnet 4 claude-sonnet-4 | claude-sonnet | text, image | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 14, 2025 | |
Helicone helicone | Anthropic: Claude Sonnet 4.5 claude-4.5-sonnet | claude-sonnet | text, image | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Helicone helicone | Anthropic: Claude Sonnet 4.5 (20250929) claude-sonnet-4-5-20250929 | claude-sonnet | text, image | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Helicone helicone | OpenAI o3 o3 | o | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | No | Yes | Jun 1, 2024 | |
Helicone helicone | OpenAI o3 Pro o3-pro | o-pro | text, image | text | 200,000 | 100,000 | $20 / 1M | $80 / 1M | No | Yes | Jun 1, 2024 | |
Helicone helicone | OpenAI o4 Mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | No | Yes | Jun 1, 2024 | |
Jiekou.AI jiekou | claude-opus-4-1-20250805 claude-opus-4-1-20250805 | claude-opus | text, image | text | 200,000 | 32,000 | $13.5 / 1M | $67.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | claude-opus-4-20250514 claude-opus-4-20250514 | claude-opus | text, image | text | 200,000 | 32,000 | $13.5 / 1M | $67.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | claude-opus-4-5-20251101 claude-opus-4-5-20251101 | claude-opus | text, image | text | 200,000 | 65,536 | $4.5 / 1M | $22.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | claude-sonnet-4-20250514 claude-sonnet-4-20250514 | claude-sonnet | text, image | text | 200,000 | 64,000 | $2.7 / 1M | $13.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | claude-sonnet-4-5-20250929 claude-sonnet-4-5-20250929 | claude-sonnet | text, image | text | 200,000 | 64,000 | $2.7 / 1M | $13.5 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | o4-mini o4-mini | o | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | No | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | Unknown | text, image | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 7, 2024 | |
Kilo Gateway kilo | Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | Unknown | text, image | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Kilo Gateway kilo | Perplexity: Sonar Pro perplexity/sonar-pro | Unknown | text, image | text | 200,000 | 8,000 | $3 / 1M | $15 / 1M | No | No | Sep 1, 2025 | |
LLM Gateway llmgateway | Claude 3 Opus claude-3-opus | claude | text, image | text | 200,000 | 4,096 | $15 / 1M | $75 / 1M | No | Yes | Mar 4, 2024 | |
LLM Gateway llmgateway | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
LLM Gateway llmgateway | Sonar Pro sonar-pro | sonar-pro | text, image | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | No | Sep 1, 2025 | |
NanoGPT nano-gpt | Claude 3.5 Sonnet claude-3-5-sonnet-20241022 | Unknown | text, image | text | 200,000 | 8,192 | $2.992 / 1M | $14.994 / 1M | No | Yes | Aug 26, 2025 | |
NanoGPT nano-gpt | Study Mode study_gpt-chatgpt-4o-latest | Unknown | text, image | text | 200,000 | 16,384 | $4.998 / 1M | $14.994 / 1M | No | No | May 13, 2024 | |
Nebius Token Factory nebius | Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct | Unknown | text, image | text | 200,000 | 8,192 | $0.5 / 1M | $2.4 / 1M | No | Yes | Feb 4, 2026 | |
OpenAI openai | o1-pro o1-pro | o-pro | text, image | text | 200,000 | 100,000 | $150 / 1M | $600 / 1M | Yes | Yes | Mar 19, 2025 | |
OpenAI openai | o3-deep-research o3-deep-research | o | text, image | text | 200,000 | 100,000 | $10 / 1M | $40 / 1M | Yes | Yes | Jun 26, 2024 | |
OpenAI openai | o3-pro o3-pro | o-pro | text, image | text | 200,000 | 100,000 | $20 / 1M | $80 / 1M | Yes | Yes | Jun 10, 2025 | |
OpenAI openai | o4-mini o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
OpenAI openai | o4-mini-deep-research o4-mini-deep-research | o-mini | text, image | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Jun 26, 2024 | |
OpenRouter openrouter | Free Models Router openrouter/free | Unknown | text, image | text | 200,000 | 8,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 1, 2026 | |
OpenRouter openrouter | o4 Mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Perplexity perplexity | Sonar Pro sonar-pro | sonar-pro | text, image | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | No | Sep 1, 2025 | |
Poe poe | o1 openai/o1 | o | text, image | text | 200,000 | 100,000 | $14 / 1M | $54 / 1M | Yes | Yes | Dec 18, 2024 | |
Poe poe | o1-pro openai/o1-pro | o-pro | text, image | text | 200,000 | 100,000 | $140 / 1M | $540 / 1M | Yes | Yes | Mar 19, 2025 | |
Poe poe | o3 openai/o3 | o | text, image | text | 200,000 | 100,000 | $1.8 / 1M | $7.2 / 1M | Yes | Yes | Apr 16, 2025 | |
Poe poe | o3-mini openai/o3-mini | o-mini | text, image | text | 200,000 | 100,000 | $0.99 / 1M | $4 / 1M | Yes | Yes | Jan 31, 2025 | |
Poe poe | o3-mini-high openai/o3-mini-high | o-mini | text, image | text | 200,000 | 100,000 | $0.99 / 1M | $4 / 1M | Yes | Yes | Jan 31, 2025 | |
Poe poe | o3-pro openai/o3-pro | o-pro | text, image | text | 200,000 | 100,000 | $18 / 1M | $72 / 1M | Yes | Yes | Jun 10, 2025 | |
Poe poe | o4-mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $0.99 / 1M | $4 / 1M | Yes | Yes | Apr 16, 2025 | |
QiHang qihang-ai | Claude Opus 4.5 claude-opus-4-5-20251101 | claude-opus | text, image | text | 200,000 | 32,000 | $0.71 / 1M | $3.57 / 1M | Yes | Yes | Nov 1, 2025 | |
QiHang qihang-ai | GPT-5-Mini gpt-5-mini | gpt-mini | text, image | text | 200,000 | 64,000 | $0.04 / 1M | $0.29 / 1M | Yes | Yes | Sep 15, 2025 | |
Qiniu qiniu-ai | Claude 3.5 Haiku claude-3.5-haiku | Unknown | text, image | text | 200,000 | 8,192 | Unknown | Unknown | No | Yes | Aug 26, 2025 | |
Qiniu qiniu-ai | Claude 3.5 Sonnet claude-3.5-sonnet | Unknown | text, image | text | 200,000 | 8,200 | Unknown | Unknown | Yes | Yes | Sep 9, 2025 | |
Qiniu qiniu-ai | Claude 3.7 Sonnet claude-3.7-sonnet | Unknown | text, image | text | 200,000 | 128,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Claude 4.0 Opus claude-4.0-opus | Unknown | text, image | text | 200,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Claude 4.0 Sonnet claude-4.0-sonnet | Unknown | text, image | text | 200,000 | 64,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Claude 4.1 Opus claude-4.1-opus | Unknown | text, image | text | 200,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 6, 2025 | |
Qiniu qiniu-ai | Claude 4.5 Haiku claude-4.5-haiku | Unknown | text, image | text | 200,000 | 64,000 | Unknown | Unknown | Yes | Yes | Oct 16, 2025 | |
Qiniu qiniu-ai | Claude 4.5 Opus claude-4.5-opus | Unknown | text, image | text | 200,000 | 200,000 | Unknown | Unknown | Yes | Yes | Nov 25, 2025 | |
Qiniu qiniu-ai | Claude 4.5 Sonnet claude-4.5-sonnet | Unknown | text, image | text | 200,000 | 64,000 | Unknown | Unknown | Yes | Yes | Sep 30, 2025 | |
Requesty requesty | o4 Mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
SAP AI Core sap-ai-core | sonar-pro sonar-pro | sonar-pro | text, image | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | No | Sep 1, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-5V-Turbo zai-org/GLM-5V-Turbo | glm | text, image | text | 200,000 | 131,072 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 1, 2026 | |
Venice AI venice | GLM 5V Turbo z-ai-glm-5v-turbo | glmv | text, image | text | 200,000 | 32,768 | $1.5 / 1M | $5 / 1M | Yes | Yes | Apr 12, 2026 | |
Vercel AI Gateway vercel | o4-mini openai/o4-mini | o-mini | text, image | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2025 | |
Vercel AI Gateway vercel | Sonar Pro perplexity/sonar-pro | sonar-pro | text, image | text | 200,000 | 8,000 | $3 / 1M | $15 / 1M | No | Yes | Feb 19, 2025 | |
ZenMux zenmux | Claude 3.5 Haiku anthropic/claude-3.5-haiku | Unknown | text, image | text | 200,000 | 64,000 | $0.8 / 1M | $4 / 1M | No | Yes | Nov 4, 2024 | |
Venice AI venice | Claude Opus 4.5 claude-opus-4-5 | claude-opus | text, image | text | 198,000 | 32,768 | $6 / 1M | $30 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | Claude Sonnet 4.5 claude-sonnet-4-5 | claude-sonnet | text, image | text | 198,000 | 64,000 | $3.75 / 1M | $18.75 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | Google Gemma 3 27B Instruct google-gemma-3-27b-it | gemma | text, image | text | 198,000 | 16,384 | $0.12 / 1M | $0.2 / 1M | No | Yes | Mar 12, 2026 | |
LLM Gateway llmgateway | DeepSeek V3.2 deepseek-v3.2 | deepseek | text, image | text | 163,840 | 16,384 | $0.28 / 1M | $0.42 / 1M | Yes | Yes | Sep 29, 2025 | |
GitHub Copilot github-copilot | Claude Opus 4.5 claude-opus-4.5 | claude-opus | text, image | text | 160,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 1, 2025 | |
abliteration.ai abliteration-ai | Abliterated Model abliterated-model | Unknown | text, image | text | 150,000 | 8,192 | $3 / 1M | $3 / 1M | No | Yes | Jan 6, 2026 | |
GitHub Copilot github-copilot | Claude Haiku 4.5 claude-haiku-4.5 | claude-haiku | text, image | text | 144,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Oct 15, 2025 | |
GitHub Copilot github-copilot | Claude Opus 4.6 claude-opus-4.6 | claude-opus | text, image | text | 144,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 5, 2026 | |
GitHub Copilot github-copilot | Claude Opus 4.7 claude-opus-4.7 | claude-opus | text, image | text | 144,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 16, 2026 | |
GitHub Copilot github-copilot | Claude Sonnet 4.5 claude-sonnet-4.5 | claude-sonnet | text, image | text | 144,000 | 32,000 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 29, 2025 | |
Alibaba (China) alibaba-cn | QVQ Max qvq-max | qvq | text, image | text | 131,072 | 8,192 | $1.147 / 1M | $4.588 / 1M | Yes | Yes | Mar 25, 2025 | |
Alibaba (China) alibaba-cn | Qwen-VL Max qwen-vl-max | qwen | text, image | text | 131,072 | 8,192 | $0.23 / 1M | $0.574 / 1M | No | Yes | Aug 13, 2025 | |
Alibaba (China) alibaba-cn | Qwen-VL Plus qwen-vl-plus | qwen | text, image | text | 131,072 | 8,192 | $0.115 / 1M | $0.287 / 1M | No | Yes | Aug 15, 2025 | |
Alibaba (China) alibaba-cn | Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct | qwen | text, image | text | 131,072 | 8,192 | $2.294 / 1M | $6.881 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.287 / 1M | $0.717 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba (China) alibaba-cn | Qwen3-VL 235B-A22B qwen3-vl-235b-a22b | qwen | text, image | text | 131,072 | 32,768 | $0.286705 / 1M | $1.14682 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-VL 30B-A3B qwen3-vl-30b-a3b | qwen | text, image | text | 131,072 | 32,768 | $0.108 / 1M | $0.431 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | QVQ Max qvq-max | qvq | text, image | text | 131,072 | 8,192 | $1.2 / 1M | $4.8 / 1M | Yes | Yes | Mar 25, 2025 | |
Alibaba alibaba | Qwen-VL Max qwen-vl-max | qwen | text, image | text | 131,072 | 8,192 | $0.8 / 1M | $3.2 / 1M | No | Yes | Aug 13, 2025 | |
Alibaba alibaba | Qwen-VL Plus qwen-vl-plus | qwen | text, image | text | 131,072 | 8,192 | $0.21 / 1M | $0.63 / 1M | No | Yes | Aug 15, 2025 | |
Alibaba alibaba | Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct | qwen | text, image | text | 131,072 | 8,192 | $2.8 / 1M | $8.4 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.35 / 1M | $1.05 / 1M | No | Yes | Sep 1, 2024 | |
Alibaba alibaba | Qwen3-VL 235B-A22B qwen3-vl-235b-a22b | qwen | text, image | text | 131,072 | 32,768 | $0.7 / 1M | $2.8 / 1M | Yes | Yes | Apr 1, 2025 | |
Alibaba alibaba | Qwen3-VL 30B-A3B qwen3-vl-30b-a3b | qwen | text, image | text | 131,072 | 32,768 | $0.2 / 1M | $0.8 / 1M | Yes | Yes | Apr 1, 2025 | |
Amazon Bedrock amazon-bedrock | Google Gemma 3 12B google.gemma-3-12b-it | gemma | text, image | text | 131,072 | 8,192 | $0.049999999999999996 / 1M | $0.09999999999999999 / 1M | No | No | Dec 1, 2024 | |
Chutes chutes | dots.ocr rednote-hilab/dots.ocr | rednote | text, image | text | 131,072 | 131,072 | $0.01 / 1M | $0.01 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | gemma 3 12b it unsloth/gemma-3-12b-it | unsloth | text, image | text | 131,072 | 131,072 | $0.03 / 1M | $0.1 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | GLM 4.6V zai-org/GLM-4.6V | glm | text, image | text | 131,072 | 65,536 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Jan 10, 2026 | |
Chutes chutes | Mistral Small 3.1 24B Instruct 2503 chutesai/Mistral-Small-3.1-24B-Instruct-2503 | chutesai | text, image | text | 131,072 | 131,072 | $0.03 / 1M | $0.11 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Mistral Small 3.2 24B Instruct 2506 chutesai/Mistral-Small-3.2-24B-Instruct-2506 | chutesai | text, image | text | 131,072 | 131,072 | $0.06 / 1M | $0.18 / 1M | No | Yes | Jan 10, 2026 | |
Google google | Gemma 3 27B gemma-3-27b-it | gemma | text, image | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Mar 12, 2025 | |
Groq groq | Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct | llama | text, image | text | 131,072 | 8,192 | $0.2 / 1M | $0.6 / 1M | No | Yes | Apr 5, 2025 | |
Groq groq | Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct | llama | text, image | text | 131,072 | 8,192 | $0.11 / 1M | $0.34 / 1M | No | Yes | Apr 5, 2025 | |
Groq groq | Llama Guard 4 12B meta-llama/llama-guard-4-12b | llama | text, image | text | 131,072 | 1,024 | $0.2 / 1M | $0.2 / 1M | No | No | Apr 5, 2025 | |
Helicone helicone | Google Gemma 3 12B gemma-3-12b-it | gemma | text, image | text | 131,072 | 8,192 | $0.049999999999999996 / 1M | $0.09999999999999999 / 1M | No | No | Dec 1, 2024 | |
Helicone helicone | Meta Llama 4 Maverick 17B 128E llama-4-maverick | llama | text, image | text | 131,072 | 8,192 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | Meta Llama 4 Scout 17B 16E llama-4-scout | llama | text, image | text | 131,072 | 8,192 | $0.08 / 1M | $0.3 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | Meta Llama Guard 4 12B llama-guard-4 | llama | text, image | text | 131,072 | 1,024 | $0.21 / 1M | $0.21 / 1M | No | No | Jan 1, 2025 | |
Jiekou.AI jiekou | o3 o3 | Unknown | text, image | text | 131,072 | 131,072 | $10 / 1M | $40 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | o3-mini o3-mini | o | text, image | text | 131,072 | 131,072 | $1.1 / 1M | $4.4 / 1M | No | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | Unknown | text, image | text | 131,072 | 16,384 | $0.049 / 1M | $0.049 / 1M | No | No | Sep 25, 2024 | |
Kilo Gateway kilo | Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | Unknown | text, image | text | 131,072 | 26,215 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
Kilo Gateway kilo | Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | Unknown | text, image | text | 131,072 | 26,215 | $0.4 / 1M | $2 / 1M | No | Yes | Aug 12, 2025 | |
Kilo Gateway kilo | Qwen: Qwen VL Max qwen/qwen-vl-max | Unknown | text, image | text | 131,072 | 32,768 | $0.8 / 1M | $3.2 / 1M | No | Yes | Aug 13, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | Unknown | text, image | text | 131,072 | 32,768 | $0.13 / 1M | $0.52 / 1M | No | Yes | Nov 25, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct | Unknown | text, image | text | 131,072 | 32,768 | $0.104 / 1M | $0.416 / 1M | No | Yes | Nov 25, 2025 | |
LLM Gateway llmgateway | Ministral 3B ministral-3b-2512 | mistral | text, image | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | No | Dec 2, 2025 | |
LLM Gateway llmgateway | Qwen Plus Latest qwen-plus-latest | qwen | text, image | text | 131,072 | 8,192 | $0.3 / 1M | $0.9 / 1M | No | Yes | Jan 25, 2025 | |
LLM Gateway llmgateway | Qwen-VL Max qwen-vl-max | qwen | text, image | text | 131,072 | 8,192 | $0.8 / 1M | $3.2 / 1M | No | Yes | Aug 13, 2025 | |
LLM Gateway llmgateway | Qwen-VL Plus qwen-vl-plus | qwen | text, image | text | 131,072 | 8,192 | $0.21 / 1M | $0.63 / 1M | No | Yes | Aug 15, 2025 | |
LLM Gateway llmgateway | Qwen2.5 VL 32B Instruct qwen2-5-vl-32b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | Yes | Mar 15, 2025 | |
LLM Gateway llmgateway | Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct | qwen | text, image | text | 131,072 | 8,192 | $2.8 / 1M | $8.4 / 1M | No | Yes | Sep 1, 2024 | |
LLM Gateway llmgateway | Qwen3 VL 235B A22B Instruct qwen3-vl-235b-a22b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | No | Yes | Sep 15, 2025 | |
LLM Gateway llmgateway | Qwen3 VL 235B A22B Thinking qwen3-vl-235b-a22b-thinking | qwen | text, image | text | 131,072 | 8,192 | $0.8 / 1M | $2.4 / 1M | Yes | Yes | Sep 15, 2025 | |
LLM Gateway llmgateway | Qwen3 VL 30B A3B Instruct qwen3-vl-30b-a3b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | Yes | Oct 2, 2025 | |
LLM Gateway llmgateway | Qwen3 VL 30B A3B Thinking qwen3-vl-30b-a3b-thinking | qwen | text, image | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | Yes | Yes | Oct 2, 2025 | |
LLM Gateway llmgateway | Qwen3 VL 8B Instruct qwen3-vl-8b-instruct | qwen | text, image | text | 131,072 | 8,192 | $0.1 / 1M | $0.1 / 1M | No | No | Aug 19, 2025 | |
Mistral mistral | Mistral Medium 3 mistral-medium-2505 | mistral-medium | text, image | text | 131,072 | 131,072 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
NanoGPT nano-gpt | Mistral Medium 3 mistralai/mistral-medium-3 | mistral-medium | text, image | text | 131,072 | 32,768 | $0.4 / 1M | $2 / 1M | No | No | Sep 25, 2025 | |
NovitaAI novita-ai | Gemma 3 12B google/gemma-3-12b-it | gemma | text, image | text | 131,072 | 8,192 | $0.05 / 1M | $0.1 / 1M | No | No | Mar 13, 2025 | |
NovitaAI novita-ai | Llama 4 Scout Instruct meta-llama/llama-4-scout-17b-16e-instruct | Unknown | text, image | text | 131,072 | 131,072 | $0.18 / 1M | $0.59 / 1M | No | No | Apr 6, 2025 | |
NovitaAI novita-ai | OpenAI GPT OSS 120B openai/gpt-oss-120b | Unknown | text, image | text | 131,072 | 32,768 | $0.05 / 1M | $0.25 / 1M | Yes | Yes | Aug 6, 2025 | |
NovitaAI novita-ai | OpenAI: GPT OSS 20B openai/gpt-oss-20b | Unknown | text, image | text | 131,072 | 32,768 | $0.04 / 1M | $0.15 / 1M | Yes | No | Aug 6, 2025 | |
Nvidia nvidia | Gemma-3-27B-IT google/gemma-3-27b-it | gemma | text, image | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Ollama Cloud ollama-cloud | gemma3:12b gemma3:12b | gemma | text, image | text | 131,072 | 131,072 | Unknown | Unknown | No | No | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | gemma3:27b gemma3:27b | gemma | text, image | text | 131,072 | 131,072 | Unknown | Unknown | No | No | Jan 19, 2026 | |
Ollama Cloud ollama-cloud | gemma3:4b gemma3:4b | gemma | text, image | text | 131,072 | 131,072 | Unknown | Unknown | No | No | Jan 19, 2026 | |
OpenRouter openrouter | Gemma 3 12B google/gemma-3-12b-it | gemma | text, image | text | 131,072 | 131,072 | $0.03 / 1M | $0.1 / 1M | No | No | Mar 13, 2025 | |
OpenRouter openrouter | Gemma 3 27B (free) google/gemma-3-27b-it:free | gemma | text, image | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Mar 12, 2025 | |
OpenRouter openrouter | Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | llama | text, image | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | No | No | Sep 25, 2024 | |
OpenRouter openrouter | Llama 3.2 3B Instruct (free) meta-llama/llama-3.2-3b-instruct:free | llama | text, image | text | 131,072 | 131,072 | $0 / 1M | $0 / 1M | No | No | Sep 25, 2024 | |
OpenRouter openrouter | Mistral Medium 3 mistralai/mistral-medium-3 | mistral-medium | text, image | text | 131,072 | 131,072 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
OVHcloud AI Endpoints ovhcloud | Mistral-Small-3.2-24B-Instruct-2506 mistral-small-3.2-24b-instruct-2506 | Unknown | text, image | text | 131,072 | 131,072 | $0.1 / 1M | $0.31 / 1M | No | Yes | Jul 16, 2025 | |
Vercel AI Gateway vercel | Nvidia Nemotron Nano 12B V2 VL nvidia/nemotron-nano-12b-v2-vl | nemotron | text, image | text | 131,072 | 131,072 | $0.2 / 1M | $0.6 / 1M | Yes | Yes | Dec 1, 2024 | |
Vercel AI Gateway vercel | Qwen3 VL Instruct alibaba/qwen3-vl-instruct | qwen | text, image | text | 131,072 | 129,024 | $0.7 / 1M | $2.8 / 1M | No | Yes | Sep 24, 2025 | |
Vercel AI Gateway vercel | Qwen3 VL Thinking alibaba/qwen3-vl-thinking | qwen | text, image | text | 131,072 | 129,024 | $0.7 / 1M | $8.4 / 1M | Yes | Yes | Sep 24, 2025 | |
Poe poe | glm-4.6v novita/glm-4.6v | Unknown | text, image | text | 131,000 | 32,768 | Unknown | Unknown | Yes | Yes | Dec 9, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-VL-32B-Instruct Qwen/Qwen2.5-VL-32B-Instruct | qwen | text, image | text | 131,000 | 131,000 | $0.27 / 1M | $0.27 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct | qwen | text, image | text | 131,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | zai-org/GLM-4.6V zai-org/GLM-4.6V | glm | text, image | text | 131,000 | 131,000 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 7, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-VL-32B-Instruct Qwen/Qwen2.5-VL-32B-Instruct | qwen | text, image | text | 131,000 | 131,000 | $0.27 / 1M | $0.27 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct | qwen | text, image | text | 131,000 | 4,000 | $0.59 / 1M | $0.59 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.6V zai-org/GLM-4.6V | glm | text, image | text | 131,000 | 131,000 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 7, 2025 | |
302.AI 302ai | chatgpt-4o-latest chatgpt-4o-latest | gpt | text, image | text | 128,000 | 16,384 | $5 / 1M | $15 / 1M | No | No | Aug 8, 2024 | |
302.AI 302ai | GLM-4.6V glm-4.6v | Unknown | text, image | text | 128,000 | 32,768 | $0.145 / 1M | $0.43 / 1M | No | Yes | Dec 8, 2025 | |
302.AI 302ai | gpt-4o gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | May 13, 2024 | |
302.AI 302ai | gpt-5.1-chat-latest gpt-5.1-chat-latest | Unknown | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | No | Yes | Nov 14, 2025 | |
302.AI 302ai | gpt-5.2-chat-latest gpt-5.2-chat-latest | Unknown | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | No | Yes | Dec 12, 2025 | |
302.AI 302ai | ministral-14b-2512 ministral-14b-2512 | Unknown | text, image | text | 128,000 | 128,000 | $0.33 / 1M | $0.33 / 1M | No | Yes | Dec 16, 2025 | |
302.AI 302ai | mistral-large-2512 mistral-large-2512 | Unknown | text, image | text | 128,000 | 262,144 | $1.1 / 1M | $3.3 / 1M | No | Yes | Dec 16, 2025 | |
Abacus abacus | GPT-4o Mini gpt-4o-mini | gpt | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Abacus abacus | GPT-OSS 120B openai/gpt-oss-120b | gpt-oss | text, image | text | 128,000 | 32,768 | $0.08 / 1M | $0.44 / 1M | Yes | Yes | Aug 5, 2025 | |
Abacus abacus | Route LLM route-llm | gpt | text, image | text | 128,000 | 16,384 | $3 / 1M | $15 / 1M | No | Yes | Jan 1, 2024 | |
Amazon Bedrock amazon-bedrock | Gemma 3 4B IT google.gemma-3-4b-it | gemma | text, image | text | 128,000 | 4,096 | $0.04 / 1M | $0.08 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.2 11B Instruct meta.llama3-2-11b-instruct-v1:0 | llama | text, image | text | 128,000 | 4,096 | $0.16 / 1M | $0.16 / 1M | No | Yes | Sep 25, 2024 | |
Amazon Bedrock amazon-bedrock | Llama 3.2 90B Instruct meta.llama3-2-90b-instruct-v1:0 | llama | text, image | text | 128,000 | 4,096 | $0.72 / 1M | $0.72 / 1M | No | Yes | Sep 25, 2024 | |
Amazon Bedrock amazon-bedrock | Magistral Small 1.2 mistral.magistral-small-2509 | magistral | text, image | text | 128,000 | 40,000 | $0.5 / 1M | $1.5 / 1M | Yes | Yes | Dec 2, 2025 | |
Amazon Bedrock amazon-bedrock | NVIDIA Nemotron Nano 12B v2 VL BF16 nvidia.nemotron-nano-12b-v2 | nemotron | text, image | text | 128,000 | 4,096 | $0.2 / 1M | $0.6 / 1M | No | Yes | Dec 1, 2024 | |
Amazon Bedrock amazon-bedrock | Pixtral Large (25.02) mistral.pixtral-large-2502-v1:0 | mistral | text, image | text | 128,000 | 8,192 | $2 / 1M | $6 / 1M | No | Yes | Apr 8, 2025 | |
Azure Cognitive Services azure-cognitive-services | Embed v4 cohere-embed-v-4-0 | cohere-embed | text, image | text | 128,000 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Apr 15, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-4 Turbo gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-4 Turbo Vision gpt-4-turbo-vision | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-4o gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-4o mini gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Azure Cognitive Services azure-cognitive-services | GPT-5 Chat gpt-5-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.2 Chat gpt-5.2-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Azure Cognitive Services azure-cognitive-services | Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 | llama | text, image | text | 128,000 | 8,192 | $0.25 / 1M | $1 / 1M | No | Yes | Apr 5, 2025 | |
Azure Cognitive Services azure-cognitive-services | Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct | llama | text, image | text | 128,000 | 8,192 | $0.2 / 1M | $0.78 / 1M | No | Yes | Apr 5, 2025 | |
Azure Cognitive Services azure-cognitive-services | Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct | llama | text, image | text | 128,000 | 8,192 | $0.37 / 1M | $0.37 / 1M | No | Yes | Sep 25, 2024 | |
Azure Cognitive Services azure-cognitive-services | Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct | llama | text, image | text | 128,000 | 8,192 | $2.04 / 1M | $2.04 / 1M | No | Yes | Sep 25, 2024 | |
Azure Cognitive Services azure-cognitive-services | Mistral Medium 3 mistral-medium-2505 | mistral-medium | text, image | text | 128,000 | 128,000 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
Azure Cognitive Services azure-cognitive-services | Mistral Small 3.1 mistral-small-2503 | mistral-small | text, image | text | 128,000 | 32,768 | $0.1 / 1M | $0.3 / 1M | No | Yes | Mar 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | Model Router model-router | model-router | text, image | text | 128,000 | 16,384 | $0.14 / 1M | $0 / 1M | No | Yes | Nov 18, 2025 | |
Azure azure | Embed v4 cohere-embed-v-4-0 | cohere-embed | text, image | text | 128,000 | 1,536 | $0.12 / 1M | $0 / 1M | No | No | Apr 15, 2025 | |
Azure azure | GPT-4 Turbo gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Azure azure | GPT-4 Turbo Vision gpt-4-turbo-vision | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Azure azure | GPT-4o gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
Azure azure | GPT-4o mini gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Azure azure | GPT-5 Chat gpt-5-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | No | Aug 7, 2025 | |
Azure azure | GPT-5.2 Chat gpt-5.2-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Azure azure | GPT-5.3 Chat gpt-5.3-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 3, 2026 | |
Azure azure | Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning | grok | text, image | text | 128,000 | 8,192 | $0.2 / 1M | $0.5 / 1M | No | Yes | Jun 27, 2025 | |
Azure azure | Grok 4.1 Fast (Reasoning) grok-4-1-fast-reasoning | grok | text, image | text | 128,000 | 8,192 | $0.2 / 1M | $0.5 / 1M | Yes | Yes | Jun 27, 2025 | |
Azure azure | Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 | llama | text, image | text | 128,000 | 8,192 | $0.25 / 1M | $1 / 1M | No | Yes | Apr 5, 2025 | |
Azure azure | Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct | llama | text, image | text | 128,000 | 8,192 | $0.2 / 1M | $0.78 / 1M | No | Yes | Apr 5, 2025 | |
Azure azure | Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct | llama | text, image | text | 128,000 | 8,192 | $0.37 / 1M | $0.37 / 1M | No | Yes | Sep 25, 2024 | |
Azure azure | Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct | llama | text, image | text | 128,000 | 8,192 | $2.04 / 1M | $2.04 / 1M | No | Yes | Sep 25, 2024 | |
Azure azure | Mistral Medium 3 mistral-medium-2505 | mistral-medium | text, image | text | 128,000 | 128,000 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
Azure azure | Mistral Small 3.1 mistral-small-2503 | mistral-small | text, image | text | 128,000 | 32,768 | $0.1 / 1M | $0.3 / 1M | No | Yes | Mar 1, 2025 | |
Azure azure | Model Router model-router | model-router | text, image | text | 128,000 | 16,384 | $0.14 / 1M | $0 / 1M | No | Yes | Nov 18, 2025 | |
Chutes chutes | gemma 3 27b it unsloth/gemma-3-27b-it | unsloth | text, image | text | 128,000 | 65,536 | $0.04 / 1M | $0.15 / 1M | No | Yes | Jan 10, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-4 Turbo openai/gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-4o openai/gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-4o mini openai/gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Cloudflare Workers AI cloudflare-workers-ai | Llama 4 Scout 17B 16E Instruct @cf/meta/llama-4-scout-17b-16e-instruct | llama | text, image | text | 128,000 | 16,384 | $0.27 / 1M | $0.85 / 1M | No | Yes | Apr 16, 2025 | |
Cohere cohere | Command A Vision command-a-vision-07-2025 | command-a | text, image | text | 128,000 | 8,000 | $2.5 / 1M | $10 / 1M | No | No | Jul 31, 2025 | |
Fireworks AI fireworks-ai | Qwen 3.6 Plus accounts/fireworks/models/qwen3p6-plus | qwen | text, image | text | 128,000 | 8,192 | $0.5 / 1M | $3 / 1M | Yes | Yes | Apr 4, 2026 | |
Firmware firmware | GPT-4o gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
GitHub Copilot github-copilot | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 19, 2026 | |
GitHub Copilot github-copilot | GPT-4.1 gpt-4.1 | gpt | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Apr 14, 2025 | |
GitHub Copilot github-copilot | GPT-4o gpt-4o | gpt | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 13, 2024 | |
GitHub Copilot github-copilot | GPT-5 gpt-5 | gpt | text, image | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 7, 2025 | |
GitHub Models github-models | GPT-4.1 openai/gpt-4.1 | gpt | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Apr 14, 2025 | |
GitHub Models github-models | GPT-4.1-mini openai/gpt-4.1-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Apr 14, 2025 | |
GitHub Models github-models | GPT-4.1-nano openai/gpt-4.1-nano | gpt-nano | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Apr 14, 2025 | |
GitHub Models github-models | Llama 4 Maverick 17B 128E Instruct FP8 meta/llama-4-maverick-17b-128e-instruct-fp8 | llama | text, image | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 31, 2025 | |
GitHub Models github-models | Llama 4 Scout 17B 16E Instruct meta/llama-4-scout-17b-16e-instruct | llama | text, image | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 31, 2025 | |
GitHub Models github-models | Mistral Medium 3 (25.05) mistral-ai/mistral-medium-2505 | mistral-medium | text, image | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | May 1, 2025 | |
GitHub Models github-models | Mistral Small 3.1 mistral-ai/mistral-small-2503 | mistral-small | text, image | text | 128,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 1, 2025 | |
GitHub Models github-models | Phi-3.5-vision instruct (128k) microsoft/phi-3.5-vision-instruct | phi | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Aug 20, 2024 | |
Helicone helicone | Mistral Nemo mistral-nemo | mistral-nemo | text, image | text | 128,000 | 16,400 | $20 / 1M | $40 / 1M | No | No | Jul 18, 2024 | |
Helicone helicone | Mistral Small mistral-small | mistral-small | text, image | text | 128,000 | 128,000 | $75 / 1M | $200 / 1M | No | No | Feb 26, 2024 | |
Helicone helicone | OpenAI ChatGPT-4o chatgpt-4o-latest | gpt | text, image | text | 128,000 | 16,384 | $5 / 1M | $20 / 1M | No | Yes | Aug 14, 2024 | |
Helicone helicone | OpenAI GPT-4o gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | May 13, 2024 | |
Helicone helicone | OpenAI GPT-4o-mini gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Helicone helicone | OpenAI GPT-5 Chat Latest gpt-5-chat-latest | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | No | Yes | Sep 30, 2024 | |
IO.NET io-net | Mistral Large Instruct 2411 mistralai/Mistral-Large-Instruct-2411 | mistral-large | text, image | text | 128,000 | 4,096 | $2 / 1M | $6 / 1M | No | Yes | Nov 1, 2024 | |
Kilo Gateway kilo | OpenAI: GPT-4 Turbo openai/gpt-4-turbo | Unknown | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Kilo Gateway kilo | Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro | Unknown | text, image | text | 128,000 | 25,600 | $2 / 1M | $8 / 1M | Yes | No | Sep 1, 2025 | |
Llama llama | Llama-4-Maverick-17B-128E-Instruct-FP8 llama-4-maverick-17b-128e-instruct-fp8 | llama | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Llama llama | Llama-4-Scout-17B-16E-Instruct-FP8 llama-4-scout-17b-16e-instruct-fp8 | llama | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
LLM Gateway llmgateway | Auto Route auto | auto | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2024 | |
LLM Gateway llmgateway | Custom Model custom | auto | text, image | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jan 1, 2024 | |
LLM Gateway llmgateway | DeepSeek V3.1 deepseek-v3.1 | deepseek | text, image | text | 128,000 | 32,768 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Aug 21, 2025 | |
LLM Gateway llmgateway | Gemma 3 27B gemma-3-27b | gemma | text, image | text | 128,000 | 16,384 | $0.27 / 1M | $0.27 / 1M | No | No | Mar 12, 2025 | |
LLM Gateway llmgateway | GLM-4.6V Flash glm-4.6v-flash | glm | text, image | text | 128,000 | 16,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 8, 2025 | |
LLM Gateway llmgateway | GLM-4.6V FlashX glm-4.6v-flashx | glm | text, image | text | 128,000 | 16,000 | $0.04 / 1M | $0.4 / 1M | Yes | Yes | Dec 8, 2025 | |
LLM Gateway llmgateway | GPT-4 Turbo gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
LLM Gateway llmgateway | GPT-4o Mini Search Preview gpt-4o-mini-search-preview | gpt | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | No | Oct 1, 2024 | |
LLM Gateway llmgateway | GPT-4o Search Preview gpt-4o-search-preview | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | No | Oct 1, 2024 | |
LLM Gateway llmgateway | GPT-5.2 Chat gpt-5.2-chat-latest | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
LLM Gateway llmgateway | GPT-5.3 Chat (latest) gpt-5.3-chat-latest | gpt | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | No | Yes | Mar 3, 2026 | |
LLM Gateway llmgateway | Mistral Small 3.2 mistral-small-2506 | mistral-small | text, image | text | 128,000 | 16,384 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jun 20, 2025 | |
LLM Gateway llmgateway | Pixtral Large (latest) pixtral-large-latest | pixtral | text, image | text | 128,000 | 128,000 | $2 / 1M | $6 / 1M | No | Yes | Nov 4, 2024 | |
LLM Gateway llmgateway | Sonar Reasoning Pro sonar-reasoning-pro | sonar-reasoning | text, image | text | 128,000 | 4,096 | $2 / 1M | $8 / 1M | Yes | No | Sep 1, 2025 | |
Mistral mistral | Mistral Medium (latest) mistral-medium-latest | mistral-medium | text, image | text | 128,000 | 16,384 | $0.4 / 1M | $2 / 1M | No | Yes | May 10, 2025 | |
Mistral mistral | Mistral Small 3.2 mistral-small-2506 | mistral-small | text, image | text | 128,000 | 16,384 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jun 20, 2025 | |
Mistral mistral | Pixtral 12B pixtral-12b | pixtral | text, image | text | 128,000 | 128,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Sep 1, 2024 | |
Mistral mistral | Pixtral Large (latest) pixtral-large-latest | pixtral | text, image | text | 128,000 | 128,000 | $2 / 1M | $6 / 1M | No | Yes | Nov 4, 2024 | |
NanoGPT nano-gpt | Azure gpt-4o azure-gpt-4o | Unknown | text, image | text | 128,000 | 16,384 | $2.499 / 1M | $9.996 / 1M | No | Yes | May 13, 2024 | |
NanoGPT nano-gpt | Azure gpt-4o-mini azure-gpt-4o-mini | Unknown | text, image | text | 128,000 | 16,384 | $0.1496 / 1M | $0.595 / 1M | No | Yes | Jul 18, 2024 | |
NanoGPT nano-gpt | ChatGPT 4o openai/chatgpt-4o-latest | gpt | text, image | text | 128,000 | 16,384 | $4.998 / 1M | $14.993999999999998 / 1M | No | Yes | May 13, 2024 | |
NanoGPT nano-gpt | Doubao 1.5 Thinking Pro Vision doubao-1-5-thinking-pro-vision-250415 | Unknown | text, image | text | 128,000 | 16,384 | $0.6 / 1M | $2.4 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Doubao 1.5 Thinking Vision Pro doubao-1-5-thinking-vision-pro-250428 | Unknown | text, image | text | 128,000 | 16,384 | $0.55 / 1M | $1.43 / 1M | No | No | May 15, 2025 | |
NanoGPT nano-gpt | Ernie 4.5 Turbo 128k ernie-4.5-turbo-128k | Unknown | text, image | text | 128,000 | 16,384 | $0.132 / 1M | $0.55 / 1M | No | No | May 8, 2025 | |
NanoGPT nano-gpt | Ernie 5.0 Thinking ernie-5.0-thinking-latest | Unknown | text, image | text | 128,000 | 16,384 | $1.1 / 1M | $2 / 1M | Yes | No | Nov 18, 2025 | |
NanoGPT nano-gpt | Ernie 5.0 Thinking Preview ernie-5.0-thinking-preview | Unknown | text, image | text | 128,000 | 16,384 | $1.1 / 1M | $2 / 1M | Yes | No | Nov 18, 2025 | |
NanoGPT nano-gpt | GPT-4 Turbo openai/gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | GPT-4o openai/gpt-4o | gpt | text, image | text | 128,000 | 16,384 | $2.499 / 1M | $9.996 / 1M | No | No | May 13, 2024 | |
NanoGPT nano-gpt | GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | gpt | text, image | text | 128,000 | 16,384 | $2.499 / 1M | $9.996 / 1M | No | No | Aug 6, 2024 | |
NanoGPT nano-gpt | GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | No | Nov 20, 2024 | |
NanoGPT nano-gpt | GPT-4o mini openai/gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.1496 / 1M | $0.595 / 1M | No | No | Jul 18, 2024 | |
NanoGPT nano-gpt | GPT-4o Search Preview openai/gpt-4o-search-preview | gpt | text, image | text | 128,000 | 16,384 | $1.47 / 1M | $5.88 / 1M | No | No | May 13, 2024 | |
NanoGPT nano-gpt | Mistral Small 31 24b Instruct mistral-small-31-24b-instruct | Unknown | text, image | text | 128,000 | 131,072 | $0.1 / 1M | $0.3 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Qwen: QvQ Max qvq-max | Unknown | text, image | text | 128,000 | 8,192 | $1.4 / 1M | $5.3 / 1M | No | No | Mar 28, 2025 | |
Nebius Token Factory nebius | Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct | Unknown | text, image | text | 128,000 | 8,192 | $0.25 / 1M | $0.75 / 1M | No | Yes | Feb 4, 2026 | |
Nvidia nvidia | Gemma 3 1b It google/gemma-3-1b-it | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Mar 10, 2025 | |
Nvidia nvidia | Gemma 3n E2b It google/gemma-3n-e2b-it | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jun 12, 2025 | |
Nvidia nvidia | Gemma 3n E4b It google/gemma-3n-e4b-it | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Jun 3, 2025 | |
Nvidia nvidia | Llama 3.2 11b Vision Instruct meta/llama-3.2-11b-vision-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Sep 18, 2024 | |
Nvidia nvidia | Llama 4 Maverick 17b 128e Instruct meta/llama-4-maverick-17b-128e-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 1, 2025 | |
Nvidia nvidia | Llama 4 Scout 17b 16e Instruct meta/llama-4-scout-17b-16e-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 2, 2025 | |
Nvidia nvidia | Phi 3 Medium 128k Instruct microsoft/phi-3-medium-128k-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 7, 2024 | |
Nvidia nvidia | Phi 3 Small 128k Instruct microsoft/phi-3-small-128k-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 7, 2024 | |
Nvidia nvidia | Phi 3 Vision 128k Instruct microsoft/phi-3-vision-128k-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 19, 2024 | |
Nvidia nvidia | Phi 3.5 Vision Instruct microsoft/phi-3.5-vision-instruct | Unknown | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Aug 16, 2024 | |
OpenAI openai | GPT-4 Turbo gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
OpenAI openai | GPT-4o (2024-05-13) gpt-4o-2024-05-13 | gpt | text, image | text | 128,000 | 4,096 | $5 / 1M | $15 / 1M | No | Yes | May 13, 2024 | |
OpenAI openai | GPT-4o (2024-08-06) gpt-4o-2024-08-06 | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
OpenAI openai | GPT-4o (2024-11-20) gpt-4o-2024-11-20 | gpt | text, image | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Nov 20, 2024 | |
OpenAI openai | GPT-5.1 Chat gpt-5.1-chat-latest | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenAI openai | GPT-5.2 Chat gpt-5.2-chat-latest | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenAI openai | GPT-5.3 Chat (latest) gpt-5.3-chat-latest | gpt | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | No | Yes | Mar 3, 2026 | |
OpenRouter openrouter | GPT-4o-mini openai/gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
OpenRouter openrouter | GPT-5.1 Chat openai/gpt-5.1-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
OpenRouter openrouter | GPT-5.2 Chat openai/gpt-5.2-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenRouter openrouter | Mistral Small 3.1 24B Instruct mistralai/mistral-small-3.1-24b-instruct | mistral-small | text, image | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Mar 17, 2025 | |
OpenRouter openrouter | Nemotron Nano 12B 2 VL (free) nvidia/nemotron-nano-12b-v2-vl:free | nemotron | text, image | text | 128,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 31, 2026 | |
Perplexity perplexity | Sonar Reasoning Pro sonar-reasoning-pro | sonar-reasoning | text, image | text | 128,000 | 4,096 | $2 / 1M | $8 / 1M | Yes | No | Sep 1, 2025 | |
Poe poe | ChatGPT-4o-Latest openai/chatgpt-4o-latest | gpt | text, image | text | 128,000 | 8,192 | $4.5 / 1M | $14 / 1M | No | Yes | Aug 14, 2024 | |
Poe poe | GPT-4-Turbo openai/gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $9 / 1M | $27 / 1M | No | Yes | Sep 13, 2023 | |
Poe poe | GPT-4o openai/gpt-4o | gpt | text, image | text | 128,000 | 8,192 | Unknown | Unknown | No | Yes | May 13, 2024 | |
Poe poe | GPT-4o-Aug openai/gpt-4o-aug | gpt | text, image | text | 128,000 | 8,192 | $2.2 / 1M | $9 / 1M | No | Yes | Nov 21, 2024 | |
Poe poe | GPT-5-Chat openai/gpt-5-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.1 / 1M | $9 / 1M | No | Yes | Aug 7, 2025 | |
Poe poe | GPT-5.1-Instant openai/gpt-5.1-instant | gpt | text, image | text | 128,000 | 16,384 | $1.1 / 1M | $9 / 1M | Yes | Yes | Nov 12, 2025 | |
Poe poe | GPT-5.2-Instant openai/gpt-5.2-instant | Unknown | text, image | text | 128,000 | 16,384 | $1.6 / 1M | $13 / 1M | No | Yes | Dec 11, 2025 | |
Poe poe | GPT-5.3-Instant openai/gpt-5.3-instant | Unknown | text, image | text | 128,000 | 16,384 | $1.6 / 1M | $13 / 1M | No | Yes | Mar 3, 2026 | |
Poe poe | Grok-4.20-Multi-Agent xai/grok-4.20-multi-agent | Unknown | text, image | text | 128,000 | Unknown | $2 / 1M | $6 / 1M | No | Yes | Mar 13, 2026 | |
Privatemode AI privatemode-ai | Gemma 3 27B gemma-3-27b | gemma | text, image | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Mar 12, 2025 | |
Requesty requesty | GPT-4o Mini openai/gpt-4o-mini | gpt-mini | text, image | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Requesty requesty | GPT-5 Mini openai/gpt-5-mini | gpt-mini | text, image | text | 128,000 | 32,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Aug 7, 2025 | |
Requesty requesty | GPT-5.1 Chat openai/gpt-5.1-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
Requesty requesty | GPT-5.2 Chat openai/gpt-5.2-chat | gpt-codex | text, image | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Scaleway scaleway | Mistral Small 3.2 24B Instruct (2506) mistral-small-3.2-24b-instruct-2506 | mistral-small | text, image | text | 128,000 | 32,768 | $0.15 / 1M | $0.35 / 1M | No | Yes | Mar 17, 2026 | |
Scaleway scaleway | Pixtral 12B 2409 pixtral-12b-2409 | pixtral | text, image | text | 128,000 | 4,096 | $0.2 / 1M | $0.2 / 1M | No | Yes | Mar 17, 2026 | |
v0 v0 | v0-1.0-md v0-1.0-md | v0 | text, image | text | 128,000 | 32,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
v0 v0 | v0-1.5-md v0-1.5-md | v0 | text, image | text | 128,000 | 32,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jun 9, 2025 | |
Venice AI venice | GPT-4o openai-gpt-4o-2024-11-20 | gpt | text, image | text | 128,000 | 16,384 | $3.125 / 1M | $12.5 / 1M | No | Yes | Mar 6, 2026 | |
Venice AI venice | GPT-4o Mini openai-gpt-4o-mini-2024-07-18 | gpt-mini | text, image | text | 128,000 | 16,384 | $0.1875 / 1M | $0.75 / 1M | No | Yes | Mar 6, 2026 | |
Venice AI venice | Venice Role Play Uncensored venice-uncensored-role-play | venice | text, image | text | 128,000 | 4,096 | $0.5 / 1M | $2 / 1M | No | Yes | Mar 16, 2026 | |
Venice AI venice | Venice Uncensored 1.2 venice-uncensored-1-2 | venice | text, image | text | 128,000 | 8,192 | $0.2 / 1M | $0.9 / 1M | No | Yes | Apr 19, 2026 | |
Vercel AI Gateway vercel | GPT-4 Turbo openai/gpt-4-turbo | gpt | text, image | text | 128,000 | 4,096 | $10 / 1M | $30 / 1M | No | Yes | Apr 9, 2024 | |
Vercel AI Gateway vercel | Llama 3.2 11B Vision Instruct meta/llama-3.2-11b | llama | text, image | text | 128,000 | 8,192 | $0.16 / 1M | $0.16 / 1M | No | Yes | Sep 25, 2024 | |
Vercel AI Gateway vercel | Llama 3.2 90B Vision Instruct meta/llama-3.2-90b | llama | text, image | text | 128,000 | 8,192 | $0.72 / 1M | $0.72 / 1M | No | Yes | Sep 25, 2024 | |
Vercel AI Gateway vercel | Llama-4-Maverick-17B-128E-Instruct-FP8 meta/llama-4-maverick | llama | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Vercel AI Gateway vercel | Llama-4-Scout-17B-16E-Instruct-FP8 meta/llama-4-scout | llama | text, image | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | Apr 5, 2025 | |
Vercel AI Gateway vercel | Mistral Medium 3.1 mistral/mistral-medium | mistral-medium | text, image | text | 128,000 | 64,000 | $0.4 / 1M | $2 / 1M | No | Yes | May 7, 2025 | |
Vercel AI Gateway vercel | Pixtral 12B mistral/pixtral-12b | pixtral | text, image | text | 128,000 | 128,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Sep 1, 2024 | |
Vercel AI Gateway vercel | Pixtral Large (latest) mistral/pixtral-large | pixtral | text, image | text | 128,000 | 128,000 | $2 / 1M | $6 / 1M | No | Yes | Nov 4, 2024 | |
Vercel AI Gateway vercel | v0-1.0-md vercel/v0-1.0-md | v0 | text, image | text | 128,000 | 32,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Vercel AI Gateway vercel | v0-1.5-md vercel/v0-1.5-md | v0 | text, image | text | 128,000 | 32,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jun 9, 2025 | |
Kilo Gateway kilo | Perplexity: Sonar perplexity/sonar | Unknown | text, image | text | 127,072 | 25,415 | $1 / 1M | $1 / 1M | No | No | Sep 1, 2025 | |
Vercel AI Gateway vercel | Sonar perplexity/sonar | sonar | text, image | text | 127,000 | 8,000 | $1 / 1M | $1 / 1M | No | Yes | Feb 19, 2025 | |
Inference inference | Google Gemma 3 google/gemma-3 | gemma | text, image | text | 125,000 | 4,096 | $0.15 / 1M | $0.3 / 1M | No | Yes | Jan 1, 2025 | |
Inference inference | Qwen 2.5 7B Vision Instruct qwen/qwen-2.5-7b-vision-instruct | qwen | text, image | text | 125,000 | 4,096 | $0.2 / 1M | $0.2 / 1M | No | Yes | Jan 1, 2025 | |
Poe poe | GPT-4o-mini openai/gpt-4o-mini | gpt-mini | text, image | text | 124,096 | 4,096 | $0.14 / 1M | $0.54 / 1M | No | Yes | Jul 18, 2024 | |
Jiekou.AI jiekou | ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | ernie | text, image | text | 123,000 | 16,000 | $0.42 / 1M | $1.25 / 1M | Yes | Yes | Jan 1, 2026 | |
NovitaAI novita-ai | ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | Unknown | text, image | text | 123,000 | 16,000 | $0.42 / 1M | $1.25 / 1M | Yes | No | Jun 30, 2025 | |
Nebius Token Factory nebius | Gemma-3-27b-it google/gemma-3-27b-it | Unknown | text, image | text | 110,000 | 8,192 | $0.1 / 1M | $0.3 / 1M | No | Yes | Feb 4, 2026 | |
Nebius Token Factory nebius | Gemma-3-27b-it (Fast) google/gemma-3-27b-it-fast | Unknown | text, image | text | 110,000 | 8,192 | $0.2 / 1M | $0.6 / 1M | No | Yes | Feb 4, 2026 | |
NovitaAI novita-ai | Gemma 3 27B google/gemma-3-27b-it | gemma | text, image | text | 98,304 | 16,384 | $0.119 / 1M | $0.2 / 1M | No | No | Mar 25, 2025 | |
Chutes chutes | gemma 3 4b it unsloth/gemma-3-4b-it | unsloth | text, image | text | 96,000 | 96,000 | $0.01 / 1M | $0.03 / 1M | No | No | Jan 10, 2026 | |
OpenRouter openrouter | Gemma 3 27B google/gemma-3-27b-it | gemma | text, image | text | 96,000 | 96,000 | $0.04 / 1M | $0.15 / 1M | No | Yes | Mar 12, 2025 | |
OpenRouter openrouter | Gemma 3 4B google/gemma-3-4b-it | gemma | text, image | text | 96,000 | 96,000 | $0.01703 / 1M | $0.06815 / 1M | No | No | Mar 13, 2025 | |
OpenRouter openrouter | Mistral Small 3.2 24B Instruct mistralai/mistral-small-3.2-24b-instruct | mistral-small | text, image | text | 96,000 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Jun 20, 2025 | |
GitHub Copilot github-copilot | Claude Opus 4.1 claude-opus-41 | claude-opus | text, image | text | 80,000 | 16,000 | $0 / 1M | $0 / 1M | Yes | No | Aug 5, 2025 | |
SiliconFlow (China) siliconflow-cn | zai-org/GLM-4.5V zai-org/GLM-4.5V | glm | text, image | text | 66,000 | 66,000 | $0.14 / 1M | $0.86 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | zai-org/GLM-4.5V zai-org/GLM-4.5V | glm | text, image | text | 66,000 | 66,000 | $0.14 / 1M | $0.86 / 1M | No | Yes | Nov 25, 2025 | |
Vercel AI Gateway vercel | GLM 4.5V zai/glm-4.5v | glm | text, image | text | 66,000 | 66,000 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
Kilo Gateway kilo | Z.ai: GLM 4.5V z-ai/glm-4.5v | Unknown | text, image | text | 65,536 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
NanoGPT nano-gpt | Qwen2.5 VL 72B TEE TEE/qwen2.5-vl-72b-instruct | qwen | text, image | text | 65,536 | 8,192 | $0.7 / 1M | $0.7 / 1M | No | No | Feb 1, 2025 | |
NanoGPT nano-gpt | Step-3 step-3 | Unknown | text, image | text | 65,536 | 8,192 | $0.2499 / 1M | $0.6494 / 1M | No | No | Jul 31, 2025 | |
NovitaAI novita-ai | AutoGLM-Phone-9B-Multilingual zai-org/autoglm-phone-9b-multilingual | Unknown | text, image | text | 65,536 | 65,536 | $0.035 / 1M | $0.138 / 1M | No | No | Dec 10, 2025 | |
302.AI 302ai | GLM-4.5V glm-4.5v | Unknown | text, image | text | 64,000 | 16,384 | $0.29 / 1M | $0.86 / 1M | No | Yes | Aug 12, 2025 | |
NanoGPT nano-gpt | GLM 4.1V Thinking Flash glm-4.1v-thinking-flash | Unknown | text, image | text | 64,000 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | GLM 4.1V Thinking FlashX glm-4.1v-thinking-flashx | Unknown | text, image | text | 64,000 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | No | Jul 9, 2025 | |
NanoGPT nano-gpt | GLM 4.5V z-ai/glm-4.5v | glmv | text, image | text | 64,000 | 96,000 | $0.6 / 1M | $1.7999999999999998 / 1M | Yes | No | Nov 22, 2025 | |
NanoGPT nano-gpt | GLM 4.5V Thinking z-ai/glm-4.5v:thinking | glmv | text, image | text | 64,000 | 96,000 | $0.6 / 1M | $1.7999999999999998 / 1M | Yes | No | Nov 22, 2025 | |
Qiniu qiniu-ai | Stepfun/Step-3.5 Flash stepfun/step-3.5-flash | Unknown | text, image | text | 64,000 | 4,096 | Unknown | Unknown | No | No | Feb 2, 2026 | |
Weights & Biases wandb | Llama 4 Scout 17B 16E Instruct meta-llama/Llama-4-Scout-17B-16E-Instruct | llama | text, image | text | 64,000 | 64,000 | $0.17 / 1M | $0.66 / 1M | Yes | Yes | Mar 12, 2026 | |
Helicone helicone | Qwen3 30B A3B qwen3-30b-a3b | qwen | text, image | text | 41,000 | 41,000 | $0.08 / 1M | $0.29 / 1M | No | Yes | Jun 1, 2025 | |
Scaleway scaleway | Gemma-3-27B-IT gemma-3-27b-it | gemma | text, image | text | 40,000 | 8,192 | $0.25 / 1M | $0.5 / 1M | Yes | Yes | Mar 17, 2026 | |
STACKIT stackit | Gemma 3 27B google/gemma-3-27b-it | gemma | text, image | text | 37,000 | 8,192 | $0.49 / 1M | $0.71 / 1M | No | No | May 17, 2025 | |
NanoGPT nano-gpt | Molmo 2 8B allenai/molmo-2-8b | allenai | text, image | text | 36,864 | 36,864 | $0.2 / 1M | $0.2 / 1M | No | No | Feb 14, 2026 | |
Alibaba (China) alibaba-cn | Qwen-VL OCR qwen-vl-ocr | qwen | text, image | text | 34,096 | 4,096 | $0.717 / 1M | $0.717 / 1M | No | No | Apr 13, 2025 | |
Alibaba alibaba | Qwen-VL OCR qwen-vl-ocr | qwen | text, image | text | 34,096 | 4,096 | $0.72 / 1M | $0.72 / 1M | No | No | Apr 13, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen2.5-VL-7B-Instruct Qwen/Qwen2.5-VL-7B-Instruct | qwen | text, image | text | 33,000 | 4,000 | $0.05 / 1M | $0.05 / 1M | No | Yes | Nov 25, 2025 | |
302.AI 302ai | gemini-2.5-flash-image gemini-2.5-flash-image | Unknown | text, image | text | 32,768 | 32,768 | $0.3 / 1M | $30 / 1M | No | No | Oct 8, 2025 | |
302.AI 302ai | gemini-3-pro-image-preview gemini-3-pro-image-preview | Unknown | text, image | text | 32,768 | 64,000 | $2 / 1M | $120 / 1M | No | No | Nov 20, 2025 | |
Chutes chutes | InternVL3 78B TEE OpenGVLab/InternVL3-78B-TEE | opengvlab | text, image | text | 32,768 | 32,768 | $0.1 / 1M | $0.39 / 1M | No | No | Jan 10, 2026 | |
Chutes chutes | Mistral Small 24B Instruct 2501 unsloth/Mistral-Small-24B-Instruct-2501 | unsloth | text, image | text | 32,768 | 32,768 | $0.03 / 1M | $0.11 / 1M | No | Yes | Jan 10, 2026 | |
Chutes chutes | Qwen2.5 VL 72B Instruct TEE Qwen/Qwen2.5-VL-72B-Instruct-TEE | qwen | text, image | text | 32,768 | 32,768 | $0.15 / 1M | $0.6 / 1M | No | No | Jan 10, 2026 | |
Google google | Gemma 3 12B gemma-3-12b-it | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
Google google | Gemma 3 4B gemma-3-4b-it | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
LLM Gateway llmgateway | Gemma 3 12B gemma-3-12b-it | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
LLM Gateway llmgateway | Gemma 3 4B gemma-3-4b-it | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
LLM Gateway llmgateway | Qwen Max Latest qwen-max-latest | qwen | text, image | text | 32,768 | 8,192 | $1.6 / 1M | $6.4 / 1M | No | Yes | Jan 25, 2025 | |
Meganova meganova | Mistral Small 3.2 24B Instruct mistralai/Mistral-Small-3.2-24B-Instruct-2506 | mistral-small | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | Yes | Jun 20, 2025 | |
NanoGPT nano-gpt | ERNIE 4.5 VL 28B baidu/ernie-4.5-vl-28b-a3b | ernie | text, image | text | 32,768 | 16,384 | $0.13999999999999999 / 1M | $0.5599999999999999 / 1M | No | No | Jun 30, 2025 | |
NanoGPT nano-gpt | Qwen3 VL 235B A22B Instruct Original qwen3-vl-235b-a22b-instruct-original | Unknown | text, image | text | 32,768 | 32,768 | $0.5 / 1M | $1.2 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | Qwen3 VL 235B A22B Thinking qwen3-vl-235b-a22b-thinking | Unknown | text, image | text | 32,768 | 32,768 | $0.5 / 1M | $6 / 1M | Yes | No | Aug 26, 2025 | |
OpenRouter openrouter | Gemma 3 12B (free) google/gemma-3-12b-it:free | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
OpenRouter openrouter | Gemma 3 4B (free) google/gemma-3-4b-it:free | gemma | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Mar 13, 2025 | |
OpenRouter openrouter | Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | qwen | text, image | text | 32,768 | 8,192 | $0 / 1M | $0 / 1M | No | No | Feb 1, 2025 | |
OVHcloud AI Endpoints ovhcloud | Qwen2.5-VL-72B-Instruct qwen2.5-vl-72b-instruct | Unknown | text, image | text | 32,768 | 32,768 | $1.01 / 1M | $1.01 / 1M | No | No | Mar 31, 2025 | |
evroc evroc | Phi-4 15B microsoft/Phi-4-multimodal-instruct | phi | text, image | text | 32,000 | 32,000 | $0.24 / 1M | $0.47 / 1M | No | No | Jan 1, 2025 | |
IO.NET io-net | Qwen 2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct | qwen | text, image | text | 32,000 | 4,096 | $0.05 / 1M | $0.22 / 1M | No | Yes | Nov 1, 2024 | |
NanoGPT nano-gpt | Doubao 1.5 Vision Pro 32k doubao-1.5-vision-pro-32k | Unknown | text, image | text | 32,000 | 8,192 | $0.459 / 1M | $1.377 / 1M | No | No | Jan 22, 2025 | |
NanoGPT nano-gpt | Ernie 4.5 Turbo VL 32k ernie-4.5-turbo-vl-32k | Unknown | text, image | text | 32,000 | 16,384 | $0.495 / 1M | $1.43 / 1M | No | No | May 8, 2025 | |
NanoGPT nano-gpt | Ernie X1 32k ernie-x1-32k | Unknown | text, image | text | 32,000 | 16,384 | $0.33 / 1M | $1.32 / 1M | No | No | May 8, 2025 | |
NanoGPT nano-gpt | Qwen25 VL 72b qwen25-vl-72b-instruct | Unknown | text, image | text | 32,000 | 32,768 | $0.69989 / 1M | $0.69989 / 1M | No | No | May 10, 2025 | |
STACKIT stackit | Qwen3-VL Embedding 8B Qwen/Qwen3-VL-Embedding-8B | qwen | text, image | text | 32,000 | 4,096 | $0.09 / 1M | $0.09 / 1M | No | No | Feb 5, 2026 | |
Kilo Gateway kilo | Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | Unknown | text, image | text | 30,000 | 8,000 | $0.14 / 1M | $0.56 / 1M | Yes | Yes | Jun 30, 2025 | |
NovitaAI novita-ai | ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | Unknown | text, image | text | 30,000 | 8,000 | $1.4 / 1M | $5.6 / 1M | Yes | Yes | Jun 30, 2025 | |
Jiekou.AI jiekou | claude-haiku-4-5-20251001 claude-haiku-4-5-20251001 | claude-haiku | text, image | text | 20,000 | 64,000 | $0.9 / 1M | $4.5 / 1M | No | Yes | Jan 1, 2026 | |
Chutes chutes | Qwen2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct | qwen | text, image | text | 16,384 | 16,384 | $0.05 / 1M | $0.22 / 1M | No | No | Jan 10, 2026 | |
Meganova meganova | Qwen2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct | qwen | text, image | text | 16,384 | 16,384 | $0.2 / 1M | $0.6 / 1M | No | Yes | Mar 24, 2025 | |
NovitaAI novita-ai | PaddleOCR-VL paddlepaddle/paddleocr-vl | Unknown | text, image | text | 16,384 | 16,384 | $0.02 / 1M | $0.02 / 1M | No | No | Oct 22, 2025 | |
Poe poe | GPT-3.5-Turbo openai/gpt-3.5-turbo | gpt | text, image | text | 16,384 | 2,048 | $0.45 / 1M | $1.4 / 1M | No | Yes | Sep 13, 2023 | |
SiliconFlow (China) siliconflow-cn | PaddlePaddle/PaddleOCR-VL PaddlePaddle/PaddleOCR-VL | Unknown | text, image | text | 16,384 | 16,384 | $0 / 1M | $0 / 1M | No | No | Oct 16, 2025 | |
SiliconFlow (China) siliconflow-cn | PaddlePaddle/PaddleOCR-VL-1.5 PaddlePaddle/PaddleOCR-VL-1.5 | Unknown | text, image | text | 16,384 | 16,384 | $0 / 1M | $0 / 1M | No | No | Jan 29, 2026 | |
Cohere cohere | Aya Vision 32B c4ai-aya-vision-32b | Unknown | text, image | text | 16,000 | 4,000 | Unknown | Unknown | No | No | May 14, 2025 | |
Cohere cohere | Aya Vision 8B c4ai-aya-vision-8b | Unknown | text, image | text | 16,000 | 4,000 | Unknown | Unknown | No | No | May 14, 2025 | |
Inference inference | Llama 3.2 11B Vision Instruct meta/llama-3.2-11b-vision-instruct | llama | text, image | text | 16,000 | 4,096 | $0.055 / 1M | $0.055 / 1M | No | Yes | Jan 1, 2025 | |
IO.NET io-net | Llama 3.2 90B Vision Instruct meta-llama/Llama-3.2-90B-Vision-Instruct | llama | text, image | text | 16,000 | 4,096 | $0.35 / 1M | $0.4 / 1M | No | Yes | Sep 25, 2024 | |
Qiniu qiniu-ai | Z-Ai/Autoglm Phone 9b z-ai/autoglm-phone-9b | Unknown | text, image | text | 12,800 | 4,096 | Unknown | Unknown | No | Yes | Dec 23, 2025 | |
Clarifai clarifai | DeepSeek OCR deepseek-ai/deepseek-ocr/models/DeepSeek-OCR | deepseek | text, image | text | 8,192 | 8,192 | $0.2 / 1M | $0.7 / 1M | No | No | Feb 25, 2026 | |
LLM Gateway llmgateway | Llama 4 Maverick 17B Instruct llama-4-maverick-17b-instruct | llama | text, image | text | 8,192 | 2,048 | $0.24 / 1M | $0.97 / 1M | No | No | Apr 5, 2025 | |
LLM Gateway llmgateway | Llama 4 Scout 17B Instruct llama-4-scout-17b-instruct | llama | text, image | text | 8,192 | 2,048 | $0.17 / 1M | $0.66 / 1M | No | No | Apr 5, 2025 | |
NovitaAI novita-ai | DeepSeek-OCR deepseek/deepseek-ocr | Unknown | text, image | text | 8,192 | 8,192 | $0.03 / 1M | $0.03 / 1M | No | No | Oct 24, 2025 | |
NovitaAI novita-ai | deepseek/deepseek-ocr-2 deepseek/deepseek-ocr-2 | Unknown | text, image | text | 8,192 | 8,192 | $0.03 / 1M | $0.03 / 1M | No | No | Jan 27, 2026 | |
Poe poe | GPT-4-Classic openai/gpt-4-classic | gpt | text, image | text | 8,192 | 4,096 | $27 / 1M | $54 / 1M | No | Yes | Mar 25, 2024 | |
Poe poe | GPT-4-Classic-0314 openai/gpt-4-classic-0314 | gpt | text, image | text | 8,192 | 4,096 | $27 / 1M | $54 / 1M | No | Yes | Aug 26, 2024 | |
Qiniu qiniu-ai | Stepfun-Ai/Gelab Zero 4b Preview stepfun-ai/gelab-zero-4b-preview | Unknown | text, image | text | 8,192 | 4,096 | Unknown | Unknown | No | Yes | Dec 23, 2025 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/DeepSeek-OCR deepseek-ai/DeepSeek-OCR | Unknown | text, image | text | 8,192 | 8,192 | $0 / 1M | $0 / 1M | No | No | Oct 20, 2025 | |
Vercel AI Gateway vercel | Grok 2 Vision xai/grok-2-vision | grok | text, image | text | 8,192 | 4,096 | $2 / 1M | $10 / 1M | No | Yes | Aug 20, 2024 | |
xAI xai | Grok 2 Vision grok-2-vision | grok | text, image | text | 8,192 | 4,096 | $2 / 1M | $10 / 1M | No | Yes | Aug 20, 2024 | |
xAI xai | Grok 2 Vision (1212) grok-2-vision-1212 | grok | text, image | text | 8,192 | 4,096 | $2 / 1M | $10 / 1M | No | Yes | Dec 12, 2024 | |
xAI xai | Grok 2 Vision Latest grok-2-vision-latest | grok | text, image | text | 8,192 | 4,096 | $2 / 1M | $10 / 1M | No | Yes | Dec 12, 2024 | |
xAI xai | Grok Vision Beta grok-vision-beta | grok-vision | text, image | text | 8,192 | 4,096 | $5 / 1M | $15 / 1M | No | Yes | Nov 1, 2024 | |
Nvidia nvidia | Phi 3 Small 8k Instruct microsoft/phi-3-small-8k-instruct | Unknown | text, image | text | 8,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 7, 2024 | |
Poe poe | GPT-3.5-Turbo-Raw openai/gpt-3.5-turbo-raw | gpt | text, image | text | 4,524 | 2,048 | $0.45 / 1M | $1.4 / 1M | No | Yes | Sep 27, 2023 | |
Nvidia nvidia | Phi 3 Medium 4k Instruct microsoft/phi-3-medium-4k-instruct | Unknown | text, image | text | 4,000 | 4,096 | $0 / 1M | $0 / 1M | No | Yes | May 7, 2024 | |
SiliconFlow (China) siliconflow-cn | deepseek-ai/deepseek-vl2 deepseek-ai/deepseek-vl2 | deepseek | text, image | text | 4,000 | 4,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | deepseek-ai/deepseek-vl2 deepseek-ai/deepseek-vl2 | deepseek | text, image | text | 4,000 | 4,000 | $0.15 / 1M | $0.15 / 1M | No | Yes | Nov 25, 2025 | |
Poe poe | GPT-3.5-Turbo-Instruct openai/gpt-3.5-turbo-instruct | gpt | text, image | text | 3,500 | 1,024 | $1.4 / 1M | $1.8 / 1M | No | Yes | Sep 20, 2023 |
424 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Vercel AI Gateway vercel | Grok 4.20 Beta Non-Reasoning xai/grok-4.20-non-reasoning-beta | grok | text, image, pdf | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | No | Yes | Mar 13, 2026 | |
Vercel AI Gateway vercel | Grok 4.20 Beta Reasoning xai/grok-4.20-reasoning-beta | grok | text, image, pdf | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 13, 2026 | |
Vercel AI Gateway vercel | Grok 4.20 Non-Reasoning xai/grok-4.20-non-reasoning | grok | text, image, pdf | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | No | Yes | Mar 23, 2026 | |
Vercel AI Gateway vercel | Grok 4.20 Reasoning xai/grok-4.20-reasoning | grok | text, image, pdf | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | Yes | Mar 23, 2026 | |
AIHubMix aihubmix | GPT-5.5 gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.5 gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 24, 2026 | |
Azure azure | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Azure azure | GPT-5.5 gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 24, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.4 openai/gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.5 openai/gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.4) duo-chat-gpt-5-4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 5, 2026 | |
LLM Gateway llmgateway | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenAI openai | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenAI openai | GPT-5.5 gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
OpenAI openai | GPT-5.5 Pro gpt-5.5-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Apr 23, 2026 | |
OpenCode Zen opencode | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenCode Zen opencode | GPT-5.4 Pro gpt-5.4-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenCode Zen opencode | GPT-5.5 gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
OpenCode Zen opencode | GPT-5.5 Pro gpt-5.5-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Apr 24, 2026 | |
OpenRouter openrouter | GPT-5.4 openai/gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenRouter openrouter | GPT-5.4 Pro openai/gpt-5.4-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
OpenRouter openrouter | GPT-5.5 openai/gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
OpenRouter openrouter | GPT-5.5 Pro openai/gpt-5.5-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Apr 23, 2026 | |
Requesty requesty | GPT-5.4 openai/gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Requesty requesty | GPT-5.4 Pro openai/gpt-5.4-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 5, 2026 | |
Vercel AI Gateway vercel | GPT 5.4 openai/gpt-5.4 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 6, 2026 | |
Vercel AI Gateway vercel | GPT 5.4 Pro openai/gpt-5.4-pro | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 6, 2026 | |
ZenMux zenmux | GPT-5.5 openai/gpt-5.5 | gpt | text, image, pdf | text | 1,050,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 23, 2026 | |
ZenMux zenmux | GPT-5.5 Pro openai/gpt-5.5-pro | gpt-pro | text, image, pdf | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Apr 23, 2026 | |
NanoGPT nano-gpt | Gemini 2.5 Flash (No Thinking) gemini-2.5-flash-nothinking | Unknown | text, image, pdf | text | 1,048,756 | 65,536 | $0.3 / 1M | $2.5 / 1M | No | No | Jun 5, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Lite Preview (09/2025) gemini-2.5-flash-lite-preview-09-2025 | Unknown | text, image, pdf | text | 1,048,756 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Lite Preview (09/2025) – Thinking gemini-2.5-flash-lite-preview-09-2025-thinking | Unknown | text, image, pdf | text | 1,048,756 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Preview (09/2025) gemini-2.5-flash-preview-09-2025 | Unknown | text, image, pdf | text | 1,048,756 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
NanoGPT nano-gpt | Gemini 2.5 Flash Preview (09/2025) – Thinking gemini-2.5-flash-preview-09-2025-thinking | Unknown | text, image, pdf | text | 1,048,756 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
FastRouter fastrouter | Gemini 2.5 Flash google/gemini-2.5-flash | gemini-flash | text, image, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 17, 2025 | |
FastRouter fastrouter | Gemini 2.5 Pro google/gemini-2.5-pro | gemini-pro | text, image, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 17, 2025 | |
Poe poe | Claude-Opus-4.7 anthropic/claude-opus-4.7 | Unknown | text, image, pdf | text | 1,048,576 | 128,000 | $4.3 / 1M | $21 / 1M | Yes | Yes | Apr 15, 2026 | |
DigitalOcean digitalocean | GPT-4.1 openai-gpt-4.1 | gpt | text, image, pdf | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
LLM Gateway llmgateway | GPT-4.1 gpt-4.1 | gpt | text, image, pdf | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
LLM Gateway llmgateway | GPT-4.1 mini gpt-4.1-mini | gpt-mini | text, image, pdf | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
NanoGPT nano-gpt | GPT 4.1 openai/gpt-4.1 | gpt | text, image, pdf | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Sep 10, 2025 | |
NanoGPT nano-gpt | GPT 4.1 Nano openai/gpt-4.1-nano | gpt-nano | text, image, pdf | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | No | Apr 14, 2025 | |
OpenAI openai | GPT-4.1 gpt-4.1 | gpt | text, image, pdf | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
OpenAI openai | GPT-4.1 mini gpt-4.1-mini | gpt-mini | text, image, pdf | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
Vercel AI Gateway vercel | GPT-4.1 openai/gpt-4.1 | gpt | text, image, pdf | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Apr 14, 2025 | |
Vercel AI Gateway vercel | GPT-4.1 mini openai/gpt-4.1-mini | gpt-mini | text, image, pdf | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Apr 14, 2025 | |
302.AI 302ai | claude-opus-4-6 claude-opus-4-6 | Unknown | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
302.AI 302ai | claude-opus-4-6-thinking claude-opus-4-6-thinking | Unknown | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
302.AI 302ai | claude-sonnet-4-6 claude-sonnet-4-6 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | No | Yes | Mar 13, 2026 | |
302.AI 302ai | claude-sonnet-4-6-thinking claude-sonnet-4-6-thinking | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
AIHubMix aihubmix | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
AIHubMix aihubmix | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Amazon Bedrock amazon-bedrock | AU Anthropic Claude Opus 4.6 au.anthropic.claude-opus-4-6-v1 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $16.5 / 1M | $82.5 / 1M | Yes | Yes | Feb 5, 2026 | |
Amazon Bedrock amazon-bedrock | AU Anthropic Claude Sonnet 4.6 au.anthropic.claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 128,000 | $3.3 / 1M | $16.5 / 1M | Yes | Yes | Feb 17, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.6 anthropic.claude-opus-4-6-v1 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.6 (EU) eu.anthropic.claude-opus-4-6-v1 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.6 (Global) global.anthropic.claude-opus-4-6-v1 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.6 (US) us.anthropic.claude-opus-4-6-v1 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.7 anthropic.claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.7 (EU) eu.anthropic.claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.7 (Global) global.anthropic.claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.7 (US) us.anthropic.claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.6 anthropic.claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.6 (EU) eu.anthropic.claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.6 (Global) global.anthropic.claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.6 (US) us.anthropic.claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Anthropic anthropic | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Anthropic anthropic | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Anthropic anthropic | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Azure azure | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 4.6 (latest) anthropic/claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 4.7 anthropic/claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
Cortecs cortecs | Claude Opus 4.6 claude-opus4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 1,000,000 | $5.98 / 1M | $29.89 / 1M | Yes | Yes | Mar 13, 2026 | |
Cortecs cortecs | Claude Opus 4.7 claude-opus4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5.6 / 1M | $27.99 / 1M | Yes | Yes | Apr 16, 2026 | |
Cortecs cortecs | Claude Sonnet 4.6 claude-4-6-sonnet | claude-sonnet | text, image, pdf | text | 1,000,000 | 1,000,000 | $3.59 / 1M | $17.92 / 1M | Yes | Yes | Mar 13, 2026 | |
DigitalOcean digitalocean | Claude Opus 4.6 anthropic-claude-opus-4.6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
DigitalOcean digitalocean | Claude Opus 4.7 anthropic-claude-opus-4.7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
DigitalOcean digitalocean | Claude Sonnet 4 anthropic-claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
DigitalOcean digitalocean | Claude Sonnet 4.5 anthropic-claude-4.5-sonnet | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
DigitalOcean digitalocean | Claude Sonnet 4.6 anthropic-claude-4.6-sonnet | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
DigitalOcean digitalocean | GPT-5.4 openai-gpt-5.4 | gpt | text, image, pdf | text | 1,000,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Firmware firmware | Qwen 3.6 Plus qwen-3-6-plus | qwen | text, image, pdf | text | 1,000,000 | 64,000 | $0.5 / 1M | $3 / 1M | Yes | Yes | Apr 3, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Opus 4.6) duo-chat-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 5, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Opus 4.7) duo-chat-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 16, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Sonnet 4.6) duo-chat-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 17, 2026 | |
LLM Gateway llmgateway | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
LLM Gateway llmgateway | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
LLM Gateway llmgateway | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
NanoGPT nano-gpt | Claude 4 Sonnet Thinking claude-sonnet-4-thinking | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Feb 24, 2025 | |
NanoGPT nano-gpt | Claude 4 Sonnet Thinking (1K) claude-sonnet-4-thinking:1024 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Sonnet Thinking (32K) claude-sonnet-4-thinking:32768 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Sonnet Thinking (64K) claude-sonnet-4-thinking:64000 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Sonnet Thinking (8K) claude-sonnet-4-thinking:8192 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.6 Opus anthropic/claude-opus-4.6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | Claude 4.6 Opus Thinking anthropic/claude-opus-4.6:thinking | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | Claude 4.6 Opus Thinking Low anthropic/claude-opus-4.6:thinking:low | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | Claude 4.6 Opus Thinking Max anthropic/claude-opus-4.6:thinking:max | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | Claude 4.6 Opus Thinking Medium anthropic/claude-opus-4.6:thinking:medium | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | No | Yes | Sep 29, 2025 | |
NanoGPT nano-gpt | Claude Sonnet 4.5 Thinking claude-sonnet-4-5-20250929-thinking | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Sep 29, 2025 | |
NanoGPT nano-gpt | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 128,000 | $2.992 / 1M | $14.993999999999998 / 1M | No | Yes | Feb 17, 2026 | |
NanoGPT nano-gpt | Claude Sonnet 4.6 Thinking anthropic/claude-sonnet-4.6:thinking | claude-sonnet | text, image, pdf | text | 1,000,000 | 128,000 | $2.992 / 1M | $14.993999999999998 / 1M | Yes | Yes | Feb 17, 2026 | |
OpenCode Zen opencode | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
OpenCode Zen opencode | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
OpenCode Zen opencode | Claude Sonnet 4 claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
OpenCode Zen opencode | Claude Sonnet 4.5 claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
OpenCode Zen opencode | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
OpenRouter openrouter | Claude Opus 4.6 anthropic/claude-opus-4.6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
OpenRouter openrouter | Claude Opus 4.7 anthropic/claude-opus-4.7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
OpenRouter openrouter | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Requesty requesty | Claude Opus 4.6 anthropic/claude-opus-4-6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Requesty requesty | Claude Sonnet 4.5 anthropic/claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4.6-opus anthropic--claude-4.6-opus | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
SAP AI Core sap-ai-core | anthropic--claude-4.6-sonnet anthropic--claude-4.6-sonnet | claude-sonnet | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 13, 2026 | |
Vercel AI Gateway vercel | Claude Opus 4.6 anthropic/claude-opus-4.6 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 1, 2026 | |
Vercel AI Gateway vercel | Claude Opus 4.7 anthropic/claude-opus-4.7 | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Vercel AI Gateway vercel | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | claude-sonnet | text, image, pdf | text | 1,000,000 | 128,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
Vercel AI Gateway vercel | Gemini 3 Flash google/gemini-3-flash | gemini-flash | text, image, pdf | text | 1,000,000 | 64,000 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Vercel AI Gateway vercel | Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | gemini | text, image, pdf | text | 1,000,000 | 65,000 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 6, 2026 | |
Vercel AI Gateway vercel | Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | gemini | text, image, pdf | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 24, 2026 | |
Vercel AI Gateway vercel | GPT 5.5 openai/gpt-5.5 | gpt | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $30 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | GPT 5.5 Pro openai/gpt-5.5-pro | gpt | text, image, pdf | text | 1,000,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | Qwen 3.5 Flash alibaba/qwen3.5-flash | qwen | text, image, pdf | text | 1,000,000 | 64,000 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Feb 24, 2026 | |
Vercel AI Gateway vercel | Qwen 3.5 Plus alibaba/qwen3.5-plus | qwen | text, image, pdf | text | 1,000,000 | 64,000 | $0.4 / 1M | $2.4 / 1M | Yes | Yes | Feb 19, 2026 | |
Vercel AI Gateway vercel | Qwen 3.6 Plus alibaba/qwen3.6-plus | qwen | text, image, pdf | text | 1,000,000 | 64,000 | $0.5 / 1M | $3 / 1M | Yes | Yes | Apr 3, 2026 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Opus 4.6 claude-opus-4-6@default | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Opus 4.7 claude-opus-4-7@default | claude-opus | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
ZenMux zenmux | Claude Opus 4.7 anthropic/claude-opus-4.7 | Unknown | text, image, pdf | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
ZenMux zenmux | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Unknown | text, image, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Poe poe | Claude-Opus-4.6 anthropic/claude-opus-4.6 | Unknown | text, image, pdf | text | 983,040 | 128,000 | $4.3 / 1M | $21 / 1M | Yes | Yes | Feb 4, 2026 | |
Poe poe | Claude-Sonnet-4 anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 983,040 | 64,000 | $2.6 / 1M | $13 / 1M | Yes | Yes | May 21, 2025 | |
Poe poe | Claude-Sonnet-4.5 anthropic/claude-sonnet-4.5 | claude-sonnet | text, image, pdf | text | 983,040 | 32,768 | $2.6 / 1M | $13 / 1M | Yes | Yes | Sep 26, 2025 | |
Poe poe | Claude-Sonnet-4.6 anthropic/claude-sonnet-4.6 | Unknown | text, image, pdf | text | 983,040 | 128,000 | $2.6 / 1M | $13 / 1M | Yes | Yes | Feb 5, 2026 | |
Abacus abacus | GPT-5.2 Codex gpt-5.2-codex | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Abacus abacus | GPT-5.3 Codex gpt-5.3-codex | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
Abacus abacus | GPT-5.3 Codex XHigh gpt-5.3-codex-xhigh | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
AIHubMix aihubmix | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.4 Mini gpt-5.4-mini | gpt-mini | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.4 Nano gpt-5.4-nano | gpt-nano | text, image, pdf | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
Azure azure | GPT-5.4 Mini gpt-5.4-mini | gpt-mini | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
Azure azure | GPT-5.4 Nano gpt-5.4-nano | gpt-nano | text, image, pdf | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.2 Codex openai/gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | GPT-5.3 Codex openai/gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
DigitalOcean digitalocean | GPT-5.3 Codex openai-gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.2 Codex) duo-chat-gpt-5-2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 22, 2026 | |
GitLab Duo gitlab | Agentic Chat (GPT-5.3 Codex) duo-chat-gpt-5-3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 5, 2026 | |
LLM Gateway llmgateway | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
LLM Gateway llmgateway | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
NanoGPT nano-gpt | GPT 5 openai/gpt-5 | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
NanoGPT nano-gpt | GPT 5.1 openai/gpt-5.1 | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.1 Codex Max openai/gpt-5.1-codex-max | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $2.5 / 1M | $20 / 1M | Yes | Yes | Nov 13, 2025 | |
NanoGPT nano-gpt | GPT 5.2 openai/gpt-5.2 | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 1, 2026 | |
NanoGPT nano-gpt | GPT 5.2 Codex openai/gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
NanoGPT nano-gpt | GPT 5.2 Pro openai/gpt-5.2-pro | gpt-pro | text, image, pdf | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Jan 1, 2026 | |
OpenAI openai | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
OpenAI openai | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
OpenCode Zen opencode | GPT-5.2 Codex gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 14, 2026 | |
OpenCode Zen opencode | GPT-5.3 Codex gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
OpenCode Zen opencode | GPT-5.4 Mini gpt-5.4-mini | gpt-mini | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
OpenCode Zen opencode | GPT-5.4 Nano gpt-5.4-nano | gpt-nano | text, image, pdf | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
OpenRouter openrouter | GPT-5.3-Codex openai/gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
OpenRouter openrouter | GPT-5.4 Mini openai/gpt-5.4-mini | gpt-mini | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
OpenRouter openrouter | GPT-5.4 Nano openai/gpt-5.4-nano | gpt-nano | text, image, pdf | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | No | Yes | Mar 17, 2026 | |
Requesty requesty | GPT-5.3-Codex openai/gpt-5.3-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
Vercel AI Gateway vercel | GPT 5.1 Codex Max openai/gpt-5.1-codex-max | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT 5.2 openai/gpt-5.2-pro | gpt | text, image, pdf | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT 5.3 Codex openai/gpt-5.3-codex | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 24, 2026 | |
Vercel AI Gateway vercel | GPT 5.4 Mini openai/gpt-5.4-mini | gpt | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
Vercel AI Gateway vercel | GPT 5.4 Nano openai/gpt-5.4-nano | gpt | text, image, pdf | text | 400,000 | 128,000 | $0.19999999999999998 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
Vercel AI Gateway vercel | GPT-5.1 Codex mini openai/gpt-5.1-codex-mini | gpt | text, image, pdf | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | May 16, 2025 | |
Vercel AI Gateway vercel | GPT-5.1-Codex openai/gpt-5.1-codex | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5.2 openai/gpt-5.2 | gpt | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5.2-Codex openai/gpt-5.2-codex | gpt-codex | text, image, pdf | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 1, 2025 | |
Vivgrid vivgrid | GPT-5.4 gpt-5.4 | gpt | text, image, pdf | text | 400,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 5, 2026 | |
Vivgrid vivgrid | GPT-5.4 Mini gpt-5.4-mini | gpt-mini | text, image, pdf | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Mar 17, 2026 | |
Vivgrid vivgrid | GPT-5.4 Nano gpt-5.4-nano | gpt-nano | text, image, pdf | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Mar 17, 2026 | |
ZenMux zenmux | GPT-5 openai/gpt-5 | Unknown | text, image, pdf | text | 400,000 | 64,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
ZenMux zenmux | GPT-5.2-Codex openai/gpt-5.2-codex | Unknown | text, image, pdf | text | 400,000 | 64,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Jan 15, 2026 | |
ZenMux zenmux | GPT-5.2-Pro openai/gpt-5.2-pro | Unknown | text, image, pdf | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Dec 11, 2025 | |
Vercel AI Gateway vercel | Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it | gemma | text, image, pdf | text | 262,144 | 131,072 | $0.13 / 1M | $0.39999999999999997 / 1M | Yes | Yes | Apr 3, 2026 | |
Vercel AI Gateway vercel | Gemma 4 31B IT google/gemma-4-31b-it | gemma | text, image, pdf | text | 262,144 | 131,072 | $0.14 / 1M | $0.39999999999999997 / 1M | Yes | Yes | Apr 3, 2026 | |
Vercel AI Gateway vercel | Qwen3 235B A22B Thinking 2507 alibaba/qwen3-235b-a22b-thinking | qwen | text, image, pdf | text | 262,114 | 262,114 | $0.3 / 1M | $2.9 / 1M | Yes | Yes | Apr 1, 2025 | |
Vercel AI Gateway vercel | Kimi K2.6 moonshotai/kimi-k2.6 | kimi-k2.6 | text, image, pdf | text | 262,000 | 262,000 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | Ministral 14B mistral/ministral-14b | ministral | text, image, pdf | text | 256,000 | 256,000 | $0.2 / 1M | $0.2 / 1M | No | No | Dec 1, 2025 | |
Vercel AI Gateway vercel | Qwen 3.6 Max Preview alibaba/qwen-3.6-max-preview | qwen | text, image, pdf | text | 240,000 | 64,000 | $1.3 / 1M | $7.8 / 1M | Yes | Yes | Apr 24, 2026 | |
Vercel AI Gateway vercel | Minimax M2.7 minimax/minimax-m2.7 | minimax | text, image, pdf | text | 204,800 | 131,000 | $0.3 / 1M | $1.2 / 1M | Yes | Yes | Mar 18, 2026 | |
Vercel AI Gateway vercel | GLM 5.1 zai/glm-5.1 | glm | text, image, pdf | text | 202,752 | 202,752 | $1.4 / 1M | $4.4 / 1M | Yes | Yes | Apr 16, 2026 | |
302.AI 302ai | claude-3-5-haiku-20241022 claude-3-5-haiku-20241022 | Unknown | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
302.AI 302ai | claude-3-5-haiku-latest claude-3-5-haiku-latest | Unknown | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
302.AI 302ai | claude-haiku-4-5 claude-haiku-4-5 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 16, 2025 | |
302.AI 302ai | claude-haiku-4-5-20251001 claude-haiku-4-5-20251001 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 16, 2025 | |
302.AI 302ai | claude-opus-4-20250514 claude-opus-4-20250514 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | No | Yes | May 22, 2025 | |
302.AI 302ai | claude-opus-4-5 claude-opus-4-5 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 25, 2025 | |
302.AI 302ai | claude-opus-4-5-20251101 claude-opus-4-5-20251101 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 25, 2025 | |
302.AI 302ai | claude-opus-4-7 claude-opus-4-7 | Unknown | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 17, 2026 | |
302.AI 302ai | claude-sonnet-4-20250514 claude-sonnet-4-20250514 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | Yes | May 22, 2025 | |
302.AI 302ai | claude-sonnet-4-5 claude-sonnet-4-5 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | Yes | Sep 30, 2025 | |
302.AI 302ai | claude-sonnet-4-5-20250929 claude-sonnet-4-5-20250929 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | No | Yes | Sep 30, 2025 | |
302.AI 302ai | claude-sonnet-4-5-20250929-thinking claude-sonnet-4-5-20250929-thinking | Unknown | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 30, 2025 | |
Abacus abacus | Claude Haiku 4.5 claude-haiku-4-5-20251001 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Abacus abacus | Claude Opus 4 claude-opus-4-20250514 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 14, 2025 | |
Abacus abacus | Claude Opus 4.1 claude-opus-4-1-20250805 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Abacus abacus | Claude Opus 4.5 claude-opus-4-5-20251101 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 1, 2025 | |
Abacus abacus | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Abacus abacus | Claude Sonnet 3.7 claude-3-7-sonnet-20250219 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
Abacus abacus | Claude Sonnet 4 claude-sonnet-4-20250514 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 14, 2025 | |
Abacus abacus | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Abacus abacus | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
AIHubMix aihubmix | Claude Opus 4.6 claude-opus-4-6-think | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 13, 2026 | |
AIHubMix aihubmix | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
AIHubMix aihubmix | Claude Sonnet 4.6 Think claude-sonnet-4-6-think | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 3 anthropic.claude-3-haiku-20240307-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 13, 2024 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 3.5 anthropic.claude-3-5-haiku-20241022-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 4.5 anthropic.claude-haiku-4-5-20251001-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 4.5 (EU) eu.anthropic.claude-haiku-4-5-20251001-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 4.5 (Global) global.anthropic.claude-haiku-4-5-20251001-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Haiku 4.5 (US) us.anthropic.claude-haiku-4-5-20251001-v1:0 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4 anthropic.claude-opus-4-20250514-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4 (US) us.anthropic.claude-opus-4-20250514-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.1 anthropic.claude-opus-4-1-20250805-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.1 (US) us.anthropic.claude-opus-4-1-20250805-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.5 anthropic.claude-opus-4-5-20251101-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.5 (EU) eu.anthropic.claude-opus-4-5-20251101-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.5 (Global) global.anthropic.claude-opus-4-5-20251101-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Opus 4.5 (US) us.anthropic.claude-opus-4-5-20251101-v1:0 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 3.5 anthropic.claude-3-5-sonnet-20240620-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Jun 20, 2024 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 3.5 v2 anthropic.claude-3-5-sonnet-20241022-v2:0 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 3.7 anthropic.claude-3-7-sonnet-20250219-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Feb 19, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4 anthropic.claude-sonnet-4-20250514-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4 (EU) eu.anthropic.claude-sonnet-4-20250514-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4 (Global) global.anthropic.claude-sonnet-4-20250514-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4 (US) us.anthropic.claude-sonnet-4-20250514-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.5 anthropic.claude-sonnet-4-5-20250929-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.5 (EU) eu.anthropic.claude-sonnet-4-5-20250929-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.5 (Global) global.anthropic.claude-sonnet-4-5-20250929-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Amazon Bedrock amazon-bedrock | Claude Sonnet 4.5 (US) us.anthropic.claude-sonnet-4-5-20250929-v1:0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Anthropic anthropic | Claude Haiku 3 claude-3-haiku-20240307 | claude-haiku | text, image, pdf | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 13, 2024 | |
Anthropic anthropic | Claude Haiku 3.5 claude-3-5-haiku-20241022 | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Anthropic anthropic | Claude Haiku 3.5 (latest) claude-3-5-haiku-latest | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Anthropic anthropic | Claude Haiku 4.5 claude-haiku-4-5-20251001 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Anthropic anthropic | Claude Haiku 4.5 (latest) claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Anthropic anthropic | Claude Opus 3 claude-3-opus-20240229 | claude-opus | text, image, pdf | text | 200,000 | 4,096 | $15 / 1M | $75 / 1M | No | Yes | Feb 29, 2024 | |
Anthropic anthropic | Claude Opus 4 claude-opus-4-20250514 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Anthropic anthropic | Claude Opus 4 (latest) claude-opus-4-0 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Anthropic anthropic | Claude Opus 4.1 claude-opus-4-1-20250805 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Anthropic anthropic | Claude Opus 4.1 (latest) claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Anthropic anthropic | Claude Opus 4.5 claude-opus-4-5-20251101 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 1, 2025 | |
Anthropic anthropic | Claude Opus 4.5 (latest) claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Anthropic anthropic | Claude Sonnet 3 claude-3-sonnet-20240229 | claude-sonnet | text, image, pdf | text | 200,000 | 4,096 | $3 / 1M | $15 / 1M | No | Yes | Mar 4, 2024 | |
Anthropic anthropic | Claude Sonnet 3.5 claude-3-5-sonnet-20240620 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Jun 20, 2024 | |
Anthropic anthropic | Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Anthropic anthropic | Claude Sonnet 3.7 claude-3-7-sonnet-20250219 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
Anthropic anthropic | Claude Sonnet 4 claude-sonnet-4-20250514 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Anthropic anthropic | Claude Sonnet 4 (latest) claude-sonnet-4-0 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Anthropic anthropic | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Anthropic anthropic | Claude Sonnet 4.5 (latest) claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Azure azure | Claude Haiku 4.5 claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Nov 18, 2025 | |
Azure azure | Claude Opus 4.1 claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Nov 18, 2025 | |
Azure azure | Claude Opus 4.5 claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Azure azure | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Azure azure | Claude Sonnet 4.5 claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Nov 18, 2025 | |
Azure Cognitive Services azure-cognitive-services | Claude Haiku 4.5 claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Nov 18, 2025 | |
Azure Cognitive Services azure-cognitive-services | Claude Opus 4.1 claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Nov 18, 2025 | |
Azure Cognitive Services azure-cognitive-services | Claude Opus 4.5 claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Aug 1, 2025 | |
Azure Cognitive Services azure-cognitive-services | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Azure Cognitive Services azure-cognitive-services | Claude Sonnet 4.5 claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Nov 18, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Haiku 3 anthropic/claude-3-haiku | claude-haiku | text, image, pdf | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 13, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Haiku 3.5 (latest) anthropic/claude-3-5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Haiku 3.5 (latest) anthropic/claude-3.5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Haiku 4.5 (latest) anthropic/claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 3 anthropic/claude-3-opus | claude-opus | text, image, pdf | text | 200,000 | 4,096 | $15 / 1M | $75 / 1M | No | Yes | Feb 29, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 4 (latest) anthropic/claude-opus-4 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 4.1 (latest) anthropic/claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Opus 4.5 (latest) anthropic/claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Sonnet 3 anthropic/claude-3-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 4,096 | $3 / 1M | $15 / 1M | No | Yes | Mar 4, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Sonnet 3.5 v2 anthropic/claude-3.5-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Sonnet 4 (latest) anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Cloudflare AI Gateway cloudflare-ai-gateway | Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Cortecs cortecs | Claude 4.5 Sonnet claude-4-5-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 200,000 | $3.259 / 1M | $16.296 / 1M | Yes | Yes | Sep 29, 2025 | |
Cortecs cortecs | Claude Haiku 4.5 claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 200,000 | $1.09 / 1M | $5.43 / 1M | Yes | Yes | Oct 15, 2025 | |
Cortecs cortecs | Claude Opus 4.5 claude-opus4-5 | claude-opus | text, image, pdf | text | 200,000 | 200,000 | $5.98 / 1M | $29.89 / 1M | Yes | Yes | Nov 24, 2025 | |
Cortecs cortecs | Claude Sonnet 4 claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3.307 / 1M | $16.536 / 1M | No | Yes | May 22, 2025 | |
DigitalOcean digitalocean | Claude Haiku 4.5 anthropic-claude-haiku-4.5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
DigitalOcean digitalocean | Claude Opus 4 anthropic-claude-opus-4 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
DigitalOcean digitalocean | Claude Opus 4.1 anthropic-claude-4.1-opus | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
DigitalOcean digitalocean | Claude Opus 4.5 anthropic-claude-opus-4.5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
DigitalOcean digitalocean | o1 openai-o1 | o | text, image, pdf | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
DigitalOcean digitalocean | o3 openai-o3 | o | text, image, pdf | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
FastRouter fastrouter | Claude Opus 4.1 anthropic/claude-opus-4.1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
FastRouter fastrouter | Claude Sonnet 4 anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Firmware firmware | Claude Haiku 4.5 claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Firmware firmware | Claude Opus 4.6 claude-opus-4-6 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Firmware firmware | Claude Opus 4.7 claude-opus-4-7 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Apr 16, 2026 | |
Firmware firmware | Claude Sonnet 4.6 claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Haiku 4.5) duo-chat-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 8, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Opus 4.5) duo-chat-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 8, 2026 | |
GitLab Duo gitlab | Agentic Chat (Claude Sonnet 4.5) duo-chat-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 8, 2026 | |
LLM Gateway llmgateway | Claude Haiku 4.5 claude-haiku-4-5-20251001 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
LLM Gateway llmgateway | Claude Haiku 4.5 (latest) claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
LLM Gateway llmgateway | Claude Opus 4 claude-opus-4-20250514 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
LLM Gateway llmgateway | Claude Opus 4.1 claude-opus-4-1-20250805 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
LLM Gateway llmgateway | Claude Opus 4.5 claude-opus-4-5-20251101 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 1, 2025 | |
LLM Gateway llmgateway | Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
LLM Gateway llmgateway | Claude Sonnet 3.7 claude-3-7-sonnet-20250219 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
LLM Gateway llmgateway | Claude Sonnet 4 claude-sonnet-4-20250514 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
LLM Gateway llmgateway | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
LLM Gateway llmgateway | Claude Sonnet 4.5 (latest) claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
LLM Gateway llmgateway | o1 o1 | o | text, image, pdf | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
LLM Gateway llmgateway | o3 o3 | o | text, image, pdf | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
NanoGPT nano-gpt | Claude 3.5 Haiku claude-3-5-haiku-20241022 | Unknown | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
NanoGPT nano-gpt | Claude 3.5 Sonnet Old claude-3-5-sonnet-20240620 | Unknown | text, image, pdf | text | 200,000 | 8,192 | $2.992 / 1M | $14.994 / 1M | No | Yes | Jun 20, 2024 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet claude-3-7-sonnet-20250219 | Unknown | text, image, pdf | text | 200,000 | 16,000 | $2.992 / 1M | $14.994 / 1M | No | Yes | Feb 19, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Thinking claude-3-7-sonnet-thinking | Unknown | text, image, pdf | text | 200,000 | 16,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Feb 24, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Thinking (128K) claude-3-7-sonnet-thinking:128000 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Feb 24, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Thinking (1K) claude-3-7-sonnet-thinking:1024 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Feb 24, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Thinking (32K) claude-3-7-sonnet-thinking:32768 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Jul 15, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Thinking (8K) claude-3-7-sonnet-thinking:8192 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | Yes | Yes | Feb 24, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus claude-opus-4-20250514 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | No | Yes | May 14, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus Thinking claude-opus-4-thinking | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | Jul 15, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus Thinking (1K) claude-opus-4-thinking:1024 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus Thinking (32K) claude-opus-4-thinking:32000 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus Thinking (32K) claude-opus-4-thinking:32768 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Opus Thinking (8K) claude-opus-4-thinking:8192 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4 Sonnet claude-sonnet-4-20250514 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $2.992 / 1M | $14.994 / 1M | No | Yes | Sep 29, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus claude-opus-4-1-20250805 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | No | Yes | Aug 5, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus Thinking claude-opus-4-1-thinking | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus Thinking (1K) claude-opus-4-1-thinking:1024 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus Thinking (32K) claude-opus-4-1-thinking:32000 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus Thinking (32K) claude-opus-4-1-thinking:32768 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.1 Opus Thinking (8K) claude-opus-4-1-thinking:8192 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $14.994 / 1M | $75.004 / 1M | Yes | Yes | May 22, 2025 | |
NanoGPT nano-gpt | Claude 4.5 Opus claude-opus-4-5-20251101 | Unknown | text, image, pdf | text | 200,000 | 32,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Nov 1, 2025 | |
NanoGPT nano-gpt | Claude 4.5 Opus Thinking claude-opus-4-5-20251101:thinking | Unknown | text, image, pdf | text | 200,000 | 32,000 | $4.998 / 1M | $25.007 / 1M | Yes | Yes | Nov 1, 2025 | |
NanoGPT nano-gpt | Claude Haiku 4.5 claude-haiku-4-5-20251001 | Unknown | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | No | Yes | Oct 15, 2025 | |
NanoGPT nano-gpt | OpenAI o1 Pro openai/o1-pro | o-pro | text, image, pdf | text | 200,000 | 100,000 | $150 / 1M | $600 / 1M | No | No | Jan 25, 2025 | |
OpenAI openai | o1 o1 | o | text, image, pdf | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
OpenAI openai | o3 o3 | o | text, image, pdf | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
OpenCode Zen opencode | Claude Haiku 3.5 claude-3-5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
OpenCode Zen opencode | Claude Haiku 4.5 claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
OpenCode Zen opencode | Claude Opus 4.1 claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenCode Zen opencode | Claude Opus 4.5 claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
OpenRouter openrouter | Claude Haiku 3.5 anthropic/claude-3.5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
OpenRouter openrouter | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
OpenRouter openrouter | Claude Opus 4 anthropic/claude-opus-4 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
OpenRouter openrouter | Claude Opus 4.1 anthropic/claude-opus-4.1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
OpenRouter openrouter | Claude Opus 4.5 anthropic/claude-opus-4.5 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
OpenRouter openrouter | Claude Sonnet 3.7 anthropic/claude-3.7-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 128,000 | $15 / 1M | $75 / 1M | Yes | Yes | Feb 19, 2025 | |
OpenRouter openrouter | Claude Sonnet 4 anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Perplexity Agent perplexity-agent | Claude Haiku 4.5 anthropic/claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Perplexity Agent perplexity-agent | Claude Opus 4.5 anthropic/claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Perplexity Agent perplexity-agent | Claude Opus 4.6 anthropic/claude-opus-4-6 | claude-opus | text, image, pdf | text | 200,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 5, 2026 | |
Perplexity Agent perplexity-agent | Claude Sonnet 4.5 anthropic/claude-sonnet-4-5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Perplexity Agent perplexity-agent | Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
QiHang qihang-ai | Claude Haiku 4.5 claude-haiku-4-5-20251001 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $0.14 / 1M | $0.71 / 1M | Yes | Yes | Oct 1, 2025 | |
QiHang qihang-ai | Claude Sonnet 4.5 claude-sonnet-4-5-20250929 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $0.43 / 1M | $2.14 / 1M | Yes | Yes | Sep 29, 2025 | |
Requesty requesty | Claude Haiku 4.5 anthropic/claude-haiku-4-5 | claude-haiku | text, image, pdf | text | 200,000 | 62,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Requesty requesty | Claude Opus 4 anthropic/claude-opus-4 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Requesty requesty | Claude Opus 4.1 anthropic/claude-opus-4-1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Requesty requesty | Claude Opus 4.5 anthropic/claude-opus-4-5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Requesty requesty | Claude Sonnet 3.7 anthropic/claude-3-7-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
Requesty requesty | Claude Sonnet 4 anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-3-haiku anthropic--claude-3-haiku | claude-haiku | text, image, pdf | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 13, 2024 | |
SAP AI Core sap-ai-core | anthropic--claude-3-opus anthropic--claude-3-opus | claude-opus | text, image, pdf | text | 200,000 | 4,096 | $15 / 1M | $75 / 1M | No | Yes | Feb 29, 2024 | |
SAP AI Core sap-ai-core | anthropic--claude-3-sonnet anthropic--claude-3-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 4,096 | $3 / 1M | $15 / 1M | No | Yes | Mar 4, 2024 | |
SAP AI Core sap-ai-core | anthropic--claude-3.5-sonnet anthropic--claude-3.5-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
SAP AI Core sap-ai-core | anthropic--claude-3.7-sonnet anthropic--claude-3.7-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 24, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4-opus anthropic--claude-4-opus | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4-sonnet anthropic--claude-4-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4.5-haiku anthropic--claude-4.5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4.5-opus anthropic--claude-4.5-opus | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
SAP AI Core sap-ai-core | anthropic--claude-4.5-sonnet anthropic--claude-4.5-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Vercel AI Gateway vercel | Claude 3.5 Sonnet (2024-06-20) anthropic/claude-3.5-sonnet-20240620 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Jun 20, 2024 | |
Vercel AI Gateway vercel | Claude Haiku 3 anthropic/claude-3-haiku | claude-haiku | text, image, pdf | text | 200,000 | 4,096 | $0.25 / 1M | $1.25 / 1M | No | Yes | Mar 13, 2024 | |
Vercel AI Gateway vercel | Claude Haiku 3.5 anthropic/claude-3.5-haiku | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Vercel AI Gateway vercel | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Vercel AI Gateway vercel | Claude Opus 3 anthropic/claude-3-opus | claude-opus | text, image, pdf | text | 200,000 | 4,096 | $15 / 1M | $75 / 1M | No | Yes | Feb 29, 2024 | |
Vercel AI Gateway vercel | Claude Opus 4 anthropic/claude-opus-4 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Vercel AI Gateway vercel | Claude Opus 4 anthropic/claude-opus-4.1 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Vercel AI Gateway vercel | Claude Opus 4.5 anthropic/claude-opus-4.5 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
Vercel AI Gateway vercel | Claude Sonnet 3.5 v2 anthropic/claude-3.5-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Vercel AI Gateway vercel | Claude Sonnet 3.7 anthropic/claude-3.7-sonnet | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
Vercel AI Gateway vercel | Claude Sonnet 4 anthropic/claude-sonnet-4 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Vercel AI Gateway vercel | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Vercel AI Gateway vercel | Codex Mini openai/codex-mini | gpt-codex-mini | text, image, pdf | text | 200,000 | 100,000 | $1.5 / 1M | $6 / 1M | Yes | Yes | May 16, 2025 | |
Vercel AI Gateway vercel | GLM 5V Turbo zai/glm-5v-turbo | glm | text, image, pdf | text | 200,000 | 128,000 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 3, 2026 | |
Vercel AI Gateway vercel | o1 openai/o1 | o | text, image, pdf | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | Yes | Yes | Dec 5, 2024 | |
Vercel AI Gateway vercel | o3 openai/o3 | o | text, image, pdf | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Apr 16, 2025 | |
Vercel AI Gateway vercel | o3 Pro openai/o3-pro | o-pro | text, image, pdf | text | 200,000 | 100,000 | $20 / 1M | $80 / 1M | Yes | Yes | Apr 16, 2025 | |
Vercel AI Gateway vercel | o3-deep-research openai/o3-deep-research | o | text, image, pdf | text | 200,000 | 100,000 | $10 / 1M | $40 / 1M | Yes | Yes | Jun 26, 2024 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Haiku 3.5 claude-3-5-haiku@20241022 | claude-haiku | text, image, pdf | text | 200,000 | 8,192 | $0.8 / 1M | $4 / 1M | No | Yes | Oct 22, 2024 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Haiku 4.5 claude-haiku-4-5@20251001 | claude-haiku | text, image, pdf | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Opus 4 claude-opus-4@20250514 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Opus 4.1 claude-opus-4-1@20250805 | claude-opus | text, image, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Opus 4.5 claude-opus-4-5@20251101 | claude-opus | text, image, pdf | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 1, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Sonnet 3.5 v2 claude-3-5-sonnet@20241022 | claude-sonnet | text, image, pdf | text | 200,000 | 8,192 | $3 / 1M | $15 / 1M | No | Yes | Oct 22, 2024 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Sonnet 3.7 claude-3-7-sonnet@20250219 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 19, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Sonnet 4 claude-sonnet-4@20250514 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Sonnet 4.5 claude-sonnet-4-5@20250929 | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Sep 29, 2025 | |
Vertex (Anthropic) google-vertex-anthropic | Claude Sonnet 4.6 claude-sonnet-4-6@default | claude-sonnet | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 17, 2026 | |
ZenMux zenmux | Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | Unknown | text, image, pdf | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Feb 24, 2025 | |
Poe poe | Claude-Opus-4.1 anthropic/claude-opus-4.1 | claude-opus | text, image, pdf | text | 196,608 | 32,000 | $13 / 1M | $64 / 1M | Yes | Yes | Aug 5, 2025 | |
Poe poe | Claude-Opus-4.5 anthropic/claude-opus-4.5 | claude-opus | text, image, pdf | text | 196,608 | 64,000 | $4.3 / 1M | $21 / 1M | Yes | Yes | Nov 21, 2025 | |
Poe poe | Claude-Sonnet-3.7 anthropic/claude-sonnet-3.7 | claude-sonnet | text, image, pdf | text | 196,608 | 128,000 | $2.6 / 1M | $13 / 1M | Yes | Yes | Feb 19, 2025 | |
Poe poe | Claude-Opus-4 anthropic/claude-opus-4 | claude-opus | text, image, pdf | text | 192,512 | 28,672 | $13 / 1M | $64 / 1M | Yes | Yes | May 21, 2025 | |
Poe poe | Claude-Haiku-4.5 anthropic/claude-haiku-4.5 | claude-haiku | text, image, pdf | text | 192,000 | 64,000 | $0.85 / 1M | $4.3 / 1M | Yes | Yes | Oct 15, 2025 | |
Poe poe | Claude-Haiku-3 anthropic/claude-haiku-3 | claude-haiku | text, image, pdf | text | 189,096 | 8,192 | $0.21 / 1M | $1.1 / 1M | No | Yes | Mar 9, 2024 | |
Poe poe | Claude-Haiku-3.5 anthropic/claude-haiku-3.5 | claude-haiku | text, image, pdf | text | 189,096 | 8,192 | $0.68 / 1M | $3.4 / 1M | No | Yes | Oct 1, 2024 | |
Poe poe | Claude-Sonnet-3.5 anthropic/claude-sonnet-3.5 | claude-sonnet | text, image, pdf | text | 189,096 | 8,192 | $2.6 / 1M | $13 / 1M | No | Yes | Jun 5, 2024 | |
Poe poe | Claude-Sonnet-3.5-June anthropic/claude-sonnet-3.5-june | claude-sonnet | text, image, pdf | text | 189,096 | 8,192 | $2.6 / 1M | $13 / 1M | No | Yes | Nov 18, 2024 | |
DigitalOcean digitalocean | GPT-4o openai-gpt-4o | gpt | text, image, pdf | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
DigitalOcean digitalocean | GPT-4o mini openai-gpt-4o-mini | gpt-mini | text, image, pdf | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
LLM Gateway llmgateway | GPT-4o gpt-4o | gpt | text, image, pdf | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
LLM Gateway llmgateway | GPT-4o mini gpt-4o-mini | gpt-mini | text, image, pdf | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
OpenAI openai | GPT-4o gpt-4o | gpt | text, image, pdf | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
OpenAI openai | GPT-4o mini gpt-4o-mini | gpt-mini | text, image, pdf | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
OpenAI openai | GPT-5.3 Codex Spark gpt-5.3-codex-spark | gpt-codex-spark | text, image, pdf | text | 128,000 | 32,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Feb 5, 2026 | |
Vercel AI Gateway vercel | GLM-4.6V zai/glm-4.6v | glm | text, image, pdf | text | 128,000 | 24,000 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Sep 30, 2025 | |
Vercel AI Gateway vercel | GLM-4.6V-Flash zai/glm-4.6v-flash | glm | text, image, pdf | text | 128,000 | 24,000 | Unknown | Unknown | Yes | Yes | Sep 30, 2025 | |
Vercel AI Gateway vercel | GPT-4o openai/gpt-4o | gpt | text, image, pdf | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Aug 6, 2024 | |
Vercel AI Gateway vercel | GPT-4o mini openai/gpt-4o-mini | gpt-mini | text, image, pdf | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Jul 18, 2024 | |
Vercel AI Gateway vercel | GPT-5.2 Chat openai/gpt-5.2-chat | gpt | text, image, pdf | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5.3 Chat openai/gpt-5.3-chat | gpt | text, image, pdf | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 6, 2026 | |
NanoGPT nano-gpt | Ernie X1 Turbo 32k ernie-x1-turbo-32k | Unknown | text, image, pdf | text | 32,000 | 16,384 | $0.165 / 1M | $0.66 / 1M | No | No | May 8, 2025 |
197 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Qiniu qiniu-ai | X-Ai/Grok 4.1 Fast Reasoning x-ai/grok-4.1-fast-reasoning | Unknown | text, image, audio, video | text | 20,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Dec 19, 2025 | |
Qiniu qiniu-ai | X-Ai/Grok 4.1 Fast Non Reasoning x-ai/grok-4.1-fast-non-reasoning | Unknown | text, image, audio, video | text | 2,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Dec 19, 2025 | |
Qiniu qiniu-ai | x-AI/Grok-4-Fast x-ai/grok-4-fast | Unknown | text, image, audio, video | text | 2,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Sep 20, 2025 | |
Qiniu qiniu-ai | X-Ai/Grok-4-Fast-Non-Reasoning x-ai/grok-4-fast-non-reasoning | Unknown | text, image, audio, video | text | 2,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Dec 18, 2025 | |
Qiniu qiniu-ai | X-Ai/Grok-4-Fast-Reasoning x-ai/grok-4-fast-reasoning | Unknown | text, image, audio, video | text | 2,000,000 | 2,000,000 | Unknown | Unknown | Yes | Yes | Dec 18, 2025 | |
Poe poe | Gemini-2.5-Flash google/gemini-2.5-flash | gemini-flash | text, image, video, audio | text | 1,065,535 | 65,535 | $0.21 / 1M | $1.8 / 1M | Yes | Yes | Apr 26, 2025 | |
Poe poe | Gemini-2.5-Pro google/gemini-2.5-pro | gemini-pro | text, image, video, audio | text | 1,065,535 | 65,535 | $0.87 / 1M | $7 / 1M | Yes | Yes | Feb 5, 2025 | |
OpenRouter openrouter | Gemini 3 Pro Preview google/gemini-3-pro-preview | gemini-pro | text, image, audio, video, pdf | text | 1,050,000 | 66,000 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 1, 2025 | |
ZenMux zenmux | Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | Unknown | text, image, audio, video | text | 1,050,000 | 65,530 | $0.25 / 1M | $1.5 / 1M | No | Yes | Mar 20, 2025 | |
Abacus abacus | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
Abacus abacus | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 25, 2025 | |
Abacus abacus | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Abacus abacus | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 1, 2026 | |
Abacus abacus | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
AIHubMix aihubmix | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.499 / 1M | Yes | Yes | Jun 5, 2025 | |
AIHubMix aihubmix | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
AIHubMix aihubmix | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
AIHubMix aihubmix | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
AIHubMix aihubmix | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Firmware firmware | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jul 17, 2025 | |
Firmware firmware | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Firmware firmware | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Google google | Gemini 2.0 Flash gemini-2.0-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.1 / 1M | $0.4 / 1M | No | Yes | Dec 11, 2024 | |
Google google | Gemini 2.0 Flash Lite gemini-2.0-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
Google google | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
Google google | Gemini 2.5 Flash Lite gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
Google google | Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
Google google | Gemini 2.5 Flash Lite Preview 09-25 gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
Google google | Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Apr 17, 2025 | |
Google google | Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | May 20, 2025 | |
Google google | Gemini 2.5 Flash Preview 09-25 gemini-2.5-flash-preview-09-2025 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
Google google | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Google google | Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | May 6, 2025 | |
Google google | Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Google google | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Google google | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
Google google | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Google google | Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Google google | Gemini Flash Latest gemini-flash-latest | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
Google google | Gemini Flash-Lite Latest gemini-flash-lite-latest | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
Helicone helicone | Google Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, audio, video | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Jiekou.AI jiekou | gemini-2.5-flash gemini-2.5-flash | gemini-flash | text, image, video, audio | text | 1,048,576 | 65,535 | $0.27 / 1M | $2.25 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-flash-lite gemini-2.5-flash-lite | gemini-flash-lite | text, image, video, audio | text | 1,048,576 | 65,535 | $0.09 / 1M | $0.36 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-flash-lite-preview-06-17 gemini-2.5-flash-lite-preview-06-17 | gemini-flash-lite | text, video, image, audio | text | 1,048,576 | 65,535 | $0.09 / 1M | $0.36 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-flash-lite-preview-09-2025 gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, video, audio | text | 1,048,576 | 65,536 | $0.09 / 1M | $0.36 / 1M | Yes | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-flash-preview-05-20 gemini-2.5-flash-preview-05-20 | gemini-flash | text, image, video, audio | text | 1,048,576 | 200,000 | $0.135 / 1M | $3.15 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-pro gemini-2.5-pro | gemini-pro | text, image, video, audio | text | 1,048,576 | 65,535 | $1.125 / 1M | $9 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-2.5-pro-preview-06-05 gemini-2.5-pro-preview-06-05 | gemini-pro | text, image, video, audio | text | 1,048,576 | 200,000 | $1.125 / 1M | $9 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-3-flash-preview gemini-3-flash-preview | gemini-flash | text, image, video, audio | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | No | Yes | Jan 1, 2026 | |
Jiekou.AI jiekou | gemini-3-pro-preview gemini-3-pro-preview | gemini-pro | text, image, video, audio | text | 1,048,576 | 65,536 | $1.8 / 1M | $10.8 / 1M | No | Yes | Jan 1, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 8,192 | $0.1 / 1M | $0.4 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Flash google/gemini-2.5-flash | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,535 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,535 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Pro google/gemini-2.5-pro | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,535 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | Unknown | audio, image, pdf, text | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 3 Flash Preview google/gemini-3-flash-preview | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools | Unknown | audio, image, pdf, text, video | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Mar 15, 2026 | |
LLM Gateway llmgateway | Gemini 2.0 Flash gemini-2.0-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.1 / 1M | $0.4 / 1M | No | Yes | Dec 11, 2024 | |
LLM Gateway llmgateway | Gemini 2.0 Flash Lite gemini-2.0-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
LLM Gateway llmgateway | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
LLM Gateway llmgateway | Gemini 2.5 Flash Lite gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
LLM Gateway llmgateway | Gemini 2.5 Flash Lite Preview 09-25 gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
LLM Gateway llmgateway | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
LLM Gateway llmgateway | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
LLM Gateway llmgateway | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
LLM Gateway llmgateway | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
OpenCode Zen opencode | Gemini 3 Flash gemini-3-flash | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
OpenCode Zen opencode | Gemini 3 Pro gemini-3-pro | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
OpenCode Zen opencode | Gemini 3.1 Pro Preview gemini-3.1-pro | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
OpenRouter openrouter | Gemini 2.0 Flash google/gemini-2.0-flash-001 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.1 / 1M | $0.4 / 1M | No | Yes | Dec 11, 2024 | |
OpenRouter openrouter | Gemini 2.5 Flash google/gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jul 17, 2025 | |
OpenRouter openrouter | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
OpenRouter openrouter | Gemini 2.5 Flash Lite Preview 09-25 google/gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
OpenRouter openrouter | Gemini 2.5 Flash Preview 09-25 google/gemini-2.5-flash-preview-09-2025 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
OpenRouter openrouter | Gemini 2.5 Pro google/gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
OpenRouter openrouter | Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | May 6, 2025 | |
OpenRouter openrouter | Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview-06-05 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
OpenRouter openrouter | Gemini 3 Flash Preview google/gemini-3-flash-preview | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
OpenRouter openrouter | Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, video, pdf, audio | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
OpenRouter openrouter | Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
OpenRouter openrouter | Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
OpenRouter openrouter | MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro | mimo | text, image, audio, video, pdf | text | 1,048,576 | 131,072 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 22, 2026 | |
Perplexity Agent perplexity-agent | Gemini 2.5 Flash google/gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
Perplexity Agent perplexity-agent | Gemini 2.5 Pro google/gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Perplexity Agent perplexity-agent | Gemini 3 Flash Preview google/gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Perplexity Agent perplexity-agent | Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Poe poe | Gemini-3-Flash google/gemini-3-flash | Unknown | text, image, video, audio | text | 1,048,576 | 65,536 | $0.4 / 1M | $2.4 / 1M | Yes | Yes | Oct 7, 2025 | |
Poe poe | Gemini-3-Pro google/gemini-3-pro | gemini-pro | text, image, video, audio | text | 1,048,576 | 65,536 | $1.6 / 1M | $9.6 / 1M | Yes | Yes | Oct 22, 2025 | |
Poe poe | Gemini-3.1-Flash-Lite google/gemini-3.1-flash-lite | Unknown | text, image, video, audio | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Feb 18, 2026 | |
Poe poe | Gemini-3.1-Pro google/gemini-3.1-pro | Unknown | text, image, video, audio | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
QiHang qihang-ai | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.09 / 1M | $0.71 / 1M | Yes | Yes | Dec 17, 2025 | |
QiHang qihang-ai | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.07 / 1M | $0.43 / 1M | Yes | Yes | Dec 17, 2025 | |
Qiniu qiniu-ai | Gemini 2.0 Flash gemini-2.0-flash | Unknown | text, image, audio, video | text | 1,048,576 | 8,192 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Gemini 2.0 Flash Lite gemini-2.0-flash-lite | Unknown | text, image, audio, video | text | 1,048,576 | 8,192 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Gemini 2.5 Flash gemini-2.5-flash | Unknown | text, image, audio, video | text | 1,048,576 | 64,000 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Gemini 2.5 Flash Lite gemini-2.5-flash-lite | Unknown | text, image, audio, video | text | 1,048,576 | 64,000 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Gemini 2.5 Pro gemini-2.5-pro | Unknown | text, image, video, audio | text | 1,048,576 | 65,536 | Unknown | Unknown | Yes | Yes | Aug 5, 2025 | |
Requesty requesty | Gemini 2.5 Flash google/gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 17, 2025 | |
Requesty requesty | Gemini 2.5 Pro google/gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Requesty requesty | Gemini 3 Flash google/gemini-3-flash-preview | Unknown | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Requesty requesty | Gemini 3 Pro google/gemini-3-pro-preview | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
SAP AI Core sap-ai-core | gemini-2.5-flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
SAP AI Core sap-ai-core | gemini-2.5-flash-lite gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
SAP AI Core sap-ai-core | gemini-2.5-pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Vercel AI Gateway vercel | Gemini 2.0 Flash google/gemini-2.0-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.1 / 1M | $0.4 / 1M | No | Yes | Dec 11, 2024 | |
Vercel AI Gateway vercel | Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
Vercel AI Gateway vercel | Gemini 2.5 Flash google/gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 5, 2025 | |
Vercel AI Gateway vercel | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
Vercel AI Gateway vercel | Gemini 2.5 Flash Lite Preview 09-25 google/gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
Vercel AI Gateway vercel | Gemini 2.5 Flash Preview 09-25 google/gemini-2.5-flash-preview-09-2025 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
Vercel AI Gateway vercel | Gemini 2.5 Pro google/gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Vertex google-vertex | Gemini 2.0 Flash gemini-2.0-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.15 / 1M | $0.6 / 1M | No | Yes | Dec 11, 2024 | |
Vertex google-vertex | Gemini 2.0 Flash Lite gemini-2.0-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | Dec 11, 2024 | |
Vertex google-vertex | Gemini 2.5 Flash gemini-2.5-flash | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 17, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Lite gemini-2.5-flash-lite | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Lite Preview 09-25 gemini-2.5-flash-lite-preview-09-2025 | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Apr 17, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | May 20, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Preview 09-25 gemini-2.5-flash-preview-09-2025 | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
Vertex google-vertex | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Vertex google-vertex | Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | May 6, 2025 | |
Vertex google-vertex | Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 | gemini-pro | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 5, 2025 | |
Vertex google-vertex | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
Vertex google-vertex | Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Vertex google-vertex | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
Vertex google-vertex | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Vertex google-vertex | Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Vertex google-vertex | Gemini Flash Latest gemini-flash-latest | gemini-flash | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Sep 25, 2025 | |
Vertex google-vertex | Gemini Flash-Lite Latest gemini-flash-lite-latest | gemini-flash-lite | text, image, audio, video, pdf | text | 1,048,576 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Sep 25, 2025 | |
Vivgrid vivgrid | Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | gemini-flash-lite | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $0.25 / 1M | $1.5 / 1M | Yes | Yes | Mar 3, 2026 | |
Vivgrid vivgrid | Gemini 3.1 Pro Preview gemini-3.1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,048,576 | 65,536 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Xiaomi xiaomi | MiMo-V2.5-Pro mimo-v2.5-pro | mimo | text, image, audio, video, pdf | text | 1,048,576 | 131,072 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 22, 2026 | |
ZenMux zenmux | MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro | mimo | text, image, audio, video, pdf | text | 1,048,576 | 131,072 | $1 / 1M | $3 / 1M | Yes | Yes | Apr 22, 2026 | |
ZenMux zenmux | Gemini 2.5 Flash google/gemini-2.5-flash | Unknown | pdf, image, text, audio | text | 1,048,000 | 64,000 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Jun 17, 2025 | |
ZenMux zenmux | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | Unknown | pdf, image, text, audio | text | 1,048,000 | 64,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Jul 22, 2025 | |
ZenMux zenmux | Gemini 2.5 Pro google/gemini-2.5-pro | Unknown | pdf, image, text, audio, video | text | 1,048,000 | 64,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Jun 17, 2025 | |
ZenMux zenmux | Gemini 3 Flash Preview google/gemini-3-flash-preview | Unknown | text, image, pdf, audio | text | 1,048,000 | 64,000 | $0.5 / 1M | $3 / 1M | Yes | Yes | Dec 17, 2025 | |
ZenMux zenmux | Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | Unknown | text, image, pdf, audio, video | text | 1,048,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 19, 2026 | |
Poe poe | Gemini-2.5-Flash-Lite google/gemini-2.5-flash-lite | gemini-flash-lite | text, image, video, audio | text | 1,024,000 | 64,000 | $0.07 / 1M | $0.28 / 1M | Yes | Yes | Jun 19, 2025 | |
Firmware firmware | Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Firmware firmware | Gemini 3.1 Pro Preview gemini-3-1-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Feb 18, 2026 | |
Google google | Gemini 1.5 Flash gemini-1.5-flash | gemini-flash | text, image, audio, video | text | 1,000,000 | 8,192 | $0.075 / 1M | $0.3 / 1M | No | Yes | May 14, 2024 | |
Google google | Gemini 1.5 Flash-8B gemini-1.5-flash-8b | gemini-flash | text, image, audio, video | text | 1,000,000 | 8,192 | $0.0375 / 1M | $0.15 / 1M | No | Yes | Oct 3, 2024 | |
Google google | Gemini 1.5 Pro gemini-1.5-pro | gemini-pro | text, image, audio, video | text | 1,000,000 | 8,192 | $1.25 / 1M | $5 / 1M | No | Yes | Feb 15, 2024 | |
Google google | Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Kilo Gateway kilo | Amazon: Nova 2 Lite amazon/nova-2-lite-v1 | Unknown | image, pdf, text, video | text | 1,000,000 | 65,535 | $0.3 / 1M | $2.5 / 1M | Yes | Yes | Mar 15, 2026 | |
Nova nova | Nova 2 Lite nova-2-lite-v1 | nova-lite | text, image, video, pdf | text | 1,000,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 1, 2025 | |
Nova nova | Nova 2 Pro nova-2-pro-v1 | nova-pro | text, image, video, pdf | text | 1,000,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 3, 2026 | |
OpenCode Go opencode-go | MiMo V2.5 mimo-v2.5 | mimo-v2.5 | text, image, audio, pdf | text | 1,000,000 | 128,000 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 22, 2026 | |
QiHang qihang-ai | Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, audio, video | text | 1,000,000 | 65,000 | $0.57 / 1M | $3.43 / 1M | Yes | Yes | Nov 19, 2025 | |
Qiniu qiniu-ai | Gemini 3.0 Flash Preview gemini-3.0-flash-preview | Unknown | text, image, audio, video, pdf | text | 1,000,000 | 64,000 | Unknown | Unknown | Yes | Yes | Dec 18, 2025 | |
Qiniu qiniu-ai | Gemini 3.0 Pro Preview gemini-3.0-pro-preview | Unknown | text, image, video, pdf, audio | text | 1,000,000 | 64,000 | Unknown | Unknown | Yes | Yes | Nov 19, 2025 | |
Venice AI venice | Gemini 3.1 Pro Preview gemini-3-1-pro-preview | gemini-pro | text, image, audio, video | text | 1,000,000 | 32,768 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 12, 2026 | |
Vercel AI Gateway vercel | Gemini 3 Pro Preview google/gemini-3-pro-preview | gemini-pro | text, image, video, audio, pdf | text | 1,000,000 | 64,000 | $2 / 1M | $12 / 1M | Yes | Yes | Nov 18, 2025 | |
Xiaomi Token Plan (China) xiaomi-token-plan-cn | MiMo-V2.5 mimo-v2.5 | mimo-v2.5 | text, image, audio, video, pdf | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
Xiaomi Token Plan (Europe) xiaomi-token-plan-ams | MiMo-V2.5 mimo-v2.5 | mimo-v2.5 | text, image, audio, video, pdf | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
Xiaomi Token Plan (Singapore) xiaomi-token-plan-sgp | MiMo-V2.5 mimo-v2.5 | mimo-v2.5 | text, image, audio, video, pdf | text | 1,000,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 22, 2026 | |
Poe poe | Gemini-2.0-Flash google/gemini-2.0-flash | gemini-flash | text, image, video, audio | text | 990,000 | 8,192 | $0.1 / 1M | $0.42 / 1M | No | Yes | Dec 11, 2024 | |
Poe poe | Gemini-2.0-Flash-Lite google/gemini-2.0-flash-lite | gemini-flash-lite | text, image, video, audio | text | 990,000 | 8,192 | $0.052 / 1M | $0.21 / 1M | No | Yes | Feb 5, 2025 | |
Alibaba alibaba | Qwen3.5 122B-A10B qwen3.5-122b-a10b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.4 / 1M | $3.2 / 1M | Yes | Yes | Feb 23, 2026 | |
Alibaba alibaba | Qwen3.5 27B qwen3.5-27b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.3 / 1M | $2.4 / 1M | Yes | Yes | Feb 23, 2026 | |
Alibaba alibaba | Qwen3.5 35B-A3B qwen3.5-35b-a3b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.25 / 1M | $2 / 1M | Yes | Yes | Feb 23, 2026 | |
Alibaba alibaba | Qwen3.5 397B-A17B qwen3.5-397b-a17b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 15, 2026 | |
Alibaba alibaba | Qwen3.6 27B qwen3.6-27b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Apr 22, 2026 | |
Alibaba alibaba | Qwen3.6 35B-A3B qwen3.6-35b-a3b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.248 / 1M | $1.485 / 1M | Yes | Yes | Apr 17, 2026 | |
Helicone helicone | Qwen3 Coder 480B A35B Instruct Turbo qwen3-coder | qwen | text, image, audio, video | text | 262,144 | 16,384 | $0.22 / 1M | $0.95 / 1M | No | Yes | Jul 23, 2025 | |
Kilo Gateway kilo | Xiaomi: MiMo-V2-Omni xiaomi/mimo-v2-omni | Unknown | audio, image, text, video | text | 262,144 | 65,536 | $0.4 / 1M | $2 / 1M | Yes | Yes | Apr 11, 2026 | |
LLM Gateway llmgateway | Qwen3.5 397B-A17B qwen35-397b-a17b | qwen | text, image, video, audio | text | 262,144 | 65,536 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 15, 2026 | |
OpenCode Go opencode-go | MiMo V2 Omni mimo-v2-omni | mimo-v2-omni | text, image, audio, pdf | text | 262,144 | 128,000 | $0.4 / 1M | $2 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenCode Zen opencode | MiMo V2 Omni Free mimo-v2-omni-free | mimo-omni-free | text, image, audio, pdf | text | 262,144 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
OpenRouter openrouter | MiMo-V2-Omni xiaomi/mimo-v2-omni | mimo | text, image, video, audio | text | 262,144 | 65,536 | $0.4 / 1M | $2 / 1M | Yes | Yes | Mar 18, 2026 | |
Nvidia nvidia | Nemotron 3 Nano Omni nvidia/nemotron-3-nano-omni-30b-a3b-reasoning | nemotron | text, image, video, audio | text | 256,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 28, 2026 | |
OpenRouter openrouter | Nemotron 3 Nano Omni (free) nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free | nemotron | text, image, video, audio | text | 256,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 28, 2026 | |
Venice AI venice | Gemini 3 Flash Preview gemini-3-flash-preview | gemini-flash | text, image, audio, video, pdf | text | 256,000 | 65,536 | $0.7 / 1M | $3.75 / 1M | Yes | Yes | Mar 12, 2026 | |
Xiaomi xiaomi | MiMo-V2-Omni mimo-v2-omni | mimo | text, image, audio, video, pdf | text | 256,000 | 128,000 | $0.4 / 1M | $2 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (China) xiaomi-token-plan-cn | MiMo-V2-Omni mimo-v2-omni | mimo | text, image, audio, video, pdf | text | 256,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (Europe) xiaomi-token-plan-ams | MiMo-V2-Omni mimo-v2-omni | mimo | text, image, audio, video, pdf | text | 256,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
Xiaomi Token Plan (Singapore) xiaomi-token-plan-sgp | MiMo-V2-Omni mimo-v2-omni | mimo | text, image, audio, video, pdf | text | 256,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 18, 2026 | |
302.AI 302ai | glm-5v-turbo glm-5v-turbo | Unknown | text, image, video, audio, pdf | text | 200,000 | 131,072 | $0.72 / 1M | $3.2 / 1M | Yes | Yes | Apr 2, 2026 | |
Z.AI Coding Plan zai-coding-plan | glm-5v-turbo glm-5v-turbo | glm | text, image, video, pdf | text | 200,000 | 131,072 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 1, 2026 | |
Z.AI zai | glm-5v-turbo glm-5v-turbo | glm | text, image, video, pdf | text | 200,000 | 131,072 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 1, 2026 | |
ZenMux zenmux | GLM 5V Turbo z-ai/glm-5v-turbo | Unknown | text, image, video, pdf | text | 200,000 | 128,000 | $0.726 / 1M | $3.1946 / 1M | Yes | Yes | Apr 1, 2026 | |
Zhipu AI zhipuai | glm-5v-turbo glm-5v-turbo | glm | text, image, video, pdf | text | 200,000 | 131,072 | $5 / 1M | $22 / 1M | Yes | Yes | Apr 1, 2026 | |
Berget.AI berget | Gemma 4 31B Instruct google/gemma-4-31B-it | gemma | audio, image, text, video | text | 128,000 | 8,192 | $0.275 / 1M | $0.55 / 1M | Yes | Yes | Apr 2, 2026 | |
GitHub Copilot github-copilot | Gemini 2.5 Pro gemini-2.5-pro | gemini-pro | text, image, audio, video | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | No | Yes | Jun 5, 2025 | |
GitHub Copilot github-copilot | Gemini 3 Flash gemini-3-flash-preview | gemini-flash | text, image, audio, video | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 17, 2025 | |
GitHub Copilot github-copilot | Gemini 3 Pro Preview gemini-3-pro-preview | gemini-pro | text, image, audio, video | text | 128,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Nov 18, 2025 | |
Qiniu qiniu-ai | Qwen 2.5 VL 72B Instruct qwen2.5-vl-72b-instruct | Unknown | text, image, audio, video | text | 128,000 | 8,192 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen 2.5 VL 7B Instruct qwen2.5-vl-7b-instruct | Unknown | text, image, audio, video | text | 128,000 | 8,192 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen VL-MAX-2025-01-25 qwen-vl-max-2025-01-25 | Unknown | text, image, audio, video | text | 128,000 | 4,096 | Unknown | Unknown | No | Yes | Aug 5, 2025 | |
NovitaAI novita-ai | Qwen3 Omni 30B A3B Thinking qwen/qwen3-omni-30b-a3b-thinking | Unknown | text, audio, video, image | text | 65,536 | 16,384 | $0.25 / 1M | $0.97 / 1M | Yes | Yes | Sep 24, 2025 | |
Vertex google-vertex | Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 | gemini-flash-lite | text, image, audio, video, pdf | text | 65,536 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Jun 17, 2025 |
143 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
ZenMux zenmux | Grok 4.2 Fast x-ai/grok-4.2-fast | Unknown | text, image, video | text | 2,000,000 | 30,000 | $3 / 1M | $9 / 1M | Yes | Yes | Mar 20, 2026 | |
ZenMux zenmux | Grok 4.2 Fast Non Reasoning x-ai/grok-4.2-fast-non-reasoning | Unknown | text, image, video | text | 2,000,000 | 30,000 | $3 / 1M | $9 / 1M | No | Yes | Mar 20, 2026 | |
Poe poe | gemini-deep-research google/gemini-deep-research | Unknown | text, image, video | text | 1,048,576 | Unknown | $1.6 / 1M | $9.6 / 1M | Yes | Yes | Dec 11, 2025 | |
AIHubMix aihubmix | Qwen3.6 Plus qwen3.6-flash | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.169 / 1M | $1.014 / 1M | Yes | Yes | Apr 2, 2026 | |
Alibaba (China) alibaba-cn | Qwen3.5 Flash qwen3.5-flash | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.172 / 1M | $1.72 / 1M | Yes | Yes | Feb 23, 2026 | |
Alibaba (China) alibaba-cn | Qwen3.5 Plus qwen3.5-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.573 / 1M | $3.44 / 1M | Yes | Yes | Feb 16, 2026 | |
Alibaba (China) alibaba-cn | Qwen3.6 Plus qwen3.6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.276 / 1M | $1.651 / 1M | Yes | Yes | Apr 2, 2026 | |
Alibaba Coding Plan (China) alibaba-coding-plan-cn | Qwen3.6 Plus qwen3.6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 2, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | Qwen3.5 Plus qwen3.5-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 16, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | Qwen3.6 Plus qwen3.6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 2, 2026 | |
Alibaba alibaba | Qwen3.5 Plus qwen3.5-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.4 / 1M | $2.4 / 1M | Yes | Yes | Feb 16, 2026 | |
Alibaba alibaba | Qwen3.6 Plus qwen3.6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.276 / 1M | $1.651 / 1M | Yes | Yes | Apr 2, 2026 | |
Amazon Bedrock amazon-bedrock | Nova Premier amazon.nova-premier-v1:0 | nova | text, image, video | text | 1,000,000 | 16,384 | $2.5 / 1M | $12.5 / 1M | Yes | Yes | Dec 3, 2024 | |
Meganova meganova | Qwen3.5 Plus Qwen/Qwen3.5-Plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.4 / 1M | $2.4 / 1M | Yes | Yes | Feb 1, 2026 | |
OpenRouter openrouter | Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23 | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.065 / 1M | $0.26 / 1M | Yes | Yes | Feb 25, 2026 | |
OpenRouter openrouter | Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.4 / 1M | $2.4 / 1M | Yes | Yes | Feb 16, 2026 | |
OpenRouter openrouter | Qwen3.6 Plus qwen/qwen3.6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.325 / 1M | $1.95 / 1M | Yes | Yes | Apr 2, 2026 | |
Venice AI venice | Qwen 3.6 Plus Uncensored qwen-3-6-plus | qwen | text, image, video | text | 1,000,000 | 65,536 | $0.625 / 1M | $3.75 / 1M | Yes | Yes | Apr 12, 2026 | |
ZenMux zenmux | MiMo V2 Pro xiaomi/mimo-v2-pro | Unknown | text, image, video | text | 1,000,000 | 256,000 | $1.5 / 1M | $4.5 / 1M | Yes | Yes | Mar 20, 2026 | |
NanoGPT nano-gpt | Qwen 3.6 Plus qwen-3.6-plus | qwen3.6 | text, image, video | text | 991,800 | 65,536 | $0.45 / 1M | $2.7 / 1M | No | No | Apr 2, 2026 | |
NanoGPT nano-gpt | Qwen3.6 Flash alibaba/qwen3.6-flash | qwen3.6 | text, image, video | text | 991,800 | 65,536 | $0.19 / 1M | $1.16 / 1M | No | No | Apr 17, 2026 | |
AIHubMix aihubmix | Qwen3.5 Plus qwen3.5-plus | qwen | text, image, video | text | 991,000 | 64,000 | $0.1096 / 1M | $0.6576 / 1M | Yes | Yes | Feb 16, 2026 | |
Amazon Bedrock amazon-bedrock | Nova Lite amazon.nova-lite-v1:0 | nova-lite | text, image, video | text | 300,000 | 8,192 | $0.06 / 1M | $0.24 / 1M | No | Yes | Dec 3, 2024 | |
Amazon Bedrock amazon-bedrock | Nova Pro amazon.nova-pro-v1:0 | nova-pro | text, image, video | text | 300,000 | 8,192 | $0.8 / 1M | $3.2 / 1M | No | Yes | Dec 3, 2024 | |
Vercel AI Gateway vercel | Nova Lite amazon/nova-lite | nova-lite | text, image, video | text | 300,000 | 8,192 | $0.06 / 1M | $0.24 / 1M | No | Yes | Dec 3, 2024 | |
Vercel AI Gateway vercel | Nova Pro amazon/nova-pro | nova-pro | text, image, video | text | 300,000 | 8,192 | $0.8 / 1M | $3.2 / 1M | No | Yes | Dec 3, 2024 | |
ZenMux zenmux | MiMo V2 Omni xiaomi/mimo-v2-omni | Unknown | text, image, video | text | 265,000 | 265,000 | $0.4 / 1M | $2 / 1M | No | Yes | Mar 20, 2026 | |
Abacus abacus | Kimi K2.5 kimi-k2.5 | kimi | text, image, video | text | 262,144 | 32,768 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
AIHubMix aihubmix | Kimi K2.6 kimi-k2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $3.9995 / 1M | Yes | Yes | Apr 21, 2026 | |
Alibaba (China) alibaba-cn | kimi/kimi-k2.5 kimi/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Alibaba (China) alibaba-cn | Moonshot Kimi K2.5 kimi-k2.5 | kimi | text, image, video | text | 262,144 | 32,768 | $0.574 / 1M | $2.411 / 1M | Yes | Yes | Jan 27, 2026 | |
Alibaba (China) alibaba-cn | Moonshot Kimi K2.6 kimi-k2.6 | kimi | text, image, video | text | 262,144 | 16,384 | $0.929 / 1M | $3.858 / 1M | Yes | Yes | Apr 21, 2026 | |
Alibaba (China) alibaba-cn | Qwen3.5 397B-A17B qwen3.5-397b-a17b | qwen | text, image, video | text | 262,144 | 65,536 | $0.43 / 1M | $2.58 / 1M | Yes | Yes | Feb 16, 2026 | |
Alibaba Coding Plan alibaba-coding-plan | Kimi K2.5 kimi-k2.5 | kimi | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
Baseten baseten | Kimi K2.6 moonshotai/Kimi-K2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
Chutes chutes | Kimi K2.5 TEE moonshotai/Kimi-K2.5-TEE | kimi | text, image, video | text | 262,144 | 65,535 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Chutes chutes | Kimi K2.6 TEE moonshotai/Kimi-K2.6-TEE | kimi | text, image, video | text | 262,144 | 262,144 | $0.44 / 1M | $2 / 1M | Yes | Yes | Apr 23, 2026 | |
Deep Infra deepinfra | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text, image, video | text | 262,144 | 32,768 | $0.5 / 1M | $2.8 / 1M | Yes | Yes | Jan 27, 2026 | |
Deep Infra deepinfra | Kimi K2.6 moonshotai/Kimi-K2.6 | kimi | text, image, video | text | 262,144 | 16,384 | $0.75 / 1M | $3.5 / 1M | Yes | Yes | Apr 21, 2026 | |
Deep Infra deepinfra | Qwen 3.5 35B A3B Qwen/Qwen3.5-35B-A3B | qwen | text, image, video | text | 262,144 | 81,920 | $0.2 / 1M | $0.95 / 1M | Yes | Yes | Apr 20, 2026 | |
Deep Infra deepinfra | Qwen 3.5 397B A17B Qwen/Qwen3.5-397B-A17B | qwen | text, image, video | text | 262,144 | 81,920 | $0.54 / 1M | $3.4 / 1M | Yes | Yes | Apr 20, 2026 | |
Deep Infra deepinfra | Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B | qwen | text, image, video | text | 262,144 | 81,920 | $0.2 / 1M | $1 / 1M | Yes | Yes | Apr 1, 2026 | |
evroc evroc | Kimi K2.5 moonshotai/Kimi-K2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $1.47 / 1M | $5.9 / 1M | Yes | Yes | Jan 27, 2026 | |
Helicone helicone | Qwen3 235B A22B Thinking qwen3-235b-a22b-thinking | qwen | text, image, video | text | 262,144 | 81,920 | $0.3 / 1M | $2.9000000000000004 / 1M | Yes | No | Jul 25, 2025 | |
HPC-AI hpc-ai | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.21 / 1M | $1 / 1M | Yes | Yes | Mar 25, 2026 | |
Hugging Face huggingface | Kimi-K2.5 moonshotai/Kimi-K2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Hugging Face huggingface | Kimi-K2.6 moonshotai/Kimi-K2.6 | kimi | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
Jiekou.AI jiekou | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Kimi For Coding kimi-for-coding | Kimi K2.5 k2p5 | kimi-thinking | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 1, 2026 | |
Kimi For Coding kimi-for-coding | Kimi K2.6 k2p6 | kimi-thinking | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 1, 2026 | |
LLM Gateway llmgateway | Kimi K2.5 kimi-k2.5 | kimi-k2.5 | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
LLM Gateway llmgateway | Kimi K2.6 kimi-k2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
Moonshot AI (China) moonshotai-cn | Kimi K2.5 kimi-k2.5 | kimi-k2.5 | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Moonshot AI (China) moonshotai-cn | Kimi K2.6 kimi-k2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
Moonshot AI moonshotai | Kimi K2.5 kimi-k2.5 | kimi-k2.5 | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Moonshot AI moonshotai | Kimi K2.6 kimi-k2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
NanoGPT nano-gpt | Qwen3.6 35B A3B qwen/Qwen3.6-35B-A3B | qwen3.6 | text, image, video | text | 262,144 | 16,384 | $0.29 / 1M | $1.74 / 1M | No | No | Apr 21, 2026 | |
NanoGPT nano-gpt | Qwen3.6 35B A3B Thinking qwen/Qwen3.6-35B-A3B:thinking | qwen3.6 | text, image, video | text | 262,144 | 16,384 | $0.29 / 1M | $1.74 / 1M | Yes | No | Apr 21, 2026 | |
NovitaAI novita-ai | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
NovitaAI novita-ai | Kimi K2.6 moonshotai/kimi-k2.6 | kimi | text, image, video | text | 262,144 | 262,144 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
NovitaAI novita-ai | Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b | qwen | text, image, video | text | 262,144 | 65,536 | $0.4 / 1M | $3.2 / 1M | Yes | Yes | Feb 26, 2026 | |
NovitaAI novita-ai | Qwen3.5-27B qwen/qwen3.5-27b | qwen | text, image, video | text | 262,144 | 65,536 | $0.3 / 1M | $2.4 / 1M | Yes | Yes | Feb 26, 2026 | |
NovitaAI novita-ai | Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b | qwen | text, image, video | text | 262,144 | 65,536 | $0.25 / 1M | $2 / 1M | Yes | Yes | Feb 26, 2026 | |
NovitaAI novita-ai | Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b | qwen | text, image, video | text | 262,144 | 64,000 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 17, 2026 | |
NovitaAI novita-ai | Qwen3.6-27B qwen/qwen3.6-27b | qwen3.6 | text, image, video | text | 262,144 | 65,536 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Apr 2, 2026 | |
Nvidia nvidia | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
OpenCode Go opencode-go | Kimi K2.5 kimi-k2.5 | kimi-k2.5 | text, image, video | text | 262,144 | 65,536 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
OpenCode Go opencode-go | Kimi K2.6 (3x limits) kimi-k2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 65,536 | $0.32 / 1M | $1.34 / 1M | Yes | Yes | Apr 21, 2026 | |
OpenCode Go opencode-go | Qwen3.5 Plus qwen3.5-plus | qwen3.5 | text, image, video | text | 262,144 | 65,536 | $0.2 / 1M | $1.2 / 1M | Yes | Yes | Feb 16, 2026 | |
OpenCode Go opencode-go | Qwen3.6 Plus qwen3.6-plus | qwen3.6 | text, image, video | text | 262,144 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Apr 2, 2026 | |
OpenCode Zen opencode | Kimi K2.5 kimi-k2.5 | kimi | text, image, video | text | 262,144 | 65,536 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
OpenCode Zen opencode | Kimi K2.5 Free kimi-k2.5-free | kimi-free | text, image, video | text | 262,144 | 262,144 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
OpenCode Zen opencode | Kimi K2.6 kimi-k2.6 | kimi | text, image, video | text | 262,144 | 65,536 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 21, 2026 | |
OpenCode Zen opencode | Qwen3.5 Plus qwen3.5-plus | qwen3.5 | text, image, video | text | 262,144 | 65,536 | $0.2 / 1M | $1.2 / 1M | Yes | Yes | Feb 16, 2026 | |
OpenCode Zen opencode | Qwen3.6 Plus qwen3.6-plus | qwen3.6 | text, image, video | text | 262,144 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | Yes | Apr 2, 2026 | |
OpenRouter openrouter | Gemma 4 26B A4B google/gemma-4-26b-a4b-it | gemma | text, image, video | text | 262,144 | 262,144 | $0.13 / 1M | $0.4 / 1M | Yes | Yes | Apr 3, 2026 | |
OpenRouter openrouter | Gemma 4 26B A4B (free) google/gemma-4-26b-a4b-it:free | gemma | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 3, 2026 | |
OpenRouter openrouter | Gemma 4 31B google/gemma-4-31b-it | gemma | text, image, video | text | 262,144 | 262,144 | $0.14 / 1M | $0.4 / 1M | Yes | Yes | Apr 2, 2026 | |
OpenRouter openrouter | Gemma 4 31B (free) google/gemma-4-31b-it:free | gemma | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 2, 2026 | |
OpenRouter openrouter | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
OpenRouter openrouter | Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | qwen | text, image, video | text | 262,144 | 65,536 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Feb 16, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B | qwen | text, image, video | text | 262,144 | 65,536 | $0.29 / 1M | $2.32 / 1M | Yes | Yes | Feb 26, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-27B Qwen/Qwen3.5-27B | qwen | text, image, video | text | 262,144 | 65,536 | $0.26 / 1M | $2.09 / 1M | Yes | Yes | Feb 25, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B | qwen | text, image, video | text | 262,144 | 65,536 | $0.23 / 1M | $1.86 / 1M | Yes | Yes | Feb 25, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B | qwen | text, image, video | text | 262,144 | 65,536 | $0.29 / 1M | $1.74 / 1M | Yes | Yes | Feb 16, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-4B Qwen/Qwen3.5-4B | qwen | text, image, video | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 3, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.5-9B Qwen/Qwen3.5-9B | qwen | text, image, video | text | 262,144 | 65,536 | $0.22 / 1M | $1.74 / 1M | Yes | Yes | Mar 3, 2026 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3.6-35B-A3B Qwen/Qwen3.6-35B-A3B | qwen | text, image, video | text | 262,144 | 65,536 | $0.23 / 1M | $1.86 / 1M | Yes | Yes | Apr 17, 2026 | |
Tencent Coding Plan (China) tencent-coding-plan | Kimi-K2.5 kimi-k2.5 | kimi | text, image, video | text | 262,144 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Jan 27, 2026 | |
Together AI togetherai | Kimi K2.6 moonshotai/Kimi-K2.6 | kimi-k2.6 | text, image, video | text | 262,144 | 131,000 | $1.2 / 1M | $4.5 / 1M | Yes | Yes | Apr 21, 2026 | |
Vercel AI Gateway vercel | Kimi K2.5 moonshotai/kimi-k2.5 | kimi | text, image, video | text | 262,144 | 262,144 | $0.6 / 1M | $1.2 / 1M | Yes | Yes | Jan 26, 2026 | |
Wafer wafer.ai | Qwen3.5 397B A17B Qwen3.5-397B-A17B | qwen | text, image, video | text | 262,144 | 65,536 | $0 / 1M | $0 / 1M | Yes | Yes | Feb 16, 2026 | |
ZenMux zenmux | Kimi K2.6 moonshotai/kimi-k2.6 | Unknown | text, image, video | text | 262,140 | 262,140 | $0.95 / 1M | $4 / 1M | Yes | Yes | Apr 20, 2026 | |
Helicone helicone | Qwen3 Next 80B A3B Instruct qwen3-next-80b-a3b-instruct | qwen | text, image, video | text | 262,000 | 16,384 | $0.14 / 1M | $1.4 / 1M | No | Yes | Jan 1, 2025 | |
ZenMux zenmux | Kimi K2.5 moonshotai/kimi-k2.5 | Unknown | text, image, video | text | 262,000 | 64,000 | $0.58 / 1M | $3.02 / 1M | Yes | Yes | Jan 27, 2026 | |
NanoGPT nano-gpt | Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | qwen | text, image, video | text | 258,048 | 65,536 | $0.6 / 1M | $3.6 / 1M | No | No | Feb 16, 2026 | |
AIHubMix aihubmix | Kimi K2.5 kimi-k2.5 | kimi-k2.5 | text, image, video | text | 256,000 | Unknown | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 1, 2026 | |
Cortecs cortecs | Kimi K2.5 kimi-k2.5 | kimi-thinking | text, image, video | text | 256,000 | 256,000 | $0.55 / 1M | $2.76 / 1M | Yes | Yes | Jan 27, 2026 | |
Fireworks AI fireworks-ai | Kimi K2.5 accounts/fireworks/models/kimi-k2p5 | kimi-thinking | text, image, video | text | 256,000 | 256,000 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Helicone helicone | Qwen3 VL 235B A22B Instruct qwen3-vl-235b-a22b-instruct | qwen | text, image, video | text | 256,000 | 16,384 | $0.3 / 1M | $1.5 / 1M | No | Yes | Sep 23, 2025 | |
Nvidia nvidia | Gemma-4-31B-IT google/gemma-4-31b-it | gemma | text, image, video | text | 256,000 | 16,384 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 2, 2026 | |
Qiniu qiniu-ai | Doubao Seed 2.0 Code doubao-seed-2.0-code | Unknown | text, image, video | text | 256,000 | 128,000 | Unknown | Unknown | Yes | Yes | Feb 14, 2026 | |
Qiniu qiniu-ai | Doubao Seed 2.0 Lite doubao-seed-2.0-lite | Unknown | text, image, video | text | 256,000 | 32,000 | Unknown | Unknown | Yes | Yes | Feb 14, 2026 | |
Qiniu qiniu-ai | Doubao Seed 2.0 Mini doubao-seed-2.0-mini | Unknown | text, image, video | text | 256,000 | 32,000 | Unknown | Unknown | Yes | Yes | Feb 14, 2026 | |
Qiniu qiniu-ai | Doubao Seed 2.0 Pro doubao-seed-2.0-pro | Unknown | text, image, video | text | 256,000 | 128,000 | Unknown | Unknown | Yes | Yes | Feb 14, 2026 | |
Qiniu qiniu-ai | Doubao-Seed 1.6 doubao-seed-1.6 | Unknown | text, image, video | text | 256,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 15, 2025 | |
Qiniu qiniu-ai | Doubao-Seed 1.6 Flash doubao-seed-1.6-flash | Unknown | text, image, video | text | 256,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 15, 2025 | |
Qiniu qiniu-ai | Moonshotai/Kimi-K2.5 moonshotai/kimi-k2.5 | Unknown | text, image, video | text | 256,000 | 256,000 | Unknown | Unknown | No | Yes | Jan 28, 2026 | |
Scaleway scaleway | Qwen3.5 397B A17B qwen3.5-397b-a17b | qwen | text, image, video | text | 256,000 | 16,384 | $0.6 / 1M | $3.6 / 1M | Yes | Yes | Mar 17, 2026 | |
Venice AI venice | Google Gemma 4 26B A4B Instruct google-gemma-4-26b-a4b-it | gemma | text, image, video | text | 256,000 | 8,192 | $0.1625 / 1M | $0.5 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | Google Gemma 4 31B Instruct google-gemma-4-31b-it | gemma | text, image, video | text | 256,000 | 8,192 | $0.175 / 1M | $0.5 / 1M | Yes | Yes | Apr 12, 2026 | |
Venice AI venice | Qwen 3.5 35B A3B qwen3-5-35b-a3b | qwen | text, image, video | text | 256,000 | 65,536 | $0.3125 / 1M | $1.25 / 1M | Yes | Yes | Apr 16, 2026 | |
Venice AI venice | Qwen3.6 27B qwen3-6-27b | qwen | text, image, video | text | 256,000 | 65,536 | $0.325 / 1M | $3.25 / 1M | Yes | Yes | Apr 25, 2026 | |
ZenMux zenmux | Doubao-Seed-1.8 volcengine/doubao-seed-1.8 | Unknown | text, image, video | text | 256,000 | 64,000 | $0.11 / 1M | $0.28 / 1M | Yes | Yes | Dec 18, 2025 | |
ZenMux zenmux | Doubao-Seed-2.0-lite volcengine/doubao-seed-2.0-lite | Unknown | text, image, video | text | 256,000 | 64,000 | $0.09 / 1M | $0.51 / 1M | Yes | Yes | Feb 14, 2026 | |
ZenMux zenmux | Doubao-Seed-2.0-mini volcengine/doubao-seed-2.0-mini | Unknown | text, image, video | text | 256,000 | 64,000 | $0.03 / 1M | $0.28 / 1M | Yes | Yes | Feb 14, 2026 | |
ZenMux zenmux | Doubao-Seed-2.0-pro volcengine/doubao-seed-2.0-pro | Unknown | text, image, video | text | 256,000 | 64,000 | $0.45 / 1M | $2.24 / 1M | Yes | Yes | Feb 14, 2026 | |
ZenMux zenmux | GLM 4.6V z-ai/glm-4.6v | Unknown | text, image, video | text | 200,000 | 64,000 | $0.14 / 1M | $0.42 / 1M | Yes | Yes | Dec 8, 2025 | |
ZenMux zenmux | GLM 4.6V Flash (Free) z-ai/glm-4.6v-flash-free | Unknown | text, image, video | text | 200,000 | 64,000 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 8, 2025 | |
ZenMux zenmux | GLM 4.6V FlashX z-ai/glm-4.6v-flash | Unknown | text, image, video | text | 200,000 | 64,000 | $0.02 / 1M | $0.21 / 1M | Yes | Yes | Dec 8, 2025 | |
NovitaAI novita-ai | ERNIE-4.5-VL-28B-A3B-Thinking baidu/ernie-4.5-vl-28b-a3b-thinking | Unknown | text, image, video | text | 131,072 | 65,536 | $0.39 / 1M | $0.39 / 1M | Yes | Yes | Nov 26, 2025 | |
NovitaAI novita-ai | qwen/qwen3-vl-30b-a3b-thinking qwen/qwen3-vl-30b-a3b-thinking | Unknown | text, image, video | text | 131,072 | 32,768 | $0.2 / 1M | $1 / 1M | No | Yes | Oct 11, 2025 | |
NovitaAI novita-ai | qwen/qwen3-vl-8b-instruct qwen/qwen3-vl-8b-instruct | Unknown | text, image, video | text | 131,072 | 32,768 | $0.08 / 1M | $0.5 / 1M | No | Yes | Oct 17, 2025 | |
NovitaAI novita-ai | Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | Unknown | text, image, video | text | 131,072 | 32,768 | $0.3 / 1M | $1.5 / 1M | No | Yes | Sep 24, 2025 | |
NovitaAI novita-ai | Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | Unknown | text, image, video | text | 131,072 | 32,768 | $0.98 / 1M | $3.95 / 1M | Yes | No | Sep 24, 2025 | |
Nvidia nvidia | Cosmos Nemotron 34B nvidia/cosmos-nemotron-34b | nemotron | text, image, video | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | No | Sep 5, 2025 | |
Amazon Bedrock amazon-bedrock | Nova 2 Lite amazon.nova-2-lite-v1:0 | nova | text, image, video | text | 128,000 | 4,096 | $0.33 / 1M | $2.75 / 1M | No | Yes | Dec 1, 2024 | |
LLM Gateway llmgateway | GLM-4.6V glm-4.6v | glm | text, image, video | text | 128,000 | 32,768 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 8, 2025 | |
Poe poe | Kimi-K2.5 novita/kimi-k2.5 | Unknown | text, image, video | text | 128,000 | 262,144 | $0.6 / 1M | $3 / 1M | Yes | Yes | Jan 27, 2026 | |
Qiniu qiniu-ai | Doubao 1.5 Vision Pro doubao-1.5-vision-pro | Unknown | text, image, video | text | 128,000 | 16,000 | Unknown | Unknown | No | No | Aug 5, 2025 | |
Qiniu qiniu-ai | Qwen3-Vl 30b A3b Thinking qwen3-vl-30b-a3b-thinking | Unknown | text, image, video | text | 128,000 | 32,000 | Unknown | Unknown | No | Yes | Feb 9, 2026 | |
Venice AI venice | Qwen 3.5 397B qwen3-5-397b-a17b | qwen | text, image, video | text | 128,000 | 32,768 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Apr 16, 2026 | |
Z.AI zai | GLM-4.6V glm-4.6v | glm | text, image, video | text | 128,000 | 32,768 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 8, 2025 | |
ZenMux zenmux | ERNIE 5.0 baidu/ernie-5.0-thinking-preview | Unknown | text, image, video | text | 128,000 | 64,000 | $0.84 / 1M | $3.37 / 1M | Yes | Yes | Jan 22, 2026 | |
Zhipu AI zhipuai | GLM-4.6V glm-4.6v | glm | text, image, video | text | 128,000 | 32,768 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 8, 2025 | |
evroc evroc | Qwen3 VL 30B Qwen/Qwen3-VL-30B-A3B-Instruct | qwen | text, image, video | text | 100,000 | 100,000 | $0.24 / 1M | $0.94 / 1M | No | Yes | Jul 30, 2025 | |
Jiekou.AI jiekou | GLM 4.5V zai-org/glm-4.5v | glmv | text, image, video | text | 65,536 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Jan 1, 2026 | |
LLM Gateway llmgateway | GLM-4.5V glm-4.5v | glm | text, image, video | text | 64,000 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
OpenRouter openrouter | GLM 4.5V z-ai/glm-4.5v | glm | text, image, video | text | 64,000 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
Z.AI zai | GLM-4.5V glm-4.5v | glm | text, image, video | text | 64,000 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
Zhipu AI zhipuai | GLM-4.5V glm-4.5v | glm | text, image, video | text | 64,000 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 | |
Clarifai clarifai | MM Poly 8B clarifai/main/models/mm-poly-8b | mm-poly | text, image, video | text | 32,768 | 4,096 | $0.658 / 1M | $1.11 / 1M | No | No | Feb 25, 2026 | |
NovitaAI novita-ai | Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | qwen | text, image, video | text | 32,768 | 32,768 | $0.8 / 1M | $0.8 / 1M | No | No | Mar 25, 2025 |
42 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | xAI: Grok 4.20 x-ai/grok-4.20 | Unknown | image, pdf, text | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | xAI: Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent | Unknown | image, pdf, text | text | 2,000,000 | 2,000,000 | $2 / 1M | $6 / 1M | Yes | No | Apr 11, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.4 openai/gpt-5.4 | Unknown | image, pdf, text | text | 1,050,000 | 128,000 | $2.5 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.4 Pro openai/gpt-5.4-pro | Unknown | image, pdf, text | text | 1,050,000 | 128,000 | $30 / 1M | $180 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4.1 openai/gpt-4.1 | Unknown | image, pdf, text | text | 1,047,576 | 32,768 | $2 / 1M | $8 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | Unknown | image, pdf, text | text | 1,047,576 | 32,768 | $0.4 / 1M | $1.6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | Unknown | image, pdf, text | text | 1,047,576 | 32,768 | $0.1 / 1M | $0.4 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Unknown | image, pdf, text | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 openai/gpt-5 | Unknown | image, pdf, text | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Mini openai/gpt-5-mini | Unknown | image, pdf, text | text | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Nano openai/gpt-5-nano | Unknown | image, pdf, text | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Pro openai/gpt-5-pro | Unknown | image, pdf, text | text | 400,000 | 128,000 | $15 / 1M | $120 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.1 openai/gpt-5.1 | Unknown | image, pdf, text | text | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.2 openai/gpt-5.2 | Unknown | image, pdf, text | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro | Unknown | image, pdf, text | text | 400,000 | 128,000 | $21 / 1M | $168 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.4 Mini openai/gpt-5.4-mini | Unknown | image, pdf, text | text | 400,000 | 128,000 | $0.75 / 1M | $4.5 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.4 Nano openai/gpt-5.4-nano | Unknown | image, pdf, text | text | 400,000 | 128,000 | $0.2 / 1M | $1.25 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | Unknown | image, pdf, text | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking | Unknown | image, pdf, text | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Opus 4 anthropic/claude-opus-4 | Unknown | image, pdf, text | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 | Unknown | image, pdf, text | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 | Unknown | image, pdf, text | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 | Unknown | image, pdf, text | text | 200,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o1 openai/o1 | Unknown | image, pdf, text | text | 200,000 | 100,000 | $15 / 1M | $60 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o1-pro openai/o1-pro | Unknown | image, pdf, text | text | 200,000 | 100,000 | $150 / 1M | $600 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o3 openai/o3 | Unknown | image, pdf, text | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o3 Deep Research openai/o3-deep-research | Unknown | image, pdf, text | text | 200,000 | 100,000 | $10 / 1M | $40 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o3 Pro openai/o3-pro | Unknown | image, pdf, text | text | 200,000 | 100,000 | $20 / 1M | $80 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o4 Mini openai/o4-mini | Unknown | image, pdf, text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research | Unknown | image, pdf, text | text | 200,000 | 100,000 | $2 / 1M | $8 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o4 Mini High openai/o4-mini-high | Unknown | image, pdf, text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o openai/gpt-4o | Unknown | image, pdf, text | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | Unknown | image, pdf, text | text | 128,000 | 4,096 | $5 / 1M | $15 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | Unknown | image, pdf, text | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | Unknown | image, pdf, text | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o (extended) openai/gpt-4o:extended | Unknown | image, pdf, text | text | 128,000 | 64,000 | $6 / 1M | $18 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o-mini openai/gpt-4o-mini | Unknown | image, pdf, text | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | Unknown | image, pdf, text | text | 128,000 | 16,384 | $0.15 / 1M | $0.6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Chat openai/gpt-5-chat | Unknown | image, pdf, text | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat | Unknown | image, pdf, text | text | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat | Unknown | image, pdf, text | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.3 Chat openai/gpt-5.3-chat | Unknown | image, pdf, text | text | 128,000 | 16,384 | $1.75 / 1M | $14 / 1M | No | Yes | Mar 15, 2026 |
40 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | Amazon: Nova Premier 1.0 amazon/nova-premier-v1 | Unknown | image, text | text | 1,000,000 | 32,000 | $2.5 / 1M | $12.5 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast | Unknown | image, text | text | 1,000,000 | 128,000 | $30 / 1M | $150 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | Unknown | image, text | text | 1,000,000 | 128,000 | $3 / 1M | $15 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Kilo Auto Frontier kilo-auto/frontier | Unknown | image, text | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.6 Plus qwen/qwen3.6-plus | Unknown | image, text | text | 1,000,000 | 65,536 | $0.325 / 1M | $1.95 / 1M | Yes | Yes | Apr 11, 2026 | |
ZenMux zenmux | Claude Opus 4.6 anthropic/claude-opus-4.6 | Unknown | image, text | text | 1,000,000 | 128,000 | $5 / 1M | $25 / 1M | Yes | Yes | Feb 6, 2026 | |
Kilo Gateway kilo | Kilo Auto Small kilo-auto/small | Unknown | image, text | text | 400,000 | 128,000 | $0.05 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | Unknown | image, text | text | 400,000 | 100,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
Kilo Gateway kilo | OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex | Unknown | image, text | text | 400,000 | 128,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Mar 15, 2026 | |
ZenMux zenmux | GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | Unknown | image, text | text | 400,000 | 64,000 | $0.25 / 1M | $2 / 1M | Yes | Yes | Nov 13, 2025 | |
Kilo Gateway kilo | Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | Unknown | image, text | text | 300,000 | 5,120 | $0.06 / 1M | $0.24 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512 | Unknown | image, text | text | 262,144 | 32,768 | $0.15 / 1M | $0.15 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mistral Small 4 mistralai/mistral-small-2603 | Unknown | image, text | text | 262,144 | 262,144 | $0.15 / 1M | $0.6 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5 | Unknown | image, text | text | 262,144 | 65,535 | $0.45 / 1M | $2.2 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | ByteDance Seed: Dola Seed 2.0 Pro (free) bytedance-seed/dola-seed-2.0-pro:free | Unknown | image, text | text | 256,000 | 128,000 | $0 / 1M | $0 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | xAI: Grok 4 x-ai/grok-4 | Unknown | image, text | text | 256,000 | 51,200 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
ZenMux zenmux | Grok 4 x-ai/grok-4 | Unknown | image, text | text | 256,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | Jul 9, 2025 | |
Kilo Gateway kilo | Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Unknown | image, text | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | Yes | Yes | Oct 15, 2025 | |
Kilo Gateway kilo | Free Models Router openrouter/free | Unknown | image, text | text | 200,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Perplexity: Sonar Pro Search perplexity/sonar-pro-search | Unknown | image, text | text | 200,000 | 8,000 | $3 / 1M | $15 / 1M | Yes | No | Mar 15, 2026 | |
ZenMux zenmux | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Unknown | image, text | text | 200,000 | 64,000 | $1 / 1M | $5 / 1M | No | Yes | Oct 15, 2025 | |
Kilo Gateway kilo | Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b | Unknown | image, text | text | 163,840 | 32,768 | $0.18 / 1M | $0.18 / 1M | No | No | Apr 5, 2025 | |
Kilo Gateway kilo | Arcee AI: Spotlight arcee-ai/spotlight | Unknown | image, text | text | 131,072 | 65,537 | $0.18 / 1M | $0.18 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemma 3 12B google/gemma-3-12b-it | Unknown | image, text | text | 131,072 | 131,072 | $0.04 / 1M | $0.13 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemma 3 4B google/gemma-3-4b-it | Unknown | image, text | text | 131,072 | 19,200 | $0.04 / 1M | $0.08 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512 | Unknown | image, text | text | 131,072 | 32,768 | $0.1 / 1M | $0.1 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | Unknown | image, text | text | 131,072 | 131,072 | $0.06 / 1M | $0.18 / 1M | No | Yes | Jun 20, 2025 | |
Kilo Gateway kilo | Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | Unknown | image, text | text | 131,072 | 32,768 | $2 / 1M | $6 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen VL Plus qwen/qwen-vl-plus | Unknown | image, text | text | 131,072 | 8,192 | $0.1365 / 1M | $0.4095 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | Unknown | image, text | text | 131,072 | 32,768 | $0.26 / 1M | $2.6 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking | Unknown | image, text | text | 131,072 | 32,768 | $0.13 / 1M | $1.56 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | Unknown | image, text | text | 131,072 | 32,768 | $0.08 / 1M | $0.5 / 1M | No | Yes | Nov 25, 2025 | |
Kilo Gateway kilo | Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking | Unknown | image, text | text | 131,072 | 32,768 | $0.117 / 1M | $1.365 / 1M | Yes | Yes | Nov 25, 2025 | |
Kilo Gateway kilo | ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b | Unknown | image, text | text | 128,000 | 2,048 | $0.1 / 1M | $0.2 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemma 3 27B google/gemma-3-27b-it | Unknown | image, text | text | 128,000 | 65,536 | $0.03 / 1M | $0.11 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | Unknown | image, text | text | 128,000 | 131,072 | $0.35 / 1M | $0.56 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct | Unknown | image, text | text | 128,000 | 16,384 | $0.2 / 1M | $0.6 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | Unknown | image, text | text | 123,000 | 16,000 | $0.42 / 1M | $1.25 / 1M | Yes | No | Jan 1, 2026 | |
ZenMux zenmux | Step-3 stepfun/step-3 | Unknown | image, text | text | 65,536 | 64,000 | $0.21 / 1M | $0.57 / 1M | Yes | Yes | Jul 31, 2025 | |
Kilo Gateway kilo | Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | Unknown | image, text | text | 32,768 | 32,768 | $0.8 / 1M | $0.8 / 1M | No | No | Mar 15, 2026 |
34 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
NanoGPT nano-gpt | Qwen Long 10M qwen-long | Unknown | text, pdf | text | 10,000,000 | 8,192 | $0.1003 / 1M | $0.408 / 1M | No | No | Jan 25, 2025 | |
NanoGPT nano-gpt | MiniMax 01 minimax/minimax-01 | minimax | text, pdf | text | 1,000,192 | 16,384 | $0.1394 / 1M | $1.1219999999999999 / 1M | No | No | Jan 15, 2025 | |
Vertex google-vertex | GLM-4.7 zai-org/glm-4.7-maas | glm | text, pdf | text | 200,000 | 128,000 | $0.6 / 1M | $2.2 / 1M | Yes | Yes | Jan 6, 2026 | |
Vertex google-vertex | DeepSeek V3.1 deepseek-ai/deepseek-v3.1-maas | deepseek | text, pdf | text | 163,840 | 32,768 | $0.6 / 1M | $1.7 / 1M | Yes | Yes | Aug 28, 2025 | |
Vertex google-vertex | DeepSeek V3.2 deepseek-ai/deepseek-v3.2-maas | deepseek | text, pdf | text | 163,840 | 65,536 | $0.56 / 1M | $1.68 / 1M | Yes | Yes | Apr 4, 2026 | |
NanoGPT nano-gpt | DeepSeek V3.2 deepseek/deepseek-v3.2 | deepseek | text, pdf | text | 163,000 | 65,536 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | No | Yes | Dec 1, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | deepseek | text, pdf | text | 163,000 | 65,536 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | Yes | No | Dec 2, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.2 Thinking deepseek/deepseek-v3.2:thinking | deepseek | text, pdf | text | 163,000 | 65,536 | $0.27999999999999997 / 1M | $0.42000000000000004 / 1M | Yes | Yes | Dec 1, 2025 | |
NanoGPT nano-gpt | Grok 3 Fast Beta grok-3-fast-beta | Unknown | text, pdf | text | 131,072 | 131,072 | $5 / 1M | $25 / 1M | No | No | Feb 17, 2025 | |
NanoGPT nano-gpt | Kimi K2 0711 Fast kimi-k2-instruct-fast | Unknown | text, pdf | text | 131,072 | 16,384 | $0.1 / 1M | $2 / 1M | No | No | Jul 15, 2025 | |
NanoGPT nano-gpt | Llama 3.2 3b Instruct meta-llama/llama-3.2-3b-instruct | llama | text, pdf | text | 131,072 | 8,192 | $0.0306 / 1M | $0.0493 / 1M | No | No | Sep 25, 2024 | |
NanoGPT nano-gpt | ASI1 Mini asi1-mini | Unknown | text, pdf | text | 128,000 | 16,384 | $1 / 1M | $1 / 1M | No | No | Mar 25, 2025 | |
NanoGPT nano-gpt | Claude 3.7 Sonnet Reasoner claude-3-7-sonnet-reasoner | Unknown | text, pdf | text | 128,000 | 8,192 | $3 / 1M | $15 / 1M | No | No | Mar 29, 2025 | |
NanoGPT nano-gpt | DeepClaude deepclaude | Unknown | text, pdf | text | 128,000 | 8,192 | $3 / 1M | $15 / 1M | No | No | Feb 1, 2025 | |
NanoGPT nano-gpt | DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 | deepseek | text, pdf | text | 128,000 | 65,536 | $0.2 / 1M | $0.7 / 1M | No | No | Jul 26, 2025 | |
NanoGPT nano-gpt | DeepSeek V3/Chat Cheaper deepseek-chat-cheaper | Unknown | text, pdf | text | 128,000 | 8,192 | $0.25 / 1M | $0.7 / 1M | No | Yes | Apr 15, 2025 | |
NanoGPT nano-gpt | DeepSeek V3/Deepseek Chat deepseek-chat | Unknown | text, pdf | text | 128,000 | 8,192 | $0.25 / 1M | $0.7 / 1M | No | Yes | Feb 27, 2025 | |
NanoGPT nano-gpt | Doubao 1.5 Thinking Pro doubao-1-5-thinking-pro-250415 | Unknown | text, pdf | text | 128,000 | 16,384 | $0.6 / 1M | $2.4 / 1M | No | No | Apr 17, 2025 | |
NanoGPT nano-gpt | Gemma 3 12B IT unsloth/gemma-3-12b-it | unsloth | text, pdf | text | 128,000 | 131,072 | $0.272 / 1M | $0.272 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Gemma 3 27B IT unsloth/gemma-3-27b-it | unsloth | text, pdf | text | 128,000 | 96,000 | $0.2992 / 1M | $0.2992 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Gemma 3 4B IT unsloth/gemma-3-4b-it | unsloth | text, pdf | text | 128,000 | 8,192 | $0.2006 / 1M | $0.2006 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Kimi Dev 72B moonshotai/Kimi-Dev-72B | kimi | text, pdf | text | 128,000 | 131,072 | $0.4 / 1M | $0.4 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | Kimi Thinking Preview kimi-thinking-preview | Unknown | text, pdf | text | 128,000 | 16,384 | $31.46 / 1M | $31.46 / 1M | No | No | May 7, 2025 | |
NanoGPT nano-gpt | Microsoft DeepSeek R1 microsoft/MAI-DS-R1-FP8 | deepseek | text, pdf | text | 128,000 | 8,192 | $0.3 / 1M | $0.3 / 1M | No | No | Sep 25, 2025 | |
NanoGPT nano-gpt | WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | gpt | text, pdf | text | 65,536 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Apr 15, 2025 | |
NanoGPT nano-gpt | ERNIE X1.1 ernie-x1.1-preview | Unknown | text, pdf | text | 64,000 | 8,192 | $0.15 / 1M | $0.6 / 1M | No | No | Sep 10, 2025 | |
NanoGPT nano-gpt | TheDrummer Skyfall 36B V2 TheDrummer 2/skyfall-36b-v2 | llama | text, pdf | text | 64,000 | 32,768 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Mar 10, 2025 | |
NanoGPT nano-gpt | Mistral 7B Instruct mistralai/mistral-7b-instruct | mistral | text, pdf | text | 32,768 | 8,192 | $0.0544 / 1M | $0.0544 / 1M | No | No | May 27, 2024 | |
NanoGPT nano-gpt | UnslopNemo 12b v4 TheDrummer 2/UnslopNemo-12B-v4.1 | llama | text, pdf | text | 32,768 | 8,192 | $0.49299999999999994 / 1M | $0.49299999999999994 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Lumimaid 70b NeverSleep/Llama-3-Lumimaid-70B-v0.1 | llama | text, pdf | text | 16,384 | 8,192 | $2.006 / 1M | $2.006 / 1M | No | No | Jul 1, 2024 | |
NanoGPT nano-gpt | Magnum v4 72B anthracite-org/magnum-v4-72b | llama | text, pdf | text | 16,384 | 8,192 | $2.006 / 1M | $2.992 / 1M | No | No | Jan 1, 2025 | |
NanoGPT nano-gpt | Mistral Nemo 12B Instruct 2407 Mistral-Nemo-12B-Instruct-2407 | Unknown | text, pdf | text | 16,384 | 16,384 | $0.01 / 1M | $0.01 / 1M | No | No | Jul 18, 2024 | |
NanoGPT nano-gpt | SorcererLM 8x22B raifle/sorcererlm-8x22b | mixtral | text, pdf | text | 16,000 | 8,192 | $4.505 / 1M | $4.505 / 1M | No | No | Jan 1, 2025 | |
NanoGPT nano-gpt | ReMM SLERP 13B undi95/remm-slerp-l2-13b | llama | text, pdf | text | 6,144 | 4,096 | $0.7989999999999999 / 1M | $1.2069999999999999 / 1M | No | No | Jan 1, 2025 |
28 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Regolo AI regolo-ai | Qwen-Image qwen-image | qwen | text | image | 8,192 | 4,096 | $0.5 / 1M | $2 / 1M | No | No | Mar 1, 2026 | |
Nvidia nvidia | FLUX.1-dev black-forest-labs/flux.1-dev | flux | text | image | 4,096 | Unknown | $0 / 1M | $0 / 1M | No | No | Sep 5, 2025 | |
Poe poe | DALL-E-3 openai/dall-e-3 | dall-e | text | image | 800 | Unknown | Unknown | Unknown | No | Yes | Nov 6, 2023 | |
Vercel AI Gateway vercel | FLUX.1 Fill [pro] bfl/flux-pro-1.0-fill | flux | text | image | 512 | Unknown | Unknown | Unknown | No | No | Oct 1, 2024 | |
Vercel AI Gateway vercel | FLUX.1 Kontext Max bfl/flux-kontext-max | flux | text | image | 512 | Unknown | Unknown | Unknown | No | No | Jun 1, 2025 | |
Vercel AI Gateway vercel | FLUX.1 Kontext Pro bfl/flux-kontext-pro | flux | text | image | 512 | Unknown | Unknown | Unknown | No | No | Jun 1, 2025 | |
Vercel AI Gateway vercel | FLUX1.1 [pro] bfl/flux-pro-1.1 | flux | text | image | 512 | Unknown | Unknown | Unknown | No | No | Oct 1, 2024 | |
Vercel AI Gateway vercel | FLUX1.1 [pro] Ultra bfl/flux-pro-1.1-ultra | flux | text | image | 512 | Unknown | Unknown | Unknown | No | No | Nov 1, 2024 | |
Vercel AI Gateway vercel | Recraft V2 recraft/recraft-v2 | recraft | text | image | 512 | Unknown | Unknown | Unknown | No | No | Mar 1, 2024 | |
Vercel AI Gateway vercel | Recraft V3 recraft/recraft-v3 | recraft | text | image | 512 | Unknown | Unknown | Unknown | No | No | Oct 1, 2024 | |
Poe poe | Imagen-3 google/imagen-3 | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | Yes | Oct 15, 2024 | |
Poe poe | Imagen-3-Fast google/imagen-3-fast | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | Yes | Oct 17, 2024 | |
Poe poe | Imagen-4 google/imagen-4 | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | Yes | May 22, 2025 | |
Poe poe | Imagen-4-Fast google/imagen-4-fast | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | Yes | Jun 25, 2025 | |
Poe poe | Imagen-4-Ultra google/imagen-4-ultra | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | Yes | May 24, 2025 | |
Vercel AI Gateway vercel | Imagen 4 google/imagen-4.0-generate-001 | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | No | May 22, 2025 | |
Vercel AI Gateway vercel | Imagen 4 Fast google/imagen-4.0-fast-generate-001 | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | No | Jun 1, 2025 | |
Vercel AI Gateway vercel | Imagen 4 Ultra google/imagen-4.0-ultra-generate-001 | imagen | text | image | 480 | Unknown | Unknown | Unknown | No | No | May 24, 2025 | |
Poe poe | TopazLabs topazlabs-co/topazlabs | topazlabs | text | image | 204 | Unknown | Unknown | Unknown | No | Yes | Dec 3, 2024 | |
Poe poe | Ideogram-v2a ideogramai/ideogram-v2a | ideogram | text | image | 150 | Unknown | Unknown | Unknown | No | Yes | Feb 27, 2025 | |
Poe poe | Ideogram-v2a-Turbo ideogramai/ideogram-v2a-turbo | ideogram | text | image | 150 | Unknown | Unknown | Unknown | No | Yes | Feb 27, 2025 | |
Nebius Token Factory nebius | FLUX.1-dev black-forest-labs/flux-dev | Unknown | text | image | 77 | Unknown | $0 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
Nebius Token Factory nebius | FLUX.1-schnell black-forest-labs/flux-schnell | Unknown | text | image | 77 | Unknown | $0 / 1M | $0 / 1M | No | No | Feb 4, 2026 | |
DigitalOcean digitalocean | Fast SDXL fal-ai/fast-sdxl | stable-diffusion | text | image | Unknown | Unknown | Unknown | Unknown | No | No | Apr 16, 2026 | |
DigitalOcean digitalocean | FLUX.1 [schnell] fal-ai/flux/schnell | flux | text | image | Unknown | Unknown | Unknown | Unknown | No | No | Apr 16, 2026 | |
NanoGPT nano-gpt | Chroma chroma | Unknown | text | image | Unknown | Unknown | Unknown | Unknown | No | No | Aug 12, 2025 | |
NanoGPT nano-gpt | Hidream hidream | Unknown | text | image | Unknown | Unknown | Unknown | Unknown | No | No | Jan 1, 2024 | |
NanoGPT nano-gpt | Z Image Turbo z-image-turbo | Unknown | text | image | Unknown | Unknown | Unknown | Unknown | No | No | Nov 27, 2025 |
18 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 | Unknown | image, text, video | text | 1,000,000 | 65,536 | $0.26 / 1M | $1.56 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23 | Unknown | image, text, video | text | 1,000,000 | 65,536 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6 | Unknown | image, text, video | text | 262,144 | 32,768 | $0.25 / 1M | $2 / 1M | Yes | Yes | Sep 1, 2025 | |
Kilo Gateway kilo | ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash | Unknown | image, text, video | text | 262,144 | 32,768 | $0.075 / 1M | $0.3 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | ByteDance Seed: Seed-2.0-Lite bytedance-seed/seed-2.0-lite | Unknown | image, text, video | text | 262,144 | 131,072 | $0.25 / 1M | $2 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini | Unknown | image, text, video | text | 262,144 | 131,072 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it | Unknown | image, text, video | text | 262,144 | 262,144 | $0.12 / 1M | $0.4 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Google: Gemma 4 31B google/gemma-4-31b-it | Unknown | image, text, video | text | 262,144 | 131,072 | $0.14 / 1M | $0.4 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | Unknown | image, text, video | text | 262,144 | 65,536 | $0.39 / 1M | $2.34 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b | Unknown | image, text, video | text | 262,144 | 65,536 | $0.26 / 1M | $2.08 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5-27B qwen/qwen3.5-27b | Unknown | image, text, video | text | 262,144 | 65,536 | $0.195 / 1M | $1.56 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b | Unknown | image, text, video | text | 262,144 | 65,536 | $0.1625 / 1M | $1.3 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | Qwen: Qwen3.5-9B qwen/qwen3.5-9b | Unknown | image, text, video | text | 256,000 | 32,768 | $0.05 / 1M | $0.15 / 1M | Yes | Yes | Mar 15, 2026 | |
Qiniu qiniu-ai | Doubao-Seed 1.6 Thinking doubao-seed-1.6-thinking | Unknown | image, text, video | text | 256,000 | 32,000 | Unknown | Unknown | Yes | Yes | Aug 15, 2025 | |
Kilo Gateway kilo | Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo | Unknown | image, text, video | text | 202,752 | 131,072 | $1.2 / 1M | $4 / 1M | Yes | Yes | Apr 11, 2026 | |
Kilo Gateway kilo | NVIDIA: Nemotron Nano 12B 2 VL nvidia/nemotron-nano-12b-v2-vl | Unknown | image, text, video | text | 131,072 | 26,215 | $0.2 / 1M | $0.6 / 1M | Yes | No | Jan 31, 2026 | |
Kilo Gateway kilo | Z.ai: GLM 4.6V z-ai/glm-4.6v | Unknown | image, text, video | text | 131,072 | 131,072 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Jan 10, 2026 | |
Kilo Gateway kilo | Reka Edge rekaai/reka-edge | Unknown | image, text, video | text | 16,384 | 16,384 | $0.1 / 1M | $0.1 / 1M | No | Yes | Apr 11, 2026 |
17 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Poe poe | GPT-5.4-Pro openai/gpt-5.4-pro | Unknown | text, image | image | 1,050,000 | 128,000 | $27 / 1M | $160 / 1M | Yes | Yes | Mar 5, 2026 | |
Poe poe | GPT-Image-1 openai/gpt-image-1 | gpt | text, image | image | 128,000 | Unknown | Unknown | Unknown | No | Yes | Mar 31, 2025 | |
Poe poe | gpt-image-1.5 openai/gpt-image-1.5 | Unknown | text, image | image | 128,000 | Unknown | Unknown | Unknown | No | No | Dec 16, 2025 | |
Poe poe | Nano-Banana-Pro google/nano-banana-pro | nano-banana | text, image | image | 65,536 | Unknown | $2 / 1M | $12 / 1M | No | Yes | Nov 19, 2025 | |
Qiniu qiniu-ai | Gemini 2.5 Flash Image gemini-2.5-flash-image | Unknown | text, image | image | 32,768 | 8,192 | Unknown | Unknown | No | No | Oct 22, 2025 | |
OpenRouter openrouter | Riverflow V2 Fast Preview sourceful/riverflow-v2-fast-preview | sourceful | text, image | image | 8,192 | 8,192 | $0 / 1M | $0 / 1M | No | No | Jan 28, 2026 | |
OpenRouter openrouter | Riverflow V2 Max Preview sourceful/riverflow-v2-max-preview | sourceful | text, image | image | 8,192 | 8,192 | $0 / 1M | $0 / 1M | No | No | Jan 28, 2026 | |
OpenRouter openrouter | Riverflow V2 Standard Preview sourceful/riverflow-v2-standard-preview | sourceful | text, image | image | 8,192 | 8,192 | $0 / 1M | $0 / 1M | No | No | Jan 28, 2026 | |
Poe poe | StableDiffusionXL stabilityai/stablediffusionxl | stable-diffusion | text, image | image | 200 | Unknown | Unknown | Unknown | No | Yes | Jul 9, 2023 | |
Poe poe | Ideogram ideogramai/ideogram | ideogram | text, image | image | 150 | Unknown | Unknown | Unknown | No | Yes | Apr 3, 2024 | |
Poe poe | Ideogram-v2 ideogramai/ideogram-v2 | ideogram | text, image | image | 150 | Unknown | Unknown | Unknown | No | Yes | Aug 21, 2024 | |
DigitalOcean digitalocean | GPT Image 1 openai-gpt-image-1 | gpt-image | text, image | image | Unknown | Unknown | $5 / 1M | $40 / 1M | No | No | Apr 24, 2025 | |
DigitalOcean digitalocean | GPT Image 1.5 openai-gpt-image-1.5 | gpt-image | text, image | image | Unknown | Unknown | $5 / 1M | $10 / 1M | No | No | Nov 25, 2025 | |
NanoGPT nano-gpt | Qwen Image qwen-image | Unknown | text, image | image | Unknown | Unknown | Unknown | Unknown | No | No | Aug 7, 2025 | |
OpenAI openai | gpt-image-1 gpt-image-1 | gpt-image | text, image | image | Unknown | Unknown | Unknown | Unknown | No | No | Apr 24, 2025 | |
Poe poe | GPT-Image-1-Mini openai/gpt-image-1-mini | gpt | text, image | image | Unknown | Unknown | Unknown | Unknown | No | Yes | Aug 26, 2025 | |
Poe poe | GPT-Image-2 openai/gpt-image-2 | Unknown | text, image | image | Unknown | Unknown | $5.0505 / 1M | $32.3232 / 1M | No | No | Apr 21, 2026 |
16 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Helicone helicone | xAI Grok 4.1 Fast Non-Reasoning grok-4-1-fast-non-reasoning | grok | text, image | text, image | 2,000,000 | 30,000 | $0.19999999999999998 / 1M | $0.5 / 1M | No | Yes | Nov 17, 2025 | |
Helicone helicone | OpenAI GPT-5.1 gpt-5.1 | gpt | text, image | text, image | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | OpenAI: GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image | text, image | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | No | Yes | Jan 1, 2025 | |
Helicone helicone | OpenAI: GPT-5.1 Codex Mini gpt-5.1-codex-mini | gpt-codex | text, image | text, image | 400,000 | 128,000 | $0.25 / 1M | $2 / 1M | No | Yes | Jan 1, 2025 | |
Poe poe | GPT-5.5 openai/gpt-5.5 | gpt | text, image | text, image | 400,000 | 128,000 | $4.5455 / 1M | $27.2727 / 1M | Yes | Yes | Apr 8, 2026 | |
Poe poe | GPT-5.5-Pro openai/gpt-5.5-pro | gpt-pro | text, image | text, image | 400,000 | 128,000 | $27.2727 / 1M | $163.6364 / 1M | Yes | Yes | Apr 8, 2026 | |
Vercel AI Gateway vercel | Gemini 3.1 Flash Image Preview (Nano Banana 2) google/gemini-3.1-flash-image-preview | gemini | text, image | text, image | 131,072 | 32,768 | $0.5 / 1M | $3 / 1M | Yes | No | Mar 6, 2026 | |
Helicone helicone | OpenAI GPT-5.1 Chat gpt-5.1-chat-latest | gpt-codex | text, image | text, image | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | No | Yes | Jan 1, 2025 | |
OpenRouter openrouter | Gemini 3.1 Flash Image Preview (Nano Banana 2) google/gemini-3.1-flash-image-preview | gemini-flash | text, image | text, image | 65,536 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | No | Feb 26, 2026 | |
Poe poe | Nano-Banana google/nano-banana | nano-banana | text, image | text, image | 65,536 | Unknown | $0.21 / 1M | $1.8 / 1M | No | Yes | Aug 21, 2025 | |
Google google | Gemini 2.5 Flash Image gemini-2.5-flash-image | gemini-flash | text, image | text, image | 32,768 | 32,768 | $0.3 / 1M | $30 / 1M | Yes | No | Aug 26, 2025 | |
Google google | Gemini 2.5 Flash Image (Preview) gemini-2.5-flash-image-preview | gemini-flash | text, image | text, image | 32,768 | 32,768 | $0.3 / 1M | $30 / 1M | Yes | No | Aug 26, 2025 | |
Qiniu qiniu-ai | Gemini 3.0 Pro Image Preview gemini-3.0-pro-image-preview | Unknown | text, image | text, image | 32,768 | 8,192 | Unknown | Unknown | No | No | Nov 20, 2025 | |
OpenAI openai | chatgpt-image-latest chatgpt-image-latest | gpt-image | text, image | text, image | Unknown | Unknown | Unknown | Unknown | No | No | Dec 16, 2025 | |
OpenAI openai | gpt-image-1-mini gpt-image-1-mini | gpt-image | text, image | text, image | Unknown | Unknown | Unknown | Unknown | No | No | Sep 26, 2025 | |
OpenAI openai | gpt-image-1.5 gpt-image-1.5 | gpt-image | text, image | text, image | Unknown | Unknown | Unknown | Unknown | No | No | Nov 25, 2025 |
14 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Helicone helicone | xAI Grok 4 Fast Non-Reasoning grok-4-fast-non-reasoning | grok | text, image, audio | text | 2,000,000 | 2,000,000 | $0.19999999999999998 / 1M | $0.5 / 1M | No | Yes | Sep 19, 2025 | |
Nvidia nvidia | Phi-4-Mini microsoft/phi-4-mini-instruct | phi | text, image, audio | text | 131,072 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 5, 2025 | |
Abacus abacus | GPT-4o (2024-11-20) gpt-4o-2024-11-20 | gpt | text, image, audio | text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Nov 20, 2024 | |
Azure Cognitive Services azure-cognitive-services | Phi-4-multimodal phi-4-multimodal | phi | text, image, audio | text | 128,000 | 4,096 | $0.08 / 1M | $0.32 / 1M | No | No | Dec 11, 2024 | |
Azure azure | Phi-4-multimodal phi-4-multimodal | phi | text, image, audio | text | 128,000 | 4,096 | $0.08 / 1M | $0.32 / 1M | No | No | Dec 11, 2024 | |
GitHub Models github-models | GPT-4o openai/gpt-4o | gpt | text, image, audio | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | May 13, 2024 | |
GitHub Models github-models | GPT-4o mini openai/gpt-4o-mini | gpt-mini | text, image, audio | text | 128,000 | 16,384 | $0 / 1M | $0 / 1M | No | Yes | Jul 18, 2024 | |
GitHub Models github-models | Llama-3.2-11B-Vision-Instruct meta/llama-3.2-11b-vision-instruct | llama | text, image, audio | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 25, 2024 | |
GitHub Models github-models | Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct | llama | text, image, audio | text | 128,000 | 8,192 | $0 / 1M | $0 / 1M | Yes | Yes | Sep 25, 2024 | |
GitHub Models github-models | Phi-4-multimodal-instruct microsoft/phi-4-multimodal-instruct | phi | text, image, audio | text | 128,000 | 4,096 | $0 / 1M | $0 / 1M | Yes | Yes | Dec 11, 2024 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Omni-30B-A3B-Instruct Qwen/Qwen3-Omni-30B-A3B-Instruct | qwen | text, image, audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Omni-30B-A3B-Thinking Qwen/Qwen3-Omni-30B-A3B-Thinking | qwen | text, image, audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Omni-30B-A3B-Instruct Qwen/Qwen3-Omni-30B-A3B-Instruct | qwen | text, image, audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Omni-30B-A3B-Thinking Qwen/Qwen3-Omni-30B-A3B-Thinking | qwen | text, image, audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | Yes | Yes | Nov 25, 2025 |
13 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Poe poe | ElevenLabs-v2.5-Turbo elevenlabs/elevenlabs-v2.5-turbo | elevenlabs | text | audio | 128,000 | Unknown | Unknown | Unknown | No | Yes | Oct 28, 2024 | |
Poe poe | ElevenLabs-v3 elevenlabs/elevenlabs-v3 | elevenlabs | text | audio | 128,000 | Unknown | Unknown | Unknown | No | Yes | Jun 5, 2025 | |
Google google | Gemini 2.5 Flash Preview TTS gemini-2.5-flash-preview-tts | gemini-flash | text | audio | 8,000 | 16,000 | $0.5 / 1M | $10 / 1M | No | No | May 1, 2025 | |
Google google | Gemini 2.5 Pro Preview TTS gemini-2.5-pro-preview-tts | gemini-flash | text | audio | 8,000 | 16,000 | $1 / 1M | $20 / 1M | No | No | May 1, 2025 | |
Xiaomi Token Plan (China) xiaomi-token-plan-cn | MiMo-V2-TTS mimo-v2-tts | mimo | text | audio | 8,000 | 16,000 | $0 / 1M | $0 / 1M | No | No | Mar 18, 2026 | |
Xiaomi Token Plan (Europe) xiaomi-token-plan-ams | MiMo-V2-TTS mimo-v2-tts | mimo | text | audio | 8,000 | 16,000 | $0 / 1M | $0 / 1M | No | No | Mar 18, 2026 | |
Xiaomi Token Plan (Singapore) xiaomi-token-plan-sgp | MiMo-V2-TTS mimo-v2-tts | mimo | text | audio | 8,000 | 16,000 | $0 / 1M | $0 / 1M | No | No | Mar 18, 2026 | |
Groq groq | Orpheus Arabic Saudi canopylabs/orpheus-arabic-saudi | canopylabs | text | audio | 4,000 | 50,000 | $40 / 1M | $0 / 1M | No | No | Dec 16, 2025 | |
Groq groq | Orpheus V1 English canopylabs/orpheus-v1-english | canopylabs | text | audio | 4,000 | 50,000 | $0 / 1M | $0 / 1M | No | No | Dec 19, 2025 | |
Poe poe | ElevenLabs-Music elevenlabs/elevenlabs-music | elevenlabs | text | audio | 2,000 | Unknown | Unknown | Unknown | No | Yes | Aug 29, 2025 | |
DigitalOcean digitalocean | ElevenLabs Multilingual TTS v2 fal-ai/elevenlabs/tts/multilingual-v2 | elevenlabs | text | audio | Unknown | Unknown | Unknown | Unknown | No | No | Apr 16, 2026 | |
DigitalOcean digitalocean | Stable Audio 2.5 (Text-to-Audio) fal-ai/stable-audio-25/text-to-audio | Unknown | text | audio | Unknown | Unknown | Unknown | Unknown | No | No | Apr 16, 2026 | |
Poe poe | Lyria google/lyria | lyria | text | audio | Unknown | Unknown | Unknown | Unknown | No | Yes | Jun 4, 2025 |
12 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
SiliconFlow (China) siliconflow-cn | Qwen/Qwen3-Omni-30B-A3B-Captioner Qwen/Qwen3-Omni-30B-A3B-Captioner | qwen | audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Nov 25, 2025 | |
SiliconFlow siliconflow | Qwen/Qwen3-Omni-30B-A3B-Captioner Qwen/Qwen3-Omni-30B-A3B-Captioner | qwen | audio | text | 66,000 | 66,000 | $0.1 / 1M | $0.4 / 1M | No | Yes | Nov 25, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-ASR Flash qwen3-asr-flash | qwen | audio | text | 53,248 | 4,096 | $0.032 / 1M | $0.032 / 1M | No | No | Sep 8, 2025 | |
Alibaba alibaba | Qwen3-ASR Flash qwen3-asr-flash | qwen | audio | text | 53,248 | 4,096 | $0.035 / 1M | $0.035 / 1M | No | No | Sep 8, 2025 | |
evroc evroc | KB Whisper KBLab/kb-whisper-large | whisper | audio | text | 448 | 448 | $0.00236 / 1M | $0.00236 / 1M | No | No | Oct 1, 2024 | |
evroc evroc | Whisper 3 Large openai/whisper-large-v3 | whisper | audio | text | 448 | 4,096 | $0.00236 / 1M | $0.00236 / 1M | No | No | Oct 1, 2024 | |
Groq groq | Whisper Large V3 whisper-large-v3 | whisper | audio | text | 448 | 448 | $0 / 1M | $0 / 1M | No | No | Sep 5, 2025 | |
Groq groq | Whisper Large v3 Turbo whisper-large-v3-turbo | whisper | audio | text | 448 | 448 | $0 / 1M | $0 / 1M | No | No | Oct 1, 2024 | |
Nvidia nvidia | Parakeet TDT 0.6B v2 nvidia/parakeet-tdt-0.6b-v2 | parakeet | audio | text | Unknown | 4,096 | $0 / 1M | $0 / 1M | No | No | Sep 5, 2025 | |
Nvidia nvidia | Whisper Large v3 openai/whisper-large-v3 | whisper | audio | text | Unknown | 4,096 | $0 / 1M | $0 / 1M | No | No | Sep 5, 2025 | |
Privatemode AI privatemode-ai | Whisper large-v3 whisper-large-v3 | whisper | audio | text | Unknown | 4,096 | $0 / 1M | $0 / 1M | No | No | Sep 1, 2023 | |
Scaleway scaleway | Whisper Large v3 whisper-large-v3 | whisper | audio | text | Unknown | 8,192 | $0.003 / 1M | $0 / 1M | No | No | Mar 17, 2026 |
10 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Google google | Gemini Live 2.5 Flash gemini-live-2.5-flash | gemini-flash | text, image, audio, video | text, audio | 128,000 | 8,000 | $0.5 / 1M | $2 / 1M | Yes | Yes | Sep 1, 2025 | |
Alibaba (China) alibaba-cn | Qwen3-Omni Flash qwen3-omni-flash | qwen | text, image, audio, video | text, audio | 65,536 | 16,384 | $0.058 / 1M | $0.23 / 1M | Yes | Yes | Sep 15, 2025 | |
Alibaba alibaba | Qwen3-Omni Flash qwen3-omni-flash | qwen | text, image, audio, video | text, audio | 65,536 | 16,384 | $0.43 / 1M | $1.66 / 1M | Yes | Yes | Sep 15, 2025 | |
Alibaba alibaba | Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime | qwen | text, image, audio, video | text, audio | 65,536 | 16,384 | $0.52 / 1M | $1.99 / 1M | No | Yes | Sep 15, 2025 | |
Alibaba alibaba | Qwen3-LiveTranslate Flash Realtime qwen3-livetranslate-flash-realtime | qwen | text, image, audio, video | text, audio | 53,248 | 4,096 | $10 / 1M | $10 / 1M | No | No | Sep 22, 2025 | |
Alibaba (China) alibaba-cn | Qwen-Omni Turbo qwen-omni-turbo | qwen | text, image, audio, video | text, audio | 32,768 | 2,048 | $0.058 / 1M | $0.23 / 1M | No | Yes | Mar 26, 2025 | |
Alibaba (China) alibaba-cn | Qwen2.5-Omni 7B qwen2-5-omni-7b | qwen | text, image, audio, video | text, audio | 32,768 | 2,048 | $0.087 / 1M | $0.345 / 1M | No | Yes | Dec 1, 2024 | |
Alibaba alibaba | Qwen-Omni Turbo qwen-omni-turbo | qwen | text, image, audio, video | text, audio | 32,768 | 2,048 | $0.07 / 1M | $0.27 / 1M | No | Yes | Mar 26, 2025 | |
Alibaba alibaba | Qwen2.5-Omni 7B qwen2-5-omni-7b | qwen | text, image, audio, video | text, audio | 32,768 | 2,048 | $0.1 / 1M | $0.4 / 1M | No | Yes | Dec 1, 2024 | |
LLM Gateway llmgateway | Qwen-Omni Turbo qwen-omni-turbo | qwen | text, image, audio, video | text, audio | 32,768 | 2,048 | $0.07 / 1M | $0.27 / 1M | No | Yes | Mar 26, 2025 |
8 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
OpenRouter openrouter | GPT-5 Image openai/gpt-5-image | gpt | text, image, pdf | text, image | 400,000 | 128,000 | $5 / 1M | $10 / 1M | Yes | Yes | Oct 14, 2025 | |
Requesty requesty | GPT-5 Image openai/gpt-5-image | gpt | text, image, pdf | text, image | 400,000 | 128,000 | $5 / 1M | $10 / 1M | Yes | Yes | Oct 14, 2025 | |
Vercel AI Gateway vercel | GPT 5.1 Thinking openai/gpt-5.1-thinking | gpt | text, image, pdf | text, image | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5 pro openai/gpt-5-pro | gpt | text, image, pdf | text, image | 400,000 | 272,000 | $15 / 1M | $120 / 1M | Yes | Yes | Aug 7, 2025 | |
302.AI 302ai | gemini-3.1-flash-image-preview gemini-3.1-flash-image-preview | Unknown | text, image, pdf | text, image | 131,072 | 32,768 | $0.5 / 1M | $60 / 1M | No | No | Feb 27, 2026 | |
Google google | Gemini 3.1 Flash Image (Preview) gemini-3.1-flash-image-preview | gemini-flash | text, image, pdf | text, image | 131,072 | 32,768 | $0.25 / 1M | $60 / 1M | Yes | No | Feb 26, 2026 | |
Vercel AI Gateway vercel | GPT-5 Chat openai/gpt-5-chat | gpt | text, image, pdf | text, image | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 | |
Vercel AI Gateway vercel | GPT-5.1 Instant openai/gpt-5.1-instant | gpt | text, image, pdf | text, image | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 |
6 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Poe poe | Ray2 lumalabs/ray2 | ray | text, image | video | 5,000 | Unknown | Unknown | Unknown | No | Yes | Feb 20, 2025 | |
Poe poe | Veo-3.1-Fast google/veo-3.1-fast | veo | text, image | video | 480 | Unknown | Unknown | Unknown | No | Yes | Oct 15, 2025 | |
Poe poe | Runway runwayml/runway | runway | text, image | video | 256 | Unknown | Unknown | Unknown | No | Yes | Oct 11, 2024 | |
Poe poe | Runway-Gen-4-Turbo runwayml/runway-gen-4-turbo | runway | text, image | video | 256 | Unknown | Unknown | Unknown | No | Yes | May 9, 2025 | |
Poe poe | Sora-2 openai/sora-2 | sora | text, image | video | Unknown | Unknown | Unknown | Unknown | No | Yes | Oct 6, 2025 | |
Poe poe | Sora-2-Pro openai/sora-2-pro | sora | text, image | video | Unknown | Unknown | Unknown | Unknown | No | Yes | Oct 6, 2025 |
6 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Azure Cognitive Services azure-cognitive-services | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image, audio | text, image, audio | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure azure | GPT-5.1 Codex gpt-5.1-codex | gpt-codex | text, image, audio | text, image, audio | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.1 gpt-5.1 | gpt | text, image, audio | text, image, audio | 272,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure azure | GPT-5.1 gpt-5.1 | gpt | text, image, audio | text, image, audio | 272,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure Cognitive Services azure-cognitive-services | GPT-5.1 Chat gpt-5.1-chat | gpt-codex | text, image, audio | text, image, audio | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 | |
Azure azure | GPT-5.1 Chat gpt-5.1-chat | gpt-codex | text, image, audio | text, image, audio | 128,000 | 16,384 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 14, 2025 |
5 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
OpenRouter openrouter | FLUX.2 Flex black-forest-labs/flux.2-flex | flux | image, text | image | 67,344 | 67,344 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 | |
OpenRouter openrouter | FLUX.2 Max black-forest-labs/flux.2-max | flux | image, text | image | 46,864 | 46,864 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 | |
OpenRouter openrouter | FLUX.2 Pro black-forest-labs/flux.2-pro | flux | image, text | image | 46,864 | 46,864 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 | |
OpenRouter openrouter | FLUX.2 Klein 4B black-forest-labs/flux.2-klein-4b | flux | image, text | image | 40,960 | 40,960 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 | |
OpenRouter openrouter | Seedream 4.5 bytedance-seed/seedream-4.5 | seed | image, text | image | 4,096 | 4,096 | $0 / 1M | $0 / 1M | No | No | Jan 31, 2026 |
5 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
ZenMux zenmux | Claude Sonnet 4 anthropic/claude-sonnet-4 | Unknown | image, text, pdf | text | 1,000,000 | 64,000 | $3 / 1M | $15 / 1M | Yes | Yes | May 22, 2025 | |
ZenMux zenmux | GPT-5.1 openai/gpt-5.1 | Unknown | image, text, pdf | text | 400,000 | 64,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Nov 13, 2025 | |
ZenMux zenmux | GPT-5.2 openai/gpt-5.2 | Unknown | image, text, pdf | text | 400,000 | 64,000 | $1.75 / 1M | $14 / 1M | Yes | Yes | Dec 11, 2025 | |
ZenMux zenmux | Claude Opus 4 anthropic/claude-opus-4 | Unknown | image, text, pdf | text | 200,000 | 32,000 | $15 / 1M | $75 / 1M | Yes | Yes | May 22, 2025 | |
ZenMux zenmux | Claude Opus 4.1 anthropic/claude-opus-4.1 | Unknown | image, text, pdf | text | 200,000 | 64,000 | $15 / 1M | $75 / 1M | Yes | Yes | Aug 5, 2025 |
5 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Vercel AI Gateway vercel | Nano Banana Pro (Gemini 3 Pro Image) google/gemini-3-pro-image | gemini-pro | text | text, image | 65,536 | 32,768 | $2 / 1M | $120 / 1M | No | No | Sep 1, 2025 | |
Vercel AI Gateway vercel | Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image | gemini-flash | text | text, image | 32,768 | 32,768 | $0.3 / 1M | $2.5 / 1M | No | No | Mar 20, 2025 | |
Vercel AI Gateway vercel | Nano Banana Preview (Gemini 2.5 Flash Image Preview) google/gemini-2.5-flash-image-preview | gemini-flash | text | text, image | 32,768 | 32,768 | $0.3 / 1M | $2.5 / 1M | No | No | Mar 20, 2025 | |
Vercel AI Gateway vercel | Grok Imagine Image xai/grok-imagine-image | grok | text | text, image | Unknown | Unknown | Unknown | Unknown | No | No | Feb 19, 2026 | |
Vercel AI Gateway vercel | Grok Imagine Image Pro xai/grok-imagine-image-pro | grok | text | text, image | Unknown | Unknown | Unknown | Unknown | No | No | Feb 19, 2026 |
4 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Poe poe | Veo-2 google/veo-2 | veo | text | video | 480 | Unknown | Unknown | Unknown | No | Yes | Dec 2, 2024 | |
Poe poe | Veo-3 google/veo-3 | veo | text | video | 480 | Unknown | Unknown | Unknown | No | Yes | May 21, 2025 | |
Poe poe | Veo-3-Fast google/veo-3-fast | veo | text | video | 480 | Unknown | Unknown | Unknown | No | Yes | Oct 13, 2025 | |
Poe poe | Veo-3.1 google/veo-3.1 | veo | text | video | 480 | Unknown | Unknown | Unknown | No | Yes | Oct 15, 2025 |
3 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Alibaba (China) alibaba-cn | Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime | qwen | text, image, audio | text, audio | 65,536 | 16,384 | $0.23 / 1M | $0.918 / 1M | No | Yes | Sep 15, 2025 | |
Alibaba (China) alibaba-cn | Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime | qwen | text, image, audio | text, audio | 32,768 | 2,048 | $0.23 / 1M | $0.918 / 1M | No | Yes | May 8, 2025 | |
Alibaba alibaba | Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime | qwen | text, image, audio | text, audio | 32,768 | 2,048 | $0.27 / 1M | $1.07 / 1M | No | Yes | May 8, 2025 |
3 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Amazon Bedrock amazon-bedrock | Voxtral Small 24B 2507 mistral.voxtral-small-24b-2507 | mistral | text, audio | text | 32,000 | 8,192 | $0.15 / 1M | $0.35 / 1M | No | Yes | Jul 1, 2025 | |
Kilo Gateway kilo | Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | Unknown | text, audio | text | 32,000 | 6,400 | $0.1 / 1M | $0.3 / 1M | No | Yes | Jul 1, 2025 | |
Scaleway scaleway | Voxtral Small 24B 2507 voxtral-small-24b-2507 | voxtral | text, audio | text | 32,000 | 16,384 | $0.15 / 1M | $0.35 / 1M | No | Yes | Mar 17, 2026 |
3 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
NovitaAI novita-ai | GLM 4.6V zai-org/glm-4.6v | glmv | text, video, image | text | 131,072 | 32,768 | $0.3 / 1M | $0.9 / 1M | Yes | Yes | Dec 8, 2025 | |
NovitaAI novita-ai | qwen/qwen3-vl-30b-a3b-instruct qwen/qwen3-vl-30b-a3b-instruct | Unknown | text, video, image | text | 131,072 | 32,768 | $0.2 / 1M | $0.7 / 1M | No | Yes | Oct 11, 2025 | |
NovitaAI novita-ai | GLM 4.5V zai-org/glm-4.5v | glmv | text, video, image | text | 65,536 | 16,384 | $0.6 / 1M | $1.8 / 1M | Yes | Yes | Aug 11, 2025 |
3 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | OpenAI: GPT Audio openai/gpt-audio | Unknown | audio, text | audio, text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT Audio Mini openai/gpt-audio-mini | Unknown | audio, text | audio, text | 128,000 | 16,384 | $0.6 / 1M | $2.4 / 1M | No | No | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | Unknown | audio, text | audio, text | 128,000 | 16,384 | $2.5 / 1M | $10 / 1M | No | Yes | Mar 15, 2026 |
3 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview | Unknown | image, text | image, text | 65,536 | 65,536 | $0.5 / 1M | $3 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview | Unknown | image, text | image, text | 65,536 | 32,768 | $2 / 1M | $12 / 1M | Yes | No | Mar 15, 2026 | |
Kilo Gateway kilo | Google: Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image | Unknown | image, text | image, text | 32,768 | 32,768 | $0.3 / 1M | $2.5 / 1M | No | No | Mar 15, 2026 |
2 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
ZenMux zenmux | Claude Opus 4.5 anthropic/claude-opus-4.5 | Unknown | pdf, image, text | text | 200,000 | 64,000 | $5 / 1M | $25 / 1M | Yes | Yes | Nov 24, 2025 | |
ZenMux zenmux | GPT-5.1 Chat openai/gpt-5.1-chat | Unknown | pdf, image, text | text | 128,000 | 64,000 | $1.25 / 1M | $10 / 1M | No | Yes | Nov 13, 2025 |
2 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | OpenAI: o3 Mini openai/o3-mini | Unknown | pdf, text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | No | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: o3 Mini High openai/o3-mini-high | Unknown | pdf, text | text | 200,000 | 100,000 | $1.1 / 1M | $4.4 / 1M | No | Yes | Mar 15, 2026 |
2 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | OpenAI: GPT-5 Image openai/gpt-5-image | Unknown | image, pdf, text | image, text | 400,000 | 128,000 | $10 / 1M | $10 / 1M | Yes | Yes | Mar 15, 2026 | |
Kilo Gateway kilo | OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini | Unknown | image, pdf, text | image, text | 400,000 | 128,000 | $2.5 / 1M | $2 / 1M | Yes | Yes | Mar 15, 2026 |
2 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | Google: Lyria 3 Clip Preview google/lyria-3-clip-preview | Unknown | image, text | audio, text | 1,048,576 | 65,536 | $0 / 1M | $0 / 1M | No | No | Apr 11, 2026 | |
Kilo Gateway kilo | Google: Lyria 3 Pro Preview google/lyria-3-pro-preview | Unknown | image, text | audio, text | 1,048,576 | 65,536 | $0 / 1M | $0 / 1M | No | No | Apr 11, 2026 |
2 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Amazon Bedrock amazon-bedrock | Voxtral Mini 3B 2507 mistral.voxtral-mini-3b-2507 | mistral | audio, text | text | 128,000 | 4,096 | $0.04 / 1M | $0.04 / 1M | No | Yes | Dec 1, 2024 | |
evroc evroc | Voxtral Small 24B mistralai/Voxtral-Small-24B-2507 | voxtral | audio, text | text | 32,000 | 32,000 | $0.00236 / 1M | $0.00236 / 1M | No | No | Mar 1, 2025 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
NovitaAI novita-ai | Qwen3 Omni 30B A3B Instruct qwen/qwen3-omni-30b-a3b-instruct | qwen | text, video, audio, image | text, audio | 65,536 | 16,384 | $0.25 / 1M | $0.97 / 1M | No | Yes | Sep 24, 2025 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Qiniu qiniu-ai | Kling-V2 6 kling-v2-6 | Unknown | text, image, video | video | 99,999,999 | 99,999,999 | Unknown | Unknown | No | No | Jan 13, 2026 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Kilo Gateway kilo | Auto Router openrouter/auto | Unknown | audio, image, pdf, text, video | image, text | 2,000,000 | 32,768 | $0 / 1M | $0 / 1M | Yes | Yes | Mar 15, 2026 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Poe poe | GPT-5.4 openai/gpt-5.4 | Unknown | text, image, pdf | image | 1,050,000 | 128,000 | $2.2 / 1M | $14 / 1M | Yes | Yes | Feb 26, 2026 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Nvidia nvidia | NeMo Retriever OCR v1 nvidia/nemoretriever-ocr-v1 | nemoretriever | image | text | Unknown | 4,096 | $0 / 1M | $0 / 1M | No | No | Sep 5, 2025 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Requesty requesty | GPT-5 openai/gpt-5 | gpt | text, audio, image, video | text, audio, image | 400,000 | 128,000 | $1.25 / 1M | $10 / 1M | Yes | Yes | Aug 7, 2025 |
1 model API records in this modality group.
| Compare | Provider | Model | Family | Input | Output | Context | Max output | Input price | Output price | Reasoning | Tools | Updated |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Google google | Gemini Live 2.5 Flash Preview Native Audio gemini-live-2.5-flash-preview-native-audio | gemini-flash | text, audio, video | text, audio | 131,072 | 65,536 | $0.5 / 1M | $2 / 1M | Yes | Yes | Sep 18, 2025 |
AI Model API Directory FAQ
Answers to common questions about filtering model APIs, interpreting pricing, and saving models for comparison.
How do I find an AI model API for a specific input and output type?+
Use the input and output modality filters to find models that match the workflow you need. For example, choose text input and image output to narrow the directory to text-to-image models, or choose image input and text output for vision-language models. The directory updates the visible groups immediately, so you can scan only the models that support the required combination. After filtering, add promising models to the compare list for a side-by-side check.
Which cost fields should I check before choosing a model API?+
Check both input cost and output cost before choosing a model API. A model with cheap input can still become expensive if your use case generates long answers, images, audio, or other high-cost outputs. Treat the directory pricing as screening data, then verify the provider billing unit, discounts, and current terms before production use. For a clearer decision, add at least two models to the comparison page and look for the lowest highlighted cost cells.
Can I compare models from this directory immediately?+
Yes, use the Add button in any model row to save that model to your browser compare list. The compare list stays in your browser, so the Compare page can load those models automatically on the same device and browser. Add two or more models when you want a meaningful comparison of cost, limits, modalities, and capability flags. Remove models from the comparison page when they are no longer relevant.
Should I pick the cheapest AI model API in the directory?+
Do not pick a model only because one cost field is the cheapest. Start with modality fit, then check context window, output limit, input cost, output cost, reasoning support, tool calling, and open-weights status. The cheapest model can be the wrong choice if it cannot accept your inputs, produce the required output, or handle the prompt length. Use cost as a shortlist signal, not as the final model selection rule.
How fresh is the model API information?+
The page shows an update date near the top and model rows include provider update fields when available. Use those dates to judge whether a model record is recent enough for early screening. Model availability, names, prices, and limits can change quickly, so confirm critical details with the provider before launch. This directory is best used for discovery and comparison, not as a contract source.
Does this directory rank AI model quality?+
No, this directory does not rank model quality or reliability. It compares technical availability fields such as modality, context window, output limit, pricing, reasoning support, tool calling, and open-weights status. Benchmark scores, production reliability, license terms, and real workload quality need separate evaluation. Use this page to find candidates, then test shortlisted models against your own prompts and constraints.
Limits
Context and output limits help screen models for long documents, coding agents, and large retrieval workflows.
Pricing
Input and output costs are useful for budget screening, but provider-specific pricing should be verified before production use.
Capabilities
Reasoning, tool calling, attachment, and open-weights flags are technical availability signals, not quality scores.
- Commercial license terms are not included in the source payload.
- Run counts, popularity, benchmark scores, and reliability rankings are not included.
- Gateway pricing can differ from original vendor pricing.
- This directory compares technical availability, not model quality.