claude-sonnet-4-20250514
200K context | Anthropic | $15 / MTokens Direct/Fusion
The hybrid reasoning model further expands the advantages of programming and agent capabilities and provides better support for regional compliance scenarios.
claude-sonnet-4-20250514
Anthropic | $15 / MTokens
The hybrid reasoning model further expands the advantages of programming and agent capabilities and provides better support for regional compliance scenarios.
ClaudeCode text Inference FIM
| Text Prompt | $3 | /MTokens |
| Text Completion | $15 | /MTokens |
ClaudeCode
claude-3-7-sonnet-20250219
coding | Anthropic | $15 / MTokens Direct/Fusion
Claude-3-7-Sonnet-20250219 is an upgraded AI model by Anthropic, released on Feb 19, 2025. It’s easy to use, gives accurate answers, writes well, and works smoothly with tools. Great for daily tasks and work
claude-3-7-sonnet-20250219
Anthropic | $15 / MTokens
Claude-3-7-Sonnet-20250219 is an upgraded AI model by Anthropic, released on Feb 19, 2025. It’s easy to use, gives accurate answers, writes well, and works smoothly with tools. Great for daily tasks and work
ClaudeCode text Inference Visual Tools FIM
| Text Prompt | $3 | /MTokens |
| Text Completion | $15 | /MTokens |
| Text Cache Prompt | $0.3 | /MTokens |
ClaudeCode
claude-opus-4-5-20251101
Anthropic | $25 / MTokens Direct/Fusion
It’s intelligent, efficient, and the best model in the world for coding, agents, and computer use. It’s also meaningfully better at everyday tasks like deep research and working with slides and spreadsheets.
claude-opus-4-5-20251101
Anthropic | $25 / MTokens
It’s intelligent, efficient, and the best model in the world for coding, agents, and computer use. It’s also meaningfully better at everyday tasks like deep research and working with slides and spreadsheets.
ClaudeCode text Tools Coder
| Text Prompt | $5 | /MTokens |
| Text Completion | $25 | /MTokens |
| Text Cache Prompt | $0.5 | /MTokens |
ClaudeCode
claude-opus-4-1-20250805
coding | Anthropic | $75 / MTokens Direct/Fusion
It scores 74.5% on SWE - bench Verified for coding. It excels in multi - file refactoring, research, and data analysis. Available on API and platforms, with the same price as Opus 4.
claude-opus-4-1-20250805
Anthropic | $75 / MTokens
It scores 74.5% on SWE - bench Verified for coding. It excels in multi - file refactoring, research, and data analysis. Available on API and platforms, with the same price as Opus 4.
ClaudeCode text image Inference Visual Tools FIM Math
| Text Prompt | $15 | /MTokens |
| Text Completion | $75 | /MTokens |
| Text Cache Prompt | $1.5 | /MTokens |
ClaudeCode
claude-opus-4-20250514
200K context | Anthropic | $75 / MTokens Direct/Fusion
Hybrid reasoning model supports the execution of tasks with higher complexity, further expanding the advantages of programming and agent capabilities; better support for regional compliance scenarios
claude-opus-4-20250514
Anthropic | $75 / MTokens
Hybrid reasoning model supports the execution of tasks with higher complexity, further expanding the advantages of programming and agent capabilities; better support for regional compliance scenarios
ClaudeCode text Inference Visual Tools
| Text Prompt | $15 | /MTokens |
| Text Completion | $75 | /MTokens |
| Text Cache Prompt | $1.5 | /MTokens |
ClaudeCode
claude-sonnet-4-5-20250929
200K context | Anthropic | $15 / MTokens Direct/Fusion
Anthropic’s top model for agents/coding, boasting 30hrs autonomous task runtime, enhanced programming & computer skills. New "Imagine with Claude" and 200K context. Cost-effective vs peers .
claude-sonnet-4-5-20250929
Anthropic | $15 / MTokens
Anthropic’s top model for agents/coding, boasting 30hrs autonomous task runtime, enhanced programming & computer skills. New "Imagine with Claude" and 200K context. Cost-effective vs peers .
Anthropic text Inference
| Text Prompt | $3 | /MTokens |
| Text Completion | $15 | /MTokens |
Anthropic
claude-haiku-4-5-20251001
Anthropic | $5 / MTokens Direct/Fusion
Claude-Haiku-4-5-20251001 is Anthropic’s latest lightweight AI model, blending cutting-edge performance with exceptional speed and cost-efficiency. It matches Claude Sonnet 4’s capabilities in coding, computer use, and agentic workflows while being over twice as fast and costing just one-third as much.
claude-haiku-4-5-20251001
Anthropic | $5 / MTokens
Claude-Haiku-4-5-20251001 is Anthropic’s latest lightweight AI model, blending cutting-edge performance with exceptional speed and cost-efficiency. It matches Claude Sonnet 4’s capabilities in coding, computer use, and agentic workflows while being over twice as fast and costing just one-third as much.
ClaudeCode text Inference Visual
| Text Prompt | $1 | /MTokens |
| Text Completion | $5 | /MTokens |
ClaudeCode
gpt-5-mini-2025-08-07
OpenAI | $2 / MTokens Direct/Fusion
GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.
gpt-5-mini-2025-08-07
OpenAI | $2 / MTokens
GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.
Openai text image Visual
| Text Prompt | $0.25 | /MTokens |
| Text Completion | $2 | /MTokens |
| Text Cache Prompt | $0.025 | /MTokens |
Openai
o4-mini-2025-04-16
OpenAI | $4.4 / MTokens Direct/Fusion
It is a lightweight version of o3, capable of handling text, images and audio. It can intelligently use and combine various tools in ChatGPT. In the AIME 2024 and 2025 math competition questions, its accuracy rates reached 93.4% and 92.7% respectively.
o4-mini-2025-04-16
OpenAI | $4.4 / MTokens
It is a lightweight version of o3, capable of handling text, images and audio. It can intelligently use and combine various tools in ChatGPT. In the AIME 2024 and 2025 math competition questions, its accuracy rates reached 93.4% and 92.7% respectively.
Openai text Inference
| Text Prompt | $1.1 | /MTokens |
| Text Completion | $4.4 | /MTokens |
| Text Cache Prompt | $0.28 | /MTokens |
Openai
gpt-4o-audio-preview-2025-06-03
OpenAI | $10 / MTokens Direct/Fusion
This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.
gpt-4o-audio-preview-2025-06-03
OpenAI | $10 / MTokens
This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.
Openai text Inference
| Text Prompt | $2.5 | /MTokens |
| Text Completion | $10 | /MTokens |
Openai
gpt-4o-2024-11-20
OpenAI | $10 / MTokens Direct/Fusion
Launched on November 20, 2024, GPT-4o from OpenAI offers 128K context window and up to 16.4K output tokens. With a knowledge cut-off in October 2023, it's better at creative writing and file analysis. Available via API
gpt-4o-2024-11-20
OpenAI | $10 / MTokens
Launched on November 20, 2024, GPT-4o from OpenAI offers 128K context window and up to 16.4K output tokens. With a knowledge cut-off in October 2023, it's better at creative writing and file analysis. Available via API
Openai text Inference
| Text Prompt | $2.5 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $1.25 | /MTokens |
Openai
gpt-5.2-chat-latest
OpenAI | $14 / MTokens Direct/Fusion
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
gpt-5.2-chat-latest
OpenAI | $14 / MTokens
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
Openai text Inference
| Text Prompt | $1.75 | /MTokens |
| Text Completion | $14 | /MTokens |
| Text Cache Prompt | $0.175 | /MTokens |
Openai
gpt-5-chat-latest
OpenAI | $10 / MTokens Direct/Fusion
GPT-5 Chat points to the GPT-5 snapshot currently used in ChatGPT. We recommend GPT-5 for most API usage, but feel free to use this GPT-5 Chat model to test our latest improvements for chat use cases.
gpt-5-chat-latest
OpenAI | $10 / MTokens
GPT-5 Chat points to the GPT-5 snapshot currently used in ChatGPT. We recommend GPT-5 for most API usage, but feel free to use this GPT-5 Chat model to test our latest improvements for chat use cases.
Openai text image Visual
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $0.125 | /MTokens |
Openai
gpt-image-1
OpenAI | $40 / MTokens Direct/Fusion
GPT - image - 1 is OpenAI's latest image - generation API, launched in April 2025. It is based on the GPT - 4o architecture, excels in high - fidelity image generation with diverse styles, and has strong text - rendering ability.
gpt-image-1
OpenAI | $40 / MTokens
GPT - image - 1 is OpenAI's latest image - generation API, launched in April 2025. It is based on the GPT - 4o architecture, excels in high - fidelity image generation with diverse styles, and has strong text - rendering ability.
Openai image Inference
| Text Prompt | $10 | /MTokens |
| Text Completion | $40 | /MTokens |
Openai
sora-2-chat
OpenAI | $0.1 / One Times Direct/Fusion
sora-2-chat
OpenAI | $0.1 / One Times
Openai text image video Inference Visual
| Times | $0.1 | /One Times |
Openai
gpt-4o
OpenAI | $10 / MTokens Direct/Fusion
GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
gpt-4o
OpenAI | $10 / MTokens
GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
Openai text image Visual
| Text Prompt | $2.5 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $1.25 | /MTokens |
Openai
o3-mini-2025-01-31
OpenAI | $4.4 / MTokens Direct/Fusion
a cost - effective reasoning model. It's 24% faster than its predecessors, excels in STEM, especially math, coding, and science. With adjustable reasoning levels, it's available on ChatGPT and API for various user tiers.
o3-mini-2025-01-31
OpenAI | $4.4 / MTokens
a cost - effective reasoning model. It's 24% faster than its predecessors, excels in STEM, especially math, coding, and science. With adjustable reasoning levels, it's available on ChatGPT and API for various user tiers.
Openai text Inference
| Text Prompt | $1.1 | /MTokens |
| Text Completion | $4.4 | /MTokens |
| Text Cache Prompt | $0.55 | /MTokens |
Openai
gpt-4.1-mini-2025-04-14
gpt-4.1-mini | OpenAI | $1.6 / MTokens Direct/Fusion
by OpenAI is a cost - effective, lightweight model. It uses 60% fewer parameters than GPT - 4o yet keeps 82% of its MMLU score. tasks, ideal for developers and businesses.
gpt-4.1-mini-2025-04-14
OpenAI | $1.6 / MTokens
by OpenAI is a cost - effective, lightweight model. It uses 60% fewer parameters than GPT - 4o yet keeps 82% of its MMLU score. tasks, ideal for developers and businesses.
Openai text Inference
| Text Prompt | $0.4 | /MTokens |
| Text Completion | $1.6 | /MTokens |
| Text Cache Prompt | $0.1 | /MTokens |
| Search Tool | $25 | /1KCall |
Openai
gpt-5-chat-latest
OpenAI | $10 / MTokens Direct/Fusion
GPT-5 Chat points to the GPT-5 snapshot currently used in ChatGPT. We recommend GPT-5 for most API usage, but feel free to use this GPT-5 Chat model to test our latest improvements for chat use cases.
gpt-5-chat-latest
OpenAI | $10 / MTokens
GPT-5 Chat points to the GPT-5 snapshot currently used in ChatGPT. We recommend GPT-5 for most API usage, but feel free to use this GPT-5 Chat model to test our latest improvements for chat use cases.
Azure text image Visual
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $0.125 | /MTokens |
Azure
gpt-5.1-chat-2025-11-13
OpenAI | $10 / MTokens Direct/Fusion
gpt-5.1-chat-2025-11-13 represents the latest iteration of OpenAI's foundational models. It is designed to handle complex reasoning tasks, code generation, and multimodal analysis with improved efficiency and lower latency.
gpt-5.1-chat-2025-11-13
OpenAI | $10 / MTokens
gpt-5.1-chat-2025-11-13 represents the latest iteration of OpenAI's foundational models. It is designed to handle complex reasoning tasks, code generation, and multimodal analysis with improved efficiency and lower latency.
Openai text Inference
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
Openai
gpt-4o-2024-08-06
OpenAI | $10 / MTokens Direct/Fusion
OpenAI's advanced model. It supports structured output like JSON Schema, which simplifies data generation. It's 50% cheaper for input and 33% for output tokens, making it cost - effective for various tasks.
gpt-4o-2024-08-06
OpenAI | $10 / MTokens
OpenAI's advanced model. It supports structured output like JSON Schema, which simplifies data generation. It's 50% cheaper for input and 33% for output tokens, making it cost - effective for various tasks.
Azure text Inference
| Text Prompt | $2.5 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $1.25 | /MTokens |
Azure
gpt-5.2-2025-12-11
OpenAI | $14 / MTokens Direct/Fusion
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
gpt-5.2-2025-12-11
OpenAI | $14 / MTokens
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
Openai text Inference
| Text Prompt | $1.75 | /MTokens |
| Text Completion | $14 | /MTokens |
| Text Cache Prompt | $0.175 | /MTokens |
Openai
gpt-5.2
OpenAI | $14 / MTokens Direct/Fusion
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
gpt-5.2
OpenAI | $14 / MTokens
GPT-5.2 is our best general-purpose model, part of the GPT-5 flagship model family. Our most intelligent model yet for both general and agentic tasks, GPT-5.2 shows improvements over the previous GPT-5.1 in: - General intelligence - Instruction following - Accuracy and token efficiency - Multimodality—especially vision - Code generation—especially front-end UI creation - Tool calling and context management in the API - Spreadsheet understanding and creation
Openai text Inference
| Text Prompt | $1.75 | /MTokens |
| Text Completion | $14 | /MTokens |
| Text Cache Prompt | $0.175 | /MTokens |
Openai
gpt-5-2025-08-07
OpenAI | $10 / MTokens Direct/Fusion
GPT-5 is our flagship model for coding, reasoning, and agentic tasks across domains.
gpt-5-2025-08-07
OpenAI | $10 / MTokens
GPT-5 is our flagship model for coding, reasoning, and agentic tasks across domains.
Openai text image Visual
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $0.125 | /MTokens |
Openai
gpt-4o-mini
OpenAI | $0.6 / MTokens Direct/Fusion
a cost-efficient small model by OpenAI. It scores 82% on MMLU, outperforming some models. With 128k context window, it supports text & vision in API, priced affordably for various AI applications.
gpt-4o-mini
OpenAI | $0.6 / MTokens
a cost-efficient small model by OpenAI. It scores 82% on MMLU, outperforming some models. With 128k context window, it supports text & vision in API, priced affordably for various AI applications.
Openai text image Visual
| Text Prompt | $0.15 | /MTokens |
| Text Completion | $0.6 | /MTokens |
| Text Cache Prompt | $0.075 | /MTokens |
Openai
gpt-4o-audio-preview
OpenAI | $10 / MTokens Direct/Fusion
The gpt-audio model is our first generally available audio model. It accepts audio inputs and outputs, and can be used in the Chat Completions REST API.
gpt-4o-audio-preview
OpenAI | $10 / MTokens
The gpt-audio model is our first generally available audio model. It accepts audio inputs and outputs, and can be used in the Chat Completions REST API.
Openai text Inference
| Text Prompt | $2.5 | /MTokens |
| Text Completion | $10 | /MTokens |
Openai
sora-2
OpenAI | $0.1 / One Times Direct/Fusion
OpenAI's Sora 2 elevates AI video generation with synchronized audio, enhanced physical realism, and character consistency. It accepts text/images/videos as inputs, offers precise storyboard control, and a mobile app.
sora-2
OpenAI | $0.1 / One Times
OpenAI's Sora 2 elevates AI video generation with synchronized audio, enhanced physical realism, and character consistency. It accepts text/images/videos as inputs, offers precise storyboard control, and a mobile app.
Openai video Inference Tools
| Times | $0.1 | /One Times |
Openai
gpt-5-2025-08-07
OpenAI | $10 / MTokens Direct/Fusion
GPT-5 is our flagship model for coding, reasoning, and agentic tasks across domains.
gpt-5-2025-08-07
OpenAI | $10 / MTokens
GPT-5 is our flagship model for coding, reasoning, and agentic tasks across domains.
Azure text image Visual
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
| Text Cache Prompt | $0.125 | /MTokens |
Azure
gpt-5
OpenAI | $10 / MTokens Direct/Fusion
OpenAI’s flagship 2025 model (52T params) with 400K context. Excels in math (94.6% AIME), coding (74.9% SWE-bench), and video understanding. Reduces hallucinations, offers tiered pricing, and links Google services .
gpt-5
OpenAI | $10 / MTokens
OpenAI’s flagship 2025 model (52T params) with 400K context. Excels in math (94.6% AIME), coding (74.9% SWE-bench), and video understanding. Reduces hallucinations, offers tiered pricing, and links Google services .
Openai text Inference
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
Openai
gpt-5.1
OpenAI | $10 / MTokens Direct/Fusion
GPT-5.1 is our flagship model for coding and agentic tasks with configurable reasoning and non-reasoning effort.
gpt-5.1
OpenAI | $10 / MTokens
GPT-5.1 is our flagship model for coding and agentic tasks with configurable reasoning and non-reasoning effort.
Openai text Inference
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
Openai
o1-2024-12-17
OpenAI | $60 / MTokens Direct/Fusion
Compared with the o1-preview version, it has significant improvements in accuracy, efficiency and flexibility, and also has new features such as function calling, vision capabilities and structured outputs.
o1-2024-12-17
OpenAI | $60 / MTokens
Compared with the o1-preview version, it has significant improvements in accuracy, efficiency and flexibility, and also has new features such as function calling, vision capabilities and structured outputs.
Openai text Inference
| Text Prompt | $15 | /MTokens |
| Text Completion | $60 | /MTokens |
| Text Cache Prompt | $7.5 | /MTokens |
Openai
gpt-3.5-turbo-0125
gpt | OpenAI | $1.5 / MTokens Direct/Fusion
It has a 16K context window, higher accuracy in response formatting, and fixed non - English encoding bugs. With input price cut by 50% and output by 25%, it's great for chatbots, content gen, and code doc tasks.
gpt-3.5-turbo-0125
OpenAI | $1.5 / MTokens
It has a 16K context window, higher accuracy in response formatting, and fixed non - English encoding bugs. With input price cut by 50% and output by 25%, it's great for chatbots, content gen, and code doc tasks.
Openai text Inference
| Text Prompt | $0.5 | /MTokens |
| Text Completion | $1.5 | /MTokens |
Openai
gpt-4o-mini-2024-07-18
OpenAI | $0.6 / MTokens Direct/Fusion
a cost-efficient small model by OpenAI. It scores 82% on MMLU, outperforming some models. With 128k context window, it supports text & vision in API, priced affordably for various AI applications.
gpt-4o-mini-2024-07-18
OpenAI | $0.6 / MTokens
a cost-efficient small model by OpenAI. It scores 82% on MMLU, outperforming some models. With 128k context window, it supports text & vision in API, priced affordably for various AI applications.
Openai text Inference Visual
| Text Prompt | $0.15 | /MTokens |
| Text Completion | $0.6 | /MTokens |
| Text Cache Prompt | $0.08 | /MTokens |
Openai
gpt-4.1
OpenAI | $8 / MTokens Direct/Fusion
GPT-4.1 excels at instruction following and tool calling, with broad knowledge across domains. It features a 1M token context window, and low latency without a reasoning step.
gpt-4.1
OpenAI | $8 / MTokens
GPT-4.1 excels at instruction following and tool calling, with broad knowledge across domains. It features a 1M token context window, and low latency without a reasoning step.
Openai text image Visual
| Text Prompt | $2 | /MTokens |
| Text Completion | $8 | /MTokens |
| Text Cache Prompt | $0.5 | /MTokens |
Openai
o4-mini-2025-04-16
OpenAI | $4.4 / MTokens Direct/Fusion
It is a lightweight version of o3, capable of handling text, images and audio. It can intelligently use and combine various tools in ChatGPT. In the AIME 2024 and 2025 math competition questions, its accuracy rates reached 93.4% and 92.7% respectively.
o4-mini-2025-04-16
OpenAI | $4.4 / MTokens
It is a lightweight version of o3, capable of handling text, images and audio. It can intelligently use and combine various tools in ChatGPT. In the AIME 2024 and 2025 math competition questions, its accuracy rates reached 93.4% and 92.7% respectively.
Azure text Inference
| Text Prompt | $1.1 | /MTokens |
| Text Completion | $4.4 | /MTokens |
| Text Cache Prompt | $0.28 | /MTokens |
Azure
sora-2-pro-chat
OpenAI | $1.5 / One Times Direct/Fusion
sora-2-pro-chat
OpenAI | $1.5 / One Times
Openai text image video Inference Visual Tools
| Times | $1.5 | /One Times |
Openai
gpt-5-mini
OpenAI | $2 / MTokens Direct/Fusion
GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.
gpt-5-mini
OpenAI | $2 / MTokens
GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.
Openai text image Visual
| Text Prompt | $0.25 | /MTokens |
| Text Completion | $2 | /MTokens |
| Text Cache Prompt | $0.025 | /MTokens |
Openai
gpt-4o-2024-05-13
128K context | OpenAI | $15 / MTokens Direct/Fusion
It can process text, audio, and images. With a 128K context window, it matches GPT - 4 Turbo on text and code, yet is faster and 50% cheaper on API. It excels in vision and audio understanding.
gpt-4o-2024-05-13
OpenAI | $15 / MTokens
It can process text, audio, and images. With a 128K context window, it matches GPT - 4 Turbo on text and code, yet is faster and 50% cheaper on API. It excels in vision and audio understanding.
Openai text Inference
| Text Prompt | $5 | /MTokens |
| Text Completion | $15 | /MTokens |
Openai
gemini-2.5-pro
Google | $10 / MTokens Direct/Fusion
DeepMind’s flagship, boasting 1M-token context (upgradable to 2M) and strong multimodality. Excels in math reasoning (92% AIME 2024), coding (63.8% SWE-bench), with 89.8% Global MMLU for multilingual tasks .
gemini-2.5-pro
Google | $10 / MTokens
DeepMind’s flagship, boasting 1M-token context (upgradable to 2M) and strong multimodality. Excels in math reasoning (92% AIME 2024), coding (63.8% SWE-bench), with 89.8% Global MMLU for multilingual tasks .
GoogleVertexAI text Inference
| Text Prompt | $1.25 | /MTokens |
| Text Completion | $10 | /MTokens |
GoogleVertexAI
gemini-3-pro-preview
Google | $12 / MTokens Direct/Fusion
Gemini 3 is most intelligent model family to date, built on a foundation of state-of-the-art reasoning. It is designed to bring any idea to life by mastering agentic workflows, autonomous coding, and complex multimodal tasks.
gemini-3-pro-preview
Google | $12 / MTokens
Gemini 3 is most intelligent model family to date, built on a foundation of state-of-the-art reasoning. It is designed to bring any idea to life by mastering agentic workflows, autonomous coding, and complex multimodal tasks.
GoogleVertexAI text Inference Tools
| Text Prompt | $2 | /MTokens |
| Text Completion | $12 | /MTokens |
GoogleVertexAI
gemini-3-pro-image-preview
Google | $12 / MTokens Direct/Fusion
Gemini 3 is most intelligent model family to date, built on a foundation of state-of-the-art reasoning. It is designed to bring any idea to life by mastering agentic workflows, autonomous coding, and complex multimodal tasks.
gemini-3-pro-image-preview
Google | $12 / MTokens
Gemini 3 is most intelligent model family to date, built on a foundation of state-of-the-art reasoning. It is designed to bring any idea to life by mastering agentic workflows, autonomous coding, and complex multimodal tasks.
GoogleVertexAI text image Visual
| Text Prompt | $2 | /MTokens |
| Text Completion | $12 | /MTokens |
GoogleVertexAI
gemini-2.5-flash-image
Google | $30 / MTokens Direct/Fusion
Gemini 2.5 Flash Image is optimized for image understanding and generation, and offers excellent value for money. Utilizing the speed and cost-effectiveness of Gemini 2.5 Flash, Gemini 2.5 Flash Image provides fast and efficient image generation and modification capabilities.
gemini-2.5-flash-image
Google | $30 / MTokens
Gemini 2.5 Flash Image is optimized for image understanding and generation, and offers excellent value for money. Utilizing the speed and cost-effectiveness of Gemini 2.5 Flash, Gemini 2.5 Flash Image provides fast and efficient image generation and modification capabilities.
GoogleVertexAI text image Inference Visual
| Text Prompt | $0.3 | /MTokens |
| Text Completion | $30 | /MTokens |
GoogleVertexAI
gemini-2.5-flash
Google | $2.5 / MTokens Direct/Fusion
Gemini 2.5 Flash, launched by Google, is a cost - effective model. It can reason before replying, enhancing accuracy. Ideal for large - scale, low - latency, high - data - volume tasks needing thought, it offers adaptable thinking for a balance between performance and cost.
gemini-2.5-flash
Google | $2.5 / MTokens
Gemini 2.5 Flash, launched by Google, is a cost - effective model. It can reason before replying, enhancing accuracy. Ideal for large - scale, low - latency, high - data - volume tasks needing thought, it offers adaptable thinking for a balance between performance and cost.
GoogleVertexAI text Inference
| Text Prompt | $0.3 | /MTokens |
| Text Completion | $2.5 | /MTokens |
| Text Cache Prompt | $0.08 | /MTokens |
GoogleVertexAI
veo3.1
Google | $1.5 / One Times Direct/Fusion
Veo3-Fast, a Google DeepMind AI video generator, turns text/image prompts into 720p videos with synced audio (dialogue, effects) 2x faster than standard Veo3. At ~2-3 mins per clip, it’s 5x more cost-effective, ideal for social content and rapid prototyping globally
veo3.1
Google | $1.5 / One Times
Veo3-Fast, a Google DeepMind AI video generator, turns text/image prompts into 720p videos with synced audio (dialogue, effects) 2x faster than standard Veo3. At ~2-3 mins per clip, it’s 5x more cost-effective, ideal for social content and rapid prototyping globally
GoogleVertexAI video Visual
| Times | $1.5 | /One Times |
GoogleVertexAI
gemini-3-flash-preview
Google | $3 / MTokens Direct/Fusion
gemini-3-flash-preview
Google | $3 / MTokens
GoogleVertexAI text image video Inference Visual Tools
| Text Prompt | $0.5 | /MTokens |
| Text Completion | $3 | /MTokens |
GoogleVertexAI
gemini-2.5-flash-lite
Google | $0.4 / MTokens Direct/Fusion
DeepMind’s fastest, most cost-efficient 2.5 model (preview 2025), with 1M-token context and multimodality. Excels at high-volume/latency-sensitive tasks (translation, classification), supporting search & code execution .
gemini-2.5-flash-lite
Google | $0.4 / MTokens
DeepMind’s fastest, most cost-efficient 2.5 model (preview 2025), with 1M-token context and multimodality. Excels at high-volume/latency-sensitive tasks (translation, classification), supporting search & code execution .
GoogleVertexAI text Inference
| Text Prompt | $0.1 | /MTokens |
| Text Completion | $0.4 | /MTokens |
GoogleVertexAI
gemini-2.0-flash
Google | $0.6 / MTokens Direct/Fusion
gemini-2.0-flash
gemini-2.0-flash
Google | $0.6 / MTokens
gemini-2.0-flash
text image Inference Visual Tools
| Text Prompt | $0.15 | /MTokens |
| Text Completion | $0.6 | /MTokens |
