Anthropic Claude models including Claude Opus 4.8 and 4.7, the latest frontier models for long-running, complex tasks.
Flagship models
- Claude Opus 4.8
- Claude Opus 4.7
- Claude Sonnet 4.6
OpenAI delivers state-of-the-art multimodal and reasoning models including GPT-5.5, GPT-5.5 Pro, GPT-5.4, GPT-5.4 Pro, GPT-5.4 Mini, GPT-5.4 Nano, o3-pro, o4-mini, and the broader GPT-5 family.
Flagship models
- GPT-5.5 Pro
- GPT-5.5
- GPT-5.4 Pro
- GPT-5.4
Google Gemini pairs long-context reasoning, multimodal understanding, image-native generation, and open Gemma families spanning Gemini 3.1, Gemini 2.5, and Gemma 4 tiers.
Flagship models
- Gemini 3.1 Pro
- Gemini 2.5 Pro
- Gemini 3.1 Flash Image
xAI Grok delivers high-context reasoning, fast analysis, multimodal understanding, and tool-heavy agent workflows across Grok 4.3 and the active Grok 4.20 variants.
Flagship models
- Grok 4.3
- Grok 4.20
- Grok 4.20 Fast
- Grok 4.20 Multi-Agent
Groq delivers high-throughput, low-latency inference for open-weight models like LLaMA 3.3 and Mixtral โ perfect for prototyping and production agents.
Flagship models
- LLaMA 3.3 70B
- LLaMA 3.1 70B
- Mixtral 8x7B
DeepSeek V4 and DeepSeek Reasoner provide deliberate reasoning, coding, and competitive pricing for engineering and research teams with long-context support.
Flagship models
- DeepSeek V4 Pro
- DeepSeek V4 Flash
- DeepSeek Reasoner
Mistral provides high-quality open and hosted models โ including Mistral Large 3, Devstral 2, and Mixtral โ balancing speed, accuracy, and multilingual support.
Flagship models
- Mistral Large 3 2512
- Devstral 2 2512
- Mixtral 8x22B
Perplexity Sonar blends DeepSeek reasoning with live web retrieval, returning sourced answers for analysts and customer teams.
Flagship models
- Sonar Pro
- Sonar Reasoning Pro
OpenRouter aggregates frontier and experimental models across Anthropic, OpenAI, DeepSeek, Z.ai, ByteDance Seed, and many more through a single API.
Flagship models
- Claude Opus 4.8
- GPT-5.5
- GLM 5.1
Together AI offers performant hosted open models โ including LLaMA 3.1 405B and Mixtral โ with flexible pricing and finetuning options.
Flagship models
- LLaMA 3.1 405B
- LLaMA 3.1 70B
- Mixtral 8x22B
Fireworks AI accelerates open-source models like DeepSeek and LLaMA with tuned inference, convenient endpoints, and aggressive pricing.
Flagship models
- DeepSeek R1
- Mixtral 8x22B
- LLaMA 3.1 405B
Cohere Command models focus on multilingual accuracy, enterprise guardrails, long-context tool use, and the newer reasoning, vision, and translation specialisations.
Flagship models
- Command A 2025
- Command A Reasoning
- Command A Vision
MiniMax specializes in compact, high-efficiency models optimized for agent workflows, coding, and tool use with exceptional cost-performance ratio.
Local OpenAI-compatible llama.cpp runtime for private, zero-metered chat when the local server is running.
Flagship models
- Qwen3.6 Local Fast
- Qwen3.6 Local Thinking