Providers

Every provider in one interface

Access 200+ frontier and specialist models — GPT-5, Claude Opus, Gemini 2.5, Grok 4, Mistral Large, DeepSeek V3, Groq LLaMA, Perplexity Sonar, and more — without juggling logins or invoices.

Browse models Compare plans

Provider network at a glance

Your workspace ships with every major provider ready to go. Auto Mode picks the right model for each task, and you can switch providers manually with a click.

Reasoning & General Intelligence

OpenAI
Website
Creators of GPT-5.5, GPT-5.5 Pro, GPT-5.4, and the lighter GPT-5.4 Mini/Nano family
Anthropic
Website
Claude family of models with extended thinking and safety controls
xAI
Website
Grok 4.3 and Grok 4.20 multimodal reasoning models from xAI
Google
Website
Gemini 3.1 and 2.5 multimodal models with million-token context

High-speed & cost-efficient

Groq
Website
Ultra-low latency inference for LLaMA and Mixtral models
Mistral
Website
Efficient European LLMs optimised for production workloads
DeepSeek
Website
High-reasoning models optimised for coding and analysis

Domain & specialist

Perplexity
Website
Realtime research with citations and live web search
Cohere
Website
Multilingual and enterprise-ready Command models
Fireworks AI
Website
High-performance hosted inference for open-weight models
Together AI
Website
Hosted LLaMA, Mixtral, and custom finetunes with generous limits

Provider coverage

Anthropic🤖

Visit

Anthropic Claude models including Claude Opus 4.8 and 4.7, the latest frontier models for long-running, complex tasks.

Flagship models

Claude Opus 4.8
Claude Opus 4.7
Claude Sonnet 4.6

OpenAI🔥

Visit

OpenAI delivers state-of-the-art multimodal and reasoning models including GPT-5.5, GPT-5.5 Pro, GPT-5.4, GPT-5.4 Pro, GPT-5.4 Mini, GPT-5.4 Nano, o3-pro, o4-mini, and the broader GPT-5 family.

Flagship models

GPT-5.5 Pro
GPT-5.5
GPT-5.4 Pro
GPT-5.4

Google🟢

Visit

Google Gemini pairs long-context reasoning, multimodal understanding, image-native generation, and open Gemma families spanning Gemini 3.1, Gemini 2.5, and Gemma 4 tiers.

Flagship models

Gemini 3.1 Pro
Gemini 2.5 Pro
Gemini 3.1 Flash Image

xAI𝕏

Visit

xAI Grok delivers high-context reasoning, fast analysis, multimodal understanding, and tool-heavy agent workflows across Grok 4.3 and the active Grok 4.20 variants.

Flagship models

Grok 4.3
Grok 4.20
Grok 4.20 Fast
Grok 4.20 Multi-Agent

Groq⚡

Visit

Groq delivers high-throughput, low-latency inference for open-weight models like LLaMA 3.3 and Mixtral — perfect for prototyping and production agents.

Flagship models

LLaMA 3.3 70B
LLaMA 3.1 70B
Mixtral 8x7B

DeepSeek🔍

Visit

DeepSeek V4 and DeepSeek Reasoner provide deliberate reasoning, coding, and competitive pricing for engineering and research teams with long-context support.

Flagship models

DeepSeek V4 Pro
DeepSeek V4 Flash
DeepSeek Reasoner

Mistral🌪️

Visit

Mistral provides high-quality open and hosted models — including Mistral Large 3, Devstral 2, and Mixtral — balancing speed, accuracy, and multilingual support.

Flagship models

Mistral Large 3 2512
Devstral 2 2512
Mixtral 8x22B

Perplexity🔮

Visit

Perplexity Sonar blends DeepSeek reasoning with live web retrieval, returning sourced answers for analysts and customer teams.

Flagship models

Sonar Pro
Sonar Reasoning Pro

OpenRouter🌐

Visit

OpenRouter aggregates frontier and experimental models across Anthropic, OpenAI, DeepSeek, Z.ai, ByteDance Seed, and many more through a single API.

Flagship models

Claude Opus 4.8
GPT-5.5
GLM 5.1

Together AI🤝

Visit

Together AI offers performant hosted open models — including LLaMA 3.1 405B and Mixtral — with flexible pricing and finetuning options.

Flagship models

LLaMA 3.1 405B
LLaMA 3.1 70B
Mixtral 8x22B

Fireworks AI🔥

Visit

Fireworks AI accelerates open-source models like DeepSeek and LLaMA with tuned inference, convenient endpoints, and aggressive pricing.

Flagship models

DeepSeek R1
Mixtral 8x22B
LLaMA 3.1 405B

Cohere🌊

Visit

Cohere Command models focus on multilingual accuracy, enterprise guardrails, long-context tool use, and the newer reasoning, vision, and translation specialisations.

Flagship models

Command A 2025
Command A Reasoning
Command A Vision

MiniMax⚡

Visit

MiniMax specializes in compact, high-efficiency models optimized for agent workflows, coding, and tool use with exceptional cost-performance ratio.

Flagship models

MiniMax M2
MiniMax M1

llama.cpp🖥️

Visit

Local OpenAI-compatible llama.cpp runtime for private, zero-metered chat when the local server is running.

Flagship models

Qwen3.6 Local Fast
Qwen3.6 Local Thinking

Need a provider that isn’t listed? Email [email protected] and we’ll onboard it for your workspace.