The AI Gateway maintains a catalog of known providers and models. This catalog enables automatic provider inference, model validation, and rich metadata for selection strategies.
This catalog is built into the gateway—no configuration required. Models from these providers are automatically available in passthrough mode.
How the catalog works
When you send a request with a model name like gpt-4o, the gateway:
- Looks up the model in its catalog
- Identifies the provider (OpenAI) and author (openai)
- Routes the request to the appropriate provider endpoint
- Applies any configured selection strategies
You can also explicitly specify providers using the provider:model format (for example, openai:gpt-4o).
Known providers
OpenAI
| Field | Value |
|---|
| Provider ID | openai |
| Aliases | openAI, open-ai, open-AI |
| Base URL | https://api.openai.com/v1/ |
| Website | openai.com |
OpenAI is an AI research organization that develops advanced artificial intelligence models, including GPT, for natural language understanding, generation, and other AI applications.
OpenAI models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|
gpt-5 | GPT-5 | 400,000 | 128,000 | text, image |
gpt-5.1 | GPT-5.1 | 400,000 | 128,000 | text, image |
gpt-5-mini | GPT-5 Mini | 400,000 | 128,000 | text, image |
gpt-5-nano | GPT-5 Nano | 400,000 | 128,000 | text, image |
gpt-5-chat-latest | GPT-5 Chat | 400,000 | 128,000 | text, image |
gpt-4.1 | GPT-4.1 | 1,000,000 | - | text, image |
gpt-4.1-mini | GPT-4.1 mini | 1,000,000 | - | text, image |
gpt-4.1-nano | GPT-4.1 nano | 1,000,000 | - | text, image |
gpt-4o | GPT-4o | 128,000 | 16,384 | text, image, audio |
o4-mini | O4-Mini | 200,000 | 100,000 | text |
o4-mini-deep-research | O4-Mini-Deep-Research | 200,000 | 100,000 | text |
o3-pro | O3-Pro | 128,000 | 100,000 | text |
o3 | O3 | 128,000 | 100,000 | text |
o3-deep-research | O3-Deep-Research | 200,000 | 100,000 | text |
o1-pro | O1-Pro | 200,000 | 100,000 | text |
o1 | O1 | 128,000 | 100,000 | text |
Anthropic
| Field | Value |
|---|
| Provider ID | anthropic |
| Aliases | Anthropic |
| Base URL | https://api.anthropic.com/v1/ |
| Website | anthropic.com |
Anthropic is an AI safety startup focused on developing safe, beneficial AI systems. Known for the Claude family of large language models.
Anthropic models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|
claude-3-haiku-20240307 | Claude Haiku 3 | 200,000 | 4,096 | text, image |
claude-3-5-haiku-latest | Claude Haiku 3.5 | 200,000 | 8,192 | text, image |
claude-3-7-sonnet-latest | Claude Sonnet 3.7 | 200,000 | 64,000 | text, image |
claude-sonnet-4-0 | Claude Sonnet 4 | 1,000,000 | 64,000 | text, image |
claude-opus-4-0 | Claude Opus 4 | 200,000 | 32,000 | text, image |
claude-opus-4-1 | Claude Opus 4.1 | 200,000 | 32,000 | text, image |
claude-sonnet-4-5 | Claude Sonnet 4.5 | 1,000,000 | 64,000 | text, image |
claude-opus-4-5 | Claude Opus 4.5 | 200,000 | 64,000 | text, image |
Google
| Field | Value |
|---|
| Provider ID | google |
| Aliases | Google, gemini |
| Base URL | https://generativelanguage.googleapis.com/v1beta/openai/ |
| Website | aistudio.google.com |
Google’s AI services including Gemini models for natural language understanding, generation, and multimodal AI applications.
Google models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|
gemini-2.5-pro | Gemini 2.5 Pro | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.5-flash | Gemini 2.5 Flash | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.5-flash-lite | Gemini 2.5 Flash-Lite | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.0-flash | Gemini 2.0 Flash | 1,048,576 | 8,192 | text, image, audio, video, file |
gemini-2.0-flash-lite | Gemini 2.0 Flash-Lite | 1,048,576 | 8,192 | text, image, audio, video, file |
gemini-3-pro-preview | Gemini 3 Pro Preview | 1,000,000 | 65,536 | text, image, audio, video, file |
DeepSeek
| Field | Value |
|---|
| Provider ID | deepseek |
| Aliases | DeepSeek, deep-seek |
| Base URL | https://api.deepseek.com |
| Website | deepseek.com |
DeepSeek is an AI company focused on advancing artificial general intelligence through cutting-edge research and development of large language models.
DeepSeek models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|
deepseek-reasoner | deepseek-reasoner | 128,000 | 64,000 | text |
deepseek-chat | deepseek-chat | 128,000 | 8,192 | text |
OpenRouter
| Field | Value |
|---|
| Provider ID | openrouter |
| Base URL | https://openrouter.ai/api/v1/ |
| Website | openrouter.ai |
OpenRouter is a unified API that provides access to multiple AI models from various providers through a single endpoint.
Hyperbolic
| Field | Value |
|---|
| Provider ID | hyperbolic |
| Base URL | https://api.hyperbolic.xyz/v1/ |
| Website | hyperbolic.xyz |
Hyperbolic provides high-performance inference for open-source models.
InceptionLabs
| Field | Value |
|---|
| Provider ID | inceptionlabs |
| Website | inceptionlabs.ai |
InceptionLabs develops diffusion-based language models for fast, efficient text generation.
Inference.net
| Field | Value |
|---|
| Provider ID | inference-net |
| Base URL | https://api.inference.net/v1/ |
| Website | inference.net |
Inference.net provides a distributed inference network for running AI models at scale.
Using models from the catalog
Simple model reference
Reference models directly by their ID:
{
"model": "gpt-4o",
"messages": [{"role": "user", "content": "Hello"}]
}
Explicit provider
Use the provider:model format for explicit routing:
{
"model": "openai:gpt-4o",
"messages": [{"role": "user", "content": "Hello"}]
}
For models hosted by third-party providers, use the author/model format:
{
"model": "openrouter:meta-llama/llama-3-70b",
"messages": [{"role": "user", "content": "Hello"}]
}
Custom providers and models
You can extend the catalog by configuring custom providers in your Traffic Policy. See Custom Providers for configuration details.
providers:
- id: "custom-ollama"
base_url: "https://ollama.internal"
models:
- id: "llama3"
- id: "mistral"
Catalog updates
The model catalog is updated periodically to include new models and providers. For immediate access to models not yet in the catalog, add them explicitly to your provider configuration.