Skip to main content
The AI Gateway maintains a catalog of known providers and models. This catalog enables automatic provider inference, model validation, and rich metadata for selection strategies.
This catalog is built into the gateway—no configuration required. Models from these providers are automatically available in passthrough mode.

How the catalog works

When you send a request with a model name like gpt-4o, the gateway:
  1. Looks up the model in its catalog
  2. Identifies the provider (OpenAI) and author (openai)
  3. Routes the request to the appropriate provider endpoint
  4. Applies any configured selection strategies
You can also explicitly specify providers using the provider:model format (for example, openai:gpt-4o).

Known providers

OpenAI

FieldValue
Provider IDopenai
AliasesopenAI, open-ai, open-AI
Base URLhttps://api.openai.com/v1/
Websiteopenai.com
OpenAI is an AI research organization that develops advanced artificial intelligence models, including GPT, for natural language understanding, generation, and other AI applications.

OpenAI models

Model IDDisplay NameContext WindowOutput TokensModalities
gpt-5GPT-5400,000128,000text, image
gpt-5.1GPT-5.1400,000128,000text, image
gpt-5-miniGPT-5 Mini400,000128,000text, image
gpt-5-nanoGPT-5 Nano400,000128,000text, image
gpt-5-chat-latestGPT-5 Chat400,000128,000text, image
gpt-4.1GPT-4.11,000,000-text, image
gpt-4.1-miniGPT-4.1 mini1,000,000-text, image
gpt-4.1-nanoGPT-4.1 nano1,000,000-text, image
gpt-4oGPT-4o128,00016,384text, image, audio
o4-miniO4-Mini200,000100,000text
o4-mini-deep-researchO4-Mini-Deep-Research200,000100,000text
o3-proO3-Pro128,000100,000text
o3O3128,000100,000text
o3-deep-researchO3-Deep-Research200,000100,000text
o1-proO1-Pro200,000100,000text
o1O1128,000100,000text

Anthropic

FieldValue
Provider IDanthropic
AliasesAnthropic
Base URLhttps://api.anthropic.com/v1/
Websiteanthropic.com
Anthropic is an AI safety startup focused on developing safe, beneficial AI systems. Known for the Claude family of large language models.

Anthropic models

Model IDDisplay NameContext WindowOutput TokensModalities
claude-3-haiku-20240307Claude Haiku 3200,0004,096text, image
claude-3-5-haiku-latestClaude Haiku 3.5200,0008,192text, image
claude-3-7-sonnet-latestClaude Sonnet 3.7200,00064,000text, image
claude-sonnet-4-0Claude Sonnet 41,000,00064,000text, image
claude-opus-4-0Claude Opus 4200,00032,000text, image
claude-opus-4-1Claude Opus 4.1200,00032,000text, image
claude-sonnet-4-5Claude Sonnet 4.51,000,00064,000text, image
claude-opus-4-5Claude Opus 4.5200,00064,000text, image

Google

FieldValue
Provider IDgoogle
AliasesGoogle, gemini
Base URLhttps://generativelanguage.googleapis.com/v1beta/openai/
Websiteaistudio.google.com
Google’s AI services including Gemini models for natural language understanding, generation, and multimodal AI applications.

Google models

Model IDDisplay NameContext WindowOutput TokensModalities
gemini-2.5-proGemini 2.5 Pro1,048,57665,535text, image, audio, video, file
gemini-2.5-flashGemini 2.5 Flash1,048,57665,535text, image, audio, video, file
gemini-2.5-flash-liteGemini 2.5 Flash-Lite1,048,57665,535text, image, audio, video, file
gemini-2.0-flashGemini 2.0 Flash1,048,5768,192text, image, audio, video, file
gemini-2.0-flash-liteGemini 2.0 Flash-Lite1,048,5768,192text, image, audio, video, file
gemini-3-pro-previewGemini 3 Pro Preview1,000,00065,536text, image, audio, video, file

DeepSeek

FieldValue
Provider IDdeepseek
AliasesDeepSeek, deep-seek
Base URLhttps://api.deepseek.com
Websitedeepseek.com
DeepSeek is an AI company focused on advancing artificial general intelligence through cutting-edge research and development of large language models.

DeepSeek models

Model IDDisplay NameContext WindowOutput TokensModalities
deepseek-reasonerdeepseek-reasoner128,00064,000text
deepseek-chatdeepseek-chat128,0008,192text

OpenRouter

FieldValue
Provider IDopenrouter
Base URLhttps://openrouter.ai/api/v1/
Websiteopenrouter.ai
OpenRouter is a unified API that provides access to multiple AI models from various providers through a single endpoint.

Hyperbolic

FieldValue
Provider IDhyperbolic
Base URLhttps://api.hyperbolic.xyz/v1/
Websitehyperbolic.xyz
Hyperbolic provides high-performance inference for open-source models.

InceptionLabs

FieldValue
Provider IDinceptionlabs
Websiteinceptionlabs.ai
InceptionLabs develops diffusion-based language models for fast, efficient text generation.

Inference.net

FieldValue
Provider IDinference-net
Base URLhttps://api.inference.net/v1/
Websiteinference.net
Inference.net provides a distributed inference network for running AI models at scale.

Using models from the catalog

Simple model reference

Reference models directly by their ID:
{
  "model": "gpt-4o",
  "messages": [{"role": "user", "content": "Hello"}]
}

Explicit provider

Use the provider:model format for explicit routing:
{
  "model": "openai:gpt-4o",
  "messages": [{"role": "user", "content": "Hello"}]
}

Author/model format

For models hosted by third-party providers, use the author/model format:
{
  "model": "openrouter:meta-llama/llama-3-70b",
  "messages": [{"role": "user", "content": "Hello"}]
}

Custom providers and models

You can extend the catalog by configuring custom providers in your Traffic Policy. See Custom Providers for configuration details.
providers:
  - id: "custom-ollama"
    base_url: "https://ollama.internal"
    models:
      - id: "llama3"
      - id: "mistral"

Catalog updates

The model catalog is updated periodically to include new models and providers. For immediate access to models not yet in the catalog, add them explicitly to your provider configuration.