https://gateway.ngrok.ai, so you can set defaults like timeouts and token limits once instead of repeating them on every access key configuration.
Timeouts
| Setting | Default | Description |
|---|---|---|
perRequestTimeout | 60s | Max time for a single upstream attempt |
totalTimeout | 120s | Max time for a request including all failover attempts |
perRequestTimeout, the gateway moves to the next failover candidate. If total time exceeds totalTimeout, the gateway returns an error.
Token limits
| Setting | Default | Description |
|---|---|---|
maxInputTokens | none | Reject requests above estimated input tokens |
maxOutputTokens | none | Cap completion length |
Via dashboard
- Open Settings in app.ngrok.ai
- Edit Total timeout, Per-request timeout, and optional token limits
- Save
https://gateway.ngrok.ai) and lets you manage AI Gateway API keys for automation.
Via the AI Gateway API
Get current settings:null:
30s, 2m).
See the Gateway configuration API reference.
Next steps
- How It Works: Request flow and failover
- Choose how to reach providers: Credits, provider keys, and self-run models
- Access Key Configurations: Per-key scope and routing