Plans & Limits
| Standard | Pro | |
|---|---|---|
| Price | $20/mo | $60/mo |
| RPM (requests/min) | 60 | 200 |
| TPM (tokens/min) | 3,333 | 13,333 |
| Duration | 30 days | 30 days |
| All models | Yes | Yes |
Rate limits are enforced per key and reset every minute. Each API key has its own independent limits.
Available models
Section titled “Available models”All models are included in every plan. Use GET /v1/models for the live list.
Chat models
Section titled “Chat models”| Provider | Model | Input $/M tokens | Output $/M tokens |
|---|---|---|---|
| Meta | Llama 3.2 3B | $0.020 | $0.020 |
| Meta | Llama 3.1 8B | $0.020 | $0.050 |
| DeepSeek | DeepSeek V3.2 | $0.260 | $0.380 |
| DeepSeek | DeepSeek R1 | $0.500 | $2.150 |
| Gemini 2.5 Flash | $0.300 | $2.500 | |
| Gemini 2.5 Pro | $1.250 | $10.000 | |
| Anthropic | Claude 4 Sonnet | $3.300 | $16.500 |
| Anthropic | Claude 4 Opus | $16.500 | $82.500 |
Prices shown are our cost from the inference provider. With a subscription plan, you pay a flat monthly rate and get a rolling 5-hour budget window — not per-token charges.
Credits (pay-as-you-go)
Section titled “Credits (pay-as-you-go)”Don’t want a subscription? Top up credits starting at $10. Any amount accepted, no maximum.
Credit keys have 60 RPM. Budget is consumed as you use the API and never resets — top up again when depleted. See Credits guide.
Payment methods
Section titled “Payment methods”| Method | For |
|---|---|
| Card (Stripe) | Subscriptions and credits |
| USDC on Base | Subscriptions and credits |
| x402 | Pay-per-request (no account needed) |
All subscriptions last 30 days with no auto-renewal.
How rate limits work
Section titled “How rate limits work”Rate limits are enforced at the key level:
- RPM — Maximum requests per minute. Exceeding returns
429. - TPM — Maximum tokens per minute (input + output combined). Exceeding returns
429.
Limits reset every minute. Different keys do not share limits — each key has its own independent counters.