Unlimited Subscriptions
Unlimited subscriptions give you a dedicated model with guaranteed throughput — no budget caps, no token counting. You pick which time blocks you need and subscribe instantly. Your API key is live during your selected hours.
All endpoints require authentication. Use your management key (mk_):
Authorization: Bearer mk_your_keyList available plans
Section titled “List available plans”GET /api/poolsReturns all active Unlimited plans.
curl https://api.cheapestinference.com/api/pools \ -H "Authorization: Bearer mk_your_key"{ "success": true, "data": [ { "id": "pool_uuid", "slug": "minimax-m3", "modelId": "MiniMax-M3", "modelName": "MiniMax M3", "description": "Dedicated MiniMax M3 with unlimited usage.", "status": "active", "totalSlots": 10, "minPricePerDay": "39.00", "manualProvisioning": false, "annualDiscount": 0.15 } ]}| Field | Description |
|---|---|
status | draft (not listed), funding (accepting pledges), activating (spinning up), active (open for subscriptions), or paused |
minPricePerDay | Lowest monthly price across all time blocks (USD, per seat) |
manualProvisioning | If true, key is delivered manually (usually within hours, always within 24h) |
annualDiscount | Fraction off when billed yearly (0.15 = 15% off). Applied when billingCycle: "year" is passed to subscribe. |
Plan detail
Section titled “Plan detail”GET /api/pools/:idReturns full plan info including all hour slots. :id can be the UUID or slug.
curl https://api.cheapestinference.com/api/pools/minimax-m3 \ -H "Authorization: Bearer mk_your_key"{ "success": true, "data": { "id": "pool_uuid", "slug": "minimax-m3", "modelId": "MiniMax-M3", "status": "active", "hourSlots": [ { "id": "slot_uuid", "hour": 0, "slotIndex": 0, "pricePerDay": "4.88", "status": "available" }, { "id": "slot_uuid", "hour": 1, "slotIndex": 0, "pricePerDay": "4.88", "status": "available" } ] }}Each hourSlot represents one hour of one seat. Prices are monthly amounts distributed across the 8 hours in a block.
Subscribe
Section titled “Subscribe”POST /api/pools/:id/subscribeSubscribe using block names. Your card is charged immediately. No API key is created automatically — you mint it yourself afterward (see the note below the response).
Prerequisite: you need a saved payment method before subscribing. Add a card via
POST /api/billing/setup-intent(or the Billing page in the dashboard) first — otherwise this returns400 "No payment method found — add a card first".
Time blocks
Section titled “Time blocks”Each day is divided into three fixed UTC blocks:
| Block | Hours (UTC) | Region |
|---|---|---|
asia | 00:00–07:59 | Asia-Pacific |
europe | 08:00–15:59 | Europe / Middle East |
americas | 16:00–23:59 | Americas |
Request body
Section titled “Request body”| Parameter | Type | Required | Description |
|---|---|---|---|
blocks | string[] | Yes | One or more: "asia", "europe", "americas" |
quantity | integer | No | Seats per block (1–20, default 1) |
billingCycle | string | No | "month" (default) or "year" — annual charges upfront with the pool’s annualDiscount applied |
Example — one block, monthly
Section titled “Example — one block, monthly”curl -X POST https://api.cheapestinference.com/api/pools/minimax-m3/subscribe \ -H "Authorization: Bearer mk_your_key" \ -H "Content-Type: application/json" \ -d '{"blocks": ["americas"]}'Example — all blocks, 2 seats, billed annually
Section titled “Example — all blocks, 2 seats, billed annually”curl -X POST https://api.cheapestinference.com/api/pools/minimax-m3/subscribe \ -H "Authorization: Bearer mk_your_key" \ -H "Content-Type: application/json" \ -d '{"blocks": ["asia", "europe", "americas"], "quantity": 2, "billingCycle": "year"}'Annual amount = monthlyPrice × 12 × (1 − pool.annualDiscount). The discount fraction is returned on the pool detail response (see below). Charges occur upfront; subscription renews yearly until canceled.
Response
Section titled “Response”{ "success": true, "data": { "id": "subscription_uuid", "poolId": "pool_uuid", "status": "active", "monthlyPrice": "39.00", "hours": [ { "hour": 0, "slotIndex": 0, "pricePerDay": "4.88" }, { "hour": 1, "slotIndex": 0, "pricePerDay": "4.88" } ], "key": null }}Note: No API key is created automatically. Use
POST /api/keys/subscriptionto create your key when ready, optionally passing{"subscriptionId": "subscription_uuid"}to target a specific subscription.
Unlimited throughput — no budget caps, no token counting. Each key handles a limited number of simultaneous requests. Your key is live during your subscribed blocks.
If the plan has manual provisioning enabled, status will be pending_key and key will be null. You’ll receive an email when your key is ready.
Pricing
Section titled “Pricing”monthlyPrice = sum of prices for all selected slots. Each block has a fixed monthly price. With quantity: 2 and two blocks, you pay 2 × (block_a + block_b).
View my subscriptions
Section titled “View my subscriptions”GET /api/pools/:id/my-subscriptionsReturns all your active subscriptions for this pool (you can have multiple if you purchased additional blocks).
curl https://api.cheapestinference.com/api/pools/minimax-m3/my-subscriptions \ -H "Authorization: Bearer mk_your_key"{ "success": true, "data": [ { "id": "subscription_uuid", "status": "active", "monthlyPrice": "39.00", "billingCycle": "month", "cancelAtPeriodEnd": false, "currentPeriodEnd": "2026-05-14T00:00:00.000Z", "hours": [ { "hour": 16, "slotIndex": 0, "pricePerDay": "4.88" } ], "key": { "id": "key_uuid", "apiKey": "sk_pool_abc123...", "isActive": true } } ]}| Field | Meaning |
|---|---|
status | active, pending_key, past_due or canceled |
billingCycle | "month" or "year" — chosen at subscribe time |
cancelAtPeriodEnd | true when the user has scheduled cancellation; access remains until currentPeriodEnd |
status semantics:
status | Meaning |
|---|---|
active | Subscription running, key is live |
pending_key | Paid, key being set up (manual provisioning) |
past_due | Payment failed, key suspended |
canceled | Subscription ended |
Get my keys
Section titled “Get my keys”GET /api/pools/:id/my-keysReturns all API keys from all your subscriptions on this pool.
curl https://api.cheapestinference.com/api/pools/minimax-m3/my-keys \ -H "Authorization: Bearer mk_your_key"Use any apiKey as Authorization: Bearer on any /v1/* or /anthropic/* inference endpoint.
Cancel subscription
Section titled “Cancel subscription”DELETE /api/pools/:id/pledgeCancels your most recent subscription. Schedules cancellation at the end of the current billing period. Your key keeps working until currentPeriodEnd; after that Stripe stops renewing, the key is revoked, and the slots are freed. No refund is issued for unused time.
curl -X DELETE https://api.cheapestinference.com/api/pools/minimax-m3/pledge \ -H "Authorization: Bearer mk_your_key"{ "success": true, "data": { "canceled": true }}How Unlimited works
Section titled “How Unlimited works”- Browse — pick a model and select your time blocks (Asia, Europe, Americas).
- Subscribe — your card is charged immediately. No API key is created automatically — you create it yourself.
- Create key — call
POST /api/keys/subscriptionto generate your API key. You can have multiple keys if you have multiple subscriptions. - Use — your key gets unlimited throughput during your selected blocks. No budget limits; each key handles a limited number of simultaneous requests.
- Renew — subscriptions recur monthly. Cancel anytime — access continues to the end of the paid period.