Skip to content

Overview

CheapestInference is an AI inference proxy that gives you access to open-source models through a single API with flat monthly pricing. No per-token charges.

https://api.cheapestinference.com/v1

CheapestInference routes your requests to the appropriate open-source model provider (Moonshot, Zhipu / Z.ai, MiniMax).

  1. You subscribe to a pool by reserving one or more daily time blocks
  2. You create API keys from the dashboard
  3. You use those keys with the OpenAI or Anthropic SDK — just change the base URL
  4. The platform validates your key and forwards the request to the provider

Your API key works exactly like an OpenAI or Anthropic key. All routing and spend tracking is handled automatically.

EndpointDescription
POST /v1/chat/completionsOpenAI-compatible chat (all models)
POST /v1/completionsOpenAI-compatible legacy completions
POST /anthropic/v1/messagesAnthropic-compatible messages
GET /v1/modelsList available models
GET /v1/models/:model_idGet specific model details
GET /v1/usageCheck key usage and status

The response format matches the official OpenAI and Anthropic APIs exactly.

Each key handles a limited number of simultaneous requests during your reserved blocks. To run more in parallel — or to isolate clients — create additional keys (one per seat). Keys are independent, so one busy key never affects the others.

MethodHow
CardVisa, Mastercard, etc. via Stripe
USDCDirect transfer on Base L2 (MetaMask, Coinbase Wallet)
CreditsPay-as-you-go top-ups — temporarily unavailable

Subscriptions last 30 days with no auto-renewal. You renew manually when ready.

Requests without an API key receive a 402 Payment Required response with a product catalog. AI agents can subscribe or purchase credits autonomously using the x402 protocol with USDC on Base L2 — no human setup needed.

No custom SDK required. Use the official OpenAI or Anthropic SDK in any language:

  • Python: openai, anthropic
  • Node.js: openai, @anthropic-ai/sdk
  • Any OpenAI-compatible client (Go, Rust, Java, etc.)