Skip to content

MiniMax M2.5 API — unlimited & flat-rate access

MiniMax M2.5 is MiniMax’s high-value general and coding model. CheapestInference serves it through an OpenAI- and Anthropic-compatible API on flat-rate monthly plans and a truly unlimited pool — so your cost does not scale with tokens.

Quick facts


Model	MiniMax M2.5
Provider	MiniMax (served direct)
Model ID	`MiniMax-M2.5`
Context window	192K tokens
Cost basis	$0.27 / $0.95 per 1M tokens (in / out)
Endpoints	`/v1/chat/completions` (OpenAI), `/anthropic/v1/messages` (Anthropic)
Pricing	From $39/mo — reserve an 8-hour daily time block, up to full 24/7

Call MiniMax M2.5

from openai import OpenAI

client = OpenAI(
    base_url="https://api.cheapestinference.com/v1",
    api_key="sk-..."  # your subscriber key
)

response = client.chat.completions.create(
    model="MiniMax-M2.5",
    messages=[{"role": "user", "content": "Summarize this document..."}],
)

curl https://api.cheapestinference.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "MiniMax-M2.5", "messages": [{"role": "user", "content": "Hello"}]}'

Why flat-rate MiniMax M2.5

MiniMax M2.5 has the lowest cost basis of the three served models, making it a strong value pick for general-purpose and high-throughput workloads. On CheapestInference it is billed at a flat monthly rate, not per token, and is part of the frontier failover set alongside Kimi K2.6 and GLM 4.7. It works in any OpenAI-compatible client.

Common questions

Is there a MiniMax M2.5 API? Yes. Use model id MiniMax-M2.5 against https://api.cheapestinference.com/v1. The API is OpenAI- and Anthropic-SDK compatible.

How much does MiniMax M2.5 cost? From $39/month. You reserve one or more 8-hour daily time blocks (up to full 24/7) and use MiniMax M2.5 with no usage cap — billed at a flat monthly fee, not per token.

Is MiniMax M2.5 good for coding? Yes — it handles general and coding tasks, and is served alongside Kimi K2.6 and GLM 4.7 in the frontier coding pool with automatic failover.