Skip to main content
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: process.env.CHEAPESTINFERENCE_API_KEY,
  baseURL: "https://api.cheapestinference.ai/v1",
});

const completion = await client.chat.completions.create({
  model: "openai/gpt-oss-20b",
  messages: [{ role: "user", content: "What are the top 3 things to do in New York?" }],
});
console.log(completion.choices[0].message.content);

What You Can Do

Our Models

CheapestInference hosts many popular open source models, you’ll be charged based on the tokens you use and size of the model. Models are not quantized unless specified.

Chat models

DeepSeek R1

DeepSeek V3.1

GPT-OSS-120B

Llama 4 Maverick

Qwen 3 Next 80B

Kimi K2 0905

View all models →

Embedding models

Powerful embedding models for semantic search and retrieval. View all models →

Build AI apps and agents with CheapestInference