Now in private beta — 300+ engineers on the waitlist

One key.
Every model.
Honest routing.

ModelBeat is a multi-provider AI gateway that gives your team a single API key, unified billing, automatic failover, and cost-aware routing across 250+ models — without the hype about quality oracles we can't yet deliver.

LIVE ROUTING FEED
0+ Models available
0+ Providers
0% Avg. cost reduction
0µs Gateway overhead

One line to migrate.
Zero lock-in.

Change your base URL and API key. That's it. Your existing OpenAI or Anthropic code works unchanged — ModelBeat handles the rest.

app.py
from openai import OpenAI

client = OpenAI(
    api_key="mb-...",                        # one ModelBeat key
    base_url="https://api.modelbeat.ai/v1",  # ← one line changed
)

resp = client.chat.completions.create(
    model="auto",                            # ModelBeat routes it
    messages=[{"role": "user", "content": "Summarize this contract."}],
)

# See exactly which model handled it + cost
print(resp.model)   # → "anthropic/claude-3-haiku"
print(resp.usage)   # → tokens + cost breakdown
app.ts
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "mb-...",                         // one ModelBeat key
  baseURL: "https://api.modelbeat.ai/v1",   // ← one line changed
});

const resp = await client.chat.completions.create({
  model: "auto",                             // ModelBeat routes it
  messages: [{ role: "user", content: "Summarize this contract." }],
});

// Full transparency on routing decision
console.log(resp.model);   // → "openai/gpt-4o-mini"
console.log(resp.usage);   // → tokens + cost breakdown
terminal
curl https://api.modelbeat.ai/v1/chat/completions \
  -H "Authorization: Bearer mb-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role": "user", "content": "Summarize this contract."}]
  }'

# Response includes routing metadata:
# x-modelbeat-model: anthropic/claude-3-haiku
# x-modelbeat-cost: $0.000042
# x-modelbeat-routing: cost-tier-match
Power user hint: Pass extra_body={"modelbeat": {"max_cost": 0.01, "task": "code"}} to override routing for specific requests. Ignored by anyone who doesn't need it.

We'll tell you exactly
what we can — and can't — do.

Most routing products oversell quality prediction. We ship what's proven, instrument for what's next, and earn into the research bet with real data.

SHIPPED IN V1
L1

Heuristic Routing

Rules-based cost-tier routing. Code fences → frontier model. Simple Q&A → cheap model. Image input → vision model. Delivers real cost savings on the 60% of traffic that's trivially classifiable.

Honest claim: Cost optimization. Not quality prediction.
~60% avg. cost reduction on easy traffic
PHASE 2
L2

Embedding Similarity Routing

Embed incoming prompts, compare to clusters of past prompts with logged outcomes. Route to whatever historically won that cluster. Gets smarter with your traffic — not someone else's.

Honest claim: Learns from your data. Cold-start is weak.
Weeks to build once you have traffic data
RESEARCH BET
L3

Trained Quality Router

A model trained on (query, candidate model, measured output quality) to predict the best model per input. This is what Martian and Not Diamond are building. We'll earn into it with data — not promise it on day one.

Honest claim: Months of ML + data flywheel. Not yet.
Phase 3 only if benchmarks justify it
Live routing decision — model: "auto"
Incoming request
"Summarize this contract in 3 bullets"
L1 Classifier
task: summarize → cost-tier
Routed to
claude-3-haiku · $0.000038
Routing decision: 0.8ms vs. GPT-4o: saved $0.0041

Everything ops.
Nothing you don't need.

Single API Key + Credit Billing

One key per project maps to a credit balance. Top up via Stripe. Hard budget caps and alerts so a runaway loop can't drain your balance silently. Transparent margin — no hidden 20–50% markups.

mb-sk-proj_abc123... → routes to 250+ models

Full Audit Logs

Per-request logs: tokens, cost by type, model chosen, routing decision, latency. Queryable and exportable. Enterprises will ask — we're ready.

Automatic Failover

Provider goes down? Traffic reroutes instantly. Single providers rarely exceed 99.7% uptime — we give you 99.95%+ across the fleet.

Multimodal Routing

Text, image, audio, video — ModelBeat inspects the request and routes to a capable model in the right modality automatically. No config required.

Built-in Guardrails

Prompt-injection screening, PII detection, content safety — on by default. Configurable per project: custom blocklists, vendor integrations, output schema validation.

Routing Transparency

Every response tells you which model handled it and why. "X% of traffic routed to cheaper models, $Y saved vs all-frontier." The metric that proves the cost story.

Where we fit in
the landscape.

We're not trying to out-research Martian or out-breadth OpenRouter. We're the honest middle: production-grade gateway with real cost savings and no quality claims we can't back up.

Feature ModelBeat OpenRouter Martian Not Diamond Portkey
Single API key + credit billing ~ ~
250+ models / 45+ providers ~ ~
Automatic failover + load balance ~
Per-request cost audit logs ~ ~
Cost-tier routing (L1) ~ ~
Multimodal routing ~
Built-in guardrails
Quality prediction routing (L3) Roadmap
Transparent pricing ~

✓ = available  ·  ~ = partial / platform tier  ·  ✗ = not available  ·  Roadmap = planned

Transparent margin.
No surprises.

We charge base provider cost plus a small, visible margin. You always know what you're paying and why. No hidden 20–50% markups.

Starter
Free
For individuals and side projects
  • $10 free credits on signup
  • 50+ models
  • Basic audit logs (7-day retention)
  • L1 cost-tier routing
  • Community support
Get started free
Enterprise
Custom
For orgs with SLA requirements
  • Everything in Pro
  • Volume discounts
  • SLA guarantees
  • SSO + RBAC
  • Custom routing policies
  • Dedicated support + onboarding
  • On-prem / VPC deployment
Talk to us
The 5.5% margin is on credit purchases, not per-token. OpenRouter charges the same. We show it upfront because hidden markups are a trust risk and a churn driver once customers notice.

Teams using ModelBeat
have saved

$0

in LLM spend vs. routing all traffic to frontier models — and counting.

Easy tasks (summarize, classify, Q&A) 62%
Medium tasks (analysis, drafting) 28%
Hard tasks (code, reasoning, math) 10%

Typical traffic distribution. Hard tasks still go to frontier models — that's the honest routing story.

Private beta — limited spots

Join the waitlist.

We're onboarding teams in batches. Engineers at 300+ companies are already on the list. Get early access, locked-in pricing, and direct input on the roadmap.

No spam. No sales calls. Just a ping when your spot opens.