OpenRouter

跨 60+ 厂商、300+ LLM 的统一 API 市场 —— "推理界的 Stripe",对厂商定价不加价。

1. 核心产品 / 服务

OpenRouter 是一个坐在每个主流 LLM 厂商前面的路由/聚合器。一个 API key、一个 OpenAI 兼容 endpoint(/v1/chat/completions),即可访问截至 2025 年的 60+ 厂商的 300+ 活跃模型 [1]。

关键能力:

OpenAI 兼容 API —— openai SDK 的直接替代;切换模型 = 改一个字符串。
自动故障转移 —— 主厂商 5xx 时,请求透明地重路由到该模型的备用厂商。
路由模式 —— 最便宜可用、最低延迟、最高吞吐,或固定厂商。
BYOK(自带 Key) —— 用户可附加自己的厂商 key;在每月前 1M 请求之后,OpenRouter 收取等效 OpenRouter 价格的 5% [2]。
额度 + 免费档 —— 现付现用额度;部分开源权重模型(DeepSeek、Llama、GLM)有免费档每日配额。
缓存 / KV 缓存透传 —— 透传厂商侧 KV 缓存折扣(例如 DeepSeek 输入缓存命中定价)。

它还发布公开的 State of AI 数据集 —— 所有路由流量的使用份额排名,已成为"人们实际在用哪些模型"的行业基准 [3]。

2. 目标用户与痛点

主要用户:不想被厂商锁定的 LLM 应用开发者。

解决的痛点:

账单碎片化 —— 取代 6+ 厂商发票(Anthropic、OpenAI、Google、DeepSeek、Moonshot、Together…)的,是一个额度余额。
厂商切换成本 —— 测试新模型从"开新账号、取 API key、集成新 SDK"变成"改一个模型字符串"。
可靠性 / 故障转移 —— 单厂商宕机不再杀死应用。
限速池化 —— OpenRouter 与每家厂商的聚合配额远高于个体能谈到的。
发现 —— 带实时定价、延迟、吞吐统计的模型卡,让开发者无需手动 benchmark 即可比价。

规模信号:月活 250 万,月活跃用户 15 万+,生命周期 >1M 开发者,>50% 使用来自美国以外 [3][4]。

3. 竞争格局

特性	OpenRouter	together-ai	portkey-ai	eden-ai
主要形态	聚合器/路由器	推理厂商(自有 GPU)+ 路由器	网关 + 可观测性	聚合器
模型数	300+	约 200(主要为开源)	50+ 透传	100+(含非 LLM AI)
自托管模型	否(纯路由器)	是(自有 H100/H200 机群)	否	否
免费档	慷慨(开源模型每日配额)	有限额度	无(付费 SaaS)	有限
对推理的加价	0%(对额度购买收 5%)	自定价(已包含 margin)	0%(按请求收费)	有加价
可观测性	基础(日志、统计)	基础	企业级(护城河)	基础
开源 SDK	OpenAI 兼容	自家 + OpenAI 兼容	多厂商 SDK	多厂商 SDK

差异化:

vs together-ai:Together 自托管 GPU,在开源模型推理价格 + 速度上竞争。OpenRouter 不跑硬件 —— 它是纯市场,包含 Together 作为其中一家厂商。
vs portkey-ai:Portkey 在企业可观测性/治理(日志、策略、PII 脱敏)上领先。OpenRouter 可观测性基础;护城河是广度 + 免费档 + 社区流量。
vs eden-ai:Eden 把非 LLM AI(视觉、OCR、语音)和 LLM 绑在一起。OpenRouter 是纯 LLM 但更深。

OpenRouter 的真正护城河:分发(每个"开源 ChatGPT 克隆"默认选 OpenRouter)+ 排名数据集(发布使用份额造就飞轮 —— 厂商竞相进入排行榜)。

4. 独立观察

Jimmy 已把 OpenRouter 作为 hermes-openrouter-models 的默认 LLM 厂商运行,并跟踪行为数月:

默认模型:deepseek/deepseek-v3.2。多数别名(qwen、qwen-flash、deepseek、glm)通过 OpenRouter 解析,而非直连厂商(local: daily_log-2026-04-04.md)。
GLM 5.1 更新:Hermes 配置跟踪 z-ai/glm-5.1,通过 OpenRouter 路由具有 202k 上下文窗口。
OpenRouter 上的 Kimi:Moonshot 的 Kimi K2 系列在 OpenRouter 和 DeepInfra 上;第三方推理有时比 Moonshot 直连便宜,因为厂商在 margin 上竞争(local: 2026-04-01-diary.md)。对编码工作负载,Jimmy 切换到通过 Moonshot 直连授权的 kimi-coding(见 hermes-openrouter-models)—— 直接厂商可以提供 prompt 缓存折扣,而 OpenRouter 透传不一定保留。
延迟税真实存在:经 OpenRouter 路由相对直连有明显延迟。在 Jimmy 的 TTFT 基准里,MiniMax M2.7 经 OpenRouter 为 1.8s,Gemini 3.1 Pro 直连为 1.1s(local: 2026-04-02-diary-claudecode.md)。对延迟敏感的 Claude Code 会话,直接厂商授权胜出。
Tempo MPP 角度:Tempo 的 Money-Per-Prompt 服务在 https://openrouter.mpp.tempo.xyz/v1/chat/completions 暴露 OpenRouter,实现无需 Stripe 账户的 crypto 原生(x402 风格)按调用付费。Jimmy 跟踪的开放问题:KV 缓存折扣是否能经过这条代理链(见 claude-code-sessions 使用笔记)。
厂商经济洞察:OpenRouter 的 0% 推理加价是战略锚点 —— 他们改为收取 5% 的额度购买费,对信任结构更友好(厂商看不到一个竞争对手在压价)。这正是 OpenRouter 成为 Cline / Continue / OpenWebUI 生态默认的原因。
**"OpenRouter 排名作为质量信号"**观察:在 agent 支付 / ai-inference-engines 设计讨论中,Jimmy 标记 OpenRouter 的逐模型延迟 + 正常运行时间统计作为推理厂商的事实声誉系统 —— 即它已经在做 agent 支付 QoS 层所需的事情(local: daily_log-2026-04-04.md)。

5. 财务 / 融资

2025 年 6 月 — Seed + A 轮合并:$40M(部分来源 $40.5M),由 Andreessen Horowitz(Seed)和 Menlo Ventures(A 轮)领投;Sequoia、Figma 及包括 Fred Ehrsam 在内的天使参与。估值报告为 约 $500M [5][6][7]。
正在洽谈(Sacra,2026):$120M 轮 / $1.3B 估值 —— 尚未确认完成 [3]。
处理的推理消费(年化运行率):
- 2024 年 10 月:$10M
- 2025 年 5 月:$100M+
- 至 2026 年:继续扩张 [3]
OpenRouter 自身营收(年化,Sacra 估算):
- 2025 年 5 月:约 $5M
- 2025 年 10 月:约 $10M
- 2026 年初:约 $50M [3]
token 吞吐:约 5T tokens/周(2025 年 4 月)→ >20T tokens/周(2026 年 4 月),同比约 4×;2025 年末超过 1T tokens/天;为 State-of-AI 报告累计数据集跨越 100T tokens [1][3]。
营收模式:推理 0% 加价;额度购买 5.5% 费用($0.80 起);加密充值 5%;BYOK 月超 1M 请求 5% [2]。

6. 团队与关系

创始人:
- Alex Atallah —— CEO。前 OpenSea(2018–2022 主导 NFT 市场)联合创始人兼 CTO。2022 年 7 月从 OpenSea 卸任"去做从 0 到 1 的事";2023 年创办 OpenRouter [6]。
- Louis Vichy —— 联合创始人 [6]。
领投人:Andreessen Horowitz(Seed 领投,也共同发表"100T token State of AI" [4])、Menlo Ventures(A 轮领投)、Sequoia Capital、Figma、Fred Ehrsam(Coinbase / Paradigm)[5][6]。
头部厂商(伙伴)——非穷举:
- deepseek —— 平台上最高量级模型之一(DeepSeek V3.2 / R1 经常排名第一)。
- [kimi]—— Kimi K2 系列经 OpenRouter 可用;有时比直连便宜。
- together-ai —— 列为开源模型的推理后端之一。
- Anthropic、OpenAI、Google(Gemini)、xAI、Mistral、Meta(Llama)、Z.ai(GLM)、MiniMax、Alibaba(Qwen)、DeepInfra、Fireworks、Groq —— 都作为可路由厂商接入。

Sources

[1] OpenRouter, "State of AI 2025: 100T Token LLM Usage Study," https://openrouter.ai/state-of-ai (2026-05-09) [2] OpenRouter Docs, "FAQ — Pricing & Fees," https://openrouter.ai/docs/faq (2026-05-09) [3] Sacra, "OpenRouter revenue, valuation & funding," https://sacra.com/c/openrouter/ (2026-05-09) [4] Andreessen Horowitz, "Investing in OpenRouter" + "State of AI: 100T Token Study," https://a16z.com/announcement/investing-in-openrouter/ (2026-05-09) [5] GlobeNewswire, "OpenRouter raises $40 million to scale up multi-model inference for enterprise," 2025-06-25, https://www.globenewswire.com/news-release/2025/06/25/3105125/0/en/OpenRouter-raises-40-million-to-scale-up-multi-model-inference-for-enterprise.html (2026-05-09) [6] The Block, "OpenSea co-founder Alex Atallah raises $40 million for AI startup OpenRouter," https://www.theblock.co/post/360093/opensea-co-founder-alex-atallah-raises-40-million-for-ai-startup-openrouter (2026-05-09) [7] Orrick, "AI Inference at Scale: OpenRouter Raises Series Seed and Series A Financing," https://www.orrick.com/en/News/2025/06/AI-Inference-at-Scale-OpenRouter-Raises-Series-Seed-and-Series-A-Financing (2026-05-09)

Local sources:

raw/2026-04-01-diary.md —— 经 OpenRouter 的 Kimi 比直连便宜;OpenRouter 的 token 计数方法(输入+输出合并、缓存感知)
raw/2026-04-02-diary-claudecode.md —— TTFT 基准 MiniMax-M2.7-经-OpenRouter(1.8s)vs 直接厂商;经 Tempo MPP 的 gpt-4o-mini 成本
raw/daily_log-2026-04-04.md + raw/diary-claudecode-2026-04-04.md —— Tempo openrouter.mpp.tempo.xyz 代理、KV 缓存透传问题、agent 支付 QoS = OpenRouter 排名类比
raw/jclaw-2026-04-04.md —— 模型路由的实体笔记