Kimi
Moonshot AI's flagship long-context LLM series; one of the leaders in China's 2026 open-source agentic coding models
1. Core Product / Service
Kimi is Moonshot AI's core LLM product line. Since its 2023 launch, long context has been the differentiated selling point (early 200k tokens, far above peers at the time).
Current flagship version (2026-05):
- Kimi K2.6 — flagship open-source model released 2026-04-20, 1T-parameter MoE architecture, 32B activated per token, native multimodal + agentic capability, 256K context, max output 16K tokens [openrouter.ai/moonshotai/kimi-k2.6, 2026-05-09]
- Kimi K2.5 Turbo / K2.5 Thinking — previous generation (still in product line); Turbo for fast response, Thinking for reasoning depth
- Kimi K2 Turbo — older flagship, still has users
Key capabilities:
- Agent Swarm system: can dispatch up to 300 domain sub-agents, autonomously executing 4000 steps in a single run
- End-to-end coding (Python / Rust / Go) + UI/UX generation
- Benchmarks: SWE-Bench Verified 80.2, Terminal-Bench 2.0 66.7, SWE-Bench Pro 58.6 (beating GPT-5.4 xhigh 57.7, Claude Opus 4.6 max 53.4)
Deployment forms: official platform.moonshot.ai direct / OpenRouter / DeepInfra / Cloudflare Workers AI / vLLM + SGLang self-deployment.
Official pricing (K2.5 Turbo, reference): input $0.60/M (cache hit $0.15/M), output $2.50/M; K2 Turbo input $1.15/M, output $8.00/M.
2. Target Users & Pain Points
Target users:
- Chinese and international developers needing long context (processing entire books, entire code repos)
- Practitioners of agentic coding workflows
- High-frequency users not wanting to pay USD prices for Claude/GPT (China official pricing + third-party OpenRouter pricing significantly below US frontier models)
Pain points solved:
- Long-document processing (256K context fits a mid-sized code repo)
- Long-running autonomous agent tasks (K2.6's strength is long-horizon agentic rather than ambiguous one-shot reasoning)
- Chinese developers going global bypassing international credit card payments via USDC and similar (see the openrouter integration discussion in hermes-openrouter-models)
3. Competitive Landscape
| Model | Camp | Strengths | Weaknesses |
|---|---|---|---|
| Kimi K2.6 | Moonshot | Long-horizon agentic, 1T MoE open source, SWE-Bench Pro leader | Open-ended ambiguous planning is weaker |
| deepseek V4 Pro | High-Flyer | BenchLM overall #1 (87 points), price/performance | Agent swarm not as strong as K2.6 |
| Qwen 3.6 Plus | Alibaba | The only 1M context, Terminal-Bench leader | Single-step reasoning slightly weaker than K2.6 |
| GLM 5.1 | Zhipu | Strongest at front-end agentic work | Overall slightly lower |
| Claude Opus 4.6 | Anthropic | Strongest overall | High price, SWE-Bench Pro overtaken by K2.6 |
| GPT-5.4 / 5.5 | OpenAI | General capability | SWE-Bench Pro overtaken by K2.6 |
Kimi's differentiation: within China's open-source camp, K2.6 is the "long-distance runner" specialist; DeepSeek is the overall champion; Qwen pushes context limits; GLM specializes in front-end. These four are the first tier of Chinese open-source LLMs in 2026.
4. Unique Observations
- Jimmy's Hermes integration: Moonshot Kimi connects to Hermes without source changes (
hermes auth add kimi-coding); K2.6 is already in the built-in model list. See hermes-openrouter-models. - OpenRouter is cheaper than official: repeatedly observed that Kimi pricing on OpenRouter / DeepInfra third parties is lower than Moonshot official, making it the preferred price/performance path.
- OpenClaw measured response time: Kimi K2.5 (Moonshot direct) averages 1.7-1.8s, on par with MiniMax M2.7 (OpenRouter), slower than Gemini 3.1 Pro direct (1.1s).
- Historical fallback chain instability: while troubleshooting OpenClaw cron on 2026-04-11, found the kimi-k2.5 fallback chain failing during certain time windows; switched back to gemini-3-pro-preview.
- Payment scenario judgment: in ai-inference-engines research, Kimi API was listed as a potential first-wave China-side adapter for Tempo / x402-style AI Agent-native payment protocols.
5. Financials (parent company Moonshot AI)
| Date | Event | Valuation |
|---|---|---|
| 2023-03 | Founded (Yang Zhilin, Zhou Xinyu, Wu Yuxin, Tsinghua alumni) | — |
| 2024-02 | Alibaba led $1B [techcrunch, 2026-05-09] | $2.5B |
| 2024-08 | Tencent, Gaorong $300M | $3.3B |
| 2025-10 | IDG led ~$600M (Tencent followed) | $3.8B pre |
| 2025-12 | IDG $150M (Alibaba, Tencent followed) | $4.3B |
| Early 2026 | $700M | $10B |
| 2026-05-07 | $2B round (open-source AI demand surge) [techcrunch, 2026-05-09] | $20B post |
Major investors: Alibaba (largest), Tencent, IDG Capital, Meituan, China Mobile, Gaorong Capital.
IPO rumors: Moonshot is reportedly considering a Hong Kong IPO, targeting a new round of $1B funding.
6. People & Relationships
- Parent company / creator: Moonshot AI (月之暗面) — Kimi is its only flagship product line; the company and product are deeply bound at the entity level
- Founder / CEO: Yang Zhilin (杨植麟, Tsinghua alumnus, ML scholar background)
- Co-founders: Zhou Xinyu, Wu Yuxin
- Major investors: Alibaba, Tencent, IDG Capital, Meituan, China Mobile
- Competitors: deepseek (same tier of Chinese open source), Qwen (same investors as Alibaba but product competition), GLM (Zhipu), Claude, GPT
- Distribution partners: openrouter (key third-party distribution channel), DeepInfra, Cloudflare Workers AI
Sources
- local: 2026-04-01-diary.md, 2026-04-02-diary-claudecode.md, daily_log-2026-04-04.md, diary-claudecode-2026-04-04.md, jclaw-2026-04-04.md, daily_log-2026-04-08.md
- https://en.wikipedia.org/wiki/Moonshot_AI (2026-05-09)
- https://openrouter.ai/moonshotai/kimi-k2.6 (2026-05-09)
- https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart (2026-05-09)
- https://techcrunch.com/2026/05/07/chinas-moonshot-ai-raises-2b-at-20b-valuation-as-demand-for-open-source-ai-skyrockets/ (2026-05-09)
- https://benchlm.ai/blog/posts/best-chinese-llm (2026-05-09)
- https://huggingface.co/moonshotai/Kimi-K2.6 (2026-05-09)