KIMI MEMBERSHIP — PRICING

Find Your
Perfect Plan

Upgrade to Kimi Membership to unlock faster models, higher concurrency, and more powerful research preview capabilities.

Monthly
Annual
SAVE 20%
COMPARE

All Plans, Side by Side

* Estimated by assuming full monthly quota on a single feature, based on typical token usage. For reference only.

Feature
Adagio
$0 / mo
Moderato
$19 / mo
Allegretto
$39 / mo
Allegro
$99 / mo
Vivace
$199 / mo
Agent
Agent Usage660150360720
Concurrent Tasks12 tasks2 tasks4 tasks4 tasks
Priority Queue4× speed4× speed4× speed4× speed
Agent Swarm
Agent Swarm Access50 uses / mo120 uses / mo240 uses / mo
Concurrent Subagents4 subagents4 subagents8 subagents
Kimi Code
Kimi Code Credits1× credits5× credits15× credits30× credits
Kimi Claw
Kimi Claw Cloud
Kimi Claw Android
Kimi Claw PC (Mac ARM)
Group Chat with Claw10 groups10 groups10 groups
Professional Data
Pro Data Requests2002,0005,00012,00024,000
Tools
Word / Excel / Slides
Deep Research
Websites Deploy
Website with Database
Slides Visual Mode
Research Preview
COMPARISON

Kimi AI vs Other AI Tools

Tool Typical Paid Plan Best For What Stands Out
Kimi AI (Moderato) $19/mo Design→code + agentic tasks Monthly quotas for Deep Research + OK Computer + Kimi Code. API fees not included.
ChatGPT Plus $20/mo All-rounder (writing, coding, images, tools) Strong general assistant + broad feature set in one place
Claude Pro $20/mo or $17 annual Writing + coding + long context work Great for documents, structured writing, and project-style workflows
Google Gemini AI Plus $7.99/mo Cheaper upgrade in Google ecosystem Often bundled with storage + Gemini features in Google apps
Google Gemini AI Pro $19.99/mo Higher limits + creator tools More access to advanced Gemini + credits/tools depending on region
Perplexity Pro $20/mo or $200/yr Research with citations / browsing Best "answer + sources" experience for web research
Microsoft 365 + Copilot $19.99/mo Word/Excel/PowerPoint productivity Copilot inside Microsoft apps + Office suite bundle
Poe (multi-model) from $4.99/mo Trying many models cheaply One subscription to access multiple model providers via points

What you're really paying for

// 01

Kimi vs the "$20 club"

Kimi's Moderato tier is priced near common premium plans, but it's purpose-built around work quotas — Deep Research, OK Computer, Kimi Code. API fees are separate. If your workflow is design→code and agent tasks, Kimi feels more specialized than general chat plans.

// 02

Best all-in-one assistant

ChatGPT Plus ($20/mo) is typically the easiest "one subscription that does a bit of everything" choice. If you do mixed tasks — writing, coding, images, file work — it's usually the most balanced.

// 03

Heavy writing & long docs

Claude Pro ($20/mo) is often chosen when your workflow is heavy on documents, writing quality, and structured outputs.

// 04

Inside Google Workspace

Gemini/Google AI plans are great value if you already use Google storage and apps. There's a cheaper AI Plus and a higher AI Pro tier depending on your region.

// 05

Research with citations

Perplexity Pro ($20/mo) is the best deal when you care about citations, browsing, and fast research summaries.

// 06

Try many models cheaply

Poe is the budget option if your goal is to try lots of models without paying each company $20/month separately.

NEW · APR 2026

Kimi K2.6 Pricing

Kimi K2.6 is Moonshot AI's most capable open-source model, built for long-horizon coding, frontend design generation, 300-agent swarms, and native multimodal workflows. Unlike a standalone product, K2.6 is accessed through your existing Kimi membership or directly via token-based API billing — making it available at every price point from free to enterprise.

App Membership

Starts at $19/month (Moderato) and gives you K2.6 inside the Kimi chat interface with agent credits, Deep Research, Kimi Code access, and Slides and Websites tools included. Higher tiers — Allegretto ($39), Allegro ($99), and Vivace ($199) — unlock Agent Swarm with up to 300 parallel subagents, more Kimi Code credits, Kimi Claw cloud deployment, and significantly larger Professional Data quotas.

API Access

Token-based and billed separately from membership. Reference pricing sits around $0.55 per million input tokens and $2.65 per million output tokens, making K2.6 one of the most cost-competitive frontier models for developers building at scale. The API is fully OpenAI-compatible — swap in model: "kimi-k2.6" and you're running the latest model in any existing workflow.

Open Weights

Available on HuggingFace under a modified MIT license, free to download and self-host with frameworks like vLLM, SGLang, or KTransformers — ideal for privacy-focused teams and AI researchers who need full infrastructure control.

Whether you're a daily user who wants smarter agent workflows, a developer building a product on top of K2.6's coding and design capabilities, or an enterprise team looking to self-host a trillion-parameter model, there's a pricing path designed for your workload.

Best for daily users
Moderato
$19/mo
K2.6 in chat + agent tasks, Deep Research, Kimi Code 1×
Best for developers
API
$0.55/1M in
Token billing · OpenAI-compatible · scales with usage
Best for self-hosting
Open Weights
Free MIT
HuggingFace · vLLM / SGLang · full infra control
LATEST MODEL — MOONSHOT AI

Kimi K2.6 is the most capable model in the Kimi lineup — built for long-horizon coding, coding-driven frontend design, 300-agent swarms, and native multimodal workflows. Access it through any Kimi membership plan above Adagio, or via the API at token-based rates.

1TParameters
32BActive / Token
262KContext
300Max Agents

How to Access Kimi K2.6

🌐
Via kimi.com App
Available on all paid tiers (Moderato and above). Use Kimi K2.6 directly in chat, agent mode, deep research, and coding-driven design workflows.
Open Kimi App →
⚙️
Via API (Token Billing)
OpenAI-compatible API. Token-based billing — pay per input/output token. Separate from membership. Use model: "kimi-k2.6" in your API calls.
API Docs →
🤗
Open Weights (HuggingFace)
Open-source model weights available under a modified MIT license. Deploy locally with vLLM, SGLang, KTransformers, or TensorRT-LLM. Free to download and self-host.
Weights on HuggingFace →
🔀
Third-Party Providers
Also accessible via OpenRouter, Kilo Code, and community integrations. Provider-set pricing may differ from official Moonshot API rates.
Browse OpenRouter →

Kimi K2.6 — Pricing by Access Method

Access Method Price What's Included Best For
App — Moderato $19/mo K2.6 in chat + agent tasks, 60 agent credits, Deep Research, Kimi Code 1× credits, Slides, Websites Deploy Daily users, creators, researchers
App — Allegretto $39/mo All Moderato + Agent Swarm (50 uses, 4 subagents), Kimi Code 5× credits, Kimi Claw, 5,000 Pro Data req Pro users, teams, agentic workflows
App — Allegro $99/mo All Allegretto + Agent Swarm (120 uses, 4 subagents), Kimi Code 15× credits, 12,000 Pro Data req Power users, heavy coders, automation
App — Vivace $199/mo All Allegro + Agent Swarm (240 uses, 8 subagents), Kimi Code 30× credits, 24,000 Pro Data req Agencies, enterprises, bulk automation
API — Input Tokens ~$0.55/1M Prompt, system instructions, conversation history, retrieved docs. Cheaper — more cacheable. Developers, builders, automation pipelines
API — Output Tokens ~$2.65/1M Generated responses. Higher-priced — compute-intensive, harder to cache. Long outputs drive spend. Developers, builders, automation pipelines
Open Weights Free (MIT) Download weights from HuggingFace. Self-host with vLLM / SGLang / KTransformers. Hardware costs apply. AI researchers, privacy-first enterprises

* API token prices are market reference snapshots (OpenRouter / ArtificialAnalysis). Always verify on your actual billing page. Prices change frequently.

What Makes K2.6 Worth the Upgrade

💻
Long-Horizon Coding
13-hour autonomous sessions. Overhauled an 8-year-old financial engine — 4,000+ lines, 12 strategies, 1,000+ tool calls. Result: +185% throughput.
🎨
Coding-Driven Design
Prompt → Awwwards-level interface with scroll animations, auth layers, and database ops. Full-stack from a single prompt — no designer required.
🤝
Agent Swarm (up to 300)
Dynamically spawns 300 specialized sub-agents executing 4,000 coordinated steps in parallel. Slashes execution time by up to 4.5×.
👁
Native Multimodal
Image and video input natively. Design-to-code from UI screenshots. Generates coherent visual assets, slides, and diagrams alongside code.

API Quickstart — Kimi K2.6

# OpenAI-compatible — works with any OpenAI SDK from openai import OpenAI client = OpenAI( api_key="YOUR_MOONSHOT_API_KEY", base_url="https://api.moonshot.ai/v1" ) response = client.chat.completions.create( model="kimi-k2.6", # ← specify K2.6 explicitly max_tokens=4096, messages=[{ "role": "user", "content": "Build a Next.js landing page with Tailwind and Framer Motion" }] ) print(response.choices[0].message.content)
Note: Kimi K2.6 is the default model at kimi.com for paid tiers. For API access, get your key at platform.moonshot.cn. Open weights are available at HuggingFace under a modified MIT license.
API PRICING

Token Billing Explained

"Kimi pricing" can mean different things depending on how you use it: the consumer app (monthly membership), developer API access (token-based billing), or open-weight model usage via third-party providers. Membership and API costs are never bundled — they're always separate.

Input Tokens
$0.50 – $0.60 / 1M
Your prompt, system instructions, conversation history, retrieved documents. Cheaper because inputs can be cached more easily.
Output Tokens
$2.50 – $2.80 / 1M
The model's generated response. Priced higher because generating tokens requires more compute and is harder to cache or optimize.
Reference sources: OpenRouter listing for Kimi K2.5 shows ~$0.50/M input · $2.80/M output. ArtificialAnalysis lists Kimi K2 at $0.60/M input · $2.50/M output. These are market snapshots — always verify on the billing page you'll actually use, as prices change.

How "Kimi" is accessed (and priced)

Region note: Kimi publishes RMB pricing for mainland China (e.g. ¥49/mo and ¥99/mo). International plans are quoted in USD. Don't assume prices are identical across regions — different SKUs, taxes, and promotions apply.

Moderato membership: what $19/mo includes

COST CALCULATOR

Estimate Your Monthly API Spend

Rough token rules of thumb: 1,000 tokens ≈ 700–800 English words. A normal chat answer is 200–800 output tokens. A long structured response can be 1,500–3,000+. Long conversation history makes input tokens dominate.

API Cost Estimator — adjust values then click Calculate
Input Cost
$4.40
Output Cost
$13.25
Total / Month
$17.65
Cost per Request
$0.0018

* Uses midpoint pricing ($0.55 input / $2.65 output) as a starting estimate. Replace with your actual provider's rates. API costs can spike with long outputs — output tokens typically drive the bill.

Example workloads

// Customer Support Bot

50K chats/mo · 800 in · 250 out

Input: 40M tokens ≈ $22. Output: 12.5M tokens ≈ $33. Total: ~$55/mo at reference rates.

// Content Generator

2K articles/mo · 2,500 in · 2,000 out

Input: 5M tokens ≈ $2.75. Output: 4M tokens ≈ $10.60. Total: ~$14/mo at reference rates.

// Deep Reasoning Agent

10K tasks/mo · 4,000 in · 1,200 out

Input: 40M tokens ≈ $22. Output: 12M tokens ≈ $31.80. Total: ~$54/mo at reference rates.

DECISION GUIDE

Membership vs API: Which Is Right for You?

✦ Pick Membership if you…
  • Use the Kimi app as your main interface daily
  • Run deep research regularly (20 uses/mo on Moderato)
  • Rely on OK Computer-style agent workflows
  • Want priority queue access during peak times
  • Use Kimi Code heavily (quota explicitly listed per tier)
  • Want predictable monthly cost — you know what you'll pay
⬡ Pick API if you…
  • Are building a product or embedding Kimi in an app
  • Need routing, logging, or custom guardrails
  • Have variable or unpredictable request volume
  • Run automations or batch processing pipelines
  • Need programmatic access (custom UI, multi-model routing)
  • Want to pay exactly for usage — no quota management

The "best of both" setup

Many power users do both: membership for personal productivity and deep research inside the Kimi app, and API for their product, internal tools, or high-volume automation. Kimi's own rules page explicitly states API usage fees are not included with membership — reinforcing this as two separate billing tracks.

COST CONTROL

What Drives Kimi API Costs Up

01

Long context windows

Keeping a huge conversation history attached to every request causes input tokens to explode. Each message re-sends the entire prior conversation.

Fix: summarize older context externally, retrieve only what you need per request
02

"Write a 3,000-word answer" requests

Long outputs are expensive because output tokens are priced higher and can't be cached. A 3,000-word answer can cost 4-6× more than a 500-word response.

Fix: generate in parts, request structured outlines first, expand only sections you'll publish
03

Tool calling and research loops

Deep research tasks can call tools repeatedly — each iteration adds tokens and potentially additional tool-use charges depending on the platform.

Fix: use membership's capped deep research quota for this use case; avoid unbounded agent loops in API
04

Vague prompts that cause rework

If your prompt is unclear, you'll regenerate multiple times — every retry is billed. Common with one-liner prompts without format or length constraints.

Fix: tighter prompt templates with clear tone, length, sections, and examples of "good" output
USE CASES

Which Plan for Which User?

Students & Casual Users

Quick Q&A, summaries, brainstorming

  • Short writing tasks
  • Research help and explanations
  • Occasional document summarization
Free Adagio tier to start — upgrade only when you hit caps frequently
Creators & Marketers

Long-form drafting, content workflows

  • Regular blog posts and social content
  • Deep research for campaigns
  • Slide and document generation
Moderato ($19/mo) - predictable spend, research quotas fit recurring workflows
Developers & Startups

Building products with Kimi integrated

  • Custom UI, logging, routing
  • Variable or unpredictable request volume
  • Need programmatic access + guardrails
API billing - use OpenRouter/ArtificialAnalysis as planning benchmarks
Agencies & Heavy Automation

Batch content, multi-agent, large docs

  • Measure tokens per task as a COGS line item
  • Cap maximum output length per request
  • Standardize prompt templates across workflows
Allegro or Vivace + API - high-volume membership quotas plus token-level cost control
FAQ

Common Questions

Ready to
Think Bigger?

Start free, upgrade when you need more power. No commitment required.