Kimi AI Pricing 2026: Plans, Membership Cost & API Token Rates

Feature	Adagio $0 / mo	Moderato $19 / mo	Allegretto $39 / mo	Allegro $99 / mo	Vivace $199 / mo
Agent
Agent Usage	6	60	150	360	720
Concurrent Tasks	1	2 tasks	2 tasks	4 tasks	4 tasks
Priority Queue	—	4× speed	4× speed	4× speed	4× speed
Agent Swarm
Agent Swarm Access	—	—	50 uses / mo	120 uses / mo	240 uses / mo
Concurrent Subagents	—	—	4 subagents	4 subagents	8 subagents
Kimi Code
Kimi Code Credits	—	1× credits	5× credits	15× credits	30× credits
Kimi Claw
Kimi Claw Cloud	—	—	✓	✓	✓
Kimi Claw Android	—	—	✓	✓	✓
Kimi Claw PC (Mac ARM)	—	—	✓	✓	✓
Group Chat with Claw	—	—	10 groups	10 groups	10 groups
Professional Data
Pro Data Requests	200	2,000	5,000	12,000	24,000
Tools
Word / Excel / Slides	✓	✓	✓	✓	✓
Deep Research	—	✓	✓	✓	✓
Websites Deploy	—	✓	✓	✓	✓
Website with Database	—	✓	✓	✓	✓
Slides Visual Mode	—	✓	✓	✓	✓
Research Preview	—	✓	✓	✓	✓

COMPARISON

Kimi AI vs Other AI Tools

Tool	Typical Paid Plan	Best For	What Stands Out
Kimi AI (Moderato)	$19/mo	Design→code + agentic tasks	Monthly quotas for Deep Research + OK Computer + Kimi Code. API fees not included.
ChatGPT Plus	$20/mo	All-rounder (writing, coding, images, tools)	Strong general assistant + broad feature set in one place
Claude Pro	$20/mo or $17 annual	Writing + coding + long context work	Great for documents, structured writing, and project-style workflows
Google Gemini AI Plus	$7.99/mo	Cheaper upgrade in Google ecosystem	Often bundled with storage + Gemini features in Google apps
Google Gemini AI Pro	$19.99/mo	Higher limits + creator tools	More access to advanced Gemini + credits/tools depending on region
Perplexity Pro	$20/mo or $200/yr	Research with citations / browsing	Best "answer + sources" experience for web research
Microsoft 365 + Copilot	$19.99/mo	Word/Excel/PowerPoint productivity	Copilot inside Microsoft apps + Office suite bundle
Poe (multi-model)	from $4.99/mo	Trying many models cheaply	One subscription to access multiple model providers via points

What you're really paying for

// 01

Kimi vs the "$20 club"

Kimi's Moderato tier is priced near common premium plans, but it's purpose-built around work quotas — Deep Research, OK Computer, Kimi Code. API fees are separate. If your workflow is design→code and agent tasks, Kimi feels more specialized than general chat plans.

// 02

Best all-in-one assistant

ChatGPT Plus ($20/mo) is typically the easiest "one subscription that does a bit of everything" choice. If you do mixed tasks — writing, coding, images, file work — it's usually the most balanced.

// 03

Heavy writing & long docs

Claude Pro ($20/mo) is often chosen when your workflow is heavy on documents, writing quality, and structured outputs.

// 04

Inside Google Workspace

Gemini/Google AI plans are great value if you already use Google storage and apps. There's a cheaper AI Plus and a higher AI Pro tier depending on your region.

// 05

Research with citations

Perplexity Pro ($20/mo) is the best deal when you care about citations, browsing, and fast research summaries.

// 06

Try many models cheaply

Poe is the budget option if your goal is to try lots of models without paying each company $20/month separately.

NEW · APR 2026

Kimi K2.6 Pricing

Kimi K2.6 is Moonshot AI's most capable open-source model, built for long-horizon coding, frontend design generation, 300-agent swarms, and native multimodal workflows. Unlike a standalone product, K2.6 is accessed through your existing Kimi membership or directly via token-based API billing — making it available at every price point from free to enterprise.

App Membership

Starts at $19/month (Moderato) and gives you K2.6 inside the Kimi chat interface with agent credits, Deep Research, Kimi Code access, and Slides and Websites tools included. Higher tiers — Allegretto ($39), Allegro ($99), and Vivace ($199) — unlock Agent Swarm with up to 300 parallel subagents, more Kimi Code credits, Kimi Claw cloud deployment, and significantly larger Professional Data quotas.

API Access

Token-based and billed separately from membership. Reference pricing sits around $0.55 per million input tokens and $2.65 per million output tokens, making K2.6 one of the most cost-competitive frontier models for developers building at scale. The API is fully OpenAI-compatible — swap in model: "kimi-k2.6" and you're running the latest model in any existing workflow.

Open Weights

Available on HuggingFace under a modified MIT license, free to download and self-host with frameworks like vLLM, SGLang, or KTransformers — ideal for privacy-focused teams and AI researchers who need full infrastructure control.

Whether you're a daily user who wants smarter agent workflows, a developer building a product on top of K2.6's coding and design capabilities, or an enterprise team looking to self-host a trillion-parameter model, there's a pricing path designed for your workload.

Explore Plans → Learn About K2.6

Best for daily users

Moderato

$19/mo

K2.6 in chat + agent tasks, Deep Research, Kimi Code 1×

Best for agentic work

Allegretto

$39/mo

Agent Swarm + Kimi Claw + Kimi Code 5× credits

Best for developers

API

$0.55/1M in

Token billing · OpenAI-compatible · scales with usage

Best for self-hosting

Open Weights

Free MIT

HuggingFace · vLLM / SGLang · full infra control

LATEST MODEL — MOONSHOT AI

Kimi K2.6 is the most capable model in the Kimi lineup — built for long-horizon coding, coding-driven frontend design, 300-agent swarms, and native multimodal workflows. Access it through any Kimi membership plan above Adagio, or via the API at token-based rates.

1TParameters

32BActive / Token

262KContext

300Max Agents

How to Access Kimi K2.6

🌐

Via kimi.com App

Available on all paid tiers (Moderato and above). Use Kimi K2.6 directly in chat, agent mode, deep research, and coding-driven design workflows.

Open Kimi App →

⚙️

Via API (Token Billing)

OpenAI-compatible API. Token-based billing — pay per input/output token. Separate from membership. Use model: "kimi-k2.6" in your API calls.

API Docs →

🤗

Open Weights (HuggingFace)

Open-source model weights available under a modified MIT license. Deploy locally with vLLM, SGLang, KTransformers, or TensorRT-LLM. Free to download and self-host.

Weights on HuggingFace →

🔀

Third-Party Providers

Also accessible via OpenRouter, Kilo Code, and community integrations. Provider-set pricing may differ from official Moonshot API rates.

Browse OpenRouter →

Kimi K2.6 — Pricing by Access Method

Access Method	Price	What's Included	Best For
App — Moderato	$19/mo	K2.6 in chat + agent tasks, 60 agent credits, Deep Research, Kimi Code 1× credits, Slides, Websites Deploy	Daily users, creators, researchers
App — Allegretto	$39/mo	All Moderato + Agent Swarm (50 uses, 4 subagents), Kimi Code 5× credits, Kimi Claw, 5,000 Pro Data req	Pro users, teams, agentic workflows
App — Allegro	$99/mo	All Allegretto + Agent Swarm (120 uses, 4 subagents), Kimi Code 15× credits, 12,000 Pro Data req	Power users, heavy coders, automation
App — Vivace	$199/mo	All Allegro + Agent Swarm (240 uses, 8 subagents), Kimi Code 30× credits, 24,000 Pro Data req	Agencies, enterprises, bulk automation

API — Input Tokens	~$0.55/1M	Prompt, system instructions, conversation history, retrieved docs. Cheaper — more cacheable.	Developers, builders, automation pipelines
API — Output Tokens	~$2.65/1M	Generated responses. Higher-priced — compute-intensive, harder to cache. Long outputs drive spend.	Developers, builders, automation pipelines
Open Weights	Free (MIT)	Download weights from HuggingFace. Self-host with vLLM / SGLang / KTransformers. Hardware costs apply.	AI researchers, privacy-first enterprises

* API token prices are market reference snapshots (OpenRouter / ArtificialAnalysis). Always verify on your actual billing page. Prices change frequently.

What Makes K2.6 Worth the Upgrade

💻

Long-Horizon Coding

13-hour autonomous sessions. Overhauled an 8-year-old financial engine — 4,000+ lines, 12 strategies, 1,000+ tool calls. Result: +185% throughput.

🎨

Coding-Driven Design

Prompt → Awwwards-level interface with scroll animations, auth layers, and database ops. Full-stack from a single prompt — no designer required.

🤝

Agent Swarm (up to 300)

Dynamically spawns 300 specialized sub-agents executing 4,000 coordinated steps in parallel. Slashes execution time by up to 4.5×.

👁

Native Multimodal

Image and video input natively. Design-to-code from UI screenshots. Generates coherent visual assets, slides, and diagrams alongside code.

API Quickstart — Kimi K2.6

# OpenAI-compatible — works with any OpenAI SDK from openai import OpenAI client = OpenAI( api_key="YOUR_MOONSHOT_API_KEY", base_url="https://api.moonshot.ai/v1" ) response = client.chat.completions.create( model="kimi-k2.6", # ← specify K2.6 explicitly max_tokens=4096, messages=[{ "role": "user", "content": "Build a Next.js landing page with Tailwind and Framer Motion" }] ) print(response.choices[0].message.content)

Note: Kimi K2.6 is the default model at kimi.com for paid tiers. For API access, get your key at platform.moonshot.cn. Open weights are available at HuggingFace under a modified MIT license.

API PRICING

Token Billing Explained

"Kimi pricing" can mean different things depending on how you use it: the consumer app (monthly membership), developer API access (token-based billing), or open-weight model usage via third-party providers. Membership and API costs are never bundled — they're always separate.

Input Tokens

$0.50 – $0.60 / 1M

Your prompt, system instructions, conversation history, retrieved documents. Cheaper because inputs can be cached more easily.

Output Tokens

$2.50 – $2.80 / 1M

The model's generated response. Priced higher because generating tokens requires more compute and is harder to cache or optimize.

Reference sources: OpenRouter listing for Kimi K2.5 shows ~$0.50/M input · $2.80/M output. ArtificialAnalysis lists Kimi K2 at $0.60/M input · $2.50/M output. These are market snapshots — always verify on the billing page you'll actually use, as prices change.

How "Kimi" is accessed (and priced)

Consumer app — chat + tools (deep research, document helpers) at kimi.com, built by Moonshot AI. Priced as flat monthly membership.
Developer/API — token-based billing for products, agents, or custom apps. Membership ≠ API access.
Third-party providers — open-weight models via OpenRouter, Hugging Face, etc. Pricing is set by the provider, not Moonshot directly.

Region note: Kimi publishes RMB pricing for mainland China (e.g. ¥49/mo and ¥99/mo). International plans are quoted in USD. Don't assume prices are identical across regions — different SKUs, taxes, and promotions apply.

Moderato membership: what $19/mo includes

20 deep research uses per month
Up to 2 concurrent deep research tasks
20 OK Computer uses per month
Up to 2 concurrent OK Computer tasks
2,048 Kimi Code requests per week
API usage fees not included — paid separately at token rates above

COST CALCULATOR

Estimate Your Monthly API Spend

Rough token rules of thumb: 1,000 tokens ≈ 700–800 English words. A normal chat answer is 200–800 output tokens. A long structured response can be 1,500–3,000+. Long conversation history makes input tokens dominate.

API Cost Estimator — adjust values then click Calculate

Requests / month

Avg input tokens

Avg output tokens

Input price ($/1M tokens)

Output price ($/1M tokens)

Input Cost

$4.40

Output Cost

$13.25

Total / Month

$17.65

Cost per Request

$0.0018

* Uses midpoint pricing ($0.55 input / $2.65 output) as a starting estimate. Replace with your actual provider's rates. API costs can spike with long outputs — output tokens typically drive the bill.

Example workloads

// Customer Support Bot

50K chats/mo · 800 in · 250 out

Input: 40M tokens ≈ $22. Output: 12.5M tokens ≈ $33. Total: ~$55/mo at reference rates.

// Content Generator

2K articles/mo · 2,500 in · 2,000 out

Input: 5M tokens ≈ $2.75. Output: 4M tokens ≈ $10.60. Total: ~$14/mo at reference rates.

// Deep Reasoning Agent

10K tasks/mo · 4,000 in · 1,200 out

Input: 40M tokens ≈ $22. Output: 12M tokens ≈ $31.80. Total: ~$54/mo at reference rates.

DECISION GUIDE

Membership vs API: Which Is Right for You?

✦ Pick Membership if you…

Use the Kimi app as your main interface daily
Run deep research regularly (20 uses/mo on Moderato)
Rely on OK Computer-style agent workflows
Want priority queue access during peak times
Use Kimi Code heavily (quota explicitly listed per tier)
Want predictable monthly cost — you know what you'll pay

⬡ Pick API if you…

Are building a product or embedding Kimi in an app
Need routing, logging, or custom guardrails
Have variable or unpredictable request volume
Run automations or batch processing pipelines
Need programmatic access (custom UI, multi-model routing)
Want to pay exactly for usage — no quota management

⚡

The "best of both" setup

Many power users do both: membership for personal productivity and deep research inside the Kimi app, and API for their product, internal tools, or high-volume automation. Kimi's own rules page explicitly states API usage fees are not included with membership — reinforcing this as two separate billing tracks.

COST CONTROL

What Drives Kimi API Costs Up

01

Long context windows

Keeping a huge conversation history attached to every request causes input tokens to explode. Each message re-sends the entire prior conversation.

Fix: summarize older context externally, retrieve only what you need per request

02

"Write a 3,000-word answer" requests

Long outputs are expensive because output tokens are priced higher and can't be cached. A 3,000-word answer can cost 4-6× more than a 500-word response.

Fix: generate in parts, request structured outlines first, expand only sections you'll publish

03

Tool calling and research loops

Deep research tasks can call tools repeatedly — each iteration adds tokens and potentially additional tool-use charges depending on the platform.

Fix: use membership's capped deep research quota for this use case; avoid unbounded agent loops in API

04

Vague prompts that cause rework

If your prompt is unclear, you'll regenerate multiple times — every retry is billed. Common with one-liner prompts without format or length constraints.

Fix: tighter prompt templates with clear tone, length, sections, and examples of "good" output

USE CASES

Which Plan for Which User?

Students & Casual Users

Quick Q&A, summaries, brainstorming

Short writing tasks
Research help and explanations
Occasional document summarization

Free Adagio tier to start — upgrade only when you hit caps frequently

Creators & Marketers

Long-form drafting, content workflows

Regular blog posts and social content
Deep research for campaigns
Slide and document generation

Moderato ($19/mo) - predictable spend, research quotas fit recurring workflows

Developers & Startups

Building products with Kimi integrated

Custom UI, logging, routing
Variable or unpredictable request volume
Need programmatic access + guardrails

API billing - use OpenRouter/ArtificialAnalysis as planning benchmarks

Agencies & Heavy Automation

Batch content, multi-agent, large docs

Measure tokens per task as a COGS line item
Cap maximum output length per request
Standardize prompt templates across workflows

Allegro or Vivace + API - high-volume membership quotas plus token-level cost control

Find Your
Perfect Plan

All Plans, Side by Side

Kimi AI vs Other AI Tools

What you're really paying for

Kimi vs the "$20 club"

Best all-in-one assistant

Heavy writing & long docs

Inside Google Workspace

Research with citations

Try many models cheaply

Kimi K2.6 Pricing

How to Access Kimi K2.6

Kimi K2.6 — Pricing by Access Method

What Makes K2.6 Worth the Upgrade

API Quickstart — Kimi K2.6

Token Billing Explained

How "Kimi" is accessed (and priced)

Moderato membership: what $19/mo includes

Estimate Your Monthly API Spend

Example workloads

50K chats/mo · 800 in · 250 out

2K articles/mo · 2,500 in · 2,000 out

10K tasks/mo · 4,000 in · 1,200 out

Membership vs API: Which Is Right for You?

The "best of both" setup

What Drives Kimi API Costs Up

Long context windows

"Write a 3,000-word answer" requests

Tool calling and research loops

Vague prompts that cause rework

Which Plan for Which User?

Quick Q&A, summaries, brainstorming

Long-form drafting, content workflows

Building products with Kimi integrated

Batch content, multi-agent, large docs

Common Questions

Ready to
Think Bigger?

Find YourPerfect Plan

All Plans, Side by Side

Kimi AI vs Other AI Tools

What you're really paying for

Kimi vs the "$20 club"

Best all-in-one assistant

Heavy writing & long docs

Inside Google Workspace

Research with citations

Try many models cheaply

Kimi K2.6 Pricing

How to Access Kimi K2.6

Kimi K2.6 — Pricing by Access Method

What Makes K2.6 Worth the Upgrade

API Quickstart — Kimi K2.6

Token Billing Explained

How "Kimi" is accessed (and priced)

Moderato membership: what $19/mo includes

Estimate Your Monthly API Spend

Example workloads

50K chats/mo · 800 in · 250 out

2K articles/mo · 2,500 in · 2,000 out

10K tasks/mo · 4,000 in · 1,200 out

Membership vs API: Which Is Right for You?

The "best of both" setup

What Drives Kimi API Costs Up

Long context windows

"Write a 3,000-word answer" requests

Tool calling and research loops

Vague prompts that cause rework

Which Plan for Which User?

Quick Q&A, summaries, brainstorming

Long-form drafting, content workflows

Building products with Kimi integrated

Batch content, multi-agent, large docs

Common Questions

Ready toThink Bigger?

Find Your
Perfect Plan

Ready to
Think Bigger?