Long-horizon coding
Text + image + video
Thinking + tools
Officially Released · Moonshot AI

Lumen AI: Chat with Kimi K2.6 for Free

Lumen AI is a third-party interface for Kimi K2.6, Moonshot AI's latest coding agent with 1T-parameter MoE, 256K context, native multimodal, and thinking mode.

Lumen AI — Kimi K2.6

256K context · multimodal · thinking · agent swarm

Hi, I'm Lumen AI powered by Kimi K2.6. Ask me about long-horizon coding, image or video understanding, or tool-driven agent flows.
Ctrl/Cmd + Enter to send/chat
Try a prompt

Launch Coverage

What People Are Saying About Kimi K2.6

Recent YouTube reviews, launch reactions, and hands-on demos covering Kimi K2.6 after release.

Kimi K2.6 Is HERE – Is This the BEST Open Source Model Yet?

Bijan Bowen revisits Kimi K2.6 after release and asks whether it has become the strongest open-source coding model for real developer workflows.

Bijan Bowen

Meet Kimi K2.6: Advancing Open-Source Coding

Moonshot's official Kimi AI channel introduces K2.6 with a short launch clip focused on open-source coding progress.

Kimi AI

First Look at Kimi K2.6: An Open Source SOTA Model that Really Beat Opus?

Onchain AI Garage breaks down K2.6's early benchmarks and asks whether the open-source SOTA claims hold up against top proprietary models.

Onchain AI Garage

Kimi K2.6: NEW Open Source Model BEATS Claude & GPT-5.4!

Universe of AI frames K2.6 as a new open-source challenger and compares it directly with Claude and GPT-5.4 on coding-heavy claims.

Universe of AI

Kimi 2.6 + Kimi Code CLI Just Dropped and It Rivals Claude Code

Income stream surfers looks at the newer Kimi 2.6 plus Kimi Code CLI workflow and compares it with the Claude Code experience.

Income stream surfers

Kimi K2 6 Isn’t AI… It’s a Full Time Engineer Now 🔥

Codedigipt frames Kimi K2.6 as a model that behaves more like a practical full-time engineering partner than a simple coding assistant.

Codedigipt

Kimi K2.6 Key Capabilities

Long-horizon coding

Native text, image, and video

Thinking mode with tool use

Agent Swarm at scale

What Moonshot AI highlights in Kimi K2.6

Long-horizon coding
Long-horizon coding

12+ hours of continuous execution

Moonshot reports Kimi K2.6 running 4,000+ tool calls over 12 hours of continuous execution to optimize a Zig inference engine (~193 tokens/sec, 20% faster than LM Studio), and a 13-hour run that delivered a 185% median throughput lift on the exchange-core financial engine.

  • 58.6% SWE-Bench Pro
  • 66.7% Terminal-Bench 2.0
  • Stronger instruction following and self-correction
  • Partners report >50% Next.js benchmark gains (Vercel)
Architecture

Kimi K2.6 Mixture of Experts

Kimi K2.6 is a sparse Mixture-of-Experts model: 1T total parameters, 32B activated per token, 384 routed + 1 shared expert, 8 experts per token, 61 layers, MLA attention, SwiGLU, 160K vocab, and a 400M MoonViT vision encoder. OpenAI-compatible API at api.moonshot.ai/v1, open weights on HuggingFace.

K2.6
Chat
OpenAI SDK
Images
Video
Thinking
Tools
1T
Total Parameters
32B
Activated per Token
256K
Context Length
384 + 1
Experts

Open-weight under Modified MIT at moonshotai/Kimi-K2.6 on HuggingFace. Native INT4 quantization. Official vLLM, SGLang, and KTransformers deployment (Transformers >= 4.57.1).

Benchmark Results

Kimi K2.6 Benchmarks

Official numbers from Moonshot's Kimi K2.6 blog, compared against GPT-5.4 and Claude Opus 4.6.

SWE-Bench Pro

Real-world software engineering issues

Coding
Kimi K2.60%
GPT-5.457.7%
Claude Opus 4.653.4%

Terminal-Bench 2.0

Shell and terminal task completion

Coding
Kimi K2.60%
GPT-5.465.4%
Claude Opus 4.665.4%

AIME 2026

Competition-level mathematics

Mathematics
Kimi K2.60%
GPT-5.499.2%
Claude Opus 4.696.7%

GPQA-Diamond

Graduate-level science reasoning

Reasoning
Kimi K2.60%
GPT-5.492.8%
Claude Opus 4.691.3%

BrowseComp

Long-horizon web browsing agents

Agentic
Kimi K2.60%
GPT-5.482.7%
Claude Opus 4.683.7%

HLE-Full w/ tools

Humanity's Last Exam with tools

Agentic
Kimi K2.60%
GPT-5.452.1%
Claude Opus 4.653%

Headline Results

58.6%
SWE-Bench Pro
Real-world coding
96.4%
AIME 2026
Competition math
54.0%
HLE-Full w/ tools
Agentic reasoning
300
Agent Swarm
Sub-agents per run
FAQ

Kimi K2.6 — quick answers

Common questions about the Kimi K2.6 release, API, open weights, and how it compares with Kimi K2.5.