Moonshot

Kimi K2.7

Released 2026-05-01Context 200Kkimi-k2-7

PipelineScore

64.2FEEDER

Ranked #9 of 10 models · 10th percentileAn even spread: no standout, no liability (60 to 67 across all five categories). Best-fit profile: Local-first.

Category breakdown

Score per category, normalized 0–100 against the v1 anchor.

Code

60.1

Reason

67.3

Tool Use

62.7

RAG

65.4

Speed

64.8

Strengths

Reason67.3

RAG65.4

Speed64.8

Sample tasks

A taste of what the test pack measures. Full prompts are private and rotated daily.

CodeDifficulty 1code-fib-1

Fibonacci function

Write a Python `fib(n)` returning the nth Fibonacci number, O(n).

ReasonDifficulty 1reason-math-1

Train meeting time

Two trains, opposite directions, given speeds and start times — when do they meet?

RAGDifficulty 2rag-extract-1

Extract metrics to JSON

From the context, extract net sales, operating margin, and free cash flow as a JSON object. Numbers only.

Tool UseDifficulty 2tool-schema-1

OpenAPI param selection

Given an OpenAPI schema with limit/offset/sort, fill JSON for 'next 50, recent first.'

RAGDifficulty 2rag-grounding-1

Refuses to fabricate

Context lacks the answer — does the model fabricate or correctly say it can't?

Compare with

Kimi K2.7 vs Claude Opus 4.7 Kimi K2.7 vs GPT-5.5 Kimi K2.7 vs Gemini 2.5 Pro Kimi K2.7 vs DeepSeek V4 Kimi K2.7 vs Qwen 3.6 72B