PipelineScore
← Back to leaderboard
Moonshot

Kimi K2.7

Released 2026-05-01Context 200Kkimi-k2-7
PipelineScore
64.2FEEDER
Ranked #9 of 10 models · 10th percentileAn even spread: no standout, no liability (60 to 67 across all five categories). Best-fit profile: Local-first.

Category breakdown

Score per category, normalized 0–100 against the v1 anchor.

Code
60.1
Reason
67.3
Tool Use
62.7
RAG
65.4
Speed
64.8

Strengths

Reason67.3
RAG65.4
Speed64.8

Sample tasks

A taste of what the test pack measures. Full prompts are private and rotated daily.

CodeDifficulty 1code-fib-1

Fibonacci function

Write a Python `fib(n)` returning the nth Fibonacci number, O(n).

ReasonDifficulty 1reason-math-1

Train meeting time

Two trains, opposite directions, given speeds and start times — when do they meet?

RAGDifficulty 2rag-extract-1

Extract metrics to JSON

From the context, extract net sales, operating margin, and free cash flow as a JSON object. Numbers only.

Tool UseDifficulty 2tool-schema-1

OpenAPI param selection

Given an OpenAPI schema with limit/offset/sort, fill JSON for 'next 50, recent first.'

RAGDifficulty 2rag-grounding-1

Refuses to fabricate

Context lacks the answer — does the model fabricate or correctly say it can't?

Compare with