← Back to Blog

Kimi K2.7 Code Lands in GitHub Copilot: First Open-Weight Model on Microsoft's Coding Platform and What It Does to Your Bill

By Eric Bush · July 3, 2026 · 8 min read

A clean office setup with monitors showing a code editor

What Actually Shipped

On July 2, 2026, Moonshot's Kimi K2.7 Code open-weight model became the first open-weight option in GitHub Copilot's model picker. The model is hosted by GitHub on Microsoft Azure and billed to users by vendor list price on a pay-as-you-go basis, following the existing "premium request" billing model that Copilot uses for GPT-5.5 and Claude tiers.

The rollout hit Copilot Pro, Pro+, and Max users first, accessible from Visual Studio Code 1.127.0. This is a notable milestone: for the first time, developers can use a fully open-weight coding model inside Microsoft's flagship developer tool, without leaving the Copilot billing envelope.

Kimi K2.7 Code by the Numbers

Kimi K2.7 Code carries the Moonshot pricing that made K2 popular: cheap input, competitive output, and strong coding-specific benchmarks. Pricing on Azure via Copilot:

Model Input Output Cache read
Kimi K2.7 Code $0.60/M $2.50/M $0.15/M
Claude Opus 4.8 (Copilot) $3.00/M $15.00/M $0.30/M
GPT-5.5 (Copilot) $1.00/M $6.00/M $0.10/M
Claude Sonnet 5 $2.00/M $10.00/M $0.20/M

On a straight cost basis, Kimi K2.7 Code is 6x cheaper than Claude Opus 4.8 and about 40% cheaper than GPT-5.5 for a mixed workload of input plus output.

Real Impact on Copilot Bills

Copilot Pro ($10/month) includes a fixed premium request quota. Pro+ ($39/month) and Max ($200/month) have progressively higher quotas. Kimi K2.7 Code arrives with a lower per-request multiplier than the frontier models — Microsoft's rate card shows 0.25 requests per Kimi call compared to 1.0 for GPT-5.5 and 3.0 for Claude Opus.

For a typical Pro+ user consuming 500 premium requests per month (their quota), switching from Claude Opus (3.0x multiplier) to Kimi K2.7 Code (0.25x multiplier) translates to 6,000 effective calls per month at the same subscription price — a 12x expansion in usable coding assist volume before overage kicks in.

When Kimi K2.7 Code Actually Wins

Cheaper is not always better. Kimi K2.7 Code excels in specific patterns:

  • Code completion and inline suggestions. The Moonshot benchmarks show K2.7 Code matches Sonnet 5 on completion quality at a fraction of the cost.
  • Repetitive refactors. Rename-across-repo, add null checks, migrate imports — the tasks that dominate day-to-day coding.
  • Codebase search and Q&A. Copilot's chat feature reading through your codebase to answer "where is the auth middleware defined" — Kimi handles this well without needing frontier-model reasoning.

Where Claude Opus 4.8 still earns its 6x premium:

  • Multi-file architectural refactors that require reasoning about entire subsystems.
  • Debugging edge cases with subtle reasoning chains.
  • Producing high-stakes production code where a single bug costs more than months of API savings.

How to Actually Save Money With This Change

Do not just switch your default. The pattern that actually captures the savings is a two-tier routing:

  1. Set Kimi K2.7 Code as your default for inline completion, chat Q&A, and single-file edits.
  2. Manually invoke Claude Opus 4.8 or GPT-5.5 for architectural decisions, multi-file refactors, and hard debugging.

A typical developer running this split hits 80% of calls through Kimi and 20% through frontier models. On a Pro+ subscription, that shifts your effective usage inside quota instead of triggering $0.30–$0.75 in overage per call.

What This Signals

Microsoft embracing an open-weight model inside Copilot is a strategic shift. Two implications for the market:

  • Frontier model pricing gets more pressure. If 80% of Copilot workloads route to Kimi at $0.60 input, Anthropic and OpenAI have less pricing power on their premium tiers.
  • Other open-weight models will follow quickly. Expect DeepSeek V4 and GLM 5.2 in the Copilot picker within 60 days.
  • Copilot's subscription pricing may compress. When users can get 6,000 effective calls on Kimi for $39/month, Cursor's $20/month plan is under renewed pressure and Microsoft may respond with a lower Pro tier.

Recommendation

  • Switch your Copilot default to Kimi K2.7 Code today for inline completion and chat.
  • Keep Claude Opus 4.8 for hard problems — manually invoke it when needed.
  • Track per-task success rate for the first two weeks. If Kimi's quality dips on your specific workflows, you can revert — the cost of the experiment is trivial.

Want to calculate exact costs for your project?

Frequently Asked Questions

What is Kimi K2.7 Code?

An open-weight coding model from Moonshot AI. On July 2, 2026 it became the first open-weight model available in GitHub Copilot's model picker, hosted by GitHub on Microsoft Azure and billed by vendor list price.

How much cheaper is Kimi K2.7 Code than Claude Opus 4.8 in Copilot?

About 6x cheaper on blended input plus output pricing ($0.60/$2.50 vs $3.00/$15.00). Kimi also has a 0.25 premium request multiplier vs Claude Opus's 3.0, meaning 12x more effective calls per Copilot Pro+ subscription quota.

Which Copilot plans get Kimi K2.7 Code?

Pro, Pro+, and Max plans. Rollout is progressive and requires Visual Studio Code 1.127.0.

Should I switch my default Copilot model to Kimi?

For inline completion, chat Q&A, and single-file edits, yes. For multi-file architectural refactors and hard debugging, keep Claude Opus 4.8 as a manual invoke. A two-tier routing pattern captures most of the savings while preserving quality.

Will Anthropic and OpenAI models get cheaper in Copilot as a result?

There is now clear pricing pressure. When 80% of Copilot workloads can route to Kimi at a fraction of Opus pricing, frontier vendors lose leverage. Expect Copilot subscription tiers and possibly frontier model per-token rates to trend down through late 2026.