Model Library/Kimi K2 Instruct 0905

Kimi K2 Instruct 0905

LLM

Reasoning

Open-source trillion-parameter MoE AI model

On-Demand Dedicated 8xH200

Details

Modalities

text

Version

905

Recommended Hardware

8xH200

Estimated Price

Provider

Moonshot AI

Family

Kimi K2

Parameters

1000B

Context

256000 tokens

License

MIT (Modified)

Kimi K2 Instruct: Trillion-Parameter MoE Model

Kimi K2 Instruct is a Mixture-of-Experts language model developed by Moonshot AI featuring advanced agentic capabilities and specialized coding expertise. With an extended context window and strong tool-calling abilities, this model excels at autonomous software development tasks and complex multi-turn interactions.

This template defaults to 32k context for wider compatibility in search

Key Features

Agentic Intelligence - Excels at autonomous decision-making and tool utilization with strong real-time function invocation capabilities
Coding Excellence - Specialized in software engineering tasks with particular strength in frontend development and agent-based coding
Extended Context - Operates with 256K token context window, doubled from previous version for longer documents and conversations
Tool Integration - Native tool-calling capabilities enabling real-time function execution based on user requests
Modified MIT License - Open source with commercial use permissions

Benchmark Performance

Software Engineering:

SWE-Bench Verified: 69.2% accuracy
SWE-Bench Multilingual: 55.9% accuracy
Terminal-Bench: 44.5% accuracy

Results represent mean accuracy over five independent full-test-set runs with controlled evaluation conditions.

Use Cases

Autonomous code generation and debugging
Frontend development with focus on aesthetics and practicality
Agent-based software development workflows
Complex multi-turn technical conversations
Long-document analysis and retrieval
Real-time tool integration for development tasks
Multi-step coding projects requiring planning and execution
Technical documentation generation and analysis

Agentic Architecture

Kimi K2 Instruct's primary strength lies in its agentic capabilities—the ability to autonomously make decisions and utilize tools to accomplish complex tasks. The model can invoke functions in real-time based on user requests, enabling sophisticated workflows where the model independently selects and executes appropriate tools.

This agentic intelligence makes the model particularly effective for software development tasks that require multiple steps, tool integration, and autonomous problem-solving.

Extended Context Processing

The model's 256K token context window—doubled from the previous 128K version—enables handling of extensive codebases, lengthy technical documents, and complex multi-turn conversations. This extended context is crucial for software development tasks that require understanding large amounts of code or maintaining coherence across long interactions.

Mixture-of-Experts Architecture

Kimi K2 Instruct employs a Mixture-of-Experts architecture with 61 layers, 384 expert modules, and Modified Linear Attention (MLA) mechanism. This architecture enables efficient processing while maintaining high performance across diverse tasks.

Deploy Kimi K2 Instruct on Vast.ai to leverage advanced agentic coding capabilities with extended context processing for autonomous software development and complex technical tasks.

Quick Start Guide

Choose a model and click 'Deploy' above to find available GPUs recommended for this model.

Rent your dedicated instance preconfigured with the model you've selected.

Start sending requests to your model instance and getting responses right now.

Related Models

textvision

Kimi K2.5

Kimi K2.5 is an open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed visual and text tokens atop Kimi-K2-Base

text

Kimi K2 Thinking

Open-source trillion-parameter MoE AI model with thinking