Back to Models
minimax/minimax-m1
Not Available

MiniMax M1

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it to process long sequences—up to 1 million tokens—while maintaining competitive FLOP efficiency. With 456 billion total parameters and 45.9B active per token, this variant is optimized for complex, multi-step reasoning tasks. Trained via a custom reinforcement learning pipeline (CISPO), M1 excels in long-context understanding, software engineering, agentic tool use, and mathematical reasoning. Benchmarks show strong performance across FullStackBench, SWE-bench, MATH, GPQA, and TAU-Bench, often outperforming other open models like DeepSeek R1 and Qwen3-235B.

6/17/2025
1,000,000 tokens
#78 Text (french)
Specifications

Modalities

Input
text
Output
text

Supported Parameters

frequency_penalty
include_reasoning
max_tokens
presence_penalty
reasoning
repetition_penalty
seed
stop
temperature
tool_choice
tools
top_k
top_p

Max Output Tokens

40,000
Leaderboard
Text
🏆OverallELO: 1,363
#127
🇯🇵JapaneseELO: 1,223
#134
🇨🇳ChineseELO: 1,386
#120
🇰🇷KoreanELO: 1,271
#131
🇬🇧EnglishELO: 1,385
#120
frenchELO: 1,412
#78
germanELO: 1,357
#102
spanishELO: 1,348
#125
russianELO: 1,347
#130
💻CodingELO: 1,416
#120
🧮MathELO: 1,372
#118
✍️Creative WritingELO: 1,317
#139
📝Instruction FollowingELO: 1,345
#132
🌶️Hard PromptsELO: 1,381
#125
💬Multi-TurnELO: 1,357
#131