minimax/minimax-m1

Not Available

MiniMax M1

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it to process long sequences—up to 1 million tokens—while maintaining competitive FLOP efficiency. With 456 billion total parameters and 45.9B active per token, this variant is optimized for complex, multi-step reasoning tasks. Trained via a custom reinforcement learning pipeline (CISPO), M1 excels in long-context understanding, software engineering, agentic tool use, and mathematical reasoning. Benchmarks show strong performance across FullStackBench, SWE-bench, MATH, GPQA, and TAU-Bench, often outperforming other open models like DeepSeek R1 and Qwen3-235B.

6/17/2025

1,000,000 tokens

#78 Text (french)

Specifications

Modalities

Input

text

Output

text

Supported Parameters

frequency_penalty

include_reasoning

max_tokens

presence_penalty

reasoning

repetition_penalty

seed

stop

temperature

tool_choice

tools

top_k

top_p

Max Output Tokens

40,000

Leaderboard

Text

🏆OverallELO: 1,363

#127

🇯🇵JapaneseELO: 1,223

#134

🇨🇳ChineseELO: 1,386

#120

🇰🇷KoreanELO: 1,271

#131

🇬🇧EnglishELO: 1,385

#120

frenchELO: 1,412

#78

germanELO: 1,357

#102

spanishELO: 1,348

#125

russianELO: 1,347

#130

💻CodingELO: 1,416

#120

🧮MathELO: 1,372

#118

✍️Creative WritingELO: 1,317

#139

📝Instruction FollowingELO: 1,345

#132

🌶️Hard PromptsELO: 1,381

#125

💬Multi-TurnELO: 1,357

#131