Back to Models
inception/mercury
Not Available

Mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.

6/26/2025
128,000 tokens
#168 Text (Coding)
Specifications

Modalities

Input
text
Output
text

Supported Parameters

frequency_penalty
max_tokens
presence_penalty
response_format
stop
structured_outputs
temperature
tool_choice
tools
top_k
top_p

Max Output Tokens

16,384
Leaderboard
Text
πŸ†OverallELO: 1,306
#198
πŸ‡¬πŸ‡§EnglishELO: 1,326
#195
πŸ’»CodingELO: 1,367
#168
✍️Creative WritingELO: 1,221
#233
πŸ“Instruction FollowingELO: 1,270
#210
🌢️Hard PromptsELO: 1,321
#188
πŸ’¬Multi-TurnELO: 1,300
#187