Back to Models
inception/mercury
Not Available
Mercury
Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.
6/26/2025
128,000 tokens
#168 Text (Coding)
Specifications
Modalities
Input
text
Output
text
Supported Parameters
frequency_penalty
max_tokens
presence_penalty
response_format
stop
structured_outputs
temperature
tool_choice
tools
top_k
top_p
Max Output Tokens
16,384Leaderboard
Text
πOverallELO: 1,306
#198π¬π§EnglishELO: 1,326
#195π»CodingELO: 1,367
#168βοΈCreative WritingELO: 1,221
#233πInstruction FollowingELO: 1,270
#210πΆοΈHard PromptsELO: 1,321
#188π¬Multi-TurnELO: 1,300
#187More from Inception