Models

FastMetal supports various LLM models for different use cases. View the models available on your dashboard, or query the /models endpoint.

Available Models

Models are configured by your administrator. Use the /models endpoint to list available models:

ModelDescription
mistral-voxtral-mini-3b-2507
Japan: Mistral's voxtral-mini-3b-2507
anthropic-claude-opus-4-8
Global: Claude Opus 4.8 - Most intelligent, best for agents and coding
anthropic-claude-opus-4-7
Global: Claude Opus 4.7 - Most intelligent, best for agents and coding
anthropic-claude-opus-4-6
Global: Claude Opus 4.6 - Most intelligent, best for agents and coding
anthropic-claude-sonnet-4-6
Japan: Claude Sonnet 4.6 - Best balance of speed and intelligence
anthropic-claude-haiku-4-5
Japan: Claude Haiku 4.5 - Fastest with near-frontier intelligence
minimax-m2.7
Global: Minimax's M2.7
glm-5
Global: Z.ai's GLM-5
llm-jp-3.1-8x13b-instruct4
Japan: Japanese language model optimized for instruction following
Qwen3-Coder-480B-A35B-Instruct-FP8
Japan: Qwen3-Coder-480B-A35B
Qwen3-Coder-30B-A3B-Instruct
Japan: Qwen3-Coder-30B-A3B-Instruct
gpt-oss-120b
Japan: gpt-oss-120b
minimax-m3
Global: MiniMax's M3
qwen3.6-27b
Global: Qwen 3.6 27B
kimi-k2.6
Global: Moonshot AI's Kimi K2.6
glm-4.7
Global: Z.ai's GLM 4.7
glm-4.7-flash
Global: Z.ai's GLM 4.7 Flash
glm-5.1
Global: Z.ai's GLM 5.1
gemini-flash-lite-free
Global: Google's Gemini Flash Lite - free to try
step-3.5-flash-free
Global: Step 3.5 Flash - free to try
glm-4.5-air-free
Global: Z.ai: GLM 4.5 Air - free to try
random-free
Global: Random - free to try

Model Pricing

Pricing is calculated per token. Each model has separate input and output token rates. Check the pricing page or your dashboard for current rates.

ModelInput / 1M tokensOutput / 1M tokens
mistral-voxtral-mini-3b-2507¥5¥5
anthropic-claude-opus-4-8¥840¥4,200
anthropic-claude-opus-4-7¥840¥4,200
anthropic-claude-opus-4-6¥850¥4,200
anthropic-claude-sonnet-4-6¥550¥2,600
anthropic-claude-haiku-4-5¥16¥870
minimax-m2.7¥53¥220
glm-5¥180¥570
llm-jp-3.1-8x13b-instruct4¥16¥79
Qwen3-Coder-480B-A35B-Instruct-FP8¥32¥263
Qwen3-Coder-30B-A3B-Instruct¥16¥79
gpt-oss-120b¥16¥79
minimax-m3¥50.5126¥202.0504
qwen3.6-27b¥48.5763¥533.7497
kimi-k2.6¥114.4952¥574.1598
glm-4.7¥67.3501¥294.6568
glm-4.7-flash¥10.1025¥67.3501
glm-5.1¥165.0078¥518.5959
gemini-flash-lite-free¥0¥0
step-3.5-flash-free¥0¥0
glm-4.5-air-free¥0¥0
random-free¥0¥0

For full pricing details, see the pricing page.

Model Capabilities

Chat/Conversation — Multi-turn dialogue with context
All models support chat/conversation
Text Completion — Single-turn text generation
Text completion support