Models

FastMetal supports various LLM models for different use cases. View the models available on your dashboard, or query the /models endpoint.

Available Models

Models are configured by your administrator. Use the /models endpoint to list available models:

Model	Description
mistral-voxtral-mini-3b-2507	Japan: Mistral's voxtral-mini-3b-2507
anthropic-claude-opus-4-8	Global: Claude Opus 4.8 - Most intelligent, best for agents and coding
anthropic-claude-opus-4-7	Global: Claude Opus 4.7 - Most intelligent, best for agents and coding
anthropic-claude-opus-4-6	Global: Claude Opus 4.6 - Most intelligent, best for agents and coding
anthropic-claude-sonnet-4-6	Japan: Claude Sonnet 4.6 - Best balance of speed and intelligence
anthropic-claude-haiku-4-5	Japan: Claude Haiku 4.5 - Fastest with near-frontier intelligence
minimax-m2.7	Global: Minimax's M2.7
glm-5	Global: Z.ai's GLM-5
llm-jp-3.1-8x13b-instruct4	Japan: Japanese language model optimized for instruction following
Qwen3-Coder-480B-A35B-Instruct-FP8	Japan: Qwen3-Coder-480B-A35B
Qwen3-Coder-30B-A3B-Instruct	Japan: Qwen3-Coder-30B-A3B-Instruct
gpt-oss-120b	Japan: gpt-oss-120b
minimax-m3	Global: MiniMax's M3
qwen3.6-27b	Global: Qwen 3.6 27B
kimi-k2.6	Global: Moonshot AI's Kimi K2.6
glm-4.7	Global: Z.ai's GLM 4.7
glm-4.7-flash	Global: Z.ai's GLM 4.7 Flash
glm-5.1	Global: Z.ai's GLM 5.1
gemini-flash-lite-free	Global: Google's Gemini Flash Lite - free to try
step-3.5-flash-free	Global: Step 3.5 Flash - free to try
glm-4.5-air-free	Global: Z.ai: GLM 4.5 Air - free to try
random-free	Global: Random - free to try

Pricing is calculated per token. Each model has separate input and output token rates. Check the pricing page or your dashboard for current rates.

For full pricing details, see the pricing page.

Chat/Conversation — Multi-turn dialogue with context

All models support chat/conversation

Text Completion — Single-turn text generation

Text completion support