Models
FastMetal supports various LLM models for different use cases. View the models available on your dashboard, or query the /models endpoint.
Available Models
Models are configured by your administrator. Use the /models endpoint to list available models:
| Model | Description |
|---|---|
mistral-voxtral-mini-3b-2507 | Japan: Mistral's voxtral-mini-3b-2507 |
anthropic-claude-opus-4-8 | Global: Claude Opus 4.8 - Most intelligent, best for agents and coding |
anthropic-claude-opus-4-7 | Global: Claude Opus 4.7 - Most intelligent, best for agents and coding |
anthropic-claude-opus-4-6 | Global: Claude Opus 4.6 - Most intelligent, best for agents and coding |
anthropic-claude-sonnet-4-6 | Japan: Claude Sonnet 4.6 - Best balance of speed and intelligence |
anthropic-claude-haiku-4-5 | Japan: Claude Haiku 4.5 - Fastest with near-frontier intelligence |
minimax-m2.7 | Global: Minimax's M2.7 |
glm-5 | Global: Z.ai's GLM-5 |
llm-jp-3.1-8x13b-instruct4 | Japan: Japanese language model optimized for instruction following |
Qwen3-Coder-480B-A35B-Instruct-FP8 | Japan: Qwen3-Coder-480B-A35B |
Qwen3-Coder-30B-A3B-Instruct | Japan: Qwen3-Coder-30B-A3B-Instruct |
gpt-oss-120b | Japan: gpt-oss-120b |
minimax-m3 | Global: MiniMax's M3 |
qwen3.6-27b | Global: Qwen 3.6 27B |
kimi-k2.6 | Global: Moonshot AI's Kimi K2.6 |
glm-4.7 | Global: Z.ai's GLM 4.7 |
glm-4.7-flash | Global: Z.ai's GLM 4.7 Flash |
glm-5.1 | Global: Z.ai's GLM 5.1 |
gemini-flash-lite-free | Global: Google's Gemini Flash Lite - free to try |
step-3.5-flash-free | Global: Step 3.5 Flash - free to try |
glm-4.5-air-free | Global: Z.ai: GLM 4.5 Air - free to try |
random-free | Global: Random - free to try |
Model Pricing
Pricing is calculated per token. Each model has separate input and output token rates. Check the pricing page or your dashboard for current rates.
| Model | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| mistral-voxtral-mini-3b-2507 | ¥5 | ¥5 |
| anthropic-claude-opus-4-8 | ¥840 | ¥4,200 |
| anthropic-claude-opus-4-7 | ¥840 | ¥4,200 |
| anthropic-claude-opus-4-6 | ¥850 | ¥4,200 |
| anthropic-claude-sonnet-4-6 | ¥550 | ¥2,600 |
| anthropic-claude-haiku-4-5 | ¥16 | ¥870 |
| minimax-m2.7 | ¥53 | ¥220 |
| glm-5 | ¥180 | ¥570 |
| llm-jp-3.1-8x13b-instruct4 | ¥16 | ¥79 |
| Qwen3-Coder-480B-A35B-Instruct-FP8 | ¥32 | ¥263 |
| Qwen3-Coder-30B-A3B-Instruct | ¥16 | ¥79 |
| gpt-oss-120b | ¥16 | ¥79 |
| minimax-m3 | ¥50.5126 | ¥202.0504 |
| qwen3.6-27b | ¥48.5763 | ¥533.7497 |
| kimi-k2.6 | ¥114.4952 | ¥574.1598 |
| glm-4.7 | ¥67.3501 | ¥294.6568 |
| glm-4.7-flash | ¥10.1025 | ¥67.3501 |
| glm-5.1 | ¥165.0078 | ¥518.5959 |
| gemini-flash-lite-free | ¥0 | ¥0 |
| step-3.5-flash-free | ¥0 | ¥0 |
| glm-4.5-air-free | ¥0 | ¥0 |
| random-free | ¥0 | ¥0 |
For full pricing details, see the pricing page.
Model Capabilities
Chat/Conversation — Multi-turn dialogue with context
All models support chat/conversationText Completion — Single-turn text generation
Text completion support