All providers
minimax

MiniMax M3 Provider for Claude Code

1M context window with 512K output tokens

Quick start claude-multi add minimax

Use cases

Processing massive codebases in a single context
Full repository refactoring and migration
Long-running agentic tasks with extended tool chains
Multimodal coding with image understanding

MiniMax-M3 is a frontier model with a 1 million token context window and 512K max output. That context size changes what you can do in a single session. You can load entire monorepos, run multi-hour agentic workflows, and let the model build context across hundreds of files without hitting a wall.

Model specs

RoleModelContextMax Output
All rolesMiniMax-M31M512K

The template maps MiniMax-M3 to every role (Opus, Sonnet, Haiku, small/fast). There is only one model, but it handles the full range from quick shell commands to complex multi-step reasoning.

Effort level is set to max and thinking is enabled with REASONING_EFFORT: high. The model gets 32,000 thinking tokens by default, which balances depth against context consumption.

Setup

  1. Create an account at minimax.io and generate an API key
  2. Run the setup command:
Terminal window
claude-multi add minimax
  1. Paste your API key when prompted

The template configures the base URL, model mapping, thinking parameters, and output limits. No manual editing required.

When to pick MiniMax

MiniMax stands out when you need context length. If you work on large codebases, do multi-repository analysis, or run agentic tasks that accumulate a lot of state, the 1M window means fewer compaction cycles and better coherence across long sessions.

The 512K output limit also opens up workflows that other providers cannot match. You can generate entire application scaffolds, produce comprehensive documentation, or get detailed analysis in a single response.

For shorter, more interactive coding sessions, DeepSeek or GLM may offer better cost efficiency at comparable quality.

Pricing details

MiniMax uses pay-per-token billing. Cost scales linearly with usage, and there is no monthly commitment. Check minimax.io for current per-token rates.

  • DeepSeek - strong coding, 1M context, pay-per-token
  • GLM - fixed monthly subscription plan
  • Kimi - competitive pricing, agentic coding
Pricing

Pay-per-token via minimax.io