AI Model

MiniMax: MiniMax M3

MiniMax: MiniMax M3 logoMinimax
Text Generation
Reasoning
Vision
About MiniMax M3

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding, and tool use. It is built on MiniMax Sparse Attention (MSA), which replaces full attention with KV-block selection to cut per-token compute at long context — roughly 1/20 the cost of the previous generation at 1M tokens, with substantially faster prefill and decode while retaining quality across most tasks.

Trained as a native multimodal model on interleaved data and tuned for multi-turn, production-like collaboration via an interactive user-simulator framework, the model is oriented toward sustained, multi-step tasks rather than single-turn execution.

Specifications
Provider
Minimax
Context Length
1,048,576 tokens
Input Types
text, image, video
Output Types
text
Category
Other
Added
5/31/2026

Use MiniMax M3 and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.