MiniMax M3 Officially Open-Sourced: 428 Billion Parameters Native Multimodal Model, Supports Million Contexts
The domestic large model provider MiniMax has officially open-sourced their native multimodal mixture of experts (MoE) model, MiniMax M3, on Hugging Face. It boasts a total of 428 billion parameters, with 23 billion parameters activated per token, natively supporting 1 million ultra-long contexts. The team also released an MXFP8 quantized version, compatible with mainstream frameworks like SGLang and vLLM. The model supports both Thinking/Non-thinking dual inference modes and achieves joint training of text, images, and video during the pre-training phase.
Why it Matters: This marks the first time a domestic large model has simultaneously reached SOTA levels in native multimodal capabilities, million ultra-long contexts, and open-source weights across three dimensions, significantly lowering the barrier for developers to integrate complex multimodal capabilities into AI applications.
#MiniMax #AI #开源 #大模型 #multimodal