Skip to content

vllm.models.minimax_m3.nvidia

Modules:

  • model

    Inference-only MiniMax M3 (text backbone) model.

  • mtp
  • sparse_attention_msa

    MSA (SM100/Blackwell) block-sparse attend for MiniMax M3.