vllm.models.minimax_m3.nvidia ¶
Modules:
-
model–Inference-only MiniMax M3 (text backbone) model.
-
mtp– -
sparse_attention_msa–MSA (SM100/Blackwell) block-sparse attend for MiniMax M3.
vllm.models.minimax_m3.nvidia ¶Modules:
model – Inference-only MiniMax M3 (text backbone) model.
mtp – sparse_attention_msa – MSA (SM100/Blackwell) block-sparse attend for MiniMax M3.