vllm.models.minimax_m3.amd ¶ Modules: model – Inference-only MiniMax M3 (text backbone) model — AMD ROCm implementation. mtp – MiniMax M3 MTP (multi-token prediction) draft model -- ROCm/AMD variant. ops – AMD/ROCm fused Triton ops for MiniMax-M3.