Skip to content

vllm.models.minimax_m3.amd

Modules:

  • model

    Inference-only MiniMax M3 (text backbone) model — AMD ROCm implementation.

  • mtp

    MiniMax M3 MTP (multi-token prediction) draft model -- ROCm/AMD variant.

  • ops

    AMD/ROCm fused Triton ops for MiniMax-M3.