Skip to content

vllm.models.minimax_m3.common

Modules:

  • indexer

    MiniMax M3 lightning indexer: side cache, metadata, and impl.

  • mm_preprocess
  • ops

    Cross-platform (Triton) kernels for MiniMax M3 sparse attention.

  • sparse_attention

    Main block-sparse GQA attention for MiniMax M3 sparse layers.

  • vision_tower