vllm.compilation.sequence_parallelism
AllReduceRMSNormPattern
¶
Source code in vllm/compilation/sequence_parallelism.py
EmbeddingAllReduceRMSNormPattern
¶
Bases: AllReduceRMSNormPattern
Source code in vllm/compilation/sequence_parallelism.py
get_inputs
¶
Source code in vllm/compilation/sequence_parallelism.py
register
¶
Source code in vllm/compilation/sequence_parallelism.py
LastAllReduceRMSNormPattern
¶
Bases: AllReduceRMSNormPattern
Source code in vllm/compilation/sequence_parallelism.py
get_inputs
¶
Source code in vllm/compilation/sequence_parallelism.py
register
¶
Source code in vllm/compilation/sequence_parallelism.py
MiddleAllReduceRMSNormPattern
¶
Bases: AllReduceRMSNormPattern
Source code in vllm/compilation/sequence_parallelism.py
get_inputs
¶
Source code in vllm/compilation/sequence_parallelism.py
register
¶
Source code in vllm/compilation/sequence_parallelism.py
SequenceParallelismPass
¶
Bases: VllmInductorPass
Source code in vllm/compilation/sequence_parallelism.py
patterns
instance-attribute
¶
__init__
¶
__init__(config: VllmConfig)