Skip to content

vllm_omni.diffusion.attention.parallel.factory ¶

logger `module-attribute` ¶

logger = init_logger(__name__)

build_parallel_attention_strategy ¶

build_parallel_attention_strategy(
    *, scatter_idx: int, gather_idx: int, use_sync: bool
) -> ParallelAttentionStrategy

Select a parallel attention strategy based on current diffusion config.

Design principle: - Attention kernel backend selection remains in attention/selector.py. - Parallel attention selection is handled here, based on distributed config and initialized process groups.