vllm_omni.diffusion.attention.parallel.factory ¶
build_parallel_attention_strategy ¶
build_parallel_attention_strategy(
*, scatter_idx: int, gather_idx: int, use_sync: bool
) -> ParallelAttentionStrategy
Select a parallel attention strategy based on current diffusion config.
Design principle: - Attention kernel backend selection remains in attention/selector.py. - Parallel attention selection is handled here, based on distributed config and initialized process groups.