vllm_omni.diffusion.layers.mot.mot_layernorm ¶
Mixture-of-Tokens (MoT) RMS Normalization layer.
Holds two sets of weights (text / gen) and routes tokens to the appropriate weight based on indices. When text_indices is None the layer degrades to a standard RMSNorm using self.weight (und mode).
MoTRMSNorm ¶
Bases: CustomOp
Mixture-of-Tokens RMS Normalization.
In und mode (text_indices is None), every token is normalised with self.weight – exactly like a vanilla RMSNorm.
In gen mode, text tokens are normalised with self.weight and gen tokens are normalised with self.gen_weight, using a single fused Triton kernel that avoids the gather / scatter overhead.