vllm.model_executor.layers.fused_moe.router.fused_topk_bias_router ¶
Classes:
-
FusedTopKBiasRouter–Router using fused top-k with e_score_correction_bias.
FusedTopKBiasRouter ¶
Bases: BaseRouter
Router using fused top-k with e_score_correction_bias.
Source code in vllm/model_executor/layers/fused_moe/router/fused_topk_bias_router.py
_compute_routing(hidden_states, router_logits, indices_type, *, input_ids=None) ¶
Compute routing using fused top-k with bias.
Source code in vllm/model_executor/layers/fused_moe/router/fused_topk_bias_router.py
_topk_softplus_sqrt_torch(topk_weights, topk_indices, token_expert_indices, gating_output, renormalize=False, e_score_correction_bias=None, input_tokens=None, hash_indices_table=None, routed_scaling_factor=1.0) ¶
Pure PyTorch fallback for topk_softplus_sqrt (XPU/CPU).