vllm_gaudi.models.deepseek_v2
¶
_get_hpu_llama_4_scaling
¶
_get_hpu_llama_4_scaling(
original_max_position_embeddings: int,
scaling_beta: float,
positions: Tensor,
) -> Tensor