vllm_gaudi.v1.worker.hpu_dp_utils
¶
HPUDPMetadata
dataclass
¶
Source code in vllm_gaudi/v1/worker/hpu_dp_utils.py
__init__
¶
__init__(
hidden_states_across_dp: Tensor,
topk_ids_across_dp: Tensor,
topk_weights_across_dp: Tensor,
local_hidden_states: Tensor,
) -> None
make
staticmethod
¶
make(
vllm_config: VllmConfig, num_tokens: int
) -> HPUDPMetadata
Source code in vllm_gaudi/v1/worker/hpu_dp_utils.py
dispatch_hidden_states
¶
Source code in vllm_gaudi/v1/worker/hpu_dp_utils.py
dispatch_tensor
¶
dispatch_tensor(
input,
output: Tensor | None = None,
is_sequence_parallel: bool = False,
) -> Tensor
Source code in vllm_gaudi/v1/worker/hpu_dp_utils.py
get_hpu_dp_metadata
¶
get_hpu_dp_metadata() -> Optional[HPUDPMetadata]
override_hpu_dp_metadata
¶
override_hpu_dp_metadata(
hpu_dp_metadata: Optional[HPUDPMetadata],
)
A context manager that overrides the current HPU DP metadata. This is used to override the HPU DP metadata for a specific forward pass.
Source code in vllm_gaudi/v1/worker/hpu_dp_utils.py
set_hpu_dp_metadata
¶
set_hpu_dp_metadata(
vllm_config: VllmConfig, num_tokens: int
)