vllm_gaudi.lora.layers.hpu_row_parallel_linear
¶
HPU-specific LoRA layer for RowParallelLinear.
This module registers HPU-aware LoRA wrappers that can handle HPURowParallelLinear instances created via OOT registration.
HPURowParallelLinearWithLoRA
¶
Bases: RowParallelLinearWithLoRA
LoRA wrapper that can handle HPURowParallelLinear instances.
Source code in vllm_gaudi/lora/layers/hpu_row_parallel_linear.py
can_replace_layer
classmethod
¶
can_replace_layer(
source_layer: Module,
lora_config: LoRAConfig,
packed_modules_list: list,
model_config: PretrainedConfig | None = None,
) -> bool
Source code in vllm_gaudi/lora/layers/hpu_row_parallel_linear.py
HPURowParallelLinearWithShardedLoRA
¶
Bases: RowParallelLinearWithShardedLoRA
Sharded LoRA wrapper that can handle HPURowParallelLinear instances.
Source code in vllm_gaudi/lora/layers/hpu_row_parallel_linear.py
can_replace_layer
classmethod
¶
can_replace_layer(
source_layer: Module,
lora_config: LoRAConfig,
packed_modules_list: list,
model_config: PretrainedConfig | None = None,
) -> bool
Source code in vllm_gaudi/lora/layers/hpu_row_parallel_linear.py
register_hpu_lora_layers
¶
Register HPU LoRA layers in the global lora_utils._all_lora_classes set.
This must be called before LoRA model creation to ensure HPU layers are checked before upstream layers.