vllm_gaudi.models.hunyuan_v1
¶
HpuHunYuanAttention
¶
Bases: HunYuanAttention
Source code in vllm_gaudi/models/hunyuan_v1.py
forward
¶
forward(
positions: Tensor,
hidden_states: Tensor,
kv_states: Optional[tuple[Tensor]] = None,
) -> Tensor
Source code in vllm_gaudi/models/hunyuan_v1.py
HpuHunYuanDenseV1ForCausalLM
¶
Bases: HunYuanDenseV1ForCausalLM
Source code in vllm_gaudi/models/hunyuan_v1.py
HpuHunYuanMoEV1ForCausalLM
¶
Bases: HunYuanMoEV1ForCausalLM