vllm_gaudi.ops
¶
Modules:
| Name | Description |
|---|---|
causal_conv1d_pytorch |
PyTorch reference implementation for the causal conv1d kernels. |
hpu_attention |
|
hpu_awq |
|
hpu_compressed_tensors |
|
hpu_conv |
|
hpu_fp8 |
|
hpu_fused_moe |
|
hpu_gptq |
|
hpu_grouped_topk_router |
|
hpu_layernorm |
|
hpu_lora |
|
hpu_mamba_mixer2 |
|
hpu_mm_encoder_attention |
|
hpu_modelopt |
|
hpu_rotary_embedding |
|
ops_selector |
Selector module to switch between PyTorch and Triton implementations |
pytorch_implementation |
|
ssd_combined |
|