vllm.model_executor.kernels.linear.mixed_precision.humming ¶
Humming GEMM as a mixed-precision WNA16Int linear kernel.
vllm.model_executor.kernels.linear.mixed_precision.humming ¶Humming GEMM as a mixed-precision WNA16Int linear kernel.