Skip to content

vllm.v1.worker.gpu.spec_decode.gemma4

Modules:

Name Description
speculator

Gemma4 MTP (Multi-Token Prediction) speculator for speculative decoding.