Skip to content

vllm_omni.worker

Modules:

Name Description
base

Base worker class for vLLM-Omni with process-scoped GPU memory accounting.

gpu_ar_model_runner

AR GPU Model Runner for vLLM-Omni.

gpu_ar_worker
gpu_generation_model_runner

Code2Wav GPU Model Runner for vLLM-Omni.

gpu_generation_worker
gpu_memory_utils

NVML-based per-process GPU memory utilities.

gpu_model_runner
mixins
omni_connector_model_runner_mixin

Unified data-plane communication mixin for Model Runners.

payload_span

Helpers for explicit thinker decode span metadata.