vllm_omni.config.omni_config ¶
Structured vLLM-Omni configuration classes.
This module is additive for Phase 2 of RFC #4021. VllmOmniConfig.from_registry builds the structured view directly from the pipeline registry and deploy config so parity can be proven before later PRs cut consumers over to these classes.
StageConfigType module-attribute ¶
StageConfigType: TypeAlias = (
VllmOmniARStageConfig
| VllmOmniGenerationStageConfig
| VllmOmniDiffusionStageConfig
)
BaseVllmOmniStageConfig ¶
Common structured config contract shared by all Omni stage realizations.
cache_config class-attribute instance-attribute ¶
cache_config: OmniStageCacheConfig = field(
default_factory=OmniStageCacheConfig
)
connector_config class-attribute instance-attribute ¶
connector_config: OmniStageConnectorConfig = field(
default_factory=OmniStageConnectorConfig
)
load_config class-attribute instance-attribute ¶
load_config: OmniStageLoadConfig = field(
default_factory=OmniStageLoadConfig
)
model_config class-attribute instance-attribute ¶
model_config: OmniStageModelConfig = field(
default_factory=OmniStageModelConfig
)
parallel_config class-attribute instance-attribute ¶
parallel_config: OmniStageParallelConfig = field(
default_factory=OmniStageParallelConfig
)
quantization_config class-attribute instance-attribute ¶
runtime_config class-attribute instance-attribute ¶
runtime_config: OmniStageRuntimeConfig = field(
default_factory=OmniStageRuntimeConfig
)
scheduler_config class-attribute instance-attribute ¶
scheduler_config: OmniStageSchedulerConfig = field(
default_factory=OmniStageSchedulerConfig
)
OmniStageCacheConfig ¶
Per-stage engine cache and memory behavior.
This is separate from _DiffusionConfigProjection.cache_config, which configures vLLM-Omni diffusion-specific cache backends such as TeaCache and Cache-DiT.
OmniStageConnectorConfig ¶
OmniStageDiffusionParallelConfig ¶
Bases: OmniStageParallelConfig
Diffusion-stage distributed parallelism behavior.
cfg_parallel_size class-attribute instance-attribute ¶
cfg_parallel_size: int = Field(default=1, ge=1, le=3)
hsdp_replicate_size class-attribute instance-attribute ¶
hsdp_replicate_size: int = Field(default=1, ge=1)
OmniStageLoadConfig ¶
Per-stage loading behavior.
OmniStageModelConfig ¶
Per-stage model behavior.
active_stream_window class-attribute instance-attribute ¶
active_stream_window: int = Field(default=0, ge=0)
compilation_config class-attribute instance-attribute ¶
default_sampling_params class-attribute instance-attribute ¶
enable_flashinfer_autotune class-attribute instance-attribute ¶
enable_flashinfer_autotune: bool | None = None
enable_multithread_weight_load class-attribute instance-attribute ¶
enable_multithread_weight_load: bool = True
num_weight_load_threads class-attribute instance-attribute ¶
num_weight_load_threads: int = Field(default=4, ge=1)
subtalker_sampling_params class-attribute instance-attribute ¶
OmniStageParallelConfig ¶
Common per-stage distributed parallelism behavior.
data_parallel_size class-attribute instance-attribute ¶
data_parallel_size: int = Field(default=1, ge=1)
pipeline_parallel_size class-attribute instance-attribute ¶
pipeline_parallel_size: int = Field(default=1, ge=1)
OmniStageRuntimeConfig ¶
Per-stage process placement and runtime behavior.
OmniStageSchedulerConfig ¶
VllmOmniARStageConfig ¶
VllmOmniConfig ¶
Top-level structured Omni config built once from registry inputs.
orchestrator_config class-attribute instance-attribute ¶
orchestrator_config: VllmOmniOrchestratorConfig = field(
default_factory=VllmOmniOrchestratorConfig
)
VllmOmniDiffusionStageConfig ¶
Bases: BaseVllmOmniStageConfig
Structured config for diffusion stages.
diffusion_config class-attribute instance-attribute ¶
diffusion_config: _DiffusionConfigProjection = field(
default_factory=_DiffusionConfigProjection
)
parallel_config class-attribute instance-attribute ¶
parallel_config: OmniStageDiffusionParallelConfig = field(
default_factory=OmniStageDiffusionParallelConfig
)
VllmOmniGenerationStageConfig ¶
VllmOmniOrchestratorConfig ¶
Configuration consumed by the orchestrator process only.
omni_dp_size_local class-attribute instance-attribute ¶
omni_dp_size_local: int = Field(default=1, ge=1)
omni_heartbeat_timeout class-attribute instance-attribute ¶
omni_heartbeat_timeout: float = Field(default=30.0, gt=0.0)
shm_threshold_bytes class-attribute instance-attribute ¶
shm_threshold_bytes: int = Field(default=65536, ge=0)