vllm_omni.entrypoints.omni_base ¶
OutputMessageHandleResult module-attribute ¶
OutputMessageHandleResult = (
tuple[Literal[True], None, None, None]
| tuple[Literal[False], str, int, ClientRequestState]
)
OmniBase ¶
Bases: PDDisaggregationMixin
Shared runtime foundation for AsyncOmni and Omni.
default_sampling_params_list instance-attribute ¶
engine instance-attribute ¶
engine = AsyncOmniEngine(
model=model,
init_timeout=init_timeout,
stage_init_timeout=stage_init_timeout,
diffusion_batch_size=diffusion_batch_size,
transfer_emitter=transfer_metrics,
log_stats=log_stats,
**kwargs,
)
errored property ¶
errored: bool
Whether the engine is in a non-recoverable error state.
True when the orchestrator thread is dead or any stage client has been marked dead (e.g. diffusion worker OOM / process death).
Checks both _engine_dead (StageDiffusionClient) and resources.engine_dead (StageEngineCoreClient / AsyncMPClient) since the two client types store the flag differently.
mod_metrics instance-attribute ¶
mod_metrics = OmniModalityMetrics(
model_name=model, log_stats=log_stats
)
prom_metrics instance-attribute ¶
prom_metrics = OmniPrometheusMetrics(
model_name=model, log_stats=log_stats
)
stage_configs property ¶
stage_configs: list
Expose engine stage configs for PD disaggregation detection and validation.
transfer_metrics instance-attribute ¶
transfer_metrics = OmniTransferMetrics(
model_name=model, log_stats=log_stats
)
from_cli_args classmethod ¶
from_cli_args(
args: TrackingNamespace, model: str | None = None
) -> OmniBase
Build from a TrackingNamespace parsed by TrackingArgumentParser. Only args that are explicitly passed to parse_args are forwarded.
resolve_sampling_params_list ¶
resolve_sampling_params_list(
sampling_params_list: Sequence[Any] | Any | None,
allow_delta_coercion: bool = False,
) -> Sequence[Any]
start_profile ¶
Start profiling specified stages.
Uses vLLM-compatible profile(is_start=True, profile_prefix) interface.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
profile_prefix | str | None | Optional prefix for the trace file names. | None |
stages | list[int] | None | List of stage IDs to profile. If None, profiles all stages. | None |
Returns:
| Type | Description |
|---|---|
list[Any] | List of results from each stage. |