Bases: OmniPlatform, CudaPlatformBase
CUDA/GPU implementation of OmniPlatform (default).
Inherits all CUDA-specific implementations from vLLM's CudaPlatform, and adds Omni-specific interfaces from OmniPlatform.
get_default_ir_op_priority(
vllm_config: VllmConfig,
) -> IrOpPriorityConfig
Copied from vllm/platforms/cuda/platform.py v0.20.0 with force using vllm_c kernels
get_default_stage_config_path() -> str
get_device_capability(
device_id: int = 0,
) -> DeviceCapability | None
get_device_count() -> int
get_device_name(device_id: int = 0) -> str
get_device_version() -> str | None
get_diffusion_attn_backend_cls(
selected_backend: str | None, head_size: int
) -> str
get_free_memory(device: device | None = None) -> int
get_omni_ar_worker_cls() -> str
get_omni_generation_worker_cls() -> str
get_torch_device(local_rank: int | None = None) -> device
has_flash_attn_package() -> bool
supports_torch_inductor() -> bool