vllm_omni.platforms.musa ¶
Modules:
| Name | Description |
|---|---|
platform | |
MUSAOmniPlatform ¶
Bases: OmniPlatform, MUSAPlatformBase
MUSA/Moore Threads GPU implementation of OmniPlatform.
Inherits all MUSA-specific implementations from vllm-musa's MUSAPlatformBase, and adds Omni-specific interfaces from OmniPlatform.
get_device_capability classmethod ¶
get_device_capability(
device_id: int = 0,
) -> DeviceCapability | None
Get the compute capability of the MUSA device.
get_diffusion_attn_backend_cls classmethod ¶
Get the diffusion attention backend class path for MUSA platform.
MUSA supports FLASH_ATTN via the mate package, and SDPA as fallback.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
selected_backend | str | None | User-selected backend name (e.g., "FLASH_ATTN", "TORCH_SDPA"). If None, uses platform default. | required |
head_size | int | Attention head size. | required |
Returns:
| Type | Description |
|---|---|
str | Fully qualified class path of the selected backend. |
get_free_memory classmethod ¶
get_free_memory(device: device | None = None) -> int
Get the free memory on the MUSA device.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
device | device | None | Optional device to query. If None, uses current device. | None |
Returns:
| Type | Description |
|---|---|
int | Free memory in bytes. |
get_torch_device classmethod ¶
get_torch_device(local_rank: int | None = None) -> device
Get the torch device for MUSA platform.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
local_rank | int | None | Optional local rank for multi-GPU setups. | None |
Returns:
| Type | Description |
|---|---|
device | torch.device for MUSA GPU. |