Feature × Feature#
The tables below show mutually exclusive features and the support on Ascend hardware, extended from vLLM table.
The symbols used have the following meanings:
✅ = Full compatibility
🟠 = Partial compatibility
❌ = No compatibility
❔ = Unknown or TBD
Feature |
Async Scheduling |
Flashcomm1 |
Layer Sharding |
Lmhead TP |
Mlapo |
Multistream Moe |
Shared Expert DP |
TP |
Weight nz |
||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
✅ |
|||||||||||||||||||||||||
❌ |
✅ |
||||||||||||||||||||||||
Async Scheduling |
✅ |
✅ |
✅ |
||||||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
||||||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
|||||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
||||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
|||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
🟠1 |
✅ |
✅ |
||||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
|||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
||||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
|||||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
||||||||||||||
Flashcomm1 |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
🟠2 |
✅ |
✅ |
✅ |
✅ |
||||||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
||||||||||||
Layer Sharding |
✅ |
✅ |
✅ |
✅ |
✅ |
🟠 |
✅ |
✅ |
🟠3 |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
||||||||||
Lmhead TP |
✅ |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
🟠4 |
✅ |
✅ |
✅ |
✅ |
❌ |
❔ |
✅ |
✅ |
|||||||||
Mlapo |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
🟠5 |
✅ |
✅ |
✅ |
❌ |
❔ |
❌ |
✅ |
✅ |
||||||||
✅ |
✅ |
✅ |
✅ |
✅ |
🟠 |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❌ |
✅ |
✅ |
✅ |
✅ |
||||||||
Multistream Moe |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
✅ |
✅ |
✅ |
||||||
Shared Expert DP |
✅ |
✅ |
✅ |
✅ |
✅ |
🟠1 |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
✅ |
❔ |
✅ |
|||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❌ |
❔ |
❔ |
✅ |
❔ |
✅ |
❔ |
❌ |
❔ |
❔ |
✅ |
|||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❌ |
✅ |
✅ |
✅ |
❔ |
✅ |
❔ |
❌ |
✅ |
✅ |
❔ |
✅ |
||||
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
✅ |
|||
TP |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
|
Weight nz |
✅ |
✅ |
✅ |
✅ |
✅ |
❔ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
✅ |
❌ |
🟠 |
✅ |
✅ |
✅ |
1 Only dcp supports dp while pcp does not support dp.
2 Falshcomm is only enabled on the prefill stage.
3 Layer sharding is only enabled on the prefill stage.
4 Lmhead TP is only enabled in the pure dp scenarios.
5 MLAPO is only supported on the decode stage.