Feature × Feature

Feature × Feature#

The tables below show mutually exclusive features and the support on Ascend hardware, extended from vLLM table.

The symbols used have the following meanings:

  • ✅ = Full compatibility

  • 🟠 = Partial compatibility

  • ❌ = No compatibility

  • ❔ = Unknown or TBD

Feature

ACLGraph Full_Decode_Only

ACLGraph Piecewise

Async Scheduling

APC

Chunked Prefill

Context Parallel

Cpu Binding

DP

Disaggregated Prefill

Eagle3

Eplb

EP

Flashcomm1

KV Cache Pool

Layer Sharding

Lmhead TP

Mlapo

mm

Multistream Moe

Shared Expert DP

Quantization W4A4

Quantization W4A8

Quantization W8A8

TP

Weight nz

ACLGraph Full_Decode_Only

ACLGraph Piecewise

Async Scheduling

APC

Chunked Prefill

Context Parallel

Cpu Binding

DP

🟠1

Disaggregated Prefill

Eagle3

Eplb

EP

Flashcomm1

🟠2

KV Cache Pool

Layer Sharding

🟠

🟠3

Lmhead TP

🟠4

Mlapo

🟠5

mm

🟠

Multistream Moe

Shared Expert DP

🟠1

Quantization W4A4

Quantization W4A8

Quantization W8A8

TP

Weight nz

🟠

  • 1 Only dcp supports dp while pcp does not support dp.

  • 2 Falshcomm is only enabled on the prefill stage.

  • 3 Layer sharding is only enabled on the prefill stage.

  • 4 Lmhead TP is only enabled in the pure dp scenarios.

  • 5 MLAPO is only supported on the decode stage.