Feature Tutorials#
This section provides tutorials for different features of vLLM Ascend.
Feature Tutorials
- PD-Colocated with Mooncake Multi-Instance
- Prefill-Decode Disaggregation (Qwen2.5-VL)
- Prefill-Decode Disaggregation (Deepseek)
- Long-Sequence Context Parallel (Qwen3-235B-A22B)
- Long-Sequence Context Parallel (Deepseek)
- Dynamic Chunked Pipeline Parallel (DeepSeek-V3.1)
- Suffix Speculative Decoding
- Ray Distributed (Qwen3-235B-A22B)