Supported Models#
Get the latest info here: https://github.com/vllm-project/vllm-ascend/issues/1608
Text-Only Language Models#
生成模型#
模型 |
Support |
注释 |
|---|---|---|
DeepSeek V3/3.1 |
✅ |
|
DeepSeek V3.2 EXP |
✅ |
|
DeepSeek R1 |
✅ |
|
DeepSeek 精炼(Qwen/LLama) |
✅ |
|
Qwen3 |
✅ |
|
Qwen3-based |
✅ |
|
Qwen3-Coder |
✅ |
|
Qwen3-Moe |
✅ |
|
Qwen3-Next |
✅ |
|
Qwen2.5 |
✅ |
|
Qwen2 |
✅ |
|
Qwen2-based |
✅ |
|
QwQ-32B |
✅ |
|
LLama2/3/3.1 |
✅ |
|
Internlm |
✅ |
|
百川 |
✅ |
|
Baichuan2 |
✅ |
|
Phi-4-mini |
✅ |
|
MiniCPM |
✅ |
|
MiniCPM3 |
✅ |
|
Ernie4.5 |
✅ |
|
Ernie4.5-Moe |
✅ |
|
Gemma-2 |
✅ |
|
Gemma-3 |
✅ |
|
Phi-3/4 |
✅ |
|
Mistral/Mistral-Instruct |
✅ |
|
GLM-4.5 |
✅ |
|
GLM-4 |
❌ |
|
GLM-4-0414 |
❌ |
|
ChatGLM |
❌ |
|
DeepSeek V2.5 |
🟡 |
需要测试 |
Mllama |
🟡 |
需要测试 |
MiniMax-Text |
🟡 |
需要测试 |
池化模型#
多模态语言模型#
生成模型#
模型 |
Support |
注释 |
|---|---|---|
Qwen2-VL |
✅ |
|
Qwen2.5-VL |
✅ |
|
Qwen3-VL |
✅ |
|
Qwen3-VL-MOE |
✅ |
|
Qwen2.5-Omni |
✅ |
|
QVQ |
✅ |
|
LLaVA 1.5/1.6 |
✅ |
|
InternVL2 |
✅ |
|
InternVL2.5 |
✅ |
|
Qwen2-Audio |
✅ |
|
Aria |
✅ |
|
LLaVA-Next |
✅ |
|
LLaVA-Next-Video |
✅ |
|
MiniCPM-V |
✅ |
|
Mistral3 |
✅ |
|
Phi-3-Vison/Phi-3.5-Vison |
✅ |
|
Gemma3 |
✅ |
|
LLama4 |
❌ |
|
LLama3.2 |
❌ |
|
Keye-VL-8B-Preview |
❌ |
|
Florence-2 |
❌ |
|
GLM-4V |
❌ |
|
InternVL2.0/2.5/3.0 |
❌ |
|
Whisper |
❌ |
|
Ultravox |
🟡 |
需要测试 |