Supported Models#
Text-only Language Models#
Generative Models#
Model |
Supported |
Note |
|---|---|---|
DeepSeek v3 |
✅ |
|
DeepSeek R1 |
✅ |
|
DeepSeek Distill (Qwen/LLama) |
✅ |
|
Qwen3 |
✅ |
|
Qwen3-Moe |
✅ |
|
Qwen2.5 |
✅ |
|
QwQ-32B |
✅ |
|
LLama3.1/3.2 |
✅ |
|
Internlm |
✅ |
|
MiniCPM |
✅ |
|
MiniCPM3 |
✅ |
Pooling Models#
Model |
Supported |
Note |
|---|---|---|
XLM-RoBERTa-based |
✅ |
|
Molmo |
✅ |
Multimodal Language Models#
Generative Models#
Model |
Supported |
Note |
|---|---|---|
Qwen2-VL |
✅ |
|
Qwen2.5-VL |
✅ |
|
InternVL2.5 |
✅ |
|
Qwen2-Audio |
✅ |