Supported Models#

Text-only Language Models#

Generative Models#

Model

Supported

Note

DeepSeek v3

DeepSeek R1

DeepSeek Distill (Qwen/LLama)

Qwen3

Qwen3-Moe

Qwen2.5

QwQ-32B

LLama3.1/3.2

Internlm

MiniCPM

MiniCPM3

Pooling Models#

Model

Supported

Note

XLM-RoBERTa-based

Molmo

Multimodal Language Models#

Generative Models#

Model

Supported

Note

Qwen2-VL

Qwen2.5-VL

InternVL2.5

Qwen2-Audio