Skip to content

vllm_omni.model_executor.models.indextts2

Modules:

Name Description
bigvgan_cuda_graph

CUDA graph wrapper for BigVGAN vocoding in IndexTTS2 Stage 1.

configuration_indextts2
dit_cuda_graph

Per-shape CUDA graph capture for the S2Mel DiT transformer core.

gpt
indextts2_s2mel_decoder

IndexTTS2 Stage 1: S2Mel decoder + BigVGAN vocoder.

indextts2_talker

IndexTTS2 Stage 0: GPT-2 AR Talker with vLLM-native PagedAttention.

pipeline

IndexTTS2 pipeline: GPT AR talker (text → mel codes) → S2Mel + BigVGAN (mel → audio).

preprocess_utils

External model loading, audio I/O, and emotion conditioning for IndexTTS2.

prompt_utils

Prompt helpers for IndexTTS2 talker prefill.

s2mel
tokenizer
utils