vllm_omni.model_executor.models.higgs_audio_v3 ¶
vllm-omni integration for boson-ai's higgs-audio v3 (two-stage TTS).
Modules:
| Name | Description |
|---|---|
configuration_higgs_audio_v3 | Configuration class for higgs-audio v3 (HiggsMultimodalQwen3) in vllm-omni. |
higgs_audio_v3_code2wav | Stage 1 (codec decoder) for higgs-audio v3. |
higgs_audio_v3_talker | Stage-0 talker for higgs-audio v3 (Qwen3 backbone, fused multi-codebook). |
higgs_audio_v3_tokenizer | Prompt builder for higgs-audio v3 TTS. |
pipeline | higgs-audio v3 pipeline: Talker (text -> 8-codebook codec) -> Code2Wav (codec -> 24 kHz PCM). |
ref_audio_cache | Bounded LRU cache for Higgs Audio v3 reference-audio codec encoding. |
HiggsAudioV3Config ¶
Bases: PretrainedConfig
Typed config for higgs-audio v3 (HiggsMultimodalQwen3).
from_pretrained() automatically resolves <|tts|>, <|text|>, <|audio|> and eos_token_id from the checkpoint tokenizer.
tie_modality_embeddings instance-attribute ¶
tie_modality_embeddings = bool(
get("tie_word_embeddings", True)
)
from_pretrained classmethod ¶
from_pretrained(
pretrained_model_name_or_path: str, **kwargs: Any
) -> HiggsAudioV3Config
Load config and resolve special token IDs from the checkpoint tokenizer.
Passes the original pretrained_model_name_or_path (local dir or HF repo ID) directly to AutoTokenizer.from_pretrained() so it can handle cache hits, downloads, and local paths uniformly. Raises if the tokenizer is missing required specials.