vllm_omni.transformers_utils.configs.higgs_audio_v3 ¶
Configuration class for higgs-audio v3 (HiggsMultimodalQwen3) in vllm-omni.
HiggsAudioV3Config.from_pretrained(model_path) returns a config with tts_token_id, text_token_id, audio_continuation_id, and eos_token_id already resolved from the checkpoint tokenizer. If the tokenizer is unavailable or missing required specials, the load raises.
HiggsAudioV3Config ¶
Bases: PretrainedConfig
Typed config for higgs-audio v3 (HiggsMultimodalQwen3).
from_pretrained() automatically resolves <|tts|>, <|text|>, <|audio|> and eos_token_id from the checkpoint tokenizer.
audio_hidden_size instance-attribute ¶
audio_hidden_size = int(
audio_encoder_config.get(
"out_dim", self.text_config.hidden_size
)
)
codebook_size instance-attribute ¶
codebook_size = int(
audio_encoder_config.get("vocab_size", codebook_size)
)
enable_flashinfer_api_unwrap instance-attribute ¶
enable_flashinfer_api_unwrap = bool(
enable_flashinfer_api_unwrap
)
num_codebooks instance-attribute ¶
num_codebooks = int(
audio_encoder_config.get("num_codebooks", num_codebooks)
)
tie_modality_embeddings instance-attribute ¶
tie_modality_embeddings = bool(
audio_encoder_config.get("tie_word_embeddings", True)
)
from_pretrained classmethod ¶
from_pretrained(
pretrained_model_name_or_path: str, **kwargs: Any
) -> HiggsAudioV3Config
Load config and resolve special token IDs from the checkpoint tokenizer.
Passes the original pretrained_model_name_or_path (local dir or HF repo ID) directly to AutoTokenizer.from_pretrained() so it can handle cache hits, downloads, and local paths uniformly. Raises if the tokenizer is missing required specials.