vllm_omni.transformers_utils.configs.fish_speech ¶
Fish Speech S2 Pro config registration with transformers AutoConfig.
Registers FishSpeechConfig (model_type="fish_qwen3_omni") and sub-configs so that AutoConfig.from_pretrained("fishaudio/s2-pro") returns the correct config class.
FishSpeechConfig ¶
Bases: PretrainedConfig
Top-level config for Fish Speech S2 Pro (fish_qwen3_omni).
Wraps text_config (Slow AR) and audio_decoder_config (Fast AR).
audio_decoder_config instance-attribute ¶
audio_decoder_config = (
audio_decoder_config or FishSpeechFastARConfig()
)
sub_configs class-attribute instance-attribute ¶
sub_configs = {
"text_config": FishSpeechSlowARConfig,
"audio_decoder_config": FishSpeechFastARConfig,
}
FishSpeechFastARConfig ¶
Bases: PretrainedConfig
Fast AR (audio_decoder) config -- 4-layer residual codebook predictor.