vllm_omni.model_executor.models.utils ¶
add_prefix_to_loaded_weights ¶
Add a prefix to the names of the loaded weights.
reinit_rotary_inv_freq ¶
reinit_rotary_inv_freq(
model: Module,
base: float = 10000.0,
match: Callable[[str, Module], bool] | None = None,
) -> int
Recompute inv_freq buffers on RoPE modules in-place.
Custom RoPE classes loaded via trust_remote_code that register inv_freq with persistent=False and are not in ROPE_INIT_FUNCTIONS come out of from_pretrained with garbage buffer values (shape and dtype correct, contents not). cos() / sin() of those values produce NaN, so the first forward emits NaN logits. Mainstream HF RoPE classes avoid this via _rope_init_function framework integration.
Recomputes 1.0 / base^(arange(0, head_dim, 2) / head_dim). head_dim is inferred from 2 * inv_freq.numel(). Pass match to override the default selector (modules whose qualified name ends in "rotary_emb" and that expose a 1-D float inv_freq tensor). Returns the number of buffers re-initialised.