vllm_omni.diffusion.model_loader.gguf_adapters.base ¶
GGUFAdapter ¶
Bases: ABC
Base class for model-specific GGUF adapters.
is_compatible staticmethod ¶
is_compatible(
od_config: OmniDiffusionConfig,
model: Module,
source: ComponentSource,
) -> bool
MappedTensor dataclass ¶
gguf_quant_weights_iterator ¶
Iterate over the quant weights in the model gguf files and convert them to torch tensors. Be careful of the order of yielding weight types and weights data, we have to yield all weight types first before yielding any weights. Otherwise it would cause issue when loading weights with for packed layer with different quant types.