vllm.model_executor.models.transformers.causal ¶
Transformers modeling backend mixin for causal language models.
Classes:
CausalMixin ¶
Bases: VllmModelForTextGeneration
Methods:
-
load_weights–A thin wrapper around
Base.load_weightsto handle the lm_head bias.
Source code in vllm/model_executor/models/transformers/causal.py
load_weights(weights) ¶
A thin wrapper around Base.load_weights to handle the lm_head bias.