vllm_omni.diffusion.models.nextstep_1_1.modeling_nextstep ¶
NextStepModel ¶
Bases: Module
image_head instance-attribute ¶
image_head = FlowMatchingHead(
input_dim=token_dim,
cond_dim=hidden_size,
dim=fm_head_dim,
layers=fm_head_layers,
)
layers instance-attribute ¶
layers = ModuleList(
[
(LlamaDecoderLayer(config, layer_idx))
for layer_idx in (range(num_hidden_layers))
]
)
forward_model ¶
forward_model(
inputs_embeds: FloatTensor,
attention_mask: Tensor | None = None,
past_key_values: Cache
| list[FloatTensor]
| None = None,
use_cache: bool | None = None,
output_attentions: bool | None = None,
output_hidden_states: bool | None = None,
cache_position: LongTensor | None = None,
) -> BaseModelOutputWithPast