vllm_omni.model_executor.models.dynin_omni ¶
Modules:
| Name | Description |
|---|---|
dynin_omni | |
dynin_omni_common | |
dynin_omni_token2audio | |
dynin_omni_token2image | |
dynin_omni_token2text | |
pipeline | Dynin-Omni pipeline topology (frozen). |
DyninOmniForConditionalGeneration ¶
Bases: Module, SupportsMultiModal
STAGE_ALIAS class-attribute instance-attribute ¶
STAGE_ALIAS = {
"tokenizer": "token2text",
"token2token": "token2text",
"detok_text": "token2text",
"token2img": "token2image",
"token2wav": "token2audio",
"token2speech": "token2audio",
}
STAGE_IMPL class-attribute instance-attribute ¶
STAGE_IMPL = {
"token2text": (
".dynin_omni_token2text",
"DyninOmniToken2Text",
),
"token2image": (
".dynin_omni_token2image",
"DyninOmniToken2Image",
),
"token2audio": (
".dynin_omni_token2audio",
"DyninOmniToken2Audio",
),
}
have_multimodal_outputs instance-attribute ¶
have_multimodal_outputs = getattr(
impl, "have_multimodal_outputs", True
)
requires_raw_input_tokens instance-attribute ¶
requires_raw_input_tokens = getattr(
impl, "requires_raw_input_tokens", True
)
supports_multimodal_raw_input_only class-attribute instance-attribute ¶
compute_logits ¶
compute_logits(
hidden_states: Tensor | OmniOutput,
sampling_metadata: Any = None,
) -> Tensor | None
embed_input_ids ¶
embed_input_ids(
input_ids: Tensor,
multimodal_embeddings: Any = None,
is_multimodal: Tensor | None = None,
**kwargs: Any,
) -> Tensor
forward ¶
forward(
input_ids: Tensor | None = None,
positions: Tensor | None = None,
intermediate_tensors: IntermediateTensors | None = None,
inputs_embeds: Tensor | None = None,
**kwargs: Any,
) -> OmniOutput
DyninOmniToken2Audio ¶
Bases: DyninOmniStageBase
Stage-3: token detokenization to speech (or pass-through).
forward ¶
forward(
input_ids: Tensor | None = None,
positions: Tensor | None = None,
intermediate_tensors: IntermediateTensors | None = None,
inputs_embeds: Tensor | None = None,
**kwargs: Any,
) -> OmniOutput
DyninOmniToken2Image ¶
Bases: DyninOmniStageBase
Stage-2: token detokenization to image (or pass-through).
forward ¶
forward(
input_ids: Tensor | None = None,
positions: Tensor | None = None,
intermediate_tensors: IntermediateTensors | None = None,
inputs_embeds: Tensor | None = None,
**kwargs: Any,
) -> OmniOutput
DyninOmniToken2Text ¶
Bases: DyninOmniStageBase
Stage-1: DYNIN generation + text detokenization or pass-through.
hidden_size instance-attribute ¶
hidden_size = resolve_hidden_size(
vllm_config=vllm_config, model=model
)
model instance-attribute ¶
forward ¶
forward(
input_ids: Tensor | None = None,
positions: Tensor | None = None,
intermediate_tensors: IntermediateTensors | None = None,
inputs_embeds: Tensor | None = None,
**kwargs: Any,
) -> OmniOutput