vllm_omni.diffusion.models.gr00t.modeling ¶
Modules:
| Name | Description |
|---|---|
gr00t_n1d7 | |
modules | |
processing_gr00t_n1d7 | |
Gr00tN1d7 ¶
Bases: PreTrainedModel
Gr00tN1d7: VLA model with Cosmos-Reason2-2B (Qwen3-VL) backbone.
backbone instance-attribute ¶
backbone = backbone_cls(
model_name=config.model_name,
select_layer=config.select_layer,
backbone_embedding_dim=config.backbone_embedding_dim,
load_bf16=config.load_bf16,
transformers_loading_kwargs=transformers_loading_kwargs,
)
collator instance-attribute ¶
collator = Gr00tN1d7DataCollator(
model_name=config.model_name,
model_type=config.backbone_model_type,
transformers_loading_kwargs=transformers_loading_kwargs,
)
supports_gradient_checkpointing class-attribute instance-attribute ¶
forward ¶
forward(inputs: dict) -> BatchFeature
Forward pass through the complete model.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
inputs | dict | Dictionary containing: - Action inputs (state, action, embodiment_id, etc.) | required |
Returns:
| Type | Description |
|---|---|
BatchFeature | BatchFeature containing loss and other outputs |
get_action ¶
Generate actions using the complete model.
Gr00tN1d7Processor ¶
Bases: ProcessorMixin
apply_sincos_state_encoding instance-attribute ¶
data_collator_class class-attribute instance-attribute ¶
data_collator_class = Gr00tN1d7DataCollator
embodiment_id_mapping instance-attribute ¶
embodiment_id_mapping = (
embodiment_id_mapping
or EMBODIMENT_TAG_TO_PROJECTOR_INDEX
)
eval_image_transform instance-attribute ¶
processor instance-attribute ¶
processor = build_processor(
model_name, transformers_loading_kwargs
)
state_action_processor instance-attribute ¶
state_action_processor = StateActionProcessor(
modality_configs=modality_configs,
statistics=statistics,
use_percentiles=use_percentiles,
clip_outliers=clip_outliers,
apply_sincos_state_encoding=apply_sincos_state_encoding,
use_relative_action=use_relative_action,
)
statistics instance-attribute ¶
decode_action ¶
decode_action(
action: ndarray,
embodiment_tag: EmbodimentTag,
state: dict[str, ndarray] | None = None,
)
Undo action normalization and convert relative actions to absolute.