Memory Profiling#
Module Contents#
- class vllm.multimodal.profiling.ProcessorInputs(prompt_text: str, mm_data: ~collections.abc.Mapping[str, ~typing.Any | list[typing.Any]], hf_processor_mm_kwargs: ~collections.abc.Mapping[str, object] = <factory>)[source]#
Represents the keyword arguments to
vllm.multimodal.processing.BaseMultiModalProcessor.apply()
.
- class vllm.multimodal.profiling.DummyEncoderData(prompt_token_ids: list[int])[source]#
Dummy data used for profiling.
- class vllm.multimodal.profiling.DummyDecoderData(prompt_token_ids: list[int], multi_modal_data: MultiModalKwargs, multi_modal_placeholders: Mapping[str, Sequence[PlaceholderRange]])[source]#
Dummy data used for profiling.
- multi_modal_data: MultiModalKwargs[source]#
Alias for field number 1
- class vllm.multimodal.profiling.BaseDummyInputsBuilder(info: _I)[source]#
Abstract base class that constructs the dummy data to profile multi-modal models.
- get_dummy_text(mm_counts: Mapping[str, int]) str [source]#
Build the text input corresponding to
mm_counts
.
- class vllm.multimodal.profiling.MultiModalProfiler(processor: BaseMultiModalProcessor[_I])[source]#
Contains code for running memory profiling for multi-modal models.