vllm.pooling_params
PoolingParams
¶
Bases: Struct
API parameters for pooling models.
Attributes:
| Name | Type | Description |
|---|---|---|
dimensions |
Optional[int]
|
Reduce the dimensions of embeddings if model support matryoshka representation. |
Source code in vllm/pooling_params.py
requires_token_ids
class-attribute
instance-attribute
¶
requires_token_ids: bool = False
Internal use only.
__post_init__
¶
verify
¶
verify(
task: PoolingTask, model_config: ModelConfig
) -> None