vllm.distributed.kv_transfer.kv_connector.v1.mooncake.store.data ¶
Data classes for MooncakeStoreConnector.
Classes:
-
ChunkedTokenDatabase–Maps token positions to store keys and GPU memory addresses.
-
KeyMetadata–Metadata for constructing pool keys.
-
LoadSpec–Specification for loading KV cache from external store.
-
MooncakeStoreConnectorMetadata–Metadata passed from scheduler to worker.
-
PoolKey–Key for addressing KV cache blocks in the distributed store.
-
ReqMeta–Per-request metadata for store put/get operations.
-
RequestTracker–Tracks per-request state across scheduler ticks.
ChunkedTokenDatabase ¶
Maps token positions to store keys and GPU memory addresses.
Methods:
-
prepare_value–Compute memory addresses and sizes for a token range.
-
process_tokens–Process tokens and yield (start_idx, end_idx, pool_key) tuples.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
prepare_value(start, end, block_ids) ¶
Compute memory addresses and sizes for a token range.
Returns:
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
process_tokens(token_len, block_hashes, mask_num=0) ¶
Process tokens and yield (start_idx, end_idx, pool_key) tuples.
Parameters:
-
(token_len¶int) –Total number of tokens.
-
(block_hashes¶list[BlockHash]) –Block hashes computed at
hash_block_sizegranularity. Whenblock_size > hash_block_sizeconsecutive hashes are merged up to the group'sblock_sizeviaBlockHashListWithBlockSize. -
(mask_num¶int, default:0) –Number of tokens to skip from the beginning.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
KeyMetadata dataclass ¶
Metadata for constructing pool keys.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
LoadSpec dataclass ¶
Specification for loading KV cache from external store.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
MooncakeStoreConnectorMetadata ¶
Bases: KVConnectorMetadata
Metadata passed from scheduler to worker.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
PoolKey dataclass ¶
Key for addressing KV cache blocks in the distributed store.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
ReqMeta dataclass ¶
Per-request metadata for store put/get operations.
Methods:
-
from_request_tracker–Create ReqMeta from a RequestTracker.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
from_request_tracker(tracker, block_size, load_spec=None, skip_save=False, block_hashes=None, is_last_chunk=None) staticmethod ¶
Create ReqMeta from a RequestTracker.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/mooncake/store/data.py
RequestTracker dataclass ¶
Tracks per-request state across scheduler ticks.