vllm.v1.kv_offload.mediums ¶
BlockIDsLoadStoreSpec ¶
Bases: LoadStoreSpec
, ABC
Spec for loading/storing KV blocks from given block numbers.
Source code in vllm/v1/kv_offload/mediums.py
__init__ ¶
CPULoadStoreSpec ¶
Bases: BlockIDsLoadStoreSpec
Spec for loading/storing a KV block to CPU memory.
Source code in vllm/v1/kv_offload/mediums.py
GPULoadStoreSpec ¶
Bases: BlockIDsLoadStoreSpec
Spec for loading/storing a KV block to GPU memory.