vllm.v1.kv_offload.backends.cpu ¶
CPUBackend ¶
Bases: Backend
Source code in vllm/v1/kv_offload/backends/cpu.py
__init__ ¶
Source code in vllm/v1/kv_offload/backends/cpu.py
allocate_blocks ¶
allocate_blocks(
block_hashes: list[BlockHash],
) -> list[BlockStatus]
Source code in vllm/v1/kv_offload/backends/cpu.py
free ¶
free(block: BlockStatus)
get_load_store_spec ¶
get_load_store_spec(
block_hashes: Iterable[BlockHash],
blocks: Iterable[BlockStatus],
) -> LoadStoreSpec