vllm.v1.executor.abstract ¶
Executor ¶
Bases: ExecutorBase
Abstract class for v1 executors, mainly define some methods for v1. For methods shared by v0 and v1, define them in ExecutorBase
Source code in vllm/v1/executor/abstract.py
24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 |
|
collective_rpc ¶
determine_available_memory ¶
execute_dummy_batch ¶
execute_model ¶
execute_model(
scheduler_output: SchedulerOutput,
non_block: bool = False,
) -> Union[ModelRunnerOutput, Future[ModelRunnerOutput]]
Source code in vllm/v1/executor/abstract.py
get_class staticmethod
¶
get_class(vllm_config: VllmConfig) -> type[Executor]
Source code in vllm/v1/executor/abstract.py
get_kv_cache_specs ¶
get_kv_cache_specs() -> list[dict[str, KVCacheSpec]]
initialize_from_config ¶
initialize_from_config(
kv_cache_configs: list[KVCacheConfig],
) -> None
Initialize the KV caches and begin the model execution loop of the underlying workers.
Source code in vllm/v1/executor/abstract.py
register_failure_callback ¶
register_failure_callback(callback: FailureCallback)
Register a function to be called if the executor enters a permanent failed state.
ExecutorWithExternalLauncher ¶
Bases: ExecutorWithExternalLauncher
, Executor