vllm.benchmarks.lib.ready_checker ¶
Utilities for checking endpoint readiness.
wait_for_endpoint async
¶
wait_for_endpoint(
request_func: RequestFunc,
test_input: RequestFuncInput,
session: ClientSession,
timeout_seconds: int = 600,
retry_interval: int = 5,
) -> RequestFuncOutput
Wait for an endpoint to become available before starting benchmarks.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
request_func | RequestFunc | The async request function to call | required |
test_input | RequestFuncInput | The RequestFuncInput to test with | required |
timeout_seconds | int | Maximum time to wait in seconds (default: 10 minutes) | 600 |
retry_interval | int | Time between retries in seconds (default: 5 seconds) | 5 |
Returns:
Name | Type | Description |
---|---|---|
RequestFuncOutput | RequestFuncOutput | The successful response |
Raises:
Type | Description |
---|---|
ValueError | If the endpoint doesn't become available within the timeout |