Skip to content

vllm.benchmarks

Modules:

Name Description
datasets

This module defines a framework for sampling benchmark requests from various

endpoint_request_func

The request function for API endpoints.

latency

Benchmark the latency of processing a single batch of requests.

serve

Benchmark online serving throughput.

throughput

Benchmark offline inference throughput.

utils