vllm.entrypoints.cli.benchmark.latency
BenchmarkLatencySubcommand
¶
Bases: BenchmarkSubcommandBase
The latency subcommand for vllm bench.
Source code in vllm/entrypoints/cli/benchmark/latency.py
__init__
¶
add_cli_args
¶
add_cli_args(parser: ArgumentParser) -> None
cmd_init
¶
cmd_init() -> list[CLISubcommand]