馃悗
Performance Benchmark
Publicswitch to 8B
Passed in 1h 54m and blocked
bootstrap
馃殌 Ready for comparing vllm against alternatives? This will take 4 hours.
A100 trt llama-8B
Collect the results
馃殌 check the results!
Wait for container to be ready
A100
Description
This file contains the downloading link for benchmarking results.
Please download the visualization scripts in the post
Results reproduction
- Find the docker we use in
benchmarking pipeline
- Deploy the docker, and inside the docker:
- Download
nightly-benchmarks.zip
. - In the same folder, run the following code
- Download
export HF_TOKEN=<your HF token>
apt update
apt install -y git
unzip nightly-benchmarks.zip
VLLM_SOURCE_CODE_LOC=./ bash .buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh
And the results will be inside ./benchmarks/results
.
bootstrapcurl -sSL https://raw.githubusercontent.com/vllm-project/buildkite-ci/main/scripts/kickoff-benchmark.sh | bash
Waited 41s
Ran in 10s

Wait for container to be ready
A100
Total Job Run Time: 3m 31s