Find the docker we use in benchmarking pipeline
Deploy the docker, and inside the docker:
- Download nightly-benchmarks.zip.
- In the same folder, run the following code

export HF_TOKEN=<your HF token>
apt update
apt install -y git
unzip nightly-benchmarks.zip
VLLM_SOURCE_CODE_LOC=./ bash .buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh

And the results will be inside ./benchmarks/results.

bootstrapcurl -sSL https://raw.githubusercontent.com/vllm-project/buildkite-ci/main/scripts/kickoff-benchmark.sh | bash

Ran in 12s

Kuntai Du unblocked 🚀 Ready for comparing vllm against alternatives? This will take 4 hours.
Wed 4th Sep 2024 at 6:38 AM

A100 vllm latest main

Ran in 1h 8m

A100 sglang benchmark

Ran in 1h 9m

A100 lmdeploy benchmark

Ran in 1h 7m

A100 trt llama-8B

Ran in 31m 36s

A100 trt llama-70B

Ran in 56m 58s

Collect the results

Ran in 20s

Wait for container to be ready

A100

Total Job Run Time: 4h 54m