Calibrating a Single Node¶

This procedure explains how to perform calibration for a single Intel® Gaudi® node.

Prerequisites¶

Before you start, familiarize with notes and recommendations.

Calibration procedure¶

Build and install the latest version of vLLM Hardware Plugin for Intel® Gaudi® by following the Installation procedure.
In the plugin project directory, navigate to the calibration subdirectory and install the required dependencies.
```
cd calibration
pip install -r requirements.txt
```
Download the dataset.

Run the calibrate_model.sh script with the obligatory -m, -d, and -o arguments, as in the following example:

./calibrate_model.sh -m /path/to/local/llama3.1/Meta-Llama-3.1-405B-Instruct/ -d dataset-processed.pkl -o /path/to/measurements/vllm-benchmarks/inc -b 128 -t 8 -l 4096
# OR
./calibrate_model.sh -m facebook/opt-125m -d dataset-processed.pkl -o inc/
# OR Calibrate DeepSeek models with dataset NeelNanda/pile-10k
PT_HPU_LAZY_MODE=1 ./calibrate_model.sh -m deepseek-ai/DeepSeek-R1  -d NeelNanda/pile-10k -o inc/ -t 8

Where:

-m <path/ID>: Path to a locally stored model or the model ID from the Hugging Face hub.
-d <path>: Path to the source dataset in the pickle (.pkl) format.
-o <path>: Path to the directory where you want to save the generated measurements.