llmcompressor.observers.mse
Classes:
-
MovingAverageMSEObserver–Compute quantization parameters by finding the optimal min/max values which minimize
MemorylessMSEObserver
Bases: Observer
Compute quantization parameters by finding the optimal min/max values which minimize the mean of quantization error squared.
Source code in src/llmcompressor/observers/mse.py
MovingAverageMSEObserver
Bases: Observer
Compute quantization parameters by finding the optimal min/max values which minimize the mean of quantization error squared, with moving average smoothing.