[New Publication] Demystifying Power-of-Two Quantization: Benchmarking Inference on AVX and RVV
Demystifying Power-of-Two Quantization: Benchmarking Inference on AVX and RVV
The Second National Workshop on Scientific HPC in the pre-Exascale era, ITADATA’25, Turin, Italy
(Github Repo)
- Saleh Jamali Golzar, University of Salerno
- Giuseppe Pagano, University of Salerno
- Biagio Cosenza, University of Salerno