Efficient quantization for CPU-based diffusion models

Hanees, A. L.; Elango, E.

Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/6982

Title:	Efficient quantization for CPU-based diffusion models
Authors:	Hanees, A. L. Elango, E.
Keywords:	Quantization Diffusion Models U-Net Architecture
Issue Date:	14-Dec-2023
Publisher:	Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.
Citation:	12th Annual Science Research Sessions 2023 (ASRS-2023) Conference Proceedings of "Exploration Towards Green Tech Horizons”. 14th December 2023. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai, Sri Lanka. pp. 36.
Abstract:	The utilization of diffusion models to create visuals from textual descriptions has grown in popularity. However, the significant requirement for computing power still poses a significant obstacle and adds time to procedures. Diffusion models provide difficulties when quantization, a method used to reduce deep learning models for increased efficiency, is used. Comparing to other model types, these models are noticeably more susceptible to quantization, which could lead to deterioration in image quality. In this research, we present a unique method that uses distillation along with quantization aware training to measure the diffusion models. Our findings demonstrate that quantized models can provide inference efficiency on CPUs while retaining great image quality. At https://github.com/intel/intel-extension-for-transformers, the source is accessible to the general public.
URI:	http://ir.lib.seu.ac.lk/handle/123456789/6982
ISBN:	978-955-627-015-0
Appears in Collections:	12th Annual Science Research Session

Files in This Item:

File	Description	Size	Format
5-EFFICIENT QUANTIZATION.pdf		27.31 kB	Adobe PDF	View/Open

Show full item record