Stable Diffusion

Large Self Supervised Learning (SSL) models have been very popular recently. One popular SSL method is Stable Diffusion, where this method uses a generator that adds Gaussian Noises and a discriminator that trys to remove this noise. This new latent text-to-image model runs with a 860M UNet and a 120 text encoder.

GitHub - CompVis/stable-diffusion: A latent text-to-image diffusion model

High-Resolution Image Synthesis with Latent Diffusion Models

The stable diffusion model, although demonstrated a great capability on image generation, is expensive to run. This project will look at how we can use Knowledge Distillation techniques to reduce the runtime of these models through pruning and fixed-point quantization.

Project overview

In particular, this project will look at:

Knowledge Distillation

General KD

Distilling the Knowledge in a Neural Network

Fitnets: Hints for thin deep nets