CS-7476-Improvements-in-DIffusion-Model

This repository contains the implementation of advanced techniques in diffusion models, specifically focusing on attention-based inpainting and latent space manipulation. The research and development work here led to the publication titled "Enhancing Conditional Image Generation with Explainable Latent Space Manipulation".

Overview

The primary goal of this project is to enhance text-to-image synthesis by integrating stable diffusion models with novel attention-based techniques. The project employs a dynamic masking technique that utilizes gradient-based selective attention (Grad-SAM) to achieve precise inpainting and improve fidelity preservation.

Key Features

Stable Diffusion Pipeline: Implementation of the diffusion model pipeline with enhancements for dynamic masking.
Gradient-based Selective Attention (Grad-SAM): Integration of Grad-SAM for analyzing and manipulating cross attention maps.
Latent Space Manipulation: Techniques for adjusting the latent space to better align generated images with reference features.
Performance Metrics: Evaluation using Frechet Inception Distance (FID) and CLIP scores to assess image quality and textual alignment.

Acknowledgements

This research was conducted as part of the CS 7476 Advanced Computer Vision course in Spring 2024 under the Professor James Hays.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.vscode		.vscode
__pycache__		__pycache__
images		images
stable-diffusion-pipeline		stable-diffusion-pipeline
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
attention.py		attention.py
attention_attribution.py		attention_attribution.py
clip.py		clip.py
computation_graph.py		computation_graph.py
cross_attention_map.py		cross_attention_map.py
ddpm.py		ddpm.py
decoder.py		decoder.py
diffusion.py		diffusion.py
download_pretrained_weights.py		download_pretrained_weights.py
encoder.py		encoder.py
environment.txt		environment.txt
experiments.json		experiments.json
experiments.py		experiments.py
filter_subject_token.py		filter_subject_token.py
gradient_utils.py		gradient_utils.py
inference_example.py		inference_example.py
latent_space_manipulator.py		latent_space_manipulator.py
model_converter.py		model_converter.py
model_loader.py		model_loader.py
pipeline.py		pipeline.py
results_fid.py		results_fid.py
subject_attention_maps.py		subject_attention_maps.py
unet_computation_graph		unet_computation_graph

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS-7476-Improvements-in-DIffusion-Model

Overview

Key Features

Acknowledgements

About

Releases

Packages

Languages

kshitij79/CS-7476-Improvements-in-Diffusion-Model

Folders and files

Latest commit

History

Repository files navigation

CS-7476-Improvements-in-DIffusion-Model

Overview

Key Features

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages