Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms

Repository for the paper "Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrogram". Forked from the stylegan3 repository (https://github.com/NVlabs/stylegan3).

File structure

data_processing/: Utilities for converting the audio dataset to mel-spectrograms that you can try StyleGAN2 on.
audio_sample_generation/: Utilities for generating audio samples or performing interpolation using a trained model.
fad/: Utilities for computing FAD along with pre-computed statistics of the data.
stylegan3/: Original StyleGAN3 repository cloned here (we use the StyleGAN3 repository since it has additional convenience functions and still allows for training with the StyleGAN2-Ada setup).
audio_inversion_experiments.ipynb: Mel-spectrogram round-tripping experiments showing the effect of various parameters on audio inversion quality.
train.sh: Training script for StyleGAN

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms

File structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
audio_sample_generation		audio_sample_generation
data_processing		data_processing
fad		fad
stylegan3		stylegan3
.gitignore		.gitignore
README.md		README.md
audio_inversion_experiments.ipynb		audio_inversion_experiments.ipynb
train.sh		train.sh

marcojira/stylegan3-melspectrograms

Folders and files

Latest commit

History

Repository files navigation

Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms

File structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages