You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I've been trying to run AudioLM and training it using birdsongs to try and see if it can produce good-quality synthetically generated audio data.
I've been using bird songs from the xeno-canto dataset https://www.kaggle.com/datasets/rohanrao/xeno-canto-bird-recordings-extended-a-m that I've preprocessed by converting to single-channel at 22050Hz sample rate and trimming each recording down to 3 seconds. I'm trying to train the soundstream now. At the start it's just noise but I'm still not getting anything after about 20000 steps.
I've read #54 to get some advice. Should my sample_stepcount.ema.flac sound like birdsong when it's properly trained?
Using an A6000 GPU as well.
Thanks
The text was updated successfully, but these errors were encountered:
haydensflee
changed the title
Training AudioLM using
Soundstream training using birdsongs. Help please
Feb 12, 2024
haydensflee
changed the title
Soundstream training using birdsongs. Help please
Soundstream training using birdsongs. Any guidance appreciated!
Feb 12, 2024
Hello,
I've been trying to run AudioLM and training it using birdsongs to try and see if it can produce good-quality synthetically generated audio data.
I've been using bird songs from the xeno-canto dataset https://www.kaggle.com/datasets/rohanrao/xeno-canto-bird-recordings-extended-a-m that I've preprocessed by converting to single-channel at 22050Hz sample rate and trimming each recording down to 3 seconds. I'm trying to train the soundstream now. At the start it's just noise but I'm still not getting anything after about 20000 steps.
I've read #54 to get some advice. Should my sample_stepcount.ema.flac sound like birdsong when it's properly trained?
Using an A6000 GPU as well.
Thanks
The text was updated successfully, but these errors were encountered: