-
Notifications
You must be signed in to change notification settings - Fork 486
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
garbled audio #161
Comments
same here, any solutions? |
same here, mini model works, large model generates just noise. |
same here too, but with the mini model, my |
This issue seems important, the large model just does not work. I suggest to use a test suite, and run the output through whisper or something. |
yeah - I use that - but the amount of failures is a LOT. almost 95% fail or generate garbled audio So - there must be an underlying issue... just not sure where as followed the instructions |
Hey everyone, thanks for opening the issue and helping on identifying where it blocks. It actually comes from the audio encoder. While I still have to figure out how to actually fix it while being backward compatible for person using it with the previous repo version, you can find fixed weights for the current version by using: |
Amazing @ylacombe - this fixed things perfectly! |
Hi
Not sure why, but when I run either of the example code - I get garbled audio - when I use the Mini Model I get the "correct" audio
I have the following setup
Ubuntu 22.04
Python 3.11.4
nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2021 NVIDIA Corporation
Built on Thu_Nov_18_09:45:30_PST_2021
Cuda compilation tools, release 11.5, V11.5.119
Build cuda_11.5.r11.5/compiler.30672275_0
python3 -m pip show torch
Name: torch
Version: 2.5.1
Summary: Tensors and Dynamic neural networks in Python with strong GPU acceleration
Home-page: https://pytorch.org/
Author: PyTorch Team
Author-email: [email protected]
License: BSD-3-Clause
Location: /data/parler-tts/.venv/lib/python3.11/site-packages
Requires: filelock, fsspec, jinja2, networkx, nvidia-cublas-cu12, nvidia-cuda-cupti-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-runtime-cu12, nvidia-cudnn-cu12, nvidia-cufft-cu12, nvidia-curand-cu12, nvidia-cusolver-cu12, nvidia-cusparse-cu12, nvidia-nccl-cu12, nvidia-nvjitlink-cu12, nvidia-nvtx-cu12, sympy, triton, typing-extensions
Required-by: descript-audio-codec, descript-audiotools, julius, parler_tts, torch-stoi, torchaudio
Thanks
The text was updated successfully, but these errors were encountered: