Why is there no sound sometimes when using the 5_HP-Karaoke-UVR.pth model? #98

Lixi20 · 2024-08-15T03:03:27Z

I found that it may be related to the parameters, please help me

from pathlib import Path

from audio_separator.separator import Separator

def main():
    input_audio = "/home/geek/workspace/TestSample/Tom_Holland.wav"
    output_dir = "/home/geek/workspace/TestSample/"
    model_file_dir = "/home/geek/.cache/audio-separator-models"
    vocal_separation_model_filename = "UVR-MDX-NET-Voc_FT.onnx"
    de_reverb_model_filename = "5_HP-Karaoke-UVR.pth"
    output_format = "wav"

    vocal_separation_separator = Separator(
        model_file_dir=model_file_dir,
        output_format=output_format,
        output_dir=output_dir,
        output_single_stem="Vocals",
        sample_rate=44100,
        mdx_params={"hop_length": 1024, "segment_size": 256, "overlap": 0.25, "batch_size": 16, "enable_denoise": False},
        vr_params={"batch_size": 16, "window_size": 320, "aggression": 10, "enable_tta": True,
                   "enable_post_process": False, "post_process_threshold": 0.2, "high_end_process": False},
    )

    vocal_separation_separator.load_model(model_filename=vocal_separation_model_filename)

    vocal_separation_output_file_path = vocal_separation_separator.separate(input_audio)[0]
    print(vocal_separation_output_file_path)

    vocal_separation_separator.load_model(model_filename=de_reverb_model_filename)

    de_reverb_output_file_path = \
    vocal_separation_separator.separate(Path(output_dir) / vocal_separation_output_file_path)[0]
    print(de_reverb_output_file_path)


if __name__ == "__main__":
    main()

When I set the parameters as follows：
mdx_params={"hop_length": 1024, "segment_size": 256, "overlap": 0.25, "batch_size": 16, "enable_denoise": False}, vr_params={"batch_size": 16, "window_size": 320, "aggression": 10, "enable_tta": False, "enable_post_process": False, "post_process_threshold": 0.2, "high_end_process": False},
the de_reverb_output_file_path is no sound：
https://github.com/Lixi20/audio_separator_test/blob/main/TestSample/enable_tta_false/Tom_Holland_(Vocals)UVR-MDX-NET-Voc_FT(Vocals)_5_HP-Karaoke-UVR.wav

And then I set the parameters as follows：
mdx_params={"hop_length": 1024, "segment_size": 256, "overlap": 0.25, "batch_size": 16, "enable_denoise": False}, vr_params={"batch_size": 16, "window_size": 320, "aggression": 10, "enable_tta": True, "enable_post_process": False, "post_process_threshold": 0.2, "high_end_process": False},

the de_reverb_output_file_path has sound：
https://github.com/Lixi20/audio_separator_test/blob/main/TestSample/enable_tta_true/Tom_Holland_(Vocals)UVR-MDX-NET-Voc_FT(Vocals)_5_HP-Karaoke-UVR.wav

I didn't set the enable_tta parameter before, and there was sound, but now it doesn't work. Why is this?

The text was updated successfully, but these errors were encountered:

Lixi20 · 2024-08-15T03:06:38Z

@beveradb

beveradb · 2024-08-16T22:26:04Z

Sorry @Lixi20 but I don't know, I didn't train that model or write the original implementation for the VR architecture!

If you really want to figure it out, you can investigate what the code is doing and try to debug it:

Good luck! If you find a fix, please raise a PR and I'll test & merge it :)

Lixi20 · 2024-08-29T08:36:48Z

here

code

Hope it helps you! !! !!

@beveradb

beveradb · 2024-09-15T18:50:38Z

Thank you so much @Lixi20 - this fix is now live in audio-separator version 0.19.2 🎉

beveradb added bug Something isn't working help wanted Extra attention is needed specific-model-not-working labels Aug 16, 2024

Lixi20 mentioned this issue Sep 6, 2024

fix Audio buffer is not finite everywhere #108

Merged

beveradb closed this as completed Sep 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is there no sound sometimes when using the 5_HP-Karaoke-UVR.pth model? #98

Why is there no sound sometimes when using the 5_HP-Karaoke-UVR.pth model? #98

Lixi20 commented Aug 15, 2024 •

edited

Loading

Lixi20 commented Aug 15, 2024

beveradb commented Aug 16, 2024

Lixi20 commented Aug 29, 2024 •

edited

Loading

beveradb commented Sep 15, 2024

Why is there no sound sometimes when using the 5_HP-Karaoke-UVR.pth model? #98

Why is there no sound sometimes when using the 5_HP-Karaoke-UVR.pth model? #98

Comments

Lixi20 commented Aug 15, 2024 • edited Loading

Lixi20 commented Aug 15, 2024

beveradb commented Aug 16, 2024

Lixi20 commented Aug 29, 2024 • edited Loading

beveradb commented Sep 15, 2024

Lixi20 commented Aug 15, 2024 •

edited

Loading

Lixi20 commented Aug 29, 2024 •

edited

Loading