Skip to content

Help with debugging incorrect speech timestamps #574

Answered by snakers4
sha-roze asked this question in Q&A
Discussion options

You must be logged in to vote

v5 has problems with singing, but v3.1 works:

  model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad:v3.1',
                                model='silero_vad',
                                force_reload=True,
                                onnx=USE_ONNX)

  (get_speech_timestamps,
  save_audio,
  read_audio,
  VADIterator,
  collect_chunks) = utils

wav = read_audio('prayer.wav', sampling_rate=SAMPLING_RATE)
# get speech timestamps from full audio file
speech_timestamps = get_speech_timestamps(wav, model,
                                          sampling_rate=SAMPLING_RATE,
                                          visualize_probs=True,
                                      …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@sha-roze
Comment options

Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants