This repository has been archived by the owner on Feb 11, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 30
word timestamp #7
Comments
Hey! You can extract alignment information by adding some code after line LibreASR/libreasr/lib/models.py Lines 853 to 892 in f08c8e8
Save the current encoder timestamp index To convert # encoder input freq, depends on the model architecture
# usually 80ms
encoder_freq = 0.08
# rough alignment estimate for an output at encoder output index t
t_seconds = t * encoder_freq Note that this is just a rough estimate and the actual alignment is usually slightly |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
can i get words timestamp when predict audio file
The text was updated successfully, but these errors were encountered: