Skip to content

2.0.0

Latest
Compare
Choose a tag to compare
@jianfch jianfch released this 17 Mar 06:15
· 180 commits to main since this release
0f2f699

-changed python requirement from 3.7+ to 3.8+ (following Whisper)
-more reliable word-level timestamps (using Whisper's new method for word timestamps)
-transcribe() now returns WhisperResult object (allowing easier to manipulation of results)
-WhisperResult contains methods to save result as JSON/SRT/VTT/ASS
-WhisperResult contains methods to regroup segments word by word
-added Silero VAD for generating suppression mask (requires PyTorch 1.2.0+)
-improved non-vad suppression
-added visualize_suppression() for visualizing suppression based on arguments (requires Pillow or opencv-python)
-SRT/VTT/ASS outputs now all support both segment-level and word-level