whisperX Subs

This repository provides utilities for processing audio files using the whisperx tool. The added functionalities include formatting subtitles based on a user-specified maximum size, downloading videos and audios from YouTube, and extracting audio from videos stored on Google Drive.

Using the Notebooks

There are three Jupyter notebooks provided:

whisperx_subs.ipynb: This notebook focuses on processing audio files and generating well-formatted subtitles.
youtube_to_audio.ipynb: This notebook allows you to download videos and audios from YouTube, which can then be processed using the whisperx_subs.ipynb notebook.
video_to_audio.ipynb: This notebook facilitates the extraction of audio from videos stored on Google Drive, saving the audio as a separate file back to Google Drive.

By launching these notebooks, you'll have a hands-on environment to run, modify, and experiment with the code in real-time.

About the Tool

This utility builds upon the whisperx repository to process audio files and generate transcriptions. The added functionality in this repository allows for conditioning the generated subtitles so that they adhere to a maximum length, as specified by the user. This is particularly useful for ensuring that subtitles are easily readable and well-paced for viewers.

Acknowledgements & License

Parts of this project utilize software developed by Max Bain. We would like to express our gratitude to Max Bain for his contribution.

Any use, redistribution, or modification of the software from the whisperx repository should adhere to the licensing terms provided by Max Bain. Please refer to the associated license file for detailed terms and conditions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

whisperX Subs

Using the Notebooks

About the Tool

Acknowledgements & License

Files

README.md

Latest commit

History

README.md

File metadata and controls

whisperX Subs

Using the Notebooks

About the Tool

Acknowledgements & License