yTwhisper

A Python-based tool to transcribe YouTube videos using OpenAI's Whisper model. This project facilitates the download of YouTube video audio, transcribes it into text, and saves the transcription for easy access and analysis.

📜 Disclaimer and Compliance

Respecting YouTube's Terms of Service (ToS) is of utmost importance. This project is intended solely for research and educational purposes. By using this tool, you agree to comply with YouTube's Terms of Service and ensure that you have the necessary permissions to download and transcribe the content. Unauthorized downloading, redistribution, or commercial use of YouTube content violates YouTube's policies and may infringe on copyright laws.

⚠️ Warning:

Users: Ensure you have explicit permission from content creators before downloading and transcribing their videos. Use this tool responsibly and ethically.
Contributors: By contributing to this project, you agree to uphold these compliance standards and avoid facilitating any misuse of the tool that violates YouTube's ToS or applicable laws.

Legal Notice: This software is provided "as-is", without any express or implied warranties. In no event will the authors be held liable for any damages arising from the use of this software.

🚀 Features

Download Audio: Extracts audio from YouTube videos efficiently.
Transcribe with Whisper: Utilizes OpenAI's Whisper model for accurate transcription.
Save Transcriptions: Stores transcriptions in organized text files for easy access.
Portable and Open-Source: Easily shareable and adaptable for various use cases.

🛠️ Installation

Prerequisites

macOS 11.7 or higher
Homebrew: A package manager for macOS. Install Homebrew
Conda: An open-source package management and environment management system. Install Miniconda or Anaconda
GPU Support (Optional):
- Apple Silicon (M1, M2, etc.): To utilize GPU acceleration via Metal Performance Shaders (MPS).

Step-by-Step Setup

Clone the Repository

git clone [email protected]:dimasusername/ytwhisper.git
cd ytwhisper

Install FFmpeg via Homebrew

Although FFmpeg is included in the Conda environment, installing it via Homebrew can provide system-wide access, which might be beneficial for other applications.
```
brew install ffmpeg
```
Set Up the Conda Environment

a. Create the Conda Environment
```
conda env create -f environment.yml
```
b. Activate the Environment
```
conda activate ytwhisper
```
c. Verify the Environment

Ensure that all dependencies are installed correctly.
```
conda list
```
Additional Recommendations

4.1. Verify GPU Availability

To ensure that your system is utilizing the GPU, you can add the following lines to your script to print device information:
```
import torch
print("Torch version:", torch.__version__)
print("CUDA available:", torch.cuda.is_available())
print("MPS available:", torch.backends.mps.is_available())
```

🎯 Usage

Run the main.py script with the YouTube video URL as an argument:

python main.py "https://www.youtube.com/watch?v=your_video_id"

Example:

python main.py "https://www.youtube.com/watch?v=dQw4w9WgXcQ"

Output

Audio File: Saved in the downloads/ directory.
Transcription: Saved as a .txt file in the transcriptions/ directory.

Troubleshooting

If yt-dlp complains about ffmpeg location, test that ffmpeg actually works. This thread on SO may shed some light.

🧰 Project Structure

yTwhisper/
├── downloads
├── transcriptions
├── .gitignore
├── CODE_OF_CONDUCT.md
├── environment.yml
├── LICENSE
├── main.py
├── README.md
└── TERMS_OF_USE.md

downloads/: Stores downloaded audio files (excluded from version control).
transcriptions/: Stores the resulting transcription text files (excluded from version control).
main.py: Main script to execute the transcription process.
requirements.txt: Lists all Python dependencies.
README.md: Project documentation.
.gitignore: Specifies files and directories to ignore in version control.

🤝 Contributing

Contributions are welcome! Please adhere to the following guidelines:

Respect Compliance Standards: Ensure that any contributions do not facilitate the violation of YouTube's ToS or copyright laws.
Code Quality: Maintain clean, readable, and well-documented code.
Submit Issues and Pull Requests: Use GitHub's issue tracker for reporting bugs or suggesting features. Submit pull requests for proposed changes.

To Contribute:

Fork the repository.
Create a new branch for your feature or bugfix.
Commit your changes with clear messages.
Push to your fork and submit a pull request.

📄 License

This project is licensed under the Apache License 2.0.

🧩 Third-Party Licenses

The ytwhisper project incorporates the following third-party libraries and tools, each governed by their respective licenses:

yt-dlp - Unlicense
OpenAI Whisper - MIT License
ffmpeg-python - MIT License
torch - BSD 3-Clause License
torchvision - BSD 3-Clause License
torchaudio - BSD 3-Clause License
numpy - BSD 3-Clause License
ffmpeg - LGPLv2.1+ / GPLv3+ License

📧 Contact

For questions, suggestions, or support, please open an issue on the GitHub repository or contact [email protected].

🌐 Acknowledgements

The ytwhisper project leverages a variety of open-source libraries and tools. Special thanks to the following projects and communities for making this tool possible:

yt-dlp - Advanced YouTube video downloader and extractor.
OpenAI Whisper - State-of-the-art speech recognition model.
FFmpeg - Comprehensive multimedia framework for audio and video processing.
Ollama - Efficient model management and deployment tool.
PyTorch - Flexible and powerful deep learning framework.
Torchvision - PyTorch's computer vision library.
Torchaudio - PyTorch's audio processing library.
NumPy - Fundamental package for scientific computing with Python.

🙏 Special Thanks To:

The Open-Source Community: For continuously contributing to and maintaining the libraries that power this project.
YouTube API and Developers: For providing the tools that enable video data extraction and processing.
All Contributors: Your feedback, suggestions, and code contributions have been invaluable in shaping this tool.

Note:

Ethical Use: This tool is intended for research and educational purposes. Please ensure you have the necessary permissions to download and transcribe YouTube content.

Compliance: Always adhere to YouTube's Terms of Service and respect content creators' rights.

EOF

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

yTwhisper

📜 Disclaimer and Compliance

🚀 Features

🛠️ Installation

Prerequisites

Step-by-Step Setup

4.1. Verify GPU Availability

🎯 Usage

Output

Troubleshooting

🧰 Project Structure

🤝 Contributing

📄 License

🧩 Third-Party Licenses

📧 Contact

🌐 Acknowledgements

🙏 Special Thanks To:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
TERMS_OF_USE.md		TERMS_OF_USE.md
environment.yml		environment.yml
main.py		main.py

License

dimasusername/ytwhisper

Folders and files

Latest commit

History

Repository files navigation

yTwhisper

📜 Disclaimer and Compliance

🚀 Features

🛠️ Installation

Prerequisites

Step-by-Step Setup

4.1. Verify GPU Availability

🎯 Usage

Output

Troubleshooting

🧰 Project Structure

🤝 Contributing

📄 License

🧩 Third-Party Licenses

📧 Contact

🌐 Acknowledgements

🙏 Special Thanks To:

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages