Parses CBC Youtube videos to extract subtitles. This is then formatted nicely for human consumption
These notes are compiled and summarized by a human here.
- Make sure a non-system Ruby is installed (I'm using 2.6.x)
- Run
bundle install
to install requires gems using Bundler - Run
ruby captions.rb -h
- Run
ruby captions.rb -v VIDEO_ID
to parse a single video to STDOUT (Video ID is in the youtube URL. E.g. https://www.youtube.com/watch?v=VIDEO_ID) - Run
ruby captions.rb -t TOKEN
where token is a Google API token with the Youtube Data API enabled. This will parse the CBC playlist and output all files to the docs folder.