Skip to content

Releases: gteufelberger/vo-scraper

v4.0.0

26 Dec 22:55
Compare
Choose a tag to compare

⚠️ vo-scraper is moving to GitHub! ⚠️

In order to ensure vo-scraper stays available even beyond my graduation, I'm moving all future development to GitHub via https://github.com/gteufelberger/vo-scraper

Changes

  • Check for updates on GitHub instead of GitLab

Fixes:

  • Enable parsing version numbers that contain a v

If you notice any bugs please raise an issue in the issue tracker

v3.2.1

26 Dec 21:48
Compare
Choose a tag to compare

Such a long time without breakage and then someone decided to upload audio-only recordings <.<

Fixes:

If you notice any bugs please raise an issue in the issue tracker

v3.2.0

26 Dec 21:47
Compare
Choose a tag to compare

Spring cleaning, time to make the progress-bar a bit more shiny :3

Features:

  • Nicer progress-bar if tqdm Python package is installed. If the package is not installed the scraper will just fall back to the old progress-bar.

If you notice any bugs please raise an issue in the issue tracker

v3.1.0

26 Dec 21:46
Compare
Choose a tag to compare

A new semester, a new small update

Features:

Breakage:

If you notice any bugs please raise an issue in the issue tracker

v3.0.1

26 Dec 21:44
Compare
Choose a tag to compare

Just two quick fixes:

Features:

  • Skipping lectures if metadata cannot be parsed. This mostly applies when trying to download recordings from a lecture that hasn't uploaded any yet.
  • Remove www. prefix from domain name. www.video.ethz.ch redirects to video.ethz.ch and can be used to download recordings but authentication with an ETH account would fail when using the former domain.

v3.0.0

26 Dec 21:43
Compare
Choose a tag to compare

It's the beginning of autumn, the leaves are dropping and so is this update:

Features:

  • Allow downloading specific resolutions (e.g. HD, 4K, or numeric values like 720p) instead of the previously vaguely defined high, medium, low.
    Default resolution is now HD if nothing is specified.
    This also fixes issues caused when a lecture supplied more then 3 different resolutions.
  • Added requirements.txt
  • Fixed the way the date is added to the filename to be consistent between older and newer lectures. ETH changed the timestamp formatting to include seconds halfway through SS21.
  • Fixed a few typos

Changes:

  • Renamed a few variables
  • Used some more f-strings for print formatting

Breakage:

  • For the video quality high, medium, and low will no longer work. Instead use highest for highest, lowest for lowest, or a specific numeric value for the desired resolution (e.g. 720p or keywords like HD, 4K).
  • As the quality of the video is also part of the filename this means what previously downloaded videos might get redownloaded when using the --all parameter. To prevent this generally it is recommended to use the --history parameter.

If you notice any bugs please raise an issue in the issue tracker

v2.0.0

26 Dec 21:42
Compare
Choose a tag to compare

Oh hey, didn't see you come in :O

Anyway, here's the changelog:

Features:

  • Added option for a parameter file. By default the scraper looks for a file called parameters.txt in the directory it is being run. If it finds it, it will read and apply any parameters described there. Check the Readme for more info.
  • Added usage hints to highlight some features after running the scraper. If you don't want to see them, use the parameter --disable-hints. You can also put it in parameters.txt ;)
  • If you dislike progress bars you can now hide it using --no-progress-bar

Changes:

  • Size of progress bar now depends on your terminal window size.

If you notice any bugs please raise an issue in the issue tracker

v1.3.0

26 Dec 21:40
Compare
Choose a tag to compare

A new semester, a new update:

Features:

Changes:

  • Changed --print-src to --print-source

Breakage:

  • dd12de3 changes the filename of newly downloaded videos and subsequently re-downloads are only prevented if the video on disk already has the new filename.
    Therefore, this will cause a re-download of all recordings specified by you if you rely solely on existing filenames to prevent downloading old videos again.
    Users making use of the history file via --history will not be affected.
  • 213e77a will break any automated scripts making use of --print-src. Make sure to update them to use --print-source if you make use of this flag!

If you notice any bugs please raise an issue in the issue tracker

v1.2.1

26 Dec 21:40
Compare
Choose a tag to compare

Long time no see, here's a small update:

Features:

  • Added the possibility to select multiple videos by specifying a range. Documentation for this has been added to the README file.

Fixes:

  • Fixed no longer being able to authenticate for videos requiring an ETH login.
    To be precise: Something in the backend of video.ethz.ch changed, so the "Referer" header is now required when sending a POST request to acquire a cookie for "ETH" protected videos.
    Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/2

If you notice any bugs please raise an issue in the issue tracker

v1.2

26 Dec 21:38
Compare
Choose a tag to compare

It's Monday morning so it's time for a new release. This one includes:

Features:

  • --destination flag to specify the location to save videos to. Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/1
  • --history flag to specify a file where to print/read episodes URLs from/to so you can finally delete watched lectures without having to worry they might re-download.
  • --version flag to print the program version and exit.
  • Ignore empty lines in link file.
  • Strip special characters such as ? < > : * | " ^ from filenames as they can cause issues on NTFS.

Changes:

  • Moved date further to the front when printing available/selected lecture episodes.
  • Collect all episode links first and then start the download so you no longer have to wait for a download to finish in order to select episodes of the next lecture if you pass multiple links.
  • Moved the code that creates folders so no folders are created when you just want to print the source links of lecture episodes with --print-src.
  • Replaced occurrences of / with os.sep to become more OS agnostic.
  • Check if the scraper is being run directly to allow for use as library.
  • and finally some general code clean up.

If you notice any bugs please raise an issue in the issue tracker