Releases: gteufelberger/vo-scraper
v4.0.0
In order to ensure vo-scraper stays available even beyond my graduation, I'm moving all future development to GitHub via https://github.com/gteufelberger/vo-scraper
Changes
- Check for updates on GitHub instead of GitLab
Fixes:
- Enable parsing version numbers that contain a
v
If you notice any bugs please raise an issue in the issue tracker
v3.2.1
Such a long time without breakage and then someone decided to upload audio-only recordings <.<
Fixes:
- Catch an error caused by audio-only recordings (https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/7)
If you notice any bugs please raise an issue in the issue tracker
v3.2.0
Spring cleaning, time to make the progress-bar a bit more shiny :3
Features:
- Nicer progress-bar if
tqdm
Python package is installed. If the package is not installed the scraper will just fall back to the old progress-bar.
If you notice any bugs please raise an issue in the issue tracker
v3.1.0
A new semester, a new small update
Features:
- Added more keywords to specify resolutions. E.g. you can use
--quality low
to download a recording in360p
. (part of https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/5) - Print selected resolution for downloading at beginning of script (part of https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/5)
- Allow comments in parameter file
Breakage:
- Changed mapping for
HD
from1080p
to720p
and added a mapping forFullHD
to1080p
(part of https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/5)
If you notice any bugs please raise an issue in the issue tracker
v3.0.1
Just two quick fixes:
Features:
- Skipping lectures if metadata cannot be parsed. This mostly applies when trying to download recordings from a lecture that hasn't uploaded any yet.
- Remove
www.
prefix from domain name.www.video.ethz.ch
redirects tovideo.ethz.ch
and can be used to download recordings but authentication with an ETH account would fail when using the former domain.
v3.0.0
It's the beginning of autumn, the leaves are dropping and so is this update:
Features:
- Allow downloading specific resolutions (e.g.
HD
,4K
, or numeric values like720p
) instead of the previously vaguely definedhigh
,medium
,low
.
Default resolution is nowHD
if nothing is specified.
This also fixes issues caused when a lecture supplied more then 3 different resolutions. - Added
requirements.txt
- Fixed the way the date is added to the filename to be consistent between older and newer lectures. ETH changed the timestamp formatting to include seconds halfway through SS21.
- Fixed a few typos
Changes:
- Renamed a few variables
- Used some more f-strings for print formatting
Breakage:
- For the video quality
high
,medium
, andlow
will no longer work. Instead usehighest
for highest,lowest
for lowest, or a specific numeric value for the desired resolution (e.g.720p
or keywords likeHD
,4K
). - As the quality of the video is also part of the filename this means what previously downloaded videos might get redownloaded when using the
--all
parameter. To prevent this generally it is recommended to use the--history
parameter.
If you notice any bugs please raise an issue in the issue tracker
v2.0.0
Oh hey, didn't see you come in :O
Anyway, here's the changelog:
Features:
- Added option for a parameter file. By default the scraper looks for a file called
parameters.txt
in the directory it is being run. If it finds it, it will read and apply any parameters described there. Check the Readme for more info. - Added usage hints to highlight some features after running the scraper. If you don't want to see them, use the parameter
--disable-hints
. You can also put it inparameters.txt
;) - If you dislike progress bars you can now hide it using
--no-progress-bar
Changes:
- Size of progress bar now depends on your terminal window size.
If you notice any bugs please raise an issue in the issue tracker
v1.3.0
A new semester, a new update:
Features:
- Added pseudo-hash to filename. Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/3
- Updated documentation in README. Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/4
Changes:
- Changed
--print-src
to--print-source
Breakage:
- dd12de3 changes the filename of newly downloaded videos and subsequently re-downloads are only prevented if the video on disk already has the new filename.
Therefore, this will cause a re-download of all recordings specified by you if you rely solely on existing filenames to prevent downloading old videos again.
Users making use of the history file via--history
will not be affected. - 213e77a will break any automated scripts making use of
--print-src
. Make sure to update them to use--print-source
if you make use of this flag!
If you notice any bugs please raise an issue in the issue tracker
v1.2.1
Long time no see, here's a small update:
Features:
- Added the possibility to select multiple videos by specifying a range. Documentation for this has been added to the README file.
Fixes:
- Fixed no longer being able to authenticate for videos requiring an ETH login.
To be precise: Something in the backend of video.ethz.ch changed, so the "Referer" header is now required when sending a POST request to acquire a cookie for "ETH" protected videos.
Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/2
If you notice any bugs please raise an issue in the issue tracker
v1.2
It's Monday morning so it's time for a new release. This one includes:
Features:
--destination
flag to specify the location to save videos to. Closes https://gitlab.ethz.ch/tgeorg/vo-scraper/-/issues/1--history
flag to specify a file where to print/read episodes URLs from/to so you can finally delete watched lectures without having to worry they might re-download.--version
flag to print the program version and exit.- Ignore empty lines in link file.
- Strip special characters such as
? < > : * | " ^
from filenames as they can cause issues on NTFS.
Changes:
- Moved date further to the front when printing available/selected lecture episodes.
- Collect all episode links first and then start the download so you no longer have to wait for a download to finish in order to select episodes of the next lecture if you pass multiple links.
- Moved the code that creates folders so no folders are created when you just want to print the source links of lecture episodes with
--print-src
. - Replaced occurrences of
/
withos.sep
to become more OS agnostic. - Check if the scraper is being run directly to allow for use as library.
- and finally some general code clean up.
If you notice any bugs please raise an issue in the issue tracker