The scripts are used to automatically detect broken links within http://www.sable.mcgill.ca/mclab.
Note: if the content of a website has been removed but it is still accessible, the link of the webite is considered a valid one (i.e. http://vineetkumar.net/).
Additional Python libraries should be installed before you run the main script checkall.py
.
pip install selenium
pip install BeautifulSoup
python checkall.py
Check broken-links.txt
to see the output of the command line.
checkall.py
: the main script- Libraries:
crawlib
: from crawlib.pyurllib2