Skip to content

sapiensmalti/smbc-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

smbc-scraper

Scrape Saturday Morning Breakfast Cereal comics (cartoons + title text). Scraper uses the Beautiful Soup 4 library.

To run, install Python 3.x and bs4. Then hit: python3 smbc-scraper.py

A new directory is created for each page where the cartoons are stored in the original format (PNG for the more recent comics and GIF for older ones) and the titles are stored in a text file. Everything is stored in the current directory from which the script is invoked.

As of this writing, the server doesn't block automated requests. If you get blocked, try increasing the random delay range in line 91. Else, you'll need to resort to some IP rotation.

About

Scraper for Saturday Morning Breakfast Cereal.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages