Skip to content

When given a list of URLs from Internet Archive will download associated full text objects and metadata (Written in PHP)

Notifications You must be signed in to change notification settings

elibtronic/iascraper

Repository files navigation

IA Scraper v0.8php -by Tim Ribaric [email protected]

The Internet Archive Scraper

Simply untar the software edit config.php and you should be ready to go
You'll need CURL installed and working in your php config for this to work

The main page will suggest what you should add to your crontab to automate the RSS scraping

The downloading takes a bit so be patient (items can be up to 25megs), it is best to automate the RSS scrape and use that.

Next version will be in Python... Until then this is mostly an exercise in learning Git.

About

When given a list of URLs from Internet Archive will download associated full text objects and metadata (Written in PHP)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages