Skip to content

lukey-luke/simpleCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

simpleCrawler - A Simple Web Crawler

A Simple (multi-threaded) Web Crawler written in Python

CustomWebSpiderImg

Full discretion: Credit goes to Bucky Roberts for developing python tutorials on his website and introducing the idea of the web crawler/web spider.

This project served as an opportunity to familiarize myself with Python and expand upon an existing project.

TODO

  • Figure out how to make this guy obey robots.txt file (done!)
  • Update domain.py to allow more link formats (i.e. example.com instead of only https://www.example.com/) (done!)

About

Simple Spider/Web Crawler written in python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages