house_advertisement_crawler

A crawler that contains house advertisement data from cragslists site.

Build With

apt-get install python3-virtualenv

virtualenv -p python3 venv

. venv/bin/activate

pip install the requirements

to run the projects first of all we have to find the links we want to crawl so you have to run:

python3 main.py "find_links"

python3 main.py "extract_pages"

now the crawl is done and you have pure data in case you want to download the image from links:

python3 main.py "download_images"

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
config.py		config.py
crawl.py		crawl.py
main.py		main.py
mongo.py		mongo.py
parser.py		parser.py
requirements.txt		requirements.txt
storage.py		storage.py