Skip to content

Latest commit

 

History

History
32 lines (24 loc) · 1.04 KB

README.md

File metadata and controls

32 lines (24 loc) · 1.04 KB

Weedmaps Scraper Module for Nest

Scrapes every strain on Weedmaps and helps you organize by price and location.

Requirements

  • MongoDB up and running
  • Node

Installation

git clone https://github.com/dsalehipour/nest-weedmaps.git
cd nest-weedmaps
npm install

Also, make sure MongoDB is up and running. See Install MongoDB.

Usage

  1. Scrape Weedmaps by running node index.js

What's happening?

After running index.js, the workers (scraper bots) will go to the strains directory, scrape the 40 strains in the grid, store those scraped items in the database, and queue scraping jobs to those strains by their href. Then, it will paginate and scrape the next page of the strains directory.

Meanwhile, the other workers will pick the jobs in the queue, scrape the strain pages, and update the strain in the database by their href.

Try looking at the scraped data using mongo's native REPL:

mongo nest
> db.items.count()
> db.items.find().pretty()

Have fun.