Scrap facebook live gaming page and extracts live users and push them to faktory worker to parse their detail(name, uid, username, number of follower, number of likes, contact details - email, social links) and their posts(post-id, text, datetime, hashtags, links, images) and finally save them in MongoDB.
- Install faktory server.
- Install MongoDB and create database
aggero_fb
and two collectionuser_details
andposts
. - pip install -r requirements.txt
- Change
URL_FACTORY
password inutils.py
file.
-
Navigate to
main
package. -
First run
python3 consumer.py
-
Then
python3 producer.py -nup 10 -nps 3
. You must need to provide-nup
and-nps
-nup
- Number of live users to parse- Choices :
int
value greater than 0 orstr
valueall
- Choices :
-nps
- Number of scroll while parsing user posts. Every scroll gives about 18 posts.- Choices :
int
value greater than 0 orstr
valueall
- Choices :
-nup 10 -nps 3
works best for testing,all
may be used in production.
-
Scraping should be started now.
- Configure Tor for proxy.
- Build a error database and send daily error report email to admin.
- By default I am running one worker process but you can modify it in
consumer.py
file as number of cores in your pc. - Create index for MongoDB.
- Write tests.