CafeBazzar Crawler

What is this?

This is a crawler for this site which crawled all categories with its api and save data like:

app name
app path
app link
app author name
app category name
app price
app review count
app install count

in mysql database.

Why should I make it?

I make it for practice, fun and work with Peewee ORM!

How Work it?

This site send json response to user Its has api and crawler work with that. api has a payload we can send app path with that and get information from site but before this we collect all categories from site and send in payload to other api and get all app path in that category and use app paths in get information app.

What we need?

this crawler has a bug, but I can't find that. what that? I get app path and convert to set which duplicate app path deleted and save in database and before this check exists app path if that exists don't save that but two or three app path save duplicate and also this problem in information apps in database

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
crawler		crawler
models		models
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CafeBazzar Crawler

What is this?

Why should I make it?

How Work it?

What we need?

About

Releases

Packages

Languages

MrMohammadY/CafeBazzar

Folders and files

Latest commit

History

Repository files navigation

CafeBazzar Crawler

What is this?

Why should I make it?

How Work it?

What we need?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages