The crawler for WaveGenAI
- Install the required packages
pip install -r requirements.txt
- Install docker
Run the proxy
docker run -d --rm -it -p 3128:3128 -p 4444:4444 -e "TOR_INSTANCES=40" zhaowde/rotating-tor-http-proxy
Run the crawler
python main.py --csv --input FILE.txt --overwrite --file_name FILE.csv --num_processes 40
This project is licensed under the MIT License - see the LICENSE file for details