Releases: crawlab-team/crawlab
Releases · crawlab-team/crawlab
v0.4.1
v0.4.0
v0.3.5
v0.3.4
1、fix 非自定义爬虫前端无法查看爬虫的问题
2、fix kill主进程未kill子进程的问题
3、fix 爬虫异常退出状态错误的问题
4、fix kill进程后状态错误的问题
V0.3.3
1、fix mongo 密码特殊字符串错误的问题
2、fix 创建临时目录错误的问题
3、fix MD5值判断不正确的问题
V0.3.2
1、重构爬虫同步流程,修改为直接从GridFs上同步爬虫
2、fix 爬虫日志无法正常获取的问题
3、fix 爬虫无法正常同步的问题
4、fix 爬虫无法正常删除的问题
5、fix 任务状态无法正常停止的问题
6、优化爬虫列表的搜索
v0.3.1
Features / Enhancement
- Docker Image Optimization. Split docker further into master, worker, frontend with alpine image.
- Unit Tests. Covered part of the backend code with unit tests.
- Frontend Optimization. Login page, button size, hints of upload UI optimization.
- More Flexible Node Registration. Allow users to pass a variable as key for node registration instead of MAC by default.
Bug Fixes
- Uploading Large Spider Files Error. Memory crash issue when uploading large spider files. #150
- Unable to Sync Spiders. Fixes through increasing level of write permission when synchronizing spider files. #114
- Spider Page Issue. Fixes through removing the field "Site". #112
- Node Display Issue. Nodes do not display correctly when running docker containers on multiple machines. #99
v0.3.0
Features / Enhancement
- Golang Backend: Refactored code from Python backend to Golang, much more stability and performance.
- Node Network Graph: Visualization of node typology.
- Node System Info: Available to see system info including OS, CPUs and executables.
- Node Monitoring Enhancement: Nodes are monitored and registered through Redis.
- File Management: Available to edit spider files online, including code highlight.
- Login/Regiser/User Management: Require users to login to use Crawlab, allow user registration and user management, some role-based authorization.
- Automatic Spider Deployment: Spiders are deployed/synchronized to all online nodes automatically.
- Smaller Docker Image: Slimmed Docker image and reduced Docker image size from 1.3G to ~700M by applying Multi-Stage Build.
Bug Fixes
- Node Status. Node status does not change even though it goes offline actually. #87
- Spider Deployment Error. Fixed through Automatic Spider Deployment #83
- Node not showing. Node not able to show online #81
- Cron Job not working. Fixed through new Golang backend #64
- Flower Error. Fixed through new Golang backend #57
v0.2.4
Features / Enhancement
- Documentation: Better and much more detailed documentation.
- Better Crontab: Make crontab expression through crontab UI.
- Better Performance: Switched from native flask engine to
gunicorn
. #78