mirror of
https://github.com/crawlab-team/crawlab.git
synced 2026-01-21 17:21:09 +01:00
11 KiB
11 KiB
0.4.4 (unknown)
Features / Enhancement
- Email Notification. Allow users to send email notifications.
- DingTalk Robot Notification. Allow users to send DingTalk Robot notifications.
- API Address Optimization. Added relative URL path in frontend so that users don't have to specify
CRAWLAB_API_ADDRESSexplicitly. - SDK Compatiblity. Allow users to integrate Scrapy or general spiders with Crawlab SDK.
Bug Fixes
nil retuenederror.- Error when using HTTPS.
0.4.3 (2020-01-07)
Features / Enhancement
- Dependency Installation. Allow users to install/uninstall dependencies and add programming languages (Node.js only for now) on the platform web interface.
- Pre-install Programming Languages in Docker. Allow Docker users to set
CRAWLAB_SERVER_LANG_NODEasYto pre-installNode.jsenvironments. - Add Schedule List in Spider Detail Page. Allow users to view / add / edit schedule cron jobs in the spider detail page. #360
- Align Cron Expression with Linux. Change the expression of 6 elements to 5 elements as aligned in Linux.
- Enable/Disable Schedule Cron. Allow users to enable/disable the schedule jobs. #297
- Better Task Management. Allow users to batch delete tasks. #341
- Better Spider Management. Allow users to sort and filter spiders in the spider list page.
- Added Chinese
CHANGELOG. - Added Github Star Button at Nav Bar.
Bug Fixes
- Schedule Cron Task Issue. #423
- Upload Spider Zip File Issue. #403 #407
- Exit due to Network Failure. #340
- Cron Jobs not Running Correctly
- Schedule List Columns Mis-positioned
- Clicking Refresh Button Redirected to 404 Page
0.4.2 (2019-12-26)
Features / Enhancement
- Disclaimer. Added page for Disclaimer.
- Call API to fetch version. #371
- Configure to allow user registration. #346
- Allow adding new users.
- More Advanced File Management. Allow users to add / edit / rename / delete files. #286
- Optimized Spider Creation Process. Allow users to create an empty customized spider before uploading the zip file.
- Better Task Management. Allow users to filter tasks by selecting through certian criterions. #341
Bug Fixes
0.4.1 (2019-12-13)
Features / Enhancement
- Spiderfile Optimization. Stages changed from dictionary to array. #358
- Baidu Tongji Update.
Bug Fixes
0.4.0 (2019-12-06)
Features / Enhancement
- Configurable Spider. Allow users to add spiders using Spiderfile to configure crawling rules.
- Execution Mode. Allow users to select 3 modes for task execution: All Nodes, Selected Nodes and Random.
Bug Fixes
- Task accidentally killed. #306
- Documentation fix. #301 #301
- Direct deploy incompatible with Windows. #288
- Log files lost. #269
0.3.5 (2019-10-28)
Features / Enhancement
- Graceful Showdown. detail
- Node Info Optimization. detail
- Append System Environment Variables to Tasks. detail
- Auto Refresh Task Log. detail
- Enable HTTPS Deployment. detail
Bug Fixes
- Unable to fetch spider list info in schedule jobs. detail
- Unable to fetch node info from worker nodes. detail
- Unable to select node when trying to run spider tasks. detail
- Unable to fetch result count when result volume is large. #260
- Node issue in schedule tasks. #244
0.3.1 (2019-08-25)
Features / Enhancement
- Docker Image Optimization. Split docker further into master, worker, frontend with alpine image.
- Unit Tests. Covered part of the backend code with unit tests.
- Frontend Optimization. Login page, button size, hints of upload UI optimization.
- More Flexible Node Registration. Allow users to pass a variable as key for node registration instead of MAC by default.
Bug Fixes
- Uploading Large Spider Files Error. Memory crash issue when uploading large spider files. #150
- Unable to Sync Spiders. Fixes through increasing level of write permission when synchronizing spider files. #114
- Spider Page Issue. Fixes through removing the field "Site". #112
- Node Display Issue. Nodes do not display correctly when running docker containers on multiple machines. #99
0.3.0 (2019-07-31)
Features / Enhancement
- Golang Backend: Refactored code from Python backend to Golang, much more stability and performance.
- Node Network Graph: Visualization of node typology.
- Node System Info: Available to see system info including OS, CPUs and executables.
- Node Monitoring Enhancement: Nodes are monitored and registered through Redis.
- File Management: Available to edit spider files online, including code highlight.
- Login/Regiser/User Management: Require users to login to use Crawlab, allow user registration and user management, some role-based authorization.
- Automatic Spider Deployment: Spiders are deployed/synchronized to all online nodes automatically.
- Smaller Docker Image: Slimmed Docker image and reduced Docker image size from 1.3G to ~700M by applying Multi-Stage Build.
Bug Fixes
- Node Status. Node status does not change even though it goes offline actually. #87
- Spider Deployment Error. Fixed through Automatic Spider Deployment #83
- Node not showing. Node not able to show online #81
- Cron Job not working. Fixed through new Golang backend #64
- Flower Error. Fixed through new Golang backend #57
0.2.4 (2019-07-07)
Features / Enhancement
- Documentation: Better and much more detailed documentation.
- Better Crontab: Make crontab expression through crontab UI.
- Better Performance: Switched from native flask engine to
gunicorn. #78
Bugs Fixes
- Deleting Spider. Deleting a spider does not only remove record in db but also removing related folder, tasks and schedules. #69
- MongoDB Auth. Allow user to specify
authenticationDatabaseto connect tomongodb. #68 - Windows Compatibility. Added
eventlettorequirements.txt. #59
0.2.3 (2019-06-12)
Features / Enhancement
- Docker: User can run docker image to speed up deployment.
- CLI: Allow user to use command-line interface to execute Crawlab programs.
- Upload Spider: Allow user to upload Customized Spider to Crawlab.
- Edit Fields on Preview: Allow user to edit fields when previewing data in Configurable Spider.
Bugs Fixes
- Spiders Pagination. Fixed pagination problem in spider page.
0.2.2 (2019-05-30)
Features / Enhancement
- Automatic Extract Fields: Automatically extracting data fields in list pages for configurable spider.
- Download Results: Allow downloading results as csv file.
- Baidu Tongji: Allow users to choose to report usage info to Baidu Tongji.
Bug Fixes
- Results Page Pagination: Fixes so the pagination of results page is working correctly. #45
- Schedule Tasks Duplicated Triggers: Set Flask DEBUG as False so that schedule tasks won't trigger twice. #32
- Frontend Environment: Added
VUE_APP_BASE_URLas production mode environment variable so the API call won't be alwayslocalhostin deployed env #30
0.2.1 (2019-05-27)
- Configurable Spider: Allow users to create a spider to crawl data without coding.
0.2 (2019-05-10)
- Advanced Stats: Advanced analytics in spider detail view.
- Sites Data: Added sites list (China) for users to check info such as robots.txt and home page response time/code.
0.1.1 (2019-04-23)
- Basic Stats: User can view basic stats such as number of failed tasks and number of results in spiders and tasks pages.
- Near Realtime Task Info: Periodically (5 sec) polling data from server to allow view task info in a near-realtime fashion.
- Scheduled Tasks: Allow users to set up cron-like scheduled/periodical tasks using apscheduler.
0.1 (2019-04-17)
- Initial Release