Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2018-05-05 | Rename command line tools | Lars-Dominik Braun | 2 | -124/+0 | |
Move contrib/ scripts to .tools and add entry points to setup.py, rename crocoite-standalone to crocoite-grab. | |||||
2018-05-05 | contrib: Add WARC merging script | Lars-Dominik Braun | 1 | -0/+70 | |
Very useful for distributed, recursive crawls which create one WARC per page. | |||||
2018-05-04 | sopel: Use recursive, distributed controller | Lars-Dominik Braun | 1 | -2/+7 | |
2018-05-04 | IRC plugin: Use argparse | Lars-Dominik Braun | 1 | -17/+33 | |
2018-05-04 | Move page archiving logic to SinglePageController | Lars-Dominik Braun | 1 | -15/+12 | |
In preparation for recursive crawls. | |||||
2018-04-20 | Add screenshot extraction script to contrib/ | Lars-Dominik Braun | 1 | -0/+54 | |
2018-02-22 | irc plugin: Serialize celery operations | Lars-Dominik Braun | 1 | -68/+105 | |
This is a workaround for https://github.com/celery/celery/issues/4480 | |||||
2017-12-25 | Increase default body size | Lars-Dominik Braun | 1 | -4/+4 | |
2017-12-24 | Refactor behavior scripts | Lars-Dominik Braun | 1 | -10/+7 | |
No functional changes, just cleanup. Replaces onload and onsnapshot events. Move screen metric emulation, DOM snapshots and screenshots here as well. | |||||
2017-12-22 | Add simple stats-keeping SiteLoader | Lars-Dominik Braun | 1 | -1/+14 | |
2017-12-19 | Select default behavior scripts by site URL | Lars-Dominik Braun | 1 | -2/+24 | |
2017-12-17 | Add distributed archiving | Lars-Dominik Braun | 1 | -0/+144 | |
Using celery. Also adds a plugin for the IRC bot sopel. Code still needs some love, but it should work. |