Age | Commit message (Expand) | Author | Files | Lines |
---|---|---|---|---|
2018-09-25 | Parallelize recursive grabs | Lars-Dominik Braun | 1 | -1/+3 |
2018-09-25 | Add recursive controller | Lars-Dominik Braun | 1 | -0/+40 |
2018-09-25 | Log extracted links | Lars-Dominik Braun | 1 | -2/+2 |
2018-08-21 | Remove celery and recursion | Lars-Dominik Braun | 1 | -53/+20 |
2018-08-04 | Reintroduce WARC logging | Lars-Dominik Braun | 1 | -8/+8 |
2018-06-20 | Synchronous SiteLoader event handling | Lars-Dominik Braun | 1 | -6/+13 |
2018-05-04 | Share recursive argument parser | Lars-Dominik Braun | 1 | -7/+13 |
2018-05-04 | Support --browser again for local crawls | Lars-Dominik Braun | 1 | -1/+5 |
2018-05-04 | Add distributed recursive crawls | Lars-Dominik Braun | 1 | -23/+18 |
2018-05-04 | Add support for recursive crawls | Lars-Dominik Braun | 1 | -2/+15 |
2018-05-04 | behavior: Add link extraction script | Lars-Dominik Braun | 1 | -2/+3 |
2018-05-04 | Move page archiving logic to SinglePageController | Lars-Dominik Braun | 1 | -114/+21 |
2017-12-25 | Increase default body size | Lars-Dominik Braun | 1 | -3/+3 |
2017-12-24 | Refactor behavior scripts | Lars-Dominik Braun | 1 | -146/+28 |
2017-12-22 | Add simple stats-keeping SiteLoader | Lars-Dominik Braun | 1 | -3/+7 |
2017-12-20 | Increase hardcoded max timeouts | Lars-Dominik Braun | 1 | -2/+2 |
2017-12-19 | Serialize WARC writing | Lars-Dominik Braun | 1 | -3/+3 |
2017-12-19 | Select default behavior scripts by site URL | Lars-Dominik Braun | 1 | -1/+10 |
2017-12-17 | Add distributed archiving | Lars-Dominik Braun | 1 | -145/+206 |
2017-12-06 | Start Chrome browser instance | Lars-Dominik Braun | 1 | -44/+49 |
2017-12-06 | Add flags to disable screenshot/DOM snapshot | Lars-Dominik Braun | 1 | -5/+9 |
2017-12-03 | Fix UTF-8 encoding name | Lars-Dominik Braun | 1 | -1/+1 |
2017-12-03 | Add page screenshot to WARC | Lars-Dominik Braun | 1 | -0/+14 |
2017-11-29 | argparse: Add metavar | Lars-Dominik Braun | 1 | -7/+7 |
2017-11-29 | Refactoring | Lars-Dominik Braun | 1 | -402/+50 |
2017-11-26 | DOM snapshot: Generate valid HTML5 | Lars-Dominik Braun | 1 | -7/+12 |
2017-11-25 | Ignore duplicate URLs when saving DOM snapshot | Lars-Dominik Braun | 1 | -1/+10 |
2017-11-25 | Workaround broken device metrics reset | Lars-Dominik Braun | 1 | -1/+3 |
2017-11-25 | Strip on* HTML attributes | Lars-Dominik Braun | 1 | -1/+27 |
2017-11-25 | Rename --run-before-snapshot and document --on* options | Lars-Dominik Braun | 1 | -3/+3 |
2017-11-24 | DOM snapshot: Save frames/subdocuments as well | Lars-Dominik Braun | 1 | -13/+36 |
2017-11-24 | Reset device metrics | Lars-Dominik Braun | 1 | -2/+5 |
2017-11-24 | Save onsnapshot script to WARC | Lars-Dominik Braun | 1 | -4/+8 |
2017-11-22 | Make <canvas> static before DOM snapshot | Lars-Dominik Braun | 1 | -8/+13 |
2017-11-22 | Emulate different screen sizes | Lars-Dominik Braun | 1 | -0/+25 |
2017-11-22 | Add example fixups for Instagram | Lars-Dominik Braun | 1 | -3/+9 |
2017-11-21 | Move base64 metadata into WARC header | Lars-Dominik Braun | 1 | -1/+1 |
2017-11-21 | Graceful page load timeout | Lars-Dominik Braun | 1 | -9/+26 |
2017-11-20 | Add page created from DOM snapshot | Lars-Dominik Braun | 1 | -6/+101 |
2017-11-17 | Initial import | Lars-Dominik Braun | 1 | -0/+320 |