Age | Commit message (Expand) | Author | Files | Lines |
---|---|---|---|---|
2018-09-25 | Prevent recursing into arbitrary schemes | Lars-Dominik Braun | 1 | -1/+9 |
2018-09-25 | Parallelize recursive grabs | Lars-Dominik Braun | 1 | -4/+14 |
2018-09-25 | Add recursive controller | Lars-Dominik Braun | 1 | -1/+129 |
2018-09-25 | Log extracted links | Lars-Dominik Braun | 1 | -0/+23 |
2018-08-21 | Remove celery and recursion | Lars-Dominik Braun | 1 | -118/+3 |
2018-08-04 | Add package information to warcinfo | Lars-Dominik Braun | 1 | -6/+16 |
2018-08-04 | Reintroduce WARC logging | Lars-Dominik Braun | 1 | -23/+33 |
2018-06-25 | warc: Save DOM-/image screenshot as WARC conversion | Lars-Dominik Braun | 1 | -7/+1 |
2018-06-20 | Add __slots__ to classes | Lars-Dominik Braun | 1 | -0/+22 |
2018-06-20 | Synchronous SiteLoader event handling | Lars-Dominik Braun | 1 | -99/+161 |
2018-05-05 | Extract only visible and clickable links | Lars-Dominik Braun | 1 | -1/+1 |
2018-05-04 | Add distributed recursive crawls | Lars-Dominik Braun | 1 | -5/+17 |
2018-05-04 | Add support for recursive crawls | Lars-Dominik Braun | 1 | -0/+100 |
2018-05-04 | behavior: Add link extraction script | Lars-Dominik Braun | 1 | -1/+11 |
2018-05-04 | Move page archiving logic to SinglePageController | Lars-Dominik Braun | 1 | -0/+103 |