summaryrefslogtreecommitdiff
path: root/crocoite/controller.py
AgeCommit message (Expand)AuthorFilesLines
2018-09-25Prevent recursing into arbitrary schemesLars-Dominik Braun1-1/+9
2018-09-25Parallelize recursive grabsLars-Dominik Braun1-4/+14
2018-09-25Add recursive controllerLars-Dominik Braun1-1/+129
2018-09-25Log extracted linksLars-Dominik Braun1-0/+23
2018-08-21Remove celery and recursionLars-Dominik Braun1-118/+3
2018-08-04Add package information to warcinfoLars-Dominik Braun1-6/+16
2018-08-04Reintroduce WARC loggingLars-Dominik Braun1-23/+33
2018-06-25warc: Save DOM-/image screenshot as WARC conversionLars-Dominik Braun1-7/+1
2018-06-20Add __slots__ to classesLars-Dominik Braun1-0/+22
2018-06-20Synchronous SiteLoader event handlingLars-Dominik Braun1-99/+161
2018-05-05Extract only visible and clickable linksLars-Dominik Braun1-1/+1
2018-05-04Add distributed recursive crawlsLars-Dominik Braun1-5/+17
2018-05-04Add support for recursive crawlsLars-Dominik Braun1-0/+100
2018-05-04behavior: Add link extraction scriptLars-Dominik Braun1-1/+11
2018-05-04Move page archiving logic to SinglePageControllerLars-Dominik Braun1-0/+103