summaryrefslogtreecommitdiff
path: root/crocoite/cli.py
AgeCommit message (Expand)AuthorFilesLines
2018-10-02irc: Refactoring/beautificationLars-Dominik Braun1-3/+6
2018-09-29Add documentationLars-Dominik Braun1-1/+6
2018-09-29irc: Limit number of processes spawnedLars-Dominik Braun1-1/+2
2018-09-29Add simple IRC botLars-Dominik Braun1-0/+19
2018-09-25Parallelize recursive grabsLars-Dominik Braun1-1/+3
2018-09-25Add recursive controllerLars-Dominik Braun1-0/+40
2018-09-25Log extracted linksLars-Dominik Braun1-2/+2
2018-08-21Remove celery and recursionLars-Dominik Braun1-53/+20
2018-08-04Reintroduce WARC loggingLars-Dominik Braun1-8/+8
2018-06-20Synchronous SiteLoader event handlingLars-Dominik Braun1-6/+13
2018-05-04Share recursive argument parserLars-Dominik Braun1-7/+13
2018-05-04Support --browser again for local crawlsLars-Dominik Braun1-1/+5
2018-05-04Add distributed recursive crawlsLars-Dominik Braun1-23/+18
2018-05-04Add support for recursive crawlsLars-Dominik Braun1-2/+15
2018-05-04behavior: Add link extraction scriptLars-Dominik Braun1-2/+3
2018-05-04Move page archiving logic to SinglePageControllerLars-Dominik Braun1-114/+21
2017-12-25Increase default body sizeLars-Dominik Braun1-3/+3
2017-12-24Refactor behavior scriptsLars-Dominik Braun1-146/+28
2017-12-22Add simple stats-keeping SiteLoaderLars-Dominik Braun1-3/+7
2017-12-20Increase hardcoded max timeoutsLars-Dominik Braun1-2/+2
2017-12-19Serialize WARC writingLars-Dominik Braun1-3/+3
2017-12-19Select default behavior scripts by site URLLars-Dominik Braun1-1/+10
2017-12-17Add distributed archivingLars-Dominik Braun1-145/+206
2017-12-06Start Chrome browser instanceLars-Dominik Braun1-44/+49
2017-12-06Add flags to disable screenshot/DOM snapshotLars-Dominik Braun1-5/+9
2017-12-03Fix UTF-8 encoding nameLars-Dominik Braun1-1/+1
2017-12-03Add page screenshot to WARCLars-Dominik Braun1-0/+14
2017-11-29argparse: Add metavarLars-Dominik Braun1-7/+7
2017-11-29RefactoringLars-Dominik Braun1-402/+50
2017-11-26DOM snapshot: Generate valid HTML5Lars-Dominik Braun1-7/+12
2017-11-25Ignore duplicate URLs when saving DOM snapshotLars-Dominik Braun1-1/+10
2017-11-25Workaround broken device metrics resetLars-Dominik Braun1-1/+3
2017-11-25Strip on* HTML attributesLars-Dominik Braun1-1/+27
2017-11-25Rename --run-before-snapshot and document --on* optionsLars-Dominik Braun1-3/+3
2017-11-24DOM snapshot: Save frames/subdocuments as wellLars-Dominik Braun1-13/+36
2017-11-24Reset device metricsLars-Dominik Braun1-2/+5
2017-11-24Save onsnapshot script to WARCLars-Dominik Braun1-4/+8
2017-11-22Make <canvas> static before DOM snapshotLars-Dominik Braun1-8/+13
2017-11-22Emulate different screen sizesLars-Dominik Braun1-0/+25
2017-11-22Add example fixups for InstagramLars-Dominik Braun1-3/+9
2017-11-21Move base64 metadata into WARC headerLars-Dominik Braun1-1/+1
2017-11-21Graceful page load timeoutLars-Dominik Braun1-9/+26
2017-11-20Add page created from DOM snapshotLars-Dominik Braun1-6/+101
2017-11-17Initial importLars-Dominik Braun1-0/+320