summaryrefslogtreecommitdiff
path: root/crocoite
AgeCommit message (Expand)AuthorFilesLines
2018-11-10tools: Fix WARC mergingLars-Dominik Braun2-18/+205
2018-11-08devtools: Disable websocket pings to ChromeLars-Dominik Braun2-1/+12
2018-11-06Switch single mode to asyncioLars-Dominik Braun5-175/+141
2018-11-06Switch site loader to async DevTools communicationLars-Dominik Braun2-229/+236
2018-11-06Add simple asyncio-based DevTool communicationLars-Dominik Braun2-0/+406
2018-11-03html: Add tests for tag/attribute strippingLars-Dominik Braun1-0/+38
2018-10-30recursive: Actually stop the grab when canceledLars-Dominik Braun1-1/+3
2018-10-30Reduce idle wait time after stopping pageLars-Dominik Braun1-4/+4
2018-10-30Increase default timeoutsLars-Dominik Braun1-2/+2
2018-10-23single: Set and recursive: check exit statusLars-Dominik Braun2-12/+34
2018-10-22behavior: Unload script only if the handle is validLars-Dominik Braun1-2/+4
2018-10-14irc: Add PoC dashboardLars-Dominik Braun3-16/+119
2018-10-14irc: Graceful bot shutdownLars-Dominik Braun3-16/+110
2018-10-11recursive: Gracefully shut down on SIGINT/TERMLars-Dominik Braun2-4/+18
2018-10-10Add timezone to logger datesLars-Dominik Braun1-1/+3
2018-10-03controller: Depth limit does not work with i>1Lars-Dominik Braun1-1/+3
2018-10-03irc: Fix mode parsingLars-Dominik Braun2-7/+37
2018-10-02irc: Refactoring/beautificationLars-Dominik Braun2-101/+266
2018-09-29Add documentationLars-Dominik Braun2-3/+9
2018-09-29irc: Limit number of processes spawnedLars-Dominik Braun2-21/+25
2018-09-29Add simple IRC botLars-Dominik Braun2-0/+273
2018-09-25Prevent recursing into arbitrary schemesLars-Dominik Braun1-1/+9
2018-09-25Parallelize recursive grabsLars-Dominik Braun2-5/+17
2018-09-25Add recursive controllerLars-Dominik Braun2-1/+169
2018-09-25Immediately flush loggerLars-Dominik Braun1-0/+2
2018-09-25Log extracted linksLars-Dominik Braun2-2/+25
2018-08-21Remove celery and recursionLars-Dominik Braun3-317/+23
2018-08-17behavior: Load more comments from FacebookLars-Dominik Braun1-0/+4
2018-08-05test_browser: Properly handle failed requestsLars-Dominik Braun2-15/+14
2018-08-04Properly handle failure to retrieve request bodyLars-Dominik Braun3-5/+50
2018-08-04Reference warcinfo record in every other recordLars-Dominik Braun1-18/+30
2018-08-04Add package information to warcinfoLars-Dominik Braun3-8/+65
2018-08-04Reintroduce WARC loggingLars-Dominik Braun9-76/+337
2018-06-25browser: Fix testcase race conditionLars-Dominik Braun1-0/+4
2018-06-25warc: Add metadata to truncated recordsLars-Dominik Braun1-22/+28
2018-06-25warc: Save DOM-/image screenshot as WARC conversionLars-Dominik Braun6-37/+72
2018-06-21Fix a few issues pointed out by pylintLars-Dominik Braun5-22/+10
2018-06-21browser: Add a few more testsLars-Dominik Braun1-3/+31
2018-06-20Move tests to pytestLars-Dominik Braun2-162/+177
2018-06-20Add __slots__ to classesLars-Dominik Braun5-1/+56
2018-06-20Synchronous SiteLoader event handlingLars-Dominik Braun6-509/+514
2018-06-08browser: Replace --remote-debugging-socket-fdLars-Dominik Braun1-23/+19
2018-06-03behavior: Wrap extract links script in anonymous namespaceLars-Dominik Braun2-2/+5
2018-05-20behavior: Patreon: Load more comments/repliesLars-Dominik Braun1-0/+4
2018-05-20behavior: Click Patreon’s “load more” buttonLars-Dominik Braun1-0/+6
2018-05-05Rename command line toolsLars-Dominik Braun1-0/+97
2018-05-05Extract only visible and clickable linksLars-Dominik Braun2-4/+29
2018-05-04Share recursive argument parserLars-Dominik Braun2-14/+15
2018-05-04Support --browser again for local crawlsLars-Dominik Braun2-2/+6
2018-05-04Add distributed recursive crawlsLars-Dominik Braun3-31/+91