summaryrefslogtreecommitdiff
AgeCommit message (Expand)AuthorFilesLines
2018-09-29Add simple IRC botLars-Dominik Braun3-0/+275
2018-09-25Prevent recursing into arbitrary schemesLars-Dominik Braun1-1/+9
2018-09-25Parallelize recursive grabsLars-Dominik Braun2-5/+17
2018-09-25Add recursive controllerLars-Dominik Braun3-1/+170
2018-09-25Immediately flush loggerLars-Dominik Braun1-0/+2
2018-09-25Log extracted linksLars-Dominik Braun2-2/+25
2018-08-21Remove celery and recursionLars-Dominik Braun6-609/+24
2018-08-19README: Add rationaleLars-Dominik Braun1-25/+87
2018-08-17behavior: Load more comments from FacebookLars-Dominik Braun1-0/+4
2018-08-05test_browser: Properly handle failed requestsLars-Dominik Braun2-15/+14
2018-08-04Properly handle failure to retrieve request bodyLars-Dominik Braun3-5/+50
2018-08-04Reference warcinfo record in every other recordLars-Dominik Braun1-18/+30
2018-08-04Add package information to warcinfoLars-Dominik Braun3-8/+65
2018-08-04Reintroduce WARC loggingLars-Dominik Braun9-76/+337
2018-06-25browser: Fix testcase race conditionLars-Dominik Braun1-0/+4
2018-06-25warc: Add metadata to truncated recordsLars-Dominik Braun1-22/+28
2018-06-25warc: Save DOM-/image screenshot as WARC conversionLars-Dominik Braun7-39/+73
2018-06-21Fix travis test commandLars-Dominik Braun1-1/+1
2018-06-21Fix a few issues pointed out by pylintLars-Dominik Braun5-22/+10
2018-06-21browser: Add a few more testsLars-Dominik Braun1-3/+31
2018-06-20Move tests to pytestLars-Dominik Braun6-163/+183
2018-06-20Add __slots__ to classesLars-Dominik Braun5-1/+56
2018-06-20Synchronous SiteLoader event handlingLars-Dominik Braun7-514/+518
2018-06-08browser: Replace --remote-debugging-socket-fdLars-Dominik Braun1-23/+19
2018-06-03behavior: Wrap extract links script in anonymous namespaceLars-Dominik Braun2-2/+5
2018-05-20behavior: Patreon: Load more comments/repliesLars-Dominik Braun1-0/+4
2018-05-20behavior: Click Patreon’s “load more” buttonLars-Dominik Braun1-0/+6
2018-05-05Update documentationLars-Dominik Braun1-4/+4
2018-05-05Rename command line toolsLars-Dominik Braun3-62/+37
2018-05-05Extract only visible and clickable linksLars-Dominik Braun2-4/+29
2018-05-05contrib: Add WARC merging scriptLars-Dominik Braun1-0/+70
2018-05-04sopel: Use recursive, distributed controllerLars-Dominik Braun1-2/+7
2018-05-04Share recursive argument parserLars-Dominik Braun2-14/+15
2018-05-04Support --browser again for local crawlsLars-Dominik Braun2-2/+6
2018-05-04Add distributed recursive crawlsLars-Dominik Braun3-31/+91
2018-05-04Add support for recursive crawlsLars-Dominik Braun2-2/+115
2018-05-04browser: Replace context manager decoratorLars-Dominik Braun1-51/+66
2018-05-04behavior: Add link extraction scriptLars-Dominik Braun4-5/+43
2018-05-04IRC plugin: Use argparseLars-Dominik Braun1-17/+33
2018-05-04Move page archiving logic to SinglePageControllerLars-Dominik Braun7-160/+211
2018-05-04Move header unfolding into ItemLars-Dominik Braun2-21/+24
2018-05-04Fetch request POST bodyLars-Dominik Braun2-8/+20
2018-05-04Test chained redirectsLars-Dominik Braun1-12/+32
2018-04-20Add screenshot extraction script to contrib/Lars-Dominik Braun1-0/+54
2018-04-20Save screenshot of entire pageLars-Dominik Braun1-6/+16
2018-04-14Fix base64 body detectionLars-Dominik Braun2-10/+10
2018-04-14Add timeout to request body fetchLars-Dominik Braun1-3/+4
2018-04-14Handle JavaScript dialogsLars-Dominik Braun1-2/+37
2018-04-04behavior: Add selector for YouTube.Lars-Dominik Braun1-0/+6
2018-03-30Add click selectors for InstagramLars-Dominik Braun1-0/+8