Age | Commit message (Expand) | Author | Files | Lines |
---|---|---|---|---|
2018-05-04 | Support --browser again for local crawls | Lars-Dominik Braun | 2 | -2/+6 |
2018-05-04 | Add distributed recursive crawls | Lars-Dominik Braun | 3 | -31/+91 |
2018-05-04 | Add support for recursive crawls | Lars-Dominik Braun | 2 | -2/+115 |
2018-05-04 | browser: Replace context manager decorator | Lars-Dominik Braun | 1 | -51/+66 |
2018-05-04 | behavior: Add link extraction script | Lars-Dominik Braun | 4 | -5/+43 |
2018-05-04 | Move page archiving logic to SinglePageController | Lars-Dominik Braun | 5 | -144/+198 |
2018-05-04 | Move header unfolding into Item | Lars-Dominik Braun | 2 | -21/+24 |
2018-05-04 | Fetch request POST body | Lars-Dominik Braun | 2 | -8/+20 |
2018-05-04 | Test chained redirects | Lars-Dominik Braun | 1 | -12/+32 |
2018-04-20 | Save screenshot of entire page | Lars-Dominik Braun | 1 | -6/+16 |
2018-04-14 | Fix base64 body detection | Lars-Dominik Braun | 2 | -10/+10 |
2018-04-14 | Add timeout to request body fetch | Lars-Dominik Braun | 1 | -3/+4 |
2018-04-14 | Handle JavaScript dialogs | Lars-Dominik Braun | 1 | -2/+37 |
2018-04-04 | behavior: Add selector for YouTube. | Lars-Dominik Braun | 1 | -0/+6 |
2018-03-30 | Add click selectors for Instagram | Lars-Dominik Braun | 1 | -0/+8 |
2018-03-25 | Add a few simple tests | Lars-Dominik Braun | 1 | -0/+190 |
2018-03-25 | Replace deprecated logger.warn | Lars-Dominik Braun | 1 | -3/+3 |
2018-03-25 | ChromeService: Close listening socket | Lars-Dominik Braun | 1 | -0/+1 |
2018-03-25 | Move getResponseBody call to Item wrapper | Lars-Dominik Braun | 2 | -13/+21 |
2018-03-18 | browser: Don’t overwrite LogEntry’s args | Lars-Dominik Braun | 1 | -1/+1 |
2018-03-18 | behavior: Add click selectors for reddit | Lars-Dominik Braun | 1 | -7/+27 |
2018-03-05 | Add generic click behavior script | Lars-Dominik Braun | 3 | -37/+119 |
2018-03-04 | Remove instagram behavior script | Lars-Dominik Braun | 2 | -27/+1 |
2018-01-20 | behavior: Scroll all DOM elements | Lars-Dominik Braun | 1 | -0/+6 |
2018-01-20 | twitter: Expand “more replies” links | Lars-Dominik Braun | 1 | -8/+21 |
2017-12-27 | Log messages from browser console | Lars-Dominik Braun | 1 | -0/+12 |
2017-12-25 | Increase default body size | Lars-Dominik Braun | 3 | -5/+34 |
2017-12-24 | Refactor behavior scripts | Lars-Dominik Braun | 6 | -172/+288 |
2017-12-23 | Set fake finished response for redirects | Lars-Dominik Braun | 1 | -1/+4 |
2017-12-23 | Drain tab event queue before stopping | Lars-Dominik Braun | 1 | -0/+2 |
2017-12-22 | Add simple stats-keeping SiteLoader | Lars-Dominik Braun | 3 | -9/+46 |
2017-12-22 | SiteLoader: Save entire finished response | Lars-Dominik Braun | 1 | -2/+9 |
2017-12-22 | Don’t write WARC record if body cannot be retrieved | Lars-Dominik Braun | 1 | -19/+48 |
2017-12-20 | Increase hardcoded max timeouts | Lars-Dominik Braun | 1 | -2/+2 |
2017-12-20 | Fix HTTP headers using the same key more than once | Lars-Dominik Braun | 1 | -2/+15 |
2017-12-19 | Serialize WARC writing | Lars-Dominik Braun | 2 | -3/+38 |
2017-12-19 | Select default behavior scripts by site URL | Lars-Dominik Braun | 4 | -1/+51 |
2017-12-17 | Add Twitter fixups | Lars-Dominik Braun | 1 | -0/+17 |
2017-12-17 | Don’t fetch redirected request body | Lars-Dominik Braun | 1 | -8/+12 |
2017-12-17 | Add distributed archiving | Lars-Dominik Braun | 2 | -151/+221 |
2017-12-06 | Start Chrome browser instance | Lars-Dominik Braun | 2 | -44/+101 |
2017-12-06 | Add flags to disable screenshot/DOM snapshot | Lars-Dominik Braun | 1 | -5/+9 |
2017-12-03 | Fix UTF-8 encoding name | Lars-Dominik Braun | 1 | -1/+1 |
2017-12-03 | Add page screenshot to WARC | Lars-Dominik Braun | 1 | -0/+14 |
2017-11-29 | Add missing timestamp to response data for redirects | Lars-Dominik Braun | 1 | -1/+1 |
2017-11-29 | argparse: Add metavar | Lars-Dominik Braun | 1 | -7/+7 |
2017-11-29 | Use Chrome’s timestamps as WARC-Date | Lars-Dominik Braun | 2 | -8/+14 |
2017-11-29 | Refactoring | Lars-Dominik Braun | 5 | -403/+571 |
2017-11-26 | DOM snapshot: Generate valid HTML5 | Lars-Dominik Braun | 2 | -9/+31 |
2017-11-25 | Ignore duplicate URLs when saving DOM snapshot | Lars-Dominik Braun | 1 | -1/+10 |