| Age | Commit message (Collapse) | Author | Files | Lines | |
|---|---|---|---|---|---|
| 2019-06-17 | cli: Log exit status | Lars-Dominik Braun | 1 | -0/+1 | |
| 2019-05-30 | controller: Fix -recursive stats | Lars-Dominik Braun | 1 | -2/+5 | |
| have previously included running jobs. Remove them. | |||||
| 2019-05-30 | controller: Correctly re-raise exceptions | Lars-Dominik Braun | 1 | -1/+2 | |
| asyncio.gather returns the task’s results or exception, not task objects. Probably a copy&paste error. | |||||
| 2019-05-30 | controller: Fix DepthLimit | Lars-Dominik Braun | 2 | -12/+45 | |
| The policy itself must be stateless, since there can be multiple ExtractLinks events (which would cause DepthLimit to reduce its depth every time). | |||||
| 2019-05-26 | behavior: Add clicking for vimeo.com | Lars-Dominik Braun | 1 | -0/+11 | |
| 2019-05-22 | behavior: Extract links from plain-text documents | Lars-Dominik Braun | 1 | -0/+13 | |
| 2019-05-13 | devtools: Try to delete temp Chrome data dir – hard | Lars-Dominik Braun | 1 | -1/+11 | |
| Fixes #17. | |||||
| 2019-05-12 | behavior: Ignore invalid URLs when extracting links | Lars-Dominik Braun | 2 | -2/+18 | |
| Fixes #18. | |||||
| 2019-05-05 | irc: Switch job id’s to proquints | Lars-Dominik Braun | 1 | -4/+41 | |
| They’re easier to read and remember for humans. Plus we don’t really need 128 bits of randomness. Time-based id’s are fine here. | |||||
| 2019-05-05 | irc: Add job info to warcinfo record | Lars-Dominik Braun | 2 | -6/+22 | |
| 2019-05-05 | cli: Allow adding extra data to warcinfo record | Lars-Dominik Braun | 2 | -4/+12 | |
| 2019-05-04 | behavior: Add clicking for imgur.com | Lars-Dominik Braun | 1 | -0/+12 | |
| 2019-05-02 | behavior: Load more content on steamcommunity.com | Lars-Dominik Braun | 1 | -1/+7 | |
| 2019-03-22 | Move documentation to Sphinx | Lars-Dominik Braun | 1 | -0/+44 | |
| 2019-03-22 | behavior: Test DomSnapshot | Lars-Dominik Braun | 1 | -1/+27 | |
| 2019-03-21 | behavior: Test Screenshot | Lars-Dominik Braun | 2 | -16/+61 | |
| 2019-03-21 | behavior: Test crash | Lars-Dominik Braun | 1 | -13/+36 | |
| 2019-03-20 | behavior: Fix Reddit selectors | Lars-Dominik Braun | 1 | -3/+11 | |
| 2019-03-16 | browser: Raise exception if navigation failed | Lars-Dominik Braun | 3 | -8/+12 | |
| Stop early if there’s nothing to do. | |||||
| 2019-03-16 | Add more debug messages | Lars-Dominik Braun | 3 | -2/+23 | |
| …to controller and behavior | |||||
| 2019-03-16 | browser: Use different UUID for loadingFinished/Failed | Lars-Dominik Braun | 1 | -1/+1 | |
| 2019-03-08 | Use yaml.safe_load_all | Lars-Dominik Braun | 2 | -2/+2 | |
| load_all is deprecated. A safe YAML subset is fine for our purpose. See https://msg.pyyaml.org/load | |||||
| 2019-03-08 | behavior: Add “more replies” selector for YouTube | Lars-Dominik Braun | 1 | -0/+4 | |
| 2019-03-08 | behavior: Fix selectors | Lars-Dominik Braun | 1 | -7/+5 | |
| Fix Facebook/Patreon selectors and Instagram example URL. | |||||
| 2019-03-08 | irc: Add config option need_voice | Lars-Dominik Braun | 3 | -26/+53 | |
| Do not hardcode required priviledge to use bot, make it configureable. | |||||
| 2019-03-06 | irc: Remove unused args for on* | Lars-Dominik Braun | 1 | -3/+3 | |
| onMode will not always receive nick and user argument (i.e. server sets mode). Remove them, since they are unused. | |||||
| 2019-03-05 | irc: Fix NAMES reply handling | Lars-Dominik Braun | 1 | -1/+6 | |
| User list may be send using multiple reply messages if too long. Do not overwrite the previous one. | |||||
| 2019-03-05 | Replace mutable default arguments | Lars-Dominik Braun | 2 | -9/+9 | |
| This fixes IRC permission checks. Previously all users who joined the channel after the bot stored their modes in the same set(). Can be detected with pylint W0102. | |||||
| 2019-02-02 | irc: Fail if bot command is empty | Lars-Dominik Braun | 1 | -1/+1 | |
| 2019-02-02 | irc: Retry if reconnect fails | Lars-Dominik Braun | 1 | -4/+8 | |
| 2019-01-27 | Support manhole debugging | Lars-Dominik Braun | 1 | -0/+5 | |
| Add optional support for manhole to all cli tools. Activated by signal USR1. | |||||
| 2019-01-27 | irc: Add URL blacklist | Lars-Dominik Braun | 2 | -3/+17 | |
| 2019-01-27 | irc: Switch configuration to JSON | Lars-Dominik Braun | 1 | -12/+12 | |
| 2019-01-27 | recursive: Avoid deadlock if unknown exception occurs | Lars-Dominik Braun | 1 | -0/+9 | |
| Kill the subprocess and make sure we retrieve exceptions from .fetch() | |||||
| 2019-01-27 | Increase subprocess’ StreamReader limits | Lars-Dominik Braun | 2 | -2/+2 | |
| We’re sending quite big JSON objects since 3a2fcc69a8eb4237b2862b3e291971d38748f115. | |||||
| 2019-01-26 | controller: Make sure idleTimeout is always applied | Lars-Dominik Braun | 1 | -1/+3 | |
| If the browser goes idle before we enter `while True` we never notice and thus the idleTimeout is never applied. | |||||
| 2019-01-26 | irc: Fix format string | Lars-Dominik Braun | 1 | -6/+6 | |
| 2019-01-10 | browser: Use hypothesis’ domains() | Lars-Dominik Braun | 1 | -5/+2 | |
| Fixes test. | |||||
| 2019-01-07 | controller: Test timeouts | Lars-Dominik Braun | 1 | -0/+106 | |
| Lots of copy&pasta. Unfortunately the controller uses asyncio.sleep in a few places. | |||||
| 2019-01-07 | Log Chrome’s responses to WARC by default | Lars-Dominik Braun | 5 | -19/+32 | |
| We may not be able to reproduce every failure, so logging as much as possible is important to figure out what went wrong. Also, in case a bug is uncovered in the future, we can check the logs and possibly fix it with -errata. | |||||
| 2019-01-05 | browser: Do not overwrite request data when prefetching | Lars-Dominik Braun | 1 | -2/+0 | |
| Needs a testcase. | |||||
| 2019-01-05 | html: Handle CDATA | Lars-Dominik Braun | 1 | -1/+5 | |
| When loading XML documents Chrome presents a pretty-printed version to the user, which still contains the original XML when exporting via DOM.getDocument. Not sure how to test this. | |||||
| 2019-01-05 | controller: Fix PrefixLimit | Lars-Dominik Braun | 1 | -1/+1 | |
| Probably broken by the transition to URL() in commit 5e444dd6511d97308a84ae9c86ebf14547d01f01 And yes, we desperately need some tests for this. | |||||
| 2019-01-04 | behavior: Ignore onstop() failure | Lars-Dominik Braun | 1 | -4/+14 | |
| Fails if the page is reloaded/redirected. See issue #13. | |||||
| 2019-01-04 | logger: Do not log debug by default | Lars-Dominik Braun | 1 | -1/+1 | |
| Must’ve slipped through. | |||||
| 2019-01-04 | coverage: Ignore a few unreachable statements | Lars-Dominik Braun | 2 | -7/+7 | |
| 2019-01-04 | behavior: Support clicking area and add testcase | Lars-Dominik Braun | 2 | -7/+76 | |
| 2019-01-03 | browser: Turn Item into RequestResponsePair | Lars-Dominik Braun | 6 | -485/+627 | |
| Previously Item was just a simple wrapper around Chrome’s Network.* events. This turned out to be quite nasty when testing, so its replacement, RequestResponsePair, does some level of abstraction. This makes testing alot easier, since we now can simply instantiate it without building a proper DevTools event. Should come without any functional changes. | |||||
| 2018-12-31 | extract-screenshot: Remove URL from filename | Lars-Dominik Braun | 1 | -8/+19 | |
| URL’s can get quite long, overflowing the file name length limit. Instead use sequential filenames and output metadata to stdout. | |||||
| 2018-12-25 | warc: Add tests | Lars-Dominik Braun | 4 | -17/+280 | |
| Using hyothesis-based testcase generation. This is quite nice compared to manual test data generation, since it catches alot more corner cases (if done right). This commit also fixes a few issues, including: - log records will only be written if the log is nonempty - properly quote packageUrl path’s - drop old thread checking code - use placeholder url for scripts without name | |||||
