Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2019-06-18 | Re-inject behavior scripts on site reload | Lars-Dominik Braun | 7 | -52/+114 | |
Fixes #13. Event handler’s push() is async now. | |||||
2019-06-18 | Fix idle state tracking race condition | Lars-Dominik Braun | 4 | -93/+121 | |
Closes #16. Expose SiteLoader’s page idle changes through events and move state tracking into controller event handler. Relies on tracking time instead of asyncio event, which is more reliable. | |||||
2019-06-17 | devtools: Fix testcase | Lars-Dominik Braun | 1 | -3/+18 | |
The body is only available after receiving the loadingFinished event. | |||||
2019-06-17 | html: Fix CDATA walking | Lars-Dominik Braun | 2 | -5/+42 | |
Missing “from” keyword, returned generator instead of dicts. Properly recreate CDATA elements now. | |||||
2019-06-17 | cli: Log exit status | Lars-Dominik Braun | 1 | -0/+1 | |
2019-05-30 | controller: Fix -recursive stats | Lars-Dominik Braun | 1 | -2/+5 | |
have previously included running jobs. Remove them. | |||||
2019-05-30 | controller: Correctly re-raise exceptions | Lars-Dominik Braun | 1 | -1/+2 | |
asyncio.gather returns the task’s results or exception, not task objects. Probably a copy&paste error. | |||||
2019-05-30 | controller: Fix DepthLimit | Lars-Dominik Braun | 2 | -12/+45 | |
The policy itself must be stateless, since there can be multiple ExtractLinks events (which would cause DepthLimit to reduce its depth every time). | |||||
2019-05-26 | behavior: Add clicking for vimeo.com | Lars-Dominik Braun | 1 | -0/+11 | |
2019-05-22 | behavior: Extract links from plain-text documents | Lars-Dominik Braun | 1 | -0/+13 | |
2019-05-13 | devtools: Try to delete temp Chrome data dir – hard | Lars-Dominik Braun | 1 | -1/+11 | |
Fixes #17. | |||||
2019-05-12 | behavior: Ignore invalid URLs when extracting links | Lars-Dominik Braun | 2 | -2/+18 | |
Fixes #18. | |||||
2019-05-05 | irc: Switch job id’s to proquints | Lars-Dominik Braun | 1 | -4/+41 | |
They’re easier to read and remember for humans. Plus we don’t really need 128 bits of randomness. Time-based id’s are fine here. | |||||
2019-05-05 | irc: Add job info to warcinfo record | Lars-Dominik Braun | 2 | -6/+22 | |
2019-05-05 | cli: Allow adding extra data to warcinfo record | Lars-Dominik Braun | 2 | -4/+12 | |
2019-05-04 | behavior: Add clicking for imgur.com | Lars-Dominik Braun | 1 | -0/+12 | |
2019-05-02 | behavior: Load more content on steamcommunity.com | Lars-Dominik Braun | 1 | -1/+7 | |
2019-03-22 | Move documentation to Sphinx | Lars-Dominik Braun | 1 | -0/+44 | |
2019-03-22 | behavior: Test DomSnapshot | Lars-Dominik Braun | 1 | -1/+27 | |
2019-03-21 | behavior: Test Screenshot | Lars-Dominik Braun | 2 | -16/+61 | |
2019-03-21 | behavior: Test crash | Lars-Dominik Braun | 1 | -13/+36 | |
2019-03-20 | behavior: Fix Reddit selectors | Lars-Dominik Braun | 1 | -3/+11 | |
2019-03-16 | browser: Raise exception if navigation failed | Lars-Dominik Braun | 3 | -8/+12 | |
Stop early if there’s nothing to do. | |||||
2019-03-16 | Add more debug messages | Lars-Dominik Braun | 3 | -2/+23 | |
…to controller and behavior | |||||
2019-03-16 | browser: Use different UUID for loadingFinished/Failed | Lars-Dominik Braun | 1 | -1/+1 | |
2019-03-08 | Use yaml.safe_load_all | Lars-Dominik Braun | 2 | -2/+2 | |
load_all is deprecated. A safe YAML subset is fine for our purpose. See https://msg.pyyaml.org/load | |||||
2019-03-08 | behavior: Add “more replies” selector for YouTube | Lars-Dominik Braun | 1 | -0/+4 | |
2019-03-08 | behavior: Fix selectors | Lars-Dominik Braun | 1 | -7/+5 | |
Fix Facebook/Patreon selectors and Instagram example URL. | |||||
2019-03-08 | irc: Add config option need_voice | Lars-Dominik Braun | 3 | -26/+53 | |
Do not hardcode required priviledge to use bot, make it configureable. | |||||
2019-03-06 | irc: Remove unused args for on* | Lars-Dominik Braun | 1 | -3/+3 | |
onMode will not always receive nick and user argument (i.e. server sets mode). Remove them, since they are unused. | |||||
2019-03-05 | irc: Fix NAMES reply handling | Lars-Dominik Braun | 1 | -1/+6 | |
User list may be send using multiple reply messages if too long. Do not overwrite the previous one. | |||||
2019-03-05 | Replace mutable default arguments | Lars-Dominik Braun | 2 | -9/+9 | |
This fixes IRC permission checks. Previously all users who joined the channel after the bot stored their modes in the same set(). Can be detected with pylint W0102. | |||||
2019-02-02 | irc: Fail if bot command is empty | Lars-Dominik Braun | 1 | -1/+1 | |
2019-02-02 | irc: Retry if reconnect fails | Lars-Dominik Braun | 1 | -4/+8 | |
2019-01-27 | Support manhole debugging | Lars-Dominik Braun | 1 | -0/+5 | |
Add optional support for manhole to all cli tools. Activated by signal USR1. | |||||
2019-01-27 | irc: Add URL blacklist | Lars-Dominik Braun | 2 | -3/+17 | |
2019-01-27 | irc: Switch configuration to JSON | Lars-Dominik Braun | 1 | -12/+12 | |
2019-01-27 | recursive: Avoid deadlock if unknown exception occurs | Lars-Dominik Braun | 1 | -0/+9 | |
Kill the subprocess and make sure we retrieve exceptions from .fetch() | |||||
2019-01-27 | Increase subprocess’ StreamReader limits | Lars-Dominik Braun | 2 | -2/+2 | |
We’re sending quite big JSON objects since 3a2fcc69a8eb4237b2862b3e291971d38748f115. | |||||
2019-01-26 | controller: Make sure idleTimeout is always applied | Lars-Dominik Braun | 1 | -1/+3 | |
If the browser goes idle before we enter `while True` we never notice and thus the idleTimeout is never applied. | |||||
2019-01-26 | irc: Fix format string | Lars-Dominik Braun | 1 | -6/+6 | |
2019-01-10 | browser: Use hypothesis’ domains() | Lars-Dominik Braun | 1 | -5/+2 | |
Fixes test. | |||||
2019-01-07 | controller: Test timeouts | Lars-Dominik Braun | 1 | -0/+106 | |
Lots of copy&pasta. Unfortunately the controller uses asyncio.sleep in a few places. | |||||
2019-01-07 | Log Chrome’s responses to WARC by default | Lars-Dominik Braun | 5 | -19/+32 | |
We may not be able to reproduce every failure, so logging as much as possible is important to figure out what went wrong. Also, in case a bug is uncovered in the future, we can check the logs and possibly fix it with -errata. | |||||
2019-01-05 | browser: Do not overwrite request data when prefetching | Lars-Dominik Braun | 1 | -2/+0 | |
Needs a testcase. | |||||
2019-01-05 | html: Handle CDATA | Lars-Dominik Braun | 1 | -1/+5 | |
When loading XML documents Chrome presents a pretty-printed version to the user, which still contains the original XML when exporting via DOM.getDocument. Not sure how to test this. | |||||
2019-01-05 | controller: Fix PrefixLimit | Lars-Dominik Braun | 1 | -1/+1 | |
Probably broken by the transition to URL() in commit 5e444dd6511d97308a84ae9c86ebf14547d01f01 And yes, we desperately need some tests for this. | |||||
2019-01-04 | behavior: Ignore onstop() failure | Lars-Dominik Braun | 1 | -4/+14 | |
Fails if the page is reloaded/redirected. See issue #13. | |||||
2019-01-04 | logger: Do not log debug by default | Lars-Dominik Braun | 1 | -1/+1 | |
Must’ve slipped through. | |||||
2019-01-04 | coverage: Ignore a few unreachable statements | Lars-Dominik Braun | 2 | -7/+7 | |