Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2019-07-04 | devtools: Prefix temp directories | Lars-Dominik Braun | 1 | -1/+1 | |
2019-07-04 | Rename cli utils | Lars-Dominik Braun | 3 | -90/+102 | |
crocoite-recursive is now just crocoite, crocoite-grab is not user-facing any more and called crocoite-single. In preparation for 1.0 release. | |||||
2019-07-03 | irc: Do not respond when not addressed directly | Lars-Dominik Braun | 1 | -1/+1 | |
This fixes annoying messages when using the bot’s nick as the first word of a message, i.e. “chromebot can do that”. | |||||
2019-07-02 | behavior: Add missing uuid’s to logging call | Lars-Dominik Braun | 1 | -2/+5 | |
2019-07-02 | Fix exit status logging | Lars-Dominik Braun | 1 | -1/+1 | |
Fixes commit 158f55eb7fb24fa26727a008ad44964390171060. Logger works only if WARC is still open. | |||||
2019-07-02 | Stabilize WARC headers | Lars-Dominik Braun | 6 | -46/+73 | |
In preparation for 1.0 release: - Correct mime types - Add X-Crocoite-Type, so logs, scripts, dom-snapshots and screenshots can be identified easily - Remove random WARC headers like X-Chrome-Initiator. We don’t want to maintain those. - Remove non-standard urn-based package URLs. Can’t use them without a urn-registration | |||||
2019-06-28 | tools: Add missing \n to JSON output | Lars-Dominik Braun | 1 | -0/+1 | |
Fixes 76811bd3f0b3fc8688939e31fdab2c71c89cc75b | |||||
2019-06-27 | extract-screenshot: Allow extracting only the first screenshot | Lars-Dominik Braun | 1 | -1/+6 | |
2019-06-27 | merge: Dump machine-readable info | Lars-Dominik Braun | 1 | -2/+18 | |
2019-06-26 | Allow turning off cert validation | Lars-Dominik Braun | 3 | -11/+37 | |
Add --insecure switch (shamelessly stolen from CURL) to both, -grab and -irc. | |||||
2019-06-26 | behavior: screenshot: Extend viewport for fixed elements | Lars-Dominik Braun | 2 | -11/+57 | |
Fixes #14, but needs a test case. | |||||
2019-06-18 | behavior: Fix screenshots | Lars-Dominik Braun | 1 | -4/+16 | |
Chrome’s behavior wrt screeshots changed in some version, so now artificially extending the viewport via device metrics is required. | |||||
2019-06-18 | Re-inject behavior scripts on site reload | Lars-Dominik Braun | 7 | -52/+114 | |
Fixes #13. Event handler’s push() is async now. | |||||
2019-06-18 | Fix idle state tracking race condition | Lars-Dominik Braun | 4 | -93/+121 | |
Closes #16. Expose SiteLoader’s page idle changes through events and move state tracking into controller event handler. Relies on tracking time instead of asyncio event, which is more reliable. | |||||
2019-06-17 | devtools: Fix testcase | Lars-Dominik Braun | 1 | -3/+18 | |
The body is only available after receiving the loadingFinished event. | |||||
2019-06-17 | html: Fix CDATA walking | Lars-Dominik Braun | 2 | -5/+42 | |
Missing “from” keyword, returned generator instead of dicts. Properly recreate CDATA elements now. | |||||
2019-06-17 | cli: Log exit status | Lars-Dominik Braun | 1 | -0/+1 | |
2019-05-30 | controller: Fix -recursive stats | Lars-Dominik Braun | 1 | -2/+5 | |
have previously included running jobs. Remove them. | |||||
2019-05-30 | controller: Correctly re-raise exceptions | Lars-Dominik Braun | 1 | -1/+2 | |
asyncio.gather returns the task’s results or exception, not task objects. Probably a copy&paste error. | |||||
2019-05-30 | controller: Fix DepthLimit | Lars-Dominik Braun | 2 | -12/+45 | |
The policy itself must be stateless, since there can be multiple ExtractLinks events (which would cause DepthLimit to reduce its depth every time). | |||||
2019-05-26 | behavior: Add clicking for vimeo.com | Lars-Dominik Braun | 1 | -0/+11 | |
2019-05-22 | behavior: Extract links from plain-text documents | Lars-Dominik Braun | 1 | -0/+13 | |
2019-05-13 | devtools: Try to delete temp Chrome data dir – hard | Lars-Dominik Braun | 1 | -1/+11 | |
Fixes #17. | |||||
2019-05-12 | behavior: Ignore invalid URLs when extracting links | Lars-Dominik Braun | 2 | -2/+18 | |
Fixes #18. | |||||
2019-05-05 | irc: Switch job id’s to proquints | Lars-Dominik Braun | 1 | -4/+41 | |
They’re easier to read and remember for humans. Plus we don’t really need 128 bits of randomness. Time-based id’s are fine here. | |||||
2019-05-05 | irc: Add job info to warcinfo record | Lars-Dominik Braun | 2 | -6/+22 | |
2019-05-05 | cli: Allow adding extra data to warcinfo record | Lars-Dominik Braun | 2 | -4/+12 | |
2019-05-04 | behavior: Add clicking for imgur.com | Lars-Dominik Braun | 1 | -0/+12 | |
2019-05-02 | behavior: Load more content on steamcommunity.com | Lars-Dominik Braun | 1 | -1/+7 | |
2019-03-22 | Move documentation to Sphinx | Lars-Dominik Braun | 1 | -0/+44 | |
2019-03-22 | behavior: Test DomSnapshot | Lars-Dominik Braun | 1 | -1/+27 | |
2019-03-21 | behavior: Test Screenshot | Lars-Dominik Braun | 2 | -16/+61 | |
2019-03-21 | behavior: Test crash | Lars-Dominik Braun | 1 | -13/+36 | |
2019-03-20 | behavior: Fix Reddit selectors | Lars-Dominik Braun | 1 | -3/+11 | |
2019-03-16 | browser: Raise exception if navigation failed | Lars-Dominik Braun | 3 | -8/+12 | |
Stop early if there’s nothing to do. | |||||
2019-03-16 | Add more debug messages | Lars-Dominik Braun | 3 | -2/+23 | |
…to controller and behavior | |||||
2019-03-16 | browser: Use different UUID for loadingFinished/Failed | Lars-Dominik Braun | 1 | -1/+1 | |
2019-03-08 | Use yaml.safe_load_all | Lars-Dominik Braun | 2 | -2/+2 | |
load_all is deprecated. A safe YAML subset is fine for our purpose. See https://msg.pyyaml.org/load | |||||
2019-03-08 | behavior: Add “more replies” selector for YouTube | Lars-Dominik Braun | 1 | -0/+4 | |
2019-03-08 | behavior: Fix selectors | Lars-Dominik Braun | 1 | -7/+5 | |
Fix Facebook/Patreon selectors and Instagram example URL. | |||||
2019-03-08 | irc: Add config option need_voice | Lars-Dominik Braun | 3 | -26/+53 | |
Do not hardcode required priviledge to use bot, make it configureable. | |||||
2019-03-06 | irc: Remove unused args for on* | Lars-Dominik Braun | 1 | -3/+3 | |
onMode will not always receive nick and user argument (i.e. server sets mode). Remove them, since they are unused. | |||||
2019-03-05 | irc: Fix NAMES reply handling | Lars-Dominik Braun | 1 | -1/+6 | |
User list may be send using multiple reply messages if too long. Do not overwrite the previous one. | |||||
2019-03-05 | Replace mutable default arguments | Lars-Dominik Braun | 2 | -9/+9 | |
This fixes IRC permission checks. Previously all users who joined the channel after the bot stored their modes in the same set(). Can be detected with pylint W0102. | |||||
2019-02-02 | irc: Fail if bot command is empty | Lars-Dominik Braun | 1 | -1/+1 | |
2019-02-02 | irc: Retry if reconnect fails | Lars-Dominik Braun | 1 | -4/+8 | |
2019-01-27 | Support manhole debugging | Lars-Dominik Braun | 1 | -0/+5 | |
Add optional support for manhole to all cli tools. Activated by signal USR1. | |||||
2019-01-27 | irc: Add URL blacklist | Lars-Dominik Braun | 2 | -3/+17 | |
2019-01-27 | irc: Switch configuration to JSON | Lars-Dominik Braun | 1 | -12/+12 | |
2019-01-27 | recursive: Avoid deadlock if unknown exception occurs | Lars-Dominik Braun | 1 | -0/+9 | |
Kill the subprocess and make sure we retrieve exceptions from .fetch() |