summaryrefslogtreecommitdiff
path: root/crocoite/html.py
AgeCommit message (Collapse)AuthorFilesLines
2019-06-17html: Fix CDATA walkingLars-Dominik Braun1-5/+6
Missing “from” keyword, returned generator instead of dicts. Properly recreate CDATA elements now.
2019-01-05html: Handle CDATALars-Dominik Braun1-1/+5
When loading XML documents Chrome presents a pretty-printed version to the user, which still contains the original XML when exporting via DOM.getDocument. Not sure how to test this.
2018-11-19Coding styleLars-Dominik Braun1-4/+4
Fix a few random issues pointed out by pylint, mainly unused imports.
2018-06-21Fix a few issues pointed out by pylintLars-Dominik Braun1-2/+0
2017-11-29RefactoringLars-Dominik Braun1-0/+107
Reusable browser communication and WARC writing.
2017-11-26DOM snapshot: Generate valid HTML5Lars-Dominik Braun1-2/+19
Some tags are “void”, i.e. cannot contain contents and don’t have a closing tag.
2017-11-25Strip on* HTML attributesLars-Dominik Braun1-0/+84
They can carry JavaScript as well and should not be allowed for DOM snapshots.