summaryrefslogtreecommitdiff
path: root/lulua/text.py
AgeCommit message (Collapse)AuthorFilesLines
2019-11-17Add more testsLars-Dominik Braun1-17/+19
2019-11-16Add OpenStreetMap label corpusLars-Dominik Braun1-0/+5
Extract node labels (name:ar) from OpenStreetMap’s planet dump. Heavily leans towards a few common words (“street”, obviously), but we should be fine since the corpus is not that large.
2019-11-08Add OpenSubtitles corpusLars-Dominik Braun1-0/+18
See issue #5.
2019-11-06text: Add TEI.2 parserLars-Dominik Braun1-1/+27
2019-10-03text: Add epub reader and hindawi corpusLars-Dominik Braun1-21/+50
See issue #5.
2019-10-03text: Fail if workers dieLars-Dominik Braun1-29/+39
2019-09-17Initial importLars-Dominik Braun1-0/+260