summaryrefslogtreecommitdiff
path: root/corpus
AgeCommit message (Collapse)AuthorFilesLines
2021-10-27corpus: Add more books from Hindawi.Lars-Dominik Braun1-1/+1
2021-10-27corpus: quran: Fix description.Lars-Dominik Braun1-1/+1
2020-08-22corpus: Update hindawiLars-Dominik Braun1-2/+2
Add recent additions
2020-05-10report: Add translated source table, asymmetry definitionLars-Dominik Braun8-15/+59
Also fix the layout break point.
2019-11-30Add missing corpuse metadata filesLars-Dominik Braun7-0/+42
2019-11-16Add OpenStreetMap label corpusLars-Dominik Braun1-0/+5
Extract node labels (name:ar) from OpenStreetMap’s planet dump. Heavily leans towards a few common words (“street”, obviously), but we should be fine since the corpus is not that large.