Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2021-10-27 | corpus: Add more books from Hindawi. | Lars-Dominik Braun | 1 | -1/+1 | |
2021-10-27 | corpus: quran: Fix description. | Lars-Dominik Braun | 1 | -1/+1 | |
2020-08-22 | corpus: Update hindawi | Lars-Dominik Braun | 1 | -2/+2 | |
Add recent additions | |||||
2020-05-10 | report: Add translated source table, asymmetry definition | Lars-Dominik Braun | 8 | -15/+59 | |
Also fix the layout break point. | |||||
2019-11-30 | Add missing corpuse metadata files | Lars-Dominik Braun | 7 | -0/+42 | |
2019-11-16 | Add OpenStreetMap label corpus | Lars-Dominik Braun | 1 | -0/+5 | |
Extract node labels (name:ar) from OpenStreetMap’s planet dump. Heavily leans towards a few common words (“street”, obviously), but we should be fine since the corpus is not that large. |