summaryrefslogtreecommitdiff
path: root/lulua/data/report/index.html
diff options
context:
space:
mode:
Diffstat (limited to 'lulua/data/report/index.html')
-rw-r--r--lulua/data/report/index.html8
1 files changed, 7 insertions, 1 deletions
diff --git a/lulua/data/report/index.html b/lulua/data/report/index.html
index 96725b7..0e4c779 100644
--- a/lulua/data/report/index.html
+++ b/lulua/data/report/index.html
@@ -230,7 +230,13 @@
From several runs with 100.000 iterations each the layout which had
good scores and looked reasonable to the human eye was picked.
<!-- -->
- Optimal arrengement of layers two and up are still under investigation.
+ Afterwards the second layer was optimized using the same process, but
+ only using data from the Hindawi corpus, because it is the only one
+ with at least some fully diacriticised texts.
+ <!-- -->
+ Finally the different brackets were arranged by hand and the remaining
+ symbols algorithmically distributed on the third layer using the raw
+ Wikitext from the Arabic Wikipedia dataset.
</p>
<p>