GSoC/GCI Archive
Google Code-in 2010 The Apertium project

Categorise translation errors in Afrikaans to Dutch MT

completed by: AureiAnimus

mentors: Francis Tyers

Take two test sentence corpora:




And count the number of errors, categorising into the following categories: Unknown word, Morphology, Disambiguation, Multiword, Syntactic transfer, Polysemy, Compounding, Separable verb

The number of errors will need to tally with the number of errors produced by the apertium-eval-translator script for Word Error Rate (the edit distance).