Skip navigation
st. Mary's University Institutional Repository St. Mary's University Institutional Repository

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/7504
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAkelew, Helen-
dc.date.accessioned2023-03-07T12:08:38Z-
dc.date.available2023-03-07T12:08:38Z-
dc.date.issued2023-01-
dc.identifier.uri.-
dc.identifier.urihttp://hdl.handle.net/123456789/7504-
dc.description.abstractBoth Ge‟ez and Tigrigna languages, which the native Ethiopian languages, are morphological rich and complex for bi-directional machine translation. To overcome this machine translation problem, this study explored the effect of morpheme-based translation unit for bidirectional Ge‟ez and Tigrigna languages. The corpus was taken from Ten Bible Books that contained 384 that contained 9189 verses. The corpus was used both for developing pre-trained model and for validation. Accordingly, to train the morfessor, 12173 simple Ge‟ez and 16708 Tigrigna words were taken from SQlite database. Explicitly, from the total of 7290 verses data, 80%, that is 7290 Verses were used to develop the pre-trained model and 20% which is 1899 Verses were used for testing or validation purposes. we used Mosses for translation process, MGIZA++ for alignment of word and morpheme, morfessor and IRSTLM techniques for the language modeling. After preparing and designing the prototype and the corpus, different experiments were conducted. BLUE score which is standard for automatic machine translation evaluation was used to measure how much of the system output is correct. Experimental results showed a better performance of 9.23% and 8.67% BLEU scores using morpheme-based from Geez to Tigrigna and from Tigrigna to Geez translation, respectively. That is, it was found out that the model or the system output was correct. Regarding the BLUE metrics evaluation tool, it was also found to show proper validation scores or results. As to the alignment challenges, many-to-many alignment is the major challenge. Hence, there is a need to conduct further research to handle the issue of many-to-many alignment challenge.en_US
dc.language.isoenen_US
dc.publisherST. MARY’S UNIVERSITYen_US
dc.subjectBi-directional Machine Translation, Bilingual Evaluation Understudy, Ge‟ez Language, Tigrigna Languageen_US
dc.titleMorpheme Based Bi-Directional Machine Translation The Case of Ge’ez to Tigrignaen_US
dc.typeThesisen_US
Appears in Collections:Master of computer science

Files in This Item:
File Description SizeFormat 
Helen Akelew Final Thesis.pdf2.7 MBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.