• Login
    View Item 
    •   NRU
    • Conference Proceedings
    • Engineering and Technology
    • View Item
    •   NRU
    • Conference Proceedings
    • Engineering and Technology
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Machine Translation for African Languages: Community Creation of Datasets and Models in Uganda

    Thumbnail
    View/Open
    Conference Paper (370.8Kb)
    Date
    2022
    Author
    Akera, Benjamin
    Mukiibi, Jonathan
    Sanyu Naggayi, Lydia
    Babirye, Claire
    Owomugisha, Isaac
    Nsumba, Solomon
    Nakatumba-Nabende, Joyce
    Bainomugisha, Engineer
    Mwebaze, Ernest
    Quinn, John
    Metadata
    Show full item record
    Abstract
    Reliable machine translation systems are only available for a small proportion of the world’s languages, the key limitation being a shortage of training and evaluation data. We provide a case study in the creation of such resources by NLP teams who are local to the communities in which these languages are spoken. A parallel text corpus, SALT, was created for five Ugandan languages (Luganda, Runyankole, Acholi, Lugbara and Ateso) and various methods were explored to train and evaluate translation models. The resulting models were found to be effective for practical translation applications, even for those languages with no previous NLP data available, achieving mean BLEU score of 26.2 for translations to English, and 19.9 from English. The SALT dataset and models described are publicly available at
    URI
    https://openreview.net/forum?id=BK-z5qzEU-9
    https://nru.uncst.go.ug/handle/123456789/6744
    Collections
    • Engineering and Technology [81]

    Research Dissemination Platform copyright © since 2021  UNCST
    Contact Us | Send Feedback
    Partners
     

     

    Browse

    All of NRU
    Communities & CollectionsBy Issue DateAuthorsTitlesSubjects
    This Collection
    By Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Research Dissemination Platform copyright © since 2021  UNCST
    Contact Us | Send Feedback
    Partners