• Login
    View Item 
    •   NRU
    • Journal Publications
    • Engineering and Technology
    • Engineering and Technology
    • View Item
    •   NRU
    • Journal Publications
    • Engineering and Technology
    • Engineering and Technology
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

    Thumbnail
    View/Open
    Article (1.125Mb)
    Date
    2022
    Author
    Mukiibi, Jonathan
    Katumba, Andrew
    Nakatumba-Nabende, Joyce
    Hussein, Ali
    Meyer, Josh
    Metadata
    Show full item record
    Abstract
    Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions. Initial efforts by the United Nations in Uganda have proved how understanding the perceptions of rural people who are excluded from social media is important in national planning. However, these efforts are being challenged by the absence of transcribed speech datasets. In this paper, The Makerere Artificial Intelligence research lab releases a Luganda radio speech corpus of 155 hours. To our knowledge, this is the first publicly available radio dataset in sub-Saharan Africa. The paper describes the development of the voice corpus and presents baseline Luganda ASR performance results using Coqui STT toolkit, an open source speech recognition toolkit.
    URI
    https://arxiv.org/abs/2206.09790
    https://nru.uncst.go.ug/handle/123456789/5634
    Collections
    • Engineering and Technology [839]

    Research Dissemination Platform copyright © since 2021  UNCST
    Contact Us | Send Feedback
    Partners
     

     

    Browse

    All of NRU
    Communities & CollectionsBy Issue DateAuthorsTitlesSubjects
    This Collection
    By Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Research Dissemination Platform copyright © since 2021  UNCST
    Contact Us | Send Feedback
    Partners