Bibletts：大型，高保真，多语言和独特的非洲语料库

论文标题

Bibletts：大型，高保真，多语言和独特的非洲语料库

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

论文作者

Meyer, Josh, Adelani, David Ifeoluwa, Casanova, Edresson, Öktem, Alp, Weber, Daniel Whitenack Julian, Kabongo, Salomon, Salesky, Elizabeth, Orife, Iroro, Leong, Colin, Ogayo, Perez, Emezue, Chris, Mukiibi, Jonathan, Osei, Salomey, Agbolo, Apelete, Akinode, Victor, Opoku, Bernard, Olanrewaju, Samuel, Alabi, Jesujoba, Muhammad, Shamsuddeen

论文摘要

Bibletts是一种在撒哈拉以南非洲使用的十种语言的大型，高质量的开放语音数据集。该语料库包含每语言最多86个小时的对齐，工作室质量的48kHz单扬声器录音，从而可以开发高质量的文本到语音模型。所代表的十种语言是：Akuapem Twi，Asante Twi，Chichewa，Ewe，Hausa，Kikuyu，Lingala，Luganda，Luganda，Luo和Yoruba。该语料库是Biblica的公开项目和发行的圣经录音的衍生作品。我们已经对齐，清洁和过滤了原始录音，并还对每种语言的对齐子进行了手工检查。我们为具有Coqui TTS的文本到语音模型提供了结果。数据是根据商业友好的CC-SA许可发布的。

BibleTTS is a large, high-quality, open speech dataset for ten languages spoken in Sub-Saharan Africa. The corpus contains up to 86 hours of aligned, studio quality 48kHz single speaker recordings per language, enabling the development of high-quality text-to-speech models. The ten languages represented are: Akuapem Twi, Asante Twi, Chichewa, Ewe, Hausa, Kikuyu, Lingala, Luganda, Luo, and Yoruba. This corpus is a derivative work of Bible recordings made and released by the Open.Bible project from Biblica. We have aligned, cleaned, and filtered the original recordings, and additionally hand-checked a subset of the alignments for each language. We present results for text-to-speech models with Coqui TTS. The data is released under a commercial-friendly CC-BY-SA license.

下载PDF全文

下载文献需遵守相关版权规定

论文标题