论文标题

分析儿童演讲中的弱点

Analysis of Disfluency in Children's Speech

论文作者

Tran, Trang, Tinkler, Morgan, Yeung, Gary, Alwan, Abeer, Ostendorf, Mari

论文摘要

如许多成人言语研究所示,在自发的语音中流行很普遍。关于儿童的讲话,尤其​​是在仍在发展其语言技能的学龄前儿童的情况下,人们的理解更少。我们介绍了一个新颖的数据集,其中有26名儿童(5--8岁)的自发解释的带注释的解释,为期一年的时间两次。我们的初步分析揭示了孩子在我们的语料库中的言语与来自两个Corpora(总机和Callhome)的成人自发演讲之间的显着差异。与成人说话的人相比,儿童具有更高的频率和填充率,倾向于更频繁地使用鼻腔填充的停顿,并且平均表现出比维修更长的代表。尽管存在差异,但在成人(总机)语音转录本上训练的自动反射检测系统在儿童的语音上表现出色,获得的F1分数比成人户外数据集(Callhome)的F1分数高10 \%。

Disfluencies are prevalent in spontaneous speech, as shown in many studies of adult speech. Less is understood about children's speech, especially in pre-school children who are still developing their language skills. We present a novel dataset with annotated disfluencies of spontaneous explanations from 26 children (ages 5--8), interviewed twice over a year-long period. Our preliminary analysis reveals significant differences between children's speech in our corpus and adult spontaneous speech from two corpora (Switchboard and CallHome). Children have higher disfluency and filler rates, tend to use nasal filled pauses more frequently, and on average exhibit longer reparandums than repairs, in contrast to adult speakers. Despite the differences, an automatic disfluency detection system trained on adult (Switchboard) speech transcripts performs reasonably well on children's speech, achieving an F1 score that is 10\% higher than the score on an adult out-of-domain dataset (CallHome).

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源