论文标题
ILASR:在生产规模上自动语音识别的隐私性增量学习
ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
论文作者
论文摘要
增量学习是通过流数据大规模构建模型构建和更新的一种范式。对于端到端的自动语音识别(ASR)任务,缺少人类注释的标签以及需要保留模型建设的隐私政策的需求使其成为艰巨的挑战。受这些挑战的促进,在本文中,我们使用基于云的框架为生产系统展示了从隐私保留自动语音识别(ILASR)的增量学习的见解。我们的意思是,通过保留隐私,对未注释的短暂数据的使用情况。该系统是用于增量/持续学习的生产Levalasr模型迈出的一步,该模型提供了接近实时测试床,以在云中进行端到端ASR实验,同时遵守保留隐私的政策。我们表明,即使在没有人类注释的标签的情况下,拟议的系统也可以在六个月的新时间内显着改善生产模型(3%),而在增量学习中,较弱的监督和大批量大小的水平都不同。在新时期,这种改进比测试集的新单词和短语为20%。我们在ASR中以保护隐私的增量方式展示了模型构建的有效性,同时进一步探索了具有有效的教师模型和使用大批量大小的实用性。
Incremental learning is one paradigm to enable model building and updating at scale with streaming data. For end-to-end automatic speech recognition (ASR) tasks, the absence of human annotated labels along with the need for privacy preserving policies for model building makes it a daunting challenge. Motivated by these challenges, in this paper we use a cloud based framework for production systems to demonstrate insights from privacy preserving incremental learning for automatic speech recognition (ILASR). By privacy preserving, we mean, usage of ephemeral data which are not human annotated. This system is a step forward for production levelASR models for incremental/continual learning that offers near real-time test-bed for experimentation in the cloud for end-to-end ASR, while adhering to privacy-preserving policies. We show that the proposed system can improve the production models significantly(3%) over a new time period of six months even in the absence of human annotated labels with varying levels of weak supervision and large batch sizes in incremental learning. This improvement is 20% over test sets with new words and phrases in the new time period. We demonstrate the effectiveness of model building in a privacy-preserving incremental fashion for ASR while further exploring the utility of having an effective teacher model and use of large batch sizes.