论文标题
与双重解码的联合标题和字幕
Joint Generation of Captions and Subtitles with Dual Decoding
论文作者
论文摘要
随着视听内容的数量增加,开发自动字幕和副标题解决方案以符合不断增长的国际观众的期望的需求似乎是增加吞吐量并降低相关后生产成本的唯一可行方式。自动字幕和字幕通常需要紧密地交织,以达到彼此以及视频信号的适当水平和同步。在这项工作中,我们评估了一个双重解码方案,以实现这两个任务之间的强大耦合,并表明如何提高充足性和一致性,而在模型大小和训练复杂性方面几乎没有额外的成本。
As the amount of audio-visual content increases, the need to develop automatic captioning and subtitling solutions to match the expectations of a growing international audience appears as the only viable way to boost throughput and lower the related post-production costs. Automatic captioning and subtitling often need to be tightly intertwined to achieve an appropriate level of consistency and synchronization with each other and with the video signal. In this work, we assess a dual decoding scheme to achieve a strong coupling between these two tasks and show how adequacy and consistency are increased, with virtually no additional cost in terms of model size and training complexity.