恩典：使用数据量表的自动编码器使用数据量表的实时视频通信

论文标题

恩典：使用数据量表的自动编码器使用数据量表的实时视频通信

GRACE: Loss-Resilient Real-Time Video Communication Using Data-Scalable Autoencoder

论文作者

Cheng, Yihua, Arapin, Anton, Zhang, Ziyi, Zhang, Qizheng, Li, Hanchen, Feamster, Nick, Jiang, Junchen

论文摘要

在许多实时视频应用程序中，我们看到一旦收到了其数据包的任何（非空）子集并通过每个新包装来提高质量，允许客户对每个帧的需求日益增长（尤其是在长时间的延迟和动态带宽中）。我们称其为数据量表交付。不幸的是，现有技术（例如FEC，RS和喷泉代码）不足：它们需要交付最少数量的数据包来解码帧，并且/或带有冗余的PAD视频数据以期待数据包丢失，如果不会丢失数据包，则会伤害视频质量。这项工作探讨了一种新方法，这是受神经网络自动编码器的最新进展的启发，这使数据量表可以交付成为可能。我们提出Grace，这是一个具体的数据量表实时视频系统。通过相同的视频编码，Grace的质量略低于传统的编解码器，而没有冗余的丢失，但由于每个错过的数据包，其质量的降低比现有解决方案更优雅，从而使客户可以灵活地在框架延迟和视频质量之间进行交易。格蕾丝（Grace）做出了两种贡献：（1）它训练新的自定义自动编码器，以平衡压缩效率和弹性与各种数据包损失；（2）它使用新的传输方案将自动编码器编码的帧作为单独解码的数据包提供。我们在真实的网络轨迹和视频上测试了GRACE（以及传统的损失方案和编解码器），并表明，尽管Grace的压缩效率略低于高度工程的视频编解码器，但它会大大减少尾部视频框架延迟（在95个百分位时）降低了2 $ \ timple times $）

Across many real-time video applications, we see a growing need (especially in long delays and dynamic bandwidth) to allow clients to decode each frame once any (non-empty) subset of its packets is received and improve quality with each new packet. We call it data-scalable delivery. Unfortunately, existing techniques (e.g., FEC, RS and Fountain Codes) fall short: they require either delivery of a minimum number of packets to decode frames, and/or pad video data with redundancy in anticipation of packet losses, which hurts video quality if no packets get lost. This work explores a new approach, inspired by recent advances of neural-network autoencoders, which make data-scalable delivery possible. We present Grace, a concrete data-scalable real-time video system. With the same video encoding, Grace's quality is slightly lower than traditional codec without redundancy when no packet is lost, but with each missed packet, its quality degrades much more gracefully than existing solutions, allowing clients to flexibly trade between frame delay and video quality. Grace makes two contributions: (1) it trains new custom autoencoders to balance compression efficiency and resilience against a wide range of packet losses; and (2) it uses a new transmission scheme to deliver autoencoder-coded frames as individually decodable packets. We test Grace (and traditional loss-resilient schemes and codecs) on real network traces and videos, and show that while Grace's compression efficiency is slightly worse than heavily engineered video codecs, it significantly reduces tail video frame delay (by 2$\times$ at the 95th percentile) with the marginally lowered video quality

下载PDF全文

下载文献需遵守相关版权规定

论文标题