论文标题
我应该散步吗?从视频数据中估算能源支出
Should I take a walk? Estimating Energy Expenditure from Video Data
论文作者
论文摘要
我们探讨了从他/她的视频观察中自动推断人体育锻炼期间使用的千分化量的问题。为了研究这项研究不足的任务,我们介绍了VID2Burn - 一种Omni-Source基准测试,用于估算视频数据的热量支出,这些视频数据具有高强度和低强度活动,我们根据医学文献中建立的模型得出了基于该模型的能量消耗注释。实际上,训练集只能涵盖一定数量的活动类型,并且验证该模型确实捕获了能量消耗的本质(例如,涉及多少肌肉,涉及多少肌肉以及它们的工作程度),而不是记住训练过程中所见特定活动类别的固定值。理想情况下,模型应超越这种特定类别的偏见,并在培训期间不明确存在的活动类别的视频中回归热量成本。考虑到此属性,VID2Burn伴随着跨类别基准,该任务是在培训期间不存在的体育活动类型来回归热量支出。对针对能源消耗估算任务进行修改的视频识别方法的最新方法的广泛评估证明了此问题的困难,尤其是对于测试时间的新活动类型,标志着新的研究方向。数据集和代码可从https://github.com/kpeng9510/vid2burn获得。
We explore the problem of automatically inferring the amount of kilocalories used by human during physical activity from his/her video observation. To study this underresearched task, we introduce Vid2Burn -- an omni-source benchmark for estimating caloric expenditure from video data featuring both, high- and low-intensity activities for which we derive energy expenditure annotations based on models established in medical literature. In practice, a training set would only cover a certain amount of activity types, and it is important to validate, if the model indeed captures the essence of energy expenditure, (e.g., how many and which muscles are involved and how intense they work) instead of memorizing fixed values of specific activity categories seen during training. Ideally, the models should look beyond such category-specific biases and regress the caloric cost in videos depicting activity categories not explicitly present during training. With this property in mind, Vid2Burn is accompanied with a cross-category benchmark, where the task is to regress caloric expenditure for types of physical activities not present during training. An extensive evaluation of state-of-the-art approaches for video recognition modified for the energy expenditure estimation task demonstrates the difficulty of this problem, especially for new activity types at test-time, marking a new research direction. Dataset and code are available at https://github.com/KPeng9510/Vid2Burn.