论文标题
浓缩咖啡:用于处理异质传感器数据的熵和形状的意识时间序列细分
ESPRESSO: Entropy and ShaPe awaRe timE-Series SegmentatiOn for processing heterogeneous sensor data
论文作者
论文摘要
从高维可穿戴传感器数据,智能设备或物联网数据中提取信息丰富且有意义的时间段是在应用程序识别(HAR),轨迹预测,手势识别和救生型等应用中的至关重要的预处理步骤。在本文中,我们提出了意式浓缩咖啡(熵和形状的意识时间序列分割),这是一种用于多维时间序列的混合分割模型,该模型旨在利用时间序列的熵和时间形状属性。意式浓缩咖啡不同于现有的方法,该方法专门关注时间序列的特定统计或时间属性。作为模型开发的一部分,引入了时间序列$ WCAC $的新型时间表示以及一种贪婪的搜索方法,该方法根据熵指标估算细分市场。浓缩咖啡被证明可以在七个公共可穿戴和无磨损感应的公共数据集中提供出色的性能。此外,我们对这些数据集进行了更深入的研究,以了解浓缩咖啡及其组成方法在不同的数据集特征方面的性能。最后,我们提供了两个有趣的案例研究,以展示如何应用浓缩咖啡可以帮助推断人类的日常活动和情感状态。
Extracting informative and meaningful temporal segments from high-dimensional wearable sensor data, smart devices, or IoT data is a vital preprocessing step in applications such as Human Activity Recognition (HAR), trajectory prediction, gesture recognition, and lifelogging. In this paper, we propose ESPRESSO (Entropy and ShaPe awaRe timE-Series SegmentatiOn), a hybrid segmentation model for multi-dimensional time-series that is formulated to exploit the entropy and temporal shape properties of time-series. ESPRESSO differs from existing methods that focus upon particular statistical or temporal properties of time-series exclusively. As part of model development, a novel temporal representation of time-series $WCAC$ was introduced along with a greedy search approach that estimate segments based upon the entropy metric. ESPRESSO was shown to offer superior performance to four state-of-the-art methods across seven public datasets of wearable and wear-free sensing. In addition, we undertake a deeper investigation of these datasets to understand how ESPRESSO and its constituent methods perform with respect to different dataset characteristics. Finally, we provide two interesting case-studies to show how applying ESPRESSO can assist in inferring daily activity routines and the emotional state of humans.