时间序列的早期分类。基于成本的优化标准和算法

论文标题

时间序列的早期分类。基于成本的优化标准和算法

Early Classification of Time Series. Cost-based Optimization Criterion and Algorithms

论文作者

Achenchabe, Youssef, Bondu, Alexis, Cornuéjols, Antoine, Dachraoui, Asma

论文摘要

越来越多的应用程序需要尽快识别传入时间序列的类别，而不会过分损害预测的准确性。在本文中，我们提出了一个新的优化标准，该标准考虑了错误分类的成本和延迟决定的成本。基于此优化标准，我们得出了一个非洋白质算法的家族，试图预测与等待成本平衡的信息的预期未来增益。在一类算法（基于无监督的算法）中，期望使用时间序列的聚类，而在第二类，基于监督的基于监督的时间序列是根据用于标记它们标记的分类器的置信度级别进行分组的。使用大量延迟成本函数在实际数据集上进行的广泛实验表明，所提出的算法能够令人满意地求解初级性与准确性权衡，而基于监督的方法比不明显的方法更好。此外，所有这些方法在多种条件下的表现要比基于近视策略的最先进方法的状态表现更好，该方法被认为是非常有竞争力的。

An increasing number of applications require to recognize the class of an incoming time series as quickly as possible without unduly compromising the accuracy of the prediction. In this paper, we put forward a new optimization criterion which takes into account both the cost of misclassification and the cost of delaying the decision. Based on this optimization criterion, we derived a family of non-myopic algorithms which try to anticipate the expected future gain in information in balance with the cost of waiting. In one class of algorithms, unsupervised-based, the expectations use the clustering of time series, while in a second class, supervised-based, time series are grouped according to the confidence level of the classifier used to label them. Extensive experiments carried out on real data sets using a large range of delay cost functions show that the presented algorithms are able to satisfactorily solving the earliness vs. accuracy trade-off, with the supervised-based approaches faring better than the unsupervised-based ones. In addition, all these methods perform better in a wide variety of conditions than a state of the art method based on a myopic strategy which is recognized as very competitive.

下载PDF全文

下载文献需遵守相关版权规定

论文标题