论文标题

基于波动的异常检测

Fluctuation-based Outlier Detection

论文作者

Du, Xusheng, Zuo, Enguang, He, Zhenzhen, Yu, Jiong

论文摘要

异常检测是机器学习中的一个重要主题,并且已在广泛的应用中使用。离群值是数量很少并偏离大多数对象的对象。由于这两种特性,我们表明离群值易受一种称为波动的机制。本文提出了一种称为基于波动的异常检测(FBOD)的方法,该方法可实现较低的线性时间复杂性,并纯粹基于波动的概念检测异常值,而无需采用任何距离,密度或隔离度量。与所有现有方法根本不同。 FBOD首先通过使用随机链接将欧几里得结构数据集转换为图形,然后根据图的连接传播特征值。最后,通过比较对象及其邻居的波动之间的差异,FBOD可以以更大的差异为异常来确定对象。将FBOD与八个实际表格数据集和三个视频数据集上的七种最先进算法进行比较的实验结果表明,在大多数情况下,FBOD在大多数情况下都优于其竞争对手,而FBOD仅具有最快算法的执行时间的5%。实验代码可在以下网址获得:https://github.com/fluctuationod/fluctuation op-lase-outlier-detection。

Outlier detection is an important topic in machine learning and has been used in a wide range of applications. Outliers are objects that are few in number and deviate from the majority of objects. As a result of these two properties, we show that outliers are susceptible to a mechanism called fluctuation. This article proposes a method called fluctuation-based outlier detection (FBOD) that achieves a low linear time complexity and detects outliers purely based on the concept of fluctuation without employing any distance, density or isolation measure. Fundamentally different from all existing methods. FBOD first converts the Euclidean structure datasets into graphs by using random links, then propagates the feature value according to the connection of the graph. Finally, by comparing the difference between the fluctuation of an object and its neighbors, FBOD determines the object with a larger difference as an outlier. The results of experiments comparing FBOD with seven state-of-the-art algorithms on eight real-world tabular datasets and three video datasets show that FBOD outperforms its competitors in the majority of cases and that FBOD has only 5% of the execution time of the fastest algorithm. The experiment codes are available at: https://github.com/FluctuationOD/Fluctuation-based-Outlier-Detection.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源