论文标题
在线非线性Neyman-Pearson分类的神经网络方法
A Neural Network Approach for Online Nonlinear Neyman-Pearson Classification
论文作者
论文摘要
我们提出了一个新颖的Neyman-Pearson(NP)分类器,该分类器既是在线和非线性文学中的第一次。所提出的分类器以在线方式在二进制标记的数据流上运行,并最大程度地提高了有关用户指定且可控的误报率的检测能力。我们的NP分类器是单个隐藏层馈电神经网络(SLFN),它以随机傅立叶特征(RFF)初始化,以构建径向基函数的内核空间,并在其隐藏的层中使用正弦激活构建。 RFF的这种使用不仅提供了出色的初始化,具有出色的非线性建模能力,而且还指数级降低了参数复杂性并压实网络以减轻过度拟合的同时,同时大大提高了处理效率。我们依次根据Lagrangian NP目标学习使用随机梯度下降更新的SLFN。结果,我们获得了加快的在线适应和强大的非线性Neyman-Pearson建模。我们的算法适用于大规模数据应用程序,并提供了实时处理的不错的误报率可控性,因为它仅具有O(n)计算和O(1)空间复杂性(N:数据实例数)。在我们在几个真实数据集上进行的一系列实验中,我们的算法高于竞争性最新技术,要么通过在NP分类目标方面胜过具有可比的计算和空间复杂性或实现相当较低的性能,要么以显着较低的复杂性来实现。
We propose a novel Neyman-Pearson (NP) classifier that is both online and nonlinear as the first time in the literature. The proposed classifier operates on a binary labeled data stream in an online manner, and maximizes the detection power about a user-specified and controllable false positive rate. Our NP classifier is a single hidden layer feedforward neural network (SLFN), which is initialized with random Fourier features (RFFs) to construct the kernel space of the radial basis function at its hidden layer with sinusoidal activation. Not only does this use of RFFs provide an excellent initialization with great nonlinear modeling capability, but it also exponentially reduces the parameter complexity and compactifies the network to mitigate overfitting while improving the processing efficiency substantially. We sequentially learn the SLFN with stochastic gradient descent updates based on a Lagrangian NP objective. As a result, we obtain an expedited online adaptation and powerful nonlinear Neyman-Pearson modeling. Our algorithm is appropriate for large scale data applications and provides a decent false positive rate controllability with real time processing since it only has O(N) computational and O(1) space complexity (N: number of data instances). In our extensive set of experiments on several real datasets, our algorithm is highly superior over the competing state-of-the-art techniques, either by outperforming in terms of the NP classification objective with a comparable computational as well as space complexity or by achieving a comparable performance with significantly lower complexity.