论文标题
一种视觉挖掘方法,以改善多个现实学习
A Visual Mining Approach to Improved Multiple-Instance Learning
论文作者
论文摘要
多种现实学习(MIL)是机器学习的范式,旨在对物体(实例)的集合(袋)进行分类,仅将标签分配给袋子。通常通过选择一个实例来表示每个袋子,将MIL问题转换为标准监督学习,从而解决了这个问题。可视化可以是通过将用户的知识纳入分类过程来评估学习方案的有用工具。考虑到多个实体学习是一种无法通过当前可视化技术来处理的范式,我们提出了一种基于多尺度树的可视化,称为MILTREE来支持MIL问题。树的第一级代表袋子,第二级代表属于每个袋子的实例,使用户可以直观地理解MIL数据集。此外,我们为MIL提出了两种新的实例选择方法,这些方法可以帮助用户进一步改进模型。我们的方法可以处理二进制和多类方案。在我们的实验中,SVM用于构建分类器。在Miltree布局的支持下,通过更改由原型实例组成的训练集更新了初始分类模型。实验结果证明了我们方法的有效性,表明Miltree的视觉挖掘可以支持MIL场景中的探索和改进模型,并且在大多数情况下,我们的实例选择方法优于当前可用的替代方案。
Multiple-instance learning (MIL) is a paradigm of machine learning that aims to classify a set (bag) of objects (instances), assigning labels only to the bags. This problem is often addressed by selecting an instance to represent each bag, transforming a MIL problem into standard supervised learning. Visualization can be a useful tool to assess learning scenarios by incorporating the users' knowledge into the classification process. Considering that multiple-instance learning is a paradigm that cannot be handled by current visualization techniques, we propose a multiscale tree-based visualization called MILTree to support MIL problems. The first level of the tree represents the bags, and the second level represents the instances belonging to each bag, allowing users to understand the MIL datasets in an intuitive way. In addition, we propose two new instance selection methods for MIL, which help users improve the model even further. Our methods can handle both binary and multiclass scenarios. In our experiments, SVM was used to build the classifiers. With support of the MILTree layout, the initial classification model was updated by changing the training set, which is composed of the prototype instances. Experimental results validate the effectiveness of our approach, showing that visual mining by MILTree can support exploring and improving models in MIL scenarios and that our instance selection methods outperform the currently available alternatives in most cases.