论文标题
在多组分,多层次不断发展的预测系统的背景下对元级学习的综述
A Review of Meta-level Learning in the Context of Multi-component, Multi-level Evolving Prediction Systems
论文作者
论文摘要
数据的指数增长,数据的多样性和速度提高了对从数据中提取有用模式的自动化或半自动化方法进行研究的需求。它需要深厚的专家知识和广泛的计算资源,以找到针对给定问题的最合适的学习方法映射。在存在大量数据的许多学习算法配置的情况下,这成为一个挑战。因此,需要使用智能推荐引擎来建议数据集的最佳学习算法是什么。专家通常使用的技术是基于试验和错误方法评估和比较一些可能的解决方案相互对抗的解决方案,使用他们在特定域等上的先前经验相对。试验和错误方法与专家的先验知识相结合,尽管计算机上和时间昂贵,但经常被证明在处理处理问题的情况下经常被证明是在固定问题上效法的。但是,这种方法通常不可行,以适用于不断到达数据流的非平稳问题。此外,在非平稳环境中,每当基础数据分布发生变化时,数据的手动分析和测试各种方法都将非常困难或简直是不可行的。在这种情况下和在线预测系统中,有几个任务可以使用元学习来有效地促进最佳建议,包括1)预处理步骤,2)学习算法或其组合,3)适应性机制及其参数,4)4)重建概念提取和5)概念漂移检测。
The exponential growth of volume, variety and velocity of data is raising the need for investigations of automated or semi-automated ways to extract useful patterns from the data. It requires deep expert knowledge and extensive computational resources to find the most appropriate mapping of learning methods for a given problem. It becomes a challenge in the presence of numerous configurations of learning algorithms on massive amounts of data. So there is a need for an intelligent recommendation engine that can advise what is the best learning algorithm for a dataset. The techniques that are commonly used by experts are based on a trial and error approach evaluating and comparing a number of possible solutions against each other, using their prior experience on a specific domain, etc. The trial and error approach combined with the expert's prior knowledge, though computationally and time expensive, have been often shown to work for stationary problems where the processing is usually performed off-line. However, this approach would not normally be feasible to apply to non-stationary problems where streams of data are continuously arriving. Furthermore, in a non-stationary environment, the manual analysis of data and testing of various methods whenever there is a change in the underlying data distribution would be very difficult or simply infeasible. In that scenario and within an on-line predictive system, there are several tasks where Meta-learning can be used to effectively facilitate best recommendations including 1) pre-processing steps, 2) learning algorithms or their combination, 3) adaptivity mechanisms and their parameters, 4) recurring concept extraction, and 5) concept drift detection.