论文标题
线性回归的自动跨域转移学习
Automatic Cross-Domain Transfer Learning for Linear Regression
论文作者
论文摘要
转移学习研究试图使模型诱导可以在不同领域转移。该方法假定每个实例所属的域的特定信息已知。本文有助于将线性回归问题的转移学习能力扩展到域信息不确定或未知的情况。实际上,该框架可以扩展到分类问题。对于普通数据集,我们假设某些潜在域信息可用于转移学习。每个域中的实例可以通过不同的参数来推断。我们从与解释变量$ x $相对应的回归系数的分布以及基于dirichlet过程的响应变量$ y $的分布中获得此域信息,这是更合理的。结果,我们不仅像往常一样转移变量$ x $,而且可以变量$ y $,这是具有挑战性的,因为测试数据没有响应值。先前的工作主要通过基于转导学习的伪标签来克服问题,这引入了严重的偏见。我们提供了一个新颖的框架来分析问题并考虑到这种一般情况:可变$ x $和可变$ y $的联合分布。此外,与以前的工作相比,我们的方法可以很好地控制偏见。我们对新功能空间进行线性回归,该空间由不同的潜在域和目标域组成,该域是根据测试数据组成的。实验结果表明,所提出的模型在实际数据集上表现良好。
Transfer learning research attempts to make model induction transferable across different domains. This method assumes that specific information regarding to which domain each instance belongs is known. This paper helps to extend the capability of transfer learning for linear regression problems to situations where the domain information is uncertain or unknown; in fact, the framework can be extended to classification problems. For normal datasets, we assume that some latent domain information is available for transfer learning. The instances in each domain can be inferred by different parameters. We obtain this domain information from the distribution of the regression coefficients corresponding to the explanatory variable $x$ as well as the response variable $y$ based on a Dirichlet process, which is more reasonable. As a result, we transfer not only variable $x$ as usual but also variable $y$, which is challenging since the testing data have no response value. Previous work mainly overcomes the problem via pseudo-labelling based on transductive learning, which introduces serious bias. We provide a novel framework for analysing the problem and considering this general situation: the joint distribution of variable $x$ and variable $y$. Furthermore, our method controls the bias well compared with previous work. We perform linear regression on the new feature space that consists of different latent domains and the target domain, which is from the testing data. The experimental results show that the proposed model performs well on real datasets.