可解释的经验风险最小化

论文标题

可解释的经验风险最小化

Explainable Empirical Risk Minimization

论文作者

Zhang, L., Karakasidis, G., Odnoblyudova, A., Dogruel, L., Jung, A.

论文摘要

机器学习（ML）方法的成功应用越来越依赖于其可解释性或解释性。设计可解释的ML系统对确保针对人类的自动决策的透明度起着重要作用。 ML方法的解释性也是值得信赖的人工智能的重要组成部分。确保解释性的关键挑战是其对特定人类用户的依赖（“说明”）。机器学习方法的用户可能对机器学习原理具有截然不同的背景知识。一个用户可能拥有机器学习或相关领域的大学学位，而另一个用户可能从未接受过高中数学的正规培训。本文采用信息理论概念来开发一种新颖的措施，以实现ML方法提供的预测的主观解释。给定用户反馈，我们通过预测的条件熵来构建此措施。用户反馈可以从用户调查或生物物理测量值中获得。我们的主要贡献是学习一个假设的可解释的经验风险最小化（EERM）原则，该假设可以在主观解释性和风险之间进行最佳平衡。 EERM原理是灵活的，可以与任意的机器学习模型结合使用。我们为线性模型和决策树提出了EERM的几个实际实现。数值实验证明了EERM在社交媒体上检测不适当语言的应用。

The successful application of machine learning (ML) methods becomes increasingly dependent on their interpretability or explainability. Designing explainable ML systems is instrumental to ensuring transparency of automated decision-making that targets humans. The explainability of ML methods is also an essential ingredient for trustworthy artificial intelligence. A key challenge in ensuring explainability is its dependence on the specific human user ("explainee"). The users of machine learning methods might have vastly different background knowledge about machine learning principles. One user might have a university degree in machine learning or related fields, while another user might have never received formal training in high-school mathematics. This paper applies information-theoretic concepts to develop a novel measure for the subjective explainability of the predictions delivered by a ML method. We construct this measure via the conditional entropy of predictions, given a user feedback. The user feedback might be obtained from user surveys or biophysical measurements. Our main contribution is the explainable empirical risk minimization (EERM) principle of learning a hypothesis that optimally balances between the subjective explainability and risk. The EERM principle is flexible and can be combined with arbitrary machine learning models. We present several practical implementations of EERM for linear models and decision trees. Numerical experiments demonstrate the application of EERM to detecting the use of inappropriate language on social media.

下载PDF全文

下载文献需遵守相关版权规定

论文标题