统一评估机器学习安全监视器

论文标题

统一评估机器学习安全监视器

Unifying Evaluation of Machine Learning Safety Monitors

论文作者

Guerin, Joris, Ferreira, Raul Sena, Delmas, Kevin, Guiochet, Jérémie

论文摘要

随着机器学习（ML）在关键自主系统中的越来越多，已经开发出运行时监视器来检测预测错误并使系统在操作过程中保持安全状态。已经提出了针对涉及各种感知任务和ML模型的不同应用，并将监视器用于不同的评估程序和指标。本文介绍了三个统一面向安全的指标，代表了监视器的安全益处（安全增益），使用后剩余的安全差距（残留危险）以及对系统性能（可用性成本）的负面影响。要计算这些指标，需要定义两个返回功能，这表示给定的ML预测将如何影响预期的未来奖励和危害。三个用例（分类，无人机着陆和自动驾驶）用于证明如何通过拟议的指标来表达文献指标。这些示例的实验结果表明，不同的评估选择如何影响监视器的感知性能。由于我们的形式主义要求我们制定明确的安全性假设，因此它使我们能够确保进行评估与高级系统要求符合。

With the increasing use of Machine Learning (ML) in critical autonomous systems, runtime monitors have been developed to detect prediction errors and keep the system in a safe state during operations. Monitors have been proposed for different applications involving diverse perception tasks and ML models, and specific evaluation procedures and metrics are used for different contexts. This paper introduces three unified safety-oriented metrics, representing the safety benefits of the monitor (Safety Gain), the remaining safety gaps after using it (Residual Hazard), and its negative impact on the system's performance (Availability Cost). To compute these metrics, one requires to define two return functions, representing how a given ML prediction will impact expected future rewards and hazards. Three use-cases (classification, drone landing, and autonomous driving) are used to demonstrate how metrics from the literature can be expressed in terms of the proposed metrics. Experimental results on these examples show how different evaluation choices impact the perceived performance of a monitor. As our formalism requires us to formulate explicit safety assumptions, it allows us to ensure that the evaluation conducted matches the high-level system requirements.

下载PDF全文

下载文献需遵守相关版权规定

论文标题