论文标题

联合的模仿学习以保留入侵检测

Federated Mimic Learning for Privacy Preserving Intrusion Detection

论文作者

Al-Marri, Noor Ali Al-Athba, Ciftler, Bekir Sait, Abdallah, Mohamed

论文摘要

物联网(IoT)设备易于攻击,因为它们的隐私和安全组件的限制。这些攻击从利用后门到破坏设备的通信网络不等。入侵检测系统(IDS)在确保对这些攻击的IoT设备的信息隐私和安全性方面起着至关重要的作用。最近,由于其高分类准确性,基于深度学习的ID技术变得越来越突出。但是,由于将用户数据传输到集中式服务器,传统的深度学习技术会危害用户隐私。联合学习(FL)是一种流行的保护隐私化学习方法。 FL可以在边缘设备的本地启用培训模型,并将本地型号传输到集中式服务器,而不是传输敏感数据。然而,FL可能会遭受反向工程ML攻击,这些攻击可以从模型中学习有关用户数据的信息。为了克服逆向工程的问题,模仿学习是保留基于ML的ID隐私的另一种方法。在模拟学习中,通过公共数据集对学生模型进行了培训,该数据集用敏感用户数据培训的教师模型标记。在这项工作中,我们提出了一种新颖的方法,该方法结合了FL和模仿学习的优势,即联合的模拟学习来创建分布式ID,同时最大程度地减少了危害用户的隐私风险,并与其他基于ML的IDS技术相比,使用NSL-KDD数据集进行了基准性能。我们的结果表明,通过联合模拟学习,我们可以实现98.11%的检测准确性。

Internet of things (IoT) devices are prone to attacks due to the limitation of their privacy and security components. These attacks vary from exploiting backdoors to disrupting the communication network of the devices. Intrusion Detection Systems (IDS) play an essential role in ensuring information privacy and security of IoT devices against these attacks. Recently, deep learning-based IDS techniques are becoming more prominent due to their high classification accuracy. However, conventional deep learning techniques jeopardize user privacy due to the transfer of user data to a centralized server. Federated learning (FL) is a popular privacy-preserving decentralized learning method. FL enables training models locally at the edge devices and transferring local models to a centralized server instead of transferring sensitive data. Nevertheless, FL can suffer from reverse engineering ML attacks that can learn information about the user's data from model. To overcome the problem of reverse engineering, mimic learning is another way to preserve the privacy of ML-based IDS. In mimic learning, a student model is trained with the public dataset, which is labeled with the teacher model that is trained by sensitive user data. In this work, we propose a novel approach that combines the advantages of FL and mimic learning, namely federated mimic learning to create a distributed IDS while minimizing the risk of jeopardizing users' privacy, and benchmark its performance compared to other ML-based IDS techniques using NSL-KDD dataset. Our results show that we can achieve 98.11% detection accuracy with federated mimic learning.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源