一种基于深厚的强化学习的隐藏的抗束缚方法

论文标题

一种基于深厚的强化学习的隐藏的抗束缚方法

A hidden anti-jamming method based on deep reinforcement learning

论文作者

Wang, Yifan, Liu, Xin, Wang, Mei, Yu, Yu

论文摘要

无线通信的当前大多数抗界算法都仅考虑如何避免攻击攻击，但忽略了沟通波形或频率动作可以由干扰器获得。尽管现有的反犯罪方法可以保证暂时的沟通效果，但是当智能干扰者能够从历史交流活动中学习时，这些反判断方法的长期性能可能会抑制。针对这个问题，提出了一种基于减少干扰者意识概率的概念的隐藏抗界方法。首先，通过计算干扰器和用户的动作之间的相关性来获得干扰器的传感概率。后来，设计了一个深厚的增强学习框架，该框架不仅旨在最大程度地提高通信吞吐量，而且还旨在最大程度地减少干扰器与用户之间的动作相关性。最后，提出了一种隐藏的抗杀伤算法，该算法将瞬时回报与用户的通信质量以及用户与干扰器之间的相关性联系起来。仿真结果表明，与仅考虑避免避免干扰的当前算法相比，所提出的算法不仅避免了干扰器感应的算法，而且还可以改善其抗界面性能。

Most of the current anti-jamming algorithms for wireless communications only consider how to avoid jamming attacks, but ignore that the communication waveform or frequency action may be obtained by the jammers. Although existing anti-jamming methods can guarantee temporary communication effects, the long-term performance of these anti-jamming methods may be depressed when intelligent jammers are capable of learning from historical communication activities. Aiming at this issue, a hidden anti-jamming method based on the idea of reducing the jammer's sense probability is proposed. Firstly, the sensing probability of the jammer is obtained by calculating the correlation between the actions of the jammer and the user. Later, a deep reinforcement learning framework is designed, which aims at not only maximizing the communication throughput but also minimizing the action's correlation between the jammer and the user. Finally, a hidden anti-jamming algorithm is proposed, which links the instantaneous return with the communication quality of users and the correlation between users and jammer. The simulation result shows that the proposed algorithm not only avoids being sensed by the jammer but also improves its anti-jamming performance compared to the current algorithm that only considers jamming avoidance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题