Intelligent Anti-Jamming Decision Algorithm of Bivariate Frequency Hopping Pattern Based on DQN With PER and Pareto

Intelligent Anti-Jamming Decision Algorithm of Bivariate Frequency Hopping Pattern Based on DQN With PER and Pareto

Jiasheng Zhu, Zhijin Zhao, Shilian Zheng
DOI: 10.4018/IJITWE.297970
Article PDF Download
Open access articles are freely available for download

Abstract

To improve the anti-jamming performance of frequency hopping system in complex electromagnetic environment, a Deep Q-Network algorithm with priority experience replay (PER) based on Pareto samples (PPER-DQN) is proposed, which makes intelligent decision for bivariate FH pattern. The system model, state-action space and reward function are designed based on the main parameters of the FH pattern. The DQN is used to improve the flexibility of the FH pattern. Based on the definition of Pareto dominance, the PER based on the TD-error and immediate reward is proposed. To ensure the diversity of the training set, it is formed by Pareto sample set and several random samples. When selecting Pareto sample, the confidence coefficient is introduced to modify its priority. It guarantees the learning value of the training set and improves the learning efficiency of DQN. The simulation results show that the efficiency, convergence speed and stability of the algorithm are effectively improved. And the generated bivariate FH pattern has better performance than the conventional FH pattern.
Article Preview
Top

Introduction

Frequency hopping communication system, as a traditional anti-interference technology, has strong anti-jamming ability. In the past few decades, based on the traditional frequency hopping communication technology, differential frequency hopping, multi sequence frequency hopping, variable speed frequency hopping, cognitive frequency hopping and other technologies have been developed. It can be found that most of these technologies realize anti-interference by timely and appropriately adjusting the important parameters which have a great impact on the performance of frequency hopping system. However, under the influence of increasingly complex electromagnetic environment and gradually intelligent interference strategy, these frequency hopping communication technologies can no longer meet the communication needs.

As the most important parameter of frequency hopping communication system, the research on frequency hopping pattern has been promoted all the time. Researchers often screen frequency points on the basis of accurate spectrum sensing results, and generate frequency hopping patterns in various ways. However, these methods have limited effect under the complex electromagnetic environment. In order to ensure the quality of frequency hopping communication, the application of more intelligent anti-interference technology in the design of frequency hopping pattern is necessary.

With the development of machine learning technology, the anti-interference technology is becoming more and more intelligent. As an important branch of machine learning, reinforcement learning is an algorithm based on Markov decision process (MDP). Its essence is the process in which agents constantly interact with the environment. Reinforcement learning interacts with the environment through an agent with learning ability. The agent selects and executes a certain action according to the current state and the learned environmental experience, and transfer to a new state under the joint action of the action and the environment. At the same time, the environment will also feedback certain rewards or punishments according to the current state and the actions taken. The agent updates the cognition of the current environment through rewards or punishments, and makes subsequent decisions. After completing the learning, the agent will obtain an environment-state-action mapping relationship. It is used to guide the agent to select the appropriate action based on the state in the actual decision-making process and obtain the maximum cumulative reward value from the environment. Therefore, reinforcement learning is suitable for solving intelligent decision-making problems in complex environments.

Complete Article List

Search this Journal:
Reset
Volume 19: 1 Issue (2024)
Volume 18: 1 Issue (2023)
Volume 17: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 16: 4 Issues (2021)
Volume 15: 4 Issues (2020)
Volume 14: 4 Issues (2019)
Volume 13: 4 Issues (2018)
Volume 12: 4 Issues (2017)
Volume 11: 4 Issues (2016)
Volume 10: 4 Issues (2015)
Volume 9: 4 Issues (2014)
Volume 8: 4 Issues (2013)
Volume 7: 4 Issues (2012)
Volume 6: 4 Issues (2011)
Volume 5: 4 Issues (2010)
Volume 4: 4 Issues (2009)
Volume 3: 4 Issues (2008)
Volume 2: 4 Issues (2007)
Volume 1: 4 Issues (2006)
View Complete Journal Contents Listing