A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making

Xie, RongLei; Huang, ChengJing; Wang, ZiYi; Han, Jin

doi:10.1007/978-981-99-0479-2_359

RongLei Xie⁴⁰,
ChengJing Huang⁴⁰,
ZiYi Wang⁴⁰ &
…
Jin Han⁴¹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1010))

Included in the following conference series:

International Conference on Autonomous Unmanned Systems

41 Accesses
1 Citations

Abstract

Aiming at the problem of difficult convergence of three-dimensional space UAV air game, a deep reinforcement learning air game decision-making algorithm based on improved action strategy is proposed. The main contributions of the paper are as follows: First, a heuristic reward function is designed by introducing situational information such as angle and speed, which alleviates the problem of difficult convergence caused by sparse rewards. Second, an action selection strategy based on short-term advantage is designed to avoid a large number of meaningless repetitions data. Simulation experiments show that the decision-making algorithm based on deep reinforcement learning can avoid risks and occupy a favorable position in different initial situations. The proposed improvement mechanism can effectively improve the convergence efficiency of the algorithm and the success rate of the game.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 709.00; Price excludes VAT (USA)

Softcover Book: USD 899.99; Price excludes VAT (USA)

Hardcover Book: USD 899.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A MADDPG-based multi-agent antagonistic algorithm for sea battlefield confrontation

Article 13 April 2022

Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game

Article 16 June 2021

Accelerating Spatio-Temporal Deep Reinforcement Learning Model for Game Strategy

References

Raivio, T.: Capture set computation of an optimally guided missile. J. Guid. Control. Dyn. 24(6), 1167–1175 (2001)
Article Google Scholar
Virtanen, K., Raivio, T., Hamalainen, R.P.: Modeling pilot’s sequential maneuvering decisions by a multistage influence diagram. J. Guid. Control Dyn. 27(4), 665–677 (2004)
Article Google Scholar
Wang, R., Gao, Z.: Research on decision system in air game simulation using maneuver library. Flight Dyn. 27(6), 72–75 (2009)
Google Scholar
Ernest, N., Cohen, K., Kivelevitch, E., et al.: Genetic fuzzy trees and their application towards autonomous training and control of a squadron of unmanned game aerial vehicles. Unmanned Syst. 3(3), 185–204 (2015)
Article Google Scholar
Huang, C.Q., Dong, K.S., Huang, H.Q., et al.: Autonomous air game maneuver decision using Bayesian inference and moving horizon optimization. J. Syst. Eng. Electron. 29(1), 86–97 (2018)
Article Google Scholar
Ruan, C., Kou, Y., et al.: Research on single step manuvering decision air game based on binary FuzzV comparison method. Command Control Simul. 34(5), 10–13 (2012)
Google Scholar
Liu, P., Ma, Y.: A deep reinforcement learning based intelligent decision method for UCAV air combat. In: Mohamed Ali, M.S., Wahid, H., Mohd Subha, N.A., Sahlan, S., Md. Yunus, M.A., Wahap, A.R. (eds.) AsiaSim 2017. CCIS, vol. 751, pp. 274–286. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-6463-0_24
Chapter Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Playing Atari with Deep Reinforcement Learning[EB/OL]. https://arxiv.xilesou.top/pdf/1312.5602 (2013)
Kurniawan, B., Vamplew, P., Papasimeon, M., et al.: An empirical study of reward structures for actor-critic reinforcement learning in air game manoeuvring simulation. In: AI 2019: Advances in Artificial Intelligence, 32nd Australasian Joint Conference, Adelaide, SA, Australia (2019)
Google Scholar
Zhou, P., Huang, J.T., Zhang, S., et al.: Research on UAV intelligent air game decision and simulation based on deep reinforcement learning. Acta Aeronautica et Astronautica Sinica 43, 126731 (2022). (in Chinese). https://doi.org/10.7527/S1000-6893.2022.26731
Mao, M.Y., Zhang, A., Zhou, D., et al.: Reinforcement learning of UCAV air game based on maneuver prediction. Electron. Opt. Control 26(2), 5–10 22 (2019)
Google Scholar
Kong, W., Zhou, D., Zhao, Y., Yang, W.: Maneuvering strategy generation algorithm for multi-UAV in close-range air game based on deep reinforcement learning and self-play. Control Theory Appl. 39(2), 352–362 (2022)
Google Scholar
Hambling, D.: AI outguns a human fighter pilot. New Sci. 247(3297), 12 (2020)
Article Google Scholar
Austin, F., Carbone, G., Falco, M., et al.: Automated maneuvering decisions for air-to-air game. In: Guidance, Navigation and Control Conference, p. 2393. AIAA, Monterey (1987)
Google Scholar

Download references

Author information

Authors and Affiliations

HIWING Technology Academy of CASIC, Beijing, 100074, China
RongLei Xie, ChengJing Huang & ZiYi Wang
Intelligent Science and Technology, Academy Limited of CASIC, Beijing, 100074, China
Jin Han

Authors

RongLei Xie
View author publications
You can also search for this author in PubMed Google Scholar
ChengJing Huang
View author publications
You can also search for this author in PubMed Google Scholar
ZiYi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jin Han
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to ChengJing Huang .

Editor information

Editors and Affiliations

Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an, Shaanxi, China
Wenxing Fu
Beijing HIWING Scientific and Technological Information Institute, Beijing, China
Mancang Gu
College of Intelligence Science and Technology, National University of Defense Technology, Changsha, Hunan, China
Yifeng Niu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, R., Huang, C., Wang, Z., Han, J. (2023). A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making. In: Fu, W., Gu, M., Niu, Y. (eds) Proceedings of 2022 International Conference on Autonomous Unmanned Systems (ICAUS 2022). ICAUS 2022. Lecture Notes in Electrical Engineering, vol 1010. Springer, Singapore. https://doi.org/10.1007/978-981-99-0479-2_359

Download citation

DOI: https://doi.org/10.1007/978-981-99-0479-2_359
Published: 10 March 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0478-5
Online ISBN: 978-981-99-0479-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics