Skip to main content

A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making

  • Conference paper
  • First Online:
Proceedings of 2022 International Conference on Autonomous Unmanned Systems (ICAUS 2022) (ICAUS 2022)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1010))

Included in the following conference series:

Abstract

Aiming at the problem of difficult convergence of three-dimensional space UAV air game, a deep reinforcement learning air game decision-making algorithm based on improved action strategy is proposed. The main contributions of the paper are as follows: First, a heuristic reward function is designed by introducing situational information such as angle and speed, which alleviates the problem of difficult convergence caused by sparse rewards. Second, an action selection strategy based on short-term advantage is designed to avoid a large number of meaningless repetitions data. Simulation experiments show that the decision-making algorithm based on deep reinforcement learning can avoid risks and occupy a favorable position in different initial situations. The proposed improvement mechanism can effectively improve the convergence efficiency of the algorithm and the success rate of the game.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 709.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 899.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 899.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Raivio, T.: Capture set computation of an optimally guided missile. J. Guid. Control. Dyn. 24(6), 1167–1175 (2001)

    Article  Google Scholar 

  2. Virtanen, K., Raivio, T., Hamalainen, R.P.: Modeling pilot’s sequential maneuvering decisions by a multistage influence diagram. J. Guid. Control Dyn. 27(4), 665–677 (2004)

    Article  Google Scholar 

  3. Wang, R., Gao, Z.: Research on decision system in air game simulation using maneuver library. Flight Dyn. 27(6), 72–75 (2009)

    Google Scholar 

  4. Ernest, N., Cohen, K., Kivelevitch, E., et al.: Genetic fuzzy trees and their application towards autonomous training and control of a squadron of unmanned game aerial vehicles. Unmanned Syst. 3(3), 185–204 (2015)

    Article  Google Scholar 

  5. Huang, C.Q., Dong, K.S., Huang, H.Q., et al.: Autonomous air game maneuver decision using Bayesian inference and moving horizon optimization. J. Syst. Eng. Electron. 29(1), 86–97 (2018)

    Article  Google Scholar 

  6. Ruan, C., Kou, Y., et al.: Research on single step manuvering decision air game based on binary FuzzV comparison method. Command Control Simul. 34(5), 10–13 (2012)

    Google Scholar 

  7. Liu, P., Ma, Y.: A deep reinforcement learning based intelligent decision method for UCAV air combat. In: Mohamed Ali, M.S., Wahid, H., Mohd Subha, N.A., Sahlan, S., Md. Yunus, M.A., Wahap, A.R. (eds.) AsiaSim 2017. CCIS, vol. 751, pp. 274–286. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-6463-0_24

    Chapter  Google Scholar 

  8. Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Playing Atari with Deep Reinforcement Learning[EB/OL]. https://arxiv.xilesou.top/pdf/1312.5602 (2013)

  9. Kurniawan, B., Vamplew, P., Papasimeon, M., et al.: An empirical study of reward structures for actor-critic reinforcement learning in air game manoeuvring simulation. In: AI 2019: Advances in Artificial Intelligence, 32nd Australasian Joint Conference, Adelaide, SA, Australia (2019)

    Google Scholar 

  10. Zhou, P., Huang, J.T., Zhang, S., et al.: Research on UAV intelligent air game decision and simulation based on deep reinforcement learning. Acta Aeronautica et Astronautica Sinica 43, 126731 (2022). (in Chinese). https://doi.org/10.7527/S1000-6893.2022.26731

  11. Mao, M.Y., Zhang, A., Zhou, D., et al.: Reinforcement learning of UCAV air game based on maneuver prediction. Electron. Opt. Control 26(2), 5–10 22 (2019)

    Google Scholar 

  12. Kong, W., Zhou, D., Zhao, Y., Yang, W.: Maneuvering strategy generation algorithm for multi-UAV in close-range air game based on deep reinforcement learning and self-play. Control Theory Appl. 39(2), 352–362 (2022)

    Google Scholar 

  13. Hambling, D.: AI outguns a human fighter pilot. New Sci. 247(3297), 12 (2020)

    Article  Google Scholar 

  14. Austin, F., Carbone, G., Falco, M., et al.: Automated maneuvering decisions for air-to-air game. In: Guidance, Navigation and Control Conference, p. 2393. AIAA, Monterey (1987)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to ChengJing Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Beijing HIWING Sci. and Tech. Info Inst

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xie, R., Huang, C., Wang, Z., Han, J. (2023). A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making. In: Fu, W., Gu, M., Niu, Y. (eds) Proceedings of 2022 International Conference on Autonomous Unmanned Systems (ICAUS 2022). ICAUS 2022. Lecture Notes in Electrical Engineering, vol 1010. Springer, Singapore. https://doi.org/10.1007/978-981-99-0479-2_359

Download citation

Publish with us

Policies and ethics