期刊文献+

知识驱动的智能博弈对抗行动序列规划方法

Knowledge Driven Course of Action Planning for Intelligent Game Confrontation
在线阅读 下载PDF
导出
摘要 针对基于深度强化学习方法解决实际博弈对抗序列规划问题中存在的探索-利用矛盾、奖赏信号稀疏、数据利用率低、难以稳定收敛等问题,分析了基于知识的学习型智能生成模式,提出基于知识驱动的方法,从用规则教、从数据中学、用问题引导等方面构建了智能博弈对抗行动序列规划模型,为提升探索-利用效率、精准奖励函数、加速算法收敛提供了理论支撑。对基于强化学习的智能博弈对抗问题求解的难点问题进行了讨论,指出下一步深度强化学习算法走向实用的发展方向。 Aiming at the problems of conflict between exploration and utilization,sparse reward signals,low data utilization rate,and difficulty in stable convergence in solving the practical course of action planning for Intelligent Game Confrontation based on deep reinforcement learning.The knowledge-based type-learning intelligent generation mode is analyzed,and the knowledge driven method is proposed.The course of planning model of intelligent game confrontation from the aspects of rule-based teaching,data-based learning and problem-based guidance and other aspects is constructed,which provides theoretical support for improving the exploration utilization efficiency,accurate reward function and accelerating algorithm convergence.The difficult problems of solving the intelligent game confrontation problem based on reinforcement learning are discussed,and the more practical development direction of the next step deep enforcement learning algorithm is pointed out.
作者 陈希亮 曹雷 康凯 李晨溪 CHEN Xiliang;CAO Lei;KANG Kai;LI Chenxi(College of Command and Control Engineering,Army Engineering University,Nanjing 210007,China;Unit 31108 of PLA,Nanjing 210007,China)
出处 《指挥与控制学报》 CSCD 北大核心 2024年第4期509-515,共7页 Journal of Command and Control
基金 国家自然科学基金(62273356)资助。
关键词 深度强化学习 博弈对抗 知识驱动 行动序列规划 deep reinforcement learning intelligent game confrontation knowledge driven course of action planning
分类号 E91 [军事]
  • 相关文献

参考文献3

二级参考文献139

  • 1唐金国.美军任务规划系统的现状、发展和关键技术[J].军事运筹与系统工程,2003,17(3):62-64. 被引量:22
  • 2教材编写组.运筹学[M].北京:清华大学出版社,1982:10.
  • 3亨利法约尔.工业管理与一般管理[M].周安华译.北京:中国社会科学出版社,1982:46.
  • 4彼得德鲁克.管理:任务、责任、实践[M].孙耀君译.北京:中国社会科学出版社,1987.
  • 5黄培生.美军研制新型空中作战管理系统[EB/OL].[2015-09-05].http://mil.news.sina.com.cn/2004-01-11/0950176579.html.
  • 6李有观.盘点各国舰艇作战管理系统[EB/OL].[2015-09-01].http://www.china.com.cn/military/2015-08/07/content_36250759.html.
  • 7KERR B. DARPA demos Deep Green [EB/OL].( 201 1-04-07) [2016-05-10]. http://www, ftleave worthlamp, com/ article/ 2011040 7 / NEWS/ 3040 7 9884.
  • 8SURDU J R. Deep Green[EB/OL]. (2008-05-08) [2016-05-10]. http://www, darpa, mil.
  • 9SURDU J R, KITTKA K. The Deep Green concept [C]//Proceedings of Spring Simulation Multiconfer- ence 2008 Conference on Military Modelling and Simu- lation Symposium. Ottawa:Spring, 2008 : 623-631.
  • 10SURDU J R, STERRETT J, LUNSFORD J. The gaming debate[J]. Training - Simulation Journal, 2010(12) .- 46-48.

共引文献213

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部