基于Dueling DQN的临近空间飞行器再入轨迹规划

doi:10.3969/j.issn.1671-0576.2024.02.001

首页 > 过刊浏览>2024年第卷第2期 >1-10. DOI:10.3969/j.issn.1671-0576.2024.02.001

基于Dueling DQN的临近空间飞行器再入轨迹规划
DOI:
                        10.3969/j.issn.1671-0576.2024.02.001
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者单位:1.上海机电工程研究所, 上海 201109 ; 2.上海航天电子技术研究所, 上海 201109
作者简介:田若岑(1997—)，男，硕士，助理工程师，主要从事制导控制技术研究。
通讯作者:
中图分类号:V448.235
基金项目:

Near Space Vehicle Reentry Trajectory Planning Based on Dueling DQN

Author:

Affiliation:

1.Shanghai Electro-Mechanical Engineering Institute, Shanghai 201109 , China ; 2.Shanghai Aerospace Electronic Technology Institute, Shanghai 201109 , China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对临近空间飞行器再入段禁飞区规避制导问题，构建了临近空间飞行器再入过程横侧向制导的马尔可夫决策过程（Markov decision process，MDP）模型。基于竞争深度Q网络（dueling deep Q network，Dueling DQN），设计了横侧向制导律及满足射程需求与禁飞区规避需求的再入过程奖励函数。经仿真验证，该横侧向制导律能够通过改变倾侧角符号实现禁飞区规避，并导引飞行器到达目标区域，具备较高精度，验证了方法的有效性。

Abstract:

Aiming at the problem of no-fly zone avoidance guidance in the reentry phase for near space vehicle, the Markov decision process (MDP) model of lateral guidance in the reentry process for near space vehicle was constructed. On the basis of dueling deep Q network (Dueling DQN), the lateral guidance law and the environmental reward feedback function to satisfy the range requirement and the no-fly zone avoidance requirement were designed. The simulation results show that the lateral guidance law can avoid the no-fly zone by changing the sign of roll angle, and guide the aircraft to the target area with high precision, which verifies the effectiveness of the method.

参考文献

相似文献

引证文献

引用本文

田若岑,刘益吉,肖涛,等.基于Dueling DQN的临近空间飞行器再入轨迹规划[J].制导与引信,2024,(2):1-10

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-12-11
最后修改日期:
录用日期:
在线发布日期: 2024-06-26
出版日期:

引用本文

分享

相关视频

文章指标

历史

文章二维码

您是本站第访问者

引用本文

分享

相关视频

文章指标

历史

文章二维码

您是本站第 访问者

您是本站第访问者