Abstract:Aiming at the problem of no-fly zone avoidance guidance in the reentry phase for near space vehicle, the Markov decision process (MDP) model of lateral guidance in the reentry process for near space vehicle was constructed. On the basis of dueling deep Q network (Dueling DQN), the lateral guidance law and the environmental reward feedback function to satisfy the range requirement and the no-fly zone avoidance requirement were designed. The simulation results show that the lateral guidance law can avoid the no-fly zone by changing the sign of roll angle, and guide the aircraft to the target area with high precision, which verifies the effectiveness of the method.