Abstract:Deep Reinforcement Learning (DRL) provides a new technological paradigm for the intelligent flight control of unmanned aerial vehicles. However, confidence in the "black box" Artificial Neural Network (ANN) intelligent model is the main obstacle to practical application. To validate the neural network-based intelligent flight control model designed with DRL through flight test, a longitudinal end-to-end intelligent flight control model that maps the flight state to the elevator/thrust commands is developed for a fixed-wing scaled model aircraft, based on the multi-dimensional continuous state input and action output DRL Proximal Policy Optimization (PPO) algorithm. The robustness of ANN control model is validated through the simulation, and its engineering implementation for the sim-to-real transfer is further carried out. A flexible onboard ANN controller is developed and the model flight demonstration is launched. The test results preliminarily verify the applicability and generalization performance of the ANN controller.