Leveraging optimal control demonstrations in reinforcement learning for powered descent

Callum Wilson,Annalisa Riccardi

Leveraging optimal control demonstrations in reinforcement learning for powered descent

2021

Callum Wilson
Annalisa Riccardi

This work presents an approach to deriving a controller for spacecraft powered descent using reinforcement learning. To assist in the learning process, our approach uses optimal control demonstrations which provide open-loop control for optimal trajectories. Combining these approaches to use the optimal trajectories as demonstrations helps to overcome issues with convergence on desirable policies in the reinforcement learning problem. We demonstrate the applicability of this approach on a simulated 3-DOF Mars lander. The results show that the learned controller is capable of achieving a pinpoint soft landing from a range of initial conditions. Compared to the open-loop optimal trajectories alone, this controller generalises to more initial conditions and can cope with environmental uncertainties.

Keywords:

Convergence (routing)
Reinforcement learning
Intelligent control
Control theory
Computer science
Mars landing
Control engineering
Range (aeronautics)
Optimal control
Soft landing

Correction
Source
Cite
Save

References

Citations