Safe Motion Planning and Learning for Unmanned Aerial Systems

Perk, Baris; İnalhan, Gökhan

doi:10.3390/aerospace9020056

Safe Motion Planning and Learning for Unmanned Aerial Systems

Atıf İçin Kopyala

Perk B. E., İnalhan G.

AEROSPACE, cilt.9, sa.2, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 9 Sayı: 2
Basım Tarihi: 2022
Doi Numarası: 10.3390/aerospace9020056
Dergi Adı: AEROSPACE
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Anahtar Kelimeler: UAV, artificial intelligence, contraction theory, nonlinear control, primitives, reinforcement learning, imitation learning, maneuvers, TRAJECTORY TRACKING, NONLINEAR-SYSTEMS, STABILITY
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

To control unmanned aerial systems, we rarely have a perfect system model. Safe and aggressive planning is also challenging for nonlinear and under-actuated systems. Expert pilots, however, demonstrate maneuvers that are deemed at the edge of plane envelope. Inspired by biological systems, in this paper, we introduce a framework that leverages methods in the field of control theory and reinforcement learning to generate feasible, possibly aggressive, trajectories. For the control policies, Dynamic Movement Primitives (DMPs) imitate pilot-induced primitives, and DMPs are combined in parallel to generate trajectories to reach original or different goal points. The stability properties of DMPs and their overall systems are analyzed using contraction theory. For reinforcement learning, Policy Improvement with Path Integrals (PI2) was used for the maneuvers. The results in this paper show that PI2 updated policies are a feasible and parallel combination of different updated primitives transfer the learning in the contraction regions. Our proposed methodology can be used to imitate, reshape, and improve feasible, possibly aggressive, maneuvers. In addition, we can exploit trajectories generated by optimization methods, such as Model Predictive Control (MPC), and a library of maneuvers can be instantly generated. For application, 3-DOF (degrees of freedom) Helicopter and 2D-UAV (unmanned aerial vehicle) models are utilized to demonstrate the main results.