The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Unmanned Aerial Vehicles

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

This paper investigates the simulation to reality gap in reinforcement learning (RL) applied to Unmanned Aerial Vehicles (UAVs) with fractional delays in the system (i.e., delays which are non-integer multiple of the sampling period). The consideration of delay has a substantial effect on the nature of the UAV system being studied. Systems with the presence of delays are considered non-Markovian, and the system state vector must be extended to make the system Markovian. Based on this analysis, we presented a sampling scheme that yields efficient RL training of agents that perform well in real-world UAVS deployment. We show that the Markovian system-trained agents do not exhibit excessive oscillations, in contrast to the agent that doesn't consider time delay in the training model. Our methodology for robust low-level control of UAV hovering mode has been validated using real-world experiments. Furthermore, real-world experiments show a qualitative match with a simulation which validates the proposed theoretical framework. A video summary of this paper can be watched in https://www.youtube.com/watch?v=1BSAA7usfK0

Original languageBritish English
Title of host publication2023 21st International Conference on Advanced Robotics, ICAR 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages514-519
Number of pages6
ISBN (Electronic)9798350342291
DOIs
StatePublished - 2023
Event21st International Conference on Advanced Robotics, ICAR 2023 - Abu Dhabi, United Arab Emirates
Duration: 5 Dec 20238 Dec 2023

Publication series

Name2023 21st International Conference on Advanced Robotics, ICAR 2023

Conference

Conference21st International Conference on Advanced Robotics, ICAR 2023
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period5/12/238/12/23

Fingerprint

Dive into the research topics of 'The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Unmanned Aerial Vehicles'. Together they form a unique fingerprint.

Cite this