The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Unmanned Aerial Vehicles

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    This paper investigates the simulation to reality gap in reinforcement learning (RL) applied to Unmanned Aerial Vehicles (UAVs) with fractional delays in the system (i.e., delays which are non-integer multiple of the sampling period). The consideration of delay has a substantial effect on the nature of the UAV system being studied. Systems with the presence of delays are considered non-Markovian, and the system state vector must be extended to make the system Markovian. Based on this analysis, we presented a sampling scheme that yields efficient RL training of agents that perform well in real-world UAVS deployment. We show that the Markovian system-trained agents do not exhibit excessive oscillations, in contrast to the agent that doesn't consider time delay in the training model. Our methodology for robust low-level control of UAV hovering mode has been validated using real-world experiments. Furthermore, real-world experiments show a qualitative match with a simulation which validates the proposed theoretical framework. A video summary of this paper can be watched in https://www.youtube.com/watch?v=1BSAA7usfK0

    Original languageBritish English
    Title of host publication2023 21st International Conference on Advanced Robotics, ICAR 2023
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages514-519
    Number of pages6
    ISBN (Electronic)9798350342291
    DOIs
    StatePublished - 2023
    Event21st International Conference on Advanced Robotics, ICAR 2023 - Abu Dhabi, United Arab Emirates
    Duration: 5 Dec 20238 Dec 2023

    Publication series

    Name2023 21st International Conference on Advanced Robotics, ICAR 2023

    Conference

    Conference21st International Conference on Advanced Robotics, ICAR 2023
    Country/TerritoryUnited Arab Emirates
    CityAbu Dhabi
    Period5/12/238/12/23

    Fingerprint

    Dive into the research topics of 'The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Unmanned Aerial Vehicles'. Together they form a unique fingerprint.

    Cite this