Evaluating Visual-Selective Visual-Inertial Odometry: An End-to-End Multi-Modal Pose Estimation Framework for Underwater Environments

Vidya Sudevan, Fakhreddine Zayer, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    This paper presents an evaluation of the performance of Visual-Selective Visual-Inertial Odometry (VS-VIO), a hybrid learning-based multi-modal pose estimation framework, in the challenging underwater domain. The assessment is based on Root Mean Square (RMSE) scores for translation and rotation vectors, compared to their reference values. The underwater environment, characterized by low lighting and high turbidity due to suspended particles, poses significant challenges for pose estimation. Understanding how hybrid learning-based multi-modal frameworks perform in such conditions is crucial for improving underwater navigation and exploration. In this study, we thoroughly analyze the performance of VS-VIO and its baseline model at the sub-sequence level, focusing on pose error. Additionally, we assess various technical aspects during the inference phase, including inference speed, power consumption, GPU utilization, GPU memory usage, and temperature. All evaluations are conducted using the AQUALOC dataset. Our findings reveal that the policy network within VS-VIO exhibits the ability to dynamically reduce the utilization of the visual modality while maintaining pose estimation accuracy. However, our analysis shows no statistically significant reduction in the percentage of visual modality usage when altering the penalty factor. These insights provide valuable guidelines for enhancing the performance of hybrid learning-based multimodal pose estimation frameworks in challenging underwater environments, contributing to advancements in underwater navigation and exploration technologies.

    Original languageBritish English
    Title of host publication2023 21st International Conference on Advanced Robotics, ICAR 2023
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages639-644
    Number of pages6
    ISBN (Electronic)9798350342291
    DOIs
    StatePublished - 2023
    Event21st International Conference on Advanced Robotics, ICAR 2023 - Abu Dhabi, United Arab Emirates
    Duration: 5 Dec 20238 Dec 2023

    Publication series

    Name2023 21st International Conference on Advanced Robotics, ICAR 2023

    Conference

    Conference21st International Conference on Advanced Robotics, ICAR 2023
    Country/TerritoryUnited Arab Emirates
    CityAbu Dhabi
    Period5/12/238/12/23

    Keywords

    • Hybrid CNN-LSTM Framework
    • Multi-Modal Multi-Rate Data Fusion
    • Pose Estimation
    • Underwater Robotics
    • Visual-Inertial Odometry (VIO)

    Fingerprint

    Dive into the research topics of 'Evaluating Visual-Selective Visual-Inertial Odometry: An End-to-End Multi-Modal Pose Estimation Framework for Underwater Environments'. Together they form a unique fingerprint.

    Cite this