Semantic Segmentation and Depth Estimation of Urban Road Scene Images Using Multi-Task Networks

  • Mohamed Mahyoub
  • , Friska Natalia
  • , Sud Sudirman
  • , Abdulmajeed Hammadi Jasim Al-Jumaily
  • , Panos Liatsis

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Scopus citations

    Abstract

    In autonomous driving, environment perception is an important step in understanding the driving scene. Objects in images captured through a vehicle camera can be detected and classified using semantic segmentation and depth estimation methods. Both these tasks are closely related to each other and this association helps in building a multi-Task neural network where a single network is used to generate both views from a given monocular image. This approach gives the flexibility to include multiple related tasks in a single network. It helps reduce multiple independent networks and improve the performance of all related tasks. The main aim of our research presented in this paper is to build a multi-Task deep learning network for simultaneous semantic segmentation and depth estimation from monocular images. Two decoder-focused U-N et-based multi-Task networks that use a pre-Trained Resnet-50 and DenseNet-121 which shared encoder and task-specific decoder networks with Attention Mechanisms are considered. We also employed multi-Task optimization strategies such as equal weighting and dynamic weight averaging during the training of the models. The corresponding models' performance is evaluated using mean IoU for semantic segmentation and Root Mean Square Error for depth estimation. From our experiments, we found that the performance of these multi-Task networks is on par with the corresponding single-Task networks.

    Original languageBritish English
    Title of host publicationDeSE 2023 - Proceedings
    Subtitle of host publication15th International Conference on Developments in eSystems Engineering
    EditorsDhiya Al-Jumeily, Header Abed Dhahad, Manj Jayabalan, Jade Hind, Jamila Mustafina, Sulaf Assi, Abir Hussain, Hissam Tawfik
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages469-474
    Number of pages6
    ISBN (Electronic)9798350335149
    DOIs
    StatePublished - 2023
    Event15th International Conference on Developments in eSystems Engineering, DeSE 2023 - Baghdad, Iraq
    Duration: 9 Jan 202312 Jan 2023

    Publication series

    NameProceedings - International Conference on Developments in eSystems Engineering, DeSE
    Volume2023-January
    ISSN (Print)2161-1343

    Conference

    Conference15th International Conference on Developments in eSystems Engineering, DeSE 2023
    Country/TerritoryIraq
    CityBaghdad
    Period9/01/2312/01/23

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 11 - Sustainable Cities and Communities
      SDG 11 Sustainable Cities and Communities

    Keywords

    • Deep Learning
    • Depth Estimation
    • Multi-Task Networks
    • Semantic Segmentation
    • Urban Road Scene Analysis

    Fingerprint

    Dive into the research topics of 'Semantic Segmentation and Depth Estimation of Urban Road Scene Images Using Multi-Task Networks'. Together they form a unique fingerprint.

    Cite this