RECURRENT 3-D MULTI-LEVEL VISUAL TRANSFORMER FOR JOINT CLASSIFICATION OF HETEROGENEOUS 2-D AND 3-D RADIOGRAPHIC DATA

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Recent advancements in artificial intelligence algorithms for medical imaging show significant potential in automating the detection of lung infections from chest radiograph scans. However, current approaches often focus solely on either 2-D or 3-D scans, failing to leverage the combined advantages of both modalities. Moreover, conventional slice-based methods place a manual burden on radiologists for slice selection. To overcome these challenges, we propose the Recurrent 3-D Multi-level Vision Transformer (R3DM-ViT) model, capable of handling multimodal data to enhance diagnostic accuracy. Our quantitative evaluations demonstrate that R3DM-ViT surpasses existing methods, achieving an impressive accuracy of 96.67%, F1-score of 96.88%, mean average precision of 96.75%, and mean average recall of 97.02%. This research signifies a significant stride forward in the automated detection of lung infections through multimodal imaging.

Original languageBritish English
Title of host publication2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PublisherIEEE Computer Society
Pages3205-3211
Number of pages7
ISBN (Electronic)9798350349399
DOIs
StatePublished - 2024
Event31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates
Duration: 27 Oct 202430 Oct 2024

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference31st IEEE International Conference on Image Processing, ICIP 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period27/10/2430/10/24

Keywords

  • CBMIR
  • Computer-aided diagnosis
  • lung infection
  • Medical image retrieval
  • R3DM-ViT

Fingerprint

Dive into the research topics of 'RECURRENT 3-D MULTI-LEVEL VISUAL TRANSFORMER FOR JOINT CLASSIFICATION OF HETEROGENEOUS 2-D AND 3-D RADIOGRAPHIC DATA'. Together they form a unique fingerprint.

Cite this