CLIFS: CLIP-DRIVEN FEW-SHOT LEARNING FOR BAGGAGE THREAT CLASSIFICATION

Abdelfatah Ahmed, Divya Velayudhan, Mahmoud ElMezain, Muaz Khalifa Alradi, Abderrahmene Boudiaf, Taimur Hassan, Mohamed Deriche, Mohammed Bennamoun, Naoufel Werghi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Baggage screening in airports is a cornerstone in airport security measures. The advent of computer vision technologies in recent years has led to the development of several automated systems for identifying security threats in baggage scans. However, existing methods struggle to adapt to new threat categories when faced with a scarcity of data samples, and the rapid emergence of new threats. Hence, in this paper, we propose a novel CLIP-driven few-shot framework (CLIFS) to explore the potential of multi-modality using text-image fusion through contrastive learning to learn relevant contextual features for recognizing security threats with limited samples. By integrating features from GPT-4 generated captions with image features, CLIFS leverages both visual and textual data to significantly improve threat classification performance with limited samples in a few-shot learning context. Our proposed CLIFS was rigorously tested on the SIXray public available baggage X-ray dataset, where it outperformed state-of-the-art by 31.3% in accuracy and 28.40% in F1-score for the challenging 5-shots scenario, demonstrating its robustness and effectiveness in classifying threats from limited data samples.

Original languageBritish English
Title of host publication2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PublisherIEEE Computer Society
Pages753-759
Number of pages7
ISBN (Electronic)9798350349399
DOIs
StatePublished - 2024
Event31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates
Duration: 27 Oct 202430 Oct 2024

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference31st IEEE International Conference on Image Processing, ICIP 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period27/10/2430/10/24

Keywords

  • Baggage threat classification
  • Contrastive Language Image Pretraining
  • Contrastive Loss
  • Few-shot learning
  • Vision-Langugae Model

Fingerprint

Dive into the research topics of 'CLIFS: CLIP-DRIVEN FEW-SHOT LEARNING FOR BAGGAGE THREAT CLASSIFICATION'. Together they form a unique fingerprint.

Cite this