SMO-CLIP: ENHANCING ANOMALOUS SMOKE DENSITY ASSESSMENT USING A HYBRID LLM-VLM APPROACH

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Flare stacks are among the crucial components in the safety and emission control of petrochemical plants. However, due to the imperceptibility of smoke and contaminants, analyzing these released particles during flare stack operation is one of the top challenges. To stress the problem, our work presents a novel solution called SMO-CLIP that can hybridize knowledge from Vision-Language Models (VLMs), specifically the Contrastive Language Image Pretraining (CLIP) model, with extra insights derived from GPT-4 Large Language Model (LLM). Furthermore, two new tasks, Fine-grained Smoke Density Recognition (FSDR) and Coarse-grained Smoke Density Recognition (CSDR) are investigated in this paper to accurately detect and evaluate varying smoke intensities. Notable advancements over current approaches are observed through extensive experiments, demonstrating the superior performance of the proposed approach against state-of-the-art models.

Original languageBritish English
Title of host publication2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PublisherIEEE Computer Society
Pages760-765
Number of pages6
ISBN (Electronic)9798350349399
DOIs
StatePublished - 2024
Event31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates
Duration: 27 Oct 202430 Oct 2024

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference31st IEEE International Conference on Image Processing, ICIP 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period27/10/2430/10/24

Keywords

  • Contrastive Language Image Pretraining (CLIP)
  • Large Language Model (LLM)
  • Smoke density recognition
  • Vision-Language Models (VLMs)

Fingerprint

Dive into the research topics of 'SMO-CLIP: ENHANCING ANOMALOUS SMOKE DENSITY ASSESSMENT USING A HYBRID LLM-VLM APPROACH'. Together they form a unique fingerprint.

Cite this