TY - GEN
T1 - SMO-CLIP
T2 - 31st IEEE International Conference on Image Processing, ICIP 2024
AU - Li, Pengfei
AU - Alradi, Muaz Khalifa
AU - Elmezain, Mahmoud Said
AU - Ahmed, Abdelfatah Hassan
AU - Boudiaf, Abderrahmene
AU - Boumaraf, Said
AU - Dias, Jorge
AU - Karki, Hamad
AU - Javed, Sajid
AU - Al Awadhi, Khalid Yousef
AU - Werghi, Naoufel
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Flare stacks are among the crucial components in the safety and emission control of petrochemical plants. However, due to the imperceptibility of smoke and contaminants, analyzing these released particles during flare stack operation is one of the top challenges. To stress the problem, our work presents a novel solution called SMO-CLIP that can hybridize knowledge from Vision-Language Models (VLMs), specifically the Contrastive Language Image Pretraining (CLIP) model, with extra insights derived from GPT-4 Large Language Model (LLM). Furthermore, two new tasks, Fine-grained Smoke Density Recognition (FSDR) and Coarse-grained Smoke Density Recognition (CSDR) are investigated in this paper to accurately detect and evaluate varying smoke intensities. Notable advancements over current approaches are observed through extensive experiments, demonstrating the superior performance of the proposed approach against state-of-the-art models.
AB - Flare stacks are among the crucial components in the safety and emission control of petrochemical plants. However, due to the imperceptibility of smoke and contaminants, analyzing these released particles during flare stack operation is one of the top challenges. To stress the problem, our work presents a novel solution called SMO-CLIP that can hybridize knowledge from Vision-Language Models (VLMs), specifically the Contrastive Language Image Pretraining (CLIP) model, with extra insights derived from GPT-4 Large Language Model (LLM). Furthermore, two new tasks, Fine-grained Smoke Density Recognition (FSDR) and Coarse-grained Smoke Density Recognition (CSDR) are investigated in this paper to accurately detect and evaluate varying smoke intensities. Notable advancements over current approaches are observed through extensive experiments, demonstrating the superior performance of the proposed approach against state-of-the-art models.
KW - Contrastive Language Image Pretraining (CLIP)
KW - Large Language Model (LLM)
KW - Smoke density recognition
KW - Vision-Language Models (VLMs)
UR - https://www.scopus.com/pages/publications/85216835859
U2 - 10.1109/ICIP51287.2024.10648092
DO - 10.1109/ICIP51287.2024.10648092
M3 - Conference contribution
AN - SCOPUS:85216835859
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 760
EP - 765
BT - 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PB - IEEE Computer Society
Y2 - 27 October 2024 through 30 October 2024
ER -