TY - JOUR
T1 - Enhanced Android Malware Detect Models based on Explainable Generative Adversarial Networks
AU - Alshebli , Shamma Adnan
AU - Mun, Hyeran
AU - Kumar, Deepak
AU - Zemerly, Mohamed Jamal
AU - Martino, Luigi
AU - Damiani, Ernesto
AU - Yeun, Chan Yeob
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2025
Y1 - 2025
N2 - With the widespread adoption of smartphones over the past decade, mobile applications have become a primary target for malicious attacks, usually in the form of malware. Recent studies have leveraged artificial intelligence (AI) techniques for malware detection and classification. However, applying such approaches, particularly deep learning (DL) techniques, to mobile malware detection poses significant challenges. These challenges arise from the difficulty of collecting large quantities of mobile malware samples and the inherent class imbalance in the collected datasets. To tackle these issues and enhance the performance of machine learning (ML) and DL detection models, we propose novel detection models based on a generative adversarial network (GAN). Furthermore, our approach not only employs a conditional tabular GAN (CTGAN) for data augmentation to explore the impact of augmentation but also identifies the optimal multiplication factor for achieving the best results. The evaluation results demonstrate that the proposed data augmentation approach significantly improves the performance of mobile malware detection models, especially those based on DL. We have notably figured out that doubling the original dataset is sufficient to enhance the performance of ML models, whereas DL models require additional data to achieve optimal results. Hence, our proposed mechanism is an effective solution for improving mobile malware detection.
AB - With the widespread adoption of smartphones over the past decade, mobile applications have become a primary target for malicious attacks, usually in the form of malware. Recent studies have leveraged artificial intelligence (AI) techniques for malware detection and classification. However, applying such approaches, particularly deep learning (DL) techniques, to mobile malware detection poses significant challenges. These challenges arise from the difficulty of collecting large quantities of mobile malware samples and the inherent class imbalance in the collected datasets. To tackle these issues and enhance the performance of machine learning (ML) and DL detection models, we propose novel detection models based on a generative adversarial network (GAN). Furthermore, our approach not only employs a conditional tabular GAN (CTGAN) for data augmentation to explore the impact of augmentation but also identifies the optimal multiplication factor for achieving the best results. The evaluation results demonstrate that the proposed data augmentation approach significantly improves the performance of mobile malware detection models, especially those based on DL. We have notably figured out that doubling the original dataset is sufficient to enhance the performance of ML models, whereas DL models require additional data to achieve optimal results. Hence, our proposed mechanism is an effective solution for improving mobile malware detection.
KW - Data Augmentation
KW - Deep Learning (DL)
KW - Explainable Artificial Intelligence (XAI)
KW - Generative Adversarial Networks (GAN)
KW - Machine Learning (ML)
KW - Malware Classification
UR - https://www.scopus.com/pages/publications/105010112081
U2 - 10.1109/ACCESS.2025.3585241
DO - 10.1109/ACCESS.2025.3585241
M3 - Article
AN - SCOPUS:105010112081
SN - 2169-3536
JO - IEEE Access
JF - IEEE Access
M1 - 0b0000649422b637
ER -