TY - JOUR
T1 - Predicting early Alzheimer’s with blood biomarkers and clinical features
AU - AlMansoori, Muaath Ebrahim
AU - Jemimah, Sherlyn
AU - Abuhantash, Ferial
AU - AlShehhi, Aamna
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/12
Y1 - 2024/12
N2 - Alzheimer’s disease (AD) is an incurable neurodegenerative disorder that leads to dementia. This study employs explainable machine learning models to detect dementia cases using blood gene expression, single nucleotide polymorphisms (SNPs), and clinical data from Alzheimer’s Disease Neuroimaging Initiative (ADNI). Analyzing 623 ADNI participants, we found that the Support Vector Machine classifier with Mutual Information (MI) feature selection, trained on all three data modalities, achieved exceptional performance (accuracy = 0.95, AUC = 0.94). When using gene expression and SNP data separately, we achieved very good performance (AUC = 0.65, AUC = 0.63, respectively). Using SHapley Additive exPlanations (SHAP), we identified significant features, potentially serving as AD biomarkers. Notably, genetic-based biomarkers linked to axon myelination and synaptic vesicle membrane formation could aid early AD detection. In summary, this genetic-based biomarker approach, integrating machine learning and SHAP, shows promise for precise AD diagnosis, biomarker discovery, and offers novel insights for understanding and treating the disease. This approach addresses the challenges of accurate AD diagnosis, which is crucial given the complexities associated with the disease and the need for non-invasive diagnostic methods.
AB - Alzheimer’s disease (AD) is an incurable neurodegenerative disorder that leads to dementia. This study employs explainable machine learning models to detect dementia cases using blood gene expression, single nucleotide polymorphisms (SNPs), and clinical data from Alzheimer’s Disease Neuroimaging Initiative (ADNI). Analyzing 623 ADNI participants, we found that the Support Vector Machine classifier with Mutual Information (MI) feature selection, trained on all three data modalities, achieved exceptional performance (accuracy = 0.95, AUC = 0.94). When using gene expression and SNP data separately, we achieved very good performance (AUC = 0.65, AUC = 0.63, respectively). Using SHapley Additive exPlanations (SHAP), we identified significant features, potentially serving as AD biomarkers. Notably, genetic-based biomarkers linked to axon myelination and synaptic vesicle membrane formation could aid early AD detection. In summary, this genetic-based biomarker approach, integrating machine learning and SHAP, shows promise for precise AD diagnosis, biomarker discovery, and offers novel insights for understanding and treating the disease. This approach addresses the challenges of accurate AD diagnosis, which is crucial given the complexities associated with the disease and the need for non-invasive diagnostic methods.
KW - Alzheimer’s disease
KW - Blood biomarkers
KW - Clinical features
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85187514296&partnerID=8YFLogxK
U2 - 10.1038/s41598-024-56489-1
DO - 10.1038/s41598-024-56489-1
M3 - Article
C2 - 38472245
AN - SCOPUS:85187514296
SN - 2045-2322
VL - 14
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 6039
ER -