A Statistically Validated Machine Learning Framework for Early Alzheimer's Disease Detection Using Structured Clinical Data

Shehu  Mohammed; Neha Malhotra; Anmol  Singh Rai; Sourabh  Kumar

doi:10.64389/icds.2026.02297

Authors

Shehu Mohammed School of Computer Applications, Lovely Professional University, Phagwara, Punjab, India Author
Neha Malhotra School of Computer Applications, Lovely Professional University, Phagwara, Punjab, India Author
Anmol Singh Rai Shrimann Superspeciality Hospital, Jalandhar, Punjab, India Author
Sourabh Kumar Mittal School of Business, Lovely Professional University, Phagwara, Punjab, India Author

DOI:

https://doi.org/10.64389/icds.2026.02297

Keywords:

Alzheimer's disease, Early Detection, Machine Learning, Ensemble Learning, Statistical Validation, Explainable AI, SHAP

Abstract

Early detection of Alzheimer's disease (AD) is vital in resource-limited settings that rely on structured clinical data. This study presents a rigorous, interpretable benchmarking framework using a dataset of 2,149 subjects (64.6\% cognitively normal; 35.4\% AD), split into training (80\%) and testing (20\%) sets. Five models—Logistic Regression, Random Forest, a Convolutional Neural Network (CNN), a Recurrent Neural Network (RNN), and Optimized Gradient Boosting—were evaluated under identical conditions. Optimized Gradient Boosting achieved the highest performance on the test set ($n = 430$) with 95.10\% accuracy, 92.10\% sensitivity, 96.80\% specificity, an F1-score of 0.93, and the fewest false negatives ($n = 12$). Random Forest also performed strongly (93.95\% accuracy), while linear and deep learning models were less effective. SHAP analysis aligned model predictions with key clinical biomarkers, including functional assessments, activities of daily living (ADL), and MMSE scores, demonstrating that ensemble tree-based models excel in structured clinical settings.

Downloads

Download data is not yet available.

Author Biographies

Neha Malhotra, School of Computer Applications, Lovely Professional University, Phagwara, Punjab, India

Associate professor and Deputy Dean, School of Computer Applications
Anmol Singh Rai, Shrimann Superspeciality Hospital, Jalandhar, Punjab, India

Neurologist and consultant, Shrimann Superspeciality Hospital, Jalandhar, Punjab, India
Sourabh Kumar, Mittal School of Business, Lovely Professional University, Phagwara, Punjab, India

Mittal School of Business, Lovely Professional University, Phagwara, Punjab, India

A Statistically Validated Machine Learning Framework for Early Alzheimer's Disease Detection Using Structured Clinical Data

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Downloads

Published

Issue

Section

License

How to Cite

Side Bar

Keywords

Crossref