doi.org
Improved logistic regression model for diabetes prediction by integrating PCA and K-means techniques
January 2019 • Changsheng Zhu, Christian Uwa Idemudia, Wenfang Feng
Diabetes causes a large number of deaths each year and a large number of people living with the disease do not realize their health condition early enough. In this study, we propose a data mining based model for early diagnosis and prediction of diabetes using the Pima Indians Diabetes dataset. Although K-means is simple and can be used for a wide variety of data types, it is quite sensitive to initial positions of cluster centers which determine the final cluster result, which either provides a sufficient and eff…