K-Nearest Neighbor learning based diabetes mellitus prediction and analysis for eHealth services

Iqbal H. Sarker*, Md Faisal Faruque, Hamed Alqahtani, Asra Kalim

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

53 Downloads (Pure)

Abstract

Nowadays, eHealth service has become a booming area, which refers to computer-based health care and information delivery to improve health service locally, regionally and worldwide. An effective disease risk prediction model by analyzing electronic health data benefits not only to care a patient but also to provide services through the corresponding data-driven eHealth systems. In this paper, we particularly focus on predicting and analysing diabetes mellitus, an increasingly prevalent chronic disease that refers to a group of metabolic disorders characterized by a high blood sugar level over a prolonged period of time. K-Nearest Neighbor (KNN) is one of the most popular and simplest machine learning techniques to build such a disease risk prediction model utilizing relevant health data. In order to achieve our goal, we present an optimal K-Nearest Neighbor (Opt-KNN) learning based prediction model based on patient's habitual attributes in various dimensions. This approach determines the optimal number of neighbors with low error rate for providing better prediction outcome in the resultant model. The effectiveness of this machine learning eHealth model is examined by conducting experiments on the real-world diabetes mellitus data collected from medical hospitals.

Original languageEnglish
Article numbere4
Pages (from-to)1-9
Number of pages9
JournalEAI Endorsed Transactions on Scalable Information Systems
Volume7
Issue number26
DOIs
Publication statusPublished - 2020

Bibliographical note

Copyright the Author(s) 2020. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • health data analytics
  • diabetes mellitus
  • data science
  • machine learning
  • k-nearest neighbor
  • predictive analytics
  • classification
  • intelligent systems
  • eHealth
  • IoT services

Cite this