Predicting diabetes in healthy population through machine learning

Hasan Abbas, Lejla Alic, Marelyn Rios, Muhammad Abdul-Ghani, Khalid Qaraqe

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Citations (Scopus)

Abstract

In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages567-570
Number of pages4
ISBN (Electronic)9781728122861
DOIs
Publication statusPublished - Jun 2019
Externally publishedYes
Event32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019 - Cordoba, Spain
Duration: 5 Jun 20197 Jun 2019

Publication series

NameProceedings - IEEE Symposium on Computer-Based Medical Systems
Volume2019-June
ISSN (Print)1063-7125

Conference

Conference32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019
Country/TerritorySpain
CityCordoba
Period5/06/197/06/19

Keywords

  • Disease Prediction
  • Support vector machine
  • Type 2 diabetes

Fingerprint

Dive into the research topics of 'Predicting diabetes in healthy population through machine learning'. Together they form a unique fingerprint.

Cite this