Issue |
BIO Web Conf.
Volume 146, 2024
2nd Biology Trunojoyo Madura International Conference (BTMIC 2024)
|
|
---|---|---|
Article Number | 01081 | |
Number of page(s) | 8 | |
Section | Dense Matter | |
DOI | https://doi.org/10.1051/bioconf/202414601081 | |
Published online | 27 November 2024 |
Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
1 Departemen of Informatics, Faculty of Engineering, University of Trunojoyo Madura, Kamal, Bangkalan, Indonesia
2 Faculty of Engineering and Quantity Surveying, INTI International University, Negeri Sembilan 71800, Malaysia
* Corresponding author: sigit.putro@trunojoyo.ac.id
Diabetes mellitus is a chronic disease often caused by high blood glucose levels and insufficient insulin production. This research aims to address the classification problem of diabetes mellitus using the K-Nearest Neighbor (K-NN) method. The aim of this research is to create a machine learning model that can detect diabetes early. The study was conducted at Syarifah Ambami Rato Ebu Hospital in Bangkalan, utilizing data from 120 patients in 2019, employing data mining techniques to classify diabetes mellitus patients. Additionally, the steps in data mining involve determining significant variables or features for classification Cleansing and normalization and transformation. The research compares training test results with ratios of 90:10, 80:20, and 70:30. Experimental results show that K-NN with a neighbor value of K=11 achieves the highest accuracy rate of 83% a reduced error rate of 16.67%, and the highest AUC value of 0.7407. These results indicate that the 90:10 data split ratio yields the best model performance in terms of accuracy and class differentiation for diabetes mellitus, as well as the lowest error rate compared to other data split ratios. This study provides a better understanding of diabetes mellitus and demonstrates that K-NN is effective in addressing classification problems, focusing on specific variables that influence the disease. Therefore, it can be concluded that K-Nearest Neighbor (K-NN) is a suitable algorithm for classifying diabetes mellitus.
© The Authors, published by EDP Sciences, 2024
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.