Peningkatan Akurasi Diagnosis Penyakit Ginjal Kronis melalui Integrasi Algoritma Naive Bayes dan Algoritma Genetika
Abstract
Chronic Kidney Disease (CKD) is a significant global health challenge that necessitates early diagnosis to prevent severe organ failure. While machine learning techniques such as Naive Bayes (NB) have been widely implemented for medical classification, their performance is often hindered by redundant and irrelevant features within high-dimensional medical datasets. This study aims to address this limitation by reducing the dimensions of non-contributing medical attributes, thereby minimizing bias and improving classification accuracy. Consequently, this study proposes the integration of Genetic Algorithm (GA) as a feature selection method to optimize the performance of the Naive Bayes (NB) algorithm in diagnosing CKD. The dataset, sourced from the UCI Machine Learning Repository, consists of 400 samples and 24 clinical features. A genetic algorithm was employed to identify the optimal feature subset through a binary evolution mechanism, while NB served both as the classifier and the fitness evaluation function. The results demonstrate that GA successfully reduced the data dimensions by 50%, streamlining the initial 24 features into 12 highly discriminative ones. Evaluation using 10-Fold Cross-Validation revealed a significant increase in accuracy, rising from 92.50% using the standard NB to 98.50% with the integrated GA-NB model. Furthermore, the recall reached 98.40%, indicating the model's high capability in minimizing diagnostic errors for affected patients (false negatives). This research proves that GA-based feature selection effectively enhances diagnostic reliability and model efficiency, presenting substantial potential for implementation in clinical decision support systems for medical professionals.
References
J. A. Maharani and M. Masriadi, “Pengaruh Artificial Intelegen Dalam Mendeteksi Kasus Penyakit Tidak Menular,” RIGGS J. Artif. Intell. Digit. Bus., vol. 4, no. 2, pp. 7343–7349, Jul. 2025, doi: 10.31004/riggs.v4i2.1864.
Ermanto and N. Surojudin, “Perbandingan Metode Klasifikasi dalam Memprediksi Penyakit Ginjal Kronis,” J. Inform. Ekon. Bisnis, pp. 715–723, Sep. 2025, doi: 10.37034/infeb.v7i3.1263.
M. A.-Z. Faradeya and E. R. Subhiyakto, “Klasifikasi Penyakit Gagal Jantung Menggunakan Algoritma Naive Bayes,” J. Algoritm., vol. 22, no. 1, pp. 115–127, May 2025, doi: 10.33364/algoritma/v.22-1.2178.
A. Darmawahyuni et al., “Health-Related Data Analysis Using Metaheuristic Optimization and Machine Learning,” IEEE Access, vol. 12, pp. 55342–55356, 2024, doi: 10.1109/ACCESS.2024.3390008.
M. Huang, X. S. Zhang, U. A. Bhatti, Y. Wu, Y. Zhang, and Y. Yasin Ghadi, “An interpretable approach using hybrid graph networks and explainable AI for intelligent diagnosis recommendations in chronic disease care,” Biomed. Signal Process. Control, vol. 91, p. 105913, May 2024, doi: 10.1016/j.bspc.2023.105913.
J. Badriyah, Nilam Ramadhani, Agung Muliawan, Khanun Roisatul Ummah, and Ata Amrullah, “Penerapan Dimensi Reduksi Pada Machine Learning Dalam Klasifikasi Kanker Payudara Berdasarkan Parameter Medis,” J. RESTIKOM Ris. Tek. Inform. dan Komput., vol. 6, no. 3, pp. 526–533, Dec. 2024, doi: 10.52005/restikom.v6i3.379.
P. E. L. Rubini, P. Soundarapandian, “Chronic Kidney Disease [Dataset],” UCI Machine Learning Repository. [Online]. Available: https://doi.org/10.24432/C5G54X .
S. Tiwaskar, M. Rashid, and P. Gokhale, “Impact of machine learning-based imputation techniques on medical datasets- a comparative analysis,” Multimed. Tools Appl., vol. 84, no. 9, pp. 5905–5925, Apr. 2024, doi: 10.1007/s11042-024-19103-0.
L. O. Joel, W. Doorsamy, and B. S. Paul, “A comparative study of imputation techniques for missing values in healthcare diagnostic datasets,” Int. J. Data Sci. Anal., vol. 20, no. 7, pp. 6357–6373, Nov. 2025, doi: 10.1007/s41060-025-00825-9.
I. Izonin, R. Tkachenko, N. Shakhovska, B. Ilchyshyn, and K. K. Singh, “A Two-Step Data Normalization Approach for Improving Classification Accuracy in the Medical Diagnosis Domain,” Mathematics, vol. 10, no. 11, p. 1942, Jun. 2022, doi: 10.3390/math10111942.
N. Singh and P. Singh, “Exploring the effect of normalization on medical data classification,” in 2021 International Conference on Artificial Intelligence and Machine Vision (AIMV), Sep. 2021, pp. 1–5, doi: 10.1109/AIMV53313.2021.9670938.
C. G and J. M. Roogi, “A Quick Review of ML Algorithms,” in 2021 6th International Conference on Communication and Electronics Systems (ICCES), Jul. 2021, pp. 1–5, doi: 10.1109/ICCES51350.2021.9488982.
D. Kaur et al., “Application of Bayesian networks to generate synthetic health data,” J. Am. Med. Informatics Assoc., vol. 28, no. 4, pp. 801–811, Mar. 2021, doi: 10.1093/jamia/ocaa303.
C. Zhang and J. Han, “Data Mining and Knowledge Discovery,” 2021, pp. 797–814.
X. Shu and Y. Ye, “Knowledge Discovery: Methods from data mining and machine learning,” Soc. Sci. Res., vol. 110, p. 102817, Feb. 2023, doi: 10.1016/j.ssresearch.2022.102817.
K. Dissanayake and M. G. Md Johar, “Comparative Study on Heart Disease Prediction Using Feature Selection Techniques on Classification Algorithms,” Appl. Comput. Intell. Soft Comput., vol. 2021, pp. 1–17, Nov. 2021, doi: 10.1155/2021/5581806.
J. X. Wang, “Meta-learning in natural and artificial intelligence,” Curr. Opin. Behav. Sci., vol. 38, pp. 90–95, Apr. 2021, doi: 10.1016/j.cobeha.2021.01.002.
I. K. Nti, O. Nyarko-Boateng, and J. Aning, “Performance of Machine Learning Algorithms with Different K Values in K-fold CrossValidation,” Int. J. Inf. Technol. Comput. Sci., vol. 13, no. 6, pp. 61–71, Dec. 2021, doi: 10.5815/ijitcs.2021.06.05.
A. Tharwat, “Classification assessment methods,” Appl. Comput. Informatics, vol. 17, no. 1, pp. 168–192, Jan. 2021, doi: 10.1016/j.aci.2018.08.003.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Peningkatan Akurasi Diagnosis Penyakit Ginjal Kronis melalui Integrasi Algoritma Naive Bayes dan Algoritma Genetika
Pages: 33 - 43