Prediksi Kanker Payudara Berbasis Machine Learning Dengan Analisis Probabilitas Klasisfikasi

  • luthfi ardyansyah Teknik Informatika, Universitas Muhadi Setiabudi
  • Bambang Irawan Teknik Informatika, Universitas Muhadi Setiabudi
Keywords: K-Nearest Neighbor, confusion matrix, cancers, Breast

Abstract

Breast cancer is one of the diseases with a high mortality rate in women, so early detection is crucial to increase the chances of recovery. Unfortunately, conventional methods of diagnosis still rely on the interpretation of medical personnel and laboratory procedures which are time-consuming and costly. This study tries to present a machine learning-based approach to predict breast cancer, while adding a classification probability analysis to make the prediction more informative. The breast cancer dataset was used to train four models, namely Logistic Regression, Support Vector Machine, Random Forest, and K-Nearest Neighbor. Evaluation was carried out using accuracy, confusion matrix, ROC curve, and AUC. The results showed that all four models were able to classify cancers with fairly high performance, while one model stood out with the highest accuracy and AUC values. Classification probability analysis provides additional perspective on the confidence level of predictions, which can help medical personnel make more objective clinical decisions.

References

Adiningrum, N. R., Rianti, R., & Priyanto, C. (2023). RANCANG BANGUN APLIKASI PREDIKSI KANKER PAYUDARA DENGAN PENDEKATAN MACHINE LEARNING. Jurnal Informatika Dan Teknik Elektro Terapan.

Aisyah, S. (2025). Machine Learning for Breast Cancer Prediction. Jurnal Stardia.

Al ABrori, Z. H., & Subhiyakto, E. R. (2025). Analisis Komparatif Akurasi Prediksi Kanker Payudara Menggunakan Algoritma Random Forest dan Logistic Regression. Jurnal Algoritma, 300–311.

Cahyani, N., Irsyada, R., & Kartini, A. Y. (2025). Implementasi Machine Learning Model sebagai Sistem Prediksi Penyakit Breast Cancer. Digital Transformation Technology, 1112–1120.

Dipranoto, T. s., & Rahayuda, I. S. (2026). Optimasi C4.5 Berbasis PSO untuk Prediksi Kanker Payudara dengan Data BC Wisconsin. Jurnal Nasional Teknologi Informasi dan Aplikasinya, 535-542.

Panchal, R., & Kumar, P. (2024). Comparing Breast Cancer Prediction Models. (Ijraset) Journal For Research in Applied Science and Engineering Technology.

Susanto, E. R., & Misdiantoro, D. (2025). Optimasi Akurasi Prediksi Penyakit Kanker Payudara Menggunakan Metode Random Forest. Jurnal Pendidikan Dan Teknologi, 1407-1416.

Zoe, Z. E., ray, r. p., faliha, P. Y., Keyla, K. A., & Mustika Ayu, R. D. (2025). Prediksi Kanker Payudara di Indonesia menggunakan Algoritma Support Vector Machine dan Regresi Logistik. Jurnal Metode dan Penerapan Ilmu Data, 113–121.

Published
2026-01-29