Main Article Content

Comparative analysis of machine learning algorithms for breast cancer prediction


Kene Tochukwu Anyachebelu
Sukkushe Hannah Hosea
Muhammad Umar Abdullahi
Maimuna Abdullahi Ibrahim

Abstract

Breast cancer is a global health concern, and early diagnosis is crucial for successful treatment. The objective of this paper is to conduct a comparative analysis of machine-learning algorithms for the prediction of breast cancer. This study used the Wisconsin Diagnostic Breast Cancer Dataset. Data preparation, technique selection, and performance evaluation are included in the study. The inquiry begins by comparing malignant and benign instances according to input factors and diagnostic outcomes. Finding components having an inverse relationship to the diagnosis is prioritized. Next, a careful approach is used to choose attributes to improve the dataset for model construction. The preprocessed data trains and optimizes four well-known machine learning algorithms: Random Forest, Support Vector Machine, K-Nearest Neighbor, and Logistic Regression. The models are evaluated for accuracy, precision, recall, F1-score, and ROC curve. This study aimed to evaluate numerous breast cancer prediction systems to determine their strengths and weaknesses. To provide openness and replicability, the study uses the Jupyter Notebook platform, Python, and data analytic tools. The logistic regression model has a test accuracy percentage of 99.26%, surpassing all other models examined in this study. Furthermore, it has a minimum false positive rate (FPR) of 1 and a false negative rate (FNR) of 4. The model exhibits a higher level of precision in comparison to the studies examined in the literature review. This study is crucial for early diagnosis and therapy development. The effects include lower healthcare expenses, better patient outcomes, and better diagnostics. Machine learning has shown promise in fighting breast cancer, boosting its relevance in healthcare.


Journal Identifiers


eISSN: 2635-3490
print ISSN: 2476-8316