Research Article

A Comparative Assessment of Machine Learning Algorithms for Detecting and Diagnosing Breast Cancer

Authors

  • Md Zahidul Islam MBA Business Analytics, Gannon University, USA
  • Md Nasiruddin Department of management science and quantitative methods, Gannon University, USA
  • Shuvo Dutta Master of Arts in Physics, Western Michigan University, USA
  • Rajesh Sikder PhD Student in Information Technology, University of the Cumberland, KY, USA
  • Chowdhury Badrul Huda Department of Management, Master of Science in Project Management, ST. Francis College, USA
  • Md Rasibul Islam Department of management science and quantitative methods, Gannon University, USA

Abstract

The principal goal of this study was to explore machine-learning techniques deployed for the early detection of breast cancer in the United States. Specifically, three algorithms were trained on a breast cancer dataset: Decision Tree, Random Forest, and Linear Regression. Each model was further evaluated for its performance, to ascertain the best model. Upon review, the Random Forest provided higher classification performance. It was postulated that the Random Forest offered higher accuracy models on the test data because Decision Trees and Linear Regression require more extensive data for them to be more precise in making high-precision predictions. Out of all the models, the Random Forest provided suitable accuracy on test data. Therefore, in this research scope, Random Forest was the most successful and proved effective in accurately identifying breast cancer malignancies. In that light, the proposed random forest can benefit healthcare organizations by facilitating in detection of breast cancer disease by identifying patients in high-risk groups at an early and more treatable stage of disease for improved outcomes and lower healthcare costs. Besides, Random Forest models can assist in identifying high-risk patients in advance for prompt treatment. In that regard, such detection saves lives and decreases long-term healthcare costs for the US government.

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

6 (2)

Pages

121-135

Published

2024-06-13

How to Cite

Md Zahidul Islam, Md Nasiruddin, Shuvo Dutta, Rajesh Sikder, Chowdhury Badrul Huda, & Md Rasibul Islam. (2024). A Comparative Assessment of Machine Learning Algorithms for Detecting and Diagnosing Breast Cancer. Journal of Computer Science and Technology Studies, 6(2), 121–135. https://doi.org/10.32996/jcsts.2024.6.2.14

Downloads

Keywords:

Breast Cancer; Random Forest; Decision Tree; Linear Regression; Early Detection; Treatment plans1.