Predictive Modelling of Benign and Malignant Tumors Using Binary Logistic, Support Vector Machine and Extreme Gradient Boosting Models

Gachoki, Peter; Mburu, Moses; Muraya, Moses

dc.contributor.author	Gachoki, Peter
dc.contributor.author	Mburu, Moses
dc.contributor.author	Muraya, Moses
dc.date.accessioned	2023-07-13T11:25:26Z
dc.date.available	2023-07-13T11:25:26Z
dc.date.issued	2019-11-26
dc.identifier.uri	http://repository.chuka.ac.ke/handle/chuka/15631
dc.description.abstract	Breast cancer is the leading type of cancer among women worldwide, with about 2 million new cases and 627,000 deaths every year. The breast tumors can be malignant or benign. Medical screening can be used to detect the type of a diagnosed tumor. Alternatively, predictive modelling can also be used to predict whether a tumor is malignant or benign. However, the accuracy of the prediction algorithms is important since any incidence of false negatives may have dire consequence since a person cannot be put under medication, which can lead to death. Moreover, cases of false positives may subject an individual to unnecessary stress and medication. Therefore, this study sought to develop and validate a new predictive model based on binary logistic, support vector machine and extreme gradient boosting models in order to improve the prediction accuracy of the cancer tumors. This study used the Breast Cancer Wilcosin data set available on Kaggle. The dependent variable was whether a tumor is malignant or benign. The regressors were the tumor features such as radius, texture, area, perimeter, smoothness, compactness, concavity, concave points, symmetry and fractional dimension of the tumor. Data analysis was done using the Rstatistical software and it involved, generation of descriptive statistics, data reduction, feature selection and model fitting. Before model fitting was done, the reduced data was split into the train set and the validation set. The results showed that the binary logistic, support vector machine and extreme gradient boosting models had predictive accuracies of 96.97%, 98.01% and 97.73%. This showed an improvement compared to already existing models. The results of this study showed that support vector machine and extreme gradient boosting have better prediction power for cancer tumors compared to binary logistic. This study recommends the use of support vector machine and extreme gradient boosting in cancer tumor prediction and also recommends further investigations for other algorithms that can improve prediction	en_US
dc.language.iso	en	en_US
dc.publisher	Science and Education Publishing	en_US
dc.relation.ispartofseries	American Journal of Applied Mathematics and Statistics;
dc.subject	benign	en_US
dc.subject	malignant	en_US
dc.subject	binary logistic	en_US
dc.subject	support vector machine	en_US
dc.subject	extreme gradient boosting	en_US
dc.title	Predictive Modelling of Benign and Malignant Tumors Using Binary Logistic, Support Vector Machine and Extreme Gradient Boosting Models	en_US
dc.type	Article	en_US

Files in this item

Name:: Predictive Modelling of Benign ...
Size:: 519.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Nursing [42]

Show simple item record