Show simple item record

dc.contributor.authorGachoki, Peter
dc.contributor.authorMburu, Moses
dc.contributor.authorMuraya, Moses
dc.date.accessioned2023-07-13T11:25:26Z
dc.date.available2023-07-13T11:25:26Z
dc.date.issued2019-11-26
dc.identifier.urihttp://repository.chuka.ac.ke/handle/chuka/15631
dc.description.abstractBreast cancer is the leading type of cancer among women worldwide, with about 2 million new cases and 627,000 deaths every year. The breast tumors can be malignant or benign. Medical screening can be used to detect the type of a diagnosed tumor. Alternatively, predictive modelling can also be used to predict whether a tumor is malignant or benign. However, the accuracy of the prediction algorithms is important since any incidence of false negatives may have dire consequence since a person cannot be put under medication, which can lead to death. Moreover, cases of false positives may subject an individual to unnecessary stress and medication. Therefore, this study sought to develop and validate a new predictive model based on binary logistic, support vector machine and extreme gradient boosting models in order to improve the prediction accuracy of the cancer tumors. This study used the Breast Cancer Wilcosin data set available on Kaggle. The dependent variable was whether a tumor is malignant or benign. The regressors were the tumor features such as radius, texture, area, perimeter, smoothness, compactness, concavity, concave points, symmetry and fractional dimension of the tumor. Data analysis was done using the Rstatistical software and it involved, generation of descriptive statistics, data reduction, feature selection and model fitting. Before model fitting was done, the reduced data was split into the train set and the validation set. The results showed that the binary logistic, support vector machine and extreme gradient boosting models had predictive accuracies of 96.97%, 98.01% and 97.73%. This showed an improvement compared to already existing models. The results of this study showed that support vector machine and extreme gradient boosting have better prediction power for cancer tumors compared to binary logistic. This study recommends the use of support vector machine and extreme gradient boosting in cancer tumor prediction and also recommends further investigations for other algorithms that can improve predictionen_US
dc.language.isoenen_US
dc.publisherScience and Education Publishingen_US
dc.relation.ispartofseriesAmerican Journal of Applied Mathematics and Statistics;
dc.subjectbenignen_US
dc.subjectmalignanten_US
dc.subjectbinary logisticen_US
dc.subjectsupport vector machineen_US
dc.subjectextreme gradient boostingen_US
dc.titlePredictive Modelling of Benign and Malignant Tumors Using Binary Logistic, Support Vector Machine and Extreme Gradient Boosting Modelsen_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record