
A Two-Stage Method of Cervical Cancer Detection using Hybrid Classification Methods
Cervical cancer is a primary cause of death in women. Human Papilloma Virus (HPV) virus is the main factor that causes cervical cancer. Therefore, risk reduction of cervical cancer is cervical cancer screening and vaccination against infection of HPV. The methods of cervical cancer screening are medical history, HPV virus detection, Pap smear detection, and small pieces of meat cutting. However, Pap smear detection is a popular technique for cervical cancer screening. In addition, the classification method is used to increase the performance of cervical cancer detection. Therefore, our research proposed method of cervical cancer detection using hybrid classification methods with risk factor data of cervical cancer and cervical cancer behavior risk data, which is divided into two stages: primary risk estimation of a chance cervical cancer and cervical cancer prediction. Firstly, the dataset was prepared to handle missing values and imbalanced data. Secondly, we select suitable attributes for model building. Finally, the dataset was used to build the model using hybrid classification methods by bagging with Random Forest. From our experiment, the model for primary risk estimation of a chance cervical cancer and cervical cancer prediction has an accuracy of 99.25% and 98.00%, respectively. The result shows that our proposed method accurately predicts cervical cancer compared to state-of-the-art methods. Therefore, it confirms that our proposed model can be applied to use for primary risk self-estimation of a chance of cervical cancer before deciding to meet a physician. Moreover, it helps the physician with diagnosis and treatment to reduce costs and the death rate.
[1] Alam T., Khan M., Iqbal M., Wahab A., and Mushtaq M., “Cervical Cancer Prediction through Different Screening Methods Using Data Mining,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 2, pp. 388-396, 2019. DOI:10.14569/IJACSA.2019.0100251
[2] Bhavani C. and Govardhan A., “Cervical Cancer Prediction Using Stacked Ensemble Algorithm with SMOTE and RFERF,” Materials Today: Proceedings, vol. 8, no. 3, pp. 3451-3457, 2023. https://doi.org/10.1016/j.matpr.2021.07.269
[3] Cervical Cancer, https://www.medparkhospital.com/content/cervic al-cancer, Last Visited, 2024.
[4] Cervical Cancer, https://www.rama.mahidol.ac.th/cancer_center/th /protfolio/knowledge/gyne/servix, Last Visited, 2024.
[5] Cervical Cancer, Last Visited, 2024.
[6] Chandran V., Sumithra M., Karthick A., George T., Deivakani M., Elakkiya B., Subramaniam U., and Manoharan S., “Diagnosis of Cervical Cancer based on Ensemble Deep Learning Network using Colposcopy Images,” BioMed Research International, vol. 2021, pp. 1-15, 2021. DOI:10.1155/2021/5584004
[7] Curia F., “Cervical Cancer Risk Prediction with Robust Ensemble and Explainable Black Boxes Method,” Health and Technology, vol. 11, pp. 875-885, 2021. https://doi.org/10.1007/s12553- 021-00554-6
[8] Deng X., Luo Y., and Wang C., “Analysis of Risk Factors for Cervical Cancer Based on Machine Learning Methods,” in Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems, China, pp. 631-635, 2018. DOI:10.1109/CCIS.2018.8691126
[9] Galgano M., Castle P., Atkins K., Brix W., Nassau S., and Stoler M., “Using Biomarkers as Objective Standards in the Diagnosis of Cervical Biopsies,” The American Journal of Surgical Pathology, vol. 34, no. 8, pp. 1077-1087, 2010. DOI:10.1097/PAS.0b013e3181e8b2c4
[10] Guvenc G., Akyuz A., and Açikel C., “Health Belief Model Scale for Cervical Cancer and Pap Smear Test: Psychometric Testing,” Journal of Advanced Nursing, vol. 67, pp. 428-437, 2011. DOI:10.1111/j.1365-2648.2010.05450.x
[11] Guyon I., Weston J., Barnhill S., and Vapnik V., “Gene Selection for Cancer Classification using Support Vector Machines,” Machine learning, vol. 46, pp. 389-422, 2002. https://doi.org/10.1023/A:1012487302797
[12] Han J., Kamber M., and Pei J., Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, 2012.
[13] Ilyas Q. and Ahmad M., “An Enhanced Ensemble Diagnosis of Cervical Cancer: A Pursuit of Machine Intelligence towards Sustainable Health,” Digital Object Identifier, vol. 9, pp. 12374-12388, 2021. DOI:10.1109/ACCESS.2021.3049165
[14] Kaggle, https://www.kaggle.com/datasets/ranzeet013/cerv ical-cancer-dataset, Last Visited, 2024.
[15] Kalbhor M. and Shinde S., “ColpoClassifier: A Hybrid Framework for Classification of the Cervigrams,” Diagnostics, vol. 13, no. 1103, pp. 1-16, 2023. https://doi.org/10.3390/diagnostics13061103
[16] Kelwin F., Jaime C., and Jessica F., Cervical Cancer (Risk Factors), 2017. https://doi.org/10.24432/C5Z310, Last Visited, 2024.
[17] Kerkar R. and Kulkarni Y., “Screening for Cervical Cancer: An Overview,” Journal Obstet Gynecol India, vol. 56, no. 2, pp. 115-122, 2006. https://jogi.co.in/storage/files/02_ra_screening- 1.pdf
[18] Khanam F. and Mondal R., “Ensemble Machine Learning Algorithms for the Diagnosis of Cervical Cancer,” in Proceedings of the International Conference on Science and Contemporary Technologies, Bangladesh, pp. 1-5, 2021. DOI:10.1109/ICSCT53883.2021.9642612
[19] Khomphaiboonkij U., Sreamsukcharoenchai N., Pitakkarnkul S., Rittiluechai K., and Tangjitgamol S., “Knowledge of Thai Women in Cervical Cancer Etiology and Screening,” PLoS ONE, vol. 20, no. 1, pp. 1-12, 2023. https://doi.org/10.1371/journal.pone.0318184
[20] Kruczkowski M., Drabik‑Kruczkowska A., Marciniak A., Tarczewska M., Kosowska M., and Szczerska M., “Predictions of Cervical Cancer Identification by Photonic Method Combined with Machine Learning,” Scientific Reports, vol. 12, no. 3672, pp. 1-11, 2022. https://doi.org/10.1038/s41598-022-07723-1
[21] Kulpornsirikul M., Srison U., Laosooksathit W., and Tantitamit T., “Prevalence of Cervical High- risk Human Papilloma Viral Infection in Pregnant Women at HRH Maha Chakri Sirindhorn Medical Center Thailand,” Thai Journal of Obstetrics and Gynaecology, vol. 32, no. 2, pp. 128-136, 2024. https://doi.org/10.14456/tjog.2024.14
[22] Laowahutanont P., Chaiwerawattana A., and Imsamran W., “Screening Guidelines Diagnose and Treat Cervical Cancer,” Kosit Press Co Ltd, A Two-Stage Method of Cervical Cancer Detection using Hybrid Classification Methods 459 2018.
[23] Lilhore U., Poongodi M., Kaur A., Simaiya S., Algarni A., Elmannai H., Vijayakumar V., Tunze G., and Hamdi M., “Hybrid Model for Detection of Cervical Cancer Using Causal Analysis and Machine Learning Techniques,” Computational and Mathematical Methods in Medicine, vol. 2022, pp. 1-17, 2022. https://doi.org/10.1155/2022/4688327
[24] Madhukar B., Bharathi S., and Ashwin M., “Classification of Breast Cancer using Ensemble Filter Feature Selection with Triplet Attention Based Efficient Net Classifier,” The International Arab Journal of Information Technology, vol. 21, no. 1, pp. 17-31, 2024. https://doi.org/10.34028/iajit/21/1/2
[25] Mahmoud H., AlArfaj A., and Hafez A., “A Fast Hybrid Classification Algorithm with Feature Reduction for Medical Images,” Applied Bionics and Biomechanics, vol. 2022, pp. 1-11, 2022. https://doi.org/10.1155/2022/1367366
[26] Masot J., Perejon F., Saavedra L., and Morales M., “A Lightweight xAI Approach to Cervical Cancer Classification,” Medical and Biological Engineering and Computing, vol. 62, pp. 2281- 2304, 2024. https://doi.org/10.1007/s11517-024- 03063-6
[27] Mehmood M., Rizwan M., ML M., and Abbas S., “Machine Learning Assisted Cervical Cancer Detection,” Frontiers in Public Health, vol. 9, pp. 1-14, 2021. DOI:10.3389/fpubh.2021.788376
[28] Mukku L. and Thomas J., “Hybrid Decision Fusion based Multimodal Ensemble Framework for Cervical Cancer Detection,” International Journal of Scientific Research in Computer Science, Engineering, and Information Technology, vol. 9, no. 1, pp. 17-24, 2023. https://doi.org/10.32628/CSEIT239018
[29] Nandanwar P. and Dhonde S., “A Novel Approach to Cervical Cancer Detection Using Hybrid Stacked Ensemble Models and Feature Selection,” International Journal of Electrical and Electronics Research, vol. 11, no. 2, pp. 582-589, 2023. DOI: 10.37391/ijeer.110246
[30] Nikookar E., Naderi E., and Rahnavard A., “Cervical Cancer Prediction by Merging Features of Different Colposcopic Images and Using Ensemble Classifier,” Journal of Medical Signals and Sensors, vol. 11, no. 2, pp. 67-78, 2021. DOI:10.4103/jmss.JMSS_16_20
[31] Patsadu O., Watanapa B., Dajpratham P., and Nukoolkit C., “Fall Motion Detection with Fall Severity Level Estimation by Mining Kinect 3D Data Stream,” The International Arab Journal of Information Technology, vol. 15, no. 3, pp. 378- 388, 2018.
[32] Predawan S. and Sompeewong P., “Cervical Cancer Screening Using Data Mining Technique,” Chonburi Hospital Journal, vol. 45, no. 3, pp. 199-208, 2020.
[33] Rahaman M., Li C., Yao Y., Kulwa F., Wu X., Li X., and Wang Q., “Deepcervix: A Deep Learning- based Framework for the Classification of Cervical Cells using Hybrid Deep Feature Fusion Techniques,” Computers in Biology and Medicine, vol. 136, pp. 104649, 2021. https://doi.org/10.1016/j.compbiomed.2021.1046 49
[34] Ramaraju H., Nagaveni Y., and Khazi A., “Use of Schiller’s Test versus Pap Smear to Increase Detection Rate of Cervical Dysplasias,” International Journal of Reproduction, Contraception, Obstetrics and Gynecology, vol. 5, pp. 1446-1450, 2017. https://doi.org/10.18203/2320- 1770.ijrcog20161302
[35] Razali N., Mostafa S., Mustapha A., Wahab M., and Ibrahim N., “Risk Factors of Cervical Cancer using Classification in Data Mining,” Journal of Physics: Conference Series, vol. 1529, pp. 1-8, 2020. DOI:10.1088/1742-6596/1529/2/022102
[36] Sheta A., El-Ashmawi W., and Baareh A., “Heart Disease Diagnosis Using Decision Trees with Feature Selection Method,” The International Arab Journal of Information Technology, vol. 21, no. 3, pp. 427-438, 2024. https://doi.org/10.34028/iajit/21/3/7
[37] Shinde S., Kalbhor M., and Wajire P., “DeepCyto: A Hybrid Framework for Cervical Cancer Classification by using Deep Feature Fusion of Cytology Images,” Mathematical Biosciences and Engineering, vol. 19, no. 7, pp. 6415-6434, 2022. DOI:10.3934/mbe.2022301
[38] Sindiani A., Alshdaifal E., and Alkhatib A., “Investigating Cervical Risk Factors that Lead to Cytological and Biopsy Examination,” MED ARCH, vol. 74, no. 4, pp. 294-297, 2020. DOI:10.5455/medarh.2020.74.294-297
[39] Sonkong K., “Factors Associated with Cervical Cancer Screening among Women Aged 30-60 Years in the Taktok Sub-District Health Promotion Hospital in Tak Province,” Journal of Medicine and Health Sciences, vol. 28, no. 3, pp. 61-74, 2021.
[40] Tanimu J., Hamada M., Hassan M., Kakudi H., and Abiodun J., “A Machine Learning Method for Classification of Cervical Cancer,” Electronics, vol. 11, no. 463, pp. 1-23, 2022. https://doi.org/10.3390/electronics11030463
[41] UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/383/cervical+c ancer+risk+factors, Last Visited, 2024.
[42] UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/537/cervical+c ancer+behavior+risk, Last Visited, 2024.
[43] Wesabi Y., Choudhury A., and Won D., 460 The International Arab Journal of Information Technology, Vol. 22, No.3, May 2025 “Classification of Cervical Cancer Dataset,” in Proceedings of the Institute of Industrial and Systems Engineers Annual Conference, Orlando, pp. 1-6, 2018.
[44] Witten I., Frank E., and Hall M., Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann Publishers, 2011.
[45] Zhang L., Zhu Y., Song Y., Han Y., Sun D., Qin S., and Gao Y., “Intelligent Diagnosis of Cervical Cancer Based on Data Mining Algorithm,” Computational and Mathematical Methods in Medicine, vol. 2021, pp. 1-9, 2021. DOI:10.1155/2021/7690902
[46] Zorkafli M., Osman M., Isa I., Ahmad F., and Sulaiman S., “Classification of Cervical Cancer Using Hybrid Multi-layered Perceptron Network Trained by Genetic Algorithm,” Procedia Computer Science, vol. 163, pp. 494-501, 2019. https://doi.org/10.1016/j.procs.2019.12.132