Document Type: Research Articles
Department of Computer Engineering, Islamic Azad University of Qazvin (QIAU), Qazvin, Iran.
Social Determinants of Health Research Center, Qazvin University of Medical Sciences, Qazvin, Iran.
Department of Electrical Engineering, Amirkabir University of Technology, Tehran, Iran.
General Surgeon, Assistant Professor. School of Medicine, Qazvin University of Medical Sciences, Qazvin, Iran.
Objective: Gastric cancer is one of the most common types of cancers, which will result in irreparable harm in the case of misdiagnosis or late diagnosis. The purpose of this study is to investigate the capability of data mining techniques and disease risk factor characteristics to predict and diagnose the gastric cancer. Methods: In this retrospective descriptive-analytic study, we selected 405 samples from two groups of patient and healthy participants. A total of 11 characteristics and risk factors were examined. we used four Machine learning methods, Include support vector machine (SVM), decision tree (DT), naive Bayesian model, and k nearest neighborhood (KNN) to classify the patients with gastric cancer. The evaluation criteria to investigate the model on the database of patients with gastric cancer included Recall, Precision, F-score, and Accuracy. Data was analyzed using MATLAB® software, version 3.2 (Mathworks Inc., Natick, MA, USA). Results: Based on the results achieved from the evaluation of four methods, the accuracy rates of SVM, DT, naive Bayesian model, and KNN algorithms were 90.08, 87.89, 87.60, and 87.60 percent, respectively. The findings showed that the highest level of F-Score was related to the SVM (91.99); whereas, the lowest rate was associated with the KNN algorithm (87.17). Conclusion: According to the findings, the SVM algorithm showed the best results in classification of Test samples. So, this intelligent system can be used as a physician assistant in medical education hospitals, where the diagnosis processes are performed by medical students.