<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>West Asia Organization for Cancer Prevention (WAOCP), APOCP's West Asia Chapter.</PublisherName>
				<JournalTitle>Asian Pacific Journal of Cancer Prevention</JournalTitle>
				<Issn>1513-7368</Issn>
				<Volume>18</Volume>
				<Issue>6</Issue>
				<PubDate PubStatus="epublish">
					<Year>2017</Year>
					<Month>06</Month>
					<Day>01</Day>
				</PubDate>
			</Journal>
<ArticleTitle>Analyzing a Lung Cancer Patient Dataset with the Focus on Predicting Survival Rate One Year after Thoracic Surgery</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage>1531</FirstPage>
			<LastPage>1536</LastPage>
			<ELocationID EIdType="pii">46973</ELocationID>
			
<ELocationID EIdType="doi">10.22034/APJCP.2017.18.6.1531</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Peyman </FirstName>
					<LastName>Rezaei Hachesu</LastName>
<Affiliation>Department of Health Information technology, School of Health management and Informatics, Tabriz University of Medical
Sciences, Tabriz, Iran.</Affiliation>

</Author>
<Author>
					<FirstName>Nazila </FirstName>
					<LastName>Moftian</LastName>
<Affiliation>Department of Health Information technology, School of Health management and Informatics, Tabriz University of Medical
Sciences, Tabriz, Iran.</Affiliation>

</Author>
<Author>
					<FirstName>Mahsa </FirstName>
					<LastName>Dehghani</LastName>
<Affiliation>Department of Health Information technology, School of Health management and Informatics, Tabriz University of Medical
Sciences, Tabriz, Iran.</Affiliation>

</Author>
<Author>
					<FirstName>Taha </FirstName>
					<LastName>Samad-Soltani</LastName>
<Affiliation>Department of Health Information technology, School of Health management and Informatics, Tabriz University of Medical
Sciences, Tabriz, Iran.</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2017</Year>
					<Month>02</Month>
					<Day>07</Day>
				</PubDate>
			</History>
		<Abstract> &lt;br /&gt; &lt;strong&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Background: &lt;/span&gt;&lt;/strong&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;Data mining, a new concept introduced in the mid-1990s, can help researchers to gain new, profound insights and facilitate access to unanticipated knowledge sources in biomedical datasets. Many issues in the medical &lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;field are concerned with the diagnosis of diseases based on tests conducted on individuals at risk. Early diagnosis &lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;and treatment can provide a better outcome regarding the survival of lung cancer patients. Researchers can use data mining techniques to create effective diagnostic models. The aim of this study was to evaluate patterns existing in risk factor data of for mortality one year after thoracic surgery for lung cancer. &lt;/span&gt;&lt;/span&gt;&lt;strong&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Methods: &lt;/span&gt;&lt;/strong&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;The dataset used in this study contained 470 records and 17 features. First, the most important variables involved in the incidence of lung cancer were extracted using knowledge discovery and datamining algorithms such as naive Bayes, maximum expectation and then, using a regression analysis algorithm, a questionnaire was developed to predict the risk of death one year after lung surgery. Outliers in the data were excluded and reported using the clustering algorithm. Finally, a calculator was designed to estimate the risk for one-year post-operative mortality based on a scorecard algorithm. &lt;/span&gt;&lt;/span&gt;&lt;strong&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Results: &lt;/span&gt;&lt;/strong&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;The results revealed the most important factor involved in increased mortality to be large tumor size. Roles for type II &lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;diabetes and preoperative dyspnea in lower survival were also identified. The greatest commonality in classification of patients was Forced expiratory volume in first second (FEV1), based on levels of which patients could be classified &lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;into different categories. &lt;/span&gt;&lt;/span&gt;&lt;strong&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Conclusion: &lt;/span&gt;&lt;/strong&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;Development of a questionnaire based on calculations to diagnose disease can &lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;&lt;span style=&quot;font-family: Times New Roman,Times New Roman; font-size: small;&quot;&gt;be used to identify and fill knowledge gaps in clinical practice guidelines. &lt;/span&gt;&lt;/span&gt;</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Data mining</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">lung neoplasms</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">cancer</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Informatics</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">knowledge</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://journal.waocp.org/article_46973_d7127f90ffaed271903894284e1be15a.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
