Logo-jrhs
J Res Health Sci. 2015;15(3): 189-195.
PMID: 26411666
Scopus ID: 84944257565
  Abstract View: 262
  PDF Download: 81

Original Article

Presentation of a model-based data mining to predict lung cancer

Reza Shahhoseini, Ali Ghazvini, Mansour Esmaeilpour, Gholamhossein Pourtaghi, Shahram Tofighi*
*Corresponding Author: Email: shr_tofighi@yahoo.com

Abstract

Background: The data related to patients often have very useful information that can help us to resolve a lot of problems and difficulties in different areas. This study was performed to present a model-based data mining to predict lung cancer in 2014.

Methods: In this exploratory and modeling study, information was collected by two methods: library and field methods. All gathered variables were in the format of form of data transferring from those affected by pulmonary problems (303 records) as well as 26 fields including clinical and environmental variables. The validity of form of data transferring was obtained via consensus and meeting group method using purposive sampling through several meetings among members of research group and lung group. The methodology used was based on classification and prediction method of data mining as well as the method of supervision with algorithms of classification and regression tree using Clementine 12 software.

Results: For clinical variables, model's precision was high in three parts of training, test and validation. For environmental variables, maximum precision of model in training part relevant to C&R algorithm was equal to 76%, in test part relevant to Neural Net algorithm was equal to 61%, and in validation part relevant to Neural Net algorithm was equal to 57%.

Conclusion: In clinical variables, C5.0, CHAID, C & R models were stable and suitable for detection of lung cancer. In addition, in environmental variables, C & R model was stable and suitable for detection of lung cancer. Variables such as pulmonary nodules, effusion of plural fluid, diameter of pulmonary nodules, and place of pulmonary nodules are very important variables that have the greatest impact on detection of lung cancer.

First Name
Last Name
Email Address
Comments
Security code


Abstract View: 263

Your browser does not support the canvas element.


PDF Download: 81

Your browser does not support the canvas element.

Submitted: 11 Apr 2015
Revision: 22 Sep 2015
ePublished: 08 Sep 2015
EndNote EndNote

(Enw Format - Win & Mac)

BibTeX BibTeX

(Bib Format - Win & Mac)

Bookends Bookends

(Ris Format - Mac only)

EasyBib EasyBib

(Ris Format - Win & Mac)

Medlars Medlars

(Txt Format - Win & Mac)

Mendeley Web Mendeley Web
Mendeley Mendeley

(Ris Format - Win & Mac)

Papers Papers

(Ris Format - Win & Mac)

ProCite ProCite

(Ris Format - Win & Mac)

Reference Manager Reference Manager

(Ris Format - Win only)

Refworks Refworks

(Refworks Format - Win & Mac)

Zotero Zotero

(Ris Format - Firefox Plugin)