<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">IJO</journal-id>
<journal-title-group>
<journal-title>International Journal of Oncology</journal-title></journal-title-group>
<issn pub-type="ppub">1019-6439</issn>
<issn pub-type="epub">1791-2423</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ijo.2016.3499</article-id>
<article-id pub-id-type="publisher-id">ijo-49-01-0361</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject></subj-group></article-categories>
<title-group>
<article-title>Incorporating epistasis interaction of genetic susceptibility single nucleotide polymorphisms in a lung cancer risk prediction model</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>MARCUS</surname><given-names>MICHAEL W.</given-names></name><xref rid="af1-ijo-49-01-0361" ref-type="aff">1</xref><xref rid="fn1-ijo-49-01-0361" ref-type="author-notes">*</xref><xref ref-type="corresp" rid="c1-ijo-49-01-0361"/></contrib>
<contrib contrib-type="author">
<name><surname>RAJI</surname><given-names>OLAIDE Y.</given-names></name><xref rid="af1-ijo-49-01-0361" ref-type="aff">1</xref><xref rid="fn1-ijo-49-01-0361" ref-type="author-notes">*</xref></contrib>
<contrib contrib-type="author">
<name><surname>DUFFY</surname><given-names>STEPHEN W.</given-names></name><xref rid="af2-ijo-49-01-0361" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author">
<name><surname>YOUNG</surname><given-names>ROBERT P.</given-names></name><xref rid="af3-ijo-49-01-0361" ref-type="aff">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>HOPKINS</surname><given-names>RAEWYN J.</given-names></name><xref rid="af3-ijo-49-01-0361" ref-type="aff">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>FIELD</surname><given-names>JOHN K.</given-names></name><xref rid="af1-ijo-49-01-0361" ref-type="aff">1</xref></contrib></contrib-group>
<aff id="af1-ijo-49-01-0361">
<label>1</label>Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, Liverpool L7 8TX, UK</aff>
<aff id="af2-ijo-49-01-0361">
<label>2</label>Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK</aff>
<aff id="af3-ijo-49-01-0361">
<label>3</label>School of Biological Sciences, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand</aff>
<author-notes>
<corresp id="c1-ijo-49-01-0361">Correspondence to: Dr Michael W. Marcus, Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, The Apex Building, 6 West Derby Street, Liverpool L7 8TX, UK, E-mail: <email>m.w.marcus@liv.ac.uk</email></corresp><fn id="fn1-ijo-49-01-0361">
<label>*</label>
<p>Contributed equally</p></fn></author-notes>
<pub-date pub-type="collection">
<month>07</month>
<year>2016</year></pub-date>
<pub-date pub-type="epub">
<day>25</day>
<month>04</month>
<year>2016</year></pub-date>
<volume>49</volume>
<issue>1</issue>
<fpage>361</fpage>
<lpage>370</lpage>
<history>
<date date-type="received">
<day>08</day>
<month>01</month>
<year>2016</year></date>
<date date-type="accepted">
<day>17</day>
<month>02</month>
<year>2016</year></date></history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2016, Spandidos Publications</copyright-statement>
<copyright-year>2016</copyright-year></permissions>
<abstract>
<p>Incorporation of genetic variants such as single nucleotide polymorphisms (SNPs) into risk prediction models may account for a substantial fraction of attributable disease risk. Genetic data, from 2385 subjects recruited into the Liverpool Lung Project (LLP) between 2000 and 2008, consisting of 20 SNPs independently validated in a candidate-gene discovery study was used. Multifactor dimensionality reduction (MDR) and random forest (RF) were used to explore evidence of epistasis among 20 replicated SNPs. Multivariable logistic regression was used to identify similar risk predictors for lung cancer in the LLP risk model for the epidemiological model and extended model with SNPs. Both models were internally validated using the bootstrap method and model performance was assessed using area under the curve (AUC) and net reclassification improvement (NRI). Using MDR and RF, the overall best classifier of lung cancer status were SNPs rs1799732 (DRD2), rs5744256 (IL-18), rs2306022 (ITGA11) with training accuracy of 0.6592 and a testing accuracy of 0.6572 and a cross-validation consistency of 10/10 with permutation testing P&lt;0.0001. The apparent AUC of the epidemiological model was 0.75 (95&#x00025; CI 0.73&#x02013;0.77). When epistatic data were incorporated in the extended model, the AUC increased to 0.81 (95&#x00025; CI 0.79&#x02013;0.83) which corresponds to 8&#x00025; increase in AUC (DeLong's test P=2.2e-<sup>16</sup>); 17.5&#x00025; by NRI. After correction for optimism, the AUC was 0.73 for the epidemiological model and 0.79 for the extended model. Our results showed modest improvement in lung cancer risk prediction when the SNP epistasis factor was added.</p></abstract>
<kwd-group>
<kwd>epistasis</kwd>
<kwd>single nucleotide polymorphisms</kwd>
<kwd>lung cancer</kwd>
<kwd>risk models</kwd>
<kwd>multifactor dimensionality reduction</kwd>
<kwd>random forest</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Lung cancer risk prediction models provide an estimate of individual's risk of developing lung cancer such that &#x02018;at-risk&#x02019; subjects can be targeted for preventive and treatment interventions (<xref rid="b1-ijo-49-01-0361" ref-type="bibr">1</xref>). Risk models hold promise for improving patient care by aiding the clinicians decision making process regarding choice of interventions and/or treatments. Risk models can also guide selection of individuals at the population level, for screening: this ensures limited resources are focussed on those individuals who are most likely to benefit. This risk guiding strategy ensures minimisation of unnecessary, invasive and potentially harmful interventions. Existing lung cancer absolute risk prediction models are mostly based on traditional epidemiological and/or clinical risk factors (<xref rid="b2-ijo-49-01-0361" ref-type="bibr">2</xref>&#x02013;<xref rid="b7-ijo-49-01-0361" ref-type="bibr">7</xref>), limiting their predictive and discriminative abilities. For an improved precision, incorporation of genetic and molecular markers of disease in risk models has been advocated (<xref rid="b8-ijo-49-01-0361" ref-type="bibr">8</xref>) and aided by recent proliferation of genetic/genomic research which has led to the identification of susceptibility genes and biological markers in many diseases (<xref rid="b9-ijo-49-01-0361" ref-type="bibr">9</xref>&#x02013;<xref rid="b12-ijo-49-01-0361" ref-type="bibr">12</xref>).</p>
<p>Common gene variants involved in lung cancer have been recently identified through a number of large, collaborative, genome-wide association studies. Susceptibility genes identified to date include those on chromosomes 5p15.33, 6p21, and 15q24-25.1 (<xref rid="b13-ijo-49-01-0361" ref-type="bibr">13</xref>&#x02013;<xref rid="b15-ijo-49-01-0361" ref-type="bibr">15</xref>). Apart from these, other genetic loci have also been identified in candidate gene association studies targeting specific molecular pathways; such as genes encoding proteins in cell cycle control, oxidant response, apoptosis, DNA repair, cell adhesion and airways inflammatory response (<xref rid="b16-ijo-49-01-0361" ref-type="bibr">16</xref>,<xref rid="b17-ijo-49-01-0361" ref-type="bibr">17</xref>).</p>
<p>While genomics research has been very fruitful in identifying these common, low-risk allelic variants, there is a growing scepticism regarding their usefulness in risk prediction. It has been shown that risk profiles generated by common low-moderate susceptibility loci, in a simple additive model, provides limited discrimination (<xref rid="b18-ijo-49-01-0361" ref-type="bibr">18</xref>,<xref rid="b19-ijo-49-01-0361" ref-type="bibr">19</xref>). The limited contribution of single nucleotide polymorphisms (SNPs) to risk profiling has been partly blamed on restriction to a limited number of significant alleles, methodological limitations regarding assessment of model performance and statistical approaches for incorporating the variants (<xref rid="b19-ijo-49-01-0361" ref-type="bibr">19</xref>). Whilst the usual approach has been to utilise only the significant variants for risk profiling, an improved disease prediction may be attained by accounting for a large ensemble of markers (<xref rid="b20-ijo-49-01-0361" ref-type="bibr">20</xref>). For the relatively few markers arising from candidate-gene studies, incorporation of the interactive effect of these genes, through epistasis modelling, may provide better predictions beyond that afforded by the limited effect of multiple loci using additive effects (<xref rid="b21-ijo-49-01-0361" ref-type="bibr">21</xref>). Models including epistatic interactions take into account the complex biological relationships among the loci and extend the traditional method that focuses only on additive score using a weighted or unweighted number of risk alleles, which assume independence between the markers (<xref rid="b22-ijo-49-01-0361" ref-type="bibr">22</xref>).</p>
<p>In the past three decades, improvements in risk prediction models brought about by the inclusion of markers and genetic factors were quantified using changes in the area under the receiving-operating characteristic curve (AUC) (<xref rid="b23-ijo-49-01-0361" ref-type="bibr">23</xref>). Recently, an increasing popular measure of evaluating improvements in risk predictions, the net reclassification improvement was introduced (<xref rid="b24-ijo-49-01-0361" ref-type="bibr">24</xref>). This measure involves cross-tabulating categories of predicted risk for 2 models, usually one with the new marker under study and the other without it, to see how persons are classified differently when these models are used (<xref rid="b25-ijo-49-01-0361" ref-type="bibr">25</xref>).</p>
<p>In this study, we investigated the presence of epistasis among a panel of SNPs previously validated individually in lung cancer (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>) and used both area under the receiver operating characteristic (AUC) analysis and net reclassification improvement (NRI) to assess the contribution of adding an interactive epistatic effect to an extensively validated clinical-based risk model for lung cancer.</p></sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title>Study population</title>
<p>This study was performed as part of the Liverpool Lung Project (LLP). Details of recruitment procedure, study design and validation have been previously reported (<xref rid="b3-ijo-49-01-0361" ref-type="bibr">3</xref>,<xref rid="b27-ijo-49-01-0361" ref-type="bibr">27</xref>). Briefly, incident cases of histologically or cytologically confirmed lung cancer, ages between 20 and 80 years, were included. Lung cancer included any of topographical subcategories of code C34 of the International Classification of Disease for Oncology 9th revision. Two population controls per case, matched on year of birth (&#x000B1;2 years) and gender, were selected from registers of general practitioners in Liverpool area. All participants were Caucasians, residents in the Liverpool area. The study protocol was approved by the Liverpool Research Ethics Committee, and all research participants provided written informed consent in accordance with the Declaration of Helsinki.</p>
<p>In this study, we utilised complete genotype data on individuals included in the independent validation of SNPs identified in a candidate-gene genetic association study (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>). The data comprises of 2385 subjects (cases=718, controls=1667) selected from individuals recruited into the LLP between 2000 and 2008. Of this number, 1362 (cases=418 and controls=914) were included in LLP case-control data used to develop the LLP risk model (<xref rid="b3-ijo-49-01-0361" ref-type="bibr">3</xref>). Data on epidemiological, clinical and lifestyle factors were collected using a standardised questionnaire supplemented with hospital case note reviews conducted by trained LLP research nurses. Information documented includes: patients smoking status (smoking duration), previous history of pulmonary diseases (pneumonia, COPD and bronchitis), previous history of malignant diseases excluding skin melanoma, occupational exposure to asbestos, family history of lung cancer with age at onset, and case diagnosis details (date of diagnosis, histological subtype and staging).</p>
<p>Genetic data consist of 20 SNPs independently validated from 157 SNPs screened in a candidate-gene discovery study; details of selection and genotyping have been described elsewhere (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>). Briefly, 157 candidate SNPs were screened in a discovery cohort of 439 subjects (200 controls and 239 lung cancer cases), which identified 30 SNPs associated with either the healthy smokers (protective) or lung cancer (susceptibility) phenotype. After genotyping this 30 SNP panel in a validation cohort of 491 subjects (248 controls and 207 lung cancers) and, using the same protective and susceptibility genotypes from the discovery cohort, a 20 SNP panel were selected based on replication of SNP associations in the validation cohort that includes variants in the metabolism of smoking-derived carcinogens (NAT2 and CYP2E1), inflammatory cytokines &#x0005B;interleukins 1(IL1B), 8(IL8), and 18(IL18), tissue necrosis factor &#x003B1;1 receptor (TNFR1), toll-like receptor 9 (TLR9)&#x0005D;, smoking addiction &#x0005B;dopamine D2 receptor (DRD2) and Dopamine transporter 1(DAT1)&#x0005D;, nicotine dependency &#x0005B;&#x003B1;5-nAChR (CHRNA3)&#x0005D;, antioxidant response to smoking &#x0005B;&#x003B1;1 anti-chymotrypsin (SERPINA3) and extracellular super-oxide dismutase (SOD3)&#x0005D;, cell cycle control, DNA repair and apoptosis (XPD, TP73, Bcl-2, FasL, Cerb1, and REV1) and integrins (ITGA11, ITGB3) implicated in apoptosis. Genomic DNA was extracted from whole blood samples by standard salt-based methods and purified genomic DNA was aliquoted (10 ng/&#x003BC;l concentration) into 96-well plates. Genotyping was performed on a Sequenom&#x02122; system (Sequenom Autoflex Mass Spectrometer and Samsung 24 pin nanodispenser) (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>).</p></sec>
<sec>
<title>Statistical analysis</title>
<p>Characteristics of the subjects in the cases and controls were compared using t-test for continuous variables and &#x003C7;<sup>2</sup> test or Fisher's exact test for discrete variables as appropriate. Genotype and allele frequencies were checked for each SNP for Hardy-Weinberg equilibrium (HWE).</p></sec>
<sec>
<title>Identification of SNPs epistasis</title>
<p>The multifactor dimensionality reduction (MDR) and random forest (RF) were used to investigate gene-gene interactions by identifying SNP combinations that provide the best discrimination of the status of the subjects. MDR is a non-parametric, model-free method that utilises a constructive induction technique to collapse high-dimensional genetic data into a single dimension (<xref rid="b28-ijo-49-01-0361" ref-type="bibr">28</xref>,<xref rid="b29-ijo-49-01-0361" ref-type="bibr">29</xref>). It pools multi-locus genotypes into high and low risk groups using an exhaustive search to identify optimal combination of polymorphisms, which can then be evaluated for its ability to classify or predict disease status. In our implementation of MDR, three separate genotypes were analysed for each SNP. The Relief-F algorithm as implemented in the MDR was used as a first approach to select among the 20 SNPs that are most likely to interact. An exhaustive search of all possible 1&#x02013;5 loci were then explored using 10-fold cross validation as described by Hahn <italic>et al</italic> (<xref rid="b28-ijo-49-01-0361" ref-type="bibr">28</xref>). Cross-validation allows estimation of the prediction error by leaving out a portion of the data as an independent test set. With 10-fold cross-validation, the data are divided into 10 equal parts, the model was developed on 9/10 of the data (i.e. the training data) and then evaluated on the remaining 1/10 of the data (i.e. the independent testing data). This is repeated for each possible 9/10 and 1/10 of the data and the resulting ten prediction errors are averaged (<xref rid="b29-ijo-49-01-0361" ref-type="bibr">29</xref>). MDR, then, seeks to find the single-locus or multi-locus predictor(s) for explaining the outcome (based on a balanced accuracy measure - the arithmetic mean of sensitivity and specificity), based on the available genomic information (<xref rid="b30-ijo-49-01-0361" ref-type="bibr">30</xref>). The prediction accuracy and cross-validation consistency defined as the number of cross-validation replicates (partitions) in which that same n-locus predictor(s) was chosen as the best predictor of lung cancer status i.e. the number of replicates in which it minimised the classification error were used to select the best SNPs in each 1 to 5-locus combination (<xref rid="b31-ijo-49-01-0361" ref-type="bibr">31</xref>). The overall best SNP classifier of lung cancer status was selected as the one with the maximum prediction accuracy and cross-validation consistency and evaluated statistically using 1000-fold permutation test.</p>
<p>For comparison, we used the freely available Willows software package for generating RF (<xref rid="b32-ijo-49-01-0361" ref-type="bibr">32</xref>). RF ranks variables by a variable importance index, a measure which reflects the &#x02018;importance&#x02019; of a variable on the basis of the classification accuracy, while considering the interaction among variables (<xref rid="b33-ijo-49-01-0361" ref-type="bibr">33</xref>). A classification tree was built by the recursive partitioning method; each tree is constructed using a different cohort of bootstrap samples from the original cohort. Approximately one-third of the samples are left out of the bootstrap (oob) samples and hence not used in the construction of the tree. The number of trees was set to 10,000 and the default values of the other parameters as provided by the program were used. Several classification trees were created with replacement from the original data imput into the program. To determine the importance of an SNP, first the values of the SNP in the oob samples are randomly permuted; then both the original oob samples and the permuted oob samples are classified by the corresponding tree. The difference in the correct classification rates between the original and permuted oob samples determines the importance of the SNP, and the variable importance is obtained by averaging the differences over all trees in the random forest (<xref rid="b32-ijo-49-01-0361" ref-type="bibr">32</xref>,<xref rid="b34-ijo-49-01-0361" ref-type="bibr">34</xref>).</p></sec>
<sec>
<title>Risk model predictions and incorporation of SNPs epistasis</title>
<p>Risk prediction was performed using the same risk factors included in the LLP risk model (<xref rid="b3-ijo-49-01-0361" ref-type="bibr">3</xref>). Multivariable logistic regression was employed to generate estimates of predicted 5-year absolute risk of lung cancer in i) a model with epidemiological data and ii) an extended model with both genetic and epidemiological data. The baseline risk (&#x003B1;, the constant term in the regression model) for the prediction of 5-year absolute risk using the extended model with both genetic and epidemiological data was recalculated. The method for calculating the baseline &#x003B1; from age- and gender-specific lung cancer incidence rates from the Liverpool area has been described (<xref rid="b3-ijo-49-01-0361" ref-type="bibr">3</xref>). The only difference is that the probability model now includes information on rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11).</p>
<p>The area under the receiver-operating characteristics (AUC) was used to i) assess the discriminatory ability of the models, and ii) compare the models with and without SNPs. The increase in AUC was evaluated and tested for significance using DeLong test (<xref rid="b35-ijo-49-01-0361" ref-type="bibr">35</xref>). Furthermore, the net reclassification improvement (NRI) was used to assess the added discrimination offered by the addition of SNPs to the risk model (<xref rid="b24-ijo-49-01-0361" ref-type="bibr">24</xref>). Bootstrapping techniques were utilised for internal validation of the models (<xref rid="b36-ijo-49-01-0361" ref-type="bibr">36</xref>). Bootstrap samples were drawn 1000 times to adjust model parameters for overfitting. Improvement in model calibration was assessed using Akaike information criteria (AIC) and Bayesian information criteria (BIC). Unless otherwise stated, all analyses were performed using R version 3.1.1 and STATA<sup>&#x000AE;</sup> version 13.1 (StataCorp LP, College Station, TX, USA).</p></sec></sec>
<sec sec-type="results">
<title>Results</title>
<p>Seven hundred and eighteen cases and 1667 population controls were successfully genotyped for 20 SNPs, which had been independently validated from 157 SNPs screened in a candidate-gene discovery study (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>). <xref rid="tI-ijo-49-01-0361" ref-type="table">Table I</xref> presents the general demographic and clinical characteristics of the study population. Men constituted the majority of the study population cases (57.7&#x00025;) and (58.1&#x00025;) controls. The proportion of ever smokers was significantly higher in cases (93.2&#x00025;) compared with controls (65.2&#x00025;). Significant differences were observed in other risk factors including smoking duration, prior diagnosis of pneumonia, occupational exposure to asbestos, and prior diagnosis of tumour (P&lt;0.001).</p>
<p><xref rid="tII-ijo-49-01-0361" ref-type="table">Table II</xref> presents the results of additive gene-dosage model for all SNPs. Heterozygosity for rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11) conferred an increased risk for lung cancer in reference to the wild-type genotype &#x0005B;OR 1.74 (95&#x00025; CI 1.31&#x02013;2.32); 1.42 (95&#x00025; CI 1.16&#x02013;1.72) and 2.53 (95&#x00025; CI 2.06&#x02013;3.12), respectively&#x0005D;. The homozygote genotype for rs16969968 (CHRNA3/5), rs13181 (ERCC2), rs5744256 (IL-18) and rs2306022 (ITGA11) increased the risk of developing lung cancer with reference to the wild-type &#x0005B;OR 1.46 (95&#x00025; CI 1.11&#x02013;1.93); 1.46 (95&#x00025; CI 1.13&#x02013;1.89); 4.07 (95&#x00025; CI 3.11&#x02013;5.31) 4.09 (95&#x00025; CI 2.21&#x02013;7.56), respectively&#x0005D;.</p>
<p><xref rid="tIII-ijo-49-01-0361" ref-type="table">Table III</xref> summarises the result obtained from the MDR analysis investigating epistatic effects among the SNPs. The best candidate classifiers of lung cancer status based on five SNP loci selected using the cross-validation consistency, training and testing accuracy were as follows: Single locus: rs2306022 (ITGA11); 2 loci: rs5744256 (IL-18), rs2306022 (ITGA11); 3 loci: rs1799732 (DRD2), rs5744256(IL-18), rs2306022 (ITGA11); 4 loci: rs1696998 (CHRNA3/5), rs1799732 (DRD2), rs5744256 (IL-18), rs2306022 (ITGA11); 5 loci: rs1799732 (DRD2), rs763110 (FasL), rs5744256 (IL-18), rs4073 (IL-8), rs2306022 (ITGA11). The 3 loci consisting of SNPs rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11) appears to be the overall best classifier of lung cancer status. These loci had training and testing accuracy of 0.6592 and 0.6572 respectively, and the cross validation consistency of 10/10 (model selected as the best of 3 in 10 CV) P&lt;0.0001 (permutation test).</p>
<p><xref rid="tIV-ijo-49-01-0361" ref-type="table">Table IV</xref> shows the importance score results in the RF. RF ranks variables by a variable importance index, which is an indication of the importance of a variable on the basis of classification accuracy while considering interaction among variables. The three SNPs &#x0005B;rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11)&#x0005D; selected as the overall best classifier of lung cancer status in MDR were also ranked top 3 by RF using variable importance index.</p>
<p><xref rid="tV-ijo-49-01-0361" ref-type="table">Table V</xref> summarises reclassifications for cases and controls using epidemiological model and models with SNPs &#x0005B;rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11)&#x0005D;. Subjects were categorised into three different thresholds; low-risk (&lt;0.91), intermediate risk (0.91 to 5.12), and high-risk (&gt;5.12) groups. The threshold values were defined from the predicted 5-year absolute risks for the original LLP control samples (n=1,272), assuming the risk distribution in this group is similar to that of the general Liverpool population. The upper threshold (5.12) corresponds to the value for the top 20&#x00025; of predicted absolute risks in the population; individuals whose 5-year predicted absolute risk is equal to or above this value are designated as &#x02018;high risk&#x02019; group. The lower threshold value of 0.91 corresponds to the bottom 40&#x00025; of absolute risks in the control population and represents the &#x02018;low risk&#x02019; group. This definition of high risk and low risk groups was used in an earlier study (<xref rid="b13-ijo-49-01-0361" ref-type="bibr">13</xref>). Overall, 42.7&#x00025; of cases (311/727) and 35.7&#x00025; of controls (592/1657) had their predicted risks re-classified into other risk groups when SNPs were incorporated into risk prediction model. This reclassification showed improvement (upward shift) in approximately 25&#x00025; of cases and became worse (downward shift) for 18&#x00025; resulting in a net gain of ~6&#x00025;. The net gain was higher for controls (10&#x00025;) with overall improvement in risk (downward shift) for 23&#x00025; and worse performance (upward shift) for 13&#x00025;. The NRI was estimated at 13.5&#x00025; (P&lt;0.001).</p>
<p><xref rid="tVI-ijo-49-01-0361" ref-type="table">Table VI</xref> depicts the odds ratios (OR) and 95&#x00025; confidence intervals (95&#x00025; CI) of the multivariate logistic regression models for the epidemiological model and the extended model with SNPs. The ORs and 95&#x00025; CI for both models were comparable which suggests the absence of any serious confounding effects of SNPs on the relationship between each of the other clinical and epidemiological risk factors and lung cancer risk. Model fit was assessed using Akaike information criterion (AIC) and Bayes information criteria (BIC). There was an improvement in model fit as indicated by the reduction of the AIC from 2098.42 from the epidemiological model to 1930.14 for the extended model with SNPs. Likewise, a similar reduction was observed in BIC from 2167.75 from the epidemiological model to 2016.80 for the extended model with SNPs. <xref rid="f1-ijo-49-01-0361" ref-type="fig">Fig. 1</xref> shows the AUC of the epidemiological model and extended model with SNPs. The apparent AUC of the epidemiological model without SNPs was 0.75 (95&#x00025; CI 0.73&#x02013;0.77). When epistatic data were incorporated in the extended model, the AUC increased to 0.81 (95&#x00025; CI 0.79&#x02013;0.83) which corresponds to 8&#x00025; increase in AUC for the model with SNPs (DeLong's test P=2.2e-<sup>16</sup>). After correction for optimism, the AUC was 0.73 for the epidemiological model and 0.79 for the extended model.</p></sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>This study demonstrates the use of comprehensive analytical techniques for investigating the contribution of adding an interactive effect of a panel of genetic markers (SNPs) to the prediction of individual absolute risk of developing lung cancer, using a risk model similar to the LLP model (<xref rid="b3-ijo-49-01-0361" ref-type="bibr">3</xref>). Using genotype data from 2385 individuals included in the independent validation of SNPs identified in a candidate-gene genetic association study from the LLP case-control study, we found the 3 loci genotype interaction rs2306022 (ITGA11), rs5744256 (IL-18) and rs1799732 (DRD2) provided the best classifier of disease status using both MDR and RF. Adding these SNPs to a clinically-based lung cancer risk model lead to an increase in AUC (0.75 to 0.81); and increase in net reclassification (NRI=17.5&#x00025;).</p>
<p>We utilised two different approaches; discrimination and reclassification to evaluate the contribution of adding an interactive epistatic effect to a risk model for lung cancer. AUC is the most popular metric used for measuring the discriminatory power of a model to correctly classify subjects with or without a disease. Our result showed 8&#x00025; increase in AUC (DeLong's test P=2.2e-<sup>16</sup>) for risk prediction in the extended model with SNPs (AUC=0.81) compared with the epidemiological model without SNPs (AUC=0.75), which is higher than that reported by Li <italic>et al</italic> (<xref rid="b9-ijo-49-01-0361" ref-type="bibr">9</xref>). Li <italic>et al</italic>, in a Chinese case-control study, genotyped five SNPs identified in Genome Wide Association study of 5068 subjects. The genetic risk scores based on these SNPs were estimated by two approaches: a simple risk alleles count (cGRS) and a weighted method (wGRS). Their AUC in combination with the bootstrap resampling method was used to assess the predictive performance of the genetic risk score for lung cancer. Smoking history contributed significantly to lung cancer (P&lt;0.001) risk &#x0005B;AUC=0.619 (0.603&#x02013;0.634)&#x0005D;, and incorporated with wGRS gave an AUC value of 0.639 (0.621&#x02013;0.652) after adjustment for over-fitting (<xref rid="b9-ijo-49-01-0361" ref-type="bibr">9</xref>). For clinical risk prediction, it is expedient that a new risk model correctly classify individuals into higher or lower risk categories (<xref rid="b37-ijo-49-01-0361" ref-type="bibr">37</xref>). Pencina <italic>et al</italic> introduced a new metric, the NRI that assesses the improvement in model performance by quantifying the degree of correct classification (<xref rid="b24-ijo-49-01-0361" ref-type="bibr">24</xref>). By applying the NRI, we demonstrated that the addition of SNPs lead to a 17.5&#x00025; improvement in the risk classification of the subjects.</p>
<p>This study is the first to replicate the association between the ITGA11 locus and lung cancer described by Young and colleagues (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>). ITGA11 (integrin &#x003B1;11) belongs to the family of transmembrane receptors that mediate physical interactions between cells and extracellular matrix protein collagens (<xref rid="b38-ijo-49-01-0361" ref-type="bibr">38</xref>). ITGA11 is localised to stromal fibroblast and commonly overexpressed in non-small cell lung cancer (NSCLC) (<xref rid="b38-ijo-49-01-0361" ref-type="bibr">38</xref>). Earlier studies have reported that the interactions of tumour cells with the stroma play a crucial role in tumour growth, invasion, metastases, angiogenesis, and chemoresistance (<xref rid="b38-ijo-49-01-0361" ref-type="bibr">38</xref>&#x02013;<xref rid="b41-ijo-49-01-0361" ref-type="bibr">41</xref>). It has been shown that carcinoma-associated fibroblasts in NSCLC express higher levels of ITGA11. One of the factors which are affected by higher levels of ITGA11 during tumour growth is IGF2 (<xref rid="b38-ijo-49-01-0361" ref-type="bibr">38</xref>,<xref rid="b42-ijo-49-01-0361" ref-type="bibr">42</xref>). Higher levels of IGF2, in turn, can stimulate growth of tumour epithelial cells leading to tumour progression and metastasis (<xref rid="b38-ijo-49-01-0361" ref-type="bibr">38</xref>). IL18 (Interleukin-18) is a multifunctional cytokine (an extracellular signalling molecule) that augments IFN-&#x003B3; production and affects tumour immune response, leukocyte recruitment, cancer proliferation, and angiogenesis (<xref rid="b43-ijo-49-01-0361" ref-type="bibr">43</xref>,<xref rid="b44-ijo-49-01-0361" ref-type="bibr">44</xref>). An earlier study reported the presence of IL-18 in induced sputum of lung cancer patients (<xref rid="b45-ijo-49-01-0361" ref-type="bibr">45</xref>). Farjadfar <italic>et al</italic> also reported an association between IL-18 and lung cancer in a case-control study including 73 lung cancer patients (53 squamous carcinoma and 20 small cell lung carcinoma), and 97 healthy regional aged-matched individuals (<xref rid="b46-ijo-49-01-0361" ref-type="bibr">46</xref>). They suggested that their finding may be attributed to the disruption of the potential of cAMP responsive element-binding protein site and subsequent reduction in IL-18 production as observed in other cancer types (<xref rid="b46-ijo-49-01-0361" ref-type="bibr">46</xref>). Reduced production of IL-18 can result in decreased IFN-&#x003B3; synthesis, imbalanced Th1/Th2 differentiation, insufficient activation of natural killer cells and CD8<sup>+</sup> lymphocytes (<xref rid="b46-ijo-49-01-0361" ref-type="bibr">46</xref>,<xref rid="b47-ijo-49-01-0361" ref-type="bibr">47</xref>) impairment of cancer cell apoptosis and efficient angiogenesis (<xref rid="b47-ijo-49-01-0361" ref-type="bibr">47</xref>,<xref rid="b48-ijo-49-01-0361" ref-type="bibr">48</xref>). DDR2 is a receptor tyrosine kinase that binds collagen as its endogenous ligand (<xref rid="b49-ijo-49-01-0361" ref-type="bibr">49</xref>). It has been previously shown to promote cell migration, proliferation, and survival when activated by ligand binding and phosphorylation (<xref rid="b49-ijo-49-01-0361" ref-type="bibr">49</xref>,<xref rid="b50-ijo-49-01-0361" ref-type="bibr">50</xref>). Harmmerman <italic>et al</italic> reported that DDR2 mutations are present in 4&#x00025; of small cell lung carcinomas; gain-of-function mutations in this gene are important oncogenic events and are amenable to therapy with dasatinib (<xref rid="b49-ijo-49-01-0361" ref-type="bibr">49</xref>). However, the mechanism of this mutation is unknown.</p>
<p>Since epistasis is known to contribute to unexplained genetic variation of common diseases, some genetic variants may have a weak and insignificant independent effect, but strong epistatic effect (biological interaction) with other variants. The integration of genetic variants in risk prediction models beyond the traditional epidemiological covariates have been applauded as the way forward in lung cancer risk prediction modelling (<xref rid="b8-ijo-49-01-0361" ref-type="bibr">8</xref>). The result presented in this study supports this notion. Genetic factors function primarily through complex mechanisms that involve interactions between multiple genes and environmental factors (<xref rid="b21-ijo-49-01-0361" ref-type="bibr">21</xref>,<xref rid="b22-ijo-49-01-0361" ref-type="bibr">22</xref>). However, the effect of interaction will be disregarded if the genetic effect is examined in isolation, without taking cognisance of potential interactions with other unknown factors (<xref rid="b31-ijo-49-01-0361" ref-type="bibr">31</xref>). The inherent nonlinearity implies that epistasis can occur among polymorphisms even in the absence of independent effect of the components, presenting computational intensive difficulties and statistical challenges because an infinite number of combinations that needs to be evaluated (<xref rid="b21-ijo-49-01-0361" ref-type="bibr">21</xref>,<xref rid="b22-ijo-49-01-0361" ref-type="bibr">22</xref>). The use of nonparametric and genetic model-free machine learning algorithms such as MDR (<xref rid="b28-ijo-49-01-0361" ref-type="bibr">28</xref>,<xref rid="b29-ijo-49-01-0361" ref-type="bibr">29</xref>) and RF (<xref rid="b32-ijo-49-01-0361" ref-type="bibr">32</xref>,<xref rid="b33-ijo-49-01-0361" ref-type="bibr">33</xref>) have been proposed to overcome the caveat of the traditional parametric statistics and have proven to be useful in this study. Here we see that the addition of the three SNPs increases the AUC, indicating that the interaction of these loci may be important. There was an improvement in model fit, as indicated by the reduction of the AIC and BIC. Furthermore, the SNPs used in this study were internally validated using a two stage design as described by Young <italic>et al</italic> (<xref rid="b26-ijo-49-01-0361" ref-type="bibr">26</xref>) and the use of HWE to minimise genotyping error are methodological advantages utilised to minimise false positive results.</p>
<p>To the best of our knowledge, this is the first study to evaluate the addition of these specific interactions of SNPs to a lung cancer risk model. However, the result of this study must be considered in the light of a number of limitations. First, our prediction model used covariates in the LLP risk model but did not include other risk factors for lung cancer such as chronic obstructive pulmonary diseases. However, the objective of this study is to evaluate the contribution of adding an interactive effect of a panel of genetic SNPs to the LLP risk model and the model has been validated in three independent external datasets with good discrimination and calibration (<xref rid="b27-ijo-49-01-0361" ref-type="bibr">27</xref>). Second, our study demonstrates how a modest increase in AUC can lead to a substantial improvement in reclassification as quantified by the NRI. This finding supports a suggestion by Pencina <italic>et al</italic> that a small increase in AUC might still be suggestive of a meaningful improvement (<xref rid="b24-ijo-49-01-0361" ref-type="bibr">24</xref>). Third, the LLP comprise predominantly Caucasians and therefore, the lack of ethnic diversity implies that this model may be less applicable in non-white population. Fourth, our approach to reclassification did not distinguish between persons with competing events and those without an event because both are classified as not having the event of interest. Fifth, the lack of validation of the epistatic model in an independent population is a limitation, however, the application of bootstrap correction for optimism addresses in part the lack of independent validation. Sixth, many of the 20 SNPs from <xref rid="tII-ijo-49-01-0361" ref-type="table">Table II</xref> failed to replicate in the current study, particularly given the larger sample size (718 cases, 1667 controls) in the current study when compared with previous study (248 cases, 207 controls). A plausible explanation for this observation may be due to the fact that the non-significant SNPs play lesser or no role in epistatic interaction. Finally, our threshold values for risk classification was based on the predicted 5-year absolute risk for original LLP control samples but the appropriateness of these threshold values in other populations is uncertain. Using different values could have affected the results of our reclassification analyses and subsequent clinical implications.</p>
<p>In conclusion, our result shows in principle how an SNP epistatic factor can be incorporated into an epidemiological risk prediction model. In this study, inclusion of SNPs rs1799732 (DRD2), rs5744256 (IL-18), rs2306022 (ITGA11) resulted in a modest improvement in lung cancer risk prediction.</p></sec></body>
<back>
<ack>
<title>Acknowledgements</title>
<p>The Liverpool Lung project was principally funded by the Roy Castle Lung Cancer Foundation, UK. M.W.M. was funded by National Institute of Health Research Health Technology Assessment (NIHR-HTA) under grant reference no. 09/61/01. R.P.Y. is a stockholder and unpaid Chief Scientific Officer of Synergenz Bioscience Inc. who hold patents on some gene markers of lung cancer risk. For the remaining authors no conflict of interest is declared.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijo-49-01-0361"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Field</surname><given-names>JK</given-names></name><name><surname>Chen</surname><given-names>Y</given-names></name><name><surname>Marcus</surname><given-names>MW</given-names></name><name><surname>Mcronald</surname><given-names>FE</given-names></name><name><surname>Raji</surname><given-names>OY</given-names></name><name><surname>Duffy</surname><given-names>SW</given-names></name></person-group><article-title>The contribution of risk prediction models to early detection of lung cancer</article-title><source>J Surg Oncol</source><volume>108</volume><fpage>304</fpage><lpage>311</lpage><year>2013</year><pub-id pub-id-type="doi">10.1002/jso.23384</pub-id><pub-id pub-id-type="pmid">23996507</pub-id></element-citation></ref>
<ref id="b2-ijo-49-01-0361"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bach</surname><given-names>PB</given-names></name><name><surname>Kattan</surname><given-names>MW</given-names></name><name><surname>Thornquist</surname><given-names>MD</given-names></name><name><surname>Kris</surname><given-names>MG</given-names></name><name><surname>Tate</surname><given-names>RC</given-names></name><name><surname>Barnett</surname><given-names>MJ</given-names></name><name><surname>Hsieh</surname><given-names>LJ</given-names></name><name><surname>Begg</surname><given-names>CB</given-names></name></person-group><article-title>Variations in lung cancer risk among smokers</article-title><source>J Natl Cancer Inst</source><volume>95</volume><fpage>470</fpage><lpage>478</lpage><year>2003</year><pub-id pub-id-type="doi">10.1093/jnci/95.6.470</pub-id><pub-id pub-id-type="pmid">12644540</pub-id></element-citation></ref>
<ref id="b3-ijo-49-01-0361"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cassidy</surname><given-names>A</given-names></name><name><surname>Myles</surname><given-names>JP</given-names></name><name><surname>van Tongeren</surname><given-names>M</given-names></name><name><surname>Page</surname><given-names>RD</given-names></name><name><surname>Liloglou</surname><given-names>T</given-names></name><name><surname>Duffy</surname><given-names>SW</given-names></name><name><surname>Field</surname><given-names>JK</given-names></name></person-group><article-title>The LLP risk model: An individual risk prediction model for lung cancer</article-title><source>Br J Cancer</source><volume>98</volume><fpage>270</fpage><lpage>276</lpage><year>2008</year><pub-id pub-id-type="doi">10.1038/sj.bjc.6604158</pub-id></element-citation></ref>
<ref id="b4-ijo-49-01-0361"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hoggart</surname><given-names>C</given-names></name><name><surname>Brennan</surname><given-names>P</given-names></name><name><surname>Tjonneland</surname><given-names>A</given-names></name><name><surname>Vogel</surname><given-names>U</given-names></name><name><surname>Overvad</surname><given-names>K</given-names></name><name><surname>&#x000D8;stergaard</surname><given-names>JN</given-names></name><name><surname>Kaaks</surname><given-names>R</given-names></name><name><surname>Canzian</surname><given-names>F</given-names></name><name><surname>Boeing</surname><given-names>H</given-names></name><name><surname>Steffen</surname><given-names>A</given-names></name><etal/></person-group><article-title>A risk model for lung cancer incidence</article-title><source>Cancer Prev Res (Phila)</source><volume>5</volume><fpage>834</fpage><lpage>846</lpage><year>2012</year><pub-id pub-id-type="doi">10.1158/1940-6207.CAPR-11-0237</pub-id></element-citation></ref>
<ref id="b5-ijo-49-01-0361"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Park</surname><given-names>S</given-names></name><name><surname>Nam</surname><given-names>BH</given-names></name><name><surname>Yang</surname><given-names>HR</given-names></name><name><surname>Lee</surname><given-names>JA</given-names></name><name><surname>Lim</surname><given-names>H</given-names></name><name><surname>Han</surname><given-names>JT</given-names></name><name><surname>Park</surname><given-names>IS</given-names></name><name><surname>Shin</surname><given-names>HR</given-names></name><name><surname>Lee</surname><given-names>JS</given-names></name></person-group><article-title>Individualized risk prediction model for lung cancer in Korean men</article-title><source>PLoS One</source><volume>8</volume><fpage>e54823</fpage><year>2013</year><pub-id pub-id-type="doi">10.1371/journal.pone.0054823</pub-id><pub-id pub-id-type="pmid">23408946</pub-id><pub-id pub-id-type="pmcid">3567090</pub-id></element-citation></ref>
<ref id="b6-ijo-49-01-0361"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Spitz</surname><given-names>MR</given-names></name><name><surname>Hong</surname><given-names>WK</given-names></name><name><surname>Amos</surname><given-names>CI</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Schabath</surname><given-names>MB</given-names></name><name><surname>Dong</surname><given-names>Q</given-names></name><name><surname>Shete</surname><given-names>S</given-names></name><name><surname>Etzel</surname><given-names>CJ</given-names></name></person-group><article-title>A risk model for prediction of lung cancer</article-title><source>J Natl Cancer Inst</source><volume>99</volume><fpage>715</fpage><lpage>726</lpage><year>2007</year><pub-id pub-id-type="doi">10.1093/jnci/djk153</pub-id><pub-id pub-id-type="pmid">17470739</pub-id></element-citation></ref>
<ref id="b7-ijo-49-01-0361"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tammemagi</surname><given-names>CM</given-names></name><name><surname>Pinsky</surname><given-names>PF</given-names></name><name><surname>Caporaso</surname><given-names>NE</given-names></name><name><surname>Kvale</surname><given-names>PA</given-names></name><name><surname>Hocking</surname><given-names>WG</given-names></name><name><surname>Church</surname><given-names>TR</given-names></name><name><surname>Riley</surname><given-names>TL</given-names></name><name><surname>Commins</surname><given-names>J</given-names></name><name><surname>Oken</surname><given-names>MM</given-names></name><name><surname>Berg</surname><given-names>CD</given-names></name><etal/></person-group><article-title>Lung cancer risk prediction: Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial models and validation</article-title><source>J Natl Cancer Inst</source><volume>103</volume><fpage>1058</fpage><lpage>1068</lpage><year>2011</year><pub-id pub-id-type="doi">10.1093/jnci/djr173</pub-id><pub-id pub-id-type="pmid">21606442</pub-id><pub-id pub-id-type="pmcid">3131220</pub-id></element-citation></ref>
<ref id="b8-ijo-49-01-0361"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Young</surname><given-names>RP</given-names></name><name><surname>Hopkins</surname><given-names>RJ</given-names></name></person-group><article-title>Incorporating genomic data into multivariate risk models for lung cancer</article-title><source>Genet Med</source><volume>15</volume><fpage>667</fpage><lpage>668</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/gim.2013.89</pub-id><pub-id pub-id-type="pmid">23912891</pub-id></element-citation></ref>
<ref id="b9-ijo-49-01-0361"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>H</given-names></name><name><surname>Yang</surname><given-names>L</given-names></name><name><surname>Zhao</surname><given-names>X</given-names></name><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Qian</surname><given-names>J</given-names></name><name><surname>Chen</surname><given-names>H</given-names></name><name><surname>Fan</surname><given-names>W</given-names></name><name><surname>Liu</surname><given-names>H</given-names></name><name><surname>Jin</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>W</given-names></name><etal/></person-group><article-title>Prediction of lung cancer risk in a Chinese population using a multifactorial genetic model</article-title><source>BMC Med Genet</source><volume>13</volume><fpage>118</fpage><year>2012</year><pub-id pub-id-type="doi">10.1186/1471-2350-13-118</pub-id><pub-id pub-id-type="pmid">23228068</pub-id><pub-id pub-id-type="pmcid">3573944</pub-id></element-citation></ref>
<ref id="b10-ijo-49-01-0361"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raji</surname><given-names>OY</given-names></name><name><surname>Agbaje</surname><given-names>OF</given-names></name><name><surname>Duffy</surname><given-names>SW</given-names></name><name><surname>Cassidy</surname><given-names>A</given-names></name><name><surname>Field</surname><given-names>JK</given-names></name></person-group><article-title>Incorporation of a genetic factor into an epidemiologic model for prediction of individual risk of lung cancer: The Liverpool Lung Project</article-title><source>Cancer Prev Res (Phila)</source><volume>3</volume><fpage>664</fpage><lpage>669</lpage><year>2010</year><pub-id pub-id-type="doi">10.1158/1940-6207.CAPR-09-0141</pub-id></element-citation></ref>
<ref id="b11-ijo-49-01-0361"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Spitz</surname><given-names>MR</given-names></name><name><surname>Etzel</surname><given-names>CJ</given-names></name><name><surname>Dong</surname><given-names>Q</given-names></name><name><surname>Amos</surname><given-names>CI</given-names></name><name><surname>Wei</surname><given-names>Q</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Hong</surname><given-names>WK</given-names></name></person-group><article-title>An expanded risk prediction model for lung cancer</article-title><source>Cancer Prev Res (Phila)</source><volume>1</volume><fpage>250</fpage><lpage>254</lpage><year>2008</year><pub-id pub-id-type="doi">10.1158/1940-6207.CAPR-08-0060</pub-id></element-citation></ref>
<ref id="b12-ijo-49-01-0361"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Beane</surname><given-names>J</given-names></name><name><surname>Sebastiani</surname><given-names>P</given-names></name><name><surname>Whitfield</surname><given-names>TH</given-names></name><name><surname>Steiling</surname><given-names>K</given-names></name><name><surname>Dumas</surname><given-names>YM</given-names></name><name><surname>Lenburg</surname><given-names>ME</given-names></name><name><surname>Spira</surname><given-names>A</given-names></name></person-group><article-title>A prediction model for lung cancer diagnosis that integrates genomic and clinical features</article-title><source>Cancer Prev Res (Phila)</source><volume>1</volume><fpage>56</fpage><lpage>64</lpage><year>2008</year><pub-id pub-id-type="doi">10.1158/1940-6207.CAPR-08-0011</pub-id></element-citation></ref>
<ref id="b13-ijo-49-01-0361"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Young</surname><given-names>RP</given-names></name><name><surname>Hopkins</surname><given-names>RJ</given-names></name><name><surname>Whittington</surname><given-names>CF</given-names></name><name><surname>Hay</surname><given-names>BA</given-names></name><name><surname>Epton</surname><given-names>MJ</given-names></name><name><surname>Gamble</surname><given-names>GD</given-names></name></person-group><article-title>Individual and cumulative effects of GWAS susceptibility loci in lung cancer: Associations after sub-phenotyping for COPD</article-title><source>PLoS One</source><volume>6</volume><fpage>e16476</fpage><year>2011</year><pub-id pub-id-type="doi">10.1371/journal.pone.0016476</pub-id><pub-id pub-id-type="pmid">21304900</pub-id><pub-id pub-id-type="pmcid">3033394</pub-id></element-citation></ref>
<ref id="b14-ijo-49-01-0361"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hung</surname><given-names>RJ</given-names></name><name><surname>McKay</surname><given-names>JD</given-names></name><name><surname>Gaborieau</surname><given-names>V</given-names></name><name><surname>Boffetta</surname><given-names>P</given-names></name><name><surname>Hashibe</surname><given-names>M</given-names></name><name><surname>Zaridze</surname><given-names>D</given-names></name><name><surname>Mukeria</surname><given-names>A</given-names></name><name><surname>Szeszenia-Dabrowska</surname><given-names>N</given-names></name><name><surname>Lissowska</surname><given-names>J</given-names></name><name><surname>Rudnai</surname><given-names>P</given-names></name><etal/></person-group><article-title>A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25</article-title><source>Nature</source><volume>452</volume><fpage>633</fpage><lpage>637</lpage><year>2008</year><pub-id pub-id-type="doi">10.1038/nature06885</pub-id><pub-id pub-id-type="pmid">18385738</pub-id></element-citation></ref>
<ref id="b15-ijo-49-01-0361"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Truong</surname><given-names>T</given-names></name><name><surname>Hung</surname><given-names>RJ</given-names></name><name><surname>Amos</surname><given-names>CI</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Bickeb&#x000F6;ller</surname><given-names>H</given-names></name><name><surname>Rosenberger</surname><given-names>A</given-names></name><name><surname>Sauter</surname><given-names>W</given-names></name><name><surname>Illig</surname><given-names>T</given-names></name><name><surname>Wichmann</surname><given-names>HE</given-names></name><name><surname>Risch</surname><given-names>A</given-names></name><etal/></person-group><article-title>Replication of lung cancer susceptibility loci at chromosomes 15q25, 5p15, and 6p21: A pooled analysis from the International Lung Cancer Consortium</article-title><source>J Natl Cancer Inst</source><volume>102</volume><fpage>959</fpage><lpage>971</lpage><year>2010</year><pub-id pub-id-type="doi">10.1093/jnci/djq178</pub-id><pub-id pub-id-type="pmid">20548021</pub-id><pub-id pub-id-type="pmcid">2897877</pub-id></element-citation></ref>
<ref id="b16-ijo-49-01-0361"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hosgood</surname><given-names>HD</given-names><suffix>III</suffix></name><name><surname>Menashe</surname><given-names>I</given-names></name><name><surname>Shen</surname><given-names>M</given-names></name><name><surname>Yeager</surname><given-names>M</given-names></name><name><surname>Yuenger</surname><given-names>J</given-names></name><name><surname>Rajaraman</surname><given-names>P</given-names></name><name><surname>He</surname><given-names>X</given-names></name><name><surname>Chatterjee</surname><given-names>N</given-names></name><name><surname>Caporaso</surname><given-names>NE</given-names></name><name><surname>Zhu</surname><given-names>Y</given-names></name><etal/></person-group><article-title>Pathway-based evaluation of 380 candidate genes and lung cancer susceptibility suggests the importance of the cell cycle pathway</article-title><source>Carcinogenesis</source><volume>29</volume><fpage>1938</fpage><lpage>1943</lpage><year>2008</year><pub-id pub-id-type="doi">10.1093/carcin/bgn178</pub-id><pub-id pub-id-type="pmid">18676680</pub-id><pub-id pub-id-type="pmcid">2722857</pub-id></element-citation></ref>
<ref id="b17-ijo-49-01-0361"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>G</given-names></name><name><surname>Gramling</surname><given-names>S</given-names></name><name><surname>Munoz</surname><given-names>D</given-names></name><name><surname>Cheng</surname><given-names>D</given-names></name><name><surname>Azad</surname><given-names>AK</given-names></name><name><surname>Mirshams</surname><given-names>M</given-names></name><name><surname>Chen</surname><given-names>Z</given-names></name><name><surname>Xu</surname><given-names>W</given-names></name><name><surname>Roberts</surname><given-names>H</given-names></name><name><surname>Shepherd</surname><given-names>FA</given-names></name><etal/></person-group><article-title>Two novel BRM insertion promoter sequence variants are associated with loss of BRM expression and lung cancer risk</article-title><source>Oncogene</source><volume>30</volume><fpage>3295</fpage><lpage>3304</lpage><year>2011</year><pub-id pub-id-type="doi">10.1038/onc.2011.81</pub-id><pub-id pub-id-type="pmid">21478907</pub-id><pub-id pub-id-type="pmcid">3400140</pub-id></element-citation></ref>
<ref id="b18-ijo-49-01-0361"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gail</surname><given-names>MH</given-names></name></person-group><article-title>Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk</article-title><source>J Natl Cancer Inst</source><volume>100</volume><fpage>1037</fpage><lpage>1041</lpage><year>2008</year><pub-id pub-id-type="doi">10.1093/jnci/djn180</pub-id><pub-id pub-id-type="pmid">18612136</pub-id><pub-id pub-id-type="pmcid">2528005</pub-id></element-citation></ref>
<ref id="b19-ijo-49-01-0361"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Spitz</surname><given-names>MR</given-names></name><name><surname>Amos</surname><given-names>CI</given-names></name><name><surname>D'Amelio</surname><given-names>A</given-names><suffix>Jr</suffix></name><name><surname>Dong</surname><given-names>Q</given-names></name><name><surname>Etzel</surname><given-names>C</given-names></name></person-group><article-title>Re: Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk</article-title><source>J Natl Cancer Inst</source><volume>101</volume><fpage>1731</fpage><lpage>1732</lpage><year>2009</year><comment>author reply 1732</comment><pub-id pub-id-type="doi">10.1093/jnci/djp394</pub-id><pub-id pub-id-type="pmid">19903803</pub-id><pub-id pub-id-type="pmcid">2794300</pub-id></element-citation></ref>
<ref id="b20-ijo-49-01-0361"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname><given-names>Z</given-names></name><name><surname>Sun</surname><given-names>W</given-names></name><name><surname>Wang</surname><given-names>K</given-names></name><name><surname>Hakonarson</surname><given-names>H</given-names></name></person-group><article-title>Multiple testing in genome-wide association studies via hidden Markov models</article-title><source>Bioinformatics</source><volume>25</volume><fpage>2802</fpage><lpage>2808</lpage><year>2009</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btp476</pub-id><pub-id pub-id-type="pmid">19654115</pub-id></element-citation></ref>
<ref id="b21-ijo-49-01-0361"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moore</surname><given-names>JH</given-names></name><name><surname>Williams</surname><given-names>SM</given-names></name></person-group><article-title>Epistasis and its implications for personal genetics</article-title><source>Am J Hum Genet</source><volume>85</volume><fpage>309</fpage><lpage>320</lpage><year>2009</year><pub-id pub-id-type="doi">10.1016/j.ajhg.2009.08.006</pub-id><pub-id pub-id-type="pmid">19733727</pub-id><pub-id pub-id-type="pmcid">2771593</pub-id></element-citation></ref>
<ref id="b22-ijo-49-01-0361"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname><given-names>Q</given-names></name><name><surname>Hu</surname><given-names>T</given-names></name><name><surname>Moore</surname><given-names>JH</given-names></name></person-group><article-title>Epistasis, complexity, and multi-factor dimensionality reduction</article-title><source>Methods Mol Biol</source><volume>1019</volume><fpage>465</fpage><lpage>477</lpage><year>2013</year><pub-id pub-id-type="doi">10.1007/978-1-62703-447-0_22</pub-id></element-citation></ref>
<ref id="b23-ijo-49-01-0361"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hanley</surname><given-names>JA</given-names></name><name><surname>McNeil</surname><given-names>BJ</given-names></name></person-group><article-title>The meaning and use of the area under a receiver operating characteristic (ROC) curve</article-title><source>Radiology</source><volume>143</volume><fpage>29</fpage><lpage>36</lpage><year>1982</year><pub-id pub-id-type="doi">10.1148/radiology.143.1.7063747</pub-id><pub-id pub-id-type="pmid">7063747</pub-id></element-citation></ref>
<ref id="b24-ijo-49-01-0361"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pencina</surname><given-names>MJ</given-names></name><name><surname>D'Agostino</surname><given-names>RB</given-names><suffix>Sr</suffix></name><name><surname>D'Agostino</surname><given-names>RB</given-names><suffix>Jr</suffix></name><name><surname>Vasan</surname><given-names>RS</given-names></name></person-group><article-title>Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond</article-title><source>Stat Med</source><volume>27</volume><fpage>157</fpage><lpage>172</lpage><year>2008</year><pub-id pub-id-type="doi">10.1002/sim.2929</pub-id></element-citation></ref>
<ref id="b25-ijo-49-01-0361"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Leening</surname><given-names>MJ</given-names></name><name><surname>Vedder</surname><given-names>MM</given-names></name><name><surname>Witteman</surname><given-names>JC</given-names></name><name><surname>Pencina</surname><given-names>MJ</given-names></name><name><surname>Steyerberg</surname><given-names>EW</given-names></name></person-group><article-title>Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician's guide</article-title><source>Ann Intern Med</source><volume>160</volume><fpage>122</fpage><lpage>131</lpage><year>2014</year><pub-id pub-id-type="doi">10.7326/M13-1522</pub-id><pub-id pub-id-type="pmid">24592497</pub-id></element-citation></ref>
<ref id="b26-ijo-49-01-0361"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Young</surname><given-names>RP</given-names></name><name><surname>Hopkins</surname><given-names>RJ</given-names></name><name><surname>Hay</surname><given-names>BA</given-names></name><name><surname>Epton</surname><given-names>MJ</given-names></name><name><surname>Mills</surname><given-names>GD</given-names></name><name><surname>Black</surname><given-names>PN</given-names></name><name><surname>Gardner</surname><given-names>HD</given-names></name><name><surname>Sullivan</surname><given-names>R</given-names></name><name><surname>Gamble</surname><given-names>GD</given-names></name></person-group><article-title>Lung cancer susceptibility model based on age, family history and genetic variants</article-title><source>PLoS One</source><volume>4</volume><fpage>e5302</fpage><year>2009</year><pub-id pub-id-type="doi">10.1371/journal.pone.0005302</pub-id><pub-id pub-id-type="pmid">19390575</pub-id><pub-id pub-id-type="pmcid">2668761</pub-id></element-citation></ref>
<ref id="b27-ijo-49-01-0361"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raji</surname><given-names>OY</given-names></name><name><surname>Duffy</surname><given-names>SW</given-names></name><name><surname>Agbaje</surname><given-names>OF</given-names></name><name><surname>Baker</surname><given-names>SG</given-names></name><name><surname>Christiani</surname><given-names>DC</given-names></name><name><surname>Cassidy</surname><given-names>A</given-names></name><name><surname>Field</surname><given-names>JK</given-names></name></person-group><article-title>Predictive accuracy of the Liverpool Lung Project risk model for stratifying patients for computed tomography screening for lung cancer: A case-control and cohort validation study</article-title><source>Ann Intern Med</source><volume>157</volume><fpage>242</fpage><lpage>250</lpage><year>2012</year><pub-id pub-id-type="doi">10.7326/0003-4819-157-4-201208210-00004</pub-id><pub-id pub-id-type="pmid">22910935</pub-id><pub-id pub-id-type="pmcid">3723683</pub-id></element-citation></ref>
<ref id="b28-ijo-49-01-0361"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hahn</surname><given-names>LW</given-names></name><name><surname>Ritchie</surname><given-names>MD</given-names></name><name><surname>Moore</surname><given-names>JH</given-names></name></person-group><article-title>Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions</article-title><source>Bioinformatics</source><volume>19</volume><fpage>376</fpage><lpage>382</lpage><year>2003</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btf869</pub-id><pub-id pub-id-type="pmid">12584123</pub-id></element-citation></ref>
<ref id="b29-ijo-49-01-0361"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moore</surname><given-names>JH</given-names></name><name><surname>Gilbert</surname><given-names>JC</given-names></name><name><surname>Tsai</surname><given-names>CT</given-names></name><name><surname>Chiang</surname><given-names>FT</given-names></name><name><surname>Holden</surname><given-names>T</given-names></name><name><surname>Barney</surname><given-names>N</given-names></name><name><surname>White</surname><given-names>BC</given-names></name></person-group><article-title>A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility</article-title><source>J Theor Biol</source><volume>241</volume><fpage>252</fpage><lpage>261</lpage><year>2006</year><pub-id pub-id-type="doi">10.1016/j.jtbi.2005.11.036</pub-id><pub-id pub-id-type="pmid">16457852</pub-id></element-citation></ref>
<ref id="b30-ijo-49-01-0361"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Motsinger</surname><given-names>AA</given-names></name><name><surname>Ritchie</surname><given-names>MD</given-names></name></person-group><article-title>The effect of reduction in cross-validation intervals on the performance of multifactor dimensionality reduction</article-title><source>Genet Epidemiol</source><volume>30</volume><fpage>546</fpage><lpage>555</lpage><year>2006</year><pub-id pub-id-type="doi">10.1002/gepi.20166</pub-id><pub-id pub-id-type="pmid">16800004</pub-id></element-citation></ref>
<ref id="b31-ijo-49-01-0361"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cordell</surname><given-names>HJ</given-names></name></person-group><article-title>Detecting gene-gene interactions that underlie human diseases</article-title><source>Nat Rev Genet</source><volume>10</volume><fpage>392</fpage><lpage>404</lpage><year>2009</year><pub-id pub-id-type="doi">10.1038/nrg2579</pub-id><pub-id pub-id-type="pmid">19434077</pub-id><pub-id pub-id-type="pmcid">2872761</pub-id></element-citation></ref>
<ref id="b32-ijo-49-01-0361"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>H</given-names></name><name><surname>Wang</surname><given-names>M</given-names></name><name><surname>Chen</surname><given-names>X</given-names></name></person-group><article-title>Willows: A memory efficient tree and forest construction package</article-title><source>BMC Bioinformatics</source><volume>10</volume><fpage>130</fpage><year>2009</year><pub-id pub-id-type="doi">10.1186/1471-2105-10-130</pub-id><pub-id pub-id-type="pmid">19416535</pub-id><pub-id pub-id-type="pmcid">2683818</pub-id></element-citation></ref>
<ref id="b33-ijo-49-01-0361"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bureau</surname><given-names>A</given-names></name><name><surname>Dupuis</surname><given-names>J</given-names></name><name><surname>Falls</surname><given-names>K</given-names></name><name><surname>Lunetta</surname><given-names>KL</given-names></name><name><surname>Hayward</surname><given-names>B</given-names></name><name><surname>Keith</surname><given-names>TP</given-names></name><name><surname>Van Eerdewegh</surname><given-names>P</given-names></name></person-group><article-title>Identifying SNPs predictive of phenotype using random forests</article-title><source>Genet Epidemiol</source><volume>28</volume><fpage>171</fpage><lpage>182</lpage><year>2005</year><pub-id pub-id-type="doi">10.1002/gepi.20041</pub-id></element-citation></ref>
<ref id="b34-ijo-49-01-0361"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>X</given-names></name><name><surname>Wang</surname><given-names>M</given-names></name><name><surname>Zhang</surname><given-names>H</given-names></name></person-group><article-title>The use of classification trees for bioinformatics</article-title><source>Wiley Interdiscip Rev Data Min Knowl Discov</source><volume>1</volume><fpage>55</fpage><lpage>63</lpage><year>2011</year><pub-id pub-id-type="doi">10.1002/widm.14</pub-id><pub-id pub-id-type="pmid">22523608</pub-id><pub-id pub-id-type="pmcid">3329156</pub-id></element-citation></ref>
<ref id="b35-ijo-49-01-0361"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>DeLong</surname><given-names>ER</given-names></name><name><surname>DeLong</surname><given-names>DM</given-names></name><name><surname>Clarke-Pearson</surname><given-names>DL</given-names></name></person-group><article-title>Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach</article-title><source>Biometrics</source><volume>44</volume><fpage>837</fpage><lpage>845</lpage><year>1988</year><pub-id pub-id-type="doi">10.2307/2531595</pub-id><pub-id pub-id-type="pmid">3203132</pub-id></element-citation></ref>
<ref id="b36-ijo-49-01-0361"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Steyerberg</surname><given-names>EW</given-names></name><name><surname>Harrell</surname><given-names>FE</given-names><suffix>Jr</suffix></name><name><surname>Borsboom</surname><given-names>GJ</given-names></name><name><surname>Eijkemans</surname><given-names>MJ</given-names></name><name><surname>Vergouwe</surname><given-names>Y</given-names></name><name><surname>Habbema</surname><given-names>JD</given-names></name></person-group><article-title>Internal validation of predictive models: Efficiency of some procedures for logistic regression analysis</article-title><source>J Clin Epidemiol</source><volume>54</volume><fpage>774</fpage><lpage>781</lpage><year>2001</year><pub-id pub-id-type="doi">10.1016/S0895-4356(01)00341-9</pub-id><pub-id pub-id-type="pmid">11470385</pub-id></element-citation></ref>
<ref id="b37-ijo-49-01-0361"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cook</surname><given-names>NR</given-names></name></person-group><article-title>Use and misuse of the receiver operating characteristic curve in risk prediction</article-title><source>Circulation</source><volume>115</volume><fpage>928</fpage><lpage>935</lpage><year>2007</year><pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.106.672402</pub-id><pub-id pub-id-type="pmid">17309939</pub-id></element-citation></ref>
<ref id="b38-ijo-49-01-0361"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname><given-names>CQ</given-names></name><name><surname>Popova</surname><given-names>SN</given-names></name><name><surname>Brown</surname><given-names>ER</given-names></name><name><surname>Barsyte-Lovejoy</surname><given-names>D</given-names></name><name><surname>Navab</surname><given-names>R</given-names></name><name><surname>Shih</surname><given-names>W</given-names></name><name><surname>Li</surname><given-names>M</given-names></name><name><surname>Lu</surname><given-names>M</given-names></name><name><surname>Jurisica</surname><given-names>I</given-names></name><name><surname>Penn</surname><given-names>LZ</given-names></name><etal/></person-group><article-title>Integrin alpha 11 regulates IGF2 expression in fibroblasts to enhance tumorigenicity of human non-small-cell lung cancer cells</article-title><source>Proc Natl Acad Sci USA</source><volume>104</volume><fpage>11754</fpage><lpage>11759</lpage><year>2007</year><pub-id pub-id-type="doi">10.1073/pnas.0703040104</pub-id><pub-id pub-id-type="pmid">17600088</pub-id><pub-id pub-id-type="pmcid">1913903</pub-id></element-citation></ref>
<ref id="b39-ijo-49-01-0361"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bhowmick</surname><given-names>NA</given-names></name><name><surname>Chytil</surname><given-names>A</given-names></name><name><surname>Plieth</surname><given-names>D</given-names></name><name><surname>Gorska</surname><given-names>AE</given-names></name><name><surname>Dumont</surname><given-names>N</given-names></name><name><surname>Shappell</surname><given-names>S</given-names></name><name><surname>Washington</surname><given-names>MK</given-names></name><name><surname>Neilson</surname><given-names>EG</given-names></name><name><surname>Moses</surname><given-names>HL</given-names></name></person-group><article-title>TGF-beta signaling in fibroblasts modulates the oncogenic potential of adjacent epithelia</article-title><source>Science</source><volume>303</volume><fpage>848</fpage><lpage>851</lpage><year>2004</year><pub-id pub-id-type="doi">10.1126/science.1090922</pub-id><pub-id pub-id-type="pmid">14764882</pub-id></element-citation></ref>
<ref id="b40-ijo-49-01-0361"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gleave</surname><given-names>M</given-names></name><name><surname>Hsieh</surname><given-names>JT</given-names></name><name><surname>Gao</surname><given-names>CA</given-names></name><name><surname>von Eschenbach</surname><given-names>AC</given-names></name><name><surname>Chung</surname><given-names>LW</given-names></name></person-group><article-title>Acceleration of human prostate cancer growth in vivo by factors produced by prostate and bone fibroblasts</article-title><source>Cancer Res</source><volume>51</volume><fpage>3753</fpage><lpage>3761</lpage><year>1991</year><pub-id pub-id-type="pmid">1712249</pub-id></element-citation></ref>
<ref id="b41-ijo-49-01-0361"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mueller</surname><given-names>MM</given-names></name><name><surname>Fusenig</surname><given-names>NE</given-names></name></person-group><article-title>Friends or foes - bipolar effects of the tumour stroma in cancer</article-title><source>Nat Rev Cancer</source><volume>4</volume><fpage>839</fpage><lpage>849</lpage><year>2004</year><pub-id pub-id-type="doi">10.1038/nrc1477</pub-id><pub-id pub-id-type="pmid">15516957</pub-id></element-citation></ref>
<ref id="b42-ijo-49-01-0361"><label>42</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>KK</given-names></name><name><surname>Liu</surname><given-names>N</given-names></name><name><surname>Radulovich</surname><given-names>N</given-names></name><name><surname>Wigle</surname><given-names>DA</given-names></name><name><surname>Johnston</surname><given-names>MR</given-names></name><name><surname>Shepherd</surname><given-names>FA</given-names></name><name><surname>Minden</surname><given-names>MD</given-names></name><name><surname>Tsao</surname><given-names>MS</given-names></name></person-group><article-title>Novel candidate tumor marker genes for lung adenocarcinoma</article-title><source>Oncogene</source><volume>21</volume><fpage>7598</fpage><lpage>7604</lpage><year>2002</year><pub-id pub-id-type="doi">10.1038/sj.onc.1205953</pub-id><pub-id pub-id-type="pmid">12386823</pub-id></element-citation></ref>
<ref id="b43-ijo-49-01-0361"><label>43</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dinarello</surname><given-names>CA</given-names></name></person-group><article-title>IL-18: A TH1-inducing, proinflammatory cytokine and new member of the IL-1 family</article-title><source>J Allergy Clin Immunol</source><volume>103</volume><fpage>11</fpage><lpage>24</lpage><year>1999</year><pub-id pub-id-type="doi">10.1016/S0091-6749(99)70518-X</pub-id><pub-id pub-id-type="pmid">9893178</pub-id></element-citation></ref>
<ref id="b44-ijo-49-01-0361"><label>44</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kojima</surname><given-names>H</given-names></name><name><surname>Aizawa</surname><given-names>Y</given-names></name><name><surname>Yanai</surname><given-names>Y</given-names></name><name><surname>Nagaoka</surname><given-names>K</given-names></name><name><surname>Takeuchi</surname><given-names>M</given-names></name><name><surname>Ohta</surname><given-names>T</given-names></name><name><surname>Ikegami</surname><given-names>H</given-names></name><name><surname>Ikeda</surname><given-names>M</given-names></name><name><surname>Kurimoto</surname><given-names>M</given-names></name></person-group><article-title>An essential role for NF-kappa B in IL-18-induced IFN-gamma expression in KG-1 cells</article-title><source>J Immunol</source><volume>162</volume><fpage>5063</fpage><lpage>5069</lpage><year>1999</year><pub-id pub-id-type="pmid">10227974</pub-id></element-citation></ref>
<ref id="b45-ijo-49-01-0361"><label>45</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rovina</surname><given-names>N</given-names></name><name><surname>Hillas</surname><given-names>G</given-names></name><name><surname>Dima</surname><given-names>E</given-names></name><name><surname>Vlastos</surname><given-names>F</given-names></name><name><surname>Loukides</surname><given-names>S</given-names></name><name><surname>Veldekis</surname><given-names>D</given-names></name><name><surname>Roussos</surname><given-names>C</given-names></name><name><surname>Alhanatis</surname><given-names>M</given-names></name><name><surname>Bakakos</surname><given-names>P</given-names></name></person-group><article-title>VEGF and IL-18 in induced sputum of lung cancer patients</article-title><source>Cytokine</source><volume>54</volume><fpage>277</fpage><lpage>281</lpage><year>2011</year><pub-id pub-id-type="doi">10.1016/j.cyto.2011.02.014</pub-id><pub-id pub-id-type="pmid">21398142</pub-id></element-citation></ref>
<ref id="b46-ijo-49-01-0361"><label>46</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Farjadfar</surname><given-names>A</given-names></name><name><surname>Mojtahedi</surname><given-names>Z</given-names></name><name><surname>Ghayumi</surname><given-names>MA</given-names></name><name><surname>Erfani</surname><given-names>N</given-names></name><name><surname>Haghshenas</surname><given-names>MR</given-names></name><name><surname>Ghaderi</surname><given-names>A</given-names></name></person-group><article-title>Interleukin-18 promoter polymorphism is associated with lung cancer: A case-control study</article-title><source>Acta Oncol</source><volume>48</volume><fpage>971</fpage><lpage>976</lpage><year>2009</year><pub-id pub-id-type="doi">10.1080/02841860902878145</pub-id><pub-id pub-id-type="pmid">19642044</pub-id></element-citation></ref>
<ref id="b47-ijo-49-01-0361"><label>47</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nakanishi</surname><given-names>K</given-names></name><name><surname>Yoshimoto</surname><given-names>T</given-names></name><name><surname>Tsutsui</surname><given-names>H</given-names></name><name><surname>Okamura</surname><given-names>H</given-names></name></person-group><article-title>Interleukin-18 is a unique cytokine that stimulates both Th1 and Th2 responses depending on its cytokine milieu</article-title><source>Cytokine Growth Factor Rev</source><volume>12</volume><fpage>53</fpage><lpage>72</lpage><year>2001</year><pub-id pub-id-type="doi">10.1016/S1359-6101(00)00015-0</pub-id><pub-id pub-id-type="pmid">11312119</pub-id></element-citation></ref>
<ref id="b48-ijo-49-01-0361"><label>48</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Okano</surname><given-names>F</given-names></name><name><surname>Yamada</surname><given-names>K</given-names></name></person-group><article-title>Canine interleukin-18 induces apoptosis and enhances Fas ligand mRNA expression in a canine carcinoma cell line</article-title><source>Anticancer Res</source><volume>20</volume><fpage>3411</fpage><lpage>3415</lpage><year>2000</year><pub-id pub-id-type="pmid">11131641</pub-id></element-citation></ref>
<ref id="b49-ijo-49-01-0361"><label>49</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hammerman</surname><given-names>PS</given-names></name><name><surname>Sos</surname><given-names>ML</given-names></name><name><surname>Ramos</surname><given-names>AH</given-names></name><name><surname>Xu</surname><given-names>C</given-names></name><name><surname>Dutt</surname><given-names>A</given-names></name><name><surname>Zhou</surname><given-names>W</given-names></name><name><surname>Brace</surname><given-names>LE</given-names></name><name><surname>Woods</surname><given-names>BA</given-names></name><name><surname>Lin</surname><given-names>W</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><etal/></person-group><article-title>Mutations in the DDR2 kinase gene identify a novel therapeutic target in squamous cell lung cancer</article-title><source>Cancer Discov</source><volume>1</volume><fpage>78</fpage><lpage>89</lpage><year>2011</year><pub-id pub-id-type="doi">10.1158/2159-8274.CD-11-0005</pub-id></element-citation></ref>
<ref id="b50-ijo-49-01-0361"><label>50</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ford</surname><given-names>CE</given-names></name><name><surname>Lau</surname><given-names>SK</given-names></name><name><surname>Zhu</surname><given-names>CQ</given-names></name><name><surname>Andersson</surname><given-names>T</given-names></name><name><surname>Tsao</surname><given-names>MS</given-names></name><name><surname>Vogel</surname><given-names>WF</given-names></name></person-group><article-title>Expression and mutation analysis of the discoidin domain receptors 1 and 2 in non-small cell lung carcinoma</article-title><source>Br J Cancer</source><volume>96</volume><fpage>808</fpage><lpage>814</lpage><year>2007</year><pub-id pub-id-type="doi">10.1038/sj.bjc.6603614</pub-id><pub-id pub-id-type="pmid">17299390</pub-id><pub-id pub-id-type="pmcid">2360060</pub-id></element-citation></ref></ref-list></back>
<floats-group>
<fig id="f1-ijo-49-01-0361" position="float">
<label>Figure 1</label>
<caption>
<p>Performance of lung cancer risk model with and without the SNP epistatic effect.</p></caption>
<graphic xlink:href="IJO-49-01-0361-g00.gif"/></fig>
<table-wrap id="tI-ijo-49-01-0361" position="float">
<label>Table I</label>
<caption>
<p>Epidemiology, clinical and lifestyle characteristics of the subjects by case-control status.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left">Characteristics</th>
<th valign="bottom" align="center">Case (n=718)</th>
<th valign="bottom" align="center">Control (n=1667)</th>
<th valign="bottom" align="center">All subjects (n=2385)</th></tr></thead>
<tbody>
<tr>
<td colspan="4" valign="top" align="left">Age (yrs.)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&lt;60</td>
<td valign="top" align="right">162 (22.6)</td>
<td valign="top" align="center">457 (27.41)</td>
<td valign="top" align="center">619 (25.9)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;60&#x02013;70</td>
<td valign="top" align="right">264 (36.8)</td>
<td valign="top" align="center">647 (38.8)</td>
<td valign="top" align="center">911 (38.2)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;70+</td>
<td valign="top" align="right">292 (40.7)</td>
<td valign="top" align="center">563 (33.8)</td>
<td valign="top" align="center">855 (35.9</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Gender</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Male</td>
<td valign="top" align="right">414 (57.7)</td>
<td valign="top" align="center">969 (58.1)</td>
<td valign="top" align="center">1383 (58.0)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Female</td>
<td valign="top" align="right">304 (42.3)</td>
<td valign="top" align="center">698 (41.9)</td>
<td valign="top" align="center">1002 (42.0)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Smoking status<xref rid="tfn1-ijo-49-01-0361" ref-type="table-fn">a</xref></td></tr>
<tr>
<td valign="top" align="left">&#x02003;Never</td>
<td valign="top" align="right">43 (6.0)</td>
<td valign="top" align="center">575 (34.5)</td>
<td valign="top" align="center">618 (25.9)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Former</td>
<td valign="top" align="right">316 (44.0)</td>
<td valign="top" align="center">820 (49.2)</td>
<td valign="top" align="center">1136 (47.6)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Current</td>
<td valign="top" align="right">353 (49.2)</td>
<td valign="top" align="center">267 (16.0)</td>
<td valign="top" align="center">620 (26.0)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Smoking duration (yrs.)<xref rid="tfn1-ijo-49-01-0361" ref-type="table-fn">a</xref></td></tr>
<tr>
<td valign="top" align="left">&#x02003;Never</td>
<td valign="top" align="right">43 (6.0)</td>
<td valign="top" align="center">575 (34.5)</td>
<td valign="top" align="center">618 (25.9)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;1&#x02013;20</td>
<td valign="top" align="right">38 (5.3)</td>
<td valign="top" align="center">341 (20.5)</td>
<td valign="top" align="center">379 (15.9)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;21&#x02013;40</td>
<td valign="top" align="right">175 (24.4)</td>
<td valign="top" align="center">440 (26.4)</td>
<td valign="top" align="center">615 (25.8)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;41&#x02013;60</td>
<td valign="top" align="right">399 (55.6)</td>
<td valign="top" align="center">278 (16.7)</td>
<td valign="top" align="center">677 (28.4)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&gt;60</td>
<td valign="top" align="right">51 (7.1)</td>
<td valign="top" align="center">27 (1.6)</td>
<td valign="top" align="center">78 (3.3)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Previous pneumonia<xref rid="tfn1-ijo-49-01-0361" ref-type="table-fn">a</xref></td></tr>
<tr>
<td valign="top" align="left">&#x02003;Yes</td>
<td valign="top" align="right">105 (14.6)</td>
<td valign="top" align="center">243 (14.6)</td>
<td valign="top" align="center">348 (14.6)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;No</td>
<td valign="top" align="right">590 (82.2)</td>
<td valign="top" align="center">1420 (85.2)</td>
<td valign="top" align="center">2010 (84.3)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Previous malignant</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Yes</td>
<td valign="top" align="right">183 (26.3)</td>
<td valign="top" align="center">38 (2.3)</td>
<td valign="top" align="center">221 (9.4)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;No</td>
<td valign="top" align="right">512 (73.7)</td>
<td valign="top" align="center">1625 (97.7)</td>
<td valign="top" align="center">2136 (90.6)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Asbestos exposure<xref rid="tfn1-ijo-49-01-0361" ref-type="table-fn">a</xref></td></tr>
<tr>
<td valign="top" align="left">&#x02003;Yes</td>
<td valign="top" align="right">134 (18.7)</td>
<td valign="top" align="center">158 (9.5)</td>
<td valign="top" align="center">292 (12.2)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;No</td>
<td valign="top" align="right">395 (55.0)</td>
<td valign="top" align="center">1505 (90.3)</td>
<td valign="top" align="center">1900 (79.7)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Family lung CA</td></tr>
<tr>
<td valign="top" align="left">&#x02003;No history</td>
<td valign="top" align="right">566 (78.8)</td>
<td valign="top" align="center">1348 (80.9)</td>
<td valign="top" align="center">1914 (80.3)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Early onset</td>
<td valign="top" align="right">74 (10.3)</td>
<td valign="top" align="center">101 (6.1)</td>
<td valign="top" align="center">175 (7.3)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Late onset</td>
<td valign="top" align="right">78 (10.9)</td>
<td valign="top" align="center">218 (13.0)</td>
<td valign="top" align="center">296 (12.4)</td></tr>
<tr>
<td colspan="4" valign="top" align="left">Histology</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Squamous cell carcinoma</td>
<td valign="top" align="right">239 (33.3)</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center"/></tr>
<tr>
<td valign="top" align="left">&#x02003;Adenocarcinoma</td>
<td valign="top" align="right">228 (31.8)</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center"/></tr>
<tr>
<td valign="top" align="left">&#x02003;Small cell</td>
<td valign="top" align="right">87 (12.1)</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center"/></tr>
<tr>
<td valign="top" align="left">&#x02003;NSCLC</td>
<td valign="top" align="right">77 (10.7)</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center"/></tr>
<tr>
<td valign="top" align="left">&#x02003;Other</td>
<td valign="top" align="right">87 (12.1)</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center"/></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijo-49-01-0361">
<label>a</label>
<p>Numbers do not add up to total due to missing data; NSCLC, non-small cell lung cancer.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="tII-ijo-49-01-0361" position="float">
<label>Table II</label>
<caption>
<p>Univariable analysis of associations between 20 candidate SNPs and lung cancer (<xref rid="b33-ijo-49-01-0361" ref-type="bibr">33</xref>).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" rowspan="5" align="left">SNP</th>
<th valign="bottom" rowspan="5" align="center">Chromosome</th>
<th valign="bottom" rowspan="5" align="center">Gene</th>
<th colspan="5" valign="bottom" align="center">Genotype</th>
<th valign="bottom" rowspan="3" align="center">Additive model assumption</th></tr>
<tr>
<th colspan="5" valign="bottom" align="left">
<hr/></th></tr>
<tr>
<th valign="bottom" align="center">Wild<xref rid="tfn2-ijo-49-01-0361" ref-type="table-fn">a</xref></th>
<th colspan="2" valign="bottom" align="center">Heterozygote</th>
<th colspan="2" valign="bottom" align="center">Homozygote</th></tr>
<tr>
<th valign="bottom" align="left">
<hr/></th>
<th colspan="2" valign="bottom" align="left">
<hr/></th>
<th colspan="2" valign="bottom" align="left">
<hr/></th>
<th valign="bottom" align="left">
<hr/></th></tr>
<tr>
<th valign="bottom" align="center">ca/co (&#x00025;)</th>
<th valign="bottom" align="center">ca/co (&#x00025;)</th>
<th valign="bottom" align="center">OR (95&#x00025; CI)</th>
<th valign="bottom" align="center">ca/co (&#x00025;)</th>
<th valign="bottom" align="center">OR (95&#x00025; CI)</th>
<th valign="bottom" align="center">P-value<sub>trend</sub></th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">rs2279115</td>
<td valign="top" align="left">18q21.3</td>
<td valign="top" align="left">Bcl-2</td>
<td valign="top" align="center">30.1/29.0</td>
<td valign="top" align="center">49.0/50.4</td>
<td valign="top" align="center">0.91 (0.75, 1.11)</td>
<td valign="top" align="center">20.1/20.6</td>
<td valign="top" align="center">0.91 (0.71, 1.17)</td>
<td valign="top" align="center">0.91</td></tr>
<tr>
<td valign="top" align="left">rs10115703</td>
<td valign="top" align="left">9p22.3</td>
<td valign="top" align="left">Cerb1</td>
<td valign="top" align="center">86.2/84.7</td>
<td valign="top" align="center">12.7/14.6</td>
<td valign="top" align="center">0.85 (0.66, 1.10)</td>
<td valign="top" align="center">1.1/0.7</td>
<td valign="top" align="center">1.66 (0.66, 4.15)</td>
<td valign="top" align="center">0.21</td></tr>
<tr>
<td valign="top" align="left">rs16969968</td>
<td valign="top" align="left">15q25.1</td>
<td valign="top" align="left">&#x003B1;5-nAChR</td>
<td valign="top" align="center">40.1/44.9</td>
<td valign="top" align="center">45.7/44.1</td>
<td valign="top" align="center">1.16 (0.96, 1.40)</td>
<td valign="top" align="center">14.2/10.9</td>
<td valign="top" align="center">1.46 (1.11, 1.93)</td>
<td valign="top" align="center">0.012</td></tr>
<tr>
<td valign="top" align="left">rs2031920</td>
<td valign="top" align="left">10q26.3</td>
<td valign="top" align="left">CYP2E1</td>
<td valign="top" align="center">94.7/94.7</td>
<td valign="top" align="center">5.2/5.2</td>
<td valign="top" align="center">0.99 (0.67, 1.47)</td>
<td valign="top" align="center">0.1/0.1</td>
<td valign="top" align="center">1.16 (0.11, 12.8)</td>
<td valign="top" align="center">0.71</td></tr>
<tr>
<td valign="top" align="left">rs6413429</td>
<td valign="top" align="left">5p15.33</td>
<td valign="top" align="left">DAT1</td>
<td valign="top" align="center">87.2/86.4</td>
<td valign="top" align="center">12.5/13.3</td>
<td valign="top" align="center">0.93 (0.72, 1.21)</td>
<td valign="top" align="center">0.3/0.3</td>
<td valign="top" align="center">0.92 (0.18, 4.76)</td>
<td valign="top" align="center">0.74</td></tr>
<tr>
<td valign="top" align="left">rs1799732</td>
<td valign="top" align="left">11q23.2</td>
<td valign="top" align="left">DRD2</td>
<td valign="top" align="center">79.5/79.4</td>
<td valign="top" align="center">13.0/7.4</td>
<td valign="top" align="center">1.74 (1.31, 2.32)</td>
<td valign="top" align="center">7.5/13.1</td>
<td valign="top" align="center">0.57 (0.42, 0.78)</td>
<td valign="top" align="center">0.30</td></tr>
<tr>
<td valign="top" align="left">rs13181</td>
<td valign="top" align="left">19q13.32</td>
<td valign="top" align="left">XPD(ERCC2)</td>
<td valign="top" align="center">38.6/39.9</td>
<td valign="top" align="center">43.3/47.3</td>
<td valign="top" align="center">0.95 (0.78, 1.15)</td>
<td valign="top" align="center">18.1/12.8</td>
<td valign="top" align="center">1.46 (1.13, 1.89)</td>
<td valign="top" align="center">0.10</td></tr>
<tr>
<td valign="top" align="left">rs763110</td>
<td valign="top" align="left">1q24.3</td>
<td valign="top" align="left">FasL</td>
<td valign="top" align="center">42.5/40.2</td>
<td valign="top" align="center">43.4/46.6</td>
<td valign="top" align="center">0.88 (0.73, 1.07)</td>
<td valign="top" align="center">14.1/13.2</td>
<td valign="top" align="center">1.00 (0.77, 1.32)</td>
<td valign="top" align="center">0.27</td></tr>
<tr>
<td valign="top" align="left">rs5744256</td>
<td valign="top" align="left">11q23.1</td>
<td valign="top" align="left">IL18</td>
<td valign="top" align="center">32.7/47.2</td>
<td valign="top" align="center">43.7/44.5</td>
<td valign="top" align="center">1.42 (1.16, 1.72)</td>
<td valign="top" align="center">23.6/8.3</td>
<td valign="top" align="center">4.07 (3.11, 5.31)</td>
<td valign="top" align="center">&lt;0.0001</td></tr>
<tr>
<td valign="top" align="left">rs16944</td>
<td valign="top" align="left">2q13</td>
<td valign="top" align="left">IL1B</td>
<td valign="top" align="center">42.9/46.1</td>
<td valign="top" align="center">44.7/43.2</td>
<td valign="top" align="center">1.11 (0.92, 1.34)</td>
<td valign="top" align="center">12.4/10.7</td>
<td valign="top" align="center">1.24 (0.93, 1.65)</td>
<td valign="top" align="center">0.24</td></tr>
<tr>
<td valign="top" align="left">rs4073</td>
<td valign="top" align="left">4q13.3</td>
<td valign="top" align="left">IL8</td>
<td valign="top" align="center">27.6/29.9</td>
<td valign="top" align="center">51.3/47.4</td>
<td valign="top" align="center">1.17 (0.96, 1.44)</td>
<td valign="top" align="center">21.7/22.7</td>
<td valign="top" align="center">1.01 (0.79, 1.30)</td>
<td valign="top" align="center">0.50</td></tr>
<tr>
<td valign="top" align="left">rs2306022</td>
<td valign="top" align="left">15q23</td>
<td valign="top" align="left">ITGA11</td>
<td valign="top" align="center">65.9/83.6</td>
<td valign="top" align="center">30.6/15.4</td>
<td valign="top" align="center">2.53 (2.06, 3.12)</td>
<td valign="top" align="center">3.5/1.0</td>
<td valign="top" align="center">4.09 (2.21, 7.56)</td>
<td valign="top" align="center">&lt;0.0001</td></tr>
<tr>
<td valign="top" align="left">rs2317676</td>
<td valign="top" align="left">17q21.32</td>
<td valign="top" align="left">ITGB3</td>
<td valign="top" align="center">87.9/87.5</td>
<td valign="top" align="center">11.6/12.2</td>
<td valign="top" align="center">0.95 (0.72, 1.24)</td>
<td valign="top" align="center">0.6/0.3</td>
<td valign="top" align="center">1.54 (0.43, 5.48)</td>
<td valign="top" align="center">0.88</td></tr>
<tr>
<td valign="top" align="left">rs1799930</td>
<td valign="top" align="left">8p22</td>
<td valign="top" align="left">NAT2</td>
<td valign="top" align="center">50.3/48.4</td>
<td valign="top" align="center">39.4/42.7</td>
<td valign="top" align="center">0.89 (0.74, 1.07)</td>
<td valign="top" align="center">10.3/8.9</td>
<td valign="top" align="center">1.12 (0.82, 1.52)</td>
<td valign="top" align="center">0.95</td></tr>
<tr>
<td valign="top" align="left">rs3087386</td>
<td valign="top" align="left">2q11.2</td>
<td valign="top" align="left">REV1</td>
<td valign="top" align="center">31.6/31.4</td>
<td valign="top" align="center">49.7/49.4</td>
<td valign="top" align="center">0.99 (0.82, 1.22)</td>
<td valign="top" align="center">18.7/19.3</td>
<td valign="top" align="center">0.96 (0.75, 1.24)</td>
<td valign="top" align="center">0.63</td></tr>
<tr>
<td valign="top" align="left">rs4934</td>
<td valign="top" align="left">14q32.13</td>
<td valign="top" align="left">SERPINA3</td>
<td valign="top" align="center">26.9/27.3</td>
<td valign="top" align="center">50.3/49.2</td>
<td valign="top" align="center">1.04 (0.84, 1.28)</td>
<td valign="top" align="center">22.8/23.5</td>
<td valign="top" align="center">0.99 (0.77, 1.26)</td>
<td valign="top" align="center">0.99</td></tr>
<tr>
<td valign="top" align="left">rs1799895</td>
<td valign="top" align="left">4p15.2</td>
<td valign="top" align="left">SOD3</td>
<td valign="top" align="center">96.7/97.2</td>
<td valign="top" align="center">3.3/2.7</td>
<td valign="top" align="center">1.25 (0.75, 2.06)</td>
<td valign="top" align="center">0.0/0.1</td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">0.44</td></tr>
<tr>
<td valign="top" align="left">rs5743836</td>
<td valign="top" align="left">3p21.2</td>
<td valign="top" align="left">TLR9</td>
<td valign="top" align="center">71.2/69.0</td>
<td valign="top" align="center">25.4/28.1</td>
<td valign="top" align="center">0.88 (0.72, 1.07)</td>
<td valign="top" align="center">3.5/2.9</td>
<td valign="top" align="center">1.15 (0.70, 1.88)</td>
<td valign="top" align="center">0.24</td></tr>
<tr>
<td valign="top" align="left">rs1139417</td>
<td valign="top" align="left">12p13.31</td>
<td valign="top" align="left">TNFR1</td>
<td valign="top" align="center">32.0/31.5</td>
<td valign="top" align="center">49.3/50.8</td>
<td valign="top" align="center">0.96 (0.78, 1.16)</td>
<td valign="top" align="center">18.7/17.8</td>
<td valign="top" align="center">1.03 (0.80, 1.34)</td>
<td valign="top" align="center">0.96</td></tr>
<tr>
<td valign="top" align="left">rs2273953</td>
<td valign="top" align="left">1p36.33</td>
<td valign="top" align="left">TP73</td>
<td valign="top" align="center">58.5/62.8</td>
<td valign="top" align="center">35.5/31.7</td>
<td valign="top" align="center">1.20 (0.99, 1.45)</td>
<td valign="top" align="center">6.0/5.5</td>
<td valign="top" align="center">1.17 (0.80, 1.70)</td>
<td valign="top" align="center">0.11</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn2-ijo-49-01-0361">
<label>a</label>
<p>Reference genotype; ca, cases; co, controls.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="tIII-ijo-49-01-0361" position="float">
<label>Table III</label>
<caption>
<p>Comparison of different Multi-locus SNP combinations using MDR.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left">Model of inheritance</th>
<th valign="bottom" align="center">No. of loci</th>
<th valign="bottom" align="center">Selected SNPs in selected best model</th>
<th valign="bottom" align="center">Cross Validation consistency (CV)</th>
<th valign="bottom" align="center">Balanced training accuracy</th>
<th valign="bottom" align="center">Balanced testing accuracy</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">Additive effect</td>
<td valign="top" align="center">1</td>
<td valign="top" align="left">ITGA11_rs2306022</td>
<td valign="top" align="center">10/10</td>
<td valign="top" align="center">0.5886</td>
<td valign="top" align="center">0.5886</td></tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center">2</td>
<td valign="top" align="left">IL18_rs5744256<break/>ITGA11_rs2306022</td>
<td valign="top" align="center">10/10</td>
<td valign="top" align="center">0.6418</td>
<td valign="top" align="center">0.6418</td></tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center">3</td>
<td valign="top" align="left">DRD2_rs1799732<break/>IL18_rs5744256<break/>ITGA11_rs2306022</td>
<td valign="top" align="center">10/10</td>
<td valign="top" align="center">0.6575</td>
<td valign="top" align="center">0.6538</td></tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center">4</td>
<td valign="top" align="left">CHRNA3_A5_rs16969968<break/>DRD2_rs1799732<break/>IL18_rs5744256<break/>ITGA11_rs2306022</td>
<td valign="top" align="center">4/10</td>
<td valign="top" align="center">0.6652</td>
<td valign="top" align="center">0.6321</td></tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center">5</td>
<td valign="top" align="left">DRD2_rs1799732 FASL_rs763110<break/>IL18_rs5744256 IL8_rs4073<break/>ITGA11_rs2306022</td>
<td valign="top" align="center">6/10</td>
<td valign="top" align="center">0.6869</td>
<td valign="top" align="center">0.6178</td></tr></tbody></table></table-wrap>
<table-wrap id="tIV-ijo-49-01-0361" position="float">
<label>Table IV</label>
<caption>
<p>Importance score results in the random forest.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left">SNP</th>
<th valign="bottom" align="center">Gene name</th>
<th valign="bottom" align="center">Variable importance</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">rs5744256<xref rid="tfn3-ijo-49-01-0361" ref-type="table-fn">a</xref></td>
<td valign="top" align="left">IL18</td>
<td valign="top" align="center">18.0783</td></tr>
<tr>
<td valign="top" align="left">rs2306022<xref rid="tfn3-ijo-49-01-0361" ref-type="table-fn">a</xref></td>
<td valign="top" align="left">ITGA11</td>
<td valign="top" align="center">14.2703</td></tr>
<tr>
<td valign="top" align="left">rs1799732<xref rid="tfn3-ijo-49-01-0361" ref-type="table-fn">a</xref></td>
<td valign="top" align="left">DRD2</td>
<td valign="top" align="center">4.4401</td></tr>
<tr>
<td valign="top" align="left">rs4934</td>
<td valign="top" align="left">SERPINA3</td>
<td valign="top" align="center">2.8533</td></tr>
<tr>
<td valign="top" align="left">rs13181</td>
<td valign="top" align="left">XPD(ERCC2)</td>
<td valign="top" align="center">2.7543</td></tr>
<tr>
<td valign="top" align="left">rs16969968</td>
<td valign="top" align="left">&#x003B1;5-nAChR</td>
<td valign="top" align="center">2.4906</td></tr>
<tr>
<td valign="top" align="left">rs16944</td>
<td valign="top" align="left">IL1B</td>
<td valign="top" align="center">2.1737</td></tr>
<tr>
<td valign="top" align="left">rs1139417</td>
<td valign="top" align="left">TNFR1</td>
<td valign="top" align="center">1.5054</td></tr>
<tr>
<td valign="top" align="left">rs2273953</td>
<td valign="top" align="left">TP73</td>
<td valign="top" align="center">1.4667</td></tr>
<tr>
<td valign="top" align="left">rs3087386</td>
<td valign="top" align="left">REV1</td>
<td valign="top" align="center">1.4185</td></tr>
<tr>
<td valign="top" align="left">rs1799930</td>
<td valign="top" align="left">NAT2</td>
<td valign="top" align="center">1.1701</td></tr>
<tr>
<td valign="top" align="left">rs10115703</td>
<td valign="top" align="left">Cerb1</td>
<td valign="top" align="center">0.9366</td></tr>
<tr>
<td valign="top" align="left">rs2279115</td>
<td valign="top" align="left">Bcl-2</td>
<td valign="top" align="center">0.8465</td></tr>
<tr>
<td valign="top" align="left">rs5743836</td>
<td valign="top" align="left">TLR9</td>
<td valign="top" align="center">0.7407</td></tr>
<tr>
<td valign="top" align="left">rs4073</td>
<td valign="top" align="left">IL8</td>
<td valign="top" align="center">0.6093</td></tr>
<tr>
<td valign="top" align="left">rs763110</td>
<td valign="top" align="left">FasL</td>
<td valign="top" align="center">0.4508</td></tr>
<tr>
<td valign="top" align="left">rs2317676</td>
<td valign="top" align="left">ITGB3</td>
<td valign="top" align="center">0.0477</td></tr>
<tr>
<td valign="top" align="left">rs2031920</td>
<td valign="top" align="left">CYP2E1</td>
<td valign="top" align="center">&#x02212;0.048</td></tr>
<tr>
<td valign="top" align="left">rs1799895</td>
<td valign="top" align="left">SOD3</td>
<td valign="top" align="center">&#x02212;0.1922</td></tr>
<tr>
<td valign="top" align="left">rs6413429</td>
<td valign="top" align="left">DAT1</td>
<td valign="top" align="center">&#x02212;0.3696</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn3-ijo-49-01-0361">
<label>a</label>
<p>Top 3 ranked SNPs using variable importance.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="tV-ijo-49-01-0361" position="float">
<label>Table V</label>
<caption>
<p>Reclassification of predicted risk for cases and controls using the epidemiological model and extended model with rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" rowspan="3" align="left">Epidemiological model</th>
<th colspan="4" valign="bottom" align="center">Extended model with rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11)</th>
<th valign="bottom" rowspan="3" align="center">Total</th></tr>
<tr>
<th colspan="4" valign="bottom" align="left">
<hr/></th></tr>
<tr>
<th valign="bottom" align="center">&lt;0.91&#x00025;</th>
<th valign="bottom" align="center">0.91 to 2.5&#x00025;</th>
<th valign="bottom" align="center">&gt;2.5 to 5.12&#x00025;</th>
<th valign="bottom" align="center">&gt;5.12&#x00025;</th></tr></thead>
<tbody>
<tr>
<td colspan="6" valign="top" align="left">Cases</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&lt;0.91&#x00025;</td>
<td valign="top" align="center">69 (57.5)</td>
<td valign="top" align="center">43 (35.8)</td>
<td valign="top" align="center">8 (6.7)</td>
<td valign="top" align="center">0 (0)</td>
<td valign="top" align="right">120</td></tr>
<tr>
<td valign="top" align="left">&#x02003;0.91 to 2.5&#x00025;</td>
<td valign="top" align="center">15 (12.4)</td>
<td valign="top" align="center">46 (38.0)</td>
<td valign="top" align="center">46 (38.0)</td>
<td valign="top" align="center">14 (11.6)</td>
<td valign="top" align="right">121</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&gt;2.5 to 5.12&#x00025;</td>
<td valign="top" align="center">0 (0)</td>
<td valign="top" align="center">43 (26.7)</td>
<td valign="top" align="center">49 (30.4)</td>
<td valign="top" align="center">69 (42.9)</td>
<td valign="top" align="right">161</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&gt;5.12</td>
<td valign="top" align="center">2 (0.6)</td>
<td valign="top" align="center">9 (2.8)</td>
<td valign="top" align="center">62 (19.1)</td>
<td valign="top" align="center">252 (77.5)</td>
<td valign="top" align="right">325</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Total</td>
<td valign="top" align="center">86</td>
<td valign="top" align="center">141</td>
<td valign="top" align="center">165</td>
<td valign="top" align="center">335</td>
<td valign="top" align="right">727</td></tr>
<tr>
<td colspan="6" valign="top" align="left">Controls</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&lt;0.91&#x00025;</td>
<td valign="top" align="center">726 (89.9)</td>
<td valign="top" align="center">77 (9.5)</td>
<td valign="top" align="center">4 (0.5)</td>
<td valign="top" align="center">1 (0.1)</td>
<td valign="top" align="right">808</td></tr>
<tr>
<td valign="top" align="left">&#x02003;0.91 to 2.5&#x00025;</td>
<td valign="top" align="center">180 (45.0)</td>
<td valign="top" align="center">147 (36.7)</td>
<td valign="top" align="center">58 (14.5)</td>
<td valign="top" align="center">15 (3.8)</td>
<td valign="top" align="right">400</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&gt;2.5 to 5.12&#x00025;</td>
<td valign="top" align="center">20 (8.8)</td>
<td valign="top" align="center">85 (37.4)</td>
<td valign="top" align="center">70 (30.8)</td>
<td valign="top" align="center">52 (22.9)</td>
<td valign="top" align="right">227</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&gt;5.12&#x00025;</td>
<td valign="top" align="center">3 (1.3)</td>
<td valign="top" align="center">29 (13.1)</td>
<td valign="top" align="center">68 (30.6)</td>
<td valign="top" align="center">122 (55.0)</td>
<td valign="top" align="right">222</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Total</td>
<td valign="top" align="center">929</td>
<td valign="top" align="center">338</td>
<td valign="top" align="center">200</td>
<td valign="top" align="center">190</td>
<td valign="top" align="right">1657</td></tr></tbody></table></table-wrap>
<table-wrap id="tVI-ijo-49-01-0361" position="float">
<label>Table VI</label>
<caption>
<p>Summary of multivariable risk model for the epidemiological model and the extended model with rs1799732 (DRD2), rs5744256 (IL-18) and rs2306022 (ITGA11).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left"/>
<th colspan="2" valign="bottom" align="center">Epidemiological model</th>
<th colspan="2" valign="bottom" align="center">Extended model with SNPs</th></tr>
<tr>
<th valign="bottom" align="left"/>
<th colspan="2" valign="bottom" align="left">
<hr/></th>
<th colspan="2" valign="bottom" align="left">
<hr/></th></tr>
<tr>
<th valign="bottom" align="left">Covariates</th>
<th valign="bottom" align="center">OR (95&#x00025;CI)</th>
<th valign="bottom" align="center">P-values</th>
<th valign="bottom" align="center">OR (95&#x00025;CI)</th>
<th valign="bottom" align="center">P-values</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">Age</td>
<td valign="top" align="center">1.01 (0.99&#x02013;1.02)</td>
<td valign="top" align="right">0.312</td>
<td valign="top" align="center">1.00 (0.99&#x02013;1.02)</td>
<td valign="top" align="right">0.610</td></tr>
<tr>
<td valign="top" align="left">Gender</td>
<td valign="top" align="center">1.24 (0.95&#x02013;1.63)</td>
<td valign="top" align="right">0.107</td>
<td valign="top" align="center">1.14 (0.87&#x02013;1.52)</td>
<td valign="top" align="right">0.340</td></tr>
<tr>
<td colspan="5" valign="top" align="left">Smoking duration (years)</td></tr>
<tr>
<td valign="top" align="left">&#x02003;None</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="right"/>
<td valign="top" align="center">1.00</td>
<td valign="top" align="right"/></tr>
<tr>
<td valign="top" align="left">&#x02003;1&#x02013;19</td>
<td valign="top" align="center">1.41 (0.82&#x02013;2.42)</td>
<td valign="top" align="right">0.209</td>
<td valign="top" align="center">1.23 (0.69&#x02013;2.18)</td>
<td valign="top" align="right">0.476</td></tr>
<tr>
<td valign="top" align="left">&#x02003;20&#x02013;39</td>
<td valign="top" align="center">4.30 (2.81&#x02013;6.57)</td>
<td valign="top" align="right">&lt;0.001</td>
<td valign="top" align="center">4.90 (3.10&#x02013;7.73)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td valign="top" align="left">&#x02003;40&#x02013;59</td>
<td valign="top" align="center">11.12 (5.41&#x02013;22.86)</td>
<td valign="top" align="right">&lt;0.001</td>
<td valign="top" align="center">15.70 (7.22&#x02013;34.14)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td valign="top" align="left">&#x02003;&#x02265;60</td>
<td valign="top" align="center">13.91 (9.26&#x02013;20.91)</td>
<td valign="top" align="right">&lt;0.001</td>
<td valign="top" align="center">18.58 (11.90&#x02013;29.01)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td valign="top" align="left">Pneumonia</td>
<td valign="top" align="center">1.53 (1.12&#x02013;2.09)</td>
<td valign="top" align="right">0.007</td>
<td valign="top" align="center">1.55 (1.11&#x02013;2.15)</td>
<td valign="top" align="right">0.008</td></tr>
<tr>
<td valign="top" align="left">Asbestos</td>
<td valign="top" align="center">3.25 (2.34&#x02013;4.52)</td>
<td valign="top" align="right">&lt;0.001</td>
<td valign="top" align="center">3.10 (2.19&#x02013;4.39)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td valign="top" align="left">Previous tumour</td>
<td valign="top" align="center">16.97 (11.25&#x02013;25.61)</td>
<td valign="top" align="right">&lt;0.001</td>
<td valign="top" align="center">16.52 (10.79&#x02013;25.31)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td colspan="5" valign="top" align="left">Family history of lung cancer</td></tr>
<tr>
<td valign="top" align="left">&#x02003;None</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="right"/>
<td valign="top" align="center"/>
<td valign="top" align="right"/></tr>
<tr>
<td valign="top" align="left">&#x02003;Early onset (&lt;60 years)</td>
<td valign="top" align="center">1.33 (0.84&#x02013;2.09)</td>
<td valign="top" align="right">0.223</td>
<td valign="top" align="center">1.11 (0.69&#x02013;1.80)</td>
<td valign="top" align="right">0.659</td></tr>
<tr>
<td valign="top" align="left">&#x02003;Late onset (&#x02265;60 years)</td>
<td valign="top" align="center">1.07 (0.76&#x02013;1.54)</td>
<td valign="top" align="right">0.672</td>
<td valign="top" align="center">1.14 (0.78&#x02013;1.66)</td>
<td valign="top" align="right">0.495</td></tr>
<tr>
<td valign="top" align="left">rs1799732</td>
<td valign="top" align="center"/>
<td valign="top" align="right"/>
<td valign="top" align="center">0.78 (0.63&#x02013;0.97)</td>
<td valign="top" align="right">0.028</td></tr>
<tr>
<td valign="top" align="left">rs5744256</td>
<td valign="top" align="center"/>
<td valign="top" align="right"/>
<td valign="top" align="center">2.04 (1.69&#x02013;2.46)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td valign="top" align="left">Rs2306022</td>
<td valign="top" align="center"/>
<td valign="top" align="right"/>
<td valign="top" align="center">4.04 (3.10&#x02013;5.26)</td>
<td valign="top" align="right">&lt;0.001</td></tr>
<tr>
<td colspan="5" valign="top" align="left">Goodness of fit statistic</td></tr>
<tr>
<td valign="top" align="left">&#x02003;AIC</td>
<td valign="top" align="center">2098.42</td>
<td valign="top" align="right"/>
<td valign="top" align="center">1930.14</td>
<td valign="top" align="right"/></tr>
<tr>
<td valign="top" align="left">&#x02003;BIC</td>
<td valign="top" align="center">2167.75</td>
<td valign="top" align="right"/>
<td valign="top" align="center">2016.80</td>
<td valign="top" align="right"/></tr></tbody></table></table-wrap></floats-group></article>
