<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ETM</journal-id>
<journal-title-group>
<journal-title>Experimental and Therapeutic Medicine</journal-title>
</journal-title-group>
<issn pub-type="ppub">1792-0981</issn>
<issn pub-type="epub">1792-1015</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/etm.2019.7958</article-id>
<article-id pub-id-type="publisher-id">ETM-0-0-7958</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Smoking alters the evolutionary trajectory of non-small cell lung cancer</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Yu</surname><given-names>Xiao-Jun</given-names></name>
<xref rid="af1-etm-0-0-7958" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Chen</surname><given-names>Gang</given-names></name>
<xref rid="af2-etm-0-0-7958" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Yang</surname><given-names>Jun</given-names></name>
<xref rid="af2-etm-0-0-7958" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Yu</surname><given-names>Guo-Can</given-names></name>
<xref rid="af2-etm-0-0-7958" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Zhu</surname><given-names>Peng-Fei</given-names></name>
<xref rid="af2-etm-0-0-7958" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Jiang</surname><given-names>Zheng-Ke</given-names></name>
<xref rid="af3-etm-0-0-7958" ref-type="aff">3</xref></contrib>
<contrib contrib-type="author"><name><surname>Feng</surname><given-names>Kan</given-names></name>
<xref rid="af1-etm-0-0-7958" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Lu</surname><given-names>Yong</given-names></name>
<xref rid="af1-etm-0-0-7958" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Bao</surname><given-names>Bin</given-names></name>
<xref rid="af1-etm-0-0-7958" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Zhong</surname><given-names>Fang-Ming</given-names></name>
<xref rid="af2-etm-0-0-7958" ref-type="aff">2</xref>
<xref rid="c1-etm-0-0-7958" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-etm-0-0-7958"><label>1</label>Department of Thoracic Surgery, The First People&#x0027;s Hospital of Fuyang Hangzhou, Hangzhou, Zhejiang 311400, P.R. China</aff>
<aff id="af2-etm-0-0-7958"><label>2</label>Department of Thoracic Surgery, Hangzhou Red Cross Hospital, Hangzhou, Zhejiang 310003, P.R. China</aff>
<aff id="af3-etm-0-0-7958"><label>3</label>Department of Surgery, Hangzhou Fuyang Hospital of Traditional Chinese Medicine, Hangzhou, Zhejiang 311400, P.R. China</aff>
<author-notes>
<corresp id="c1-etm-0-0-7958"><italic>Correspondence to</italic>: Dr Fang-Ming Zhong, Department of Thoracic Surgery, Hangzhou Red Cross Hospital, 208 East Ring Road, Xiacheng, Hangzhou, Zhejiang 310003, P.R. China, E-mail: <email>fangming25740@163.com</email></corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>11</month>
<year>2019</year></pub-date>
<pub-date pub-type="epub">
<day>29</day>
<month>08</month>
<year>2019</year></pub-date>
<volume>18</volume>
<issue>5</issue>
<fpage>3315</fpage>
<lpage>3324</lpage>
<history>
<date date-type="received"><day>20</day><month>07</month><year>2018</year></date>
<date date-type="accepted"><day>16</day><month>05</month><year>2019</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Yu et al.</copyright-statement>
<copyright-year>2019</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>Smoking is the biggest risk factor for lung cancer. Smokers have a much higher chance of developing lung tumors with a worse survival rate; however, non-smokers also develop lung tumors. A number of questions remain including the underlying difference between smoker and non-smoker lung cancer patients and the involvement of genetic and epigenetic processes in tumor development. The present study analyzed the mutation data of 100 non-small cell lung cancer (NSCLC) patients, 12 non-smokers, 48 ex-smokers and 40 smokers, from Tracking Non-Small Cell Lung Cancer Evolution through Therapy Consortium. A total of 68 genes exhibited different mutation patterns across non-smokers, ex-smokers and smokers. A number of these 68 genes encode membrane proteins with biological regulation, metabolic process, and response to stimulus functions. For each group of patients, the top 10 most frequently mutated genes were selected and their oncogenetic tree inferred, which reflected how the genes evolve during tumor genesis. By comparing the oncogenetic trees of non-smokers and smokers, it was identified that in non-smokers, the mutation of epidermal growth factor receptor (EGFR) was an early genetic alteration event and EGFR was the key driver, but in smokers, the mutation of titin (TTN) was more important. Based on network analysis, TTN can interact with spectrin &#x03B1; erythrocytic 1 through calmodulin 2 and troponin C1. These genetic differences during tumorigenesis of non-smoker and smoker lung cancer patients provided novel insights into the effects of smoking on the evolutionary trajectory of non-small cell lung cancer and may prove helpful for targeted therapy of different lung cancer subtypes.</p>
</abstract>
<kwd-group>
<kwd>smoking</kwd>
<kwd>evolutionary trajectories</kwd>
<kwd>non-small cell lung cancer</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Lung cancer patients make up ~14&#x0025; of newly diagnosed cancer cases and is the second most widespread cancer worldwide (<xref rid="b1-etm-0-0-7958" ref-type="bibr">1</xref>). Of those, ~85&#x0025; are non-small cell lung cancer (NSCLC) (<xref rid="b2-etm-0-0-7958" ref-type="bibr">2</xref>). Lung cancer not only has high incidence, but also high death rate. It is a huge healthcare and economic burden for both developing and developed countries.</p>
<p>There are many possible factors that may contribute to the genesis of lung cancer (<xref rid="b2-etm-0-0-7958" ref-type="bibr">2</xref>). Genetics can explain a large proportion of lung cancer occurrence as many single nucleotide polymorphisms have been discovered to be associated with lung cancer susceptibility by genome-wide association studies (<xref rid="b3-etm-0-0-7958" ref-type="bibr">3</xref>). Environment factors, such as air pollution (<xref rid="b4-etm-0-0-7958" ref-type="bibr">4</xref>), particulate matter 2.5 (<xref rid="b5-etm-0-0-7958" ref-type="bibr">5</xref>) and smoking, can facilitate the epigenetic dysfunctions which will interact with genetic changes and trigger tumorgenesis (<xref rid="b2-etm-0-0-7958" ref-type="bibr">2</xref>,<xref rid="b6-etm-0-0-7958" ref-type="bibr">6</xref>&#x2013;<xref rid="b9-etm-0-0-7958" ref-type="bibr">9</xref>). Cigarette smoke includes over 5,000 compounds (<xref rid="b10-etm-0-0-7958" ref-type="bibr">10</xref>), such as nicotine, free radicals, benzopyrene, catechols, polonium-210 and heavy metals (<xref rid="b11-etm-0-0-7958" ref-type="bibr">11</xref>). Many of these compounds are strong carcinogenic chemicals (<xref rid="b12-etm-0-0-7958" ref-type="bibr">12</xref>), which can interfere with DNA mismatch repair and cause somatic mutations. Cigarette smoking accounts for 87&#x0025; of lung cancer deaths (<xref rid="b13-etm-0-0-7958" ref-type="bibr">13</xref>) and is the leading risk factor.</p>
<p>Unfortunately, the genetic mechanisms of smoking leading to lung carcinogenesis are largely unknown and many observations are contradictory (<xref rid="b10-etm-0-0-7958" ref-type="bibr">10</xref>). For example, benzoapyrene, a carcinogenic chemical from smoke, can induce lung tumors in mice but not in rats (<xref rid="b14-etm-0-0-7958" ref-type="bibr">14</xref>). On the molecular level, several well-established signaling pathways, such as cyclooxygenase and its derived prostanoids, peroxisome proliferator-activated receptor &#x03B3; and arachidonate 15-lipoxygenase, epidermal growth factor receptor (EGFR) and the P13K/AKT/mTOR and vascular endothelial growth factor-dependent angiogenetic pathway, have been reported to have important roles (<xref rid="b10-etm-0-0-7958" ref-type="bibr">10</xref>). As a complex systems disease (<xref rid="b2-etm-0-0-7958" ref-type="bibr">2</xref>), lung cancer dysfunctions are dynamic and the evolution of smoking-induced lung cancer, i.e. the series of genetic events, can elucidate a more realistic picture of tumorigenesis. With the rapid development of next-generation sequencing, the somatic mutations in cancer patients can be more easily identified. Based on somatic mutation data, the evolutionary trajectories of cancer patients can be reconstructed. Caravagna <italic>et al</italic> (<xref rid="b15-etm-0-0-7958" ref-type="bibr">15</xref>) developed an algorithm called Pipeline for Cancer Inference (PiCnIc) to analyze the colon adenocarcinoma and rectum adenocarcinoma (COAD/READ) somatic mutation data from The Cancer Genome Atlas project. The underlying somatic evolution based on Suppes&#x0027; probabilistic causation was reconstructed (<xref rid="b16-etm-0-0-7958" ref-type="bibr">16</xref>) and it was determined that mutations in APC regulator of WNT signaling pathway, KRAS proto-oncogene, and tumor protein p53 were primary events for micro-satellite stable COAD/READ tumors, which was consistent with previous literature. Brown <italic>et al</italic> (<xref rid="b17-etm-0-0-7958" ref-type="bibr">17</xref>), performed phylogenetic analysis on whole-exome sequencing and copy number profiling data of primary and metastatic breast cancer samples and inferred the phylogeny of genomic alterations during breast cancer progression. The study utilized the Dollo parsimony method and the branch and bound exhaustive search algorithm described in Felsenstein (<xref rid="b18-etm-0-0-7958" ref-type="bibr">18</xref>), to reconstruct the phylogenetic tree.</p>
<p>To investigate the genomic alterations triggered by smoking, the present study analyzed the somatic mutations in 100 NSCLC patients. The different genomic alterations amongst non-smokers, ex-smokers and smokers were identified and the most frequent genetic alterations of each smoking subgroup were analyzed to construct oncogenetic trees, which revealed the evolutionary trajectories of smoking NSCLC. The present results provided novel insights into NSCLC development due to smoking and also identified potential intervention targets for treating NSCLC patients.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>NSCLC somatic mutation dataset</title>
<p>TRAcking Cancer Evolution through therapy (TRACERx) Consortium is a multi-million pound project funded by Cancer Research UK to better understand the genetic risks of lung cancer through exploring the human genome. The present study obtained the somatic mutation data and smoking status data of 100 NSCLC patients from Jamal-Hanjani <italic>et al</italic> (<xref rid="b19-etm-0-0-7958" ref-type="bibr">19</xref>). The clinical information of these 100 patients are provided in <xref rid="SD1-etm-0-0-7958" ref-type="supplementary-material">Table SI</xref>. The dataset consists of 12 people who never smoked in their life, 48 people who used to smoke but have quit smoking for &#x003E;20 years and 40 current smokers or recent ex-smokers. The somatic mutations were annotated to genes. If there were non-synonymous exonic alterations within a gene, this was considered as a mutated gene and it was allocated &#x2018;1&#x2019;; otherwise genes were classed as &#x2018;0&#x2019;. There were 11,345 genes that were mutated in at least 1 of the 100 NSCLC patients. An 11345&#x00D7;100 matrix was produced where rows denoted genes, the columns were patients and the binary value indicated whether the particular gene was mutated in this patient.</p>
<p>Unlike the TRACERx study by Jamal-Hanjani <italic>et al</italic> (<xref rid="b19-etm-0-0-7958" ref-type="bibr">19</xref>), which analyzed the intratumor heterogeneity by constructing phylogenetic trees for each patient, the present study was interested in characterizing the general mutation pattern within patient subtypes.</p>
</sec>
<sec>
<title>Identifying the mutated genes amongst different smoking status groups</title>
<p>To identify the various mutated genes amongst different smoking status groups, the Fisher&#x0027;s Exact Test (<xref rid="b20-etm-0-0-7958" ref-type="bibr">20</xref>) was applied for the confusion table of mutation status and smoking status. P&#x003C;0.05 was considered to indicate statistical significance.</p>
</sec>
<sec>
<title>Construction of the evolutionary trajectories for different smoking status groups</title>
<p>How the most frequently mutated genes evolved in different smoking status groups was analyzed using Oncotree (<xref rid="b21-etm-0-0-7958" ref-type="bibr">21</xref>,<xref rid="b22-etm-0-0-7958" ref-type="bibr">22</xref>), a widely used method for oncogenetic tree deduction (<xref rid="b23-etm-0-0-7958" ref-type="bibr">23</xref>).</p>
<p>In an oncogenetic tree model, the evolutionary trajectories of tumor genesis are simplified and the causality between genetic alteration events is assumed to occur sequentially. In addition, the causation of a genetic alteration event by another is independent of other causations.</p>
<p>The Oncotree method involves several steps. First, a set of the most relevant genetic events is selected. For the present study, the top 10 most frequent genetic alterations for each smoking status group were considered as relevant for the progression of the tumor group and therefore were selected to be modeled. Then, each pair of such genetic events was assigned a weight corresponding to the probabilities of joint or individual occurrence. Finally, based on the assigned weights, the optimal oncogenetic tree was inferred as maximum-weight branching (<xref rid="b21-etm-0-0-7958" ref-type="bibr">21</xref>,<xref rid="b22-etm-0-0-7958" ref-type="bibr">22</xref>).</p>
<p>The method was applied for the present study using R package Oncotree (<uri xlink:href="http://cran.r-project.org/web/packages/Oncotree/">http://cran.r-project.org/web/packages/Oncotree/</uri>).</p>
</sec>
<sec>
<title>Annotation of the biological function of the mutated genes</title>
<p>WebGestalt was used to annotate the biological function of the mutated genes (<xref rid="b24-etm-0-0-7958" ref-type="bibr">24</xref>). WebGestalt is a widely used online enrichment tool to model organisms including human, mouse, rat, yeast, fruit fly and <italic>Caenorhabditis elegans</italic>. It has many annotation databases integrated, including Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, DrugBank and Online Mendelian Inheritance in Man. The P-value of overrepresentation enrichment analysis was multiple test-adjusted as the false discovery rate (FDR). In the present study, the enriched categories with FDR&#x003C;0.2 were considered as significant.</p>
</sec>
</sec>
</sec>
<sec sec-type="result|discussion">
<title>Results and Discussion</title>
<sec>
<title/>
<sec>
<title>A total of 68 genes demonstrate different mutation patterns amongst smoking status groups</title>
<p>Fisher&#x0027;s exact test was used to identify the different mutated genes amongst the various smoking status groups. A total of 68 gene mutations were considered as significant to smoking status (P&#x003C;0.05; <xref rid="tI-etm-0-0-7958" ref-type="table">Table I</xref>). The OncoPrinter plots of these 68 genes in the three different smoking status groups, non-smoker, ex-smoker and smoker, are displayed in <xref rid="f1-etm-0-0-7958" ref-type="fig">Fig. 1</xref>. The genes were ranked based on the mutation frequency in all lung cancer patients. Zinc finger homeobox 4 (ZFHX4), usherin (USH2A), CUB and Sushi multiple domains 1 (CSMD1), CUB and Sushi multiple domains 2 (CSMD2), spectrin &#x03B1; erythrocytic 1 (SPTA1), pappalysin 2 (PAPPA2), dynein axonemal heavy chain 9 (DNAH9), contactin-associated protein like 5 (CNTNAP5), additional sex combs like 3 (ASXL3) were highly mutated in ex-smokers and smokers, but not in non-smokers. The mutation rate was associated the smoking status with the current smokers demonstrating the highest rate of mutated genes. There were several non-smoker specific mutations, such as lysine demethylase 8 (KDM8), zinc finger protein 677 (ZNF677), TEA domain transcription factor 1 (TEAD1) and phosphatidylinositol glycan anchor biosynthesis class M (PIGM). These non-smoker specific mutations suggested that tumorigenesis of lung cancer in non-smoker patients was different from the tumorigenesis of lung cancer in smoking patients.</p>
</sec>
<sec>
<title>Biological functions of the 68 gene mutations associated with smoking status</title>
<p>The 68 gene mutations associated with smoking status were annotated using Gene Ontology (GO) and the biological process (BP), cellular component (CC) and molecular function (MF) categories (<xref rid="f2-etm-0-0-7958" ref-type="fig">Fig. 2</xref>). Numerous genes were annotated to be membrane proteins with biological regulation, metabolic process, and response to stimulus functions (<xref rid="f2-etm-0-0-7958" ref-type="fig">Fig. 2</xref>). These results were expected since smoke is a xenobiotic stimulus to the human body and the chemicals can affect normal metabolic processes, and alter the biological regulations. Rigorous statistical test for the enrichment significance using WebGestalt was performed for deeper investigation into gene function (<xref rid="b24-etm-0-0-7958" ref-type="bibr">24</xref>) with significantly enriched BP (<xref rid="tII-etm-0-0-7958" ref-type="table">Table II</xref>), CC (<xref rid="tIII-etm-0-0-7958" ref-type="table">Table III</xref>) and MF (<xref rid="tIV-etm-0-0-7958" ref-type="table">Table IV</xref>) categories. It was demonstrated that the organ development, morphogenesis of an epithelial fold, muscle tissue morphogenesis and the muscle organ morphogenesis categories were enriched (<xref rid="tII-etm-0-0-7958" ref-type="table">Table II</xref>). These genes may serve an important role in tumor initiation and help transform the normal lung tissue to tumor tissue. Proteins associated with the plasma membrane were enriched (<xref rid="tIII-etm-0-0-7958" ref-type="table">Table III</xref>), which was consistent with the preliminary biological function analysis (<xref rid="f2-etm-0-0-7958" ref-type="fig">Fig. 2</xref>), and indicated that the mutated genes were involved in stimulus response. In addition, enrichment of proteins associated with muscle/fiber functions suggested that the mutated genes may change the lung muscle structure. Significant enrichment of multiple binding functions proved that the mutated genes were key players in signaling transduction and regulation (<xref rid="tIV-etm-0-0-7958" ref-type="table">Table IV</xref>), which may amplify the dysfunctions and accelerate tumorigenesis.</p>
</sec>
<sec>
<title>Evolutionary trajectories of non-smoker, ex-smoker and smoker lung cancer patients</title>
<p>Cancer is a complex multigene and multiprocess disease. The tumorigenesis of colorectal cancer is well studied (<xref rid="b25-etm-0-0-7958" ref-type="bibr">25</xref>,<xref rid="b26-etm-0-0-7958" ref-type="bibr">26</xref>) and can be used as a perfect example to explain the roles of mutations in causing pathway dysfunctions. The process includes several steps (<xref rid="b25-etm-0-0-7958" ref-type="bibr">25</xref>): i) Mutation of mismatch-repair (MMR) gene; ii) microsatellite instability (MSI) pathway dysfunction caused by MMR mutation; iii) normal epithelium becomes small adenoma; iv) chromosomal instability and mutations in KRAS and BRAF; v) serrated adenoma pathway dysfunction triggered by BRAF mutation; vi) small adenoma becomes large adenoma; and vii) mutations of PIK3CA, PTEN, tumor protein p53 (TP53), BAX, SMAD4 and transforming growth factor &#x03B2; receptor 2 accelerate the progression from large adenoma to cancer.</p>
<p>Similarly, lung cancer must also have several mutational events, which occur sequentially to initiate and accelerate tumorigenesis. Smoking is a major risk factor that can cause genetic and epigenetic changes that alter the tumorigenesis procedures. Research into this process will help explain the mechanism differences between smoker and non-smoker lung cancer patients.</p>
<p>The Oncotree method was used to produce oncogenetic trees of the top 10 most frequent mutated genes in non-smoker, ex-smoker and smoker lung cancer patients (<xref rid="f3-etm-0-0-7958" ref-type="fig">Fig. 3</xref>). For non-smokers, the early events were EGFR and titin (TTN) mutation. The late EGFR events were mutations of PIGM and zinc finger protein 677, while TTN was followed by mutations of TEAD1, olfactory receptor family 6 subfamily P member 1, catenin &#x03B2; 1, huntingtin interacting protein 1, protocadherin &#x03B3; subfamily A 8 and SUMO1/sentrin specific peptidase 7. For ex-smokers, TTN was also an early event but more early events were detected compared with non-smokers, including mutations of ryanodine receptor 2, ZFHX4 and CSMD1. For smokers, the results revealed the highest number of early events, including mutations of TTN, ryanodine receptor 2, USH2A, SPTA1 and CSMD1. Results demonstrated that smoking increased spontaneous mutations and formed more complex oncogenetic trees. For non-smokers, EGFR was the primary mutation whilst in ex-smokers and smokers, the importance of TTN was increased. Almost all smokers had the TTN mutation.</p>
</sec>
<sec>
<title>Oncogenetic differences between non-smoker, ex-smoker and smoker lung cancer patients</title>
<p>Based on the oncogenetic trees of non-smoker, ex-smoker and smoker lung cancer patients (<xref rid="f3-etm-0-0-7958" ref-type="fig">Fig. 3</xref>), the key driver gene of non-smoker lung cancer patients was EGFR, whilst the key driver gene of smoker lung cancer patients was TTN.</p>
<p>EGFR is a well-known oncogene that affects the PI3K and RAS pathway and accelerates cell growth and survival (<xref rid="b27-etm-0-0-7958" ref-type="bibr">27</xref>). EGFR is widely expressed in &#x003E;60&#x0025; of NSCLC patients and is a clinically relevant target of tyrosine kinase inhibitors (TKIs). EGFR mutations are more frequent in Asians, females, non-smokers and lung adenocarcinomas (<xref rid="b28-etm-0-0-7958" ref-type="bibr">28</xref>,<xref rid="b29-etm-0-0-7958" ref-type="bibr">29</xref>). The present findings determined that EGFR was the key driver gene of non-smoker lung cancer patients which was in agreement with the literature (<xref rid="b28-etm-0-0-7958" ref-type="bibr">28</xref>,<xref rid="b29-etm-0-0-7958" ref-type="bibr">29</xref>).</p>
<p>TTN encodes a protein of striated muscle and is the key component for striated muscle assembly and function. TTN mutation is very frequent in the majority of cancer types with the second highest mutation rate behind TP53 in The Cancer Genome Atlas dataset (<xref rid="b30-etm-0-0-7958" ref-type="bibr">30</xref>). In the present study, 65 patients had the TTN mutation and 35 patients did not. For the 65 patients with TTN mutation, there were 2 adenosquamous carcinoma, 2 carcinosarcoma, 31 invasive adenocarcinoma, 1 large cell carcinoma and 29 squamous cell carcinoma patients. For the 35 patients without TTN mutations, there were 1 adenosquamous carcinoma, 30 invasive adenocarcinoma, 1 large cell neuroendocrine and 3 squamous cell carcinoma patients. Although its mechanisms remain largely unknown, TTN has great potential for investigation due to its roles in tumorigenesis and progression (<xref rid="b30-etm-0-0-7958" ref-type="bibr">30</xref>). The present study determined that TTN may function through regulating DNAH9, USH2A, SPTA1 or CSMD2 based on the oncogenetic trees (<xref rid="f3-etm-0-0-7958" ref-type="fig">Fig. 3</xref>). Although the oncogenetic tree only demonstrated the process of genetic alteration occurrence, it provided hints of functional regulations; however, this needs to be further confirmed. To explore the possible regulation mechanisms of TTN, the protein functional association network STRING (<xref rid="b31-etm-0-0-7958" ref-type="bibr">31</xref>,<xref rid="b32-etm-0-0-7958" ref-type="bibr">32</xref>) was used with medium confidence (&#x003E;0.4). It was determined that TTN can interact with SPTA1 through calmodulin 2 (CALM2) and troponin C1 (TNNC1; <xref rid="f4-etm-0-0-7958" ref-type="fig">Fig. 4</xref>). The STRING confidence scores of each interaction (<xref rid="SD2-etm-0-0-7958" ref-type="supplementary-material">Table SII</xref>) were 0.722 for TTN and CALM2, 0.962 for SPTA1 and CALM2, 0.965 for TTN and TNNC1 and 0.537 for SPTA1 and TNNC1. These results provided insight into how TTN may function in lung cancer of smoking patients, or even other types of cancer.</p>
<p>There were limitations to the oncogenetic tree model. Firstly, the model was based on association rather than causality and the results could not be treated as actual biological regulations, therefore these should be further investigated with experimental methods. Secondly, the oncogenetic tree model cannot handle a large number of genes. The input genes should be carefully picked based on mutation frequency or biological literature with only the highly possible genes analyzed. It is not a general method that can be applied on a genome wide scale. Finally, the sample size should be large enough to capture the association so results generated on small datasets need to be interpreted with caution.</p>
<p>In conclusion, lung cancer is a complex multigene, multiprocess disease with complex genetic and environmental risk factors. Smoking is the biggest risk factor that can alter the genetics and epigenetics of lung tissue causing cancer. Smokers have a much greater chance of developing lung cancer. The present study compared the mutation patterns of non-smoker, ex-smoker and smoker lung cancer patients and identified 68 genes that were significantly differentially mutated amongst smoking status groups. Furthermore, oncogenetic trees were constructed of the top 10 most frequently mutated genes in each group and analyzed. It was identified that in non-smoker lung cancer patients, the key driver gene was EGFR, whilst in smoker lung cancer patients the key driver gene was TTN. The EGFR mutation finding in non-smokers is in line with previous literature. A potential mechanism for the high frequency mutated gene TTN in tumorigenesis was suggested. The present study provided novel insights into the effect of smoking on altering the evolutionary trajectory of lung cancer and its progression.</p>
</sec>
</sec>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1-etm-0-0-7958" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data1.xlsx"/>
</supplementary-material>
<supplementary-material id="SD2-etm-0-0-7958" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data2.xlsx"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding</title>
<p>No funding was received.</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>The datasets generated and/or analyzed during the present study are available from the corresponding author on reasonable request.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>FMZ designed the experiment and XJY performed the experiment. GC, JY, GCY and PFZ analyzed the data and performed data analysis. ZKJ, KF, YL and BB contributed to the study design. KF and YL wrote the article. ZKJ and BB revised the article. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="b1-etm-0-0-7958"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname><given-names>R</given-names></name><name><surname>Naishadham</surname><given-names>D</given-names></name><name><surname>Jemal</surname><given-names>A</given-names></name></person-group><article-title>Cancer statistics, 2012</article-title><source>CA Cancer J Clin</source><volume>62</volume><fpage>10</fpage><lpage>29</lpage><year>2012</year><pub-id pub-id-type="doi">10.3322/caac.20138</pub-id><pub-id pub-id-type="pmid">22237781</pub-id></element-citation></ref>
<ref id="b2-etm-0-0-7958"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Jiang</surname><given-names>M</given-names></name><name><surname>Kong</surname><given-names>X</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>Dysfunctions associated with methylation, microRNA expression and gene expression in lung cancer</article-title><source>PLoS One</source><volume>7</volume><fpage>e43441</fpage><year>2012</year><pub-id pub-id-type="doi">10.1371/journal.pone.0043441</pub-id><pub-id pub-id-type="pmid">22912875</pub-id></element-citation></ref>
<ref id="b3-etm-0-0-7958"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Boss&#x00E9;</surname><given-names>Y</given-names></name><name><surname>Amos</surname><given-names>CI</given-names></name></person-group><article-title>A decade of GWAS results in lung cancer</article-title><source>Cancer Epidemiol Biomarkers Prev</source><volume>27</volume><fpage>363</fpage><lpage>379</lpage><year>2018</year><pub-id pub-id-type="doi">10.1158/1055-9965.EPI-16-0794</pub-id><pub-id pub-id-type="pmid">28615365</pub-id></element-citation></ref>
<ref id="b4-etm-0-0-7958"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname><given-names>CL</given-names></name><name><surname>He</surname><given-names>SW</given-names></name><name><surname>Zhang</surname><given-names>YD</given-names></name><name><surname>Duan</surname><given-names>HX</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Huang</surname><given-names>YC</given-names></name><name><surname>Li</surname><given-names>GF</given-names></name><name><surname>Wang</surname><given-names>P</given-names></name><name><surname>Ma</surname><given-names>LJ</given-names></name><name><surname>Zhou</surname><given-names>GB</given-names></name><name><surname>Cao</surname><given-names>Y</given-names></name></person-group><article-title>Air pollution and DNA methylation alterations in lung cancer: A systematic and comparative study</article-title><source>Oncotarget</source><volume>8</volume><fpage>1369</fpage><lpage>1391</lpage><year>2017</year><pub-id pub-id-type="pmid">27901495</pub-id></element-citation></ref>
<ref id="b5-etm-0-0-7958"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shu</surname><given-names>Y</given-names></name><name><surname>Zhu</surname><given-names>L</given-names></name><name><surname>Yuan</surname><given-names>F</given-names></name><name><surname>Kong</surname><given-names>X</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>Analysis of the relationship between PM2.5 and lung cancer based on protein-protein interactions</article-title><source>Comb Chem High Throughput Screen</source><volume>19</volume><fpage>100</fpage><lpage>108</lpage><year>2016</year><pub-id pub-id-type="doi">10.2174/1386207319666151110123345</pub-id><pub-id pub-id-type="pmid">26552437</pub-id></element-citation></ref>
<ref id="b6-etm-0-0-7958"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Zhang</surname><given-names>YH</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Cai</surname><given-names>Y</given-names></name></person-group><article-title>Identification of transcription factors that may reprogram lung adenocarcinoma</article-title><source>Artif Intell Med</source><volume>83</volume><fpage>52</fpage><lpage>57</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.artmed.2017.03.010</pub-id><pub-id pub-id-type="pmid">28377053</pub-id></element-citation></ref>
<ref id="b7-etm-0-0-7958"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>BQ</given-names></name><name><surname>You</surname><given-names>J</given-names></name><name><surname>Chen</surname><given-names>L</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><name><surname>Zhang</surname><given-names>N</given-names></name><name><surname>Li</surname><given-names>HP</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Kong</surname><given-names>XY</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>Identification of lung-cancer-related genes with the shortest path approach in a protein-protein interaction network</article-title><source>Biomed Res Int</source><volume>2013</volume><fpage>267375</fpage><year>2013</year><pub-id pub-id-type="pmid">23762832</pub-id></element-citation></ref>
<ref id="b8-etm-0-0-7958"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>BQ</given-names></name><name><surname>You</surname><given-names>J</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>Classification of non-small cell lung cancer based on copy number alterations</article-title><source>PLoS One</source><volume>9</volume><fpage>e88300</fpage><year>2014</year><pub-id pub-id-type="doi">10.1371/journal.pone.0088300</pub-id><pub-id pub-id-type="pmid">24505469</pub-id></element-citation></ref>
<ref id="b9-etm-0-0-7958"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Yang</surname><given-names>J</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>Novel candidate key drivers in the integrative network of genes, microRNAs, methylations and copy number variations in squamous cell lung carcinoma</article-title><source>Biomed Res Int</source><volume>2015</volume><fpage>358125</fpage><year>2015</year><pub-id pub-id-type="pmid">25802847</pub-id></element-citation></ref>
<ref id="b10-etm-0-0-7958"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tonini</surname><given-names>G</given-names></name><name><surname>D&#x0027;Onofrio</surname><given-names>L</given-names></name><name><surname>Dell&#x0027;Aquila</surname><given-names>E</given-names></name><name><surname>Pezzuto</surname><given-names>A</given-names></name></person-group><article-title>New molecular insights in tobacco-induced lung cancer</article-title><source>Future Oncol</source><volume>9</volume><fpage>649</fpage><lpage>655</lpage><year>2013</year><pub-id pub-id-type="doi">10.2217/fon.13.32</pub-id><pub-id pub-id-type="pmid">23647294</pub-id></element-citation></ref>
<ref id="b11-etm-0-0-7958"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hecht</surname><given-names>SS</given-names></name></person-group><article-title>More than 500 trillion molecules of strong carcinogens per cigarette: Use in product labelling?</article-title><source>Tob Control</source><volume>20</volume><fpage>387</fpage><year>2011</year><pub-id pub-id-type="doi">10.1136/tc.2011.042853</pub-id><pub-id pub-id-type="pmid">21670075</pub-id></element-citation></ref>
<ref id="b12-etm-0-0-7958"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>L</given-names></name><name><surname>Chu</surname><given-names>C</given-names></name><name><surname>Lu</surname><given-names>J</given-names></name><name><surname>Kong</surname><given-names>X</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Cai</surname><given-names>YD</given-names></name></person-group><article-title>A computational method for the identification of new candidate carcinogenic and non-carcinogenic chemicals</article-title><source>Mol Biosyst</source><volume>11</volume><fpage>2541</fpage><lpage>2550</lpage><year>2015</year><pub-id pub-id-type="doi">10.1039/C5MB00276A</pub-id><pub-id pub-id-type="pmid">26194467</pub-id></element-citation></ref>
<ref id="b13-etm-0-0-7958"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zon</surname><given-names>RT</given-names></name><name><surname>Goss</surname><given-names>E</given-names></name><name><surname>Vogel</surname><given-names>VG</given-names></name><name><surname>Chlebowski</surname><given-names>RT</given-names></name><name><surname>Jatoi</surname><given-names>I</given-names></name><name><surname>Robson</surname><given-names>ME</given-names></name><name><surname>Wollins</surname><given-names>DS</given-names></name><name><surname>Garber</surname><given-names>JE</given-names></name><name><surname>Brown</surname><given-names>P</given-names></name><name><surname>Kramer</surname><given-names>BS</given-names></name><collab collab-type="corp-author">American Society of Clinical Oncology</collab></person-group><article-title>American society of clinical oncology policy statement: The role of the oncologist in cancer prevention and risk assessment</article-title><source>J Clin Oncol</source><volume>27</volume><fpage>986</fpage><lpage>993</lpage><year>2009</year><pub-id pub-id-type="doi">10.1200/JCO.2008.16.3691</pub-id><pub-id pub-id-type="pmid">19075281</pub-id></element-citation></ref>
<ref id="b14-etm-0-0-7958"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nesnow</surname><given-names>S</given-names></name><name><surname>Ross</surname><given-names>JA</given-names></name><name><surname>Stoner</surname><given-names>GD</given-names></name><name><surname>Mass</surname><given-names>MJ</given-names></name></person-group><article-title>Mechanistic linkage between DNA adducts, mutations in oncogenes and tumorigenesis of carcinogenic environmental polycyclic aromatic hydrocarbons in strain A/J mice</article-title><source>Toxicology</source><volume>105</volume><fpage>403</fpage><lpage>413</lpage><year>1995</year><pub-id pub-id-type="doi">10.1016/0300-483X(95)03238-B</pub-id><pub-id pub-id-type="pmid">8571376</pub-id></element-citation></ref>
<ref id="b15-etm-0-0-7958"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Caravagna</surname><given-names>G</given-names></name><name><surname>Graudenzi</surname><given-names>A</given-names></name><name><surname>Ramazzotti</surname><given-names>D</given-names></name><name><surname>Sanz-Pamplona</surname><given-names>R</given-names></name><name><surname>De Sano</surname><given-names>L</given-names></name><name><surname>Mauri</surname><given-names>G</given-names></name><name><surname>Moreno</surname><given-names>V</given-names></name><name><surname>Antoniotti</surname><given-names>M</given-names></name><name><surname>Mishra</surname><given-names>B</given-names></name></person-group><article-title>Algorithmic methods to infer the evolutionary trajectories in cancer progression</article-title><source>Proc Natl Acad Sci USA</source><volume>113</volume><fpage>E4025</fpage><lpage>E4034</lpage><year>2016</year><pub-id pub-id-type="doi">10.1073/pnas.1520213113</pub-id><pub-id pub-id-type="pmid">27357673</pub-id></element-citation></ref>
<ref id="b16-etm-0-0-7958"><label>16</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Suppes</surname><given-names>P</given-names></name></person-group><chapter-title>A probabilistic theory of causality</chapter-title><publisher-name>North-Holland Pub. Co.</publisher-name><publisher-loc>Amsterdam</publisher-loc><year>1970</year><pub-id pub-id-type="pmid">5418277</pub-id></element-citation></ref>
<ref id="b17-etm-0-0-7958"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname><given-names>D</given-names></name><name><surname>Smeets</surname><given-names>D</given-names></name><name><surname>Sz&#x00E9;kely</surname><given-names>B</given-names></name><name><surname>Larsimont</surname><given-names>D</given-names></name><name><surname>Sz&#x00E1;sz</surname><given-names>AM</given-names></name><name><surname>Adnet</surname><given-names>PY</given-names></name><name><surname>Roth&#x00E9;</surname><given-names>F</given-names></name><name><surname>Rouas</surname><given-names>G</given-names></name><name><surname>Nagy</surname><given-names>ZI</given-names></name><name><surname>Farag&#x00F3;</surname><given-names>Z</given-names></name><etal/></person-group><article-title>Phylogenetic analysis of metastatic progression in breast cancer using somatic mutations and copy number aberrations</article-title><source>Nat Commun</source><volume>8</volume><fpage>14944</fpage><year>2017</year><pub-id pub-id-type="doi">10.1038/ncomms15759</pub-id><pub-id pub-id-type="pmid">28429735</pub-id></element-citation></ref>
<ref id="b18-etm-0-0-7958"><label>18</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Rohlf</surname><given-names>FJ</given-names></name></person-group><chapter-title>J. Felsenstein J, Inferring Phylogenies</chapter-title><publisher-name>Sinauer Associates Inc.</publisher-name><publisher-loc>Sunderland, MA</publisher-loc><year>2004</year></element-citation></ref>
<ref id="b19-etm-0-0-7958"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jamal-Hanjani</surname><given-names>M</given-names></name><name><surname>Wilson</surname><given-names>GA</given-names></name><name><surname>McGranahan</surname><given-names>N</given-names></name><name><surname>Birkbak</surname><given-names>NJ</given-names></name><name><surname>Watkins</surname><given-names>TBK</given-names></name><name><surname>Veeriah</surname><given-names>S</given-names></name><name><surname>Shafi</surname><given-names>S</given-names></name><name><surname>Johnson</surname><given-names>DH</given-names></name><name><surname>Mitter</surname><given-names>R</given-names></name><name><surname>Rosenthal</surname><given-names>R</given-names></name><etal/></person-group><article-title>Tracking the evolution of non-small-cell lung cancer</article-title><source>N Engl J Med</source><volume>376</volume><fpage>2109</fpage><lpage>2121</lpage><year>2017</year><pub-id pub-id-type="doi">10.1056/NEJMoa1616288</pub-id><pub-id pub-id-type="pmid">28445112</pub-id></element-citation></ref>
<ref id="b20-etm-0-0-7958"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname><given-names>RA</given-names></name></person-group><article-title>The logic of inductive inference</article-title><source>J Royal Stat Soc</source><volume>98</volume><fpage>39</fpage><lpage>82</lpage><year>1935</year><pub-id pub-id-type="doi">10.2307/2342435</pub-id></element-citation></ref>
<ref id="b21-etm-0-0-7958"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Szabo</surname><given-names>A</given-names></name><name><surname>Boucher</surname><given-names>K</given-names></name></person-group><article-title>Estimating an oncogenetic tree when false negatives and positives are present</article-title><source>Math Biosci</source><volume>176</volume><fpage>219</fpage><lpage>236</lpage><year>2002</year><pub-id pub-id-type="doi">10.1016/S0025-5564(02)00086-X</pub-id><pub-id pub-id-type="pmid">11916510</pub-id></element-citation></ref>
<ref id="b22-etm-0-0-7958"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Desper</surname><given-names>R</given-names></name><name><surname>Jiang</surname><given-names>F</given-names></name><name><surname>Kallioniemi</surname><given-names>OP</given-names></name><name><surname>Moch</surname><given-names>H</given-names></name><name><surname>Papadimitriou</surname><given-names>CH</given-names></name><name><surname>Sch&#x00E4;ffer</surname><given-names>AA</given-names></name></person-group><article-title>Inferring tree models for oncogenesis from comparative genome hybridization data</article-title><source>J Comput Biol</source><volume>6</volume><fpage>37</fpage><lpage>51</lpage><year>1999</year><pub-id pub-id-type="doi">10.1089/cmb.1999.6.37</pub-id><pub-id pub-id-type="pmid">10223663</pub-id></element-citation></ref>
<ref id="b23-etm-0-0-7958"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>XC</given-names></name><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name><name><surname>Zhong</surname><given-names>Y</given-names></name></person-group><article-title>The occurrence of genetic alterations during the progression of breast carcinoma</article-title><source>Biomed Res Int</source><volume>2016</volume><fpage>5237827</fpage><year>2016</year><pub-id pub-id-type="pmid">27190992</pub-id></element-citation></ref>
<ref id="b24-etm-0-0-7958"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>B</given-names></name><name><surname>Kirov</surname><given-names>S</given-names></name><name><surname>Snoddy</surname><given-names>J</given-names></name></person-group><article-title>WebGestalt: An integrated system for exploring gene sets in various biological contexts</article-title><source>Nucleic Acids Res</source><volume>33</volume><issue>(Web Server Issue)</issue><fpage>W741</fpage><lpage>W748</lpage><year>2005</year><pub-id pub-id-type="doi">10.1093/nar/gki475</pub-id><pub-id pub-id-type="pmid">15980575</pub-id></element-citation></ref>
<ref id="b25-etm-0-0-7958"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Markowitz</surname><given-names>SD</given-names></name><name><surname>Bertagnolli</surname><given-names>MM</given-names></name></person-group><article-title>Molecular origins of cancer: Molecular basis of colorectal cancer</article-title><source>N Engl J Med</source><volume>361</volume><fpage>2449</fpage><lpage>2460</lpage><year>2009</year><pub-id pub-id-type="doi">10.1056/NEJMra0804588</pub-id><pub-id pub-id-type="pmid">20018966</pub-id></element-citation></ref>
<ref id="b26-etm-0-0-7958"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Calvert</surname><given-names>PM</given-names></name><name><surname>Frucht</surname><given-names>H</given-names></name></person-group><article-title>The genetics of colorectal cancer</article-title><source>Ann Intern Med</source><volume>137</volume><fpage>603</fpage><lpage>612</lpage><year>2002</year><pub-id pub-id-type="doi">10.7326/0003-4819-137-7-200210010-00012</pub-id><pub-id pub-id-type="pmid">12353948</pub-id></element-citation></ref>
<ref id="b27-etm-0-0-7958"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vogelstein</surname><given-names>B</given-names></name><name><surname>Papadopoulos</surname><given-names>N</given-names></name><name><surname>Velculescu</surname><given-names>VE</given-names></name><name><surname>Zhou</surname><given-names>S</given-names></name><name><surname>Diaz</surname><given-names>LA</given-names><suffix>Jr</suffix></name><name><surname>Kinzler</surname><given-names>KW</given-names></name></person-group><article-title>Cancer genome landscapes</article-title><source>Science</source><volume>339</volume><fpage>1546</fpage><lpage>1558</lpage><year>2013</year><pub-id pub-id-type="doi">10.1126/science.1235122</pub-id><pub-id pub-id-type="pmid">23539594</pub-id></element-citation></ref>
<ref id="b28-etm-0-0-7958"><label>28</label><element-citation publication-type="journal"><article-title>Proceedings from the 10th annual meeting of molecularly targeted therapy in non-small cell lung cancer</article-title><source>J Thorac Oncol</source><volume>5</volume><supplement>(12 Suppl 6)</supplement><fpage>S433</fpage><lpage>S496</lpage><year>2010</year><pub-id pub-id-type="doi">10.1097/JTO.0b013e3181fd6fe1</pub-id></element-citation></ref>
<ref id="b29-etm-0-0-7958"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tokumo</surname><given-names>M</given-names></name><name><surname>Toyooka</surname><given-names>S</given-names></name><name><surname>Kiura</surname><given-names>K</given-names></name><name><surname>Shigematsu</surname><given-names>H</given-names></name><name><surname>Tomii</surname><given-names>K</given-names></name><name><surname>Aoe</surname><given-names>M</given-names></name><name><surname>Ichimura</surname><given-names>K</given-names></name><name><surname>Tsuda</surname><given-names>T</given-names></name><name><surname>Yano</surname><given-names>M</given-names></name><name><surname>Tsukuda</surname><given-names>K</given-names></name><etal/></person-group><article-title>The relationship between epidermal growth factor receptor mutations and clinicopathologic features in non-small cell lung cancers</article-title><source>Clin Cancer Res</source><volume>11</volume><fpage>1167</fpage><lpage>1173</lpage><year>2005</year><pub-id pub-id-type="pmid">15709185</pub-id></element-citation></ref>
<ref id="b30-etm-0-0-7958"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>N</given-names></name><name><surname>Hong</surname><given-names>Y</given-names></name><name><surname>Kwon</surname><given-names>D</given-names></name><name><surname>Yoon</surname><given-names>S</given-names></name></person-group><article-title>Somatic mutaome profile in human cancer tissues</article-title><source>Genomics Inform</source><volume>11</volume><fpage>239</fpage><lpage>244</lpage><year>2013</year><pub-id pub-id-type="doi">10.5808/GI.2013.11.4.239</pub-id><pub-id pub-id-type="pmid">24465236</pub-id></element-citation></ref>
<ref id="b31-etm-0-0-7958"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname><given-names>D</given-names></name><name><surname>Franceschini</surname><given-names>A</given-names></name><name><surname>Wyder</surname><given-names>S</given-names></name><name><surname>Forslund</surname><given-names>K</given-names></name><name><surname>Heller</surname><given-names>D</given-names></name><name><surname>Huerta-Cepas</surname><given-names>J</given-names></name><name><surname>Simonovic</surname><given-names>M</given-names></name><name><surname>Roth</surname><given-names>A</given-names></name><name><surname>Santos</surname><given-names>A</given-names></name><name><surname>Tsafou</surname><given-names>KP</given-names></name><etal/></person-group><article-title>STRING v10: Protein-protein interaction networks, integrated over the tree of life</article-title><source>Nucleic Acids Res</source><volume>43</volume><issue>(Database Issue)</issue><fpage>D447</fpage><lpage>D452</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/nar/gku1003</pub-id><pub-id pub-id-type="pmid">25352553</pub-id></element-citation></ref>
<ref id="b32-etm-0-0-7958"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname><given-names>D</given-names></name><name><surname>Gable</surname><given-names>AL</given-names></name><name><surname>Lyon</surname><given-names>D</given-names></name><name><surname>Junge</surname><given-names>A</given-names></name><name><surname>Wyder</surname><given-names>S</given-names></name><name><surname>Huerta-Cepas</surname><given-names>J</given-names></name><name><surname>Simonovic</surname><given-names>M</given-names></name><name><surname>Doncheva</surname><given-names>NT</given-names></name><name><surname>Morris</surname><given-names>JH</given-names></name><name><surname>Bork</surname><given-names>P</given-names></name><etal/></person-group><article-title>STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets</article-title><source>Nucleic Acids Res</source><volume>47</volume><fpage>D607</fpage><lpage>D613</lpage><year>2019</year><pub-id pub-id-type="doi">10.1093/nar/gky1131</pub-id><pub-id pub-id-type="pmid">30476243</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-etm-0-0-7958" position="float">
<label>Figure 1.</label>
<caption><p>OncoPrinter plot of the 68 mutated genes in non-smokers, ex-smokers and smokers. The genes were ranked by the mutation frequency in all lung cancer patients. ZFHX4, USH2A, CSMD1, CSMD2, SPTA1, PAPPA2, DNAH9, CNTNAP5 and ASXL3 were highly mutated in ex-smokers and smokers but not in non-smokers. Smokers had the highest rate of mutated genes, with smoking status directly correlated with number of mutations. KDM8, ZNF677, TEAD1 and PIGM were non-smoker specific mutations. The different mutation patterns suggested the tumor genesis of non-smoker lung cancer patients was different from the tumor genesis of smoking lung cancer patients. ZFHX4, zinc finger homeobox 4; USH2A, usherin; CSMD1, CUB and Sushi multiple domains 1; CSMD2, CUB and Sushi multiple domains 2; SPTA1, spectrin &#x03B1; erythrocytic 1; PAPPA2, pappalysin 2; DNAH9, dynein axonemal heavy chain 9; CNTNAP5, contactin-associated protein like 5; ASXL3, additional sex combs like 3; KDM8, lysine demethylase 8; ZNF677, zinc finger protein 677; TEAD1, TEA domain transcription factor 1; PIGM, phosphatidylinositol glycan anchor biosynthesis class M.</p></caption>
<graphic xlink:href="etm-18-05-3315-g00.tif"/>
</fig>
<fig id="f2-etm-0-0-7958" position="float">
<label>Figure 2.</label>
<caption><p>GO analysis for biological process, cellular component and molecular function categories of the 68 mutated genes associated with smoking status. GO, Gene Ontology.</p></caption>
<graphic xlink:href="etm-18-05-3315-g01.tif"/>
</fig>
<fig id="f3-etm-0-0-7958" position="float">
<label>Figure 3.</label>
<caption><p>Oncogenetic trees of the top 10 most frequent mutated genes in non-smoker, ex-smoker and smoker lung cancer patients. (A) Oncogenetic tree for non-smoker lung cancer patients. (B) Oncogenetic tree for ex-smoker lung cancer patients. (C) Oncogenetic tree for smoker lung cancer patients.</p></caption>
<graphic xlink:href="etm-18-05-3315-g02.tif"/>
</fig>
<fig id="f4-etm-0-0-7958" position="float">
<label>Figure 4.</label>
<caption><p>STRING network of TTN and SPTA1. TTN can interact with SPTA1 through CALM2 and TNNC1. TTN, titin; SPTA1, spectrin &#x03B1; erythrocytic 1; CALM2, calmodulin 2; TNNC1, troponin C1; SPTB, spectrin &#x03B2; erythrocyctic; ANK1, ankyrin 1; DES, desmin; TNNI3, troponin I3; ACTN2, actinin &#x03B1; 2; NEB, nebulin; MYL2, myosin light chain 2; TCAP, titin-cap.</p></caption>
<graphic xlink:href="etm-18-05-3315-g03.tif"/>
</fig>
<table-wrap id="tI-etm-0-0-7958" position="float">
<label>Table I.</label>
<caption><p>A total of 68 genes that demonstrated different mutation patterns amongst non-smokers, ex-smokers and smokers.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Gene symbol</th>
<th align="center" valign="bottom">Gene name</th>
<th align="center" valign="bottom">NCBI gene ID</th>
<th align="center" valign="bottom">Fisher&#x0027;s exact test P-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">EGFR</td>
<td align="left" valign="top">Epidermal growth factor receptor</td>
<td align="right" valign="top">1956</td>
<td align="center" valign="top">0.00052</td>
</tr>
<tr>
<td align="left" valign="top">TTN</td>
<td align="left" valign="top">Titin</td>
<td align="right" valign="top">7273</td>
<td align="center" valign="top">0.00071</td>
</tr>
<tr>
<td align="left" valign="top">ZFHX4</td>
<td align="left" valign="top">Zinc finger homeobox 4</td>
<td align="right" valign="top">79776</td>
<td align="center" valign="top">0.00433</td>
</tr>
<tr>
<td align="left" valign="top">USH2A</td>
<td align="left" valign="top">Usherin</td>
<td align="right" valign="top">7399</td>
<td align="center" valign="top">0.00549</td>
</tr>
<tr>
<td align="left" valign="top">SPTA1</td>
<td align="left" valign="top">Spectrin &#x03B1;, erythrocytic 1</td>
<td align="right" valign="top">6708</td>
<td align="center" valign="top">0.00753</td>
</tr>
<tr>
<td align="left" valign="top">TRPV6</td>
<td align="left" valign="top">Transient receptor potential cation channel subfamily V member 6</td>
<td align="right" valign="top">55503</td>
<td align="center" valign="top">0.00988</td>
</tr>
<tr>
<td align="left" valign="top">SEC16A</td>
<td align="left" valign="top">SEC16 homolog A, endoplasmic reticulum export factor</td>
<td align="right" valign="top">9919</td>
<td align="center" valign="top">0.00988</td>
</tr>
<tr>
<td align="left" valign="top">SCN1A</td>
<td align="left" valign="top">Sodium voltage-gated channel &#x03B1; subunit 1</td>
<td align="right" valign="top">6323</td>
<td align="center" valign="top">0.01216</td>
</tr>
<tr>
<td align="left" valign="top">ZNF677</td>
<td align="left" valign="top">Zinc finger protein 677</td>
<td align="right" valign="top">342926</td>
<td align="center" valign="top">0.01333</td>
</tr>
<tr>
<td align="left" valign="top">TEAD1</td>
<td align="left" valign="top">TEA domain transcription factor 1</td>
<td align="right" valign="top">7003</td>
<td align="center" valign="top">0.01333</td>
</tr>
<tr>
<td align="left" valign="top">PIGM</td>
<td align="left" valign="top">Phosphatidylinositol glycan anchor biosynthesis class M</td>
<td align="right" valign="top">93183</td>
<td align="center" valign="top">0.01333</td>
</tr>
<tr>
<td align="left" valign="top">EPG5</td>
<td align="left" valign="top">Ectopic P-granules autophagy protein 5 homolog</td>
<td align="right" valign="top">57724</td>
<td align="center" valign="top">0.01427</td>
</tr>
<tr>
<td align="left" valign="top">TENM3</td>
<td align="left" valign="top">Teneurin transmembrane protein 3</td>
<td align="right" valign="top">55714</td>
<td align="center" valign="top">0.01482</td>
</tr>
<tr>
<td align="left" valign="top">OR6P1</td>
<td align="left" valign="top">Olfactory receptor family 6 subfamily P member 1</td>
<td align="right" valign="top">128366</td>
<td align="center" valign="top">0.01494</td>
</tr>
<tr>
<td align="left" valign="top">PAPPA2</td>
<td align="left" valign="top">Pappalysin 2</td>
<td align="right" valign="top">60676</td>
<td align="center" valign="top">0.01743</td>
</tr>
<tr>
<td align="left" valign="top">ZNF783</td>
<td align="left" valign="top">Zinc finger family member 783</td>
<td align="right" valign="top">100289678</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">CTNNB1</td>
<td align="left" valign="top">Catenin &#x03B2; 1</td>
<td align="right" valign="top">1499</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">SPATA13</td>
<td align="left" valign="top">Spermatogenesis associated 13</td>
<td align="right" valign="top">221178</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">HIP1</td>
<td align="left" valign="top">Huntingtin interacting protein 1</td>
<td align="right" valign="top">3092</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">SENP7</td>
<td align="left" valign="top">SUMO1/sentrin specific peptidase 7</td>
<td align="right" valign="top">57337</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">PCDHGA8</td>
<td align="left" valign="top">Protocadherin &#x03B3; subfamily A, 8</td>
<td align="right" valign="top">9708</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">SNPH</td>
<td align="left" valign="top">Syntaphilin</td>
<td align="right" valign="top">9751</td>
<td align="center" valign="top">0.01769</td>
</tr>
<tr>
<td align="left" valign="top">ENPEP</td>
<td align="left" valign="top">Glutamyl aminopeptidase</td>
<td align="right" valign="top">2028</td>
<td align="center" valign="top">0.01819</td>
</tr>
<tr>
<td align="left" valign="top">KCNH2</td>
<td align="left" valign="top">Potassium voltage-gated channel subfamily H member 2</td>
<td align="right" valign="top">3757</td>
<td align="center" valign="top">0.01819</td>
</tr>
<tr>
<td align="left" valign="top">NLGN3</td>
<td align="left" valign="top">Neuroligin 3</td>
<td align="right" valign="top">54413</td>
<td align="center" valign="top">0.01819</td>
</tr>
<tr>
<td align="left" valign="top">MS4A14</td>
<td align="left" valign="top">Membrane spanning 4-domains A14</td>
<td align="right" valign="top">84689</td>
<td align="center" valign="top">0.01819</td>
</tr>
<tr>
<td align="left" valign="top">DEPDC5</td>
<td align="left" valign="top">DEP domain containing 5</td>
<td align="right" valign="top">9681</td>
<td align="center" valign="top">0.01819</td>
</tr>
<tr>
<td align="left" valign="top">SMARCA4</td>
<td align="left" valign="top">SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 4</td>
<td align="right" valign="top">6597</td>
<td align="center" valign="top">0.02044</td>
</tr>
<tr>
<td align="left" valign="top">LYST</td>
<td align="left" valign="top">Lysosomal trafficking regulator</td>
<td align="right" valign="top">1130</td>
<td align="center" valign="top">0.02157</td>
</tr>
<tr>
<td align="left" valign="top">CNTN4</td>
<td align="left" valign="top">Contactin 4</td>
<td align="right" valign="top">152330</td>
<td align="center" valign="top">0.02157</td>
</tr>
<tr>
<td align="left" valign="top">ZNF536</td>
<td align="left" valign="top">Zinc finger protein 536</td>
<td align="right" valign="top">9745</td>
<td align="center" valign="top">0.02420</td>
</tr>
<tr>
<td align="left" valign="top">CNTNAP5</td>
<td align="left" valign="top">Contactin associated protein like 5</td>
<td align="right" valign="top">129684</td>
<td align="center" valign="top">0.02459</td>
</tr>
<tr>
<td align="left" valign="top">ASXL3</td>
<td align="left" valign="top">Additional sex combs like 3, transcriptional regulator</td>
<td align="right" valign="top">80816</td>
<td align="center" valign="top">0.02459</td>
</tr>
<tr>
<td align="left" valign="top">DNAH9</td>
<td align="left" valign="top">Dynein axonemal heavy chain 9</td>
<td align="right" valign="top">1770</td>
<td align="center" valign="top">0.02568</td>
</tr>
<tr>
<td align="left" valign="top">CNGA2</td>
<td align="left" valign="top">Cyclic nucleotide gated channel &#x03B1; 2</td>
<td align="right" valign="top">1260</td>
<td align="center" valign="top">0.02841</td>
</tr>
<tr>
<td align="left" valign="top">KCNH5</td>
<td align="left" valign="top">Potassium voltage-gated channel subfamily H member 5</td>
<td align="right" valign="top">27133</td>
<td align="center" valign="top">0.02841</td>
</tr>
<tr>
<td align="left" valign="top">ZEB2</td>
<td align="left" valign="top">Zinc finger E-box binding homeobox 2</td>
<td align="right" valign="top">9839</td>
<td align="center" valign="top">0.02841</td>
</tr>
<tr>
<td align="left" valign="top">PHLPP2</td>
<td align="left" valign="top">PH domain and leucine rich repeat protein phosphatase 2</td>
<td align="right" valign="top">23035</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">GLI2</td>
<td align="left" valign="top">GLI family zinc finger 2</td>
<td align="right" valign="top">2736</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">GPR35</td>
<td align="left" valign="top">G protein-coupled receptor 35</td>
<td align="right" valign="top">2859</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">ATP13A5</td>
<td align="left" valign="top">Atpase 13A5</td>
<td align="right" valign="top">344905</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">MYF5</td>
<td align="left" valign="top">Myogenic factor 5</td>
<td align="right" valign="top">4617</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">PCDHGB7</td>
<td align="left" valign="top">Protocadherin &#x03B3; subfamily B, 7</td>
<td align="right" valign="top">56099</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">WBSCR17</td>
<td align="left" valign="top">Williams-Beuren syndrome chromosome region 17</td>
<td align="right" valign="top">64409</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">BAZ1B</td>
<td align="left" valign="top">Bromodomain adjacent to zinc finger domain 1B</td>
<td align="right" valign="top">9031</td>
<td align="center" valign="top">0.02918</td>
</tr>
<tr>
<td align="left" valign="top">COL6A5</td>
<td align="left" valign="top">Collagen type VI &#x03B1;5 chain</td>
<td align="right" valign="top">256076</td>
<td align="center" valign="top">0.03141</td>
</tr>
<tr>
<td align="left" valign="top">CSMD1</td>
<td align="left" valign="top">CUB and Sushi multiple domains 1</td>
<td align="right" valign="top">64478</td>
<td align="center" valign="top">0.03183</td>
</tr>
<tr>
<td align="left" valign="top">RYR2</td>
<td align="left" valign="top">Ryanodine receptor 2</td>
<td align="right" valign="top">6262</td>
<td align="center" valign="top">0.03217</td>
</tr>
<tr>
<td align="left" valign="top">TSHZ3</td>
<td align="left" valign="top">Teashirt zinc finger homeobox 3</td>
<td align="right" valign="top">57616</td>
<td align="center" valign="top">0.03459</td>
</tr>
<tr>
<td align="left" valign="top">KDM8</td>
<td align="left" valign="top">Lysine demethylase 8</td>
<td align="right" valign="top">79831</td>
<td align="center" valign="top">0.03728</td>
</tr>
<tr>
<td align="left" valign="top">NALCN</td>
<td align="left" valign="top">Sodium leak channel, non-selective</td>
<td align="right" valign="top">259232</td>
<td align="center" valign="top">0.03732</td>
</tr>
<tr>
<td align="left" valign="top">MALRD1</td>
<td align="left" valign="top">MAM and LDL receptor class A domain containing 1</td>
<td align="right" valign="top">340895</td>
<td align="center" valign="top">0.03732</td>
</tr>
<tr>
<td align="left" valign="top">DOCK10</td>
<td align="left" valign="top">Dedicator of cytokinesis 10</td>
<td align="right" valign="top">55619</td>
<td align="center" valign="top">0.03732</td>
</tr>
<tr>
<td align="left" valign="top">DNAH11</td>
<td align="left" valign="top">Dynein axonemal heavy chain 11</td>
<td align="right" valign="top">8701</td>
<td align="center" valign="top">0.03857</td>
</tr>
<tr>
<td align="left" valign="top">TAF1L</td>
<td align="left" valign="top">TATA-box binding protein associated factor 1 like</td>
<td align="right" valign="top">138474</td>
<td align="center" valign="top">0.04006</td>
</tr>
<tr>
<td align="left" valign="top">PRUNE2</td>
<td align="left" valign="top">Prune homolog 2</td>
<td align="right" valign="top">158471</td>
<td align="center" valign="top">0.04006</td>
</tr>
<tr>
<td align="left" valign="top">PLCH1</td>
<td align="left" valign="top">Phospholipase C eta 1</td>
<td align="right" valign="top">23007</td>
<td align="center" valign="top">0.04006</td>
</tr>
<tr>
<td align="left" valign="top">KIAA1549L</td>
<td align="left" valign="top">KIAA1549 like</td>
<td align="right" valign="top">25758</td>
<td align="center" valign="top">0.04006</td>
</tr>
<tr>
<td align="left" valign="top">RPTOR</td>
<td align="left" valign="top">Regulatory associated protein of MTOR complex 1</td>
<td align="right" valign="top">57521</td>
<td align="center" valign="top">0.04165</td>
</tr>
<tr>
<td align="left" valign="top">CSMD2</td>
<td align="left" valign="top">CUB and Sushi multiple domains 2</td>
<td align="right" valign="top">114784</td>
<td align="center" valign="top">0.04312</td>
</tr>
<tr>
<td align="left" valign="top">CDH23</td>
<td align="left" valign="top">Cadherin related 23</td>
<td align="right" valign="top">64072</td>
<td align="center" valign="top">0.04357</td>
</tr>
<tr>
<td align="left" valign="top">KIAA1324L</td>
<td align="left" valign="top">KIAA1324 like</td>
<td align="right" valign="top">222223</td>
<td align="center" valign="top">0.04374</td>
</tr>
<tr>
<td align="left" valign="top">NUP205</td>
<td align="left" valign="top">Nucleoporin 205</td>
<td align="right" valign="top">23165</td>
<td align="center" valign="top">0.04374</td>
</tr>
<tr>
<td align="left" valign="top">TBC1D4</td>
<td align="left" valign="top">TBC1 domain family member 4</td>
<td align="right" valign="top">9882</td>
<td align="center" valign="top">0.04374</td>
</tr>
<tr>
<td align="left" valign="top">FLNC</td>
<td align="left" valign="top">Filamin C</td>
<td align="right" valign="top">2318</td>
<td align="center" valign="top">0.04717</td>
</tr>
<tr>
<td align="left" valign="top">CHD7</td>
<td align="left" valign="top">Chromodomain helicase DNA binding protein 7</td>
<td align="right" valign="top">55636</td>
<td align="center" valign="top">0.04717</td>
</tr>
<tr>
<td align="left" valign="top">DNAH17</td>
<td align="left" valign="top">Dynein axonemal heavy chain 17</td>
<td align="right" valign="top">8632</td>
<td align="center" valign="top">0.04717</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-etm-0-0-7958"><p>NCBI, National Center for Biotechnology; ID, identification.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-etm-0-0-7958" position="float">
<label>Table II.</label>
<caption><p>Significantly enriched GO biological process categories of the 68 mutated genes associated with smoking status.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">GO ID</th>
<th align="center" valign="bottom">Description</th>
<th align="center" valign="bottom">P-value</th>
<th align="center" valign="bottom">FDR</th>
<th align="center" valign="bottom">Overlap genes</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">GO:0007423</td>
<td align="left" valign="top">Sensory organ development</td>
<td align="center" valign="top">4.48&#x00D7;10<sup>&#x2212;5</sup></td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">CTNNB1, EGFR, GLI2, MYF5, CHD7, TENM3, CDH23, SMARCA4, USH2A, ZEB2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0098655</td>
<td align="left" valign="top">Cation transmembrane transport</td>
<td align="center" valign="top">0.0001085</td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, GPR35, ATP13A5, KCNH2, NLGN3, TRPV6, CHD7, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0034765</td>
<td align="left" valign="top">Regulation of ion transmembrane transport</td>
<td align="center" valign="top">0.0001252</td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">NALCN, KCNH5, GPR35, KCNH2, NLGN3, CHD7, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0042391</td>
<td align="left" valign="top">Regulation of membrane potential</td>
<td align="center" valign="top">0.0001286</td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, GPR35, KCNH2, NLGN3, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0034762</td>
<td align="left" valign="top">Regulation of transmembrane transport</td>
<td align="center" valign="top">0.0001394</td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">NALCN, KCNH5, GPR35, KCNH2, NLGN3, CHD7, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0006812</td>
<td align="left" valign="top">Cation transport</td>
<td align="center" valign="top">0.0001504</td>
<td align="center" valign="top">0.1801</td>
<td align="left" valign="top">CNGA2, CTNNB1, NALCN, KCNH5, GPR35, ATP13A5, KCNH2, NLGN3, TRPV6, CHD7, RYR2, SCN1A, CDH23</td>
</tr>
<tr>
<td align="left" valign="top">GO:0060571</td>
<td align="left" valign="top">Morphogenesis of an epithelial fold</td>
<td align="center" valign="top">0.0002124</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">CTNNB1, EGFR, GLI2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0043010</td>
<td align="left" valign="top">Camera-type eye development</td>
<td align="center" valign="top">0.0002382</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">CTNNB1, EGFR, MYF5, CHD7, TENM3, SMARCA4, ZEB2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0060415</td>
<td align="left" valign="top">Muscle tissue morphogenesis</td>
<td align="center" valign="top">0.0002664</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">MYF5, CHD7, RYR2, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0048644</td>
<td align="left" valign="top">Muscle organ morphogenesis</td>
<td align="center" valign="top">0.0002865</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">MYF5, CHD7, RYR2, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0001508</td>
<td align="left" valign="top">Action potential</td>
<td align="center" valign="top">0.0003213</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">NALCN, GPR35, KCNH2, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030001</td>
<td align="left" valign="top">Metal ion transport</td>
<td align="center" valign="top">0.0003543</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">CNGA2, CTNNB1, NALCN, KCNH5, GPR35, KCNH2, TRPV6, CHD7, RYR2, SCN1A, CDH23</td>
</tr>
<tr>
<td align="left" valign="top">GO:0043269</td>
<td align="left" valign="top">Regulation of ion transport</td>
<td align="center" valign="top">0.0003573</td>
<td align="center" valign="top">0.1974</td>
<td align="left" valign="top">CTNNB1, NALCN, KCNH5, GPR35, KCNH2, NLGN3, CHD7, RYR2, SCN1A</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-etm-0-0-7958"><p>GO, Gene Ontology; ID, identification; FDR, false discovery rate.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIII-etm-0-0-7958" position="float">
<label>Table III.</label>
<caption><p>Significantly enriched GO cellular component categories of the 68 mutated genes associated with smoking status.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">GO ID</th>
<th align="center" valign="bottom">Description</th>
<th align="center" valign="bottom">P-value</th>
<th align="center" valign="bottom">FDR</th>
<th align="center" valign="bottom">Overlap genes</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">GO:0030018</td>
<td align="left" valign="top">Z disc</td>
<td align="center" valign="top">4.49&#x00D7;10<sup>&#x2212;5</sup></td>
<td align="center" valign="top">0.0294</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0031674</td>
<td align="left" valign="top">I band</td>
<td align="center" valign="top">6.91E-05</td>
<td align="center" valign="top">0.0294</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0044459</td>
<td align="left" valign="top">Plasma membrane part</td>
<td align="center" valign="top">0.000105</td>
<td align="center" valign="top">0.0297</td>
<td align="left" valign="top">CNGA2, CTNNB1, EGFR, ENPEP, SPATA13, PHLPP2, KCNH5, GPR35, HIP1, ATP13A5, KCNH2, NLGN3, TRPV6, TENM3, PCDHGB7, SCN1A, SPTA1, USH2A, PCDHGA8, SNPH</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030017</td>
<td align="left" valign="top">Sarcomere</td>
<td align="center" valign="top">0.00029</td>
<td align="center" valign="top">0.0572</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0042995</td>
<td align="left" valign="top">Cell projection</td>
<td align="center" valign="top">0.000359</td>
<td align="center" valign="top">0.0572</td>
<td align="left" valign="top">CNGA2, CTNNB1, CNTN4, DNAH9, SPATA13, PHLPP2, GLI2, TENM3, RPTOR, TSHZ3, CDH23, SPTA1, USH2A, DNAH11, SNPH</td>
</tr>
<tr>
<td align="left" valign="top">GO:0044449</td>
<td align="left" valign="top">Contractile fiber part</td>
<td align="center" valign="top">0.000456</td>
<td align="center" valign="top">0.0572</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030016</td>
<td align="left" valign="top">Myofibril</td>
<td align="center" valign="top">0.000471</td>
<td align="center" valign="top">0.0572</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0043292</td>
<td align="left" valign="top">Contractile fiber</td>
<td align="center" valign="top">0.000588</td>
<td align="center" valign="top">0.0625</td>
<td align="left" valign="top">CTNNB1, FLNC, RYR2, SCN1A, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030122</td>
<td align="left" valign="top">AP-2 adaptor complex</td>
<td align="center" valign="top">0.000942</td>
<td align="center" valign="top">0.0801</td>
<td align="left" valign="top">EGFR, HIP1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030128</td>
<td align="left" valign="top">Clathrin coat of endocytic vesicle</td>
<td align="center" valign="top">0.000942</td>
<td align="center" valign="top">0.0801</td>
<td align="left" valign="top">EGFR, HIP1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0098590</td>
<td align="left" valign="top">Plasma membrane region</td>
<td align="center" valign="top">0.001172</td>
<td align="center" valign="top">0.0906</td>
<td align="left" valign="top">CNGA2, CTNNB1, EGFR, ENPEP, SPATA13, PHLPP2, HIP1, NLGN3, USH2A, SNPH</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030132</td>
<td align="left" valign="top">Clathrin coat of coated pit</td>
<td align="center" valign="top">0.001618</td>
<td align="center" valign="top">0.1146</td>
<td align="left" valign="top">EGFR, HIP1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0097458</td>
<td align="left" valign="top">Neuron part</td>
<td align="center" valign="top">0.002216</td>
<td align="center" valign="top">0.1449</td>
<td align="left" valign="top">CNTN4, PHLPP2, HIP1, TENM3, RPTOR, TSHZ3, CDH23, SMARCA4, SPTA1, USH2A, SNPH</td>
</tr>
<tr>
<td align="left" valign="top">GO:0090575</td>
<td align="left" valign="top">RNA polymerase II transcription factor complex</td>
<td align="center" valign="top">0.002476</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">TAF1L, CTNNB1, MYF5</td>
</tr>
<tr>
<td align="left" valign="top">GO:0005929</td>
<td align="left" valign="top">Cilium</td>
<td align="center" valign="top">0.003008</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">CNGA2, DNAH9, PHLPP2, GLI2, USH2A, DNAH11</td>
</tr>
<tr>
<td align="left" valign="top">GO:0043234</td>
<td align="left" valign="top">Protein complex</td>
<td align="center" valign="top">0.003037</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">TAF1L, CTNNB1, DNAH9, EGFR, NUP205, COL6A5, HIP1, MYF5, RPTOR, RYR2, SMARCA4, TEAD1, TTN, USH2A, DNAH11, DEPDC5</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030125</td>
<td align="left" valign="top">Clathrin vesicle coat</td>
<td align="center" valign="top">0.003127</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">EGFR, HIP1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0031226</td>
<td align="left" valign="top">Intrinsic component of plasma membrane</td>
<td align="center" valign="top">0.003156</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">CNGA2, ENPEP, KCNH5, GPR35, ATP13A5, KCNH2, NLGN3, TRPV6, TENM3, PCDHGB7, SCN1A, SPTA1, PCDHGA8</td>
</tr>
<tr>
<td align="left" valign="top">GO:0031253</td>
<td align="left" valign="top">Cell projection membrane</td>
<td align="center" valign="top">0.003305</td>
<td align="center" valign="top">0.1478</td>
<td align="left" valign="top">CNGA2, CTNNB1, SPATA13, PHLPP2, USH2A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0098858</td>
<td align="left" valign="top">Actin-based cell projection</td>
<td align="center" valign="top">0.003565</td>
<td align="center" valign="top">0.1515</td>
<td align="left" valign="top">CTNNB1, SPATA13, CDH23, USH2A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0030131</td>
<td align="left" valign="top">Clathrin adaptor complex</td>
<td align="center" valign="top">0.004254</td>
<td align="center" valign="top">0.1718</td>
<td align="left" valign="top">EGFR, HIP1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0031090</td>
<td align="left" valign="top">Organelle membrane</td>
<td align="center" valign="top">0.004448</td>
<td align="center" valign="top">0.1718</td>
<td align="left" valign="top">CNGA2, EGFR, ENPEP, PHLPP2, NUP205, HIP1, MALRD1, RPTOR, RYR2, WBSCR17, DEPDC5, SNPH, TBC1D4, SEC16A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0044441</td>
<td align="left" valign="top">Ciliary part</td>
<td align="center" valign="top">0.004724</td>
<td align="center" valign="top">0.1746</td>
<td align="left" valign="top">CNGA2, DNAH9, PHLPP2, GLI2, USH2A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0044798</td>
<td align="left" valign="top">Nuclear transcription factor complex</td>
<td align="center" valign="top">0.005127</td>
<td align="center" valign="top">0.1816</td>
<td align="left" valign="top">TAF1L, CTNNB1, MYF5</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3-etm-0-0-7958"><p>GO, Gene Ontology; ID, identification; FDR, false discovery rate.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIV-etm-0-0-7958" position="float">
<label>Table IV.</label>
<caption><p>Significantly enriched GO molecular function categories of the 68 mutated genes associated with smoking status.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">GO ID</th>
<th align="center" valign="bottom">Description</th>
<th align="center" valign="bottom">P-value</th>
<th align="center" valign="bottom">FDR</th>
<th align="center" valign="bottom">Overlap genes</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">GO:0044877</td>
<td align="left" valign="top">Macromolecular complex binding</td>
<td align="center" valign="top">2.18&#x00D7;10<sup>&#x2212;5</sup></td>
<td align="center" valign="top">0.0308</td>
<td align="left" valign="top">CTNNB1, EGFR, FLNC, GLI2, HIP1, CHD7, RPTOR, TSHZ3, SMARCA4, SPTA1, TTN, USH2A, KDM8, BAZ1B, DEPDC5</td>
</tr>
<tr>
<td align="left" valign="top">GO:0070577</td>
<td align="left" valign="top">Lysine-acetylated histone binding</td>
<td align="center" valign="top">5.57E-05</td>
<td align="center" valign="top">0.0393</td>
<td align="left" valign="top">TAF1L, SMARCA4, BAZ1B</td>
</tr>
<tr>
<td align="left" valign="top">GO:0005516</td>
<td align="left" valign="top">Calmodulin binding</td>
<td align="center" valign="top">9.33E-05</td>
<td align="center" valign="top">0.0418</td>
<td align="left" valign="top">CNGA2, EGFR, KCNH5, TRPV6, RYR2, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0051015</td>
<td align="left" valign="top">Actin filament binding</td>
<td align="center" valign="top">0.000118</td>
<td align="center" valign="top">0.0418</td>
<td align="left" valign="top">EGFR, FLNC, HIP1, SPTA1, TTN</td>
</tr>
<tr>
<td align="left" valign="top">GO:0005261</td>
<td align="left" valign="top">Cation channel activity</td>
<td align="center" valign="top">0.000273</td>
<td align="center" valign="top">0.0742</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, TRPV6, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0003682</td>
<td align="left" valign="top">Chromatin binding</td>
<td align="center" valign="top">0.0004</td>
<td align="center" valign="top">0.0742</td>
<td align="left" valign="top">CTNNB1, EGFR, GLI2, CHD7, TSHZ3, SMARCA4, KDM8, BAZ1B</td>
</tr>
<tr>
<td align="left" valign="top">GO:0000155</td>
<td align="left" valign="top">Phosphorelay sensor kinase activity</td>
<td align="center" valign="top">0.000443</td>
<td align="center" valign="top">0.0742</td>
<td align="left" valign="top">KCNH5, KCNH2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0004673</td>
<td align="left" valign="top">Protein histidine kinase activity</td>
<td align="center" valign="top">0.000443</td>
<td align="center" valign="top">0.0742</td>
<td align="left" valign="top">KCNH5, KCNH2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0046982</td>
<td align="left" valign="top">Protein heterodimerization activity</td>
<td align="center" valign="top">0.000472</td>
<td align="center" valign="top">0.0742</td>
<td align="left" valign="top">CTNNB1, EGFR, KCNH5, HIP1, MYF5, TENM3, SPTA1</td>
</tr>
<tr>
<td align="left" valign="top">GO:0005244</td>
<td align="left" valign="top">Voltage-gated ion channel activity</td>
<td align="center" valign="top">0.000918</td>
<td align="center" valign="top">0.118</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0022832</td>
<td align="left" valign="top">Voltage-gated channel activity</td>
<td align="center" valign="top">0.000918</td>
<td align="center" valign="top">0.118</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0016775</td>
<td align="left" valign="top">Phosphotransferase activity, nitrogenous group as acceptor</td>
<td align="center" valign="top">0.001053</td>
<td align="center" valign="top">0.1241</td>
<td align="left" valign="top">KCNH5, KCNH2</td>
</tr>
<tr>
<td align="left" valign="top">GO:0005216</td>
<td align="left" valign="top">Ion channel activity</td>
<td align="center" valign="top">0.001884</td>
<td align="center" valign="top">0.1874</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, TRPV6, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0046873</td>
<td align="left" valign="top">Metal ion transmembrane transporter activity</td>
<td align="center" valign="top">0.001917</td>
<td align="center" valign="top">0.1874</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, TRPV6, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0001159</td>
<td align="left" valign="top">Core promoter proximal region DNA binding</td>
<td align="center" valign="top">0.001988</td>
<td align="center" valign="top">0.1874</td>
<td align="left" valign="top">GLI2, MYF5, CHD7, SMARCA4, TEAD1, ZNF536</td>
</tr>
<tr>
<td align="left" valign="top">GO:0022838</td>
<td align="left" valign="top">Substrate-specific channel activity</td>
<td align="center" valign="top">0.002202</td>
<td align="center" valign="top">0.1896</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, TRPV6, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0008324</td>
<td align="left" valign="top">Cation transmembrane transporter activity</td>
<td align="center" valign="top">0.002279</td>
<td align="center" valign="top">0.1896</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, ATP13A5, KCNH2, TRPV6, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0022836</td>
<td align="left" valign="top">Gated channel activity</td>
<td align="center" valign="top">0.002541</td>
<td align="center" valign="top">0.1903</td>
<td align="left" valign="top">CNGA2, NALCN, KCNH5, KCNH2, RYR2, SCN1A</td>
</tr>
<tr>
<td align="left" valign="top">GO:0032403</td>
<td align="left" valign="top">Protein complex binding</td>
<td align="center" valign="top">0.002557</td>
<td align="center" valign="top">0.1903</td>
<td align="left" valign="top">EGFR, FLNC, HIP1, RPTOR, SPTA1, TTN, USH2A, DEPDC5</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn4-etm-0-0-7958"><p>GO, Gene Ontology; ID, identification; FDR, false discovery rate.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>
