<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">MCO</journal-id>
<journal-title-group>
<journal-title>Molecular and Clinical Oncology</journal-title>
</journal-title-group>
<issn pub-type="ppub">2049-9450</issn>
<issn pub-type="epub">2049-9469</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">MCO-0-0-02198</article-id>
<article-id pub-id-type="doi">10.3892/mco.2020.2198</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Comparative analysis of target gene exon sequencing by cognitive technology using a next generation sequencing platform in patients with lung cancer</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Chen</surname><given-names>Yu</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Yan</surname><given-names>Wenqing</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Xie</surname><given-names>Zhi</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Guo</surname><given-names>Weibang</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Lu</surname><given-names>Danxia</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Lv</surname><given-names>Zhiyi</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zhang</surname><given-names>Xuchao</given-names></name>
<xref rid="af1-mco-0-0-02198" ref-type="aff">1</xref>
<xref rid="af2-mco-0-0-02198" ref-type="aff">2</xref>
<xref rid="c1-mco-0-0-02198" ref-type="corresp"/>
</contrib>
</contrib-group>
<aff id="af1-mco-0-0-02198"><label>1</label>Guangdong Provincial Key Laboratory of Translational Medicine in Lung Cancer, Medical Research Center, Guangdong Provincial People&#x0027;s Hospital, Guangdong Academy of Medical Sciences, School of Medicine, South China University of Technology, Guangzhou, Guangdong 510080, P.R. China</aff>
<aff id="af2-mco-0-0-02198"><label>2</label>Guangdong Provincial Key Laboratory of Translational Medicine in Lung Cancer, Medical Research Center, Guangdong Provincial People&#x0027;s Hospital, Guangdong Academy of Medical Sciences, The Second School of Clinical Medicine, Southern Medical University, Guangzhou, Guangdong 510280, P.R. China</aff>
<author-notes>
<corresp id="c1-mco-0-0-02198"><italic>Correspondence to:</italic> Professor Xuchao Zhang, Guangdong Provincial Key Laboratory of Translational Medicine in Lung Cancer, Medical Research Center, Guangdong Provincial People&#x0027;s Hospital, Guangdong Academy of Medical Sciences, School of Medicine, South China University of Technology, 106 Zhongshan Er Road, Guangzhou, Guangdong 510080, P.R. China <email>zhxuchao3000@126.com</email></corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>02</month>
<year>2021</year></pub-date>
<pub-date pub-type="epub">
<day>24</day>
<month>12</month>
<year>2020</year></pub-date>
<volume>14</volume>
<issue>2</issue>
<elocation-id>36</elocation-id>
<history>
<date date-type="received">
<day>31</day>
<month>03</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>12</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Chen et al.</copyright-statement>
<copyright-year>2020</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>Next generation sequencing (NGS) technology is an increasingly important clinical tool for therapeutic decision-making. However, interpretation of NGS data presents challenges at the point of care, due to limitations in understanding the clinical importance of gene variants and efficiently translating results into actionable information for the clinician. The present study compared two approaches for annotating and reporting actionable genes and gene mutations from tumor samples: The traditional approach of manual curation, annotation and reporting using an experienced molecular tumor bioinformationist; and a cloud-based cognitive technology, with the goal to detect gene mutations of potential significance in Chinese patients with lung cancer. Data from 285 gene-targeted exon sequencing previously conducted on 115 patient tissue samples between 2014 and 2016 and subsequently manually annotated and evaluated by the Guangdong Lung Cancer Institute (GLCI) research team were analyzed by the Watson for Genomics (WfG) cognitive genomics technology. A comparative analysis of the annotation results of the two methods was conducted to identify quantitative and qualitative differences in the mutations generated. The complete congruence rate of annotation results between WfG analysis and the GLCI bioinformatician was 43.48&#x0025;. In 65 (56.52&#x0025;) samples, WfG analysis identified and interpreted, on average, 1.54 more mutation sites in each sample than the manual GLCI review. These mutation sites were located on 27 genes, including <italic>EP300</italic>, <italic>ARID1A</italic>, <italic>STK11</italic> and <italic>DNMT3A</italic>. Mutations in the EP300 gene were most prevalent, and present in 30.77&#x0025; samples. The Tumor Mutation Burden (TMB) interpreted by WfG analysis (1.82) was significantly higher than the TMB (0.73) interpreted by GLCI review. Compared with manual curation by a bioinformatician, WfG analysis provided comprehensive insights and additional genetic alterations to inform clinical therapeutic strategies for patients with lung cancer. These findings suggest the valuable role of cognitive computing to increase efficiency in the comprehensive detection and interpretation of genetic alterations which may inform opportunities for targeted cancer therapies.</p>
</abstract>
<kwd-group>
<kwd>next generation sequencing</kwd>
<kwd>lung cancer</kwd>
<kwd>precision medicine</kwd>
<kwd>artificial intelligence</kwd>
<kwd>genomics</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Lung cancer, common worldwide, is the leading cause of cancer deaths in China. Effective and efficient measures to target therapies are needed to reduce disease burden and improve patient prognoses and outcomes. Precision medicine uses genomic analysis, including next generation sequencing, to identify the genetic profile of individual patients and lung cancer cells to identify specific individual receptivity to available therapies. The 2015 launch of the Precision Medicine Initiative by US President Obama accelerated the integration of next generation sequencing (NGS) methods in genomic medicine, especially in oncology care (<xref rid="b1-mco-0-0-02198" ref-type="bibr">1</xref>,<xref rid="b2-mco-0-0-02198" ref-type="bibr">2</xref>). NGS technology is becoming more widely employed as a novel genetic screening, prognostic and diagnostic technique for clinical disease management (<xref rid="b3-mco-0-0-02198 b4-mco-0-0-02198 b5-mco-0-0-02198" ref-type="bibr">3-5</xref>) and is becoming an effective and acceptable method for clinical gene detection (<xref rid="b6-mco-0-0-02198" ref-type="bibr">6</xref>,<xref rid="b7-mco-0-0-02198" ref-type="bibr">7</xref>). Although it is still in the early stage of clinical application for the diagnosis and treatment of tumors (<xref rid="b8-mco-0-0-02198 b9-mco-0-0-02198 b10-mco-0-0-02198" ref-type="bibr">8-10</xref>), its continuous innovation has generated increased awareness and interest in the role of genetic markers and the molecular mechanisms of diseases. Genome alterations play a significant role in disease recurrence for lung cancer (<xref rid="b11-mco-0-0-02198" ref-type="bibr">11</xref>). Large-scale genomic sequencing studies have revealed the complex genomic landscape of lung cancer, with tumor heterogeneity (<xref rid="b12-mco-0-0-02198" ref-type="bibr">12</xref>).</p>
<p>NGS shows promise in the treatment of lung cancer to identify candidate biomarkers for early diagnosis, identify prognostic factors, and detect actionable mutations to guide targeted therapy decisions (<xref rid="b13-mco-0-0-02198" ref-type="bibr">13</xref>). However, NGS generates massive volumes of data. Analyzing and interpreting this data creates challenges for analysts and clinicians alike. As a result, the analysis of data and annotation of variation have become a major bottleneck that inhibits wider clinical adoption and usefulness of NGS technology.</p>
<p>There are many tools available to analyze NGS data for variants (<xref rid="b14-mco-0-0-02198" ref-type="bibr">14</xref>). The standard method for annotation of NGS mutations at cancer care centers like Guangdong Lung Cancer Institute (GLCI) includes retrieval, analysis and comparison of the annotation results with databases like DrugBank, COSMIC, dbSNP, OMIM, ClinVar, 5000 Exomes and 1000 Genomes. Genes are further analyzed through screening under a series of conditions examining zygosity, variant type, variant effect, location, filtered coverage and minor allele frequency, on the basis of preliminary analysis of NGS data.</p>
<p>This study compared the annotation and interpretation of NGS results at a large volume cancer center in Guangdong, China using standard methods and using IBM Watson for Genomics (WfG), a cloud-based cognitive computing system. WfG is trained to analyze molecular data at a massive scale to provide clinically actionable insights that are supported by all available relevant evidence. The tool is built on several different predictive models that can perform analysis across the whole-genome and accesses a comprehensive database of structured and unstructured data sources using Natural Language Processing (NLP). (Over 200 sources include DrugBank, NCI, COSMIC, ClinVar, and 1000 Genomes, as well as evidence extracted from the universe of biological and medical literature.) WfG is in use in selected markets (<xref rid="b13-mco-0-0-02198" ref-type="bibr">13</xref>,<xref rid="b15-mco-0-0-02198" ref-type="bibr">15</xref>,<xref rid="b16-mco-0-0-02198" ref-type="bibr">16</xref>), however, its application in the analysis and annotation of results of NGS data in Chinese patients has not been reported. This study examined and compared the NGS data annotation process by comparing the results of the gene mutation annotation for Chinese patients with lung cancer generated by Guangdong Lung Cancer Institute (GLCI) bioinformaticians and WfG. The ultimate goal of this study was to leverage insights from the analysis to inform individual treatment decisions to benefit future Chinese patients with lung cancer.</p>
</sec>
<sec sec-type="Materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Materials</title>
<p>In terms of patient specimen collection and analysis, we do have approval from our institutional ethics board and informed consent from each patient. Actually, we have established a tissue repository center (Tumor Sample Bank) in our cancer center which was approved by Human Biomaterial and Genetic Resource Office of China. Researchers at Guangdong Lung Cancer Institute (GLCI) obtained a variety of tissue samples from the Tumor Sample Bank in GLCI of Guangdong Provincial People&#x0027;s Hospital, Guangzhou, China. The samples, from 115 randomly selected patients diagnosed with lung cancer at Guangdong General Hospital between 2014 and 2016, included 10 formalin fixed paraffin-embedded (FFPE) samples, 12 small samples collected after puncture, 4 plasma samples and 89 large tumor samples collected during surgery. All patients were well informed and signed the informed consent.</p>
</sec>
<sec>
<title>DNA extraction</title>
<p>Researchers used QIAGEN QlAampDNA Mini Kit and QIAGEN QlAampBlood Mini Kit for the gDNA extraction process from each tissue sample (i.e., the FFPE samples, small samples collected after puncture and large samples collected in surgery) and DNA from 1-4 ml plasma samples, respectively. DNA quantitative analysis was completed using Qubit analyzer.</p>
</sec>
<sec>
<title>Design and synthesis of target capture probe</title>
<p>Hybrid capture in target areas was implemented using SureSelect<sup>XT</sup> Custom library. This probe library was designed through SureDesign software based on genome hg19/GRCh37; target areas were lung cancer-related high-frequency mutation gene exon areas.</p>
</sec>
<sec>
<title>Establishment of NGS library</title>
<p>The NGS library was established using free DNA in plasma and gDNA in tissue samples. Free DNA was extracted from 1-2 ml plasma to establish the library, without requirement of fragmentation. Approximately, 50-1,000 ng of gDNA was extracted from each tissue sample and cut into 100-200 bp segments through enzyme digestion to establish the library. The NGS library was established using Ion Xpress&#x2122; Plus Fragment Library Kit and Ion Xpress&#x2122; Barcode Adapters 1-16 Kit, and the selection, purification and recovery of DNA was completed via Agencourt AMPure XP beads. Target areas were captured after hybridization through SureSelectXT Custom library at 65&#x02DA;C for 16-24 h following 11 cycles of polymerase chain reaction (PCR) amplification in pre-library after segment selection. The target area sequencing library was obtained from the captured library after purification and 9 cycles of PCR amplification. Finally, QIAxcel and Qubit were used to detect the library segment length and library concentration, respectively.</p>
</sec>
<sec>
<title>Sequencing and data analysis</title>
<p>The library was diluted into 12 pM using water according to its concentration and connected with microballoon using Ion PI&#x2122; Template OT2 200 Kit v2, after which the samples were spotted on P1 chips for sequencing. The sequencing data were compared with human genome Hg 19 using Suite software (Life Technologies, Version 5.0.2), and mutations were detected using Variant Caller software (Life Technologies, Version 5.0.2.1), so as to form corresponding variant calling files (VCF).</p>
</sec>
<sec>
<title>Target sequencing data interpretation by GLCI using standard methods</title>
<p>Integrative Genomics Viewer (IGV) software was applied by GLCI bioinformaticians to annotate the gene mutation information by comparing with the databases DrugBank, COSMIC, dbSNP, OMIM, ClinVar, 5000 Exomes and 1000 Genomes.</p>
</sec>
<sec>
<title>Gene variant interpretation executed by Watson for Genomics</title>
<p>Upon sequence completion by GLCI lab, the research team accessed the cloud-based cognitive computing tool, Watson for Genomics (WfG). The following information was uploaded to WfG: (a) tumor type, (b) a list of variants as a variant calling &#xFB01;le (.vcf). Uploading this data required approximately 1 min for each sample, by a data technician. After these data were uploaded, WfG performed the Molecular Profile Analysis (MPA) for each gene with a variant. A subpart of the WfG cognitive tool, the MPA reviews evidence from functional studies and protein structure and applies programming logic to classify variants into five categories: Pathogenic, likely pathogenic, benign, likely benign, and variables of unknown significance (VUS). Alterations categorized as benign or likely benign were removed from the report. Next, WfG identified a gene as actionable if: a) the variant was pathogenic or likely pathogenic; b) the variant was directly targetable or part of a pathway that was targetable based on evidence from the literature; and c) a U.S. Food and Drug Administration-approved or investigational target therapy was available.</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>Statistical analysis of the difference between tumor mutation burden obtained by WFG and GLCI was performed by the paired t-test analysis. Counting data was expressed by the number of cases/percentage n(&#x0025;), and the measurement data were expressed by mean number (mean &#x00B1; SD). t-test analysis was performed using SPSS v22.0 software (IBM Corp.). P&#x003C;0.05 was considered to indicate a statistically significant difference.</p>
</sec>
</sec>
</sec>
<sec sec-type="Results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Congruence rate of gene mutation interpretation results of GLCI and WfG analyses/between two methods</title>
<p>&#x2018;Complete congruence&#x2019; was defined as having completely consistent interpretation results from the two methods, which included the following two scenarios: In the first scenario, no mutation sites were identified by either method, i.e., Complete Congruence, 0 alterations reported. In the second scenario, the same single or multiple mutation sites were interpreted by both methods, i.e., Complete Congruence, 1 or more identical alterations reported by both methods.</p>
<p>&#x2018;Partial congruence&#x2019; was defined as the partially consistent interpretation results from the two methods, i.e., mutation sites interpreted by WfG analysis contained those by GLCI bioinformaticians. &#x2018;No congruence&#x2019; was defined as totally inconsistent interpretation results from the two methods, i.e., no mutation sites were interpreted by GLCI bioinformaticians while 1 or more mutation site were interpreted by WfG analysis, or vice versa.</p>
<p>After annotation of the sequencing results of all 115 samples, across all samples WfG identified 180 alterations whereas GLCI identified 80 mutation sites. The congruence rate in detecting mutation sites across the entire sample was 44.44&#x0025;. Complete congruence was found in the analyses of 50 samples (congruence rate 43.48&#x0025;.) Of these completely congruent samples, 29 had no reported mutation sites (Complete Congruence, 0 alterations), and 21 had 29 mutation sites (Complete Congruence, 1 or more alterations).</p>
<p>In the remaining 65 samples (56.52&#x0025;), a total of 180 mutation sites were found after analysis by both methods. Compared with GLCI bioinformaticians, WfG analysis interpreted more mutation sites (100), with an average of 1.54 more mutation sites in each sample. (In one sample, WfG interpreted 11 more mutation sites than were interpreted by the GLCI bioinformatician). In 37 samples (32.17&#x0025;) Partial Congruence was found in the analysis: GLCI interpreted 51 mutation sites and WfG interpreted 103 mutation sites. (WfG interpreted an average of 1.41 more mutation sites in each sample). Finally, in the 28 samples with no congruence, WfG reported 48 mutation sites and GLCI reported zero mutation sites. In these samples, WfG analysis interpreted 1.71 more mutation sites in each sample when compared to GLCI analysis WfG (<xref rid="f1-mco-0-0-02198" ref-type="fig">Fig. 1</xref> and <xref rid="tI-mco-0-0-02198" ref-type="table">Table I</xref>).</p>
</sec>
<sec>
<title>Differential genes and signal pathways in data interpretation results between two methods</title>
<p>NGS data from 115 samples were annotated by WfG and GLCI bioinformaticians respectively. In 50 samples (43.5&#x0025;), no mutations were found by either GLCI or WfG. However, in the remaining 65 samples (56.5&#x0025;), WfG analysis interpreted more mutation sites (100), whereas in 54 (47.0&#x0025;) samples, WfG analysis interpreted 1-2 more mutation sites when compared with GLCI bioinformaticians&#x0027; results (<xref rid="tII-mco-0-0-02198" ref-type="table">Table II</xref>).</p>
<p>The 100 mutation sites that were additionally identified by WfG analysis were located on 27 genes, including <italic>EP300</italic>, <italic>ARID1A</italic>, <italic>STK11</italic> and <italic>DNMT3A</italic>. The mutation rate was 30.77&#x0025; (20/65) in <italic>EP300</italic> gene; 18.46&#x0025; (12/65), 16.92&#x0025; (11/65) and 13.85&#x0025; (9/65) in <italic>ARID1A</italic>, <italic>STK11</italic> and <italic>DNMT3A</italic> genes, respectively. In the 37 (32.17&#x0025;) samples out of 65, 51 mutation sites were interpreted by GLCI bioinformaticians while 103 by WfG analysis, with 51 co-interpreted mutation sites located on <italic>EGFR</italic>, <italic>KRAS</italic> and <italic>TP53</italic> genes. WfG interpreted 52 more mutation sites located on <italic>EP300</italic>, <italic>ARID1A</italic>, <italic>STK11</italic>, <italic>DNMT3A</italic>, <italic>PIK3R1</italic>, <italic>PTEN</italic>, <italic>ATM</italic>, <italic>BCL2</italic>, <italic>MSH6</italic>, <italic>NF1</italic>, <italic>PTPN11</italic>, <italic>ATR</italic>, <italic>BRCA1</italic>, <italic>CDKN2A</italic>, <italic>CREBBP</italic>, <italic>MET</italic>, <italic>RNF43</italic> and <italic>TSC1</italic> genes. In another 28 (24.35&#x0025;) samples out of 65, no mutation sites were interpreted by GLCI bioinformaticians while 48 mutation sites were interpreted by WfG analysis and were located on <italic>EP300</italic>, <italic>ARID1A</italic>, <italic>STK11</italic>, <italic>TSC2</italic>, <italic>BRCA2</italic>, <italic>MSH6</italic>, <italic>BRCA1</italic>, <italic>DNMT3A</italic>, <italic>NF2</italic>, <italic>ATM</italic>, <italic>PIK3R1</italic>, <italic>PTEN</italic>, <italic>BRAF</italic>, <italic>CDKN1B</italic>, <italic>IDH1</italic>, <italic>IDH2</italic>, <italic>KIT</italic>, <italic>PMS2</italic>, <italic>RNF43</italic> and <italic>TSC1</italic> genes (<xref rid="f2-mco-0-0-02198" ref-type="fig">Fig. 2</xref>).</p>
</sec>
<sec>
<title>Analysis of congruence rate of common driver gene mutation annotation</title>
<p>The two methods reported similar annotation results in the mutations analysis of 4 common driver genes of <italic>EGFR</italic>, <italic>KRAS</italic>, <italic>TP53</italic> and <italic>ERBB2</italic>. Of the 115 samples, the total number of mutation sites for these 4 genes was 82, 80 of which were identically reported by both methods (congruence rate 97.56&#x0025;). The congruence rates of <italic>EGFR</italic>, <italic>KRAS</italic>, <italic>TP53</italic> and <italic>ERBB2</italic> were 100&#x0025; (29/29), 100&#x0025; (12/12), 94.87&#x0025; (37/39) and 100&#x0025; (2/2) respectively (<xref rid="f3-mco-0-0-02198" ref-type="fig">Fig. 3</xref>).</p>
</sec>
<sec>
<title>Analysis of tumor mutation burden (TMB) interpretation results between two methods</title>
<p>The average number of missense mutations obtained before preliminary analysis and filtration of NGS data of target 0.947 kb genes in the 115 samples was 130.43, with a TMB of 137.78. After WfG interpretation and filtration, the average number of missense mutations and TMB obtained was 1.72 and 1.82, significantly higher than the 0.69 and 0.73, respectively, obtained by GLCI bioinformaticians (P&#x003C;0.05) (<xref rid="f4-mco-0-0-02198" ref-type="fig">Fig. 4</xref>).</p>
</sec>
<sec>
<title>New high-Minor Allele Frequency (MAF) mutations indicated by WfG</title>
<p>In one of the 37 samples with Partial Congruence, (sample no. 29002) WfG analysis identified 4 new mutation sites with high MAF. The MAF of <italic>TP53</italic> G245V mutation found by both methods was 5.25&#x0025;, while the MAF was 11.95, 14.05, 35.15 and 32.02&#x0025; in new mutation sites <italic>ATR</italic> G492fs, <italic>DNMT3A</italic> A222fs, <italic>PTEN</italic> R130Q and <italic>STK11</italic> G257fs interpreted by WfG only, respectively (<xref rid="f5-mco-0-0-02198" ref-type="fig">Fig. 5</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="Discussion">
<title>Discussion</title>
<p>Several cancer centers are beginning to use artificial intelligence (AI) computing systems to analyze NGS data (<xref rid="b14-mco-0-0-02198" ref-type="bibr">14</xref>). The speed and volume of research, discovery and reporting on new genes and mutations, and their relationships to tumorigenesis is accelerating. Monitoring and integrating this knowledge for use in clinical decision-making is a task well-suited to cognitive computing technologies. WfG is a cognitive computing technology which can learn new information and analyze data at a rate that far exceeds manual curation and analysis (<xref rid="b14-mco-0-0-02198" ref-type="bibr">14</xref>). During this study, the number of cases analyzed and annotated by one GLCI bioinformatician working full time was approximately 10 cases per one week. In contrast, WfG completed analysis and annotation of each sample in &#x007E;3 min, evidence that WfG was able to perform this analysis at a much faster rate than even a highly trained human analyst.</p>
<p>To compare the effectiveness of the WfG cognitive computing tool with human-only targeted panels at identifying potentially actionable gene mutations for Chinese patients with lung cancer, we retrospectively analyzed 115 cases from GLCI that had undergone targeted DNA sequencing of 285 genes and subsequent analysis by the GLCI bioinformaticians. We conducted an independent analysis of these 115 samples using WfG. WfG was provided the full list of variants on each sequenced sample and identified 180 mutation sites in the 115 samples vs. only 80 mutation sites identified by GLCI bioinformaticians. This indicated that WfG could interpret more mutation sites than GLCI bioinformaticians using standard methods. This is particularly valuable as it is well known that target therapy or individualized therapy focuses on specific variant genes or specified gene mutation sites. WfG&#x0027;s ability to interpret more gene mutation sites from 56.52&#x0025; samples demonstrates that it can provide more opportunities for targeted therapy for patients with lung cancer and additional useful information for clinical doctors developing therapeutic strategies.</p>
<p>In this study, the congruence rate of the analyses of NGS results was 43.48&#x0025; and congruence rate of mutations was 44.44&#x0025; between WfG analysis and GLCI bioinformaticians&#x0027;, and is similar to the results of other studies which have compared WfG and manual mutation analysis (<xref rid="b13-mco-0-0-02198" ref-type="bibr">13</xref>,<xref rid="b16-mco-0-0-02198" ref-type="bibr">16</xref>). There are several possible reasons that fewer mutation sites were interpreted by GLCI bioinformaticians than by WfG. For example, the GLCI biological analysis was completed by one individual bioinformatician and was relatively conservative, emphasizing mutation sites that were common driver gene mutation sites that had been studied thoroughly, as demonstrated by the 97.56&#x0025; congruence rate of significant driver gene mutation sites. This is because GLCI bioinformaticians mainly focus on driving genes, while the new analysis methods are more comprehensive, which not only does not lose the driving gene targets, but also increases the discovery of many new rare mutation targets. A defining feature of the WfG cognitive tool and its analysis is its ability to retrieve almost all the research published worldwide, and to extract actionable information to analyze and annotate gene mutation sites. In contrast, the bioinformatician relied on extensive individual experience in reading the literature and analyzing results. As to most uncommon variant genes, WfG annotation was superior to the GLCI bioinformatician&#x0027;s. In this study results, the genes in the list interpreted by WfG analysis only, such as <italic>MSH6</italic>, <italic>DNMT3A</italic>, <italic>NF2</italic>, <italic>ATM</italic>, <italic>PIK3R1</italic>, <italic>CDKN1B</italic>, <italic>IDH1</italic>, <italic>IDH2</italic>, <italic>KIT</italic>, <italic>PMS2</italic>, <italic>RNF43</italic> and <italic>TSC1</italic>, reflect WfG&#x0027;s ability to access and integrate more recent research consistently during analysis. This may also explain discrepancies between the annotation results of bioinformaticians and WfG analysis in samples with mutation sites occurring in uncommon genes. This may also suggest that uncommon variant genes should be of increased concern to bioinformaticians, especially in this rapidly evolving arena.</p>
<p>This retrospective analysis of individual tissue samples representing 115 cases offers an opportunity to review treatment therapies with the benefit of new information gleaned from WfG. In 24.35&#x0025; samples, targeted therapy was not performed for patients as no clinically significant mutation sites were found by GLCI bioinformaticians. In contrast, mutation sites were found in uncommon genes by WfG analysis in 56.52&#x0025; of the samples. Patients with these mutations may have been eligible to participate in clinical trials with corresponding agents, had this information been available to their clinicians at the point of care. Going forward, it is therefore possible that cognitive technologies such as WfG will be able to assist clinicians by providing the comprehensive and timely analysis needed to help them guide patients to appropriate therapies. This is consistent with the view of Itahashi <italic>et al</italic> (<xref rid="b17-mco-0-0-02198" ref-type="bibr">17</xref>). They believe that WFG is useful for a clinician at a general hospital additional survey of evidence by a clinician is required when evaluating functions (<xref rid="b17-mco-0-0-02198" ref-type="bibr">17</xref>). In our sample of Chinese patients with lung cancer, opportunities for targeted therapy might have been available for 24.35&#x0025; of patients, informed by the WfG analysis. Of particular interest, in one sample in which there was no congruence between GLCI and WfG analysis, and in which there was no common mutation gene variation, this analysis found that the MAF value of multiple new gene mutation sites interpreted by WfG analysis was evidently higher than that of discovered insignificant gene mutation sites. This finding suggests that the therapeutic strategies for patients with findings like these might be changed accordingly.</p>
<p>This report is a preliminary study to compare AI-aided analysis using the cognitive computing technology WfG to the standard manual method. Limitations of the study included relatively small sample size, absence of in-association analysis with clinical therapy and, as a result, no specific information about how the results of this analysis might have affected clinical outcomes. Our study did not systematically measure the time required to annotate, interpret and report NGS results by either the standard method used by GLCI, or by WfG. Future studies will be necessary to address these limitations.</p>
<p>In conclusion, this study provides evidence that analysis of NGS results by the cognitive computing technology WfG can provide an accurate and comprehensive interpretation of more gene mutation sites through a more rapid process than routine manual analysis, generating potential opportunities of targeted therapy for cancer patients. The basis of targeted therapy is to obtain the effective gene mutation information of patients in time. For patients diagnosed with lung cancer in China and elsewhere, the timely provision of actionable information that may affect treatment options can be critical to individual therapy and research. The abundance of mutation sites interpreted uniquely by WfG analysis in partial samples was relatively high, further suggesting opportunities to optimize clinical decision making for a greater number of affected patients in the future.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>This abstract was presented at the 2018 ASCO Annual Meeting (June 1, 2018; Chicago, USA) and was published as Abstract no. e24254.</p>
</ack>
<sec>
<title>Funding</title>
<p>The present study was supported by the following grants: Guangdong Provincial Natural Science Program (grant no. 2019A1515010900; to XZ); GDPH Dengfeng Program (grant nos. DFJH201903, KJ012019444 and 8197103306; to XZ); Guangdong Provincial Applied S&#x0026;T R&#x0026;D Program &#x005B;grant no. 2016B020237006; to Professor Peng Li (Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China) and XZ&#x005D;.</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>YC and XZ designed the study, YC, WY, ZX, WG, DL and ZL performed the experiments. YC interpreted the data and drafted the initial manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>The protocol of the present study was approved by the Ethics Committee of Guangdong Provincial People&#x0027;s Hospital and written informed consent was obtained from all patients.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="b1-mco-0-0-02198"><label>1</label><element-citation publication-type="journal"><comment>The White House Office of the Press Secretary. Remarks by the President in State of the Union. (2015-01-20) &#x005B;2015-03-20&#x005D;. <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.whitehouse.gov/the-press-office/2015/01/20/remarks-president-state-union-address-january-20-2015">https://www.whitehouse.gov/the-press-office/2015/01/20/remarks-president-state-union-address-january-20-2015</ext-link>.</comment></element-citation></ref>
<ref id="b2-mco-0-0-02198"><label>2</label><element-citation publication-type="journal"><comment>The White House Office of the Press Secretary. FACT SHEET: President Obama&#x0027;s Precision Medicine Initiative. (2015-01-30) &#x005B;2015-03-20&#x005D;. <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.whitehouse.gov/the-press-office/2015/01/30/fact-sheet-president-obama&#x0027;s-precision-medicine-initiative">https://www.whitehouse.gov/the-press-office/2015/01/30/fact-sheet-president-obama&#x0027;s-precision-medicine-initiative</ext-link>.</comment></element-citation></ref>
<ref id="b3-mco-0-0-02198"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Y</given-names></name><name><surname>Muzny</surname><given-names>DM</given-names></name><name><surname>Reid</surname><given-names>JG</given-names></name><name><surname>Bainbridge</surname><given-names>MN</given-names></name><name><surname>Willis</surname><given-names>A</given-names></name><name><surname>Ward</surname><given-names>PA</given-names></name><name><surname>Braxton</surname><given-names>A</given-names></name><name><surname>Beuten</surname><given-names>J</given-names></name><name><surname>Xia</surname><given-names>F</given-names></name><name><surname>Niu</surname><given-names>Z</given-names></name><etal/></person-group><article-title>Clinical whole-exome sequencing for the diagnosis of mendelian disorders</article-title><source>N Engl J Med</source><volume>369</volume><fpage>1502</fpage><lpage>1511</lpage><year>2013</year><pub-id pub-id-type="pmid">24088041</pub-id><pub-id pub-id-type="doi">10.1056/NEJMoa1306555</pub-id></element-citation></ref>
<ref id="b4-mco-0-0-02198"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>H</given-names></name><name><surname>Deignan</surname><given-names>JL</given-names></name><name><surname>Dorrani</surname><given-names>N</given-names></name><name><surname>Strom</surname><given-names>SP</given-names></name><name><surname>Kantarci</surname><given-names>S</given-names></name><name><surname>Quintero-Rivera</surname><given-names>F</given-names></name><name><surname>Das</surname><given-names>K</given-names></name><name><surname>Toy</surname><given-names>T</given-names></name><name><surname>Harry</surname><given-names>B</given-names></name><name><surname>Yourshaw</surname><given-names>M</given-names></name><etal/></person-group><article-title>Clinical exome sequencing for genetic identification of rare Mendelian disorders</article-title><source>JAMA</source><volume>312</volume><fpage>1880</fpage><lpage>1887</lpage><year>2014</year><pub-id pub-id-type="pmid">25326637</pub-id><pub-id pub-id-type="doi">10.1001/jama.2014.14604</pub-id></element-citation></ref>
<ref id="b5-mco-0-0-02198"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dewey</surname><given-names>FE</given-names></name><name><surname>Grove</surname><given-names>ME</given-names></name><name><surname>Pan</surname><given-names>C</given-names></name><name><surname>Goldstein</surname><given-names>BA</given-names></name><name><surname>Bernstein</surname><given-names>JA</given-names></name><name><surname>Chaib</surname><given-names>H</given-names></name><name><surname>Merker</surname><given-names>JD</given-names></name><name><surname>Goldfeder</surname><given-names>RL</given-names></name><name><surname>Enns</surname><given-names>GM</given-names></name><name><surname>David</surname><given-names>SP</given-names></name><etal/></person-group><article-title>Clinical interpretation and implications of whole-genome sequencing</article-title><source>JAMA</source><volume>311</volume><fpage>1035</fpage><lpage>1045</lpage><year>2014</year><pub-id pub-id-type="pmid">24618965</pub-id><pub-id pub-id-type="doi">10.1001/jama.2014.1717</pub-id></element-citation></ref>
<ref id="b6-mco-0-0-02198"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hayes</surname><given-names>DN</given-names></name><name><surname>Kim</surname><given-names>WY</given-names></name></person-group><article-title>The next steps in next-gen sequencing of cancer genomes</article-title><source>J Clin Invest</source><volume>125</volume><fpage>462</fpage><lpage>468</lpage><year>2015</year><pub-id pub-id-type="pmid">25642706</pub-id><pub-id pub-id-type="doi">10.1172/JCI68339</pub-id></element-citation></ref>
<ref id="b7-mco-0-0-02198"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Garraway</surname><given-names>LA</given-names></name></person-group><article-title>Genomics-driven oncology: Framework for an emerging paradigm</article-title><source>J Clin Oncol</source><volume>31</volume><fpage>1806</fpage><lpage>1814</lpage><year>2013</year><pub-id pub-id-type="pmid">23589557</pub-id><pub-id pub-id-type="doi">10.1200/JCO.2012.46.8934</pub-id></element-citation></ref>
<ref id="b8-mco-0-0-02198"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Good</surname><given-names>BM</given-names></name><name><surname>Ainscough</surname><given-names>BJ</given-names></name><name><surname>McMichael</surname><given-names>JF</given-names></name><name><surname>Su</surname><given-names>AI</given-names></name><name><surname>Griffith</surname><given-names>OL</given-names></name></person-group><article-title>Organizing knowledge to enable personalization of medicine in cancer</article-title><source>Genome Biol</source><volume>15</volume><issue>438</issue><year>2014</year><pub-id pub-id-type="pmid">25222080</pub-id><pub-id pub-id-type="doi">10.1186/s13059-014-0438-7</pub-id></element-citation></ref>
<ref id="b9-mco-0-0-02198"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Griffith</surname><given-names>M</given-names></name><name><surname>Miller</surname><given-names>CA</given-names></name><name><surname>Griffith</surname><given-names>OL</given-names></name><name><surname>Krysiak</surname><given-names>K</given-names></name><name><surname>Skidmore</surname><given-names>ZL</given-names></name><name><surname>Ramu</surname><given-names>A</given-names></name><name><surname>Walker</surname><given-names>JR</given-names></name><name><surname>Dang</surname><given-names>HX</given-names></name><name><surname>Trani</surname><given-names>L</given-names></name><name><surname>Larson</surname><given-names>DE</given-names></name><etal/></person-group><article-title>Optimizing cancer genome sequencing and analysis</article-title><source>Cell Syst</source><volume>1</volume><fpage>210</fpage><lpage>223</lpage><year>2015</year><pub-id pub-id-type="pmid">26645048</pub-id><pub-id pub-id-type="doi">10.1016/j.cels.2015.08.015</pub-id></element-citation></ref>
<ref id="b10-mco-0-0-02198"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hyman</surname><given-names>DM</given-names></name><name><surname>Solit</surname><given-names>DB</given-names></name><name><surname>Arcila</surname><given-names>ME</given-names></name><name><surname>Cheng</surname><given-names>DT</given-names></name><name><surname>Sabbatini</surname><given-names>P</given-names></name><name><surname>Baselga</surname><given-names>J</given-names></name><name><surname>Berger</surname><given-names>MF</given-names></name><name><surname>Ladanyi</surname><given-names>M</given-names></name></person-group><article-title>Precision medicine at memorial sloan kettering cancer center: Clinical next generation sequencing enabling next-generation targeted therapy trials</article-title><source>Drug Discov Today</source><volume>20</volume><fpage>1422</fpage><lpage>1428</lpage><year>2015</year><pub-id pub-id-type="pmid">26320725</pub-id><pub-id pub-id-type="doi">10.1016/j.drudis.2015.08.005</pub-id></element-citation></ref>
<ref id="b11-mco-0-0-02198"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname><given-names>Z</given-names></name><name><surname>Sun</surname><given-names>R</given-names></name><name><surname>Curtis</surname><given-names>C</given-names></name></person-group><article-title>A population genetics perspective on the determinants of intra-tumor heterogeneity</article-title><source>Biochim Biophys Acta Rev Cancer</source><volume>1867</volume><fpage>109</fpage><lpage>126</lpage><year>2017</year><pub-id pub-id-type="pmid">28274726</pub-id><pub-id pub-id-type="doi">10.1016/j.bbcan.2017.03.001</pub-id></element-citation></ref>
<ref id="b12-mco-0-0-02198"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jamal-Hanjani</surname><given-names>M</given-names></name><name><surname>Hackshaw</surname><given-names>A</given-names></name><name><surname>Ngai</surname><given-names>Y</given-names></name><name><surname>Shaw</surname><given-names>J</given-names></name><name><surname>Dive</surname><given-names>C</given-names></name><name><surname>Quezada</surname><given-names>S</given-names></name><name><surname>Middleton</surname><given-names>G</given-names></name><name><surname>de Bruin</surname><given-names>E</given-names></name><name><surname>Le Quesne</surname><given-names>J</given-names></name><name><surname>Shafi</surname><given-names>S</given-names></name><etal/></person-group><article-title>Tracking genomic cancer evolution for precision medicine: The lung TRACERx Study</article-title><source>PLoS Biol</source><volume>12</volume><issue>e1001906</issue><year>2014</year><pub-id pub-id-type="pmid">25003521</pub-id><pub-id pub-id-type="doi">10.1371/journal.pbio.1001906</pub-id></element-citation></ref>
<ref id="b13-mco-0-0-02198"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Patel</surname><given-names>NM</given-names></name><name><surname>Michelini</surname><given-names>VV</given-names></name><name><surname>Snell</surname><given-names>JM</given-names></name><name><surname>Balu</surname><given-names>S</given-names></name><name><surname>Hoyle</surname><given-names>AP</given-names></name><name><surname>Parker</surname><given-names>JS</given-names></name><name><surname>Hayward</surname><given-names>MC</given-names></name><name><surname>Eberhard</surname><given-names>DA</given-names></name><name><surname>Salazar</surname><given-names>AH</given-names></name><name><surname>McNeillie</surname><given-names>P</given-names></name><etal/></person-group><article-title>Enhancing next-generation sequencing-guided cancer care through cognitive computing</article-title><source>Oncologist</source><volume>23</volume><fpage>179</fpage><lpage>185</lpage><year>2018</year><pub-id pub-id-type="pmid">29158372</pub-id><pub-id pub-id-type="doi">10.1634/theoncologist.2017-0170</pub-id></element-citation></ref>
<ref id="b14-mco-0-0-02198"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pabinger</surname><given-names>S</given-names></name><name><surname>Dander</surname><given-names>A</given-names></name><name><surname>Fischer</surname><given-names>M</given-names></name><name><surname>Snajder</surname><given-names>R</given-names></name><name><surname>Sperk</surname><given-names>M</given-names></name><name><surname>Efremova</surname><given-names>M</given-names></name><name><surname>Krabichler</surname><given-names>B</given-names></name><name><surname>Speicher</surname><given-names>MR</given-names></name><name><surname>Zschocke</surname><given-names>J</given-names></name><name><surname>Trajanoski</surname><given-names>Z</given-names></name></person-group><article-title>A survey of tools for variant analysis of next-generation genome sequencing data</article-title><source>Brief Bioinform</source><volume>15</volume><fpage>256</fpage><lpage>278</lpage><year>2014</year><pub-id pub-id-type="pmid">23341494</pub-id><pub-id pub-id-type="doi">10.1093/bib/bbs086</pub-id></element-citation></ref>
<ref id="b15-mco-0-0-02198"><label>15</label><element-citation publication-type="journal"><article-title>Oncologists partner with Watson on genomics</article-title><source>Cancer Discov</source><volume>5</volume><issue>788</issue><year>2015</year><pub-id pub-id-type="pmid">26080837</pub-id><pub-id pub-id-type="doi">10.1158/2159-8290.CD-NB2015-090</pub-id></element-citation></ref>
<ref id="b16-mco-0-0-02198"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wrzeszczynski</surname><given-names>KO</given-names></name><name><surname>Frank</surname><given-names>MO</given-names></name><name><surname>Koyama</surname><given-names>T</given-names></name><name><surname>Rhrissorrakrai</surname><given-names>K</given-names></name><name><surname>Robine</surname><given-names>N</given-names></name><name><surname>Utro</surname><given-names>F</given-names></name><name><surname>Emde</surname><given-names>AK</given-names></name><name><surname>Chen</surname><given-names>BJ</given-names></name><name><surname>Arora</surname><given-names>K</given-names></name><name><surname>Shah</surname><given-names>M</given-names></name><etal/></person-group><article-title>Comparing sequencing assays and human-machine analyses in actionable genomics for glioblastoma</article-title><source>Neurol Genet</source><volume>3</volume><issue>e164</issue><year>2017</year><pub-id pub-id-type="pmid">28740869</pub-id><pub-id pub-id-type="doi">10.1212/NXG.0000000000000164</pub-id></element-citation></ref>
<ref id="b17-mco-0-0-02198"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Itahashi</surname><given-names>K</given-names></name><name><surname>Kondo</surname><given-names>S</given-names></name><name><surname>Kubo</surname><given-names>T</given-names></name><name><surname>Fujiwara</surname><given-names>Y</given-names></name><name><surname>Kato</surname><given-names>M</given-names></name><name><surname>Ichikawa</surname><given-names>H</given-names></name><name><surname>Koyama</surname><given-names>T</given-names></name><name><surname>Tokumasu</surname><given-names>R</given-names></name><name><surname>Xu</surname><given-names>J</given-names></name><name><surname>Huettner</surname><given-names>CS</given-names></name><etal/></person-group><article-title>Evaluating clinical genome sequence analysis by Watson for genomics</article-title><source>Front Med (Lausanne)</source><volume>5</volume><issue>305</issue><year>2018</year><pub-id pub-id-type="pmid">30474028</pub-id><pub-id pub-id-type="doi">10.3389/fmed.2018.00305</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-mco-0-0-02198" position="float">
<label>Figure 1</label>
<caption><p>Comparing patient-paired samples using different annotation methods of Watson for Genomics and Guangdong Lung Cancer Institute bioinformationist.</p></caption>
<graphic xlink:href="mco-14-02-02198-g00.tif" />
</fig>
<fig id="f2-mco-0-0-02198" position="float">
<label>Figure 2</label>
<caption><p>Mutation genes identified by GLCI and WfG. (A) Partial congruence and (B) no congruence. GLCI, Guangdong Lung Cancer Institute; WfG, Watson for Genomics.</p></caption>
<graphic xlink:href="mco-14-02-02198-g01.tif" />
</fig>
<fig id="f3-mco-0-0-02198" position="float">
<label>Figure 3</label>
<caption><p>Congruence rates of common driver gene mutation in analysis by both methods. GLCI, Guangdong Lung Cancer Institute; ERBB2, erb-b2 receptor tyrosine kinase 2; WfG, Watson for Genomics.</p></caption>
<graphic xlink:href="mco-14-02-02198-g02.tif" />
</fig>
<fig id="f4-mco-0-0-02198" position="float">
<label>Figure 4</label>
<caption><p>Comparison of tumor mutation burden (across all samples) for the two methods. <sup>&#x002A;</sup>P&#x003C;0.05. GLCI, Guangdong Lung Cancer Institute; WfG, Watson for Genomics.</p></caption>
<graphic xlink:href="mco-14-02-02198-g03.tif" />
</fig>
<fig id="f5-mco-0-0-02198" position="float">
<label>Figure 5</label>
<caption><p>MAF of five mutation sites in one sample (no. 29002) as interpreted by both methods. MAF, minor allele frequency; ATR, ATR serine/threonine kinase; DNMT3A, DNA methyltransferase 3&#x03B1;; STK11, serine/threonine kinase 11; GLCI, Guangdong Lung Cancer Institute; WfG, Watson for Genomics.</p></caption>
<graphic xlink:href="mco-14-02-02198-g04.tif" />
</fig>
<table-wrap id="tI-mco-0-0-02198" position="float">
<label>Table I</label>
<caption><p>Number of mutation sites identified by WfG and GLCI.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle">Type of uniformity</th>
<th align="center" valign="middle">No.</th>
<th align="center" valign="middle">Mutation sites identified by WfG, n</th>
<th align="center" valign="middle">Mutation sites identified by GLCI, n</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">Complete congruence, 0 alterations</td>
<td align="center" valign="middle">29</td>
<td align="center" valign="middle">0</td>
<td align="center" valign="middle">0</td>
</tr>
<tr>
<td align="left" valign="middle">Complete congruence, 1 or more alterations</td>
<td align="center" valign="middle">21</td>
<td align="center" valign="middle">29</td>
<td align="center" valign="middle">29</td>
</tr>
<tr>
<td align="left" valign="middle">Partial congruence</td>
<td align="center" valign="middle">37</td>
<td align="center" valign="middle">103</td>
<td align="center" valign="middle">51</td>
</tr>
<tr>
<td align="left" valign="middle">No congruence</td>
<td align="center" valign="middle">28</td>
<td align="center" valign="middle">48</td>
<td align="center" valign="middle">0</td>
</tr>
<tr>
<td align="left" valign="middle">Total</td>
<td align="center" valign="middle">115</td>
<td align="center" valign="middle">180</td>
<td align="center" valign="middle">80</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>GLCI, Guangdong Lung Cancer Institute; WfG, Watson for Genomics.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-mco-0-0-02198" position="float">
<label>Table II</label>
<caption><p>Variability in mutation sites interpreted by Watson for Genomics and Guangdong Lung Cancer Institute.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle">Variability (number of different mutation sites)</th>
<th align="center" valign="middle">Incidence, no. of samples</th>
<th align="center" valign="middle">Share of samples, &#x0025;</th>
<th align="center" valign="middle">Cumulative percentage, &#x0025;</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">0</td>
<td align="center" valign="middle">50</td>
<td align="center" valign="middle">43.5</td>
<td align="center" valign="middle">43.5</td>
</tr>
<tr>
<td align="left" valign="middle">1</td>
<td align="center" valign="middle">41</td>
<td align="center" valign="middle">35.7</td>
<td align="center" valign="middle">79.1</td>
</tr>
<tr>
<td align="left" valign="middle">2</td>
<td align="center" valign="middle">13</td>
<td align="center" valign="middle">11.3</td>
<td align="center" valign="middle">90.4</td>
</tr>
<tr>
<td align="left" valign="middle">3</td>
<td align="center" valign="middle">3</td>
<td align="center" valign="middle">2.6</td>
<td align="center" valign="middle">93.0</td>
</tr>
<tr>
<td align="left" valign="middle">4</td>
<td align="center" valign="middle">4</td>
<td align="center" valign="middle">3.5</td>
<td align="center" valign="middle">96.5</td>
</tr>
<tr>
<td align="left" valign="middle">5</td>
<td align="center" valign="middle">2</td>
<td align="center" valign="middle">1.7</td>
<td align="center" valign="middle">98.3</td>
</tr>
<tr>
<td align="left" valign="middle">6</td>
<td align="center" valign="middle">1</td>
<td align="center" valign="middle">0.9</td>
<td align="center" valign="middle">99.1</td>
</tr>
<tr>
<td align="left" valign="middle">11</td>
<td align="center" valign="middle">1</td>
<td align="center" valign="middle">0.9</td>
<td align="center" valign="middle">100.0</td>
</tr>
<tr>
<td align="left" valign="middle">Total</td>
<td align="center" valign="middle">115</td>
<td align="center" valign="middle">100.0</td>
<td align="center" valign="middle">&#x00A0;</td>
</tr>
</tbody>
</table>
</table-wrap>
</floats-group>
</article>
