<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">OL</journal-id>
<journal-title-group>
<journal-title>Oncology Letters</journal-title>
</journal-title-group>
<issn pub-type="ppub">1792-1074</issn>
<issn pub-type="epub">1792-1082</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ol.2020.11976</article-id>
<article-id pub-id-type="publisher-id">OL-0-0-11976</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Integrated analysis of whole genome and transcriptome sequencing in a young patient with gastric cancer provides insights for precision therapy</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Hu</surname><given-names>Kongwang</given-names></name>
<xref rid="af1-ol-0-0-11976" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Yu</surname><given-names>Weiqiang</given-names></name>
<xref rid="af2-ol-0-0-11976" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Ajayi</surname><given-names>Olugbenga Emmanuel</given-names></name>
<xref rid="af2-ol-0-0-11976" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Li</surname><given-names>Longlong</given-names></name>
<xref rid="af1-ol-0-0-11976" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Huang</surname><given-names>Zhiguo</given-names></name>
<xref rid="af1-ol-0-0-11976" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Rong</surname><given-names>Qiqi</given-names></name>
<xref rid="af2-ol-0-0-11976" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Wang</surname><given-names>Shuaili</given-names></name>
<xref rid="af2-ol-0-0-11976" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Wu</surname><given-names>Qing-Fa</given-names></name>
<xref rid="af2-ol-0-0-11976" ref-type="aff">2</xref>
<xref rid="c1-ol-0-0-11976" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-ol-0-0-11976"><label>1</label>Division of Gastrointestinal Surgery, Department of General Surgery, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui 230022, P.R. China</aff>
<aff id="af2-ol-0-0-11976"><label>2</label>Hefei National Laboratory for Physical Sciences at Microscale, CAS Key Laboratory of Innate Immunity and Chronic Disease, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230026, P.R. China</aff>
<author-notes>
<corresp id="c1-ol-0-0-11976"><italic>Correspondence to</italic>: Professor Qing-Fa Wu, Hefei National Laboratory for Physical Sciences at Microscale, CAS Key Laboratory of Innate Immunity and Chronic Disease, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, 443 HuangShan Road, Hefei, Anhui 230026, P.R. China, E-mail: <email>wuqf@ustc.edu.cn</email></corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>10</month>
<year>2020</year></pub-date>
<pub-date pub-type="epub">
<day>12</day>
<month>08</month>
<year>2020</year></pub-date>
<volume>20</volume>
<issue>4</issue>
<elocation-id>115</elocation-id>
<history>
<date date-type="received"><day>04</day><month>06</month><year>2019</year></date>
<date date-type="accepted"><day>17</day><month>07</month><year>2020</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Hu et al.</copyright-statement>
<copyright-year>2020</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>Gastric cancer is a leading cause of cancer-associated deaths worldwide and is considered to be an age-related disease. In younger patients, gastric cancer is biologically more aggressive, and prognosis is worse compared with that in elderly patients. In the present case report, the whole genome and transcriptome was sequenced in a 26-year-old patient with gastric cancer who presented with gastric cancer-related symptoms and was admitted to the First Affiliated Anhui Medical Hospital (Hefei, China) in December 2016. In total, 9 germline and 4 somatic mutations were identified in the patient, and there were more deleterious sites in the germline mutated genes. Genes with somatic mutations, such as <italic>MUC2, MUC4, SLC8A2</italic>, and with structural variations, including <italic>CCND3, FGFR2</italic> and <italic>FGFR3</italic>, were found to be differentially expressed. Cancer-associated pathways, such as the &#x2018;calcium signaling pathway&#x2019;, &#x2018;cGMP-PKG signaling pathway&#x2019; and &#x2018;transcriptional mis-regulation&#x2019; were also enriched at both the genomic and transcriptomic levels. The genes found to have germline (<italic>SFRP4</italic>), somatic (<italic>MUC2, MUC4, SLC8A2</italic>) mutations, or structural variations (<italic>CCND3, FGFR2</italic> and <italic>FGFR3</italic>) were differentially expressed in the patient and could be promising precision therapy targets.</p>
</abstract>
<kwd-group>
<kwd>gastric cancer</kwd>
<kwd>young age</kwd>
<kwd>germline mutations</kwd>
<kwd>somatic mutations</kwd>
<kwd>TCGA STAD</kwd>
</kwd-group></article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Gastric cancer (GC) is a leading cause of cancer-related deaths worldwide and is known to be an age-related disease (<xref rid="b1-ol-0-0-11976" ref-type="bibr">1</xref>). The mean age of patients with GC at diagnosis is ~60 years and &#x003C;3&#x0025; of GC cases are reported in patients &#x003C;30 years of age (<xref rid="b2-ol-0-0-11976" ref-type="bibr">2</xref>,<xref rid="b3-ol-0-0-11976" ref-type="bibr">3</xref>). Several reports have found that younger patients (~30 years) are often diagnosed with advanced stages of GC and have worse prognosis compared with that in elderly patients (~60 years) (<xref rid="b4-ol-0-0-11976" ref-type="bibr">4</xref>,<xref rid="b5-ol-0-0-11976" ref-type="bibr">5</xref>). In younger patients, the cancer was found to spread rapidly and was biologically more aggressive (<xref rid="b6-ol-0-0-11976" ref-type="bibr">6</xref>). Integrative genomic approaches, which associate genomic and transcriptomic analysis have been found to increase the identification of numerous events and processes associated with tumor aggressiveness and may assist in selecting candidate genes from normal and pathological samples for targeted therapy (<xref rid="b7-ol-0-0-11976" ref-type="bibr">7</xref>). A previous study aimed to investigate the biological and genomic mechanisms in the progression of GC has led to the development of target-oriented therapy for advanced GC to be used in a clinical setting, with base substitution mutations in genes such as tumor protein p53 (<italic>TP53</italic>), AT-rich interaction domain 1A (<italic>ARID1A</italic>), KRAS proto-oncogene, GTPase (<italic>KRAS</italic>), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (<italic>PIK3CA</italic>), ring finger protein 43 (<italic>RNF43</italic>), APC regulator of WNT signaling pathway (<italic>APC</italic>), RAS p21 protein activator 1 (<italic>RASA1</italic>) and erb-b2 receptor tyrosine kinase 2 (<italic>ERBB2</italic>) identified as important therapeutic targets in anti-cancer treatments (<xref rid="b8-ol-0-0-11976" ref-type="bibr">8</xref>). Developing personalized therapy for optimal individual cancer patient outcome has become more feasible due to the rapid advancement in next-generation sequencing techniques and technologies that enable fast and comprehensive characterizations of tumors at the molecular level (<xref rid="b9-ol-0-0-11976" ref-type="bibr">9</xref>&#x2013;<xref rid="b12-ol-0-0-11976" ref-type="bibr">12</xref>). As every patient harbors a unique combination of variants that influence the risk, onset, and progression of the disease, an effective personalized therapy is dependent on a well-profiled genome from the individual patient with cancer and understanding the oncogenic mechanisms that regulate the progression of the tumor (<xref rid="b13-ol-0-0-11976" ref-type="bibr">13</xref>). With a view to identifying potential clinically actionable therapeutic targets that may inform individualized treatment strategies, the whole genome was sequenced in a 26-year-old patient with advanced stage GC, by integrating the whole genome and whole transcriptome sequencing data.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Sample collection and preservation</title>
<p>A 26-year-old female patient was admitted to the First Affiliated Anhui Medical Hospital in December 2016 having presented with GC-related symptoms. The patient felt uncomfortable in the upper left quadrant of her abdomen and had repeated black melena for one month. This symptom recurred and became more serious for 2 weeks. The patient felt full and uncomfortable in the upper left quadrant of her abdomen repeatedly following a meal, one month prior to the development of black melena. She also had nausea, although there was no hematemesis or hematochezia. The abdominal discomfort worsened and the patient received treatment in Luan Jinkai Hospital. Thereafter, the patient visited the First Affiliated Hospital of Anhui Medical University where the present study was conducted. The gastroscopy of the patient indicated GC with obstruction, while pathology indicated antrum considered as carcinoma mucocellulare. Pathological examination of the samples was performed using hematoxylin and eosin staining. Briefly, tissues were sliced with a dimension of 1.5&#x00D7;1.5&#x00D7;0.2-0.3 cm and soaked in 40&#x00B0;C warm water. The sections were deparaffinized in xylene and were gradually hydrated through graded ethanol (100, 95, 80 and 70&#x0025;) each for 3 min at 25&#x00B0;C. The sections were stained in hematoxylin solution for 10 min at 25&#x00B0;C and differentiated in 1&#x0025; hydrochloric alcohol. Then the slides were rinsed with tap water and distilled water until the nuclei became blue, and dehydrated in 95&#x0025; ethanol. The sections were counterstained in 1&#x0025; eosin solution for 5 min at 25&#x00B0;C, washed with 70&#x0025; ethanol twice and absolute ethanol, and were cleared in two washes of xylene. The sections were mounted with neutral balsam and observed under a light microscope magnification, &#x00D7;400 (&#x00D7;40 objective and &#x00D7;10 ocular magnification; BX-42; Olympus). An abdominal computer tomography scan &#x002B; enhancement showed that both armpits had multiple small lymph nodes and the gastric antrum had thickened walls along with multiple swollen lymph nodes in the surrounding area. The patient was diagnosed with a T3N3bM0 (advanced stage IIIC) GC using AJCC TNM staging system 7th edition (<xref rid="b14-ol-0-0-11976" ref-type="bibr">14</xref>).</p>
<p>The patient underwent a gastrostomy for tumor removal. Tumor tissue and adjacent normal tissue ~5-cm away from the tumor were resected and preserved in liquid nitrogen until use.</p>
</sec>
<sec>
<title>DNA and RNA extraction and library preparation</title>
<p>Genomic DNA was isolated from the resected tissues using a genomic DNA isolation kit (Bioo Scientific; <uri xlink:href="http://www.biooscientific.com">http://www.biooscientific.com</uri>) according to the manufacturer&#x0027;s recommendations. Extracted DNA was quantified using a NanoDrop ND-1000 spectrophotometer (NanoDrop Technologies; Thermo Fisher Scientific, Inc.) and the integrity was assessed using 1&#x0025; agarose gel electrophoresis and visualized in gel imager (Tanon; <uri xlink:href="http://www.biotanon.com">http://www.biotanon.com</uri>). RNA was extracted from the tumor and adjacent normal tissue using TRIzol<sup>&#x00AE;</sup> (Invitrogen; Thermo Fisher Scientific, Inc.), and eluted in RNAse-free water. RNA quantity and quality were assessed using an Agilent 2100 Bioanalyzer (Agilent Technologies Inc.). DNA and RNA libraries were prepared according to standard protocols according to Bioo Scientific.</p>
</sec>
<sec>
<title>Whole transcriptome sequencing and pre-analysis</title>
<p>For whole transcriptome sequencing, raw sequencing reads were produced using an Illumina X10 sequencer (Illumina Inc.) with 150-bp paired-end reads according to the manufacturer&#x0027;s instructions. Raw RNA-Sequencing (RNA-Seq) data was trimmed by removing the adaptors and low quality reads using trim_galore v0.4.4 software (<xref rid="b15-ol-0-0-11976" ref-type="bibr">15</xref>). The data was subsequently aligned to the human reference genome GRCh37.p13 using STAR v2.6.1a software (<xref rid="b16-ol-0-0-11976" ref-type="bibr">16</xref>) in a 2-pass mapping. Read count in the tumor and adjacent normal tissues was calculated using htseq-count v0.10.0 software (<xref rid="b17-ol-0-0-11976" ref-type="bibr">17</xref>), and differentially expressed genes were detected using edgeR v3.24.3 software (<xref rid="b18-ol-0-0-11976" ref-type="bibr">18</xref>). Genes with P&#x2264;0.001 and log<sub>2</sub> fold change &#x2265;1 were considered as differentially expressed.</p>
</sec>
<sec>
<title>Whole genome sequencing and pre-analysis</title>
<p>Whole genome sequencing of the prepared libraries was performed using the Illumina X10 sequencer (Illumina Inc.) with 150-bp paired-end reads according to the manufacturer&#x0027;s instructions. The raw fastq data was mapped to the human reference genome (b37) provided by the Broad Institute with the Burrows-Wheeler Aligner v.7.0.12 (<xref rid="b19-ol-0-0-11976" ref-type="bibr">19</xref>). Read duplications were marked using the Picard tool v.2.10.10; local realignment and base quality score recalibration was performed using Genome Analysis ToolKit (GATK) v. 3.8-0 (<xref rid="b20-ol-0-0-11976" ref-type="bibr">20</xref>) and the final bam files were used for variant calling.</p>
</sec>
<sec>
<title>Somatic and germline variants detection</title>
<p>Germline variants were called using the GATK HaplotyperCaller joint v3.8 software (<xref rid="b21-ol-0-0-11976" ref-type="bibr">21</xref>) and then filtered using GATK variant quality score recalibration (VQSR). Variants that passed using the VQSR module with coverage &#x2265;6X and supporting reads &#x2265;0 were retained, while the variants showing genotype &#x2018;0/0&#x2019; or &#x2018;./.&#x2019; in the tumor sample were filtered out. For somatic variant calling, three different types of software were used: GATK HaplotypeCaller joint v3.8, Mutect v1.1.4 and MuTect2 v3.8 (<xref rid="b22-ol-0-0-11976" ref-type="bibr">22</xref>). In Mutect and MuTect2, all variants that were flagged as &#x2018;PASS&#x2019; were kept. Other variants were retained with a coverage &#x2265;5X and supporting &#x2265;3 reads, if they were flagged among a subset of &#x2018;alternative (<xref rid="b23-ol-0-0-11976" ref-type="bibr">23</xref>) allele in normal&#x2019;, &#x2018;triallelic site&#x2019;, &#x2018;possible contamination&#x2019;, &#x2018;clustered read position&#x2019; in Mutect, or &#x2018;germline risk&#x2019;, &#x2018;alt allele in normal&#x2019;, &#x2018;t lod fstar&#x2019; (tumor does not meet likelihood threshold), &#x2018;str contraction&#x2019; (site filtered due to contraction of short tandem repeat region), &#x2018;triallelic site&#x2019; (site filtered because &#x003E; two alternate alleles pass tumor likelihood threshold) in MuTect2. Germline variants were removed from both the Mutect and MuTect2 output data. In all three softwares, variant allele frequency in the tumor sample had to be 3 times higher compared with that in the normal sample, and &#x003E;5&#x0025; of the variant reads in the tumor sample was required. Functional annotations for both germline and somatic variants were added to each mutation using the ANNOVAR software v.2018Apr16 (<xref rid="b17-ol-0-0-11976" ref-type="bibr">17</xref>) and several publicly available databases, such as 1,000 Genome Project (<xref rid="b24-ol-0-0-11976" ref-type="bibr">24</xref>), the Exome Aggregation Consortium (<xref rid="b25-ol-0-0-11976" ref-type="bibr">25</xref>), the Genome Aggregation Database (<xref rid="b26-ol-0-0-11976" ref-type="bibr">26</xref>) and the compiled scores prediction system dbnsfp33a (<xref rid="b27-ol-0-0-11976" ref-type="bibr">27</xref>), dbSNP (<xref rid="b28-ol-0-0-11976" ref-type="bibr">28</xref>) and COSMIC (<xref rid="b29-ol-0-0-11976" ref-type="bibr">29</xref>).</p>
</sec>
<sec>
<title>Transition to transversion ratio calculation</title>
<p>The transition to transversion ratio for germline variants was calculated by dividing total transitions by total transversions. Transitions refer to variations that involve a change from purine to purine, while transversions refer to variations that involve a change from purine to pyrimidine or vice versa (<xref rid="b30-ol-0-0-11976" ref-type="bibr">30</xref>). The transition to transversion ratio between homologous strands of DNA is generally ~2, and for human, the ratio is ~2.1 (<xref rid="b31-ol-0-0-11976" ref-type="bibr">31</xref>).</p>
</sec>
<sec>
<title>Copy number alternation and structural variation analysis</title>
<p>Copy number variants (CNV), tumor purity and ploidy was analyzed using cnv_facets software (<uri xlink:href="https://github.com/dariober/cnv_facets">https://github.com/dariober/cnv_facets</uri>) based on FACETS v0.5.14 (<xref rid="b32-ol-0-0-11976" ref-type="bibr">32</xref>), and run with the &#x2018;-cval 25 400&#x2019; command. Duplicated and deleted segments were retained and annotated using an ENCODE gene symbol in R v3.5.1 (<xref rid="b33-ol-0-0-11976" ref-type="bibr">33</xref>). iCallSV (<uri xlink:href="https://github.com/rhshah/iCallSV">https://github.com/rhshah/iCallSV</uri>), which applied Delly v0.7.5 prediction method (<xref rid="b34-ol-0-0-11976" ref-type="bibr">34</xref>) was used to detect structural variations and fusions using default parameters. Inter- and intra-chromosomal fusions &#x003E;50 kb were retained and only in-frame deletions, duplications and fusions were selected for subsequent analysis.</p>
</sec>
<sec>
<title>Mutation significance analysis</title>
<p>The 1,000 Genome Project (<xref rid="b24-ol-0-0-11976" ref-type="bibr">24</xref>), the Exome Aggregation Consortium (<xref rid="b25-ol-0-0-11976" ref-type="bibr">25</xref>) and the Genome Aggregation Database (<xref rid="b26-ol-0-0-11976" ref-type="bibr">26</xref>) provided alternative allele frequency data in different populations that ANNOVAR software applied to the present variations. Annotations using a set of algorithm scoring system from dbNSFP v.33a, including FATHMM, LRT, Mutation Assessor, Mutation Taster, M-CAP, Polyphen2, PROVEAN and SIFT (<xref rid="b35-ol-0-0-11976" ref-type="bibr">35</xref>), were also applied to the variations. Then the common variants were filtered out, which are sites with a minor allele frequency &#x2265;1&#x0025;, to get the most informative ones. Mutations predicted to be deleterious by at least two algorithms of dbNSFP were considered to be potential pathogenic sites of these informative variants.</p>
</sec>
<sec>
<title>Identification of mutations in cancer associated genes</title>
<p>To identify genes with a high possibility of involvement in cancer development, the Cancer Gene Census (CGC) (<uri xlink:href="https://cancer.sanger.ac.uk/census">https://cancer.sanger.ac.uk/census</uri>) (<xref rid="b36-ol-0-0-11976" ref-type="bibr">36</xref>) and Cancer Predisposition Genes (CPG) (<xref rid="b37-ol-0-0-11976" ref-type="bibr">37</xref>) databases were used. In total, 850 candidate genes from cancer-associated databases (723 genes from CGC and 114 genes from CPG with 87 overlapping genes), were investigated. The gene lists were downloaded directly from the two databases with no specific selection criteria.</p>
</sec>
<sec>
<title>Pathway analysis</title>
<p>Gene enrichment was performed using the R package clusterProfiler v3.10.1 software (<xref rid="b38-ol-0-0-11976" ref-type="bibr">38</xref>), which implements a hypergeometric model to test for gene set overrepresentation relative to a background gene set. Enrichment was analyzed using the functions: &#x2018;Enriched Kyoto Encyclopedia of Genes and Genomes (KEGG)&#x2019;, &#x2018;gseKEGG&#x2019; (Gene Set Enrichment Analysis (GSEA) (<xref rid="b39-ol-0-0-11976" ref-type="bibr">39</xref>) of KEGG, and a function of clusterProfiler), P&#x003C;0.05, <uri xlink:href="http://OrgDb=org.Hs.eg.db">OrgDb=org.Hs.eg.db</uri> (a database for Genome Wide Annotation for Human, referenced by clusterProfiler).</p>
</sec>
<sec>
<title>Integrated analysis of whole genome and transcriptome sequencing data</title>
<p>The differentially expressed genes were compared with mutated genes identified by whole genome sequencing. To achieve this, genes that were mutated and with differentially expressed transcripts were merged in R v3.5.1 (<xref rid="b33-ol-0-0-11976" ref-type="bibr">33</xref>).</p>
</sec>
<sec>
<title>Analysis of public GC RNA-Seq data</title>
<p>The RNA-Seq data from The Cancer Genome Atlas stomach adenocarcinoma (TCGA STAD) was downloaded from TCGAbiolinks online tool (<uri xlink:href="https://github.com/BioinformaticsFMRP/TCGAbiolinks">https://github.com/BioinformaticsFMRP/TCGAbiolinks</uri>). To identify the most significantly differentially expressed genes in GC, the results from three RNA-Seq differential expression analysis tools, including limma v3.42.2 (<xref rid="b40-ol-0-0-11976" ref-type="bibr">40</xref>), DEseq2 v1.26.0 (<xref rid="b41-ol-0-0-11976" ref-type="bibr">41</xref>) and edgeR v3.28.1 (<xref rid="b18-ol-0-0-11976" ref-type="bibr">18</xref>), were combined and the overlapping genes were separated for further analysis. For all three tools, P&#x003C;0.05 and the average log<sub>2</sub> fold change &#x00B1; standard deviation were used as the cut-off values. Gene Ontology (<xref rid="b42-ol-0-0-11976" ref-type="bibr">42</xref>) enrichment analysis was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 functional annotation clustering tool to determine the potential functions of the differentially expressed genes. The KEGG pathway analysis was performed to determine the involvement of differentially expressed genes in different biological pathways. The-log10 (P) denotes the enrichment score, indicating the significance of the pathway associations.</p>
</sec>
<sec>
<title>Kaplan-Meier plotter (KM-Plotter) analysis</title>
<p>KM-plotter (<uri xlink:href="https://kmplot.com/analysis/">https://kmplot.com/analysis/</uri>) is an online database and tool which can be used for the discovery and validation of survival biomarkers (<xref rid="b43-ol-0-0-11976" ref-type="bibr">43</xref>). It contains cancer gene expression data and survival information from Gene Expression Omnibus (<xref rid="b44-ol-0-0-11976" ref-type="bibr">44</xref>), European Genome-phenome Archive (EGA) (<uri xlink:href="https://www.ebi.ac.uk/ega/home">https://www.ebi.ac.uk/ega/home</uri>), and TCGA (<uri xlink:href="https://www.cancer.gov/tcga">https://www.cancer.gov/tcga</uri>), and uses each percentile of mRNA expression between the lower (25&#x0025;) and upper quartiles (75&#x0025;) of expression as a cut-off point to divide patients into high and low expression groups. The KM-plotter software, which included available transcriptome and survival data (both overall and disease-free survival times), was used to analyze a total of 1,440 patients with GC, based on default parameters.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Tumor purity and coverage statistics</title>
<p>The proportion and the ploidy of tumor cells in the sample were ~22&#x0025; and 1.74, respectively. Using whole genome sequencing, the total number of paired reads were 8.13 and 8.34 billion, the mean read coverage was 37.16 and 35.48, while the mean library insert size was 319 and 323 bp for normal and tumor tissue samples, respectively. For the consensus coding sequence region, ~90&#x0025; of the bases had over 30X coverage. Using whole transcriptome sequencing, 1.04 and 1.11 billion clean reads were obtained with an average input read length of 253 and 261 bp for normal and tumor tissue, respectively.</p>
</sec>
<sec>
<title>Gene expression analysis</title>
<p>The RNA-Seq pipeline was used (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Fig. S1A</xref>) for the whole transcriptome sequencing. Genes with P&#x2264;0.001 and absolute log<sub>2</sub> fold change value &#x2265;1 were considered as significantly differentially expressed genes. In total, 766 down- and 765 upregulated genes were obtained (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Fig. S1B</xref>). The RNA-Seq result was subsequently matched to the whole genome sequencing data for further analysis.</p>
</sec>
<sec>
<title>Germline mutation analysis</title>
<p>Germline mutation analysis revealed a total of 4,228,339 single nucleotide polymorphisms (SNPs) and Indels with 95.03&#x0025; of the sites previously reported in the dbSNP database. The transition to transversion ratio was 2.05, indicating that the germline analysis had high confidence, as the empirical value for whole genome analysis is ~2.1 (<xref rid="b31-ol-0-0-11976" ref-type="bibr">31</xref>). Among the germline variants, 21,405 SNPs/Indels were located in the exonic regions, of which 10,136 were predicted to alter the protein, including 9,504 non-synonymous single nucleotide variations (SNVs), 112 frameshift deletions, 90 frameshift insertions, 173 non-frameshift deletions, 169 non-frameshift insertions, 78 stop-gain, and 10 stop-loss. More than 1&#x0025; minor allele frequency variants from publicly available databases were removed, as those sites are common in the population. Of the candidate genes from the CGC and CPG databases, 13 SNPs and two Indels were identified in 15 genes (<xref rid="tI-ol-0-0-11976" ref-type="table">Table I</xref>). A deleterious score evaluation of the 15 variants revealed that mutations in <italic>DOCK8</italic> (p.D63N), <italic>HLF</italic> (p.V59A), <italic>NKX2-1</italic> (p.G322S), <italic>NRG1</italic> (p.S17C), <italic>PRDM2</italic> (p.S823T), <italic>WNK2</italic> (p.C652R) were deleterious, while mutations in <italic>KMT2D</italic> (p.V401M), <italic>RAD51D</italic> (p.A181V) and <italic>ROS1</italic> (p.E1605Q) were possibly damaging.</p>
</sec>
<sec>
<title>Somatic mutation analysis</title>
<p>A total of 3 softwares from the Broad Institute were used to identify the somatic variants and 114 variants in the exonic regions were found. This included 45 non-synonymous SNVs, 6 frameshift deletions, 11 frameshift insertions, 17 non-frameshift deletions, 3 non-frameshift insertions, 2 stop-gain and 30 synonymous SNVs. Similar to the germline mutation analysis, common variants were filtered out with publicly available datasets to select the most informative somatic variants. In total, 29 variants were obtained, including 13 mutations which were reported in the Catalogue of Somatic Mutations in Cancer database (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Table SI</xref>). Of these, 3 protein-truncating variants (<italic>BPNT1</italic>, p.S144I; <italic>FRG1</italic>, p.<italic>N153D;</italic> and <italic>TAS2R31</italic>, p.L59F) were predicted to be deleterious and one variant (<italic>KRTAP5-3</italic>, p.C185S) was predicted to be possibly damaging. However, for transcriptional regulation, 3 genes (<italic>MUC2, MUC4</italic> and <italic>SLC8A2</italic>) were found to be differentially expressed.</p>
</sec>
<sec>
<title>Large-scale variation analysis</title>
<p>To identify large-scale variations in the patient, the copy number alterations and complex structural events in the tumor sample were investigated. The structural variant discovery tool, DELLY, reported 701 structural events; however, only in-frame events or fusions over 50 kb between two genes were retained. Ultimately, 20 significant structural variations were reported (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Table SII</xref>). Another sensitive tool, FACETS, detected 58 large segments, of which 62 were deletions and 112 were duplications found in the coding regions (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Table SIII</xref>). Combining the results from the two tools, nine genes (<italic>CDKN1B, FBXW7, MUC4, CCND3, ETV6, FGFR2, FGFR3, TFEB</italic> and <italic>KMT2C</italic>) were reported in either the CGC or CPG databases. <italic>CDKN1B, FBXW7</italic> and <italic>MUC4</italic> showed duplications, and there were deletions in <italic>CCND3, ETV6, FGFR2, FGFR3</italic> and <italic>TFEB</italic>. Although <italic>KMT2C</italic> rearranged with <italic>BAGE2</italic>, the latter was not reported in the 2 databases. The mRNA expression levels of the genes with structural variations was also investigated and 21 were found to be differentially expressed in the tumor sample (<xref rid="f1-ol-0-0-11976" ref-type="fig">Fig. 1</xref>).</p>
</sec>
<sec>
<title>Integrated pathway analysis results</title>
<p>RNA-Seq and mutation data were analyzed to characterize the genomic alterations in known signaling pathways. The GO enrichment for somatic mutated genes revealed terms, which included 3 mucin family genes, <italic>MUC2, MUC4</italic> and <italic>MUC6</italic>, and were enriched in &#x2018;maintenance of gastrointestinal epithelium&#x2019;, &#x2018;O-glycan processing&#x2019;, and digestive system process, while <italic>KRTAP5-3, TCHH, RPTN, POU3F1</italic> were enriched in epidermal cell differentiation pathway (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Table SIV</xref>). The KEGG enrichment analysis revealed <italic>SLC8A2</italic> and <italic>SLC25A5</italic> were enriched in calcium signaling and cGMP-PKG signaling pathways (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Table SIV</xref>). From the RNA-Seq data, GSEA showed that the upregulated genes were enriched in &#x2018;cGMP-PKG signaling&#x2019;, &#x2018;calcium signaling&#x2019;, and &#x2018;oxytocin signaling&#x2019; pathways, while the downregulated genes were enriched in &#x2018;cell cycle&#x2019; pathway, &#x2018;oxidative phosphorylation&#x2019; pathway, &#x2018;transcriptional mis-regulation in cancer&#x2019; pathway and &#x2018;GC pathway&#x2019; (<xref rid="f2-ol-0-0-11976" ref-type="fig">Fig. 2</xref>).</p>
</sec>
<sec>
<title>Comparison with TCGA STAD GC features</title>
<p>To ensure the present study was representative and valuable, the genomic features of the patient was compared with data from TCGA STAD dataset, which was downloaded from TCGAbiolinks, and 329 (136 up- and 193 downregulated genes) differentially expressed genes were identified, which were also found in the RNA-Seq analysis. A total of three tools were used to identify the most significantly differentially expressed genes, and 136 up- and 193 downregulated genes were identified (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Fig. S2A</xref>), and were subsequently investigated in the tumor tissue sample obtained from the patient with GC and compared with that in the adjacent normal tissue. In total, 27 genes were identified, including five that were up- and 22 that were downregulated (<xref rid="f3-ol-0-0-11976" ref-type="fig">Fig. 3A</xref>). GO and KEGG pathway enrichment analysis revealed one (GO:MF) and three (KEGG) significant terms, respectively, including digestion and gastric acid secretion (<xref rid="f3-ol-0-0-11976" ref-type="fig">Fig. 3B</xref>).</p>
<p>To determine the associated functional genomic alterations of these 27 genes, the previous studies that reported them were analyzed. In the present study, there were no somatic mutations in the 27 genes, however, 12 of the genes, including gastrokine 2 (<italic>GKN2</italic>), prosaposin-like 1 (<italic>PSAPL1</italic>), ethanolamine-phosphate phospho-lyase (<italic>ETNPPL</italic>), cytochrome P450 family 2 subfamily W member 1 (<italic>CYP2W1</italic>), <italic>SFRP4</italic>, collagen type XXII &#x03B1; 1 chain, prostate stem cell antigen, lipase F, gastric type, solute carrier family 15 member 1 (<italic>SLC15A1</italic>), ATPase H&#x002B;/K&#x002B; transporting subunit &#x03B2; (<italic>ATP4B</italic>), aldehyde dehydrogenase 3 family member A1 (<italic>ALDH3A1</italic>) and cystatin SN (<italic>CST1</italic>) were mutated in germline. Notably, two (<italic>SFRP4</italic> and <italic>SLC15A1</italic>) of the 12 germline mutated genes have been reported in previous studies. Germline mutations in <italic>SFRP4</italic> (P320T and R340K) were reported to be strongly associated with human cancer types (<xref rid="b45-ol-0-0-11976" ref-type="bibr">45</xref>) while mutations in <italic>SLC15A1</italic> were linked with bowel diseases (<xref rid="b46-ol-0-0-11976" ref-type="bibr">46</xref>).</p>
<p>The KM plotter database was subsequently used with the log-rank test to investigate the association between the expression level of the 27 genes and clinical outcomes (both disease-free and overall survival times). In the database, patients with GC are divided into high and low expression groups, with each percentile of mRNA expression between the lower (25&#x0025;) and upper quartiles (75&#x0025;) used as a cut-off point. A total of 10 genes (<italic>ALDH3A1, ETNPPL</italic>, activity regulated cytoskeleton associated protein (<italic>ARC</italic>), <italic>ATP4B, CST1, CYP2W1, GKN2, PSAPL1, SFRP4</italic> and <italic>SLC15A1</italic>) with differential expression were significantly associated (P&#x003C;0.05) with survival times. In all the 10 genes, those with high expression (red curve) were correlated with a poorer survival while genes with low expression (black curve) correlated with higher survival in patients with GC (<xref rid="SD1-ol-0-0-11976" ref-type="supplementary-material">Fig. S2B</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>In an attempt to improve the understanding of the pathobiology of GC in young patients and provide insights for personalized treatment strategies, the genome of a 26-year old patient with GC was performed using both whole genome and whole transcriptome sequencing. Germline and somatic mutations analyses of the whole genome sequencing data revealed 15 cancer related genes with germline mutations, 27 genes with 29 informative somatic mutations and nine genes with structural events that could underlie the progression of GC in the patient. Of the 27 genes containing somatic mutations, 26 have not been previously reported as driver genes in TCGA (<xref rid="b42-ol-0-0-11976" ref-type="bibr">42</xref>). Only <italic>MUC6</italic>, a immunohistochemical marker used to support the diagnosis of gastric-type EA (<xref rid="b47-ol-0-0-11976" ref-type="bibr">47</xref>), has been reported as a driver gene. In the mutation significance analysis, a set of scoring system was used to predict deleterious amino acid substitutions. This analysis revealed nine germline variants and four somatic mutations with potentially pathogenic sites. Compared with somatic mutations, more deleterious germline sites were detected and genes containing these sites were all found to be cancer-associated, suggesting that inheritance disorders might be a key risk factor for the development of GC in the patient.</p>
<p>To improve the understanding of the inter-relationships between genomic alterations and the transcriptome, the mutated genes at the transcriptional level were also investigated. It was found that genes with somatic mutations, such as <italic>MUC2, MUC4, SLC8A2</italic>, and with somatic structural mutations, such as <italic>CCND3, FGFR2</italic> and <italic>FGFR3</italic>, were differentially expressed, suggesting that they could be promising precision therapy targets.</p>
<p>Mucin genes are known to maintain the gastrointestinal epithelium, and persistent mucosal inflammation increase the risk of developing GC (<xref rid="b48-ol-0-0-11976" ref-type="bibr">48</xref>). In addition, the fibroblast growth factor receptor (FGFR) pathway plays a key role in GC pathogenesis, and detection of FGFR2 copy number in the plasma circulating tumor DNA are potential predictive biomarkers to <italic>FGFR</italic> inhibition (<xref rid="b49-ol-0-0-11976" ref-type="bibr">49</xref>).</p>
<p>With respect to heterogeneity of GC, it was hypothesized that mutations in different genes that are involved in the same pathway could result in similar clinical phenotypes. Therefore, different pathways were investigated that could be dysregulated in the patient with GC. The calcium signaling pathway, cGMP-PKG signaling pathway and transcriptional mis-regulation were enriched at both the genomic and transcriptomic levels, suggesting that these pathways might play crucial roles in young patients with GC. Notably, these pathways were not found from the analysis of TCGA STAD dataset (<xref rid="b42-ol-0-0-11976" ref-type="bibr">42</xref>), suggesting that the pathogenesis of GC in young individuals, particularly in the patient in the present study may be different. Nevertheless, the genomic features of the young patient with GC was compared with TCGA STAD data and 27 differential expression genes were identified, particularly germline mutations in <italic>SFRP4</italic> (P320T and R340K), which are strongly associated with human cancers (<xref rid="b45-ol-0-0-11976" ref-type="bibr">45</xref>). <italic>SFRP4</italic> is a member of the SFRP family (<xref rid="b50-ol-0-0-11976" ref-type="bibr">50</xref>). SFRP4s contain a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins (<xref rid="b50-ol-0-0-11976" ref-type="bibr">50</xref>) and act as soluble modulators of Wnt signaling, which is a well-known pathway that is involved in tumorigenesis of GC (<xref rid="b51-ol-0-0-11976" ref-type="bibr">51</xref>). This indicates that such germline mutations could be informative in designing patient specific therapies. Several germline mutations were found in the 27 genes; however, no somatic alterations or structural events were identified, suggesting that genetic factors may be important in the development of early GC.</p>
<p>Notably, the youngest patient in TCGA STAD dataset was 30-years-old and &#x003E;99&#x0025; (439/443) of the patients were &#x003E;40 years old, thus, TCGA STAD data could have originated from elderly patients with GC. This supports the requirement to investigate the genome in younger patients with GC to identify possible therapeutic targets. The patient in the present study was diagnosed at age 26, which was much younger compared with patients in TCGA STAD dataset. TCGA-STAD RNA-seq data was reanalyzed and 331 dysregulated genes were found (138 were up- and 193 were downregulated), however, only 27 of these were found in the patient in the present study, suggesting that there was a considerable difference between GC in young and elderly patients. There may be indications that the pathogenesis of the patient was substantially different from typical patients with GC, as cancer-related pathways, which were significantly enriched at both genomic and transcriptomic levels in the patient were also not found in TCGA STAD dataset.</p>
<p>In conclusion, genes with germline (<italic>SFRP4</italic>), and somatic mutations (<italic>MUC2, MUC4, SLC8A2</italic>), and those with structural variations (<italic>CCND3, FGFR2</italic> and <italic>FGFR3</italic>), which were differentially expressed in the patient could be promising precision therapy targets.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1-ol-0-0-11976" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="pdf" xlink:href="Supplementary_Data.pdf"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding</title>
<p>This study was supported in part by the Hefei National Laboratory for Physical Sciences at Microscale (grant no. HFNLM20180012).</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>We have deposited the data to National Genomics Data Center with the project number PRJCA002949. The datasets are not publicly accessible due to local data protection laws. However, access to the data can be granted upon a reasonable request to the corresponding author.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>QW and KH conceived the study and designed the experiments. LL and ZH provided the tissue samples and supplied the clinical and pathological information of the patient. QR, OEA and SW performed sample preparation, DNA isolation for sequencing and library construction. KH and WY performed the bioinformatics analysis. OEA, WY, KH and QW drafted and revised the manuscript and supplementary information. All authors approved the final version of the manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>This study was approved by the Ethics Committee of the First Affiliated Hospital of Anhui Medical University. The patient provided written informed consent for the use of the resected tissue, and the sample was obtained under Institutional Review Board approval.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>GC</term><def><p>gastric cancer</p></def></def-item>
<def-item><term>CGC</term><def><p>Cancer Gene Census</p></def></def-item>
<def-item><term>CPG</term><def><p>Cancer Predisposition Genes</p></def></def-item>
<def-item><term>Indels</term><def><p>insertions and deletions</p></def></def-item>
<def-item><term>SNP</term><def><p>single nucleotide polymorphism</p></def></def-item>
<def-item><term>SNV</term><def><p>single nucleotide variation</p></def></def-item>
<def-item><term>GO</term><def><p>Gene Ontology</p></def></def-item>
<def-item><term>KEGG</term><def><p>Kyoto Encyclopedia of Genes and Genomes</p></def></def-item>
<def-item><term>TCGA STAD</term><def><p>The Cancer Genome Atlas stomach adenocarcinoma</p></def></def-item>
</def-list>
</glossary>
<ref-list>
<title>References</title>
<ref id="b1-ol-0-0-11976"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>De Vita</surname><given-names>F</given-names></name><name><surname>Di Martino</surname><given-names>N</given-names></name><name><surname>Fabozzi</surname><given-names>A</given-names></name><name><surname>Laterza</surname><given-names>MM</given-names></name><name><surname>Ventriglia</surname><given-names>J</given-names></name><name><surname>Savastano</surname><given-names>B</given-names></name><name><surname>Petrillo</surname><given-names>A</given-names></name><name><surname>Gambardella</surname><given-names>V</given-names></name><name><surname>Sforza</surname><given-names>V</given-names></name><name><surname>Marano</surname><given-names>L</given-names></name><etal/></person-group><article-title>Clinical management of advanced gastric cancer: The role of new molecular drugs</article-title><source>World J Gastroentero</source><volume>20</volume><fpage>14537</fpage><lpage>14558</lpage><year>2014</year><pub-id pub-id-type="doi">10.3748/wjg.v20.i40.14537</pub-id></element-citation></ref>
<ref id="b2-ol-0-0-11976"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mori</surname><given-names>M</given-names></name><name><surname>Sugimachi</surname><given-names>K</given-names></name><name><surname>Ohiwa</surname><given-names>T</given-names></name><name><surname>Okamura</surname><given-names>T</given-names></name><name><surname>Tamura</surname><given-names>S</given-names></name><name><surname>Inokuchi</surname><given-names>K</given-names></name></person-group><article-title>Early gastric carcinoma in Japanese patients under 30 years of age</article-title><source>Br J Surg</source><volume>72</volume><fpage>289</fpage><lpage>291</lpage><year>1985</year><pub-id pub-id-type="doi">10.1002/bjs.1800720415</pub-id><pub-id pub-id-type="pmid">3986480</pub-id></element-citation></ref>
<ref id="b3-ol-0-0-11976"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname><given-names>S</given-names></name><name><surname>Lee</surname><given-names>HS</given-names></name><name><surname>Kim</surname><given-names>HS</given-names></name><name><surname>Kim</surname><given-names>YI</given-names></name><name><surname>Kim</surname><given-names>WH</given-names></name></person-group><article-title>Alteration of E-cadherin-mediated adhesion protein is common, but microsatellite instability is uncommon in young age gastric cancers</article-title><source>Histopathology</source><volume>42</volume><fpage>128</fpage><lpage>136</lpage><year>2003</year><pub-id pub-id-type="doi">10.1046/j.1365-2559.2003.01546.x</pub-id><pub-id pub-id-type="pmid">12558744</pub-id></element-citation></ref>
<ref id="b4-ol-0-0-11976"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname><given-names>BR</given-names></name><name><surname>Stabile</surname><given-names>BE</given-names></name></person-group><article-title>Extreme aggressiveness and lethality of gastric adenocarcinoma in the very young</article-title><source>Arch Surg</source><volume>144</volume><fpage>506</fpage><lpage>510</lpage><year>2009</year><pub-id pub-id-type="doi">10.1001/archsurg.2009.77</pub-id><pub-id pub-id-type="pmid">19528381</pub-id></element-citation></ref>
<ref id="b5-ol-0-0-11976"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Quijano Orvananos</surname><given-names>F</given-names></name><name><surname>Moreno Paquentin</surname><given-names>E</given-names></name><name><surname>Alvarez</surname><given-names>JJ</given-names></name><name><surname>Martinez Munive</surname><given-names>A</given-names></name><name><surname>Butron Perez</surname><given-names>L</given-names></name></person-group><article-title>Gastric carcinoma in patients under 35 years</article-title><source>Rev Gastroenterol Mex</source><volume>64</volume><fpage>75</fpage><lpage>77</lpage><year>1999</year><comment>(In Spanish)</comment><pub-id pub-id-type="pmid">10532131</pub-id></element-citation></ref>
<ref id="b6-ol-0-0-11976"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Theuer</surname><given-names>CP</given-names></name><name><surname>Kurosaki</surname><given-names>T</given-names></name><name><surname>Taylor</surname><given-names>TH</given-names></name><name><surname>Anton-Culver</surname><given-names>H</given-names></name></person-group><article-title>Unique features of gastric carcinoma in the young: A population-based analysis</article-title><source>Cancer</source><volume>83</volume><fpage>25</fpage><lpage>33</lpage><year>1998</year><pub-id pub-id-type="doi">10.1002/(SICI)1097-0142(19980701)83:1&lt;25::AID-CNCR4&gt;3.0.CO;2-D</pub-id><pub-id pub-id-type="pmid">9655289</pub-id></element-citation></ref>
<ref id="b7-ol-0-0-11976"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wierinckx</surname><given-names>A</given-names></name><name><surname>Roche</surname><given-names>M</given-names></name><name><surname>Raverot</surname><given-names>G</given-names></name><name><surname>Legras-Lachuer</surname><given-names>C</given-names></name><name><surname>Croze</surname><given-names>S</given-names></name><name><surname>Nazaret</surname><given-names>N</given-names></name><name><surname>Rey</surname><given-names>C</given-names></name><name><surname>Auger</surname><given-names>C</given-names></name><name><surname>Jouanneau</surname><given-names>E</given-names></name><name><surname>Chanson</surname><given-names>P</given-names></name><etal/></person-group><article-title>Integrated genomic profiling identifies loss of chromosome 11p impacting transcriptomic activity in aggressive pituitary PRL Tumors</article-title><source>Brain Pathol</source><volume>21</volume><fpage>533</fpage><lpage>543</lpage><year>2011</year><pub-id pub-id-type="pmid">21251114</pub-id></element-citation></ref>
<ref id="b8-ol-0-0-11976"><label>8</label><element-citation publication-type="journal"><collab collab-type="corp-author">Cancer Genome Atlas Research Network</collab><person-group person-group-type="author"><name><surname>Bass</surname><given-names>AJ</given-names></name><name><surname>Thorsson</surname><given-names>V</given-names></name><name><surname>Shmulevich</surname><given-names>I</given-names></name><name><surname>Reynolds</surname><given-names>SM</given-names></name><name><surname>Miller</surname><given-names>M</given-names></name><name><surname>Bernard</surname><given-names>B</given-names></name><name><surname>Hinoue</surname><given-names>T</given-names></name><name><surname>Laird</surname><given-names>PW</given-names></name><name><surname>Curtis</surname><given-names>C</given-names></name><etal/></person-group><article-title>Comprehensive molecular characterization of gastric adenocarcinoma</article-title><source>Nature</source><volume>513</volume><fpage>202</fpage><lpage>209</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nature13480</pub-id><pub-id pub-id-type="pmid">25079317</pub-id></element-citation></ref>
<ref id="b9-ol-0-0-11976"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chmielecki</surname><given-names>J</given-names></name><name><surname>Meyerson</surname><given-names>M</given-names></name></person-group><article-title>DNA sequencing of cancer: What Have We Learned?</article-title><source>Annu Rev Med</source><volume>65</volume><fpage>63</fpage><lpage>79</lpage><year>2014</year><pub-id pub-id-type="doi">10.1146/annurev-med-060712-200152</pub-id><pub-id pub-id-type="pmid">24274178</pub-id></element-citation></ref>
<ref id="b10-ol-0-0-11976"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Garraway</surname><given-names>LA</given-names></name><name><surname>Lander</surname><given-names>ES</given-names></name></person-group><article-title>Lessons from the Cancer Genome</article-title><source>Cell</source><volume>153</volume><fpage>17</fpage><lpage>37</lpage><year>2013</year><pub-id pub-id-type="doi">10.1016/j.cell.2013.03.002</pub-id><pub-id pub-id-type="pmid">23540688</pub-id></element-citation></ref>
<ref id="b11-ol-0-0-11976"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vogelstein</surname><given-names>B</given-names></name><name><surname>Papadopoulos</surname><given-names>N</given-names></name><name><surname>Velculescu</surname><given-names>VE</given-names></name><name><surname>Zhou</surname><given-names>SB</given-names></name><name><surname>Diaz</surname><given-names>LA</given-names></name><name><surname>Kinzler</surname><given-names>KW</given-names></name></person-group><article-title>Cancer genome landscapes</article-title><source>Science</source><volume>339</volume><fpage>1546</fpage><lpage>1558</lpage><year>2013</year><pub-id pub-id-type="doi">10.1126/science.1235122</pub-id><pub-id pub-id-type="pmid">23539594</pub-id></element-citation></ref>
<ref id="b12-ol-0-0-11976"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Garraway</surname><given-names>LA</given-names></name></person-group><article-title>Genomics-driven oncology: Framework for an emerging paradigm</article-title><source>J Clin Oncol</source><volume>31</volume><fpage>1806</fpage><lpage>1814</lpage><year>2013</year><pub-id pub-id-type="doi">10.1200/JCO.2012.46.8934</pub-id><pub-id pub-id-type="pmid">23589557</pub-id></element-citation></ref>
<ref id="b13-ol-0-0-11976"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>R</given-names></name></person-group><article-title>Abstract 4495: Development and clinical application of an integrative genomic approach to personalized cancer therapy</article-title><source>Cancer Res</source><fpage>76</fpage><year>2016</year><comment>doi: 10.1158/1538-7445.AM2016-4495</comment></element-citation></ref>
<ref id="b14-ol-0-0-11976"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Edge</surname><given-names>SB</given-names></name><name><surname>Compton</surname><given-names>CC</given-names></name></person-group><article-title>The American Joint Committee on Cancer: The 7th edition of the AJCC cancer staging manual and the future of TNM</article-title><source>Ann Surg Oncol</source><volume>17</volume><fpage>1471</fpage><lpage>1474</lpage><year>2010</year><pub-id pub-id-type="doi">10.1245/s10434-010-0985-4</pub-id><pub-id pub-id-type="pmid">20180029</pub-id></element-citation></ref>
<ref id="b15-ol-0-0-11976"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Krueger</surname><given-names>F</given-names></name></person-group><article-title>Trim Galore: A wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files, with some extra functionality for MspI-digested RRBS-type (Reduced Representation Bisufite-Seq) libraries</article-title><year>2012</year></element-citation></ref>
<ref id="b16-ol-0-0-11976"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dobin</surname><given-names>A</given-names></name><name><surname>Davis</surname><given-names>CA</given-names></name><name><surname>Schlesinger</surname><given-names>F</given-names></name><name><surname>Drenkow</surname><given-names>J</given-names></name><name><surname>Zaleski</surname><given-names>C</given-names></name><name><surname>Jha</surname><given-names>S</given-names></name><name><surname>Batut</surname><given-names>P</given-names></name><name><surname>Chaisson</surname><given-names>M</given-names></name><name><surname>Gingeras</surname><given-names>TR</given-names></name></person-group><article-title>STAR: Ultrafast universal RNA-seq aligner</article-title><source>Bioinformatics</source><volume>29</volume><fpage>15</fpage><lpage>21</lpage><year>2013</year><pub-id pub-id-type="doi">10.1093/bioinformatics/bts635</pub-id><pub-id pub-id-type="pmid">23104886</pub-id></element-citation></ref>
<ref id="b17-ol-0-0-11976"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Anders</surname><given-names>S</given-names></name><name><surname>Pyl</surname><given-names>PT</given-names></name><name><surname>Huber</surname><given-names>W</given-names></name></person-group><article-title>HTSeq-a Python framework to work with high-throughput sequencing data</article-title><source>Bioinformatics</source><volume>31</volume><fpage>166</fpage><lpage>169</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btu638</pub-id><pub-id pub-id-type="pmid">25260700</pub-id></element-citation></ref>
<ref id="b18-ol-0-0-11976"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname><given-names>MD</given-names></name><name><surname>McCarthy</surname><given-names>DJ</given-names></name><name><surname>Smyth</surname><given-names>GK</given-names></name></person-group><article-title>edgeR: A Bioconductor package for differential expression analysis of digital gene expression data</article-title><source>Bioinformatics</source><volume>26</volume><fpage>139</fpage><lpage>140</lpage><year>2010</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btp616</pub-id><pub-id pub-id-type="pmid">19910308</pub-id></element-citation></ref>
<ref id="b19-ol-0-0-11976"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>H</given-names></name></person-group><article-title>Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM</article-title><comment>(Submitted on 16 Mar 2013 (v1), last revised 26 May, 2013 (this version, v2))</comment></element-citation></ref>
<ref id="b20-ol-0-0-11976"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>DePristo</surname><given-names>MA</given-names></name><name><surname>Banks</surname><given-names>E</given-names></name><name><surname>Poplin</surname><given-names>R</given-names></name><name><surname>Garimella</surname><given-names>KV</given-names></name><name><surname>Maguire</surname><given-names>JR</given-names></name><name><surname>Hartl</surname><given-names>C</given-names></name><name><surname>Philippakis</surname><given-names>AA</given-names></name><name><surname>del Angel</surname><given-names>G</given-names></name><name><surname>Rivas</surname><given-names>MA</given-names></name><etal/></person-group><article-title>A framework for variation discovery and genotyping using next-generation DNA sequencing data</article-title><source>Nat Genet</source><volume>43</volume><fpage>491</fpage><lpage>498</lpage><year>2011</year><pub-id pub-id-type="doi">10.1038/ng.806</pub-id><pub-id pub-id-type="pmid">21478889</pub-id></element-citation></ref>
<ref id="b21-ol-0-0-11976"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Poplin</surname><given-names>R</given-names></name><name><surname>Ruano-Rubio</surname><given-names>V</given-names></name><name><surname>DePristo</surname><given-names>MA</given-names></name><name><surname>Fennell</surname><given-names>TJ</given-names></name><name><surname>Carneiro</surname><given-names>MO</given-names></name><name><surname>Van der Auwera</surname><given-names>GA</given-names></name><name><surname>Kling</surname><given-names>DE</given-names></name><name><surname>Gauthier</surname><given-names>LD</given-names></name><name><surname>Levy-Moonshine</surname><given-names>A</given-names></name><name><surname>Roazen</surname><given-names>D</given-names></name><etal/></person-group><article-title>Scaling accurate genetic variant discovery to tens of thousands of samples</article-title><source>BioRxiv</source><month>July</month><day>24</day><year>2017</year><comment>doi: https://doi.org/10.1101/201178</comment></element-citation></ref>
<ref id="b22-ol-0-0-11976"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cibulskis</surname><given-names>K</given-names></name><name><surname>Lawrence</surname><given-names>MS</given-names></name><name><surname>Carter</surname><given-names>SL</given-names></name><name><surname>Sivachenko</surname><given-names>A</given-names></name><name><surname>Jaffe</surname><given-names>D</given-names></name><name><surname>Sougnez</surname><given-names>C</given-names></name><name><surname>Gabriel</surname><given-names>S</given-names></name><name><surname>Meyerson</surname><given-names>M</given-names></name><name><surname>Lander</surname><given-names>ES</given-names></name><name><surname>Getz</surname><given-names>G</given-names></name></person-group><article-title>Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples</article-title><source>Nat Biotechnol</source><volume>31</volume><fpage>213</fpage><lpage>219</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/nbt.2514</pub-id><pub-id pub-id-type="pmid">23396013</pub-id></element-citation></ref>
<ref id="b23-ol-0-0-11976"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Colaprico</surname><given-names>A</given-names></name><name><surname>Silva</surname><given-names>TC</given-names></name><name><surname>Olsen</surname><given-names>C</given-names></name><name><surname>Garofano</surname><given-names>L</given-names></name><name><surname>Cava</surname><given-names>C</given-names></name><name><surname>Garolini</surname><given-names>D</given-names></name><name><surname>Sabedot</surname><given-names>TS</given-names></name><name><surname>Malta</surname><given-names>TM</given-names></name><name><surname>Pagnotta</surname><given-names>SM</given-names></name><name><surname>Castiglioni</surname><given-names>I</given-names></name><etal/></person-group><article-title>TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data</article-title><source>Nucleic Acids Res</source><volume>44</volume><fpage>e71</fpage><year>2016</year><pub-id pub-id-type="doi">10.1093/nar/gkv1507</pub-id><pub-id pub-id-type="pmid">26704973</pub-id></element-citation></ref>
<ref id="b24-ol-0-0-11976"><label>24</label><element-citation publication-type="journal"><collab collab-type="corp-author">1000 Genomes Project Consortium</collab><person-group person-group-type="author"><name><surname>Auton</surname><given-names>A</given-names></name><name><surname>Brooks</surname><given-names>LD</given-names></name><name><surname>Durbin</surname><given-names>RM</given-names></name><name><surname>Garrison</surname><given-names>EP</given-names></name><name><surname>Kang</surname><given-names>HM</given-names></name><name><surname>Korbel</surname><given-names>JO</given-names></name><name><surname>Marchini</surname><given-names>JL</given-names></name><name><surname>McCarthy</surname><given-names>S</given-names></name><name><surname>McVean</surname><given-names>GA</given-names></name><name><surname>Abecasis</surname><given-names>GR</given-names></name></person-group><article-title>A global reference for human genetic variation</article-title><source>Nature</source><volume>526</volume><fpage>68</fpage><lpage>74</lpage><year>2015</year><pub-id pub-id-type="doi">10.1038/nature15393</pub-id><pub-id pub-id-type="pmid">26432245</pub-id></element-citation></ref>
<ref id="b25-ol-0-0-11976"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Karczewski</surname><given-names>KJ</given-names></name><name><surname>Weisburd</surname><given-names>B</given-names></name><name><surname>Thomas</surname><given-names>B</given-names></name><name><surname>Solomonson</surname><given-names>M</given-names></name><name><surname>Ruderfer</surname><given-names>DM</given-names></name><name><surname>Kavanagh</surname><given-names>D</given-names></name><name><surname>Hamamsy</surname><given-names>T</given-names></name><name><surname>Lek</surname><given-names>M</given-names></name><name><surname>Samocha</surname><given-names>KE</given-names></name><name><surname>Cummings</surname><given-names>BB</given-names></name><etal/></person-group><article-title>The ExAC browser: Displaying reference data information from over 60 000 exomes</article-title><source>Nucleic Acids Res</source><volume>45</volume><fpage>D840</fpage><lpage>D845</lpage><year>2016</year><pub-id pub-id-type="doi">10.1093/nar/gkw971</pub-id><pub-id pub-id-type="pmid">27899611</pub-id></element-citation></ref>
<ref id="b26-ol-0-0-11976"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Karczewski</surname><given-names>KJ</given-names></name><name><surname>Karczewski</surname><given-names>LF</given-names></name></person-group><article-title>The genome aggregation database (gnomAD)</article-title><year>2017</year></element-citation></ref>
<ref id="b27-ol-0-0-11976"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>X</given-names></name><name><surname>Wu</surname><given-names>C</given-names></name><name><surname>Li</surname><given-names>C</given-names></name><name><surname>Boerwinkle</surname><given-names>E</given-names></name></person-group><article-title>dbNSFP v3.0: A One-stop database of functional predictions and annotations for human nonsynonymous and splice-site SNVs</article-title><source>Hum Mutat</source><volume>37</volume><fpage>235</fpage><lpage>241</lpage><year>2016</year><pub-id pub-id-type="doi">10.1002/humu.22932</pub-id><pub-id pub-id-type="pmid">26555599</pub-id></element-citation></ref>
<ref id="b28-ol-0-0-11976"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sherry</surname><given-names>ST</given-names></name><name><surname>Ward</surname><given-names>MH</given-names></name><name><surname>Kholodov</surname><given-names>M</given-names></name><name><surname>Baker</surname><given-names>J</given-names></name><name><surname>Phan</surname><given-names>L</given-names></name><name><surname>Smigielski</surname><given-names>EM</given-names></name><name><surname>Sirotkin</surname><given-names>K</given-names></name></person-group><article-title>dbSNP: The NCBI database of genetic variation</article-title><source>Nucleic Acids Res</source><volume>29</volume><fpage>308</fpage><lpage>311</lpage><year>2001</year><pub-id pub-id-type="doi">10.1093/nar/29.1.308</pub-id><pub-id pub-id-type="pmid">11125122</pub-id></element-citation></ref>
<ref id="b29-ol-0-0-11976"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tate</surname><given-names>JG</given-names></name><name><surname>Bamford</surname><given-names>S</given-names></name><name><surname>Jubb</surname><given-names>HC</given-names></name><name><surname>Sondka</surname><given-names>Z</given-names></name><name><surname>Beare</surname><given-names>DM</given-names></name><name><surname>Bindal</surname><given-names>N</given-names></name><name><surname>Boutselakis</surname><given-names>H</given-names></name><name><surname>Cole</surname><given-names>CG</given-names></name><name><surname>Creatore</surname><given-names>C</given-names></name><name><surname>Dawson</surname><given-names>E</given-names></name><etal/></person-group><article-title>COSMIC: The catalogue of somatic mutations in cancer</article-title><source>Nucleic Acids Res</source><volume>47</volume><fpage>D941</fpage><lpage>D947</lpage><year>2019</year><pub-id pub-id-type="doi">10.1093/nar/gky1015</pub-id><pub-id pub-id-type="pmid">30371878</pub-id></element-citation></ref>
<ref id="b30-ol-0-0-11976"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kimura</surname><given-names>M</given-names></name></person-group><article-title>A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide-sequences</article-title><source>J Mol Evol</source><volume>16</volume><fpage>111</fpage><lpage>120</lpage><year>1980</year><pub-id pub-id-type="doi">10.1007/BF01731581</pub-id><pub-id pub-id-type="pmid">7463489</pub-id></element-citation></ref>
<ref id="b31-ol-0-0-11976"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mwenifumbo</surname><given-names>JC</given-names></name><name><surname>Marra</surname><given-names>MA</given-names></name></person-group><article-title>Cancer genome-sequencing study design</article-title><source>Nat Rev Genet</source><volume>14</volume><fpage>321</fpage><lpage>332</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/nrg3445</pub-id><pub-id pub-id-type="pmid">23594910</pub-id></element-citation></ref>
<ref id="b32-ol-0-0-11976"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shen</surname><given-names>R</given-names></name><name><surname>Seshan</surname><given-names>VE</given-names></name></person-group><article-title>FACETS: Allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing</article-title><source>Nucleic Acids Res</source><volume>44</volume><fpage>e131</fpage><year>2016</year><pub-id pub-id-type="doi">10.1093/nar/gkw520</pub-id><pub-id pub-id-type="pmid">27270079</pub-id></element-citation></ref>
<ref id="b33-ol-0-0-11976"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Team</surname><given-names>RC</given-names></name></person-group><article-title>R: A language and environment for statistical computing</article-title><year>2013</year></element-citation></ref>
<ref id="b34-ol-0-0-11976"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rausch</surname><given-names>T</given-names></name><name><surname>Zichner</surname><given-names>T</given-names></name><name><surname>Schlattl</surname><given-names>A</given-names></name><name><surname>Stutz</surname><given-names>AM</given-names></name><name><surname>Benes</surname><given-names>V</given-names></name><name><surname>Korbel</surname><given-names>JO</given-names></name></person-group><article-title>DELLY: Structural variant discovery by integrated paired-end and split-read analysis</article-title><source>Bioinformatics</source><volume>28</volume><fpage>i333</fpage><lpage>i339</lpage><year>2012</year><pub-id pub-id-type="doi">10.1093/bioinformatics/bts378</pub-id><pub-id pub-id-type="pmid">22962449</pub-id></element-citation></ref>
<ref id="b35-ol-0-0-11976"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dong</surname><given-names>C</given-names></name><name><surname>Wei</surname><given-names>P</given-names></name><name><surname>Jian</surname><given-names>X</given-names></name><name><surname>Gibbs</surname><given-names>R</given-names></name><name><surname>Boerwinkle</surname><given-names>E</given-names></name><name><surname>Wang</surname><given-names>K</given-names></name><name><surname>Liu</surname><given-names>X</given-names></name></person-group><article-title>Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies</article-title><source>Hum Mol Genet</source><volume>24</volume><fpage>2125</fpage><lpage>2137</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/hmg/ddu733</pub-id><pub-id pub-id-type="pmid">25552646</pub-id></element-citation></ref>
<ref id="b36-ol-0-0-11976"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Futreal</surname><given-names>PA</given-names></name><name><surname>Coin</surname><given-names>L</given-names></name><name><surname>Marshall</surname><given-names>M</given-names></name><name><surname>Down</surname><given-names>T</given-names></name><name><surname>Hubbard</surname><given-names>T</given-names></name><name><surname>Wooster</surname><given-names>R</given-names></name><name><surname>Rahman</surname><given-names>N</given-names></name><name><surname>Stratton</surname><given-names>MR</given-names></name></person-group><article-title>A census of human cancer genes</article-title><source>Nat Rev Cancer</source><volume>4</volume><fpage>177</fpage><lpage>183</lpage><year>2004</year><pub-id pub-id-type="doi">10.1038/nrc1299</pub-id><pub-id pub-id-type="pmid">14993899</pub-id></element-citation></ref>
<ref id="b37-ol-0-0-11976"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rahman</surname><given-names>N</given-names></name></person-group><article-title>Realizing the promise of cancer predisposition genes</article-title><source>Nature</source><volume>505</volume><fpage>302</fpage><lpage>308</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nature12981</pub-id><pub-id pub-id-type="pmid">24429628</pub-id></element-citation></ref>
<ref id="b38-ol-0-0-11976"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname><given-names>GC</given-names></name><name><surname>Wang</surname><given-names>LG</given-names></name><name><surname>Han</surname><given-names>YY</given-names></name><name><surname>He</surname><given-names>QY</given-names></name></person-group><article-title>clusterProfiler: An R package for comparing biological themes among gene clusters</article-title><source>OMICS</source><volume>16</volume><fpage>284</fpage><lpage>287</lpage><year>2012</year><pub-id pub-id-type="doi">10.1089/omi.2011.0118</pub-id><pub-id pub-id-type="pmid">22455463</pub-id></element-citation></ref>
<ref id="b39-ol-0-0-11976"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Subramanian</surname><given-names>A</given-names></name><name><surname>Tamayo</surname><given-names>P</given-names></name><name><surname>Mootha</surname><given-names>VK</given-names></name><name><surname>Mukherjee</surname><given-names>S</given-names></name><name><surname>Ebert</surname><given-names>BL</given-names></name><name><surname>Gillette</surname><given-names>MA</given-names></name><name><surname>Paulovich</surname><given-names>A</given-names></name><name><surname>Pomeroy</surname><given-names>SL</given-names></name><name><surname>Golub</surname><given-names>TR</given-names></name><name><surname>Lander</surname><given-names>ES</given-names></name><name><surname>Mesirov</surname><given-names>JP</given-names></name></person-group><article-title>Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles</article-title><source>Proc Natl Acad Sci USA</source><volume>102</volume><fpage>15545</fpage><lpage>15550</lpage><year>2005</year><pub-id pub-id-type="doi">10.1073/pnas.0506580102</pub-id><pub-id pub-id-type="pmid">16199517</pub-id></element-citation></ref>
<ref id="b40-ol-0-0-11976"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ritchie</surname><given-names>ME</given-names></name><name><surname>Phipson</surname><given-names>B</given-names></name><name><surname>Wu</surname><given-names>D</given-names></name><name><surname>Hu</surname><given-names>Y</given-names></name><name><surname>Law</surname><given-names>CW</given-names></name><name><surname>Shi</surname><given-names>W</given-names></name><name><surname>Smyth</surname><given-names>GK</given-names></name></person-group><article-title>limma powers differential expression analyses for RNA-sequencing and microarray studies</article-title><source>Nucleic Acids Res</source><volume>43</volume><fpage>e47</fpage><year>2015</year><pub-id pub-id-type="doi">10.1093/nar/gkv007</pub-id><pub-id pub-id-type="pmid">25605792</pub-id></element-citation></ref>
<ref id="b41-ol-0-0-11976"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Love</surname><given-names>MI</given-names></name><name><surname>Huber</surname><given-names>W</given-names></name><name><surname>Anders</surname><given-names>S</given-names></name></person-group><article-title>Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2</article-title><source>Genome Biol</source><volume>15</volume><fpage>550</fpage><year>2014</year><pub-id pub-id-type="doi">10.1186/s13059-014-0550-8</pub-id><pub-id pub-id-type="pmid">25516281</pub-id></element-citation></ref>
<ref id="b42-ol-0-0-11976"><label>42</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bailey</surname><given-names>MH</given-names></name><name><surname>Tokheim</surname><given-names>C</given-names></name><name><surname>Porta-Pardo</surname><given-names>E</given-names></name><name><surname>Sengupta</surname><given-names>S</given-names></name><name><surname>Bertrand</surname><given-names>D</given-names></name><name><surname>Weerasinghe</surname><given-names>A</given-names></name><name><surname>Colaprico</surname><given-names>A</given-names></name><name><surname>Wendl</surname><given-names>MC</given-names></name><name><surname>Kim</surname><given-names>J</given-names></name><name><surname>Reardon</surname><given-names>B</given-names></name><etal/></person-group><article-title>Comprehensive characterization of cancer driver genes and mutations</article-title><source>Cell</source><volume>174</volume><fpage>1034</fpage><lpage>1035</lpage><year>2018</year><pub-id pub-id-type="doi">10.1016/j.cell.2018.07.034</pub-id><pub-id pub-id-type="pmid">30096302</pub-id></element-citation></ref>
<ref id="b43-ol-0-0-11976"><label>43</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sz&#x00E1;sz</surname><given-names>AM</given-names></name><name><surname>L&#x00E1;nczky</surname><given-names>A</given-names></name><name><surname>Nagy</surname><given-names>&#x00C1;</given-names></name><name><surname>F&#x00F6;rster</surname><given-names>S</given-names></name><name><surname>Hark</surname><given-names>K</given-names></name><name><surname>Green</surname><given-names>JE</given-names></name><name><surname>Boussioutas</surname><given-names>A</given-names></name><name><surname>Busuttil</surname><given-names>R</given-names></name><name><surname>Szab&#x00F3;</surname><given-names>A</given-names></name><name><surname>Gy&#x0151;rffy</surname><given-names>B</given-names></name></person-group><article-title>Cross-validation of survival associated biomarkers in gastric cancer using transcriptomic data of 1,065 patients</article-title><source>Oncotarget</source><volume>7</volume><fpage>49322</fpage><lpage>49333</lpage><year>2016</year><pub-id pub-id-type="doi">10.18632/oncotarget.10337</pub-id><pub-id pub-id-type="pmid">27384994</pub-id></element-citation></ref>
<ref id="b44-ol-0-0-11976"><label>44</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Taiaroa</surname><given-names>G</given-names></name><name><surname>Rawlinson</surname><given-names>D</given-names></name><name><surname>Featherstone</surname><given-names>L</given-names></name><etal/></person-group><article-title>Direct RNA sequencing and early evolution of SARS-CoV-2</article-title><source>bioRxiv</source><year>2020</year><comment>(Epub ahead of print)</comment></element-citation></ref>
<ref id="b45-ol-0-0-11976"><label>45</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>He</surname><given-names>ML</given-names></name><name><surname>Chen</surname><given-names>Y</given-names></name><name><surname>Chen</surname><given-names>Q</given-names></name><name><surname>He</surname><given-names>Y</given-names></name><name><surname>Zhao</surname><given-names>J</given-names></name><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Yang</surname><given-names>H</given-names></name><name><surname>Kung</surname><given-names>HF</given-names></name></person-group><article-title>Multiple gene dysfunctions lead to high cancer-susceptibility: Evidences from a whole-exome sequencing study</article-title><source>Am J Cancer Res</source><volume>1</volume><fpage>562</fpage><lpage>573</lpage><year>2011</year><pub-id pub-id-type="pmid">21984973</pub-id></element-citation></ref>
<ref id="b46-ol-0-0-11976"><label>46</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zucchelli</surname><given-names>M</given-names></name><name><surname>Torkvist</surname><given-names>L</given-names></name><name><surname>Bresso</surname><given-names>F</given-names></name><name><surname>Halfvarson</surname><given-names>J</given-names></name><name><surname>Hellquist</surname><given-names>A</given-names></name><name><surname>Anedda</surname><given-names>F</given-names></name><name><surname>Assadi</surname><given-names>G</given-names></name><name><surname>Lindgren</surname><given-names>GB</given-names></name><name><surname>Svanfeldt</surname><given-names>M</given-names></name><name><surname>Janson</surname><given-names>M</given-names></name><etal/></person-group><article-title>PepT1 oligopeptide transporter (SLC15A1) gene polymorphism in inflammatory bowel disease</article-title><source>Inflamm Bowel Dis</source><volume>15</volume><fpage>1562</fpage><lpage>1569</lpage><year>2009</year><pub-id pub-id-type="doi">10.1002/ibd.20963</pub-id><pub-id pub-id-type="pmid">19462432</pub-id></element-citation></ref>
<ref id="b47-ol-0-0-11976"><label>47</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hodgson</surname><given-names>A</given-names></name><name><surname>Parra-Herran</surname><given-names>C</given-names></name><name><surname>Mirkovic</surname><given-names>J</given-names></name></person-group><article-title>Immunohistochemical expression of HIK1083 and MUC6 in endometrial carcinomas</article-title><source>Histopathology</source><volume>75</volume><fpage>552</fpage><lpage>558</lpage><year>2019</year><pub-id pub-id-type="doi">10.1111/his.13887</pub-id><pub-id pub-id-type="pmid">31021421</pub-id></element-citation></ref>
<ref id="b48-ol-0-0-11976"><label>48</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>M</given-names></name><name><surname>Huang</surname><given-names>L</given-names></name><name><surname>Qiu</surname><given-names>H</given-names></name><name><surname>Fu</surname><given-names>Q</given-names></name><name><surname>Li</surname><given-names>W</given-names></name><name><surname>Yu</surname><given-names>Q</given-names></name><name><surname>Sun</surname><given-names>L</given-names></name><name><surname>Zhang</surname><given-names>L</given-names></name><name><surname>Hu</surname><given-names>G</given-names></name><name><surname>Hu</surname><given-names>J</given-names></name><name><surname>Yuan</surname><given-names>X</given-names></name></person-group><article-title><italic>Helicobacter pylori</italic> infection synergizes with three inflammation-related genetic variants in the GWASs to increase risk of gastric cancer in a Chinese population</article-title><source>PLoS One</source><volume>8</volume><fpage>e74976</fpage><year>2013</year><pub-id pub-id-type="doi">10.1371/journal.pone.0074976</pub-id><pub-id pub-id-type="pmid">24069371</pub-id></element-citation></ref>
<ref id="b49-ol-0-0-11976"><label>49</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hierro</surname><given-names>C</given-names></name><name><surname>Alsina</surname><given-names>M</given-names></name><name><surname>Sanchez</surname><given-names>M</given-names></name><name><surname>Serra</surname><given-names>V</given-names></name><name><surname>Rodon</surname><given-names>J</given-names></name><name><surname>Tabernero</surname><given-names>J</given-names></name></person-group><article-title>Targeting the fibroblast growth factor receptor 2 in gastric cancer: Promise or pitfall?</article-title><source>Ann Oncol</source><volume>28</volume><fpage>1207</fpage><lpage>1216</lpage><year>2017</year><pub-id pub-id-type="doi">10.1093/annonc/mdx081</pub-id><pub-id pub-id-type="pmid">28327938</pub-id></element-citation></ref>
<ref id="b50-ol-0-0-11976"><label>50</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cruciat</surname><given-names>CM</given-names></name><name><surname>Niehrs</surname><given-names>C</given-names></name></person-group><article-title>Secreted and transmembrane wnt inhibitors and activators</article-title><source>Cold Spring Harb Perspect Biol</source><volume>5</volume><fpage>a015081</fpage><year>2013</year><pub-id pub-id-type="doi">10.1101/cshperspect.a015081</pub-id><pub-id pub-id-type="pmid">23085770</pub-id></element-citation></ref>
<ref id="b51-ol-0-0-11976"><label>51</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Herr</surname><given-names>P</given-names></name><name><surname>Hausmann</surname><given-names>G</given-names></name><name><surname>Basler</surname><given-names>K</given-names></name></person-group><article-title>WNT secretion and signalling in human disease</article-title><source>Trends Mol Med</source><volume>18</volume><fpage>483</fpage><lpage>493</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.molmed.2012.06.008</pub-id><pub-id pub-id-type="pmid">22796206</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-ol-0-0-11976" position="float">
<label>Figure 1.</label>
<caption><p>Differentially expressed genes with structural alternations. Structural variation and copy number alternation analysis identified 21 genes, which were differentially expressed. The angular axis shows log<sub>2</sub> fold change between tumor and adjacent normal tissues.</p></caption>
<graphic xlink:href="ol-20-04-11976-g00.tif"/>
</fig>
<fig id="f2-ol-0-0-11976" position="float">
<label>Figure 2.</label>
<caption><p>Gene Set Enrichment Analysis revealed dysregulated pathways in RNA-seq. Upregulated genes are indicated by blue and the downregulated by red.</p></caption>
<graphic xlink:href="ol-20-04-11976-g01.tif"/>
</fig>
<fig id="f3-ol-0-0-11976" position="float">
<label>Figure 3.</label>
<caption><p>Genomic features in the young patient in the present study compared with that in patients from TCGA STAD dataset. (A) Heatmap of the 27 genes, which were dysregulated in both TCGA STAD and in the present study. Counts per million (CPM) indicates the expression level of each gene and log (CPM&#x002B;1), which ranged from &#x2212;2 (blue) to 6 (red), was used for sample clustering and visualization. (B). Gene Ontology enrichment analysis and KEGG pathway analysis identified several significant pathways. &#x002A;P&#x003C;0.05 and &#x002A;&#x002A;P&#x003C;0.01. DEG, differentially expressed gene; KEGG, Kyoto Encyclopedia of Genes and Genomes; MF, molecular functions; CC, cellular components; BP, biological processes; TCGA, The Cancer Genome Atlas; STAD, stomach adenocarcinoma; hydrogen:potas &#x2026; ATPase activity, hydrogen:potassium-exchanging ATPase activity.</p></caption>
<graphic xlink:href="ol-20-04-11976-g02.tif"/>
</fig>
<table-wrap id="tI-ol-0-0-11976" position="float">
<label>Table I.</label>
<caption><p>Functional assessment of the germline mutations and prediction of their effect.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Gene</th>
<th align="center" valign="bottom">Databse</th>
<th align="center" valign="bottom">Chromosome</th>
<th align="center" valign="bottom">Position</th>
<th align="center" valign="bottom">Reference allele</th>
<th align="center" valign="bottom">Alteration</th>
<th align="center" valign="bottom">Type</th>
<th align="center" valign="bottom">AA. Change</th>
<th align="center" valign="bottom">Cosmic82</th>
<th align="center" valign="bottom">Final prediction</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">AKAP9</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">91739445</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">T</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.T3899I</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td align="left" valign="top">DOCK8</td>
<td align="center" valign="top">CPG</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">286491</td>
<td align="center" valign="top">G</td>
<td align="center" valign="top">A</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.D63N</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">D</td>
</tr>
<tr>
<td align="left" valign="top">HIF1A</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">62207747</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">A</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.T669N</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td align="left" valign="top">HLF</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">53345172</td>
<td align="center" valign="top">T</td>
<td align="center" valign="top">C</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.V59A</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">D</td>
</tr>
<tr>
<td align="left" valign="top">KMT2D</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">49446404</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">T</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.V401M</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">P</td>
</tr>
<tr>
<td align="left" valign="top">MLLT3</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">20414344</td>
<td align="center" valign="top">CTG</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">Non-FS DEL</td>
<td align="left" valign="top">p.163_164del</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td align="left" valign="top">MN1</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">28194933</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">TGCTGC</td>
<td align="left" valign="top">Non-FS</td>
<td align="left" valign="top">p.Q533delins</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td align="center" valign="top">TGCTGC</td>
<td align="left" valign="top">INS</td>
<td align="left" valign="top">QQQQQ</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">NKX2-1</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">36986635</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">T</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.G322S</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">D</td>
</tr>
<tr>
<td align="left" valign="top">NRG1</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">32505286</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">G</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.S17C</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">D</td>
</tr>
<tr>
<td align="left" valign="top">PRDM2</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">14107360</td>
<td align="center" valign="top">T</td>
<td align="center" valign="top">A</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.S823T</td>
<td align="center" valign="top">Y</td>
<td align="center" valign="top">D</td>
</tr>
<tr>
<td align="left" valign="top">PTPRC</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">198675962</td>
<td align="center" valign="top">A</td>
<td align="center" valign="top">G</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.N101S</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td align="left" valign="top">RAD51D</td>
<td align="center" valign="top">CPG</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">33428245</td>
<td align="center" valign="top">G</td>
<td align="center" valign="top">A</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.A181V</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">P</td>
</tr>
<tr>
<td align="left" valign="top">RNF213</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">78346531</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">A</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.P4250T</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">U</td>
</tr>
<tr>
<td align="left" valign="top">ROS1</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">117662652</td>
<td align="center" valign="top">C</td>
<td align="center" valign="top">G</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.E1605Q</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">P</td>
</tr>
<tr>
<td align="left" valign="top">WNK2</td>
<td align="center" valign="top">CGC</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">96015284</td>
<td align="center" valign="top">T</td>
<td align="center" valign="top">C</td>
<td align="left" valign="top">Non-SNV</td>
<td align="left" valign="top">p.C652R</td>
<td align="center" valign="top">N</td>
<td align="center" valign="top">D</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-ol-0-0-11976"><p>Non-SNV, non-synonymous single nucleotide variation; non-FS DEL, non-frameshift deletion; non-FS INS, non-frameshift insertion; CGC, Cancer Gene Census; CPG, Cancer Predisposition Genes; P, possibly damaging; D, deleterious; U, uncertain significance; Y, yes; N, no; AA, amino acid.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>
