<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Molecular Medicine Reports</journal-id>
<journal-title-group>
<journal-title>Molecular Medicine Reports</journal-title>
</journal-title-group>
<issn pub-type="ppub">1791-2997</issn>
<issn pub-type="epub">1791-3004</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/mmr.2017.7412</article-id>
<article-id pub-id-type="publisher-id">mmr-16-05-6499</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Colon cancer recurrence-associated genes revealed by WGCNA co-expression network analysis</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Zhai</surname><given-names>Xiaofeng</given-names></name>
<xref rid="af1-mmr-16-05-6499" ref-type="aff">1</xref>
<xref rid="af2-mmr-16-05-6499" ref-type="aff">2</xref>
<xref rid="fn1-mmr-16-05-6499" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Xue</surname><given-names>Qingfeng</given-names></name>
<xref rid="af3-mmr-16-05-6499" ref-type="aff">3</xref>
<xref rid="fn1-mmr-16-05-6499" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Liu</surname><given-names>Qun</given-names></name>
<xref rid="af1-mmr-16-05-6499" ref-type="aff">1</xref>
<xref rid="af2-mmr-16-05-6499" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Guo</surname><given-names>Yuyu</given-names></name>
<xref rid="af1-mmr-16-05-6499" ref-type="aff">1</xref>
<xref rid="af2-mmr-16-05-6499" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Chen</surname><given-names>Zhe</given-names></name>
<xref rid="af1-mmr-16-05-6499" ref-type="aff">1</xref>
<xref rid="af2-mmr-16-05-6499" ref-type="aff">2</xref>
<xref rid="c1-mmr-16-05-6499" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-mmr-16-05-6499"><label>1</label>Department of Integrative Oncology, Changhai Hospital of Traditional Chinese Medicine, Second Military Medical University, Shanghai 200433, P.R. China</aff>
<aff id="af2-mmr-16-05-6499"><label>2</label>Department of Integrative Oncology, Changhai Hospital, Shanghai 200433, P.R. China</aff>
<aff id="af3-mmr-16-05-6499"><label>3</label>Department of Anesthesiology, 264 Hospital of The People&#x0027;s Liberation Army, Taiyuan, Shanxi 030001, P.R. China</aff>
<author-notes>
<corresp id="c1-mmr-16-05-6499"><italic>Correspondence to</italic>: Dr Zhe Chen, Department of Integrative Oncology, Changhai Hospital, 168 Changhai Road, Shanghai 200433, P.R. China, E-mail: <email>chendhdhd@163.com</email></corresp>
<fn id="fn1-mmr-16-05-6499"><label>&#x002A;</label><p>Contributed equally</p></fn>
</author-notes>
<pub-date pub-type="ppub"><month>05</month><year>2017</year></pub-date>
<pub-date pub-type="epub"><day>31</day><month>08</month><year>2017</year></pub-date>
<volume>16</volume>
<issue>5</issue>
<fpage>6499</fpage>
<lpage>6505</lpage>
<history>
<date date-type="received"><day>31</day><month>03</month><year>2016</year></date>
<date date-type="accepted"><day>20</day><month>03</month><year>2017</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Zhai et al.</copyright-statement>
<copyright-year>2017</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>The present study aimed to identify the recurrence-associated genes in colon cancer, which may provide theoretical evidence for the development of novel methods to prevent tumor recurrence. Colon cancer and matched normal samples microarray data (E-GEOD-39582) were downloaded from ArrayExpress. Genes with significant variation were identified, followed by the screening of differentially expressed genes (DEGs). Subsequently, the co-expression network of DEGs was constructed using the weighted correlation network analysis (WGCNA) method, which was verified using the validation dataset. The significant modules associated with recurrence in the network were subsequently screened and verified in another independent dataset E-GEOD-33113. Function and pathway enrichment analyses were also conducted to determine the roles of selected genes. Survival analysis was performed to identify the association between these genes and survival. A total of 434 DEGs were identified in the colon samples, and stress-associated endoplasmic reticulum protein family member 2 (SERP2) and long non-coding RNA-0219 (LINC0219) were determined to be the vital DEGs between all the three sub-type groups with different clinical features. The brown module was identified to be the most significant module in the co-expression network associated with the recurrence of colon cancer, which was verified in the E-GEOD-33113 dataset. Top 10 genes in the brown module, including EGF containing fibulin like extracellular matrix protein 2 (EFEMP2), fibrillin 1 (FBN1) and secreted protein acidic and cysteine rich (SPARC) were also associated with survival time of colon cancer patients. Further analysis revealed that the function of cell adhesion, biological adhesion, extracellular matrix (ECM) organization, pathways of ECM-receptor interaction and focal adhesion were the significantly changed terms in colon cancer. In conclusion, SERP2, EFEMP2, FBN1, SPARC, and LINC0219 were revealed to be the recurrence-associated molecular and prognostic indicators in colon cancer by WGCNA co-expression network analysis.</p>
</abstract>
<kwd-group>
<kwd>colon cancer</kwd>
<kwd>co-expression</kwd>
<kwd>differentially expressed genes</kwd>
<kwd>recurrence</kwd>
<kwd>survival</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Colon cancer is one of the most common malignant tumors, with a high incident rate in the 40&#x2013;50 age group. Colon cancer affects ~150,000 patients in the USA annually (<xref rid="b1-mmr-16-05-6499" ref-type="bibr">1</xref>). Due to the changing of diets, colon cancer has become the 4th cause for malignant tumor mortality in China and there are ~140,000 diagnosed cases annually (<xref rid="b2-mmr-16-05-6499" ref-type="bibr">2</xref>). Surgery is the primary therapy for colon cancer and patients exhibit 5-year survival rate of 50&#x0025; following surgery (<xref rid="b3-mmr-16-05-6499" ref-type="bibr">3</xref>).</p>
<p>However, 15&#x2013;20&#x0025; patients experience recurrence following treatment. Tumor recurrence following curative surgery is a major hindrance for the improvement of overall survival (<xref rid="b4-mmr-16-05-6499" ref-type="bibr">4</xref>). Therefore, it is important to identify the molecular changes in patients and to determine the underlying reason for colon cancer recurrence. Biomarkers have been used as tools in the detection and management of the disease in patients with colon cancer (<xref rid="b5-mmr-16-05-6499" ref-type="bibr">5</xref>). For instance, the CpG island methylator phenotype is independently associated with an unfavorable prognosis in patients with colon cancer (<xref rid="b6-mmr-16-05-6499" ref-type="bibr">6</xref>). Epithelial cell adhesion molecule, cluster of differentiation (CD)26, musashi RNA binding protein 1, CD29, CD24, leucine rich repeat containing G protein-coupled receptor 5 and aldehyde dehydrogenase 1 family member A1 have been identified as potential putative markers for colon cancer (<xref rid="b7-mmr-16-05-6499" ref-type="bibr">7</xref>,<xref rid="b8-mmr-16-05-6499" ref-type="bibr">8</xref>). DNA methylation may also predict recurrence of resected stage III proximal colon cancer (<xref rid="b9-mmr-16-05-6499" ref-type="bibr">9</xref>). MicroRNA-93 inhibited the early relapse of colon cancer by targeting cell cycle-associated genes (<xref rid="b10-mmr-16-05-6499" ref-type="bibr">10</xref>). Phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit a mutation in colorectal cancer may act as a predictive molecular biomarker for adjuvant aspirin therapy in colon cancer (<xref rid="b11-mmr-16-05-6499" ref-type="bibr">11</xref>).</p>
<p>An improved understanding of the biology of recurrence may improve the development of novel recurrence prevention or treatment methods in colon cancer. In order to investigate the recurrence-associated genes in colon cancer for future therapy, a co-expression network of differentially expressed genes (DEGs) in colon cancer was constructed in the present study and the most significant modules in the network were used to reveal the recurrence-associated genes. Subsequently, the functions of recurrence-associated genes were enriched to determine the importance of these genes in the relapse of patients with colon cancer.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Microarray profiles</title>
<p>Two microarray profiles of colon cancer samples including recurrence information (E-GEOD-39,582 and E-GEOD-33,113) were downloaded from ArrayExpress (<uri xlink:href="http://www.ebi.ac.uk/arrayexpress/">http://www.ebi.ac.uk/arrayexpress/</uri>). E-GEOD-39,582 included 566 samples, based on the platform of AFFY HG-U133_Plus_2, which were divided into the training dataset and the validation dataset for the weighted correlation network analysis (WGCNA) network construction. There were a total of 90 colon cancer samples in E-GEOD-33,113, with the recurrence status and clinical information of the samples, which was used as the validation dataset for mining vital module associated with recurrence.</p>
</sec>
<sec>
<title>Sample classification</title>
<p>Samples in the profiles were classified using Consensus Clustering in R (<uri xlink:href="http://www.bioconductor.org/packages/release/bioc/html/ConsensusClusterPlus.html">http://www.bioconductor.org/packages/release/bioc/html/ConsensusClusterPlus.html</uri>) to identify their sub-types. The parameters set in this analysis are presented in <xref rid="tI-mmr-16-05-6499" ref-type="table">Table I</xref>. Following the classification of samples, the association between colon cancer samples and corresponding clinical performance [survival, sex, cghdata, chemotherapy, mismatch repair (Mmr) status, tumor node metastasis (TNM) stage, and Tumor location] were determiend using the &#x03C7;<sup>2</sup> test. P&#x003C;0.05 was considered to indicate a statistically significant difference.</p>
</sec>
<sec>
<title>Pretreatment of genes expression data</title>
<p>Data in the profiles were initially normalized using the robust multi-array analysis (RMA) method in Affy package (<uri xlink:href="http://www.bioconductor.org/packages/release/bioc/html/affy.html">http://www.bioconductor.org/packages/release/bioc/html/affy.html</uri>) and were transformed using a log<sub>2</sub> transformation. Probes were converted into gene symbols and the average expression value was used as the only value of the gene with multiple corresponding probes. Following that, the cv method in the genefilter package was applied to filter the genes with significant variation. Genes with coefficient of variation &#x003E;0.4 were recognized as the candidate genes.</p>
</sec>
<sec>
<title>Screening of DEGs</title>
<p>The limma package in R (<uri xlink:href="http://www.bioconductor.org/packages/release/bioc/html/limma.html">http://www.bioconductor.org/packages/release/bioc/html/limma.html</uri>) was used to identify the DEGs from the candidate genes. P&#x003C;0.01 and log fold change (FC) &#x003E;1.5 were set as the cut-off criteria.</p>
</sec>
<sec>
<title>Construction of the WGCNA co-expression network</title>
<p>The WGCNA method (<xref rid="b12-mmr-16-05-6499" ref-type="bibr">12</xref>) was used to construct the co-expression network of the genes in the test samples of colon cancer. The interaction coefficient between genes was calculated using the following formula:</p>
<disp-formula>
<alternatives>
<mml:math id="umml1" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="normal">S</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow><mml:mi>&#x03B2;</mml:mi></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:msub><mml:mi mathvariant="normal">S</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>|</mml:mo><mml:mtext>cor</mml:mtext><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi mathvariant="normal">x</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="normal">x</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>|</mml:mo><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:mo stretchy="false">(</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math>
<graphic xlink:href="MMR-16-05-6499-g00.tif"/>
</alternatives>
</disp-formula>
<p>Where x<sub>i</sub> and x<sub>j</sub> are the expression vectors of gene i and j, respectively. The Pearson coefficient of these two vectors was cor, which was transited into the interaction coefficient using S<sub>ij</sub>. This transition intended to give more weight to the strong connections and reduce the importance of the weak connections in the predicted co-expression network in order to improve the reliability of the co-expression network.</p>
<p>The connection coefficient will be transformed into a weighted coefficient W<sub>ij</sub> using the following formula:</p>
<disp-formula>
<alternatives>
<mml:math id="umml2" display="block"><mml:mrow><mml:mtable columnalign="right"><mml:mtr columnalign="right"><mml:mtd columnalign="right"><mml:mrow><mml:msub><mml:mi mathvariant="normal">W</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi mathvariant="normal">l</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub><mml:mo>&#x002B;</mml:mo><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>min</mml:mo><mml:mo>{</mml:mo><mml:msub><mml:mi mathvariant="normal">k</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="normal">k</mml:mi><mml:mi mathvariant="normal">j</mml:mi></mml:msub><mml:mo>}</mml:mo><mml:mo>&#x002B;</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2013;</mml:mo><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:mo stretchy="false">(</mml:mo><mml:mn>2</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign="right"><mml:mtd columnalign="right"><mml:mrow><mml:msub><mml:mi mathvariant="normal">l</mml:mi><mml:mrow><mml:mtext>ij</mml:mtext></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mi mathvariant="normal">u</mml:mi></mml:munder><mml:mrow><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>iu</mml:mtext></mml:mrow></mml:msub><mml:mspace width=".16em" /><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>uj</mml:mtext></mml:mrow></mml:msub><mml:mspace width=".16em" /><mml:mo>,</mml:mo><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:mspace width=".16em" /><mml:msub><mml:mi mathvariant="normal">k</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mi mathvariant="normal">u</mml:mi></mml:munder><mml:mrow><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mtext>iu</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:math>
<graphic xlink:href="MMR-16-05-6499-g01.tif"/>
</alternatives>
</disp-formula>
<p>Subsequently, the co-expression network would be constructed based on the W matrix, followed by module mining. The reliability of the minded modules was verified using the verify sample set E-GEOD-33113. The topological properties were also confirmed. Modules with module significance &#x003C;0.05 were identified to be recurrence-associated modules.</p>
</sec>
<sec>
<title>Verification of recurrence-associated modules</title>
<p>The average expression value of genes in the significant modules in each sample was calculated. Subsequently, the samples were ranked based on the expression level of modules. According to 1/4 and 3/4 value of expression level, the modules were divided into high expression level, median expression level, and low expression level. Finally, the Kaplan-Meier (KM) curves of recurrence-associated modules expression level and recurrence status were drawn. The significance of the different expression level modules was compared, and P&#x003C;0.05 was considered to indicate a statistically significant difference. The identified vital module was verified using the independent dataset E-GEOD-33113.</p>
</sec>
<sec>
<title>Function enrichment of recurrence associated modules</title>
<p>In order to determine the functions of the genes in the recurrence-associated modules, genes were subjected to Database for Annotation, Visualization and Integrated Discovery (<xref rid="b13-mmr-16-05-6499" ref-type="bibr">13</xref>) for function and pathway enrichment. P&#x003C;0.05 was the threshold used for the significant terms.</p>
</sec>
<sec>
<title>Survival analysis of genes in recurrence-associated modules</title>
<p>Survival analysis was performed on the genes in the recurrence-associated modules. The degree (k) of a gene and the significance (P) between each gene and sample survival time were also calculated using a Cox regression. Subsequently, the interaction coefficient (coef) between k and -log<sub>10</sub>(P) was computed to identify the hub genes associated with survival. Finally, selected genes were determined in the verifed samples for an association with survival. Recurrence associated genes were also subject to sample classification.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Data preprocessing and sample classification</title>
<p>From the 19,846 irredundant genes in E-GEOD-39582, 6,600 were the variation genes. Consensus Clustering analysis revealed that cumulative distribution function (CDF) was at a high level when there were 3 sub-types, accompanied with a satisfied classification effect (<xref rid="f1-mmr-16-05-6499" ref-type="fig">Fig. 1</xref>). Therefore, the 556 samples in E-GEOD-39582 were divided into 3 subgroups: G1, G2, and G3 (<xref rid="f2-mmr-16-05-6499" ref-type="fig">Fig. 2</xref>). There were significant survival differences among these 3 types of samples, and as depicted in <xref rid="f3-mmr-16-05-6499" ref-type="fig">Fig. 3</xref>, G1 exhibited the highest survival status, whereas G3 exhibited the lowest survival status. Additionally, the c<sup>2</sup> test determined that all clinical data, with the exception of sex, were significantly different among these 3 groups (<xref rid="tII-mmr-16-05-6499" ref-type="table">Table II</xref>).</p>
</sec>
<sec>
<title>DEGs among the 3 sub-groups</title>
<p>From the 434 DEGs, 76 were the DEGs between G1 and G2, 390 were the DEGs between G1 and G3 and 63 were DEGs between G2 and G3. A total of 2 DEGs were identified in all 3 groups (<xref rid="f4-mmr-16-05-6499" ref-type="fig">Fig. 4</xref>). These two common DEGs were stress-associated endoplasmic reticulum protein family member 2 (SERP2) and long non-coding RNA-0219 (LINC0219).</p>
</sec>
<sec>
<title>Co-expression network construction and module mining</title>
<p>The co-expression network of the training dataset was divided into 4 modules (<xref rid="f5-mmr-16-05-6499" ref-type="fig">Fig. 5</xref>), which were verified with the validation dataset. By calculating the correlation coefficient, the connections between the genes in each module and survival status of the samples were identified. It was determined that the Brown module had the highest module significance and there were various survival-associated genes (hub genes) in this module (<xref rid="f6-mmr-16-05-6499" ref-type="fig">Fig. 6</xref>). These connections were also verified in the validation dataset.</p>
</sec>
<sec>
<title>Recurrence-associated module analysis</title>
<p>All 431 genes in the Brown module were subject to a clustering analysis and it was revealed that there were 3 types of samples (<xref rid="f7-mmr-16-05-6499" ref-type="fig">Fig. 7</xref>). As presented in <xref rid="f7-mmr-16-05-6499" ref-type="fig">Fig. 7</xref>, certain G2 type samples were also observed in with G1 and G2 type samples, suggesting it may have a medium role for the connection between the G1 and G3 status. Additionally, the recurrence interval of G3 samples was short, accompanied with downregulated genes.</p>
<p>The top 10 genes were collagen type VI &#x03B1; 3 chain (COL6A3), EGF containing fibulin like extracellular matrix protein 2 (EFEMP2), fibrillin 1 (FBN1), follistatin-like 1 (FSTL1), glycosyltransferase 8 domain containing 2 (GLT8D2), heart development protein with EGF like domains 1 (HEG1), RAB31, member RAS oncogene family (RAB31), secreted protein acidic and cysteine rich (SPARC), SPARC/osteonectin, cwcv and kazal like domains proteoglycan 1 (SPOCK1) and TIMP metallopeptidase inhibitor 2 (TIMP2) in the Brown module with higher degrees were associated with survival (<xref rid="tIII-mmr-16-05-6499" ref-type="table">Table III</xref>). Genes in the Brown module were primarily enriched in tumor recurrence-associated functions and pathways, including cell adhesion, biological adhesion, ECM organism, the ECM-receptor interaction and the focal adhesion pathways (<xref rid="tIV-mmr-16-05-6499" ref-type="table">Table IV</xref>).</p>
</sec>
<sec>
<title>Validation of recurrence related modules</title>
<p>As presented in the KM curves (<xref rid="f8-mmr-16-05-6499" ref-type="fig">Fig. 8</xref>), there was a significant difference in terms of recurrence status (P=1.5&#x00D7;10<sup>&#x2212;6</sup>) among the samples with high, median, and low gene expression level of the Brown module. High expression levels inidcated higher incidence of recurrence, whereas low expression levels show lower recurrence incidence. This association between gene expression level and recurrence incidence was validated in the dataset of E-GEOD-33113 (P=2.8&#x00D7;10<sup>&#x2212;2</sup>).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>Classification of the colon samples was downloaded from ArrayExpress revealed that there were significant differences of survival status and clinical data among the 3 subtype samples. From the 434 DEGs, SERP2 and LINC0129 were the common DEGs of the 3 subgroups, suggesting they may have an important role in the recurrence of colon cancer. The Brown module was the recurrence-associated module in the co-expression network of DEGs and the top 10 genes (COL6A3, EFEMP2, FBN1, FSTL1, GLT8D2, HEG1, RAB31, SPARC, SPOCK1, and TIMP2) in this module with higher degrees were demonstrated to be significantly associated with survival. Enrichment analysis revealed that genes in the Brown module were primarily enriched in tumor recurrence-associated functions and pathways, including cell and biological adhesion, ECM organization, the ECM-receptor interaction, and the focal adhesion pathways. Additionally, the association between the module and tumor recurrence were verified in another dataset E-GEOD-33113.</p>
<p>SERP2 belongs to the serine proteinase inhibitor family, which are key regulators for the biological pathways that initiate coagulation, inflammation, angiogenesis, apoptosis, complement activation response and ECM composition (<xref rid="b14-mmr-16-05-6499" ref-type="bibr">14</xref>). SERP2 methylation was identified to be a marker for the detection and diagnosis of colon cancer (<xref rid="b15-mmr-16-05-6499" ref-type="bibr">15</xref>). Expression of SERP2 is reported to be an early event in colon cancer, and is associated with carcinogenesis and its development (<xref rid="b16-mmr-16-05-6499" ref-type="bibr">16</xref>). LINC0129 is a long non-coding RNA (lincRNA) and lincRNAs are RNAs &#x003E;200 nt, which are not translated into proteins. Dysfunctions of lincRNAs have been associated with cancer. A previous study revealed that the downregulation of lincRNA BRAF-activated non-protein coding RNA may promote the proliferation of colorectal cancer cells (<xref rid="b17-mmr-16-05-6499" ref-type="bibr">17</xref>). Another previous study reported that lincRNA HOX transcript antisense RNA expression may be a poor prognosis indicator in colon cancer (<xref rid="b18-mmr-16-05-6499" ref-type="bibr">18</xref>). Additionally, overexpression of lincRNA prostate cancer associated transcript 1 was identified to be a novel biomarker of poor prognosis in patients with colon cancer (<xref rid="b19-mmr-16-05-6499" ref-type="bibr">19</xref>). However, there are currently no direct findings that have determined the connection between LINC0129 and colon cancer, all aforementioned findings may have suggested that it may have an important role in colorectal cancer by contributing to the process of relapse.</p>
<p>EFEMP2 is a serum biomarker for the early detection of colon cancer (<xref rid="b20-mmr-16-05-6499" ref-type="bibr">20</xref>) and a superior biomarker compared with carcinoembryonic-antigen, which is the sole biomarker currently used for the diagnosis and treatment monitor in colon cancer (<xref rid="b21-mmr-16-05-6499" ref-type="bibr">21</xref>). FBN1 is a component of the extracellular microfibril and the hypermethylation status of its promoter is a specific and sensitive biomarker for colon cancer (<xref rid="b22-mmr-16-05-6499" ref-type="bibr">22</xref>). SPARC is a matricellular protein involved in cell migration, angiogenesis and tissue remodeling. High SPARC expression may be associated with an improved clinical outcome in stage II colon cancer (<xref rid="b23-mmr-16-05-6499" ref-type="bibr">23</xref>). A previous study determined that the absence of stromal SPARC is an independent prognostic predicator for poor prognosis of colon cancer (<xref rid="b24-mmr-16-05-6499" ref-type="bibr">24</xref>). The high degrees of these genes in the recurrence-associated modules indicated that they have important roles in colon cancer relapse. Among the significantly enriched pathways, the ECM-receptor interaction and focal adhesion pathways were functionally clearly associated with the progression and prognosis of colon cancer (<xref rid="b25-mmr-16-05-6499" ref-type="bibr">25</xref>).</p>
<p>By constructing the co-expression network of genes and identifying the recurrence related modules in the network, the present study identified several survival and recurrence-associated genes in colon cancer. These genes, including SERP2, EFEMP2, FBN1, SPARC, and LINC0219 were identified as recurrence-associated molecular and prognosis indicators in colon cancer.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>The present study was supported by the Shanghai Municipal Commission of Health and Family Planning (grant no. ZYSNXD-CC-ZDYJ032), the Fund from Shanghai Science and Technology Committee (grant no. 12401907800) and the Shanghai Science and Technology Committee (grant no 15401931700).</p>
</ack>
<ref-list>
<title>References</title>
<ref id="b1-mmr-16-05-6499"><label>1</label><element-citation publication-type="book"><collab collab-type="corp-author">American Cancer Society</collab><source>Colorectal cancer facts &#x0026; figures, 2011&#x2013;2013</source><publisher-loc>Atlanta, GA</publisher-loc><publisher-name>American Cancer Society</publisher-name><year>2011</year></element-citation></ref>
<ref id="b2-mmr-16-05-6499"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname><given-names>Z</given-names></name><name><surname>Baige</surname><given-names>L</given-names></name><name><surname>Shen</surname><given-names>G</given-names></name><name><surname>Lina</surname><given-names>S</given-names></name><name><surname>Lingxia</surname><given-names>L</given-names></name><name><surname>Chengyan</surname><given-names>H</given-names></name></person-group><article-title>Study on the Serum Periostin Concentrations of colon cancer patients</article-title><source>Chin J Lab Dia</source><volume>4</volume><fpage>42</fpage><year>2015</year></element-citation></ref>
<ref id="b3-mmr-16-05-6499"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yangming</surname><given-names>G</given-names></name><name><surname>Chunxiao</surname><given-names>W</given-names></name><name><surname>Minlu</surname><given-names>Z</given-names></name><name><surname>Minlu</surname><given-names>Z</given-names></name><name><surname>Peng</surname><given-names>P</given-names></name><name><surname>Kai</surname><given-names>GU</given-names></name><name><surname>Pingping</surname><given-names>B</given-names></name><name><surname>Zhezhou</surname><given-names>H</given-names></name><name><surname>Yongmei</surname><given-names>X</given-names></name><name><surname>Ying</surname><given-names>Z</given-names></name></person-group><article-title>Colorectal cancer survival analysis in major areas in Shanghai</article-title><source>China Oncol</source><volume>25</volume><fpage>497</fpage><lpage>504</lpage><year>2015</year></element-citation></ref>
<ref id="b4-mmr-16-05-6499"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gerger</surname><given-names>A</given-names></name><name><surname>Zhang</surname><given-names>W</given-names></name><name><surname>Yang</surname><given-names>D</given-names></name><name><surname>Bohanes</surname><given-names>P</given-names></name><name><surname>Ning</surname><given-names>Y</given-names></name><name><surname>Winder</surname><given-names>T</given-names></name><name><surname>LaBonte</surname><given-names>MJ</given-names></name><name><surname>Wilson</surname><given-names>PM</given-names></name><name><surname>Benhaim</surname><given-names>L</given-names></name><name><surname>Paez</surname><given-names>D</given-names></name><etal/></person-group><article-title>Common cancer stem cell gene variants predict colon cancer recurrence</article-title><source>Clin Cancer Res</source><volume>17</volume><fpage>6934</fpage><lpage>6943</lpage><year>2011</year><pub-id pub-id-type="doi">10.1158/1078-0432.CCR-11-1180</pub-id><pub-id pub-id-type="pmid">21918173</pub-id></element-citation></ref>
<ref id="b5-mmr-16-05-6499"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Duffy</surname><given-names>M</given-names></name><name><surname>Lamerz</surname><given-names>R</given-names></name><name><surname>Haglund</surname><given-names>C</given-names></name><name><surname>Nicolini</surname><given-names>A</given-names></name><name><surname>Kalousov&#x00E1;</surname><given-names>M</given-names></name><name><surname>Holubec</surname><given-names>L</given-names></name><name><surname>Sturgeon</surname><given-names>C</given-names></name></person-group><article-title>Tumor markers in colorectal cancer, gastric cancer and gastrointestinal stromal cancers: European group on tumor markers 2014 guidelines update</article-title><source>Int J Cancer</source><volume>134</volume><fpage>2513</fpage><lpage>2522</lpage><year>2014</year><pub-id pub-id-type="doi">10.1002/ijc.28384</pub-id><pub-id pub-id-type="pmid">23852704</pub-id></element-citation></ref>
<ref id="b6-mmr-16-05-6499"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Juo</surname><given-names>YY</given-names></name><name><surname>Johnston</surname><given-names>FM</given-names></name><name><surname>Zhang</surname><given-names>DY</given-names></name><name><surname>Juo</surname><given-names>HH</given-names></name><name><surname>Wang</surname><given-names>H</given-names></name><name><surname>Pappou</surname><given-names>EP</given-names></name><name><surname>Yu</surname><given-names>T</given-names></name><name><surname>Easwaran</surname><given-names>H</given-names></name><name><surname>Baylin</surname><given-names>S</given-names></name><name><surname>van Engeland</surname><given-names>M</given-names></name><name><surname>Ahuja</surname><given-names>N</given-names></name></person-group><article-title>Prognostic value of CpG island methylator phenotype among colorectal cancer patients: A systematic review and meta-analysis</article-title><source>Ann Oncol</source><volume>25</volume><fpage>2314</fpage><lpage>2327</lpage><year>2014</year><pub-id pub-id-type="doi">10.1093/annonc/mdu149</pub-id><pub-id pub-id-type="pmid">24718889</pub-id><pub-id pub-id-type="pmcid">4239805</pub-id></element-citation></ref>
<ref id="b7-mmr-16-05-6499"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sanders</surname><given-names>MA</given-names></name><name><surname>Majumdar</surname><given-names>AP</given-names></name></person-group><article-title>Colon cancer stem cells: Implications in carcinogenesis</article-title><source>Front Biosci (Landmark Ed)</source><volume>16</volume><fpage>1651</fpage><lpage>1662</lpage><year>2011</year><pub-id pub-id-type="doi">10.2741/3811</pub-id><pub-id pub-id-type="pmid">21196254</pub-id><pub-id pub-id-type="pmcid">3913054</pub-id></element-citation></ref>
<ref id="b8-mmr-16-05-6499"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vermeulen</surname><given-names>L</given-names></name><name><surname>Todaro</surname><given-names>M</given-names></name><name><surname>de Sousa Mello</surname><given-names>F</given-names></name><name><surname>Sprick</surname><given-names>MR</given-names></name><name><surname>Kemper</surname><given-names>K</given-names></name><name><surname>Alea</surname><given-names>M Perez</given-names></name><name><surname>Richel</surname><given-names>DJ</given-names></name><name><surname>Stassi</surname><given-names>G</given-names></name><name><surname>Medema</surname><given-names>JP</given-names></name></person-group><article-title>Single-cell cloning of colon cancer stem cells reveals a multi-lineage differentiation capacity</article-title><source>Proc Natl Acad Sci USA</source><volume>105</volume><fpage>13427</fpage><lpage>13432</lpage><conf-date>2008</conf-date><pub-id pub-id-type="doi">10.1073/pnas.0805706105</pub-id><pub-id pub-id-type="pmid">18765800</pub-id><pub-id pub-id-type="pmcid">2533206</pub-id></element-citation></ref>
<ref id="b9-mmr-16-05-6499"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ahn</surname><given-names>JB</given-names></name><name><surname>Chung</surname><given-names>WB</given-names></name><name><surname>Maeda</surname><given-names>O</given-names></name><name><surname>Shin</surname><given-names>SJ</given-names></name><name><surname>Kim</surname><given-names>HS</given-names></name><name><surname>Chung</surname><given-names>HC</given-names></name><name><surname>Kim</surname><given-names>NK</given-names></name><name><surname>Issa</surname><given-names>JP</given-names></name></person-group><article-title>DNA methylation predicts recurrence from resected stage III proximal colon cancer</article-title><source>Cancer</source><volume>117</volume><fpage>1847</fpage><lpage>1854</lpage><year>2011</year><pub-id pub-id-type="doi">10.1002/cncr.25737</pub-id><pub-id pub-id-type="pmid">21509761</pub-id></element-citation></ref>
<ref id="b10-mmr-16-05-6499"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>IP</given-names></name><name><surname>Tsai</surname><given-names>HL</given-names></name><name><surname>Hou</surname><given-names>MF</given-names></name><name><surname>Chen</surname><given-names>KC</given-names></name><name><surname>Tsai</surname><given-names>PC</given-names></name><name><surname>Huang</surname><given-names>SW</given-names></name><name><surname>Chou</surname><given-names>WW</given-names></name><name><surname>Wang</surname><given-names>JY</given-names></name><name><surname>Juo</surname><given-names>SH</given-names></name></person-group><article-title>MicroRNA-93 inhibits tumor growth and early relapse of human colorectal cancer by affecting genes involved in the cell cycle</article-title><source>Carcinogenesis</source><volume>33</volume><fpage>1522</fpage><lpage>1530</lpage><year>2012</year><pub-id pub-id-type="doi">10.1093/carcin/bgs166</pub-id><pub-id pub-id-type="pmid">22581829</pub-id></element-citation></ref>
<ref id="b11-mmr-16-05-6499"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liao</surname><given-names>X</given-names></name><name><surname>Lochhead</surname><given-names>P</given-names></name><name><surname>Nishihara</surname><given-names>R</given-names></name><name><surname>Morikawa</surname><given-names>T</given-names></name><name><surname>Kuchiba</surname><given-names>A</given-names></name><name><surname>Yamauchi</surname><given-names>M</given-names></name><name><surname>Imamura</surname><given-names>Y</given-names></name><name><surname>Qian</surname><given-names>ZR</given-names></name><name><surname>Baba</surname><given-names>Y</given-names></name><name><surname>Shima</surname><given-names>K</given-names></name><etal/></person-group><article-title>Aspirin use, tumor PIK3CA mutation, and colorectal-cancer survival</article-title><source>N Engl J Med</source><volume>367</volume><fpage>1596</fpage><lpage>1606</lpage><year>2012</year><pub-id pub-id-type="doi">10.1056/NEJMoa1207756</pub-id><pub-id pub-id-type="pmid">23094721</pub-id><pub-id pub-id-type="pmcid">3532946</pub-id></element-citation></ref>
<ref id="b12-mmr-16-05-6499"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Langfelder</surname><given-names>P</given-names></name><name><surname>Horvath</surname><given-names>S</given-names></name></person-group><article-title>WGCNA: An R package for weighted correlation network analysis</article-title><source>BMC Bioinformatics</source><volume>9</volume><fpage>559</fpage><year>2008</year><pub-id pub-id-type="doi">10.1186/1471-2105-9-559</pub-id><pub-id pub-id-type="pmid">19114008</pub-id><pub-id pub-id-type="pmcid">2631488</pub-id></element-citation></ref>
<ref id="b13-mmr-16-05-6499"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>da</surname><given-names>W Huang</given-names></name><name><surname>Sherman</surname><given-names>BT</given-names></name><name><surname>Lempicki</surname><given-names>RA</given-names></name></person-group><article-title>Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources</article-title><source>Nat Protoc</source><volume>4</volume><fpage>44</fpage><lpage>57</lpage><year>2009</year><pub-id pub-id-type="pmid">19131956</pub-id></element-citation></ref>
<ref id="b14-mmr-16-05-6499"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Richardson</surname><given-names>J</given-names></name><name><surname>Viswanathan</surname><given-names>K</given-names></name><name><surname>Lucas</surname><given-names>A</given-names></name></person-group><article-title>Serpins, the vasculature, and viral therapeutics</article-title><source>Front Biosci</source><volume>11</volume><fpage>1042</fpage><lpage>1056</lpage><year>2006</year><pub-id pub-id-type="doi">10.2741/1862</pub-id><pub-id pub-id-type="pmid">16146796</pub-id></element-citation></ref>
<ref id="b15-mmr-16-05-6499"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>X</given-names></name><name><surname>Song</surname><given-names>YF</given-names></name><name><surname>Lu</surname><given-names>HN</given-names></name><name><surname>Wang</surname><given-names>DP</given-names></name><name><surname>Zhang</surname><given-names>XS</given-names></name><name><surname>Huang</surname><given-names>SL</given-names></name><name><surname>Sun</surname><given-names>BL</given-names></name><name><surname>Huang</surname><given-names>ZG</given-names></name></person-group><article-title>Combined detection of plasma GATA5 and SFRP2 methylation is a valid noninvasive biomarker for colorectal cancer and adenomas</article-title><source>World J Gastroenterol</source><volume>21</volume><fpage>2629</fpage><lpage>2637</lpage><year>2015</year><pub-id pub-id-type="doi">10.3748/wjg.v21.i9.2629</pub-id><pub-id pub-id-type="pmid">25759530</pub-id><pub-id pub-id-type="pmcid">4351212</pub-id></element-citation></ref>
<ref id="b16-mmr-16-05-6499"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ning</surname><given-names>L</given-names></name><name><surname>Haixing</surname><given-names>J</given-names></name><name><surname>Zhenyu</surname><given-names>H</given-names></name><name><surname>Shanyu</surname><given-names>Q</given-names></name><name><surname>Xin</surname><given-names>L</given-names></name></person-group><article-title>Expression of SFRP2, &#x03B2;-catenin and their roles in colorectal carcinoma</article-title><source>J Dig Oncol (Electronic Version)</source><year>2008</year></element-citation></ref>
<ref id="b17-mmr-16-05-6499"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname><given-names>Y</given-names></name><name><surname>Liu</surname><given-names>Y</given-names></name><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Jie</surname><given-names>D</given-names></name><name><surname>Yun</surname><given-names>T</given-names></name><name><surname>Li</surname><given-names>W</given-names></name><name><surname>Yan</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>K</given-names></name><name><surname>Feng</surname><given-names>J</given-names></name></person-group><article-title>Downregulated long noncoding RNA BANCR promotes the proliferation of colorectal cancer cells via downregualtion of p21 expression</article-title><source>PLoS One</source><volume>10</volume><fpage>e0122679</fpage><year>2015</year><pub-id pub-id-type="doi">10.1371/journal.pone.0122679</pub-id><pub-id pub-id-type="pmid">25928067</pub-id><pub-id pub-id-type="pmcid">4415816</pub-id></element-citation></ref>
<ref id="b18-mmr-16-05-6499"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kogo</surname><given-names>R</given-names></name><name><surname>Shimamura</surname><given-names>T</given-names></name><name><surname>Mimori</surname><given-names>K</given-names></name><name><surname>Kawahara</surname><given-names>K</given-names></name><name><surname>Imoto</surname><given-names>S</given-names></name><name><surname>Sudo</surname><given-names>T</given-names></name><name><surname>Tanaka</surname><given-names>F</given-names></name><name><surname>Shibata</surname><given-names>K</given-names></name><name><surname>Suzuki</surname><given-names>A</given-names></name><name><surname>Komune</surname><given-names>S</given-names></name><etal/></person-group><article-title>Long noncoding RNA HOTAIR regulates polycomb-dependent chromatin modification and is associated with poor prognosis in colorectal cancers</article-title><source>Cancer Res</source><volume>71</volume><fpage>6320</fpage><lpage>6326</lpage><year>2011</year><pub-id pub-id-type="doi">10.1158/0008-5472.CAN-11-1021</pub-id><pub-id pub-id-type="pmid">21862635</pub-id></element-citation></ref>
<ref id="b19-mmr-16-05-6499"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ge</surname><given-names>X</given-names></name><name><surname>Chen</surname><given-names>Y</given-names></name><name><surname>Liao</surname><given-names>X</given-names></name><name><surname>Liu</surname><given-names>D</given-names></name><name><surname>Li</surname><given-names>F</given-names></name><name><surname>Ruan</surname><given-names>H</given-names></name><name><surname>Jia</surname><given-names>W</given-names></name></person-group><article-title>Overexpression of long noncoding RNA PCAT-1 is a novel biomarker of poor prognosis in patients with colorectal cancer</article-title><source>Med Oncol</source><volume>30</volume><fpage>588</fpage><year>2013</year><pub-id pub-id-type="doi">10.1007/s12032-013-0588-6</pub-id><pub-id pub-id-type="pmid">23640607</pub-id></element-citation></ref>
<ref id="b20-mmr-16-05-6499"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yao</surname><given-names>L</given-names></name><name><surname>Lao</surname><given-names>W</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Tang</surname><given-names>X</given-names></name><name><surname>Hu</surname><given-names>X</given-names></name><name><surname>He</surname><given-names>C</given-names></name><name><surname>Hu</surname><given-names>X</given-names></name><name><surname>Xu</surname><given-names>LX</given-names></name></person-group><article-title>Identification of EFEMP2 as a serum biomarker for the early detection of colorectal cancer with lectin affinity capture assisted secretome analysis of cultured fresh tissues</article-title><source>J Proteome Res</source><volume>11</volume><fpage>3281</fpage><lpage>3294</lpage><year>2012</year><pub-id pub-id-type="doi">10.1021/pr300020p</pub-id><pub-id pub-id-type="pmid">22506683</pub-id></element-citation></ref>
<ref id="b21-mmr-16-05-6499"><label>21</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>McPherson</surname><given-names>RA</given-names></name><name><surname>Pincus</surname><given-names>MR</given-names></name></person-group><source>Henry&#x0027;s Clinical Diagnosis and Management by Laboratory Methods</source><edition>22nd edition</edition><publisher-name>Elsevier Saunders</publisher-name><publisher-loc>Philadelphia, PA</publisher-loc><year>2011</year><pub-id pub-id-type="doi">10.1016/B978-1-4377-0974-2.00028-2</pub-id></element-citation></ref>
<ref id="b22-mmr-16-05-6499"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname><given-names>Q</given-names></name><name><surname>Song</surname><given-names>Y</given-names></name><name><surname>Zhang</surname><given-names>H</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Xia</surname><given-names>P</given-names></name><name><surname>Dang</surname><given-names>C</given-names></name></person-group><article-title>Detection of hypermethylated fibrillin-1 in the stool samples of colorectal cancer patients</article-title><source>Med Oncol</source><volume>30</volume><fpage>695</fpage><year>2013</year><pub-id pub-id-type="doi">10.1007/s12032-013-0695-4</pub-id><pub-id pub-id-type="pmid">23963856</pub-id></element-citation></ref>
<ref id="b23-mmr-16-05-6499"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chew</surname><given-names>A</given-names></name><name><surname>Salama</surname><given-names>P</given-names></name><name><surname>Robbshaw</surname><given-names>A</given-names></name><name><surname>Klopcic</surname><given-names>B</given-names></name><name><surname>Zeps</surname><given-names>N</given-names></name><name><surname>Platell</surname><given-names>C</given-names></name><name><surname>Lawrance</surname><given-names>IC</given-names></name></person-group><article-title>SPARC, FOXP3, CD8 and CD45 correlation with disease recurrence and long-term disease-free survival in colorectal cancer</article-title><source>PLoS One</source><volume>6</volume><fpage>e22047</fpage><year>2011</year><pub-id pub-id-type="doi">10.1371/journal.pone.0022047</pub-id><pub-id pub-id-type="pmid">21818290</pub-id><pub-id pub-id-type="pmcid">3144212</pub-id></element-citation></ref>
<ref id="b24-mmr-16-05-6499"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname><given-names>JF</given-names></name><name><surname>Wang</surname><given-names>HK</given-names></name><name><surname>Xiao</surname><given-names>H</given-names></name><name><surname>Li</surname><given-names>N</given-names></name><name><surname>Cheng</surname><given-names>CX</given-names></name><name><surname>Zhao</surname><given-names>YZ</given-names></name><name><surname>Ma</surname><given-names>YB</given-names></name><name><surname>Gao</surname><given-names>JZ</given-names></name><name><surname>Bai</surname><given-names>RB</given-names></name><name><surname>Zheng</surname><given-names>HX</given-names></name></person-group><article-title>Relationship and prognostic significance of SPARC and VEGF protein expression in colon cancer</article-title><source>J Exp Clin Cancer Res</source><volume>29</volume><fpage>71</fpage><year>2010</year><pub-id pub-id-type="doi">10.1186/1756-9966-29-71</pub-id><pub-id pub-id-type="pmid">20565704</pub-id><pub-id pub-id-type="pmcid">2895582</pub-id></element-citation></ref>
<ref id="b25-mmr-16-05-6499"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lascorz</surname><given-names>J</given-names></name><name><surname>Chen</surname><given-names>B</given-names></name><name><surname>Hemminki</surname><given-names>K</given-names></name><name><surname>F&#x00F6;rsti</surname><given-names>A</given-names></name></person-group><article-title>Consensus pathways implicated in prognosis of colorectal cancer identified through systematic enrichment analysis of gene expression profiling studies</article-title><source>PLoS One</source><volume>6</volume><fpage>e18867</fpage><year>2011</year><pub-id pub-id-type="doi">10.1371/journal.pone.0018867</pub-id><pub-id pub-id-type="pmid">21541025</pub-id><pub-id pub-id-type="pmcid">3081819</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-mmr-16-05-6499" position="float">
<label>Figure 1.</label>
<caption><p>Correlation between the number of subgroups and CDF values. CDF, cumulative distribution function.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g02.tif"/>
</fig>
<fig id="f2-mmr-16-05-6499" position="float">
<label>Figure 2.</label>
<caption><p>The three subgroups of the samples in E-GEOD-39582.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g03.tif"/>
</fig>
<fig id="f3-mmr-16-05-6499" position="float">
<label>Figure 3.</label>
<caption><p>Recurrence analysis of the three subgroups.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g04.tif"/>
</fig>
<fig id="f4-mmr-16-05-6499" position="float">
<label>Figure 4.</label>
<caption><p>Differentially expressed genes in the 3 subgroups of G1, G2, and G3.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g05.tif"/>
</fig>
<fig id="f5-mmr-16-05-6499" position="float">
<label>Figure 5.</label>
<caption><p>Clustering results of minded modules in train (upper) and valid (below) dataset. Genes in modules are marked with different colors (blue, yellow, brown and turquoise), with grey color representing no genes in any modules.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g06.tif"/>
</fig>
<fig id="f6-mmr-16-05-6499" position="float">
<label>Figure 6.</label>
<caption><p>Distribution of survival related genes in all modules. Genes are presented in x-axis, and the enrichment significance is shown in y-axis.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g07.tif"/>
</fig>
<fig id="f7-mmr-16-05-6499" position="float">
<label>Figure 7.</label>
<caption><p>Clustering results of genes in the brown module. Pink, green, light green, and light blue represent G1-, G2-, and G3-type samples, respectively.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g08.tif"/>
</fig>
<fig id="f8-mmr-16-05-6499" position="float">
<label>Figure 8.</label>
<caption><p>Correlation analysis between the genes and recurrence time. Kaplan-Meier curves for the brown module in (A) E-GEOD-39582 and (B) E-GEOD-33113 datasets.</p></caption>
<graphic xlink:href="MMR-16-05-6499-g09.tif"/>
</fig>
<table-wrap id="tI-mmr-16-05-6499" position="float">
<label>Table I.</label>
<caption><p>Parameters used in sample classification using consensus clustering.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Parameter</th>
<th align="center" valign="bottom">Value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Max cluster group</td>
<td align="center" valign="top">20</td>
</tr>
<tr>
<td align="left" valign="top">Subsample number</td>
<td align="center" valign="top">5,000</td>
</tr>
<tr>
<td align="left" valign="top">Proportion sample</td>
<td align="center" valign="top">0.9</td>
</tr>
<tr>
<td align="left" valign="top">Proportion feature</td>
<td align="center" valign="top">1</td>
</tr>
<tr>
<td align="left" valign="top">Distance method</td>
<td align="center" valign="top">Pearson</td>
</tr>
<tr>
<td align="left" valign="top">Clustering method</td>
<td align="center" valign="top">K-means dist</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="tII-mmr-16-05-6499" position="float">
<label>Table II.</label>
<caption><p>Association analysis on clinical data with sub-samples groups.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Characteristic</th>
<th align="center" valign="bottom">&#x03C7;<sup>2</sup></th>
<th align="center" valign="bottom">df</th>
<th align="center" valign="bottom">P-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Sex</td>
<td align="center" valign="top">1.1653</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">5.58&#x00D7;10<sup>&#x2212;1</sup></td>
</tr>
<tr>
<td align="left" valign="top">CGH data</td>
<td align="center" valign="top">17.0540</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">1.98&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">Chemotherapy</td>
<td align="center" valign="top">19.7370</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">5.63&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">MMR status</td>
<td align="center" valign="top">23.4140</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">1.05&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">TNM stage</td>
<td align="center" valign="top">26.2990</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9.34&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">Tumor location</td>
<td align="center" valign="top">14.7690</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">6.21&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-mmr-16-05-6499"><p>MMR, mismatch repair; TNM, tumor node metastasis; CGH, comparative genomic hybridization; df, degrees of freedom.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIII-mmr-16-05-6499" position="float">
<label>Table III.</label>
<caption><p>Top 10 genes with high degrees in Brown module.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Gene name</th>
<th align="center" valign="bottom">Coefficient</th>
<th align="center" valign="bottom">P-value</th>
<th align="center" valign="bottom">k total</th>
<th align="center" valign="bottom">k within</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><italic>COL6A3</italic></td>
<td align="center" valign="top">2.17&#x00D7;10<sup>&#x2212;4</sup></td>
<td align="center" valign="top">1.9&#x00D7;10<sup>&#x2212;6</sup></td>
<td align="center" valign="top">136.82</td>
<td align="center" valign="top">91.86</td>
</tr>
<tr>
<td align="left" valign="top"><italic>EFEMP2</italic></td>
<td align="center" valign="top">2.10&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">9.2&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">172.24</td>
<td align="center" valign="top">110.00</td>
</tr>
<tr>
<td align="left" valign="top"><italic>FBN1</italic></td>
<td align="center" valign="top">1.42&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">5.7&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">137.03</td>
<td align="center" valign="top">92.14</td>
</tr>
<tr>
<td align="left" valign="top"><italic>FSTL1</italic></td>
<td align="center" valign="top">1.50&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">5.4&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">143.42</td>
<td align="center" valign="top">91.08</td>
</tr>
<tr>
<td align="left" valign="top"><italic>GLT8D2</italic></td>
<td align="center" valign="top">2.91&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">7.5&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">137.54</td>
<td align="center" valign="top">91.21</td>
</tr>
<tr>
<td align="left" valign="top"><italic>HEG1</italic></td>
<td align="center" valign="top">2.17&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">2.3&#x00D7;10<sup>&#x2212;5</sup></td>
<td align="center" valign="top">168.71</td>
<td align="center" valign="top">94.84</td>
</tr>
<tr>
<td align="left" valign="top"><italic>RAB31</italic></td>
<td align="center" valign="top">5.27&#x00D7;10<sup>&#x2212;4</sup></td>
<td align="center" valign="top">5.9&#x00D7;10<sup>&#x2212;5</sup></td>
<td align="center" valign="top">138.09</td>
<td align="center" valign="top">93.74</td>
</tr>
<tr>
<td align="left" valign="top"><italic>SPARC</italic></td>
<td align="center" valign="top">3.06&#x00D7;10<sup>&#x2212;4</sup></td>
<td align="center" valign="top">9.8&#x00D7;10<sup>&#x2212;8</sup></td>
<td align="center" valign="top">132.03</td>
<td align="center" valign="top">95.57</td>
</tr>
<tr>
<td align="left" valign="top"><italic>SPOCK1</italic></td>
<td align="center" valign="top">1.49&#x00D7;10<sup>&#x2212;3</sup></td>
<td align="center" valign="top">9.7&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">128.75</td>
<td align="center" valign="top">88.57</td>
</tr>
<tr>
<td align="left" valign="top"><italic>TIMP2</italic></td>
<td align="center" valign="top">5.34&#x00D7;10<sup>&#x2212;4</sup></td>
<td align="center" valign="top">4.2&#x00D7;10<sup>&#x2212;7</sup></td>
<td align="center" valign="top">166.30</td>
<td align="center" valign="top">105.73</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="tIV-mmr-16-05-6499" position="float">
<label>Table IV.</label>
<caption><p>Significantly enriched functional terms of genes in Brown module.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Term ID</th>
<th align="center" valign="bottom">Function</th>
<th align="center" valign="bottom">Count</th>
<th align="center" valign="bottom">P-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">GO:0007155</td>
<td align="left" valign="top">Cell adhesion</td>
<td align="center" valign="top">84</td>
<td align="center" valign="top">7.29&#x00D7;10<sup>&#x2212;36</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0022610</td>
<td align="left" valign="top">Biological adhesion</td>
<td align="center" valign="top">84</td>
<td align="center" valign="top">8.12&#x00D7;10<sup>&#x2212;36</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0030198</td>
<td align="left" valign="top">Extracellular matrix organization</td>
<td align="center" valign="top">30</td>
<td align="center" valign="top">2.31&#x00D7;10<sup>&#x2212;23</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0043062</td>
<td align="left" valign="top">Extracellular structure organization</td>
<td align="center" valign="top">32</td>
<td align="center" valign="top">1.46&#x00D7;10<sup>&#x2212;19</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0001501</td>
<td align="left" valign="top">Skeletal system development</td>
<td align="center" valign="top">40</td>
<td align="center" valign="top">2.50&#x00D7;10<sup>&#x2212;17</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0001944</td>
<td align="left" valign="top">Vasculature development</td>
<td align="center" valign="top">35</td>
<td align="center" valign="top">1.35&#x00D7;10<sup>&#x2212;16</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0001568</td>
<td align="left" valign="top">Blood vessel development</td>
<td align="center" valign="top">34</td>
<td align="center" valign="top">5.71&#x00D7;10<sup>&#x2212;16</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0009611</td>
<td align="left" valign="top">Response to wounding</td>
<td align="center" valign="top">45</td>
<td align="center" valign="top">3.90&#x00D7;10<sup>&#x2212;13</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0006928</td>
<td align="left" valign="top">Cell motion</td>
<td align="center" valign="top">40</td>
<td align="center" valign="top">1.36&#x00D7;10<sup>&#x2212;11</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0042060</td>
<td align="left" valign="top">Wound healing</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">2.72&#x00D7;10<sup>&#x2212;11</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0016477</td>
<td align="left" valign="top">Cell migration</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">5.08&#x00D7;10<sup>&#x2212;10</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0048870</td>
<td align="left" valign="top">Cell motility</td>
<td align="center" valign="top">29</td>
<td align="center" valign="top">1.19&#x00D7;10<sup>&#x2212;9</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0051674</td>
<td align="left" valign="top">Localization of cell</td>
<td align="center" valign="top">29</td>
<td align="center" valign="top">1.19&#x00D7;10<sup>&#x2212;9</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0048514</td>
<td align="left" valign="top">Blood vessel morphogenesis</td>
<td align="center" valign="top">24</td>
<td align="center" valign="top">1.25&#x00D7;10<sup>&#x2212;9</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0051270</td>
<td align="left" valign="top">Regulation of cell motion</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">6.93&#x00D7;10<sup>&#x2212;9</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0035295</td>
<td align="left" valign="top">Tube development</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">1.44&#x00D7;10<sup>&#x2212;8</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0030334</td>
<td align="left" valign="top">Regulation of cell migration</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">2.15&#x00D7;10<sup>&#x2212;8</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0040012</td>
<td align="left" valign="top">Regulation of locomotion</td>
<td align="center" valign="top">21</td>
<td align="center" valign="top">3.35&#x00D7;10<sup>&#x2212;8</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0042127</td>
<td align="left" valign="top">Regulation of cell proliferation</td>
<td align="center" valign="top">45</td>
<td align="center" valign="top">1.12&#x00D7;10<sup>&#x2212;7</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0016337</td>
<td align="left" valign="top">Cell-cell adhesion</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">2.97&#x00D7;10<sup>&#x2212;6</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0007167</td>
<td align="left" valign="top">Enzyme linked receptor protein signaling pathway</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">2.49&#x00D7;10<sup>&#x2212;5</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0010033</td>
<td align="left" valign="top">Response to organic substance</td>
<td align="center" valign="top">36</td>
<td align="center" valign="top">5.15&#x00D7;10<sup>&#x2212;5</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0008285</td>
<td align="left" valign="top">Negative regulation of cell proliferation</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">5.67&#x00D7;10<sup>&#x2212;5</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0006954</td>
<td align="left" valign="top">Inflammatory response</td>
<td align="center" valign="top">21</td>
<td align="center" valign="top">1.07&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0000902</td>
<td align="left" valign="top">Cell morphogenesis</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">1.32&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0032989</td>
<td align="left" valign="top">Cellular component morphogenesis</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">2.24&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0008284</td>
<td align="left" valign="top">Positive regulation of cell proliferation</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">3.97&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">GO:0030182</td>
<td align="left" valign="top">Neuron differentiation</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">8.49&#x00D7;10<sup>&#x2212;4</sup></td>
</tr>
<tr>
<td align="left" valign="top">hsa04512</td>
<td align="left" valign="top">ECM-receptor interaction</td>
<td align="center" valign="top">26</td>
<td align="center" valign="top">1.80&#x00D7;10<sup>&#x2212;21</sup></td>
</tr>
<tr>
<td align="left" valign="top">hsa04510</td>
<td align="left" valign="top">Focal adhesion</td>
<td align="center" valign="top">35</td>
<td align="center" valign="top">1.85&#x00D7;10<sup>&#x2212;20</sup></td>
</tr>
<tr>
<td align="left" valign="top">hsa05200</td>
<td align="left" valign="top">Pathways in cancer</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">3.45&#x00D7;10<sup>&#x2212;5</sup></td>
</tr>
<tr>
<td align="left" valign="top">hsa04060</td>
<td align="left" valign="top">Cytokine-cytokine receptor interaction</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">4.34&#x00D7;10<sup>&#x2212;3</sup></td>
</tr>
<tr>
<td align="left" valign="top">hsa04810</td>
<td align="left" valign="top">Regulation of actin cytoskeleton</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">5.83&#x00D7;10<sup>&#x2212;3</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-mmr-16-05-6499"><p>GO, gene ontology.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>