<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">OL</journal-id>
<journal-title-group>
<journal-title>Oncology Letters</journal-title>
</journal-title-group>
<issn pub-type="ppub">1792-1074</issn>
<issn pub-type="epub">1792-1082</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ol.2019.11063</article-id>
<article-id pub-id-type="publisher-id">OL-0-0-11063</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Identification of a four-long non-coding RNA signature in predicting breast cancer survival</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Zhu</surname><given-names>Mingjie</given-names></name>
<xref rid="af1-ol-0-0-11063" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Lv</surname><given-names>Qing</given-names></name>
<xref rid="af1-ol-0-0-11063" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Huang</surname><given-names>Hu</given-names></name>
<xref rid="af1-ol-0-0-11063" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Sun</surname><given-names>Chunlei</given-names></name>
<xref rid="af1-ol-0-0-11063" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Pang</surname><given-names>Da</given-names></name>
<xref rid="af2-ol-0-0-11063" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Wu</surname><given-names>Junqiang</given-names></name>
<xref rid="af1-ol-0-0-11063" ref-type="aff">1</xref>
<xref rid="c1-ol-0-0-11063" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-ol-0-0-11063"><label>1</label>Department of Breast Surgery, The Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu 214035, P.R. China</aff>
<aff id="af2-ol-0-0-11063"><label>2</label>Department of Breast Surgery, Harbin Medical University Cancer Hospital, Harbin, Heilongjiang 150081, P.R. China</aff>
<author-notes>
<corresp id="c1-ol-0-0-11063"><italic>Correspondence to</italic>: Dr Junqiang Wu, Department of Breast Surgery, The Affiliated Hospital of Jiangnan University, 200 Huihe Road, Wuxi, Jiangsu 214035, P.R. China, E-mail: <email>dr_junqiang_wu@163.com</email></corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>01</month>
<year>2020</year></pub-date>
<pub-date pub-type="epub">
<day>07</day>
<month>11</month>
<year>2019</year></pub-date>
<volume>19</volume>
<issue>1</issue>
<fpage>221</fpage>
<lpage>228</lpage>
<history>
<date date-type="received"><day>03</day><month>01</month><year>2019</year></date>
<date date-type="accepted"><day>04</day><month>10</month><year>2019</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Zhu et al.</copyright-statement>
<copyright-year>2019</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>Long non-coding RNAs (lncRNAs) serve key roles in tumorigenesis and are differentially expressed in cancer. Using bioinformatics and statistical methods, the present study aimed to identify an lncRNA signature to predict breast cancer survival. The gene expression data of 768 patients with breast cancer were downloaded from The Cancer Genome Atlas database, and Cox regression, Kaplan-Meier and receiver operating characteristic (ROC) analyses were performed to construct and validate a predictive model. Gene Ontology term enrichment and Kyoto Encyclopedia of Genes and Genomes pathway analysis were employed to predict the functions of the indicated lncRNAs. A signature consisting of four lncRNAs, including <italic>PVT1, MAPT-AS1, LINC00667</italic> and <italic>LINC00938</italic>, was identified, and patients were subsequently divided into high- and low-risk groups according to the median risk score. Kaplan-Meier analysis confirmed that patients in the high-risk group exhibited significantly poorer overall survival rate in both the training (P=0.0151) and the validation set (P=0.0016); furthermore, ROC analysis confirmed that the model could predict patient survival with a certain sensitivity and specificity. In conclusion, the four-lncRNA signature presents a potential prognostic biomarker for breast cancer that may be relevant for clinical application.</p>
</abstract>
<kwd-group>
<kwd>breast cancer</kwd>
<kwd>long non-coding RNA</kwd>
<kwd>overall survival</kwd>
<kwd>prognostic predictor</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Breast cancer is one of the most common malignancies in women in both developed and developing countries, and each year &#x003E;1,300,000 cases of breast cancer are reported globally (<xref rid="b1-ol-0-0-11063" ref-type="bibr">1</xref>). For decades, clinicopathological features have been utilized to evaluate the prognosis of patients with breast cancer, including tumor size, clinical stage, intrinsic subtype and lymph node status (<xref rid="b2-ol-0-0-11063" ref-type="bibr">2</xref>&#x2013;<xref rid="b5-ol-0-0-11063" ref-type="bibr">5</xref>). However, these factors are limited in their prognostic capability and are only useful in a number of patients (<xref rid="b6-ol-0-0-11063" ref-type="bibr">6</xref>). It is widely accepted that the underlying molecular mechanisms of breast cancer are complex and may involve the alterations of specific genomic regions, as well as epigenetic modifications in mammary epithelial cells (<xref rid="b7-ol-0-0-11063" ref-type="bibr">7</xref>,<xref rid="b8-ol-0-0-11063" ref-type="bibr">8</xref>). To the best of our knowledge, patients with similar disease characteristics, who have received similar treatments may present with markedly different clinical outcomes. Therefore, accurately predicting patient outcome and subsequently selecting the appropriate treatment, reducing morbidity and prolonging survival time are of great clinical importance.</p>
<p>Long non-coding RNAs (lncRNAs) are a class of RNAs with &#x003E;200 nucleotides and no known protein coding capability (<xref rid="b9-ol-0-0-11063" ref-type="bibr">9</xref>,<xref rid="b10-ol-0-0-11063" ref-type="bibr">10</xref>). lncRNAs have been implicated in a wide range of biological processes, including tumor-suppressor modulation, RNA-RNA interactions, and epigenetic and post-transcriptional regulation (<xref rid="b11-ol-0-0-11063" ref-type="bibr">11</xref>&#x2013;<xref rid="b14-ol-0-0-11063" ref-type="bibr">14</xref>). As additional biological functions of lncRNAs are identified, they have become the focus of an increasing number of studies. These studies have revealed that lncRNAs serve a role in carcinogenesis and possess specific expression patterns in cancer (<xref rid="b15-ol-0-0-11063" ref-type="bibr">15</xref>&#x2013;<xref rid="b17-ol-0-0-11063" ref-type="bibr">17</xref>). To date, several lncRNAs have been regarded as diagnostic and/or prognostic biomarkers for specific malignancies, such as lncRNA HOX transcript antisense RNA (<italic>HOTAIR</italic>), which is overexpressed in breast cancer, thus promoting cancer cell invasion and metastasis by altering the methylation and gene expression of histone H3K27 via polycomb repressive complex 2 (<xref rid="b18-ol-0-0-11063" ref-type="bibr">18</xref>). Another lncRNA, metastasis-associated lung adenocarcinoma transcript 1, was first identified in non-small cell lung cancer (NSCLC); its high level of expression was strongly correlated with an increased risk of NSCLC, and was associated with metastasis and poor patient outcome (<xref rid="b19-ol-0-0-11063" ref-type="bibr">19</xref>).</p>
<p>However, the predictive ability of single lncRNAs is still unsatisfactory, resulting in high numbers of both false positive and negative results (<xref rid="b20-ol-0-0-11063" ref-type="bibr">20</xref>). Therefore, the present study aimed to identify a four-lncRNA signature able to predict the overall survival (OS) rate of patients with breast cancer, and to validate the prognostic value of the identified lncRNAs using high-throughput sequencing data from The Cancer Genome Atlas (TCGA) database.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Breast cancer gene expression data from TCGA and Gene Expression Omnibus (GEO) databases</title>
<p>Breast cancer gene expression data, including coding and non-coding RNA sequence data, were acquired from TCGA (<uri xlink:href="https://cancergenome.nih.gov/">https://cancergenome.nih.gov/</uri>) together with the corresponding clinical information. Until 2017, 1,098 breast cancer samples were available from TCGA, though in the present study only those including patient survival status were selected (n=768); this enabled the determination of any association between the expression of lncRNAs of the lncRNA-expression signature and the corresponding OS time for breast cancer. These 768 breast cancer samples were divided equally into a training set (to identify the gene expression signature) and a validation set (to validate the gene expression signature). To confirm the expression levels of the differentially expressed genes, the gene expression dataset GSE5764 (<xref rid="b21-ol-0-0-11063" ref-type="bibr">21</xref>), containing 10 breast cancer tissue samples and 20 non-cancerous samples, was downloaded from GEO (<uri xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</uri>; Affymetrix GPL570 platform, Affymetrix Human Genome U133 Plus 2.0 Array; Affymetrix; Thermo Fisher Scientific, Inc.).</p>
</sec>
<sec>
<title>Identification of lncRNAs</title>
<p>RNA genes downloaded from TCGA were compared with published lncRNAs from the MiTranscriptome database (<uri xlink:href="http://mitranscriptome.org/">http://mitranscriptome.org/</uri>). Potential lncRNAs were identified as transcriptome sequences that were mapped to corresponding lncRNAs, rather than any protein-coding region, and were not identified as protein-coding genes in the National Center for Biotechnology Information database (<uri xlink:href="https://www.ncbi.nlm.nih.gov/">https://www.ncbi.nlm.nih.gov/</uri>).</p>
</sec>
<sec>
<title>Gene expression data analysis</title>
<p>Raw read counts of the transcriptomic data from TCGA were normalized using the quartile normalization method and logarithmically transformed to a normal distribution. The Bioconductor package DESeq2 (<uri xlink:href="https://www.bioconductor.org/packages/release/bioc/html/DESeq2.html">https://www.bioconductor.org/packages/release/bioc/html/DESeq2.html</uri>, version 1.24.0) was used to perform the normalization and identify differentially expressed genes in breast cancer samples compared with adjacent normal tissues, with an adjusted P&#x003C;0.05 and an absolute log2-based fold-change &#x003E;0.5. For gene expression data from GEO, the R package limma (<uri xlink:href="https://www.bioconductor.org/packages/release/bioc/html/limma.html">https://www.bioconductor.org/packages/release/bioc/html/limma.html</uri>, version 3.40.6) was used to conduct differential expression analysis.</p>
</sec>
<sec>
<title>Establishment of a prognostic signature</title>
<p>To establish a prognostic signature for breast cancer, a two-step method was employed using the R package SIS (<uri xlink:href="https://CRAN.R-project.org/package=SIS">https://CRAN.R-project.org/package=SIS</uri>, version 0.8&#x2013;6) for sure independence screening. Firstly, univariate Cox regression analysis was performed to identify survival-associated genes. Secondly, SIS (based on the least absolute shrinkage and selection operator, Cox-penalized regression model) was used to identify important variables and construct multi-gene-based prognostic signatures for OS rate prediction.</p>
</sec>
<sec>
<title>Guilt by association analysis</title>
<p>To identify genes that correlated with the four lncRNAs of the prognostic signature, data from TCGA were used to evaluate the pairwise Pearson&#x0027;s correlation between the expression levels of the target lncRNAs. Only associated genes with an absolute r&#x2265;0.3 and a significant correlation (P&#x003C;0.05) were retained. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG; <uri xlink:href="https://www.genome.jp/kegg/">http://www.genome.jp/kegg/</uri>) pathway analyses were performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID; <uri xlink:href="https://david.ncifcrf.gov/">http://david.ncifcrf.gov/</uri>).</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>Kaplan-Meier analysis was used to estimate the performance of the prognostic signatures, and log-rank test was performed to evaluate statistical significance. A risk score was calculated for each patient according to the formula of the four-lncRNA signature, and patients were divided into high- and low-risk groups using the median score as a cut-off. Receiver operating characteristic (ROC) analysis was used to evaluate the sensitivity and specificity of the four-lncRNA signature and other biomarkers, including TP53, MKI67, ESR1, PGR, ERBB2 and HOTAIR. All statistical analyses were conducted using R 3.5.2 (<uri xlink:href="https://www.r-project.org/">https://www.r-project.org/</uri>), and P&#x003C;0.05 was considered to indicate a statistically significant difference.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Patient characteristics</title>
<p>All 768 patients were diagnosed with breast cancer based on clinicopathological evaluation. The clinical stage and histological subtype were determined using the Tumor-Node-Metastasis staging (<xref rid="b22-ol-0-0-11063" ref-type="bibr">22</xref>) and immunohistochemical molecular typing methods, respectively. According to the data, the estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) status of each patient was indicated as positive, negative or indeterminate. In addition, the ranges of the OS and relapse-free survival times were 1&#x2013;8,605 and 1&#x2013;8,556 days, respectively. Patient characteristics are displayed in <xref rid="tI-ol-0-0-11063" ref-type="table">Table I</xref>.</p>
</sec>
<sec>
<title>Differential expression analysis and determination of the four-lncRNA signature in the training set</title>
<p>Differential expression analysis was employed to select differentially expressed RNAs in normal and cancerous tissues; a total of 8,854 upregulated and 5,939 downregulated RNAs were identified. Subsequently, all possible combinations of four lncRNAs were analyzed and compared using the two-step Cox regression method. A total of 7 models consisting of four lncRNAs were identified (<xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Table SI</xref>); among these candidates, one model was identified as the most suitable for predicting the OS of patients with breast cancer. Patients were divided into high- and low-risk groups using the median risk score as a cut-off, with the risk score calculated as follows: Risk score=(&#x2212;0.015&#x00D7; expression value of <italic>PVT1</italic>) &#x002B; (&#x2212;0.193&#x00D7; expression value of <italic>MAPT-AS1</italic>) &#x002B; (&#x2212;0.116&#x00D7; expression value of <italic>LINC00667</italic>) &#x002B; (0.098&#x00D7; expression value of <italic>LINC00938</italic>). The coefficients of this formula were derived from multivariate Cox regression analysis (<xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Table SI</xref>). Kaplan-Meier analysis was also performed to determine the association between the expression levels of the four-lncRNA signature and patient OS. Compared with those of the low-risk group, high-risk patients exhibited significantly poorer OS rate (log-rank P=0.0151; <xref rid="f1-ol-0-0-11063" ref-type="fig">Fig. 1A</xref>).</p>
</sec>
<sec>
<title>Validation of the four-lncRNA signature in the validation set</title>
<p>To confirm the predictive capacity of the four-lncRNA signature identified in the training set, the equivalent analyses were also performed in the validation set. Patients were divided into low- and high-risk groups, and the differences between patient OS rates were compared using Kaplan-Meier analysis. Patients in the high-risk group possessed significantly lower OS rate than those of patients in the low-risk group (log-rank P=0.0016; <xref rid="f1-ol-0-0-11063" ref-type="fig">Fig. 1B</xref>), which was consistent with the findings from the training set. Furthermore, ROC analysis was performed to evaluate the sensitivity and specificity of survival prediction; the area under the curve (AUC) was 0.641 (<xref rid="f2-ol-0-0-11063" ref-type="fig">Fig. 2A</xref>), indicating that the four-lncRNA signature was able to accurately predict the survival of patients with breast cancer.</p>
</sec>
<sec>
<title>Four-lncRNA signature in different clinical stages and molecular subtypes</title>
<p>ROC analyses were performed to investigate whether the four-lncRNA signature was applicable to different breast cancer stages and molecular subtypes. In stages I&#x2013;IV, the AUC values were 0.595, 0.687, 0.634 and 0.645, respectively (<xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Fig. S1</xref>), indicating that the four-lncRNA signature was able to predict the survival of patients at different clinical stages of breast cancer. Regarding subtype, the AUC values in the four molecular subgroups were 0.637, 0.654, 0.688 and 0.613, respectively (<xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Fig. S2</xref>), suggesting that the four-lncRNA signature served as a prognostic indicator for patients with different breast cancer subtypes.</p>
</sec>
<sec>
<title>Performance of the four-lncRNA signature compared with that of known biomarkers and individual lncRNAs</title>
<p>For further clarification, the performance of the four-lncRNA signature was compared with that of several known breast cancer biomarkers, including <italic>TP53, MKI67, ESR1, PGR, ERBB2</italic> and <italic>HOTAIR</italic>, using ROC and Kaplan-Meier analyses. The sensitivity and specificity of these six known biomarkers are displayed in <xref rid="f2-ol-0-0-11063" ref-type="fig">Fig. 2B</xref>. The AUC values of <italic>TP53, MKI67, ESR1, PGR, ERBB2</italic> and <italic>HOTAIR</italic> were 0.574, 0.483, 0.510, 0.501, 0.501 and 0.473 (data not shown), respectively, while the AUC value (0.641) of the four-lncRNA signature was greater. Kaplan-Meier analysis revealed that only <italic>TP53</italic> was significantly associated with patient OS (<xref rid="f3-ol-0-0-11063" ref-type="fig">Fig. 3A</xref>; log-rank P=0.0396), while the other selected biomarkers were not (<xref rid="f3-ol-0-0-11063" ref-type="fig">Fig. 3B-F</xref>). Moreover, the four lncRNAs from the identified model were also evaluated. ROC curve analysis generated AUC values for <italic>PVT1, MAPT-AS1, LINC00667</italic> and <italic>LINC00938</italic> as 0.532, 0.553, 0.550 and 0.480, respectively (<xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Fig. S3</xref>), which were smaller than that of the four-lncRNA signature. In addition, the results of Kaplan-Meier analysis indicated that the differential expression of these lncRNAs was not significantly associated with the OS rate of patients with breast cancer (log-rank P&#x003E;0.05; <xref rid="SD1-ol-0-0-11063" ref-type="supplementary-material">Fig. S4</xref>).</p>
</sec>
<sec>
<title>Relative expression levels and potential biological functions of the four lncRNAs of the lncRNA signature</title>
<p>To further investigate the potential functions of the four lncRNAs of the signature, gene expression data from the GSE5764 dataset were downloaded from the GEO database, and differential expression analysis was performed. The fold-change values of <italic>PVT1, MAPT-AS1, LINC00667</italic> and <italic>LINC00938</italic> were 2.031, 3.057, 1.579, 0.455, respectively. This result indicated that <italic>PVT1, MAPT-AS1</italic> and <italic>LINC00667</italic> were upregulated in breast cancer tissues, and that <italic>LINC00938</italic> was downregulated. GO enrichment and KEGG pathway analyses were conducted. According to the results of GO analysis, the primary <italic>PVT1</italic>-associated functions were &#x2018;transcription elongation&#x2019;, &#x2018;mitochondrial electron transport&#x2019; and &#x2018;endoplasmic reticulum-associated degradation&#x2019; (<xref rid="f4-ol-0-0-11063" ref-type="fig">Fig. 4A</xref>). <italic>MAPT-AS1</italic> was associated with &#x2018;cilium morphogenesis&#x2019; and &#x2018;cilium assembly&#x2019;, &#x2018;intraciliary retrograde transport&#x2019; and &#x2018;neurological system process&#x2019; (<xref rid="f4-ol-0-0-11063" ref-type="fig">Fig. 4B</xref>). <italic>LINC00667</italic> was associated with &#x2018;regulation of transcription, DNA-templated&#x2019;, &#x2018;centrosome organization&#x2019; and &#x2018;regulation of cell morphogenesis&#x2019; (<xref rid="f4-ol-0-0-11063" ref-type="fig">Fig. 4C</xref>), and <italic>LINC00983</italic> was associated with biological processes involved in &#x2018;cilium assembly&#x2019; and &#x2018;S-adenosylmethionine metabolic process&#x2019; (<xref rid="f4-ol-0-0-11063" ref-type="fig">Fig. 4D</xref>). Moreover, KEGG analysis also indicated several biological processes and pathways that potentially associated with the signature lncRNAs, including &#x2018;Notch signaling pathway&#x2019;, &#x2018;oxidative phosphorylation&#x2019; and &#x2018;Huntington&#x0027;s disease&#x2019; (<xref rid="f4-ol-0-0-11063" ref-type="fig">Fig. 4E</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>In the present study, the RNA expression data of 768 patients from TCGA were analyzed, and 14,793 RNAs that were differently expressed in normal vs. cancerous tissues were selected. Following multivariate Cox regression analysis (data not shown), a model consisting of four lncRNAs (<italic>PVT1, MAPT-AS1, LINC00667</italic> and <italic>LINC00938</italic>) was determined to predict the survival of patients with breast cancer, while Kaplan-Meier and ROC analyses confirmed that this model was able to predict OS to an acceptable degree of specificity and sensitivity.</p>
<p>In previous years, an increasing number of lncRNAs have been identified and are widely considered to be a novel class of gene regulators in various types of cancer (<xref rid="b11-ol-0-0-11063" ref-type="bibr">11</xref>,<xref rid="b23-ol-0-0-11063" ref-type="bibr">23</xref>). With the development of high-throughput sequencing, a gradually increasing number of sequencing data have been used to study cancer-associated lncRNAs. Using transcriptome sequencing, Zou <italic>et al</italic> (<xref rid="b24-ol-0-0-11063" ref-type="bibr">24</xref>) identified two novel lncRNAs, LCE5A-1 and KCTD6-3, that are associated with head and neck carcinogenesis, and Ylip&#x00E4;&#x00E4; <italic>et al</italic> (<xref rid="b25-ol-0-0-11063" ref-type="bibr">25</xref>) established prostate cancer associated transcript-1 as a novel oncogenic lncRNA, confirming its association with castration-resistant prostate cancers. To date, aberrant lncRNA expression levels have been observed in numerous cancer types, making these lncRNAs novel and reliable biomarkers for cancer diagnosis and prognosis (<xref rid="b26-ol-0-0-11063" ref-type="bibr">26</xref>,<xref rid="b27-ol-0-0-11063" ref-type="bibr">27</xref>). The present study profiled the expression of lncRNAs associated with breast cancer prognosis using next-generation sequencing data, and determined a set of four lncRNAs that, when combined, may be used as a potential biomarker for the prognosis of breast cancer.</p>
<p>Previous studies have identified various biomarkers with prognostic value in breast cancer. <italic>TP53</italic> is a recognized tumor-suppressor gene, the encoded protein of which responds to a diverse range of cellular stimuli to regulate the expression of target genes. Mutations in these genes are associated with a variety of human cancers, such as gastric cancer, colorectal cancer and breast cancer (<xref rid="b28-ol-0-0-11063" ref-type="bibr">28</xref>). <italic>MKI67</italic> is a nucleoprotein-coding gene; its expression product, Ki-67, has been identified as a biomarker of cell proliferation, which is regarded as a predictor of patient outcome (<xref rid="b29-ol-0-0-11063" ref-type="bibr">29</xref>). Additionally, lncRNA <italic>HOTAIR</italic> has been confirmed to promote breast cancer invasion and metastasis (<xref rid="b30-ol-0-0-11063" ref-type="bibr">30</xref>). The roles of <italic>ESR1, PGR</italic> and <italic>ERBB2</italic>, also known as <italic>ER, PR</italic> and <italic>HER2</italic>, respectively, have been widely recognized for breast cancer molecular typing and prognostics (<xref rid="b31-ol-0-0-11063" ref-type="bibr">31</xref>). In the present study, the prognostic value of several commonly used clinical prognostic molecular indicators was evaluated using ROC analysis and was compared with that of the four-lncRNA signature. <italic>ESR1, PGR, ERBB2, MKI67</italic> and <italic>TP53</italic> were all included in the analysis, and the result showed that the AUC value of the four-lncRNA signature was greater than that of the aforementioned biomarkers, which confirmed the four lncRNA signature as a potentially superior prognostic predictor. Additionally, according to the results of Kaplan-Meier analysis, the four individual lncRNAs of the signature were not adequate as independent prognostic predictors. This showed that, although a particular lncRNA may be associated with breast cancer, it may not reliably predict patient survival. However, the combination of these four lncRNAs was able to predict the outcomes of patients with breast cancer with satisfactory sensitivity and specificity.</p>
<p>lncRNAs are expressed at numerous cellular locations and fulfill a wide variety of regulatory roles at almost all stages of gene expression (<xref rid="b32-ol-0-0-11063" ref-type="bibr">32</xref>). Although specific lncRNAs have been implicated in a number of biological processes, the majority of their functions are not fully understood. Of the four lncRNAs of the signature, <italic>PVT1</italic> is located on chromosome 8q24.21; a previous study has demonstrated that supernumerary copies of this chromosomal region are associated with various types of cancer, including breast and ovarian cancer, acute myeloid leukemia and Hodgkin&#x0027;s lymphoma (<xref rid="b33-ol-0-0-11063" ref-type="bibr">33</xref>). <italic>MAPT-AS1</italic> is an 840-bp lncRNA transcribed from the anti-sense strand of the <italic>MAPT</italic> promoter, and has been identified as a potential epigenetic regulator of <italic>MAPT</italic> expression in Parkinson&#x0027;s disease (<xref rid="b34-ol-0-0-11063" ref-type="bibr">34</xref>). However, to the best of our knowledge, there have been no reports of the association between <italic>MAPT-AS1</italic> and tumorigenesis to date. Furthermore, the biological functions of <italic>LINC00667</italic> and <italic>LINC00938</italic> remain to be elucidated. To date, to the best of our knowledge, there is no indication as to why the four-lncRNA signature may serve as a prognostic marker. lncRNAs function in complex ways, and the potential association between these molecules are crucial to understanding their underlying mechanisms of action.</p>
<p>In the present study, differential expression analysis was performed on gene expression data from TCGA and GEO databases. Compared with non-cancerous samples, <italic>PVT1, MAPT-AS1</italic> and <italic>LINC00667</italic> were upregulated, while <italic>LINC00938</italic> was downregulated in breast cancer tissues. Therefore, <italic>PVT1, MAPT-AS1</italic> and <italic>LINC00667</italic> were considered to be candidate oncogenes, while <italic>LINC00938</italic> may serve as a cancer-suppressor gene. Moreover, GO and KEGG analyses were employed to investigate the potential functions of the four lncRNAs. The results showed that <italic>PVT1</italic> and <italic>LINC00667</italic> were associated with transcription regulation, while <italic>MAPT-AS1, LINC00667</italic> and <italic>LINC00938</italic> were associated with cellular mitosis, and <italic>PVT1</italic> was associated with mitochondrial energy metabolism. These fundamental biological processes are essentially involved in tumorigenesis and cancer progression (<xref rid="b35-ol-0-0-11063" ref-type="bibr">35</xref>,<xref rid="b36-ol-0-0-11063" ref-type="bibr">36</xref>). Additionally, KEGG analysis indicated that <italic>LINC00667</italic> was associated with the &#x2018;Notch signaling pathway&#x2019;, and previous study has demonstrated that dysregulated Notch signaling is oncogenic, inhibits apoptosis and promotes cell survival (<xref rid="b37-ol-0-0-11063" ref-type="bibr">37</xref>).</p>
<p>In conclusion, the present study identified a four-lncRNA signature with predictive value for breast cancer prognosis, which may be used as a novel biomarker for the prognosis of patients with breast cancer. Although the signature may contribute to the prognostic evaluation of breast cancer, one of the limitations of the present study is that it was a bioinformatics analysis, and therefore further studies are required using clinical samples, in order to evaluate the identified four-lncRNA signature, in addition to determining the functional mechanisms of these lncRNAs.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1-ol-0-0-11063" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="pdf" xlink:href="Supplementary_Data.pdf"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding</title>
<p>No funding was received.</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>The breast cancer gene expression data, together with the corresponding clinical information are available from TCGA (<uri xlink:href="https://cancergenome.nih.gov/">https://cancergenome.nih.gov/</uri>). The dataset GSE5764 was downloaded from the GEO database (<uri xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</uri>). The data of associated genes and pathways for GO and KEGG analyses are available in the DAVID database (<uri xlink:href="https://david.ncifcrf.gov/">https://david.ncifcrf.gov/</uri>).</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>JW and MZ designed the study and conducted bioinformatic analysis. QL, HH, DP, CS and MZ sorted the data and participated in the statistical analysis. MZ drafted the manuscript. DP and CS participated in drafting the manuscript and providing research guidance. JW reviewed and edited the manuscript. All authors read and approved the final manuscript. All authors agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>KEGG</term><def><p>Kyoto Encyclopedia of Genes and Genomes</p></def></def-item>
<def-item><term>lncRNA</term><def><p>long non-coding RNA</p></def></def-item>
<def-item><term>NSCLC</term><def><p>non-small cell lung cancer</p></def></def-item>
<def-item><term>OS</term><def><p>overall survival</p></def></def-item>
<def-item><term>ROC</term><def><p>receiver operating characteristic</p></def></def-item>
<def-item><term>SIS</term><def><p>sure independence screening</p></def></def-item>
<def-item><term>TCGA</term><def><p>The Cancer Genome Atlas</p></def></def-item>
</def-list>
</glossary>
<ref-list>
<title>References</title>
<ref id="b1-ol-0-0-11063"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname><given-names>RL</given-names></name><name><surname>Miller</surname><given-names>KD</given-names></name><name><surname>Jemal</surname><given-names>A</given-names></name></person-group><article-title>Cancer statistics, 2017</article-title><source>CA Cancer J Clin</source><volume>67</volume><fpage>7</fpage><lpage>30</lpage><year>2017</year><pub-id pub-id-type="doi">10.3322/caac.21387</pub-id><pub-id pub-id-type="pmid">28055103</pub-id></element-citation></ref>
<ref id="b2-ol-0-0-11063"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Michaelson</surname><given-names>JS</given-names></name><name><surname>Silverstein</surname><given-names>M</given-names></name><name><surname>Wyatt</surname><given-names>J</given-names></name><name><surname>Weber</surname><given-names>G</given-names></name><name><surname>Moore</surname><given-names>R</given-names></name><name><surname>Halpern</surname><given-names>E</given-names></name><name><surname>Kopans</surname><given-names>DB</given-names></name><name><surname>Hughes</surname><given-names>K</given-names></name></person-group><article-title>Predicting the survival of patients with breast carcinoma using tumor size</article-title><source>Cancer</source><volume>95</volume><fpage>713</fpage><lpage>723</lpage><year>2002</year><pub-id pub-id-type="doi">10.1002/cncr.10742</pub-id><pub-id pub-id-type="pmid">12209713</pub-id></element-citation></ref>
<ref id="b3-ol-0-0-11063"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rakha</surname><given-names>EA</given-names></name><name><surname>El-Sayed</surname><given-names>ME</given-names></name><name><surname>Menon</surname><given-names>S</given-names></name><name><surname>Green</surname><given-names>AR</given-names></name><name><surname>Lee</surname><given-names>AH</given-names></name><name><surname>Ellis</surname><given-names>IO</given-names></name></person-group><article-title>Histologic grading is an independent prognostic factor in invasive lobular carcinoma of the breast</article-title><source>Breast Cancer Res Treat</source><volume>111</volume><fpage>121</fpage><lpage>127</lpage><year>2008</year><pub-id pub-id-type="doi">10.1007/s10549-007-9768-4</pub-id><pub-id pub-id-type="pmid">17929165</pub-id></element-citation></ref>
<ref id="b4-ol-0-0-11063"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shokouh</surname><given-names>TZ</given-names></name><name><surname>Ezatollah</surname><given-names>A</given-names></name><name><surname>Barand</surname><given-names>P</given-names></name></person-group><article-title>Interrelationships Between Ki67, HER2/neu, p53, ER, and PR Status and their associations with tumor grade and lymph node involvement in breast carcinoma subtypes: Retrospective-observational analytical study</article-title><source>Medicine (Baltimore)</source><volume>94</volume><fpage>e1359</fpage><year>2015</year><pub-id pub-id-type="doi">10.1097/MD.0000000000001359</pub-id><pub-id pub-id-type="pmid">26266392</pub-id></element-citation></ref>
<ref id="b5-ol-0-0-11063"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname><given-names>J</given-names></name><name><surname>Luo</surname><given-names>DX</given-names></name><name><surname>Huang</surname><given-names>C</given-names></name><name><surname>Shen</surname><given-names>Y</given-names></name><name><surname>Bu</surname><given-names>Y</given-names></name><name><surname>Markwell</surname><given-names>S</given-names></name><name><surname>Gao</surname><given-names>J</given-names></name><name><surname>Liu</surname><given-names>J</given-names></name><name><surname>Zu</surname><given-names>X</given-names></name><name><surname>Cao</surname><given-names>Z</given-names></name><etal/></person-group><article-title>AKR1B10 overexpression in breast cancer: Association with tumor size, lymph node metastasis and patient survival and its potential as a novel serum marker</article-title><source>Int J Cancer</source><volume>131</volume><fpage>E862</fpage><lpage>E871</lpage><year>2012</year><pub-id pub-id-type="doi">10.1002/ijc.27618</pub-id><pub-id pub-id-type="pmid">22539036</pub-id></element-citation></ref>
<ref id="b6-ol-0-0-11063"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Crabb</surname><given-names>SJ</given-names></name><name><surname>Bajdik</surname><given-names>CD</given-names></name><name><surname>Leung</surname><given-names>S</given-names></name><name><surname>Speers</surname><given-names>CH</given-names></name><name><surname>Kennecke</surname><given-names>H</given-names></name><name><surname>Huntsman</surname><given-names>DG</given-names></name><name><surname>Gelmon</surname><given-names>KA</given-names></name></person-group><article-title>Can clinically relevant prognostic subsets of breast cancer patients with four or more involved axillary lymph nodes be identified through immunohistochemical biomarkers? A tissue microarray feasibility study</article-title><source>Breast Cancer Res</source><volume>10</volume><fpage>R6</fpage><year>2008</year><pub-id pub-id-type="doi">10.1186/bcr1847</pub-id><pub-id pub-id-type="pmid">18194560</pub-id></element-citation></ref>
<ref id="b7-ol-0-0-11063"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>M Braden</surname><given-names>A</given-names></name><name><surname>V Stankowski</surname><given-names>R</given-names></name><name><surname>M Engel</surname><given-names>J</given-names></name><name><surname>A Onitilo</surname><given-names>A</given-names></name></person-group><article-title>Breast cancer biomarkers: Risk assessment, diagnosis, prognosis, prediction of treatment efficacy and toxicity, and recurrence</article-title><source>Curr Pharm Des</source><volume>20</volume><fpage>4879</fpage><lpage>4898</lpage><year>2014</year><pub-id pub-id-type="doi">10.2174/1381612819666131125145517</pub-id><pub-id pub-id-type="pmid">24283956</pub-id></element-citation></ref>
<ref id="b8-ol-0-0-11063"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nagini</surname><given-names>S</given-names></name></person-group><article-title>Breast Cancer: Current molecular therapeutic targets and new players</article-title><source>Anticancer Agents Med Chem</source><volume>17</volume><fpage>152</fpage><lpage>163</lpage><year>2017</year><pub-id pub-id-type="doi">10.2174/1871520616666160502122724</pub-id><pub-id pub-id-type="pmid">27137076</pub-id></element-citation></ref>
<ref id="b9-ol-0-0-11063"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><collab collab-type="corp-author">ENCODE Project Consortium</collab><name><surname>Birney</surname><given-names>E</given-names></name><name><surname>Stamatoyannopoulos</surname><given-names>JA</given-names></name><name><surname>Dutta</surname><given-names>A</given-names></name><name><surname>Guig&#x00F3;</surname><given-names>R</given-names></name><name><surname>Gingeras</surname><given-names>TR</given-names></name><name><surname>Margulies</surname><given-names>EH</given-names></name><name><surname>Weng</surname><given-names>Z</given-names></name><name><surname>Snyder</surname><given-names>M</given-names></name><name><surname>Dermitzakis</surname><given-names>ET</given-names></name><etal/></person-group><article-title>Identification and analysis of functional elements in 1&#x0025; of the human genome by the ENCODE pilot project</article-title><source>Nature</source><volume>447</volume><fpage>799</fpage><lpage>816</lpage><year>2007</year><pub-id pub-id-type="doi">10.1038/nature05874</pub-id><pub-id pub-id-type="pmid">17571346</pub-id></element-citation></ref>
<ref id="b10-ol-0-0-11063"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname><given-names>MB</given-names></name><name><surname>Johnston</surname><given-names>RL</given-names></name><name><surname>Inostroza-Ponta</surname><given-names>M</given-names></name><name><surname>Fox</surname><given-names>AH</given-names></name><name><surname>Fortini</surname><given-names>E</given-names></name><name><surname>Moscato</surname><given-names>P</given-names></name><name><surname>Dinger</surname><given-names>ME</given-names></name><name><surname>Mattick</surname><given-names>JS</given-names></name></person-group><article-title>Genome-wide analysis of long noncoding RNA stability</article-title><source>Genome Res</source><volume>22</volume><fpage>885</fpage><lpage>898</lpage><year>2012</year><pub-id pub-id-type="doi">10.1101/gr.131037.111</pub-id><pub-id pub-id-type="pmid">22406755</pub-id></element-citation></ref>
<ref id="b11-ol-0-0-11063"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Prensner</surname><given-names>JR</given-names></name><name><surname>Chinnaiyan</surname><given-names>AM</given-names></name></person-group><article-title>The emergence of lncRNAs in cancer biology</article-title><source>Cancer Discov</source><volume>1</volume><fpage>391</fpage><lpage>407</lpage><year>2011</year><pub-id pub-id-type="doi">10.1158/2159-8290.CD-11-0209</pub-id><pub-id pub-id-type="pmid">22096659</pub-id></element-citation></ref>
<ref id="b12-ol-0-0-11063"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rinn</surname><given-names>JL</given-names></name><name><surname>Chang</surname><given-names>HY</given-names></name></person-group><article-title>Genome regulation by long noncoding RNAs</article-title><source>Annu Rev Biochem</source><volume>81</volume><fpage>145</fpage><lpage>166</lpage><year>2012</year><pub-id pub-id-type="doi">10.1146/annurev-biochem-051410-092902</pub-id><pub-id pub-id-type="pmid">22663078</pub-id></element-citation></ref>
<ref id="b13-ol-0-0-11063"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nagano</surname><given-names>T</given-names></name><name><surname>Fraser</surname><given-names>P</given-names></name></person-group><article-title>No-nonsense functions for long noncoding RNAs</article-title><source>Cell</source><volume>145</volume><fpage>178</fpage><lpage>181</lpage><year>2011</year><pub-id pub-id-type="doi">10.1016/j.cell.2011.03.014</pub-id><pub-id pub-id-type="pmid">21496640</pub-id></element-citation></ref>
<ref id="b14-ol-0-0-11063"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yoon</surname><given-names>JH</given-names></name><name><surname>Abdelmohsen</surname><given-names>K</given-names></name><name><surname>Srikantan</surname><given-names>S</given-names></name><name><surname>Yang</surname><given-names>X</given-names></name><name><surname>Martindale</surname><given-names>JL</given-names></name><name><surname>De</surname><given-names>S</given-names></name><name><surname>Huarte</surname><given-names>M</given-names></name><name><surname>Zhan</surname><given-names>M</given-names></name><name><surname>Becker</surname><given-names>KG</given-names></name><name><surname>Gorospe</surname><given-names>M</given-names></name></person-group><article-title>LincRNA-p21 suppresses target mRNA translation</article-title><source>Mol Cell</source><volume>47</volume><fpage>648</fpage><lpage>655</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.molcel.2012.06.027</pub-id><pub-id pub-id-type="pmid">22841487</pub-id></element-citation></ref>
<ref id="b15-ol-0-0-11063"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname><given-names>WX</given-names></name><name><surname>Koirala</surname><given-names>P</given-names></name><name><surname>Mo</surname><given-names>YY</given-names></name></person-group><article-title>LncRNA-mediated regulation of cell signaling in cancer</article-title><source>Oncogene</source><volume>36</volume><fpage>5661</fpage><lpage>5667</lpage><year>2017</year><pub-id pub-id-type="doi">10.1038/onc.2017.184</pub-id><pub-id pub-id-type="pmid">28604750</pub-id></element-citation></ref>
<ref id="b16-ol-0-0-11063"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bhan</surname><given-names>A</given-names></name><name><surname>Soleimani</surname><given-names>M</given-names></name><name><surname>Mandal</surname><given-names>SS</given-names></name></person-group><article-title>Long noncoding RNA and cancer: A new paradigm</article-title><source>Cancer Res</source><volume>77</volume><fpage>3965</fpage><lpage>3981</lpage><year>2017</year><pub-id pub-id-type="doi">10.1158/0008-5472.CAN-16-2634</pub-id><pub-id pub-id-type="pmid">28701486</pub-id></element-citation></ref>
<ref id="b17-ol-0-0-11063"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sanchez Calle</surname><given-names>A</given-names></name><name><surname>Kawamura</surname><given-names>Y</given-names></name><name><surname>Yamamoto</surname><given-names>Y</given-names></name><name><surname>Takeshita</surname><given-names>F</given-names></name><name><surname>Ochiya</surname><given-names>T</given-names></name></person-group><article-title>Emerging roles of long non-coding RNA in cancer</article-title><source>Cancer Sci</source><volume>109</volume><fpage>2093</fpage><lpage>2100</lpage><year>2018</year><pub-id pub-id-type="doi">10.1111/cas.13642</pub-id><pub-id pub-id-type="pmid">29774630</pub-id></element-citation></ref>
<ref id="b18-ol-0-0-11063"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gupta</surname><given-names>RA</given-names></name><name><surname>Shah</surname><given-names>N</given-names></name><name><surname>Wang</surname><given-names>KC</given-names></name><name><surname>Kim</surname><given-names>J</given-names></name><name><surname>Horlings</surname><given-names>HM</given-names></name><name><surname>Wong</surname><given-names>DJ</given-names></name><name><surname>Tsai</surname><given-names>MC</given-names></name><name><surname>Hung</surname><given-names>T</given-names></name><name><surname>Argani</surname><given-names>P</given-names></name><name><surname>Rinn</surname><given-names>JL</given-names></name><etal/></person-group><article-title>Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis</article-title><source>Nature</source><volume>464</volume><fpage>1071</fpage><lpage>1076</lpage><year>2010</year><pub-id pub-id-type="doi">10.1038/nature08975</pub-id><pub-id pub-id-type="pmid">20393566</pub-id></element-citation></ref>
<ref id="b19-ol-0-0-11063"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gutschner</surname><given-names>T</given-names></name><name><surname>Hammerle</surname><given-names>M</given-names></name><name><surname>Diederichs</surname><given-names>S</given-names></name></person-group><article-title>MALAT1-a paradigm for long noncoding RNA function in cancer</article-title><source>J Mol Med (Berl)</source><volume>91</volume><fpage>791</fpage><lpage>801</lpage><year>2013</year><pub-id pub-id-type="doi">10.1007/s00109-013-1028-y</pub-id><pub-id pub-id-type="pmid">23529762</pub-id></element-citation></ref>
<ref id="b20-ol-0-0-11063"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chandra Gupta</surname><given-names>S</given-names></name><name><surname>Nandan Tripathi</surname><given-names>Y</given-names></name></person-group><article-title>Potential of long non-coding RNAs in cancer patients: From biomarkers to therapeutic targets</article-title><source>Int J Cancer</source><volume>140</volume><fpage>1955</fpage><lpage>1967</lpage><year>2017</year><pub-id pub-id-type="doi">10.1002/ijc.30546</pub-id><pub-id pub-id-type="pmid">27925173</pub-id></element-citation></ref>
<ref id="b21-ol-0-0-11063"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Turashvili</surname><given-names>G</given-names></name><name><surname>Bouchal</surname><given-names>J</given-names></name><name><surname>Baumforth</surname><given-names>K</given-names></name><name><surname>Wei</surname><given-names>W</given-names></name><name><surname>Dziechciarkova</surname><given-names>M</given-names></name><name><surname>Ehrmann</surname><given-names>J</given-names></name><name><surname>Klein</surname><given-names>J</given-names></name><name><surname>Fridman</surname><given-names>E</given-names></name><name><surname>Skarda</surname><given-names>J</given-names></name><name><surname>Srovnal</surname><given-names>J</given-names></name><etal/></person-group><article-title>Novel markers for differentiation of lobular and ductal invasive breast carcinomas by laser microdissection and microarray analysis</article-title><source>BMC Cancer</source><volume>7</volume><fpage>55</fpage><year>2007</year><pub-id pub-id-type="doi">10.1186/1471-2407-7-55</pub-id><pub-id pub-id-type="pmid">17389037</pub-id></element-citation></ref>
<ref id="b22-ol-0-0-11063"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hortobagyi</surname><given-names>GN</given-names></name><name><surname>Edge</surname><given-names>SB</given-names></name><name><surname>Giuliano</surname><given-names>A</given-names></name></person-group><article-title>New and Important Changes in the TNM staging system for breast cancer</article-title><source>Am Soc Clin Oncol Educ Book</source><volume>38</volume><fpage>457</fpage><lpage>467</lpage><year>2018</year><pub-id pub-id-type="doi">10.1200/EDBK_201313</pub-id><pub-id pub-id-type="pmid">30231399</pub-id></element-citation></ref>
<ref id="b23-ol-0-0-11063"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>G</given-names></name><name><surname>Wang</surname><given-names>Z</given-names></name><name><surname>Wang</surname><given-names>D</given-names></name><name><surname>Qiu</surname><given-names>C</given-names></name><name><surname>Liu</surname><given-names>M</given-names></name><name><surname>Chen</surname><given-names>X</given-names></name><name><surname>Zhang</surname><given-names>Q</given-names></name><name><surname>Yan</surname><given-names>G</given-names></name><name><surname>Cui</surname><given-names>Q</given-names></name></person-group><article-title>LncRNA Disease: A database for long-non-coding RNA-associated diseases</article-title><source>Nucleic Acids Res</source><volume>41</volume><issue>(Database Issue)</issue><fpage>D983</fpage><lpage>D986</lpage><year>2013</year><pub-id pub-id-type="pmid">23175614</pub-id></element-citation></ref>
<ref id="b24-ol-0-0-11063"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname><given-names>AE</given-names></name><name><surname>Ku</surname><given-names>J</given-names></name><name><surname>Honda</surname><given-names>TK</given-names></name><name><surname>Yu</surname><given-names>V</given-names></name><name><surname>Kuo</surname><given-names>SZ</given-names></name><name><surname>Zheng</surname><given-names>H</given-names></name><name><surname>Xuan</surname><given-names>Y</given-names></name><name><surname>Saad</surname><given-names>MA</given-names></name><name><surname>Hinton</surname><given-names>A</given-names></name><name><surname>Brumund</surname><given-names>KT</given-names></name><etal/></person-group><article-title>Transcriptome sequencing uncovers novel long noncoding and small nucleolar RNAs dysregulated in head and neck squamous cell carcinoma</article-title><source>RNA</source><volume>21</volume><fpage>1122</fpage><lpage>1134</lpage><year>2015</year><pub-id pub-id-type="doi">10.1261/rna.049262.114</pub-id><pub-id pub-id-type="pmid">25904139</pub-id></element-citation></ref>
<ref id="b25-ol-0-0-11063"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ylip&#x00E4;&#x00E4;</surname><given-names>A</given-names></name><name><surname>Kivinummi</surname><given-names>K</given-names></name><name><surname>Kohvakka</surname><given-names>A</given-names></name><name><surname>Annala</surname><given-names>M</given-names></name><name><surname>Latonen</surname><given-names>L</given-names></name><name><surname>Scaravilli</surname><given-names>M</given-names></name><name><surname>Kartasalo</surname><given-names>K</given-names></name><name><surname>Lepp&#x00E4;nen</surname><given-names>SP</given-names></name><name><surname>Karakurt</surname><given-names>S</given-names></name><name><surname>Sepp&#x00E4;l&#x00E4;</surname><given-names>J</given-names></name><etal/></person-group><article-title>Transcriptome sequencing reveals PCAT5 as a novel ERG-regulated long noncoding RNA in prostate cancer</article-title><source>Cancer Res</source><volume>75</volume><fpage>4026</fpage><lpage>4031</lpage><year>2015</year><pub-id pub-id-type="doi">10.1158/0008-5472.CAN-15-0217</pub-id><pub-id pub-id-type="pmid">26282172</pub-id></element-citation></ref>
<ref id="b26-ol-0-0-11063"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gutschner</surname><given-names>T</given-names></name><name><surname>Diederichs</surname><given-names>S</given-names></name></person-group><article-title>The hallmarks of cancer: A long non-coding RNA point of view</article-title><source>RNA Biol</source><volume>9</volume><fpage>703</fpage><lpage>719</lpage><year>2012</year><pub-id pub-id-type="doi">10.4161/rna.20481</pub-id><pub-id pub-id-type="pmid">22664915</pub-id></element-citation></ref>
<ref id="b27-ol-0-0-11063"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gibb</surname><given-names>EA</given-names></name><name><surname>Brown</surname><given-names>CJ</given-names></name><name><surname>Lam</surname><given-names>WL</given-names></name></person-group><article-title>The functional role of long non-coding RNA in human carcinomas</article-title><source>Mol Cancer</source><volume>10</volume><fpage>38</fpage><year>2011</year><pub-id pub-id-type="doi">10.1186/1476-4598-10-38</pub-id><pub-id pub-id-type="pmid">21489289</pub-id></element-citation></ref>
<ref id="b28-ol-0-0-11063"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Song</surname><given-names>CV</given-names></name><name><surname>Teo</surname><given-names>SH</given-names></name><name><surname>Taib</surname><given-names>NA</given-names></name><name><surname>Yip</surname><given-names>CH</given-names></name></person-group><article-title>Surgery for BRCA, TP53 and PALB2: A literature review</article-title><source>Ecancermedicalscience</source><volume>12</volume><fpage>863</fpage><year>2018</year><pub-id pub-id-type="doi">10.3332/ecancer.2018.863</pub-id><pub-id pub-id-type="pmid">30174725</pub-id></element-citation></ref>
<ref id="b29-ol-0-0-11063"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sundara Rajan</surname><given-names>S</given-names></name><name><surname>Hanby</surname><given-names>AM</given-names></name><name><surname>Horgan</surname><given-names>K</given-names></name><name><surname>Thygesen</surname><given-names>HH</given-names></name><name><surname>Speirs</surname><given-names>V</given-names></name></person-group><article-title>The potential utility of geminin as a predictive biomarker in breast cancer</article-title><source>Breast Cancer Res Treat</source><volume>143</volume><fpage>91</fpage><lpage>98</lpage><year>2014</year><pub-id pub-id-type="doi">10.1007/s10549-013-2786-5</pub-id><pub-id pub-id-type="pmid">24292956</pub-id></element-citation></ref>
<ref id="b30-ol-0-0-11063"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>S&#x00F8;rensen</surname><given-names>KP</given-names></name><name><surname>Thomassen</surname><given-names>M</given-names></name><name><surname>Tan</surname><given-names>Q</given-names></name><name><surname>Bak</surname><given-names>M</given-names></name><name><surname>Cold</surname><given-names>S</given-names></name><name><surname>Burton</surname><given-names>M</given-names></name><name><surname>Larsen</surname><given-names>MJ</given-names></name><name><surname>Kruse</surname><given-names>TA</given-names></name></person-group><article-title>Long non-coding RNA HOTAIR is an independent prognostic marker of metastasis in estrogen receptor-positive primary breast cancer</article-title><source>Breast Cancer Res Treat</source><volume>142</volume><fpage>529</fpage><lpage>536</lpage><year>2013</year><pub-id pub-id-type="doi">10.1007/s10549-013-2776-7</pub-id><pub-id pub-id-type="pmid">24258260</pub-id></element-citation></ref>
<ref id="b31-ol-0-0-11063"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>YF</given-names></name><name><surname>Liao</surname><given-names>YY</given-names></name><name><surname>Yang</surname><given-names>M</given-names></name><name><surname>Peng</surname><given-names>NF</given-names></name><name><surname>Xie</surname><given-names>SR</given-names></name><name><surname>Xie</surname><given-names>YF</given-names></name></person-group><article-title>Discordances in ER, PR and HER2 receptors between primary and recurrent/metastatic lesions and their impact on survival in breast cancer patients</article-title><source>Med Oncol</source><volume>31</volume><fpage>214</fpage><year>2014</year><pub-id pub-id-type="doi">10.1007/s12032-014-0214-2</pub-id><pub-id pub-id-type="pmid">25216864</pub-id></element-citation></ref>
<ref id="b32-ol-0-0-11063"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wilusz</surname><given-names>JE</given-names></name><name><surname>Sunwoo</surname><given-names>H</given-names></name><name><surname>Spector</surname><given-names>DL</given-names></name></person-group><article-title>Long noncoding RNAs: Functional surprises from the RNA world</article-title><source>Genes Dev</source><volume>23</volume><fpage>1494</fpage><lpage>1504</lpage><year>2009</year><pub-id pub-id-type="doi">10.1101/gad.1800909</pub-id><pub-id pub-id-type="pmid">19571179</pub-id></element-citation></ref>
<ref id="b33-ol-0-0-11063"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tseng</surname><given-names>YY</given-names></name><name><surname>Moriarity</surname><given-names>BS</given-names></name><name><surname>Gong</surname><given-names>W</given-names></name><name><surname>Akiyama</surname><given-names>R</given-names></name><name><surname>Tiwari</surname><given-names>A</given-names></name><name><surname>Kawakami</surname><given-names>H</given-names></name><name><surname>Ronning</surname><given-names>P</given-names></name><name><surname>Reuland</surname><given-names>B</given-names></name><name><surname>Guenther</surname><given-names>K</given-names></name><name><surname>Beadnell</surname><given-names>TC</given-names></name><etal/></person-group><article-title>PVT1 dependence in cancer with MYC copy-number increase</article-title><source>Nature</source><volume>512</volume><fpage>82</fpage><lpage>86</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nature13311</pub-id><pub-id pub-id-type="pmid">25043044</pub-id></element-citation></ref>
<ref id="b34-ol-0-0-11063"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Coupland</surname><given-names>KG</given-names></name><name><surname>Kim</surname><given-names>WS</given-names></name><name><surname>Halliday</surname><given-names>GM</given-names></name><name><surname>Hallupp</surname><given-names>M</given-names></name><name><surname>Dobson-Stone</surname><given-names>C</given-names></name><name><surname>Kwok</surname><given-names>JB</given-names></name></person-group><article-title>Role of the long non-coding RNA MAPT-AS1 in regulation of microtubule associated protein tau (MAPT) expression in Parkinson&#x0027;s disease</article-title><source>PLoS One</source><volume>11</volume><fpage>e0157924</fpage><year>2016</year><pub-id pub-id-type="doi">10.1371/journal.pone.0157924</pub-id><pub-id pub-id-type="pmid">27336847</pub-id></element-citation></ref>
<ref id="b35-ol-0-0-11063"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bradner</surname><given-names>JE</given-names></name><name><surname>Hnisz</surname><given-names>D</given-names></name><name><surname>Young</surname><given-names>RA</given-names></name></person-group><article-title>Transcriptional Addiction in Cancer</article-title><source>Cell</source><volume>168</volume><fpage>629</fpage><lpage>643</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.cell.2016.12.013</pub-id><pub-id pub-id-type="pmid">28187285</pub-id></element-citation></ref>
<ref id="b36-ol-0-0-11063"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Valcarcel-Jimenez</surname><given-names>L</given-names></name><name><surname>Gaude</surname><given-names>E</given-names></name><name><surname>Torrano</surname><given-names>V</given-names></name><name><surname>Frezza</surname><given-names>C</given-names></name><name><surname>Carracedo</surname><given-names>A</given-names></name></person-group><article-title>Mitochondrial Metabolism: Yin and yang for tumor progression</article-title><source>Trends Endocrinol Metab</source><volume>28</volume><fpage>748</fpage><lpage>757</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.tem.2017.06.004</pub-id><pub-id pub-id-type="pmid">28938972</pub-id></element-citation></ref>
<ref id="b37-ol-0-0-11063"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Aithal</surname><given-names>MG</given-names></name><name><surname>Rajeswari</surname><given-names>N</given-names></name></person-group><article-title>Role of Notch signalling pathway in cancer and its association with DNA methylation</article-title><source>J Genet</source><volume>92</volume><fpage>667</fpage><lpage>675</lpage><year>2013</year><pub-id pub-id-type="doi">10.1007/s12041-013-0284-5</pub-id><pub-id pub-id-type="pmid">24371188</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-ol-0-0-11063" position="float">
<label>Figure 1.</label>
<caption><p>Kaplan-Meier analysis of overall survival of patients using the four-long non-coding RNA signature. Kaplan-Meier curves for (A) the training-set patients (n=384) and (B) the validation-set patients (n=384). Two-sided log-rank test was performed to evaluate the survival differences between the two curves.</p></caption>
<graphic xlink:href="ol-19-01-0221-g00.tif"/>
</fig>
<fig id="f2-ol-0-0-11063" position="float">
<label>Figure 2.</label>
<caption><p>ROC analysis shows the sensitivity and specificity of the biomarkers in predicting the overall survival of patients. (A) ROC curves of the four-lncRNA signature. (B) ROC curves of several known biomarkers: <underline>TP53, MKI67, ESR1, PGR, ERBB2 and HOTAIR</underline>. lncRNA, long non-coding RNA; ROC, receiver operating characteristic.</p></caption>
<graphic xlink:href="ol-19-01-0221-g01.tif"/>
</fig>
<fig id="f3-ol-0-0-11063" position="float">
<label>Figure 3.</label>
<caption><p>Kaplan-Meier analysis of different known biomarkers for patients in the validation set. (A) <italic>TP53</italic> expression and overall survival of patients. (B) <italic>MKI67</italic> expression and overall survival of patients. (C) <italic>ESR1</italic> expression and overall survival rate of patients. (D) <italic>PGR</italic> expression and overall survival rate of patients. (E) <italic>ERBB2</italic> expression and overall survival rate of patients. (F) <italic>HOTAIR</italic> expression and overall survival rate of patients. HOTAIR, HOX transcript antisense RNA; PGR, progesterone receptor.</p></caption>
<graphic xlink:href="ol-19-01-0221-g02.tif"/>
</fig>
<fig id="f4-ol-0-0-11063" position="float">
<label>Figure 4.</label>
<caption><p>Potential biological functions of the four lncRNAs from the four-lncRNA signature. (A) GO analysis for <italic>PVT1</italic>. (B) GO analysis for <italic>MAPT-AS1</italic>. (C) GO analysis for <italic>LINC00667</italic>. (D) GO analysis for <italic>LINC00938</italic>. (E) Kyoto Encyclopedia of Genes and Genomes analysis for the four lncRNAs. GO, Gene Ontology; lncRNA, long non-coding RNA.</p></caption>
<graphic xlink:href="ol-19-01-0221-g03.tif"/>
</fig>
<table-wrap id="tI-ol-0-0-11063" position="float">
<label>Table I.</label>
<caption><p>Demographic characteristics of the 768 patients with breast cancer included in the present study.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Characteristics</th>
<th align="center" valign="bottom">Training set (n=384)</th>
<th align="center" valign="bottom">Validation set (n=384)</th>
<th align="center" valign="bottom">Total set (n=768), &#x0025;</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Sex</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Male</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">7 (0.91)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Female</td>
<td align="center" valign="top">380</td>
<td align="center" valign="top">381</td>
<td align="center" valign="top">761 (99.09)</td>
</tr>
<tr>
<td align="left" valign="top">TNM stage (22)</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Stage I</td>
<td align="center" valign="top">66</td>
<td align="center" valign="top">67</td>
<td align="center" valign="top">133 (17.32)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Stage II</td>
<td align="center" valign="top">226</td>
<td align="center" valign="top">223</td>
<td align="center" valign="top">449 (58.46)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Stage III</td>
<td align="center" valign="top">85</td>
<td align="center" valign="top">86</td>
<td align="center" valign="top">171 (22.27)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Stage IV</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">15 (1.95)</td>
</tr>
<tr>
<td align="left" valign="top">ER status</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Negative</td>
<td align="center" valign="top">90</td>
<td align="center" valign="top">92</td>
<td align="center" valign="top">182 (23.70)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Positive</td>
<td align="center" valign="top">291</td>
<td align="center" valign="top">290</td>
<td align="center" valign="top">581 (75.65)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Indeterminate</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">5 (0.65)</td>
</tr>
<tr>
<td align="left" valign="top">PR status</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Negative</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">128</td>
<td align="center" valign="top">253 (32.94)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Positive</td>
<td align="center" valign="top">256</td>
<td align="center" valign="top">254</td>
<td align="center" valign="top">510 (66.41)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Indeterminate</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">5 (0.65)</td>
</tr>
<tr>
<td align="left" valign="top">HER2 status</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Negative</td>
<td align="center" valign="top">228</td>
<td align="center" valign="top">231</td>
<td align="center" valign="top">459 (59.77)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Positive</td>
<td align="center" valign="top">92</td>
<td align="center" valign="top">87</td>
<td align="center" valign="top">179 (23.31)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Indeterminate</td>
<td align="center" valign="top">64</td>
<td align="center" valign="top">66</td>
<td align="center" valign="top">130 (16.93)</td>
</tr>
<tr>
<td align="left" valign="top">Vital status</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Alive</td>
<td align="center" valign="top">326</td>
<td align="center" valign="top">326</td>
<td align="center" valign="top">652 (84.90)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Deceased</td>
<td align="center" valign="top">58</td>
<td align="center" valign="top">58</td>
<td align="center" valign="top">116 (15.10)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;OS time (range), days</td>
<td align="center" valign="top">7-8,556</td>
<td align="center" valign="top">1-8,605</td>
<td align="center" valign="top">1-8,605</td>
</tr>
<tr>
<td align="left" valign="top">RFS status</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Relapsed</td>
<td align="center" valign="top">295</td>
<td align="center" valign="top">288</td>
<td align="center" valign="top">583 (75.91)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Relapse-free</td>
<td align="center" valign="top">89</td>
<td align="center" valign="top">96</td>
<td align="center" valign="top">185 (24.09)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;RFS time (range), days</td>
<td align="center" valign="top">7-8,556</td>
<td align="center" valign="top">1-8,391</td>
<td align="center" valign="top">1-8,556</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-ol-0-0-11063"><p>TNM, Tumor-Node-Metastasis; RFS, relapse-free survival; OS, overall survival; HER2, human epidermal growth factor receptor 2; ER, estrogen receptor; PR, progesterone receptor.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>
