Functional annotation of noncoding variants and prioritization of cancer-associated lncRNAs in lung cancer

Li,Hua; Lv,Xin

doi:10.3892/ol.2016.4604

July-2016 Volume 12 Issue 1

Full Size Image

Journals

International Journal of Molecular Medicine

International Journal of Molecular Medicine is an international journal devoted to molecular mechanisms of human disease.

International Journal of Oncology

International Journal of Oncology is an international journal devoted to oncology research and cancer treatment.

Molecular Medicine Reports

Covers molecular medicine topics such as pharmacology, pathology, genetics, neuroscience, infectious diseases, molecular cardiology, and molecular surgery.

Oncology Reports

Oncology Reports is an international journal devoted to fundamental and applied research in Oncology.

Experimental and Therapeutic Medicine

Experimental and Therapeutic Medicine is an international journal devoted to laboratory and clinical medicine.

Oncology Letters

Oncology Letters is an international journal devoted to Experimental and Clinical Oncology.

Biomedical Reports

Explores a wide range of biological and medical fields, including pharmacology, genetics, microbiology, neuroscience, and molecular cardiology.

Molecular and Clinical Oncology

International journal addressing all aspects of oncology research, from tumorigenesis and oncogenes to chemotherapy and metastasis.

World Academy of Sciences Journal

Multidisciplinary open-access journal spanning biochemistry, genetics, neuroscience, environmental health, and synthetic biology.

International Journal of Functional Nutrition

Open-access journal combining biochemistry, pharmacology, immunology, and genetics to advance health through functional nutrition.

International Journal of Epigenetics

Publishes open-access research on using epigenetics to advance understanding and treatment of human disease.

Medicine International

An International Open Access Journal Devoted to General Medicine.

July-2016 Volume 12 Issue 1

Full Size Image

Article

Functional annotation of noncoding variants and prioritization of cancer-associated lncRNAs in lung cancer

Authors:
- Hua Li
- Xin Lv
View Affiliations / Copyright

Affiliations: Department of Anesthesiology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200072, P.R. China
Pages: 222-230
|
Published online on: May 18, 2016

https://doi.org/10.3892/ol.2016.4604
Expand metrics +

Abstract

Multiple computational tools have been widely applied to the detection of coding driver mutations in cancer; however, the prioritization of pathogenic non-coding variants remains a difficult and demanding task. The present study was performed to distinguish non‑coding disease‑causing mutations from neutral ones, and to prioritize potential cancer-associated long non-coding RNAs (lncRNAs) with a logistic regression model in lung cancer. A logistic regression model was constructed, combining 19,153 disease‑associated ClinVar and Human Gene Mutation Database pathogenic variants as the response variable and non‑coding features as the predictor variable. Validation of the model was conducted with genome‑wide association study (GWAS) disease‑ or trait‑associated single nucleotide polymorphisms (SNPs) and recurrent somatic mutations. High scoring regions were characterized with respect to their distribution in various features and gene classes; potential cancer‑associated lncRNA candidates were prioritized, combining the fraction of high‑scoring regions and average score predicted by the logistic regression model. H3K79me2 was the most negative factor that contributed to the model, while conserved regions were most positively informative to the model. The area under the receiver operating characteristic curve of the model was 0.89. The model assigned a significantly higher score to GWAS SNPs and recurrent somatic mutations compared with neutral SNPs (mean, 5.9012 vs. 5.5238; P<0.001, Mann‑Whitney U test) and non‑recurrent mutations (mean, 5.4677 vs. 5.2277, P<0.001, Mann‑Whitney U test), respectively. It was observed that regions, including splicing sites and untranslated regions, and gene classes, including cancer genes and cancer-associated lncRNAs, had an increased enrichment of high‑scoring regions. In total, 2,679 cancer‑associated lncRNAs were determined and characterized. A total of 104 of these lncRNAs were differentially expressed between lung cancer and normal specimens. The logistic regression model is a useful and efficient scoring system to prioritize non‑coding pathogenic variants and lncRNAs, and may provide the basis for detecting non‑coding driver lncRNAs in lung cancer.

View Figures

View References

1	Roschke AV and Rozenblum E: Multi-layered cancer chromosomal instability phenotype. Front Oncol. 3:1–13. 2013. View Article : Google Scholar : PubMed/NCBI
2	Robison K: Application of second-generation sequencing to cancer genomics. Brief Bioinform. 11:524–534. 2010. View Article : Google Scholar : PubMed/NCBI
3	Greenman C, Stephens P, Smith R, Dalgliesh GL, Hunter C, Bignell G, Davies H, Teague J, Butler A, Stevens C, et al: Patterns of somatic mutation in human cancer genomes. Nature. 446:153–158. 2007. View Article : Google Scholar : PubMed/NCBI
4	Sim NL, Kumar P, Hu J, Henikoff S, Schneider G and Ng PC: SIFT web server: Predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 40:W452–W457. 2012. View Article : Google Scholar : PubMed/NCBI
5	Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS and Sunyaev SR: A method and server for predicting damaging missense mutations. Nat Methods. 7:248–249. 2010. View Article : Google Scholar : PubMed/NCBI
6	Huang FW, Hodis E, Xu MJ, Kryukov GV, Chin L and Garraway LA: Highly recurrent TERT promoter mutations in human melanoma. Science. 339:957–959. 2013. View Article : Google Scholar : PubMed/NCBI
7	Horn S, Figl A, Rachakonda PS, Fischer C, Sucker A, Gast A, Kadel S, Moll I, Nagore E, Hemminki K, et al: TERT promoter mutations in familial and sporadic melanoma. Science. 339:959–961. 2013. View Article : Google Scholar : PubMed/NCBI
8	ENCODE Project Consortium: An integrated encyclopedia of DNA elements in the human genome. Nature. 489:57–74. 2012. View Article : Google Scholar : PubMed/NCBI
9	Lowe CB and Haussler D: 29 mammalian genomes reveal novel exaptations of mobile elements for likely regulatory functions in the human genome. PLoS One. 7:e431282012. View Article : Google Scholar : PubMed/NCBI
10	Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, et al: The NIH roadmap epigenomics mapping consortium. Nat Biotechnol. 28:1045–1048. 2010. View Article : Google Scholar : PubMed/NCBI
11	Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, Karczewski KJ, Park J, Hitz BC, Weng S, et al: Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 22:1790–1797. 2012. View Article : Google Scholar : PubMed/NCBI
12	Ward LD and Kellis M: HaploReg: A resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 40:D930–D934. 2012. View Article : Google Scholar : PubMed/NCBI
13	Khurana E, Fu Y, Colonna V, Mu XJ, Kang HM, Lappalainen T, Sboner A, Lochovsky L, Chen J, Harmanci A, et al: Integrative annotation of variants from 1092 humans: Application to cancer genomics. Science. 342:12355872013. View Article : Google Scholar : PubMed/NCBI
14	Li J, Drubay D, Michiels S and Gautheret D: Mining the coding and non-coding genome for cancer drivers. Cancer Lett. 369:307–315. 2015. View Article : Google Scholar : PubMed/NCBI
15	Cooper GM, Stone EA and Asimenos G: NISC Comparative Sequencing Program, Green ED, Batzoglou S and Sidow A: Distribution and intensity of constraint in mammalian genomic sequence. Genome Res. 15:901–913. 2005. View Article : Google Scholar : PubMed/NCBI
16	Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al: Evolutionarily conserved elements in vertebrate, insect, worm and yeast genomes. Genome Res. 15:1034–1050. 2005. View Article : Google Scholar : PubMed/NCBI
17	Pollard KS, Hubisz MJ, Rosenbloom KR and Siepel A: Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20:110–121. 2010. View Article : Google Scholar : PubMed/NCBI
18	Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM and Shendure J: A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 46:310–315. 2014. View Article : Google Scholar : PubMed/NCBI
19	Fu Y, Liu Z, Lou S, Bedford J, Mu XJ, Yip KY, Khurana E and Gerstein M: FunSeq2: A framework for prioritizing noncoding regulatory variants in cancer. Genome Biol. 15:4802014. View Article : Google Scholar : PubMed/NCBI
20	Jeon Y, Sarma K and Lee JT: New and Xisting regulatory mechanisms of X chromosome inactivation. Curr Opin Genet Dev. 22:62–71. 2012. View Article : Google Scholar : PubMed/NCBI
21	Mattick JS, Amaral PP, Dinger ME, Mercer TR and Mehler MF: RNA regulation of epigenetic processes. Bioessays. 31:51–59. 2009. View Article : Google Scholar : PubMed/NCBI
22	Wapinski O and Chang HY: Long noncoding RNAs and human disease. Trends Cell Biol. 21:354–361. 2011. View Article : Google Scholar : PubMed/NCBI
23	Ørom UA, Derrien T, Beringer M, Gumireddy K, Gardini A, Bussotti G, Lai F, Zytnicki M, Notredame C, Huang Q, et al: Long noncoding RNAs with enhancer-like function in human cells. Cell. 143:46–58. 2010. View Article : Google Scholar : PubMed/NCBI
24	Clark MB and Mattick JS: Long noncoding RNAs in cell biology. Semin Cell Dev Biol. 22:366–376. 2011. View Article : Google Scholar : PubMed/NCBI
25	Rando TA and Chang HY: Aging, rejuvenation, and epigenetic reprogramming: Resetting the aging clock. Cell. 148:46–57. 2012. View Article : Google Scholar : PubMed/NCBI
26	Nie L, Wu HJ, Hsu JM, Chang SS, Labaff AM, Li CW, Wang Y, Hsu JL and Hung MC: Long non-coding RNAs: Versatile master regulators of gene expression and crucial players in cancer. Am J Transl Res. 4:127–150. 2012.PubMed/NCBI
27	Gutschner T and Diederichs S: The hallmarks of cancer: A long non-coding RNA point of view. RNA Biol. 9:703–719. 2012. View Article : Google Scholar : PubMed/NCBI
28	Fang Z, Wu L, Wang L, Yang Y, Meng Y and Yang H: Increased expression of the long non-coding RNA UCA1 in tongue squamous cell carcinomas: A possible correlation with cancer metastasis. Oral Surg Oral Med Oral Pathol Oral Radiol. 117:89–95. 2014. View Article : Google Scholar : PubMed/NCBI
29	Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, Tsai MC, Hung T, Argani P, Rinn JL, et al: Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 464:1071–1076. 2010. View Article : Google Scholar : PubMed/NCBI
30	Guffanti A, Iacono M, Pelucchi P, Kim N, Soldà G, Croft LJ, Taft RJ, Rizzi E, Askarian-Amiri M, Bonnal RJ, et al: A transcriptional sketch of a primary human breast cancer by 454 deep sequencing. BMC Genomics. 10:1632009. View Article : Google Scholar : PubMed/NCBI
31	Garding A, Bhattacharya N, Claus R, Ruppel M, Tschuch C, Filarsky K, Idler I, Zucknick M, Caudron-Herger M, Oakes C, et al: Epigenetic upregulation of lncRNAs at 13q14.3 in leukemia is linked to the ln Cis downregulation of a gene cluster that targets NF-kB. PLoS Genet. 9:e10033732013. View Article : Google Scholar : PubMed/NCBI
32	Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SA, Behjati S, Biankin AV, Bignell GR, Bolli N, Borg A, Børresen-Dale AL, et al: Signatures of mutational processes in human cancer. Nature. 500:415–421. 2013. View Article : Google Scholar : PubMed/NCBI
33	1000 Genomes Project Consortium. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT and McVean GA: An integrated map of genetic variation from 1,092 human genomes. Nature. 491:56–65. 2012. View Article : Google Scholar : PubMed/NCBI
34	Landrum MJ, Lee JM, Riley GR, Jang W, Rubinstein WS, Church DM and Maglott DR: ClinVar: Public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42:D980–D985. 2014. View Article : Google Scholar : PubMed/NCBI
35	Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, Thomas NS and Cooper DN: The human gene mutation database: 2008 update. Genome Med. 1:132009. View Article : Google Scholar : PubMed/NCBI
36	Beck T, Hastings RK, Gollapudi S, Free RC and Brookes AJ: GWAS central: A comprehensive resource for the comparison and interrogation of genome-wide association studies. Eur J Hum Genet. 22:949–952. 2014. View Article : Google Scholar : PubMed/NCBI
37	Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, et al: GENCODE: The reference human genome annotation for the ENCODE project. Genome Res. 22:1760–1774. 2012. View Article : Google Scholar : PubMed/NCBI
38	Cabili MN, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A and Rinn JL: Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 25:1915–1927. 2011. View Article : Google Scholar : PubMed/NCBI
39	Pruitt KD, Tatusova T and Maglott DR: NCBI reference sequences (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35:D61–D65. 2007. View Article : Google Scholar : PubMed/NCBI
40	Ward AJ and Cooper TA: The pathobiology of splicing. J Pathol. 220:152–163. 2010.PubMed/NCBI
41	Karolchik D, Barber GP, Casper J, Clawson H, Cline MS, Diekhans M, Dreszer TR, Fujita PA, Guruvadoo L, Haeussler M, et al: The UCSC genome browser database: 2014 update. Nucleic Acids Res. 42:D764–D770. 2014. View Article : Google Scholar : PubMed/NCBI
42	Smith MA, Gesell T, Stadler PF and Mattick JS: Widespread purifying selection on RNA structure in mammals. Nucleic Acids Res. 41:8220–8236. 2013. View Article : Google Scholar : PubMed/NCBI
43	Schuster-Böckler B and Lehner B: Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature. 488:504–507. 2012. View Article : Google Scholar : PubMed/NCBI
44	Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, Jia M, Shepherd R, Leung K, Menzies A, et al: COSMIC: Mining complete cancer genomes in the catalogue of somatic mutations in cancer. Nucleic Acids Res. 39:D945–D950. 2011. View Article : Google Scholar : PubMed/NCBI
45	Ju YS, Lee WC, Shin JY, Lee S, Bleazard T, Won JK, Kim YT, Kim JI, Kang JH and Seo JS: Fusion of KIF5B and RET transforming gene in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. Genome Res. 22:436–445. 2012. View Article : Google Scholar : PubMed/NCBI
46	Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M and Gingeras TR: STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 29:15–21. 2013. View Article : Google Scholar : PubMed/NCBI
47	Quinlan AR and Hall IM: BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 26:841–842. 2010. View Article : Google Scholar : PubMed/NCBI
48	Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL and Pachter L: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 7:562–578. 2012. View Article : Google Scholar : PubMed/NCBI
49	Love MI, Huber W and Anders S: Moderated estimation of fold change and dispersion for RNA-Seq data with DESeq2. Genome Biol. 15:5502014. View Article : Google Scholar : PubMed/NCBI
50	Dees ND, Zhang Q, Kandoth C, Wendl MC, Schierding W, Koboldt DC, Mooney TB, Callaway MB, Dooling D, Mardis ER, et al: MuSiC: Identifying mutational significance in cancer genomes. Genome Res. 22:1589–1598. 2012. View Article : Google Scholar : PubMed/NCBI
51	Hon GC, Hawkins RD and Ren B: Predictive chromatin signatures in the mammalian genome. Hum Mol Genet. 18:R195–R201. 2009. View Article : Google Scholar : PubMed/NCBI
52	Ritchie GRS, Dunham I, Zeggini E and Flicek P: Functional annotation of noncoding sequence variants. Nat Methods. 11:294–296. 2014. View Article : Google Scholar : PubMed/NCBI
53	Washietl S, Kellis M and Garber M: Evolutionary dynamics and tissue specificity of human long noncoding. RNAs in six mammals. 616–628. 2014.
54	Loewen G, Jayawickramarajah J, Zhuo Y and Shan B: Functions of lncRNA HOTAIR in lung cancer. J Hematol Oncol. 7:902014. View Article : Google Scholar : PubMed/NCBI
55	Yang MH, Hu ZY, Xu C, Xie LY, Wang XY, Chen SY and Li ZG: MALAT1 promotes colorectal cancer cell proliferation/migration/invasion via PRKA kinase anchor protein 9. Biochim Biophys Acta. 1852:166–174. 2015. View Article : Google Scholar : PubMed/NCBI
56	Shen L, Chen L, Wang Y, Jiang X, Xia H and Zhuang Z: Long noncoding RNA MALAT1 promotes brain metastasis by inducing epithelial-mesenchymal transition in lung cancer. J Neurooncol. 121:101–108. 2015. View Article : Google Scholar : PubMed/NCBI
57	Okugawa Y, Toiyama Y, Hur K, Toden S, Saigusa S, Tanaka K, Inoue Y, Mohri Y, Kusunoki M, Boland CR and Goel A: Metastasis-associated long non-coding RNA drives gastric cancer development and promotes peritoneal metastasis. Carcinogenesis. 35:2731–2739. 2014. View Article : Google Scholar : PubMed/NCBI

Journals

International Journal of Molecular Medicine

International Journal of Oncology

Molecular Medicine Reports

Oncology Reports

Experimental and Therapeutic Medicine

Oncology Letters

Biomedical Reports

Molecular and Clinical Oncology

World Academy of Sciences Journal

International Journal of Functional Nutrition

International Journal of Epigenetics

Medicine International

Functional annotation of noncoding variants and prioritization of cancer-associated lncRNAs in lung cancer

This article is mentioned in:

Abstract

Figure 1

Figure 2

Figure 3

Figure 4

Related Articles