In silico identification of breast cancer genes by combined multiple high throughput analyses

  • Authors:
    • Dejun Shen
    • Jianbo He
    • Helena R. Chang
  • View Affiliations

  • Published online on: February 1, 2005     https://doi.org/10.3892/ijmm.15.2.205
  • Pages: 205-212
Metrics: Total Views: 0 (Spandidos Publications: | PMC Statistics: )
Total PDF Downloads: 0 (Spandidos Publications: | PMC Statistics: )


Abstract

Publicly available human genomic sequence data provide an unprecedented opportunity for researchers to decode the functionality of human genome. Such information is extremely valuable in cancer prevention diagnosis and treatment. Cancer Genome Anatomy Project (CGAP) and Gene Expression Omnibus (GEO) are two bioinformatic infrastructures for studying functional genomics. The goal of this study is to explore the feasibility of incorporating the Internet-available bioinformatic databases to discover human breast cancer-related genes. Several tools including the Gene Finder, Virtual Northern (vNorthern) and SAGE digital gene expression displayer (DGED) were used to analyze differential gene expression between benign and malignant breast tissues. A pilot study was performed using both EST and SAGE vNorthern to analyze the expression of a panel of known genes, including high abundance genes β-actin and G3PDH, low abundance genes BRCA1 and p53, tissue-specific genes CEA and PSA and two breast cancer-related genes Her2/neu and MUC1. We found a high expression of β-actin and G3PDH and a low expression of BRCA1 and p53 across different types of tissues as well as a tissue-specific expression of CEA in colon and PSA in prostate. A further analysis of 30 known breast cancer-related genes in breast cancer tissues by vNorthern demonstrated a high expression of oncogenes and low expression of tumor suppressor genes. An open-end analysis of two pools of breast cancer and benign breast tissue libraries by SAGE DGED produced 53 differentially expressed genes according to the screening criteria of a >five-fold difference and p<0.01. Further analysis by EST vNorthern and virtual microarray analysis reduced the candidate genes to six, with four down-regulated genes, ANXA1, CAV1, KRT5 and MMP7, and two up-regulated genes, ERBB2 and G1P3 in breast cancer. These findings were validated by a real-time RT-PCR analysis in eight paired human breast cancer tissue samples. We conclude that the combined multiple high throughput analyses is an effective data mining strategy in cancer gene identification. This approach may improve the usage of public available genomic data through strategic data mining of high throughput analysis.

Related Articles

Journal Cover

February 2005
Volume 15 Issue 2

Print ISSN: 1107-3756
Online ISSN:1791-244X

Sign up for eToc alerts

Recommend to Library

Copy and paste a formatted citation
x
Spandidos Publications style
Shen D, He J and Chang HR: In silico identification of breast cancer genes by combined multiple high throughput analyses. Int J Mol Med 15: 205-212, 2005
APA
Shen, D., He, J., & Chang, H.R. (2005). In silico identification of breast cancer genes by combined multiple high throughput analyses. International Journal of Molecular Medicine, 15, 205-212. https://doi.org/10.3892/ijmm.15.2.205
MLA
Shen, D., He, J., Chang, H. R."In silico identification of breast cancer genes by combined multiple high throughput analyses". International Journal of Molecular Medicine 15.2 (2005): 205-212.
Chicago
Shen, D., He, J., Chang, H. R."In silico identification of breast cancer genes by combined multiple high throughput analyses". International Journal of Molecular Medicine 15, no. 2 (2005): 205-212. https://doi.org/10.3892/ijmm.15.2.205