Differential proteomic analysis of pathway biomarkers in human breast cancer by integrated bioinformatics
- Authors:
- Published online on: August 24, 2012 https://doi.org/10.3892/ol.2012.881
- Pages: 1097-1103
Abstract
Introduction
Breast cancer is the most common neoplasia in women and its pathogenesis is related to an acquired or inherited genetic disorder influenced by environmental, behavioral or reproductive factors (1). Cancer biomarker discovery is important for both cancer biology and clinical applications. These markers may come from DNA, RNA, miRNA or proteins (2), with proteins being the most significant (3).
The development and improvement of biotechnologies has allowed researchers to perform high-throughput analyses of genomes, transcriptomes and proteomes in health and disease, and identify hundreds of potential biomarkers (4), offering the potential to discover diagnostic, prognostic or therapeutic targets. However, less than two dozen cancer biomarkers are currently approved by the Food and Drug Administration (FDA) (5), including only 9 protein biomarkers identified in the blood (6). Due to the lack of sensitivity and specificity of these known biomarkers (7), researchers continue to search for more significant targets. Proteomics is a promising approach for the discovery of cancer targets and biomarkers (8). The mapping of proteome profiles and differential proteomics has been widely performed in breast cancer to identify potential biomarkers (9). The identified proteins were reported to have potential clinical significance, and certain proteins may be used as potential diagnostic, prognostic or predictive biomarkers (10,11,12,13,14). However, due to the heterogeneity in the different studies, including experimental design, sample collection and classification and analytical method (15), these results lack good reproducibility and require further validation before they can be used in clinical detection and to explain the underlying mechanisms of breast cancer. In addition, few protein candidates were warranted to be specific to breast cancer, and were often differentially expressed in other cancer types (16). Hence, research encountered the challenge of how to decipher and use these individual results and bring them into clinical applications. In addition, an understanding of the underlying biological mechanisms of carcinogenesis and the altered molecular events in breast cancer at integrated pathway levels is necessary. Proteins do not act alone to perform biological functions, but through complex biological pathways. The discovery of these intricate pathways is essential to understanding biological mechanisms. A wide variety of cancers may also link to the same pathways that affect tumorigenesis and progression through altering protein expressions. Therefore, once these pathways are known, it may be easier to monitor different aspects of cancer progression and develop a therapeutic strategy by focusing on pathways instead of individual proteins. The enriched pathways or functions may be the most probable cause of cancer (17), and the enriched proteins involved in these processes could in turn serve as target agents in diagnosis or treatment. Several monoclonal antibodies and small molecular inhibitors have been developed to target certain molecular pathways involved in cell growth, survival and metastasis in breast cancer (18,19,20). Therefore, integrated bioinformatics should be applied when discovering cancer-associated pathways and networks to improve our understanding of cancer biology, as well as cancer diagnosis and therapeutics.
In the present study, we integrated protein profiles in breast cancers from literature and public databases to perform bioinformatical analyses. Differential serum proteomics between human breast cancer patients and healthy volunteers were performed, and western blotting was used to validate the differential expression of secretory proteins in breast cancer. Our aim was not to discover all the breast cancer-associated proteins but rather to focus on screening enriched signaling pathways in breast cancer which may provide new insights into breast cancer research. This bioinformatical insight into breast cancer-associated protein profiles may potentially provide clues for identifying new functional modules in breast cancer and may be used to understand the underlying tumorigenesis process.
Materials and methods
Patient characteristics and serum collection
Blood samples were collected from 25 breast cancer patients and 20 healthy volunteers at the Yantai Yu-Huang-Ding Hospital, China, with written informed consent and approval of the Yu-Huang-Ding Hospital research and ethics committee. Venous blood was drawn from each subject into 10-ml fasting blood tubes and allowed to clot at room temperature for 1 h. Serum was separated by centrifugation at 2000 g for 15 min at 4°C. Proteins in the supernatant were precipitated by mixing with 4 volumes of ice-cold acetone and allowing it to stand at −20°C for 1 h. After being centrifuged at 12000 g for 1 h, washed with 90% acetone and dried, the proteins were taken up in 2 ml lysis solution (7 M urea, 2 M thiourea and 65 mM dithiothreitol) and stored at −80°C.
Collection of breast cancer-associated proteins
Breast cancer protein profiles were constructed by retrieving public databases (Uniprot, Release 2011_11, http://www.uniprot.org) and literature on differential proteomes in breast cancer (dbDEPC 2.0, http://lifecenter.sgst.cn/dbdepc/index.do). All human proteins in the Uniprot database were downloaded with all annotations. Breast cancer-associated proteins were further selected manually from the downloaded data using the keywords ‘cancer’, ‘tumor’ or ‘carcinoma’. The regulation levels were recorded as ‘over-regulation’, ‘downregulation’ and ‘no-annotation’. All proteins extracted from the literature were further grouped into over- and downregulation clusters, and certain proteins with controversial expression in different reports were clustered into the no-annotation group.
Bioinformatics analysis
Ontological analysis
All breast cancer-associated proteins were classified into different protein classes according to Interpro (www.ebi.ac.uk/interpro) and Gene Ontology (GO) annotation (www.geneontology.org), and significantly enriched functions were further selected according to Panther (http://www.pantherdb.org/).
Pathway analysis
Ingenuity Pathway Analysis v8.0-2803 (Ingenuity Systems, Redwood City, CA, USA) was used to analyze pathways and networks involving the breast cancer-associated proteins. The following settings were used: reference set, Ingenuity Knowledge Base (genes only); network analysis, direct and indirect relationships; molecules per network, 35; networks per analysis, 25. All species, tissues and cell lines were used for the analysis. IPA uses Fisher’s exact test to determine which pathways (canonical pathways, toxicity pathways or biological functions) are significantly linked to the input protein set compared with the whole Ingenuity Knowledge Base.
Selection of secretory and cell surface proteins
Secretory proteins and cell surface proteins are promising biomarkers. All cancer-associated proteins were compared with the serum/plasma proteome (21) to select secretory proteins, and compared with the cell surfaceome (22) to select cell surface proteins. GO was used to further filter the results.
Proteomic analysis of human serum from patients with breast cancer and healthy volunteers
Two-dimensional gel electrophoresis and mass spectrometric analyses were performed as described in our previous study (23). Gels were made in triplicate to confirm the spot patterns and were scanned with a Z320 scanner (Founder, Beijing, China). The gel images were processed with ImageMaster software (GE Healthcare, Piscataway, NJ, USA). Images were briefly checked manually to eliminate artefacts. Following spot detection, a match set was built including all the experimental and control gels. The significance of the differential expression of protein spots between experimental and control groups was estimated by mean ratio >2.0 and the independent samples t-test; P<0.05 was considered to indicate a statistically significant result.
Western blot analysis
A total of 50 μg of pooled proteins were separated at 12% SDS-PAGE by electrophoresis and then transferred onto a nitrocellulose membrane (Millipore, Bedford, MA, USA). The membranes were blocked with 5% milk TBST (10 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1% Tween-20) overnight at 4°C and then hybridized with the following primary antibodies: anti-ORM2 (sc-51020), anti-APOA1 (sc-69755), anti-GC (sc-18706) and anti-CLU (sc-166907), all from Santa Cruz Biotechnology, Inc. (Santa Cruz, CA, USA). The immune complexes were visualized with a DAB staining kit (Zhongshan Jinqiao Technology, China).
Results
Overview of breast cancer-associated proteins
The breast cancer-associated protein profile was mainly comprised of proteins screened from the Uniprot database and literature reports. A total of 803 breast cancer-associated proteins were screened from literature reports and 243 were selected in the Uniprot database. Finally, 1031 breast cancer-associated proteins were obtained including 514 (49.6%) upregulated and 318 (30.8%) downregulated proteins. One hundred and seventy-two proteins required further confirmation of expression.
Bioinformatics analysis
From protein lists to biological functions
According to GO analysis, Interpro and literature annotations, the enriched biological functions are listed in Table I. Metabolic process, chaperone transcription and catalytic activity-associated proteins were more significantly enriched in breast cancer-associated proteins. The comparison of up- and downregulated proteins demonstrated that more chaperone, cell adhesion, transporter and antioxidant-related proteins were present in the upregulated group, while defense/immunity and extracellular proteins were present in the downregulated group.
From biological functions to pathways
To explore the enrichment pathways of these proteins with different biological functions, a pathway analysis was performed using Ingenuity Pathway Analysis tools and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Thirty-three pathways were identified in breast cancers, including 25 pathways enriched in upregulated proteins and 8 pathways with downregulated proteins (Table II).
From biological pathways to networks
Network generation was performed using Ingenuity Pathway Analysis tools. This generated 25 networks using all the breast cancer-associated proteins and 13 networks using the upregulated proteins. Several functions were linked to more than 3 of the networks in the upregulated group and the main ones included cancer, genetic disorder, cell death and cellular movement.
Cancer/testis proteins, secretory proteins and cell surface proteins
Cancer/testis antigens are potential cancer biomarkers with restricted expression in the testis and certain cancer cells. The earlier comparison of breast cancer proteins showed that 119 testis proteins including 11 testis-specific proteins were included in this study. Twenty-one proteins were common to both the breast cancer proteins and cancer/testis antigens. The comparison of breast cancer proteins with human serum/plasma and cell surface proteins (surfaceome) revealed that the cancer proteins included 153 secretory proteins and 69 cell surface proteins
Experimental analysis by 2D-electrophoresis and western blotting
All gels were statistically analyzed through ImageMaster software, and 11 proteins were shown to be differentially expressed between breast cancer patients and healthy donors (Fig. 1A). Of the 4 proteins tested by western blotting, ORM2 and CLU were verified to be highly expressed in the serum of breast cancer patients, and APOA1 and GC were expressed at low levels (Fig. 1B).
Discussion
With the development and improvement of proteomic technology, cancer proteomes are evolving quickly and aid in the search for biomarkers and therapeutic targets. Breast cancer proteomes were widely analyzed and certain important proteins were identified (9,10). However, these data were heterogeneous and many of them lacked experimental verification and validation. In our preliminary study, 11 serum proteins were found to be differentially expressed between breast cancer patients and healthy individuals. The results provided insight into certain aspects of breast cancer, but had little correlation with all the breast cancer proteomes available. Thus, the integration of protein lists containing breast cancer proteomes to understand the correlations involved in regulation functions and networks may aid in the screening of diagnostic markers or therapeutic targets.
In the present study, integrated bioinformatical tools were used to analyze the enriched pathways and networks that are associated with breast cancer. The breast cancer protein profile was constructed, including proteins in the published literature of differential proteomics and the Uniprot database. Protein expressions in breast cancer were evaluated, and broadly categorized into three groups: upregulated, downregulated and those that had contradictory expressions in breast cancer. An ontological analysis indicated that most of these breast cancer-associated proteins were classified into different enrichment functional groups that may be involved in different aspects of cancer biology. The pathway networks analysis showed that several signaling pathways and networks were significantly associated with human breast cancer. These pathways and functions were relevant to the cancer microenvironment, invasion and metastasis processes as briefly described below.
Previous studies have shown that the cancer microenvironment is deficient in oxygen, low in glucose (24), and is is usually followed by highl glycolytic activity (25,26). Studies of breast cancers showed that hypoxic conditions may alter the expression of certain proteins that serve as hypoxic markers (27). Fifteen glycolytic enzymes were investigated in the present study which may be involved in tumor proliferation in hypoxic conditions. The protein ubiquitination pathway and the PI3K/Akt signaling pathway have been reported to be associated with the development of human breast cancers (28,29). Certain drugs have been applied to affect these pathways in the treatment of breast cancer (30,31,32,33). The RhoA signaling pathway was strongly correlated with tumor cell invasion and metastasis (34), and the Nrf2-mediated oxidative stress response activated genes encoding detoxification enzymes and antioxidant proteins to protect cells from oxidative stress. In breast cancer cells, NRP/B was able to enhance oxidative stress responses via the Nrf2 pathway (35). More than 13 cell adhesion proteins in the present study may play roles in the intercellular and cell-extracellular matrix interactions of cancer, leading to cancer invasion or metastasis (36), or participating in signal transduction, cell growth and differentiation (37). E-cadherin is one prominent adhesion molecule that forms the E-cadherin-catenin complex which plays a role in epithelial cell-cell adhesion and differentiation (38), in particular serving as a potent invasion/tumor suppressor of breast cancer (39). Previous studies indicated that the downregulation of E-cadherin was relevant to several pathways including the integrin-linked kinase (ILK) signaling pathway (40,41). The ILK signaling pathway was one prominent enrichment pathway in breast cancer, and may play an important role in hormonal cancer progression (42).
Some of these canonical pathways are known to participate in cancer processes, but the mechanism and the altered individual members are not well studied. We hypothesized that the altered expressions of different members or activators in these enriched pathways may be associated with the tumor process and its microenvironment alteration, and may serve as important targets or agents in the diagnosis and treatment of breast cancer.
More than 200 cancer/testis antigens have been listed in the cancer/testis database (http://www.cta.lncc.br/). The significant feature of these proteins is their restricted expression in the testis and low or no expression in normal tissues (43); therefore, they may be used as potential cancer vaccine targets. Twenty-one cancer/testis antigens (8%) and 119 testis proteins (11 testis-specific proteins) were included in the breast cancer protein profiles and these proteins warrant further study. Among these proteins, a recent study showed that MAGE-A3 and A4 in the peripheral blood of breast cancer patients may have potential prognostic and predictive implications (44). All of these proteins are promising specific tumor markers of breast cancer.
Due to the limitations of protein identification using current proteomic techniques, some molecules in certain pathways may be missing. However, the present study gives a new bioinformatical insight into breast cancer at systems biology levels by integrating the individual studies to identify enriched biological and molecular pathways, providing vital evidence for future research. Further studies are warranted to substantiate the enriched functions and pathways. The studies may advance our understanding of cancer biomarker discovery, and also facilitate the biological interpretation of cancer biology in a network context.