<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Molecular Medicine Reports</journal-id>
<journal-title-group>
<journal-title>Molecular Medicine Reports</journal-title></journal-title-group>
<issn pub-type="ppub">1791-2997</issn>
<issn pub-type="epub">1791-3004</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/mmr.2013.1526</article-id>
<article-id pub-id-type="publisher-id">mmr-08-02-0529</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject></subj-group></article-categories>
<title-group>
<article-title>Analysis of a novel protein in human colorectal adenocarcinoma</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>WANG</surname><given-names>KAICHENG</given-names></name></contrib>
<contrib contrib-type="author">
<name><surname>CHEN</surname><given-names>YAO</given-names></name><xref ref-type="corresp" rid="c1-mmr-08-02-0529"/></contrib>
<aff id="af1-mmr-08-02-0529">Department of Anatomy, Premedical and Forensic Medical Institute, Sichuan University, Chengdu, Sichuan 610041, P.R. China</aff></contrib-group>
<author-notes>
<corresp id="c1-mmr-08-02-0529">Correspondence to: Dr Yao Chen, Department of Anatomy, Premedical and Forensic Medical Institute, Sichuan University, 17 Renmin South Road, Chengdu, Sichuan 610041, P.R. China, E-mail: <email>chenyao62@scu.edu.cn</email></corresp></author-notes>
<pub-date pub-type="ppub">
<month>8</month>
<year>2013</year></pub-date>
<pub-date pub-type="epub">
<day>14</day>
<month>06</month>
<year>2013</year></pub-date>
<volume>8</volume>
<issue>2</issue>
<fpage>529</fpage>
<lpage>534</lpage>
<history>
<date date-type="received">
<day>04</day>
<month>02</month>
<year>2013</year></date>
<date date-type="accepted">
<day>29</day>
<month>05</month>
<year>2013</year></date></history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2013, Spandidos Publications</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<license-p>This is an open-access article licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License. The article may be redistributed, reproduced, and reused for non-commercial purposes, provided the original source is properly cited.</license-p></license></permissions>
<abstract>
<p>Colorectal adenocarcinoma (CRC) is the third most common type of cancer worldwide with a low 5-year survival rate. The present study aimed to investigate the structure and function of a novel protein identified from human colorectal adenocarcinoma (CRC). A differentially expressed sequence tag (GenBank accession number, ES274081) was collected from GenBank. Bioinformatics tools were employed to obtain the sequence of the full-length cDNA in order to localize the open reading frame and to predict the protein sequence. Mass spectrometry was used to analyze the structure of this novel protein and western blot analysis was used to confirm the expression of this protein in human CRC tissue samples. The full-length cDNA was composed of 4,283-bp nucleotides and the sequence information was obtained (GenBank accession number, NM_001013649). The corresponding protein molecule contained 165 amino acids, with a monoisotopic molecular weight of 18.6033 kDa and an isoelectric point of 8.43, determined by mass spectrometry. The protein structure and its function in adenocarcinoma were further explored. In the present study, a novel protein, which may be involved in nuclear signal transduction, was identified using bioinformatics, mass spectrometry and western blot analysis.</p></abstract>
<kwd-group>
<kwd>novel protein</kwd>
<kwd>human colorectal adenocarcinoma</kwd>
<kwd>bioinformatics</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Colorectal adenocarcinoma (CRC) is the third most common type of cancer worldwide accounting for 8.9&#x02013;9.4&#x00025; of all cancer cases (<xref rid="b1-mmr-08-02-0529" ref-type="bibr">1</xref>,<xref rid="b2-mmr-08-02-0529" ref-type="bibr">2</xref>). The 5-year survival rate for poorly differentiatied CRC is 29&#x00025; (<xref rid="b3-mmr-08-02-0529" ref-type="bibr">3</xref>).</p>
<p>Development of human CRC is a multistep process, involving numerous pathological changes in gene expression and protein function. In a previous study, a cDNA subtraction library was established and from this, 86 differentially expressed sequence tags in human CRC were identified using the suppression subtractive hybridization technique combined with cDNA microarray (<xref rid="b4-mmr-08-02-0529" ref-type="bibr">4</xref>). Among these newly identified differentially expressed tags, mRNA expression of ES274081 was observed to be downregulated in human CRC by quantitative real time-polymerase chain reaction (<xref rid="b5-mmr-08-02-0529" ref-type="bibr">5</xref>), indicating a potential role in the development of human CRC. In the present study, to explore the function of NM_001013649, the corresponding protein, HCRCN81, was identified using bioinformatic tools, mass spectrometry and western blot analysis.</p></sec>
<sec sec-type="methods">
<title>Materials and methods</title>
<sec>
<title>Cloning of differentially expressed tags in human CRC</title>
<p>Differentially expressed tags were identified from a cDNA subtraction library, using the suppression subtractive hybridization technique combined with cDNA microarray (GenBank accession number, ES274081). Internet sources and bioinformatics analysis software packages are shown in <xref rid="tI-mmr-08-02-0529" ref-type="table">Table I</xref>.</p></sec>
<sec>
<title>Bioinformatical analysis</title>
<p>Using the website, <ext-link xlink:href="http://www.ncbi.nlm.nih.gov/unigene" ext-link-type="uri">http://www.ncbi.nlm.nih.gov/unigene</ext-link>, a search was performed on the accession number (ES274081) and the sequence information of a 4,283-bp cDNA molecule (GenBank accession number, NM_001013649) was obtained. Using the open reading frame (ORF) finder tool provided by NCBI, the initiation codon was predicted according to the principle that the initiation codon in the Kozak sequence must be suitable for translation initiation (<xref rid="b6-mmr-08-02-0529" ref-type="bibr">6</xref>). According to the Chou-Fasman prediction method of the secondary structure of proteins, 41 amino acids are capable of forming 2 helices. The hydrophobic portion of NM_001013649 was analyzed with ProtScale and the &#x003B1;-helix and &#x003B2;-sheet structures were predicted using PredictProtein. In addition, the coded amino acid sequence was analyzed using BLAST, CPHmodels, SMART, Pfam and Motif Scan.</p></sec>
<sec>
<title>Mass spectrometric analysis</title>
<p>Mass spectrometry (Proevolab, Beijing, China) was used to confirm the predicted amino acid sequence of NM_001013649 (<xref rid="b7-mmr-08-02-0529" ref-type="bibr">7</xref>). Briefly, the protein sample was digested on ice with trypsin (0.01 &#x003BC;g/&#x003BC;l; 10 &#x003BC;l). Following removal of trypsin, the digested sample was incubated with an ammonium bicarbonate solution (5 &#x003BC;l; 25 mM) at 37&#x000B0;C overnight. The incubated sample (2 &#x003BC;l) was mixed. The liquid gradient was set at 136 min and the MS acquisition was set at 110 min. For the first-round scan, Fourier transform mass spectrometry was used with a range of 400&#x02013;1,500 Da. For the second-round scan, linear ion trap quadrupole was used.</p></sec>
<sec>
<title>Western blot analysis</title>
<p>Tissue samples were obtained from two patients: a 74-year-old female patient with rectal tubular villous carcinoma, with a complication of moderate epithelial hyperplasia and a 76-year-old male patient with rectal adenocarcinoma. Tumor and normal tissue located 5 cm from the tumor tissues were collected to detect the expression of the target protein using 12&#x00025; SDS-PAGE. For each reaction, 30 &#x003BC;g samples were used and the antibody serum was diluted by 1:500. The antibody was made to order by CWBio, Beijing, China (patent number: 201210445862.6). Informed consent was obtained from the patients and the study was approved by the Ethics Committee of Sichuan University.</p></sec></sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title>Bioinformatic analyses</title>
<p>Using the NCBI database, a 4,283 bp target full-length cDNA located in chromosome 2, ORF 68 (2p11.2) was obtained. An initiation codon (ATG) and a termination codon (TGA) were located at the 73&#x02013;75 and 571&#x02013;573 nucleotides of this sequence, respectively, which defined the longest ORF in this sequence, representing 166 amino acids (<xref rid="f1-mmr-08-02-0529" ref-type="fig">Fig. 1</xref>).</p>
<p>The corresponding protein molecule contained 166 amino acids with a comparative MW of 18,750.9 kDa and a pI of 8.44. Protein hydrophilic analysis (<xref rid="f2-mmr-08-02-0529" ref-type="fig">Fig. 2</xref>) revealed the presence of a hydrophilic fragment, in the region of 80&#x02013;90 amino acids and a hydrophobic fragment in the region of 115&#x02013;125 amino acids. Using the tool NetNGlyc, no <italic>O</italic>- or <italic>N</italic>-linkage glycosylation locus was observed in this sequence. Using NetPhos, potential phosphorylation sites were observed at the following amino acid residues: 39, 56, 78, 86, 89, 92, 93, 94, 95, 98, 103, 116, 120, 134, 138, 141 and 164 (<xref rid="f3-mmr-08-02-0529" ref-type="fig">Fig. 3</xref>). In addition, no transmembrane region or orientation was identifed in this sequence. The results of the coiled-coil analysis is shown in <xref rid="f4-mmr-08-02-0529" ref-type="fig">Fig. 4</xref>. &#x003B2;-turn analysis indicated that a &#x003B2;-turn may be present near amino acid residues 6, 13, 60, 93, 131 and 145 (<xref rid="f5-mmr-08-02-0529" ref-type="fig">Fig. 5</xref>).</p></sec>
<sec>
<title>HCRCN81</title>
<p>In the predicted secondary structure of the HCRCN81 protein, 56 amino acid residues were involved in the formation of &#x003B1;-helices, including the 25&#x02013;31, 38&#x02013;54, 86&#x02013;88, 130&#x02013;139 and 145&#x02013;163 residues, accounting for 33.73&#x00025; of the entire protein sequence. By contrast, 16 amino acids were involved in the formation of &#x003B2;-sheets, including the 69&#x02013;70, 107&#x02013;113 and 119&#x02013;125 amino acid residues, accounting for 9.64&#x00025; of the protein sequence (<xref rid="f6-mmr-08-02-0529" ref-type="fig">Fig. 6</xref>). Using the tool CPHmodels (<ext-link xlink:href="http://www.cbs.dtu.dk/services/CPHmodels/" ext-link-type="uri">http://www.cbs.dtu.dk/services/CPHmodels/</ext-link>) provided by the Center for Biological Sequence Analysis (Lyngby, Denmark) the tertiary structure of this protein was predicted (<xref rid="f7-mmr-08-02-0529" ref-type="fig">Fig. 7</xref>).</p>
<p>In addition, the localization pattern of the protein, HCRCN81, was explored. Using SignalP, no signal peptide of 1&#x02013;70 amino acids was identified (<xref rid="f8-mmr-08-02-0529" ref-type="fig">Fig. 8</xref>). Utilizing TargetP, it was noted that there was a reduced possibility of the protein, HCRCN81, to be localized in mitochondria, endoplasmic reticulum or other components of the secretory pathway. Analysis using the tool NetNES revealed the presence of a leucine-rich nuclear export signal (NES) at amino acid residues 147&#x02013;155 of the protein, HCRCN81 (<xref rid="f9-mmr-08-02-0529" ref-type="fig">Fig. 9</xref>).</p>
<p>Analysis of the protein sequence using SMART and Pfam (<xref rid="b8-mmr-08-02-0529" ref-type="bibr">8</xref>), revealed two areas with low complexity at residues 52&#x02013;65 and 87&#x02013;104 of HCRCN81. This protein is a member of the UPF0561 family and shares a domain containing 1&#x02013;126 amino acids with the UPF0561 family members. Using SMART, three regions in HCRCN81 that may be homologs of other known structures were identified: i) 11&#x02013;39 amino acids shared with SCOP: d1fn9a (E-value 2.20e&#x0002B;00), which was present in the outer capsid protein &#x003C3;3 (<xref rid="b9-mmr-08-02-0529" ref-type="bibr">9</xref>); ii) 90&#x02013;108 amino acids shared with SCOP: d1eg3a2 (E-value 7.30e&#x0002B;00), which was present in the EF-hand domain (<xref rid="b10-mmr-08-02-0529" ref-type="bibr">10</xref>); iii) 110&#x02013;145 amino acids shared with SCOPL: d1qlma_(E-value 1.10e&#x0002B;00), which was present in methenyltetrahydromethanopterin cyclohydrolase (<xref rid="b11-mmr-08-02-0529" ref-type="bibr">11</xref>).</p>
<p>Analysis using Motif Scan revealed cAMP- and cGMP-dependent protein kinase phosphorylation sites at residues 53&#x02013;56 and 161&#x02013;164; casein kinase II phosphorylation sites at residues 98&#x02013;101 and 141&#x02013;144; protein kinase C phosphorylation sites at residues 134&#x02013;136; tyrosine kinase phosphorylation sites at residues 63&#x02013;70 and bipartite nuclear localization signal at residues 30&#x02013;44.</p>
<p>Using the tool, PeptideMass, it was observed that HCRCN81 may be digested into peptides, the of which sequences are shown in <xref rid="f10-mmr-08-02-0529" ref-type="fig">Fig. 10</xref> (selected 500&#x02013;3,000 kDa). Mass spectrometric analysis revealed that the protein molecule contained 165 amino acids, had lost a histidine in the terminal region, had a monoisotopic MW of 18.6033 and a pI of 8.43. The sequences are shown in <xref rid="f11-mmr-08-02-0529" ref-type="fig">Fig. 11</xref>, which covered 81.3&#x00025; of the entire protein molecule. Taking the remaining the peptide sequences into consideration, results of mass spectrometry were considered to be accurate.</p>
<p>Western blot analysis demonstrated a clear band at the position of 18 kDa (<xref rid="f12-mmr-08-02-0529" ref-type="fig">Fig. 12</xref>).</p></sec></sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>Following completion of the Human Genome Project and current progression of the post-Human Genome Project, a large amount of genomic information is available at present. To improve understanding of the significant amount of the notable increase of biological information, bioinformatics has become a useful tool for biological and biomedical studies. In the early stages of the Human Genome Project, small clones were prioritized for full-length sequencing (<xref rid="b12-mmr-08-02-0529" ref-type="bibr">12</xref>). A number of software and network tools have been significant in cloning novel genes. In particular, a large amount of information on expressed sequence tags (ESTs) has led to marked changes in the methods used for identifying and cloning novel genes. A series of gene expression analyses (<xref rid="b13-mmr-08-02-0529" ref-type="bibr">13</xref>) and large-scale sequencing studies of ORF tags have made a significant contribution to the definition of the transcriptome and the improvement of genome annotation (<xref rid="b14-mmr-08-02-0529" ref-type="bibr">14</xref>).</p>
<p>In the present study, a number of network tools and software sets were used to analyze the target sequence, which was beneficial in studying the function of the HCRCN81 protein. More specifically, the full-length cDNA sequence from the NCBI database was used, BLAST was used to search the ESTs and DNAstar was employed for <italic>in silico</italic> cloning. An overlap of two EST alkali bases of &gt;40 bp, with a shared similarity of 95&#x00025; in the overlapping region, confirmed an accurate match in sequences (<xref rid="b15-mmr-08-02-0529" ref-type="bibr">15</xref>). Following this, the EST was extended for as long as possible and the matching sequences were assembled together forming a new EST for BLAST analysis to identify more matching sequences. This approach was repeated until no more matching sequences were found. Following this, the obtained full-length cDNA sequence was analyzed using bioinformatic tools and databases from the internet (<xref rid="b16-mmr-08-02-0529" ref-type="bibr">16</xref>) and a novel protein, HCRCN81, involved in human CRC, was identified. For this identification, the first step was to translate the cDNA sequence into an amino acid sequence. Initially, numerous initiation codons were identified in NM_001013649. Following this, ORF was used for analysis and it was noted that the majority of these initiation codons resulted in sequences that were too short for a protein molecule. Therefore, the longest peptide sequence was determined as the protein molecule. This result was validated by uniGene in NCBI. The full-length amino acid sequence was then obtained and the physical and chemical characteristics of the protein molecule were analyzed using multiple tools. In particular, ProtParam and ProtScale provided by ExPASy were used to determine the MW and theoretical pI. Additional software sets and tools were used to determine the hydrophobic region, phosphorylation sites and <italic>O</italic>- and <italic>N</italic>-linkage glycosylation sites within the sequence. Following this, the protein structure was predicted. Jpred and PredictProtein were used to analyze the secondary structure. Results of the two approaches were consistent. To predict the tertiary structure of the target protein, the sequence was analyzed using SWISS-MODEL and CBS. The SWISS-MODEL did not yield a result, however, CBS analysis revealed a partial structure of the protein between 108&#x02013;166 amino acids.</p>
<p>In addition, the function of this protein was investigated. Since NLS and NES were predicted to be present in this protein, it was speculated that the protein may be involved in nucleic function. The outer capsid protein &#x003C3;3 is capable of stimulating translation by blocking the activation of the dsRNA-dependent protein kinase, EIF2AK2/PKR (<xref rid="b17-mmr-08-02-0529" ref-type="bibr">17</xref>). The function of methenyltetrahydromethanopterin cyclohydrolase is to catalyze the reversible interconversion from 5-formyl-H(<xref rid="b4-mmr-08-02-0529" ref-type="bibr">4</xref>)MPT to methenyl-H(<xref rid="b4-mmr-08-02-0529" ref-type="bibr">4</xref>)MPT(&#x0002B;), by performing the following catalytic reaction: 5,10-methenyl-5,6,7,8-tetrahydromethanopterin &#x0002B; H<sub>2</sub>O &#x0003D; 5-formyl-5,6,7,8-tetrahydromethanopterin. In addition, casein kinase II activation has been reported as a downstream event of Wnt signaling activation (<xref rid="b18-mmr-08-02-0529" ref-type="bibr">18</xref>) and tyrosine kinases have been found to transport into the nuclear regions where gene expression may be modified (<xref rid="b19-mmr-08-02-0529" ref-type="bibr">19</xref>). Considering the presence of numerous potential phosphorylation sites, particularly kinase phosphorylation sites in the EF-hand domain, it was hypothesized that, following synthesis, this target protein may be transported into the nucleus, due to the presence of an NLS, which may be phosphorylated, leading to the exposure of its functional regions. It may then bind to Ca<sup>2&#x0002B;</sup> through the EF-hand domain (<xref rid="b10-mmr-08-02-0529" ref-type="bibr">10</xref>), exposing the NES region, which, in turn, may lead to its transportation out of the nucleus. Consistent with this hypothesis, it has been reported that the EF-hand is directly associated with chronic inflammatory disorders and cancer (<xref rid="b15-mmr-08-02-0529" ref-type="bibr">15</xref>). However, the function of the UPF0651 family remains unclear. A previous study observed downregulation of the mRNA of this gene in human CRC (<xref rid="b5-mmr-08-02-0529" ref-type="bibr">5</xref>). Combined, these observations indicate that this novel protein is potentially key to the development of human CRC.</p>
<p>Mass spectrometry was used to confirm the hypothesis. PeptideMass was used to predict the small peptide sequences following trypsin treatment. By comparing the prediction and the mass spectrometry results, it was confirmed that the prediction of the amino acid sequence of the protein HCRCN81 was accurate.</p>
<p>Western blot analysis, using antibody serum specific for HCRCN81, revealed that HCRCN81 was expressed in tumor and adjacent normal tissues. Previous studies (<xref rid="b4-mmr-08-02-0529" ref-type="bibr">4</xref>,<xref rid="b5-mmr-08-02-0529" ref-type="bibr">5</xref>) have revealed that mRNA expression of NM_001013649 is downregulated in human CRC tissue. Among 30 human CRC tissue samples, 5 revealed upregulated NM_001013649 mRNA expression, whereas 25 demonstrated downregulated NM_001013649 mRNA expression, accounting for 83&#x00025; of all tested samples. This observation indicated the potential involvement of NM_001013649 in CRC pathogenesis. In addition, NM_001013649 downregulation was detected in 91&#x00025; of the moderately differentiated samples (21/23) but only in 50&#x00025; of the poorly differentiated tissue samples (3/6) and the difference was considered to be statistically significant (Fisher&apos;s exact probability test, P&lt;0.05). This observation reveals a possible correlation between NM_001013649.3 transcriptional expression and CRC tumor stage.</p>
<p>In the present study, using bioinformatic tools, the structure of a novel protein, HCRCN81, translated from NM_001013649.3 mRNA, was analyzed. Using western blot analysis, protein expression of HCRCN81 in cell lines and colorectal tissue samples was detected, with a MW of ~18 kDa, consistent with the hypothesis. In addition, potential NLS and NES from the protein sequence of HCRCN81 and the EF-hand domain were predicted. This observation was in accordance with the hypothesis that HCRCN81 may function as a cell cycle regulator in the nucleus, which, in turn, may explain its possible role in CRC pathogenesis. Mass spectrometry was used to determine the amino acid sequence of the protein, the result of which was consistent with the prediction using bioinformatic tools. The bioinformatic analysis of the present study also indicated that HCRCN81 is a member of the UPF0561 family. The protein function of the UPF0561 family has not been well characterized. The current study indicates that proteins of the UPF0561 family may have a similar function to HCRCN81 in cell cycle regulation and additional studies on this family must be performed.</p></sec></body>
<back>
<ack>
<title>Acknowledgements</title>
<p>This study was supported by a grant from the Sichuan University for Stomatological Key Laboratories (SKLODSCU20090021).</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-mmr-08-02-0529"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pisani</surname><given-names>P</given-names></name><name><surname>Parkin</surname><given-names>DM</given-names></name><name><surname>Ferlay</surname><given-names>J</given-names></name></person-group><article-title>Estimates of the worldwide incidence of eighteen major cancers in 1985. Implications for prevention and projections of future burden</article-title><source>Int J Cancer</source><volume>55</volume><fpage>891</fpage><lpage>903</lpage><year>1993</year></element-citation></ref>
<ref id="b2-mmr-08-02-0529"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Parkin</surname><given-names>DM</given-names></name><name><surname>Bray</surname><given-names>F</given-names></name><name><surname>Ferlay</surname><given-names>J</given-names></name><name><surname>Pisani</surname><given-names>P</given-names></name></person-group><article-title>Global cancer statistics</article-title><source>CA Cancer J Clin</source><volume>55</volume><fpage>74</fpage><lpage>108</lpage><year>2005</year></element-citation></ref>
<ref id="b3-mmr-08-02-0529"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Burton</surname><given-names>S</given-names></name><name><surname>Norman</surname><given-names>AR</given-names></name><name><surname>Brown</surname><given-names>G</given-names></name><name><surname>Abulafi</surname><given-names>AM</given-names></name><name><surname>Swift</surname><given-names>RI</given-names></name></person-group><article-title>Predictive poor prognostic factors in colonic carcinoma</article-title><source>Surg Oncol</source><volume>15</volume><fpage>71</fpage><lpage>78</lpage><year>2006</year></element-citation></ref>
<ref id="b4-mmr-08-02-0529"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>Y</given-names></name><name><surname>Zhang</surname><given-names>YZ</given-names></name><name><surname>Zhou</surname><given-names>ZG</given-names></name><name><surname>Wang</surname><given-names>G</given-names></name><name><surname>Yi</surname><given-names>ZH</given-names></name></person-group><article-title>Identification of differently expressed genes in human colorectal adenocarcinoma</article-title><source>World J Gastroenterol</source><volume>12</volume><fpage>1025</fpage><lpage>1032</lpage><year>2006</year></element-citation></ref>
<ref id="b5-mmr-08-02-0529"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname><given-names>Q</given-names></name><name><surname>Zhang</surname><given-names>C</given-names></name><name><surname>Chen</surname><given-names>Y</given-names></name></person-group><article-title>NM_001013649.3 gene is downregulated in human colorectal adenocarcinoma</article-title><source>Mol Med Rep</source><volume>4</volume><fpage>1279</fpage><lpage>1281</lpage><year>2011</year></element-citation></ref>
<ref id="b6-mmr-08-02-0529"><label>6</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>C</given-names></name><name><surname>He</surname><given-names>F</given-names></name></person-group><source>Bioinformatics methods and practices</source><publisher-name>Sci Press</publisher-name><fpage>64</fpage><lpage>142</lpage><year>2002</year></element-citation></ref>
<ref id="b7-mmr-08-02-0529"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guthals</surname><given-names>A</given-names></name><name><surname>Bandeira</surname><given-names>N</given-names></name></person-group><article-title>Peptide identification by tandem mass spectrometry with alternate fragmentation modes</article-title><source>Mol Cell Proteomics</source><volume>11</volume><fpage>550</fpage><lpage>557</lpage><year>2012</year></element-citation></ref>
<ref id="b8-mmr-08-02-0529"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname><given-names>J</given-names></name><name><surname>Milpetz</surname><given-names>F</given-names></name><name><surname>Bork</surname><given-names>P</given-names></name><name><surname>Ponting</surname><given-names>CP</given-names></name></person-group><article-title>SMART, a simple modular architecture research tool: Identification of signaling domains</article-title><source>Proc Natl Acad Sci USA</source><volume>95</volume><fpage>5857</fpage><lpage>5864</lpage><year>1998</year></element-citation></ref>
<ref id="b9-mmr-08-02-0529"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schiff</surname><given-names>LA</given-names></name><name><surname>Nibert</surname><given-names>ML</given-names></name><name><surname>Co</surname><given-names>MS</given-names></name><etal/></person-group><article-title>Distinct binding sites for zinc and double-stranded RNA in the reovirus outer capsid protein sigma 3</article-title><source>Mol Cell Biol</source><volume>8</volume><fpage>273</fpage><lpage>283</lpage><year>1988</year></element-citation></ref>
<ref id="b10-mmr-08-02-0529"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chazin</surname><given-names>WJ</given-names></name></person-group><article-title>Relating form and function of EF-hand calcium binding proteins</article-title><source>Acc Chem Res</source><volume>44</volume><fpage>171</fpage><lpage>179</lpage><year>2011</year></element-citation></ref>
<ref id="b11-mmr-08-02-0529"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Upadhyay</surname><given-names>V</given-names></name><name><surname>Demmer</surname><given-names>U</given-names></name><name><surname>Warkentin</surname><given-names>E</given-names></name><etal/></person-group><article-title>Structure and catalytic mechanism of N(5),N(10)-methenyl-tetrahydromethanopterin cyclohydrolase</article-title><source>Biochemistry</source><volume>51</volume><fpage>8435</fpage><lpage>8443</lpage><year>2012</year></element-citation></ref>
<ref id="b12-mmr-08-02-0529"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kawai</surname><given-names>J</given-names></name><name><surname>Shinagawa</surname><given-names>A</given-names></name><name><surname>Shibata</surname><given-names>K</given-names></name><etal/></person-group><article-title>The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium: Functional annotation of a full-length mouse cDNA collection</article-title><source>Nature</source><volume>409</volume><fpage>685</fpage><lpage>690</lpage><year>2011</year></element-citation></ref>
<ref id="b13-mmr-08-02-0529"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Saha</surname><given-names>S</given-names></name><name><surname>Sparks</surname><given-names>AB</given-names></name><name><surname>Rago</surname><given-names>C</given-names></name><etal/></person-group><article-title>Using the transcriptome to annotate the genome</article-title><source>Nature Biotechnol</source><volume>20</volume><fpage>508</fpage><lpage>512</lpage><year>2002</year></element-citation></ref>
<ref id="b14-mmr-08-02-0529"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Camargo</surname><given-names>AA</given-names></name><name><surname>Samaia</surname><given-names>HP</given-names></name><name><surname>Dias-Neto</surname><given-names>E</given-names></name><etal/></person-group><article-title>The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome</article-title><source>Proc Natl Acad Sci USA</source><volume>98</volume><fpage>12103</fpage><lpage>12108</lpage><year>2001</year></element-citation></ref>
<ref id="b15-mmr-08-02-0529"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Radha</surname><given-names>V</given-names></name><name><surname>Nambirajan</surname><given-names>S</given-names></name><name><surname>Swarup</surname><given-names>G</given-names></name></person-group><article-title>Association of Lyn tyrosine kinase with the nuclear matrix and cell-cycle-dependent changes in matrix-associated tyrosine kinase activity</article-title><source>Eur J Biochem</source><volume>236</volume><fpage>352</fpage><lpage>359</lpage><year>1996</year></element-citation></ref>
<ref id="b16-mmr-08-02-0529"><label>16</label><element-citation publication-type="book"><article-title>General Higher Education Eleventh Five-Year national planning materials</article-title><source>Genomics</source><person-group person-group-type="editor"><name><surname>Yang</surname><given-names>J</given-names></name></person-group><publisher-name>Higher Education Press</publisher-name><publisher-loc>China</publisher-loc><fpage>74</fpage><lpage>88</lpage><year>2002</year></element-citation></ref>
<ref id="b17-mmr-08-02-0529"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Farsetta</surname><given-names>DL</given-names></name><name><surname>Chandran</surname><given-names>K</given-names></name><name><surname>Nibert</surname><given-names>ML</given-names></name></person-group><article-title>Transcriptional activities of reovirus RNA polymerase in recoated cores. Initiation and elongation are regulated by separate mechanisms</article-title><source>J Biol Chem</source><volume>275</volume><fpage>39693</fpage><lpage>39701</lpage><year>2000</year></element-citation></ref>
<ref id="b18-mmr-08-02-0529"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gebhardt</surname><given-names>C</given-names></name><name><surname>N&#x000E9;meth</surname><given-names>J</given-names></name><name><surname>Angel</surname><given-names>P</given-names></name><name><surname>Hess</surname><given-names>J</given-names></name></person-group><article-title>S100A8 and S100A9 in inflammation and cancer</article-title><source>Biochem Pharmacol</source><volume>72</volume><fpage>1622</fpage><lpage>1631</lpage><year>2006</year></element-citation></ref>
<ref id="b19-mmr-08-02-0529"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname><given-names>Y</given-names></name><name><surname>Wang</surname><given-names>HY</given-names></name></person-group><article-title>Casein kinase 2 is activated and essential for Wnt/beta-catenin signaling</article-title><source>J Biol Chem</source><volume>281</volume><fpage>18394</fpage><lpage>18400</lpage><year>2006</year></element-citation></ref></ref-list></back>
<floats-group>
<fig id="f1-mmr-08-02-0529" position="float">
<label>Figure 1</label>
<caption>
<p>ORF finder.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g00.gif"/></fig>
<fig id="f2-mmr-08-02-0529" position="float">
<label>Figure 2</label>
<caption>
<p>Hydrophobic analysis.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g01.gif"/></fig>
<fig id="f3-mmr-08-02-0529" position="float">
<label>Figure 3</label>
<caption>
<p>Phosphate analysis.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g02.gif"/></fig>
<fig id="f4-mmr-08-02-0529" position="float">
<label>Figure 4</label>
<caption>
<p>Coiled-coil analysis.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g03.gif"/></fig>
<fig id="f5-mmr-08-02-0529" position="float">
<label>Figure 5</label>
<caption>
<p>&#x003B2;-turn analysis.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g04.gif"/></fig>
<fig id="f6-mmr-08-02-0529" position="float">
<label>Figure 6</label>
<caption>
<p>Secondary structure analysis.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g05.gif"/></fig>
<fig id="f7-mmr-08-02-0529" position="float">
<label>Figure 7</label>
<caption>
<p>Tertiary structure.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g06.gif"/></fig>
<fig id="f8-mmr-08-02-0529" position="float">
<label>Figure 8</label>
<caption>
<p>SignalP result.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g07.gif"/></fig>
<fig id="f9-mmr-08-02-0529" position="float">
<label>Figure 9</label>
<caption>
<p>NetNES result.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g08.gif"/></fig>
<fig id="f10-mmr-08-02-0529" position="float">
<label>Figure 10</label>
<caption>
<p>PeptideMass.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g09.gif"/></fig>
<fig id="f11-mmr-08-02-0529" position="float">
<label>Figure 11</label>
<caption>
<p>Mass spectrometry result.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g10.gif"/></fig>
<fig id="f12-mmr-08-02-0529" position="float">
<label>Figure 12</label>
<caption>
<p>Western blot analysis. T1, protein sample from tumor tissues of a 74-year-old female patient with rectal tubular and villous carcinoma complicated with moderate epithelial hyperplasia. N1, protein sample from normal tissues 5 cm from the tumor mass. T2, protein sample from tumor tissues of a 76-year-old male patient with rectal adenocarcinoma. N2, protein sample from normal tissues 5 cm away from the tumor mass.</p></caption>
<graphic xlink:href="MMR-08-02-0529-g11.gif"/></fig>
<table-wrap id="tI-mmr-08-02-0529" position="float">
<label>Table I</label>
<caption>
<p>Internet sources.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Internet Sources</th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.ncbi.nlm.nih.gov/unigene" ext-link-type="uri">http://www.ncbi.nlm.nih.gov/unigene</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://blast.ncbi.nlm.nih.gov/Blast.cgi" ext-link-type="uri">http://blast.ncbi.nlm.nih.gov/Blast.cgi</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.ncbi.nlm.nih.gov/gorf/gorf.html" ext-link-type="uri">http://www.ncbi.nlm.nih.gov/gorf/gorf.html</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://web.expasy.org/protscale/" ext-link-type="uri">http://web.expasy.org/protscale/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.predictprotein.org/" ext-link-type="uri">http://www.predictprotein.org/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.cbs.dtu.dk/services/NetNGlyc/" ext-link-type="uri">http://www.cbs.dtu.dk/services/NetNGlyc/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.cbs.dtu.dk/services/NetPhos/" ext-link-type="uri">http://www.cbs.dtu.dk/services/NetPhos/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://www.cbs.dtu.dk/services/CPHmodels/" ext-link-type="uri">http://www.cbs.dtu.dk/services/CPHmodels/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://smart.embl-heidelberg.de/" ext-link-type="uri">http://smart.embl-heidelberg.de/</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://hits.isb-sib.ch/cgi-bin/PFSCAN" ext-link-type="uri">http://hits.isb-sib.ch/cgi-bin/PFSCAN</ext-link></td></tr>
<tr>
<td align="left" valign="top"><ext-link xlink:href="http://web.expasy.org/peptide_mass" ext-link-type="uri">http://web.expasy.org/peptide_mass</ext-link></td></tr></tbody></table></table-wrap></floats-group></article>
