<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">IJO</journal-id>
<journal-title-group>
<journal-title>International Journal of Oncology</journal-title></journal-title-group>
<issn pub-type="ppub">1019-6439</issn>
<issn pub-type="epub">1791-2423</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ijo.2015.3136</article-id>
<article-id pub-id-type="publisher-id">ijo-47-04-1393</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject></subj-group></article-categories>
<title-group>
<article-title>Multifaceted roles of 5&#x02032;-regulatory region of the cancer associated gene <italic>B4GALT1</italic> and its comparison with the gene family</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>AL-OBAIDE</surname><given-names>MOHAMMED A. IBRAHIM</given-names></name><xref rid="af1-ijo-47-04-1393" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>ALOBYDI</surname><given-names>HYTHAM</given-names></name><xref rid="af2-ijo-47-04-1393" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author">
<name><surname>ABDELSALAM</surname><given-names>ABDELSALAM G.</given-names></name><xref rid="af3-ijo-47-04-1393" ref-type="aff">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>ZHANG</surname><given-names>RUIWEN</given-names></name><xref rid="af4-ijo-47-04-1393" ref-type="aff">4</xref></contrib>
<contrib contrib-type="author">
<name><surname>SRIVENUGOPAL</surname><given-names>KALKUNTE S.</given-names></name><xref rid="af1-ijo-47-04-1393" ref-type="aff">1</xref><xref ref-type="corresp" rid="c1-ijo-47-04-1393"/></contrib></contrib-group>
<aff id="af1-ijo-47-04-1393">
<label>1</label>Department of Biomedical Sciences, School of Pharmacy, Texas Tech University Health Sciences Center, Amarillo, TX 79106, USA</aff>
<aff id="af2-ijo-47-04-1393">
<label>2</label>Biomedica, LLC, Sterling Heights, MI, USA</aff>
<aff id="af3-ijo-47-04-1393">
<label>3</label>Department of Mathematics, Statistics and Physics, College of Arts and Sciences, Qatar University, Doha, Qatar</aff>
<aff id="af4-ijo-47-04-1393">
<label>4</label>Department of Pharmaceutical Sciences, School of Pharmacy, Texas Tech University Health Sciences Center, Amarillo, TX 79106, USA</aff>
<author-notes>
<corresp id="c1-ijo-47-04-1393">Correspondence to: Dr Kalkunte Srivenugopal, Department of Biomedical Sciences, School of Pharmacy, Texas Tech University Health Sciences Center, 1406 S. Coulter Drive, Amarillo, TX 79106, USA, E-mail: <email>kalkunte.srivenugopal@ttuhsc.edu</email></corresp></author-notes>
<pub-date pub-type="collection">
<month>10</month>
<year>2015</year></pub-date>
<pub-date pub-type="epub">
<day>27</day>
<month>08</month>
<year>2015</year></pub-date>
<volume>47</volume>
<issue>4</issue>
<fpage>1393</fpage>
<lpage>1404</lpage>
<history>
<date date-type="received">
<day>25</day>
<month>06</month>
<year>2015</year></date>
<date date-type="accepted">
<day>05</day>
<month>08</month>
<year>2015</year></date></history>
<permissions>
<copyright-statement>Copyright: &#x000A9; Al-Obaide.</copyright-statement>
<copyright-year>2015</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0">
<license-p>This is an open access article distributed under the terms of a Creative Commons Attribution License</license-p></license></permissions>
<abstract>
<p>&#x003B2;1,4-Galactosylransferases are a family of enzymes encoded by seven <italic>B4GALT</italic> genes and are involved in the development of anticancer drug resistance and metastasis. Among these genes, the <italic>B4GALT1</italic> shows significant variations in the transcript origination sites in different cell types/tissues and encodes an interesting dually partitioning &#x003B2;-1, 4-galacto-syltransferase protein. We identified at 5&#x02032;-end of <italic>B4GALT1</italic> a 1.454 kb sequence forming a transcription regulatory region, referred to by us as the TR1-PE1, had all characteristics of a bidirectional promoter directing the transcription of <italic>B4GALT1</italic> in a divergent manner along with its long non-coding RNA (lncRNA) antisense counterpart <italic>B4GALT1-AS1</italic>. The TR1-PE1 showed unique dinucleotide base-stacking energy values specific to transcription factor binding sites (TFBSs), INR and BRE, and harbored CpG Island (CGI) that showed GC skew with potential for R-loop formation at the transcription starting sites (TSSs). The 5&#x02032;-regulatory axis of <italic>B4GALT1</italic> also included five more novel TFBSs for CTCF, GLI1, TCF7L2, GATA3 and SOX5, in addition to unique (TG)<sub>18</sub> repeats in conjunction with 22 nucleotide TG-associated sequence (TGAS). The five lncRNA <italic>B4GALT1-AS1</italic> transcripts showed significant complementarity with <italic>B4GALT1</italic> mRNA. In contrast, the rest of <italic>B4GALT</italic> genes showed fewer lncRNAs, and all lacked the (TG)<sub>18</sub> and TGAS. Our results are strongly supported by the FANTOM5 study which showed tissue-specific variations in transcript origination sites for this gene. We suggest that the unique expression patterns for the <italic>B4GALT1</italic> in normal and malignant tissues are controlled by a differential usage of 5&#x02032;-<italic>B4GALT1</italic> regulatory units along with a post-transcriptional regulation by the antisense RNA, which in turn govern the cell-matrix interactions, neoplastic progression, anticancer drug sensitivity, and could be utilized in personalized therapy.</p></abstract>
<kwd-group>
<kwd>bidirectional promoter</kwd>
<kwd>drug resistance</kwd>
<kwd>lncRNA</kwd>
<kwd>metastasis</kwd>
<kwd>personalized therapy</kwd>
<kwd>transcription factor binding sites</kwd>
<kwd>TG repeats</kwd>
<kwd>TG-associated sequences</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>The enzymatic glycosylation of proteins and lipids is a fundamental process in biology. &#x003B2;1, 4-Galactosylransferases are a family of enzymes that catalyze the formation of &#x003B2;4-N-acetyllactosamine linkages in extracellular matrices by transferring the UDP-bound galactose to terminal N-acetylglucosamines in carbohydrate chains (<xref rid="b1-ijo-47-04-1393" ref-type="bibr">1</xref>,<xref rid="b2-ijo-47-04-1393" ref-type="bibr">2</xref>). At least 7 members of this family (Gal T1 to Gal T7) have been characterized, reflecting the redundancy and a fine regulation of the glycoforms synthesized by these enzymes in specific cell-cell and cell-matrix adhesions. Of these, <italic>B4GALT1</italic> is the most studied and its cDNA was first cloned in 1986 (<xref rid="b3-ijo-47-04-1393" ref-type="bibr">3</xref>); since then, it has served as a prototype for characterization of several galatosyltrasferases across the mammalian species (<xref rid="b4-ijo-47-04-1393" ref-type="bibr">4</xref>). The <italic>B4GALT1</italic> gene generates two types of protein isoforms termed the long (399 amino acids) and short (386 amino acids). The two isoforms are type II membrane-bound glycoproteins and reside in Golgi apparatus and a portion of the longer protein functions on the cell surface as a recognition molecule by binding with appropriate glycoside substrates (<xref rid="b5-ijo-47-04-1393" ref-type="bibr">5</xref>). Consequently, the <italic>B4GALT1</italic> plays important roles in numerous physiological and pathological processes such as inflammation, sperm-egg interaction, embryogenesis, and morphogenesis, development of the central nervous system, cell migration, cancer progression and metastasis (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>&#x02013;<xref rid="b8-ijo-47-04-1393" ref-type="bibr">8</xref>).</p>
<p>Although considered mainly as a housekeeping gene, a large number of studies have shown a differential expression of <italic>B4GALT1</italic> in mammary gland, brain and other normal tissues such as the cartilage (<xref rid="b7-ijo-47-04-1393" ref-type="bibr">7</xref>,<xref rid="b9-ijo-47-04-1393" ref-type="bibr">9</xref>,<xref rid="b10-ijo-47-04-1393" ref-type="bibr">10</xref>). On the other hand, this gene has been highly implicated in oncogenesis and tumor progression (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>,<xref rid="b11-ijo-47-04-1393" ref-type="bibr">11</xref>&#x02013;<xref rid="b13-ijo-47-04-1393" ref-type="bibr">13</xref>). Leukemia, melanoma and cancers of the breast, lung, ovary, liver, prostate and their metastatic counterparts have been shown to possess elevated levels of expression and cell-surface <italic>B4GALT1</italic> (<xref rid="b11-ijo-47-04-1393" ref-type="bibr">11</xref>,<xref rid="b12-ijo-47-04-1393" ref-type="bibr">12</xref>,<xref rid="b14-ijo-47-04-1393" ref-type="bibr">14</xref>). Thus, its expression levels may alter the distribution and profile of cancer antigens. Also, in a nude mouse model, the number of peritoneal dissemination foci of the antisense <italic>B4GALT1</italic>-transfected ovarian tumor cells was smaller than that of the control cells, suggesting the involvement of this enzyme in the invasive and metastatic ovarian cancer (<xref rid="b15-ijo-47-04-1393" ref-type="bibr">15</xref>). Furthermore, <italic>B4GALT1</italic> has been strongly linked with multidrug resistance (<xref rid="b8-ijo-47-04-1393" ref-type="bibr">8</xref>,<xref rid="b16-ijo-47-04-1393" ref-type="bibr">16</xref>), tumor sensitivity to cisplatin (<xref rid="b17-ijo-47-04-1393" ref-type="bibr">17</xref>,<xref rid="b18-ijo-47-04-1393" ref-type="bibr">18</xref>) and promotion of cell death signaling pathways (<xref rid="b19-ijo-47-04-1393" ref-type="bibr">19</xref>). Additionally, the gene undergoes promoter methylation in several cancer types and consequent silencing of the gene (<xref rid="b13-ijo-47-04-1393" ref-type="bibr">13</xref>,<xref rid="b20-ijo-47-04-1393" ref-type="bibr">20</xref>,<xref rid="b21-ijo-47-04-1393" ref-type="bibr">21</xref>).</p>
<p>Little information is available on the regulation of <italic>B4GALT1</italic> transcription and the involvement of regulatory elements in the process. A single promoter is believed to mediate the varied and tissue-specific expression of the <italic>B4GALT1</italic> (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>,<xref rid="b13-ijo-47-04-1393" ref-type="bibr">13</xref>,<xref rid="b20-ijo-47-04-1393" ref-type="bibr">20</xref>,<xref rid="b21-ijo-47-04-1393" ref-type="bibr">21</xref>). However, a recent landmark analysis of promoters called the FANTOM5 (functional annotation of the mammalian genome 5) chose the <italic>B4GALT1</italic> as a representative gene and showed significant variations in the transcript origination sites in different cell types and patient specimens (<xref rid="b22-ijo-47-04-1393" ref-type="bibr">22</xref>). They reported a <italic>B4GALT1</italic> promoter at 5&#x02032; end and observed 266 bp CGI associated with <italic>B4GALT1</italic> transcription. Encouraged by these observations, we undertook a systematic study to characterize the alternative promoters of <italic>B4GALT1</italic>. In the present study, we report the characteristics of the <italic>B4GALT1</italic> alternative promoters and molecular features of the 5&#x02032; regulatory region of the gene, which revealed multiple regulatory sequences, composed of 1.454 kb and its association with the long non-coding RNA gene, <italic>B4GALT1-AS1</italic>. Given the redundant and compensatory roles of &#x003B2;1, 4-galactosylransferases in glycan synthesis and extracellular matrix interactions (<xref rid="b1-ijo-47-04-1393" ref-type="bibr">1</xref>), we also screened and compared the regulatory elements in other genes of the <italic>B4GALT</italic> family and compared them with the prototype <italic>B4GALT1</italic>.</p></sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title>Genomic databases</title>
<p>The genomic criteria and alternative promoters of the divergent loci, <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> and the other <italic>B4GALT</italic> genes were searched during January-December 2014 in seven genomics databases, NCBI-GenBank, Transcriptional Regulatory Element Database (TRED), Mammalian Promoter/Enhancer Database (PEDB), Eukaryotic promoter database (EPD), Mammalian Promoter Database (MPromDb), Ensembl and UCSC Genome Browser. Several tools in the databases were used to retrieve the sequences, identify the strand (forward or reverse), flip the strand and in search for a specific sequence. NCBI-dbSNP was used to search for SNPs in the <italic>B4GALT1</italic> regulatory region.</p></sec>
<sec>
<title>Verification of map locations</title>
<p>The precise genomic map locations of the identified sequences of the <italic>B4GALT1</italic> alternative promoters were verified and updated to hg38 by using the BLAT tool, Gene Sorter and Table Browser tools of UCSC Genome Bioinformatics database.</p></sec>
<sec>
<title>Search for regulatory elements in the alternative promoters</title>
<p>The identified sequences of the <italic>B4GALT1</italic> alternative promoters were analyzed for TFBSs, namely, TATA-8 (TATAWAWR) and TATA-532 (HWHWWWWR, excluding: HTYTTTWR, CAYTTTWR, MAMAAAAR and CTYAAAAR), INR (YYANWYY), CCAAT and its inverted sequence TAACC, BRE (SSRCGCC), and DPE (RGWCGTG) binding sites (<xref rid="b23-ijo-47-04-1393" ref-type="bibr">23</xref>&#x02013;<xref rid="b26-ijo-47-04-1393" ref-type="bibr">26</xref>). The TG tandem repeats were identified by Blat and Blast tools. Identification of CGIs in alternative promoters was searched at 100-bp window (N=100) moving across the sequence at 1-bp intervals, parameter sets used to search for CGIs in the alternative promoters: Observed/Expected CpG &#x02265;0.6 and &#x00025;G+C &gt;55&#x00025; (<xref rid="b27-ijo-47-04-1393" ref-type="bibr">27</xref>,<xref rid="b28-ijo-47-04-1393" ref-type="bibr">28</xref>). The ratio Observed/Expected (O/E) CpG was calculated according to the equation reported by Gardiner-Garden and Frommer (<xref rid="b27-ijo-47-04-1393" ref-type="bibr">27</xref>).</p></sec>
<sec>
<title>Structural features of TR1-PE1 regulatory sequence</title>
<p>The dinucleotide base-stacking energy values were derived from values of dinucleotide base-stacking energy provided by Ornstein <italic>et al</italic> (<xref rid="b29-ijo-47-04-1393" ref-type="bibr">29</xref>). According to the scale, which is in kcal/mol, the range of values from &#x02212;3.82 kcal/mol (unstack easily) to &#x02212;14.59 kcal/mol (difficult to unstack), thus, the obtained values show the relative dissociation stability of the double helix structure. Whereas, GC skew is calculated as &#x0005B;(G&#x02212;C)/(G+C)&#x0005D;, where C and G denote the numbers of cytosine and guanine (<xref rid="b30-ijo-47-04-1393" ref-type="bibr">30</xref>,<xref rid="b31-ijo-47-04-1393" ref-type="bibr">31</xref>). GC skew is useful for predicting the R loop formation and the origin and terminus of replication. Programs written in R were used to analyze the sequences and to plot the data of dinucleotide base-stacking energy values (Kcal/mol/dimer) and GC skew along the length of the TR1-PE1 sequence.</p></sec>
<sec>
<title>Blast tree map tool</title>
<p>NCBI standard nucleotide BLAST tool was used to search by pairwise alignments for similar sequences to the regulatory sequences identified in the study. The distance tree of the obtained pairwise comparisons was produced to show evolutionary relatedness of regulatory sequences among species.</p></sec>
<sec>
<title>Statistical analysis</title>
<p>The regulatory sequences were analyzed and achieved using Excel software and programs written in R. The independence of each promoter element was examined using Fisher's exact probability test.</p></sec></sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title>Promoters and antisense lncRNA loci of B4GALT family members</title>
<p>Search in the databases showed that the expression of the seven genes of the <italic>B4GALT</italic> family is controlled by alternative promoters. The numbers of identified alternative promoters for each gene varied from nine to sixteen and were located on six chromosomes (<xref rid="tI-ijo-47-04-1393" ref-type="table">Table I</xref>). In addition, antisense lncRNA divergent loci were identified for four <italic>B4GALT</italic> genes, namely; <italic>B4GALT1</italic>, <italic>B4GALT4</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic> (<xref rid="tII-ijo-47-04-1393" ref-type="table">Table II</xref>). Data search showed five species of antisense lncRNA transcripts for <italic>B4GALT1</italic>. In contrast, the three other <italic>B4GALT</italic> genes, <italic>B4GALT4</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic> were each associated with a single lncRNA transcript. These observations encouraged us to perform a comprehensive search of the regulatory regions of <italic>B4GALT1</italic> locus and characterize the functional elements and structural features controlling the transcription of this gene.</p></sec>
<sec>
<title>Genomic context of B4GALT1 alternative promoters</title>
<p>The human <italic>B4GALT1</italic> (uc003zsg.2) is located on the negative strand of the short arm of chromosome 9: 33110641-33167358 at 9p13 and was found to be divergently paired head to head with <italic>B4GALT1-AS1</italic> (uc033cop.1). <italic>B4GALT1</italic> has been assumed to harbor a single promoter of 500 bp (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>,<xref rid="b13-ijo-47-04-1393" ref-type="bibr">13</xref>,<xref rid="b20-ijo-47-04-1393" ref-type="bibr">20</xref>). In this context, we were, however, surprised to find a large number of alternative promoters for <italic>B4GALT1</italic>. Our rigorous analysis revealed a total of thirteen alternative promoters for <italic>B4GALT1</italic>, although many of them were overlapping with TR1, HP2 and HP3 sequences (<xref rid="tIII-ijo-47-04-1393" ref-type="table">Table III</xref> and <xref rid="f1-ijo-47-04-1393" ref-type="fig">Fig. 1</xref>). The map location of HP1 (chr9: 33103518-33104577) indicated it is located outside of the <italic>B4GALT1</italic> locus, whereas three other promoters, HP2 (chr9: 33124856-33126364), HP3 (chr9: 33156756-33158354), and ES1 (chr9: 33128098-33128697) were found located within the space of <italic>B4GALT1</italic> locus. HP4 (chr9: 33125681-33126181) and HP5 (chr9: 33157101-33157692) sequences were located within HP2 and HP3 sequences, respectively. Our analysis also showed that one third of TR2 sequence (chr9: 3110342-33111341) was located outside the 3&#x02032; end of the gene locus. Whereas, CP1, EP1, ES2 and ES3 were located within the vicinity of TR1 sequence (<xref rid="tIII-ijo-47-04-1393" ref-type="table">Table III</xref>) and this was verified by sequence alignment analyses (data not shown). Furthermore, approximately half (458 bp) of TR1 sequence (chr9: 33166901-33167900) was located inside <italic>B4GALT1</italic> locus at 5&#x02032; side and 42 bp overlapped the sequence at 3&#x02032; side of <italic>B4GALT1-AS1</italic> reverse complement (chr9: 33167859-33179983), the remaining of TR1 sequence, 500 bp, is the space between the two divergent genes <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> (<xref rid="f2-ijo-47-04-1393" ref-type="fig">Fig. 2</xref>). Furthermore, we observed five nucleotides at the 3&#x02032; side of PE1 (chr9: 33167354-33168354) overlapped with the <italic>B4GALT1</italic> sequence at the 5&#x02032; end and more than half of PE1 sequence overlapped TR1 sequence outside the 5&#x02032; side of <italic>B4GALT1</italic> (<xref rid="f2-ijo-47-04-1393" ref-type="fig">Fig. 2</xref>). The outcome of map location analysis of the six overlapping alternative promoter sequences (TR1, CP1, EP1, ES2, ES3 and PE1) at 5&#x02032;-end of <italic>B4GALT1</italic> revealed the presence of a complex regulatory unit, which we designated TR1-PE1 and is located at chr9: 33166901-33168354.</p></sec>
<sec>
<title>Molecular and structural characteristics of the TR1-PE1</title>
<p>Alternative promoters of <italic>B4GALT1</italic> could be divided into two groups according to &#x00025;G+C and Observed/Expected CpG values, which are indicators of CGIs (<xref rid="tIII-ijo-47-04-1393" ref-type="table">Table III</xref>). Accordingly, alternative promoters with CGIs were: TR1, CP1, EP1, ES2, ES3 and PE1, which form the TR1-PE1 regulatory complex; whereas the rest of alternative promoters lack CGIs. TR1-PE1 sequence was found rich with BRE and INR sequences. The clustering pattern of INR and BRE was unique, seven out of nine INR sequences were clustered in the 5&#x02032;-side of TR1-PE1 region and the four identified BRE sequences were located in the 3&#x02032;-TR1-PE1 side (<xref rid="f3-ijo-47-04-1393" ref-type="fig">Fig. 3</xref>). In contrast, TR1-PE1 sequence does not contain any of TATA-8 and GC-box sequences, but it does contain two TAACC in the upstream region and one TATA-532, TTCTTAAA, along with two INR sequences downstream TR1-PE1. In addition, the TR1-PE1 sequence harbored other regulatory sequences in the BRE region, such as three estrogen response elements (ERE) and a muscle actin promoter factor (MAPF) binding element, which were found within the CP1 alternative promoter reported by Choi <italic>et al</italic> (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>).</p>
<p>Our next effort was to identify the TSSs within the regulatory complex. As expected the regulatory unit TR1-PE1 with an overlapping six alternative promoter sequences contained six TSSs along a sequence of 156 bp at chr9: 33167351-33167506 (<xref rid="f3-ijo-47-04-1393" ref-type="fig">Fig. 3</xref>). This region is rich in the BRE sequences and is comparable to the reported 266 bp CGI (chr9: 33167138-33167403) associated with <italic>B4GALT1</italic> transcription (<xref rid="b22-ijo-47-04-1393" ref-type="bibr">22</xref>).</p>
<p>To find out the possible correlation between structural characteristics and distribution of TFBSs along TR1-PE1 sequence, we investigated the dinucleotide base-stacking energy values of this sequence. The INR sites were found mainly along sequences easily unstack, whereas BRE sequences were observed at sequences comparatively difficult to unstack (<xref rid="f4-ijo-47-04-1393" ref-type="fig">Fig. 4</xref>). Three regions termed A, B and C with calculated stacking energy values of &#x02212;7.661, &#x02212;8.448 and &#x02212;8.524 kcal/mol harbored INR, INR and TATA-532, respectively. Whereas, BRE sequences were clustered within region D composed of 156 bp at 33167351-33167506, which showed &#x02212;9 to &#x02212;9.75 kcal/mol. Also, we observed GC skew in the TR1-PE1 CGI region at TSS vicinity (<xref rid="f5-ijo-47-04-1393" ref-type="fig">Fig. 5</xref>). GC skew is a result of strand asymmetry down-stream TSSs and therefore it is an indication of possible formation of R-loops that are correlated with un-methylated status of CGI (see Discussion).</p>
<p>Experimental evidence supporting the validity of our results on the presence of multiple TSSs within the TR1-PE1 sequence has been obtained as a part of the FANTOM5 consortium (<xref rid="b22-ijo-47-04-1393" ref-type="bibr">22</xref>). These researchers used Cap analysis of gene expression (CAGE) to map the promoters and sets of transcripts for <italic>B4GALT1</italic> in numerous human tissues and cell lines. The CAGE patterns obtained for the 266 bases <italic>B4GALT1</italic> transcription initiation region located at chr9: 33167138-33167403, which lies within the TR1-PE1 identified here (Fig. 1 in ref. <xref rid="b22-ijo-47-04-1393" ref-type="bibr">22</xref>) clearly suggested the presence of several sites for transcript initiation, very similar to the putative start sites mapped by us (<xref rid="f3-ijo-47-04-1393" ref-type="fig">Fig. 3</xref>). Thus, the present study verifies and lends credence to the observed data and likely to be helpful in corroborating the promoter databases, some of which are being constantly revised.</p></sec>
<sec>
<title>SNPs in the TR1-PE1 regulatory region</title>
<p>Search in the NCBI-dbSNP showed ten SNPs in the TR1-PE1 regulatory region at chr9: 33166911-33167159. Map locations and the type of SNPs are: chr9: 33166911 (missense), chr9: 33166941 (missense), chr9: 33166981 (synonymous), chr9: 33167008 (synonymous), chr9: 33167049 (missense), chr9: 33167109 (missense), chr9: 33167131 (synonymous), chr9: 33167135 (missense), chr9: 33167143 (missense) and chr9: 33167159 (missense). These results highlight another layer of regulation in the TR1-PE1 axis because the SNPs have the potential for altering the binding affinities of transcription factors and changing the transcription efficiency and levels.</p></sec>
<sec>
<title>Sequence alignment of B4GALT1 and B4GALT1-AS1 transcripts</title>
<p>The possible association of <italic>B4GALT1-AS1</italic>, which encodes an lncRNA in the regulation of <italic>B4GALT1</italic> expression, motivated us to explore the sequence complementarity of divergent transcripts. As mentioned in <xref rid="tII-ijo-47-04-1393" ref-type="table">Table II</xref>, five <italic>B4GALT1-AS1</italic> transcripts were identified compared to one antisense transcript for each of <italic>B4GALT4</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic> loci. The sequence alignment analysis between primary <italic>B4GALT1</italic> transcript of 4124 bp (NM_001497.3) and predicted <italic>B4GALT1</italic> transcript (XM_005251440.2) with the five <italic>B4GALT1-AS1</italic> transcripts showed complementarity in the range 59.1 and 94.7&#x00025;, suggesting the potential for duplex formation. Consistent with this finding, the TR1-PE1 has all features and properties of a bidirectional promoter including a divergent or &#x02018;head-to-head&#x02019; configuration of <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> with a 500-bp intervening sequence between them. In addition, the G+C contents, CGIs and TFBSs of TR1-PE1 confirm to the properties found in bidirectional promoters (<xref rid="b32-ijo-47-04-1393" ref-type="bibr">32</xref>&#x02013;<xref rid="b35-ijo-47-04-1393" ref-type="bibr">35</xref>). We propose that such bidirectional multi-faceted regulatory region might be present in the genomic space of the other three <italic>B4GALT</italic> family members with anti-sense divergent loci, <italic>B4GALT4</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic>. It is worth mentioning that the promoters of <italic>B4GALT</italic> family members are of different types and sequences, and may contain specific regulatory features in <italic>B4GALT4</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic> loci.</p></sec>
<sec>
<title>Identification of TG repeats in HP3 promoter close to TR1-PE1 regulatory sequence</title>
<p>We investigated the possible involvement of other regulatory sequences near the TR1-PE1 region in the expression of <italic>B4GALT1</italic>, for example TG repeats (<xref rid="b36-ijo-47-04-1393" ref-type="bibr">36</xref>). We found two (TG)<sub>18</sub> tandem repeats at chr9: 33157065-33157158 located in the intronic alternative promoter HP3 placed 10.146 kb from 3&#x02032;-TR1-PE1 side (<xref rid="f6-ijo-47-04-1393" ref-type="fig">Fig. 6</xref>). Our analysis also showed presence of a tag sequence next to (TG)<sub>18</sub> sequence. The sequence is composed of 22 bases (TAGTTCCCTATGCTGGACACCG), located at the 3&#x02032; side of both (TG)<sub>18</sub> repeats, we refer to as the TG Associated Sequence, TGAS. The two sequences were not observed in other promoters of <italic>B4GALT</italic> family members, but they were observed in other loci located in five other chromosomes (<xref rid="tIV-ijo-47-04-1393" ref-type="table">Table IV</xref>). Notably, these TG repeats and TGAS sequences were linked with cancer genes, for example SOX5, which is reported to be associated with glioma, prostate, testicular seminomas and colorectal cancer. Thus, our data suggest possible regulatory roles for the TG tandem repeats and TGAS in tumorigenesis. In support of this assumption, we identified four novel types of TFBS for SOX5, GLI1, TCF7L2 and GATA3 in the 14.151 kb (chr9: 33167360-33153209) at 5&#x02032;-side of <italic>B4GALT1</italic> regulatory region that includes TR1-PE1 bidirectional promoter, intronic HP3 promoter, TG repeats and TGAS. The number of identified TFBSs for SOX5 (five), GLI1 (one), TCF7L2 (six) and GATA3 (six); we noted that TCF sites are tied to GATA sites. These TFBSs are known to be associated with cancer and cell identity (see Discussion).</p></sec>
<sec>
<title>Enhancers, open chromatin and CTCF binding sites</title>
<p>The discovery of <italic>B4GALT1</italic> bidirectional promoter, TG repeats and TGAS sequence enthused us to search for other regulatory sequences that might regulate the expression of <italic>B4GALT1</italic> and other family members. Our search identified following sites in the regulatory regions of <italic>B4GALT</italic> family members: enhancers, open chromatin and CTCF binding sites (<xref rid="f7-ijo-47-04-1393" ref-type="fig">Fig. 7</xref>). CTCF (CCCTC-binding transcription factor) is a well-studied multifunctional protein involved not only in DNA methylation but also associated with transcriptional activation/repression, and chromatin looping (<xref rid="b37-ijo-47-04-1393" ref-type="bibr">37</xref>). While <italic>B4GALT1</italic>, <italic>B4GALT3</italic> and <italic>B4GALT4</italic> contain a single CTCF binding site, the other members, <italic>B4GALT5</italic>, <italic>B4GALT6</italic> and <italic>B4GALT7</italic> showed 3 to 5 binding sites for CTCF. We found one to three open chromatin sites in <italic>B4GALT1</italic>, <italic>B4GALT4</italic>, <italic>B4GALT5</italic> and <italic>B4GALT6</italic>, which are indicative of active transcription. On the other hand, three genes <italic>B4GALT1</italic>, <italic>B4GALT5</italic> and <italic>B4GALT6</italic> were found rich in enhancer sequences that are associated with TCF7L2 and GATA3 in unique cell identity regulatory programs for specific expression of these genes in normal and cancer cells. Our analysis showed that the majority of <italic>B4GALT1</italic> enhancers are located within and toward 5&#x02032;-<italic>B4GALT1</italic> regulatory region.</p></sec>
<sec>
<title>Evolutionary conservation of regulatory sequences identified in this study</title>
<p>To explore the evolutionary relationships of our discovered regulatory sequences, we generated blast tree maps. The results showed that among the <italic>B4GALT</italic> family, the TR1-PE1 promoter structure was exclusively found in <italic>B4GALT1</italic> and/or <italic>B4GALT1-AS1</italic>. Nevertheless, the TE1-PE1 sequences were highly prevalent and conserved in the <italic>B4GALT1</italic> gene across the animal kingdom. For example, structures similar to TR1-PE1 were observed in <italic>B4GALT1</italic> locus in primates including gorilla (Sequence ID: XM_004047921.1), common chimpanzee (sequence ID: XM_003312037.2) Sumatran orangutan, <italic>Pongo abelii</italic> (Sequence ID: XM_002819701.2), and humans (sequence ID: NG_008919).</p>
<p>TG repeats have been implicated in gene expression, but not widely investigated for their significance in evolution of gene expression. Here, we determined their abundance across the genomes of various species. The 36 bp of the (TG)<sub>18</sub> repeats were found widespread starting from protozoan (<italic>trypanosomes</italic>), helminth (<italic>Onchocerca</italic>), higher plants (<italic>Oryza</italic>, <italic>Triticum</italic>) to humans. Then, we searched for the presence and distribution of the 22 bp of TGAS. The sequence was found conserved in <italic>B4GALT1</italic> gene but not in any other <italic>B4GALT</italic> genes. Also, we noted that the 36 bp of (TG)<sub>18</sub> repeats showed genetic evolutionary genetic relatedness with <italic>B4GALT1</italic> when investigated with first 10 bp of TGAS sequence (TAGTTCCCTA), results in <xref rid="tIV-ijo-47-04-1393" ref-type="table">Table IV</xref> support this conclusion. These data indicated the critical role of this ancient 10 bp motif in evolution of <italic>B4GALT1</italic> regulatory sequence at 5&#x02032;-side. Furthermore, the (TG)<sub>18</sub> sequence showed evolutionary genetic relatedness with fewer species, ~30, when analyzed by blast tool in combination with TGAS sequence (<xref rid="f8-ijo-47-04-1393" ref-type="fig">Fig. 8</xref>) and their duplicate of 116 bp showed similar results. This remains the first report of TGAS in literature. The results imply that when the (TG)<sub>18</sub> sequence is tied to TGAS, they become highly conserved during evolution and suggest a specific mechanistic role in gene expression.</p></sec></sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p><italic>B4GALT1</italic> has emerged as a model gene for deciphering the complex transcriptional regulation underlying its cell type- and tissue-specific gene expression. As an enzyme and cell recognition molecule mediating cell adhesion and signaling, <italic>B4GALT1</italic> serves many important roles in both physiological and pathological settings. Although, a survey of databases indicated the presence of more than one promoter for <italic>B4GALT1</italic>, before the present study, there was no definitive mapping of the transcription start sites for this gene. Our extensive and careful analysis revealed a large number of alternative promoters, thirteen to be exact, involved in human <italic>B4GALT1</italic> expression. Of these six overlapped and resided within 1.454 kb forming a complex transcriptional regulatory unit referred to as the TR1-PE1, which contains six TSS and apparently mediates most of <italic>B4GALT1</italic> expression. A recent publication by the FANTOM Consortium (<xref rid="b22-ijo-47-04-1393" ref-type="bibr">22</xref>) showed multiple transcription start sites for the <italic>B4GALT1</italic> promoter at 33167138-33167403 are located within the 266 base CGI, which is part of TR1-PE1 regulatory complex sequence characterized in the present study. This region was also shown to be associated with <italic>B4GALT1</italic> cell type-specific transcription. These data support our findings that the six TSSs present in the TR1-PE1 complex harboring six overlapping alternative promoters may all be involved in the expression of <italic>B4GALT1</italic> in different tissues.</p>
<p>A most significant finding of the present study is the revelation that the <italic>B4GALT1</italic> gene is paired with the production of a long non-coding RNA, which can potentially function as an antisense transcript, called <italic>B4GALT1-AS1</italic> via the TR1-PE1 regulatory complex. More specifically, the TR1-PE1 is a bidirectional promoter that divergently directs the transcription of <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> genes in a head to head orientation. The configuration of any paired genes can be co-directional (either &#x02190; &#x02190; or &#x02192; &#x02192;), convergent (&#x02192; &#x02190;) or divergent (&#x02190; &#x02192;). Many promoters show divergent transcription, and it is estimated that more than 10&#x00025; of the genes in the human genome are divergently transcribed wherein the genes share a single promoter with their transcription start sites separated by &lt;1,000 base pairs. Examples include the DNA repair genes, such as the BRCA1, BRCA2, CKEK1 and FANC family members (<xref rid="b32-ijo-47-04-1393" ref-type="bibr">32</xref>,<xref rid="b33-ijo-47-04-1393" ref-type="bibr">33</xref>). Many bidirectional pairs are co-expressed, but some are anti-regulated. Furthermore, the promoter segments between two bidirectional genes initiate transcription in both directions and contain shared elements that regulate both genes, thus, providing a unique mechanism of regulation for numerous genes (<xref rid="b34-ijo-47-04-1393" ref-type="bibr">34</xref>). Our characterization of TR1-PE1 regulatory complex showed that it bridges the two adjacent head to head genes, <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic>, which are transcribed in opposite directions. Several types of TFBSs and CGI identified in TR1-PE1 are indeed found in bidirectional promoters (<xref rid="b34-ijo-47-04-1393" ref-type="bibr">34</xref>,<xref rid="b35-ijo-47-04-1393" ref-type="bibr">35</xref>). In this respect, the reported promoters by Poeta <italic>et al</italic> (<xref rid="b13-ijo-47-04-1393" ref-type="bibr">13</xref>) and Kim <italic>et al</italic> (<xref rid="b20-ijo-47-04-1393" ref-type="bibr">20</xref>), which are more likely similar to one of overlapping alternative promoters composed TR1-PE1 regulatory sequence, were found to contain methylated CGIs, which highlight the role of CGIs in <italic>B4GALT1</italic> bidirectional promoter and link them to cancer and drug resistance.</p>
<p>Another salient finding here pertains to the possible generation of five lncRNA transcript species under the control of TR1-PE1 bidirectional promoter (<xref rid="tII-ijo-47-04-1393" ref-type="table">Table II</xref>); these natural and putative antisense transcripts arising from the forward strand showed significant potential for forming RNA-RNA duplexes. The lncRNAs are abundant in the human genome and the FANTOM3 project identified ~35,000 non-coding transcripts from ~10,000 distinct loci (<xref rid="b38-ijo-47-04-1393" ref-type="bibr">38</xref>) that bear many signatures of mRNAs, including 5&#x02032; capping, splicing, and poly-adenylation, but have little or no open reading frame (ORF). Most lncRNAs are &gt;200 nucleotides long and recent evidence points to a variety of functions for them in cellular processes. These include the activation or inhibition of transcription, organization of nuclear sub-structures, alteration of chromatin state, and regulation of gene expression through the interaction with effector proteins and modulation of their activity (<xref rid="b39-ijo-47-04-1393" ref-type="bibr">39</xref>&#x02013;<xref rid="b42-ijo-47-04-1393" ref-type="bibr">42</xref>). Our data highlight the complexity underlying the transcriptional and possible post-transcriptional (by the lncRNA) regulation of the <italic>B4GALT1</italic> gene in its differential expression in human tissues.</p>
<p>Also, we considered possible involvement of other elements in modulating <italic>B4GALT1</italic> expression. In the present study, we identified (TG)<sub>18</sub> tandem repeats placed 10.146 kb from TR1-PE1 regulatory sequence. It has been reported that TG tandem repeats, which are highly conserved throughout eukaryotic genome evolution, enhance transcription especially when they are located closer to the promoter (<xref rid="b36-ijo-47-04-1393" ref-type="bibr">36</xref>,<xref rid="b43-ijo-47-04-1393" ref-type="bibr">43</xref>,<xref rid="b44-ijo-47-04-1393" ref-type="bibr">44</xref>). Although, the potential roles of these repeats have been demonstrated in recombination (<xref rid="b45-ijo-47-04-1393" ref-type="bibr">45</xref>), DNA repair (<xref rid="b46-ijo-47-04-1393" ref-type="bibr">46</xref>), alternative splicing (<xref rid="b47-ijo-47-04-1393" ref-type="bibr">47</xref>) and breast cancer (<xref rid="b48-ijo-47-04-1393" ref-type="bibr">48</xref>,<xref rid="b49-ijo-47-04-1393" ref-type="bibr">49</xref>), some observations do not support a function for these repeats in gene expression (<xref rid="b36-ijo-47-04-1393" ref-type="bibr">36</xref>).</p>
<p>Our analysis of dinucleotide base-stacking energy along TR1-PE1 showed variable values, which reflect the patterns of double helix dissociation in the regulatory sequence. Additionally, we were able to correlate and co-localize the values of dinucleotide base-stacking energy with the transcription factor binding sequences. As described by Ornstein <italic>et al</italic> (<xref rid="b29-ijo-47-04-1393" ref-type="bibr">29</xref>), the relative stability of the double helix structure can be demonstrated according to the range of dinucleotide base-stacking energy values from &#x02212;3.82 kcal/mol (unstack easily) to &#x02212;14.59 kcal/mol (difficult to unstack), thus the obtained values for INR sites are at regions that dissociate easily in comparison with BRE and TSS sites. Also, we observed another interesting structural feature in the TR1-PE1 bidirectional promoter, which is related to the GC skew type at TSS sites and possible formation of R-loops. It has been reported that GC strand asymmetry downstream of TSSs is prone to R-loop formation that is correlated with unmethylated status of CGI (<xref rid="b31-ijo-47-04-1393" ref-type="bibr">31</xref>). These data on structural features of TR1-PE1 regulatory region, in addition to other criteria of alternative promoters reported in present study, intergenic or intragenic locations, types of TFBSs, CGI and potential role of (TG)<sub>18</sub> tandem repeats are critical factors and have important consequences in the transcription process and cell type specific expression of <italic>B4GALT1</italic>.</p>
<p>The identification of cell type-specific TFBS (TCF7L2 and GATA3) in the vicinity of TR1-PE1 and HP3 intronic promoter provided further functions played by <italic>B4GALT1</italic> in the development of cancer, cancer stem cell-linked expression and relapse. The TCF7L2 transcription factor is linked to a variety of human diseases, including cancer (<xref rid="b50-ijo-47-04-1393" ref-type="bibr">50</xref>). It plays a critical role in enhancer activity, especially super enhancer, which is known to be associated with cell identity and diseases (<xref rid="b50-ijo-47-04-1393" ref-type="bibr">50</xref>&#x02013;<xref rid="b54-ijo-47-04-1393" ref-type="bibr">54</xref>). Furthermore, TCF7L2 is tied to the genome by association with GATA3 (<xref rid="b50-ijo-47-04-1393" ref-type="bibr">50</xref>), which is the case shown in this study. GATA transcription factors are also involved in carcinogenesis. The roles of GATA factors in carcinogenesis <italic>vis-&#x000E0;-vis</italic> the normal functions is a result of malfunctions. GATA1 mutations are associated with megakaryoblastic leukemias in patients with Down syndrome; loss of GATA3 expression is involved in breast cancer; whereas silencing of GATA4 and GATA5 expression are reported in gastric, colorectal and lung cancer (<xref rid="b53-ijo-47-04-1393" ref-type="bibr">53</xref>,<xref rid="b54-ijo-47-04-1393" ref-type="bibr">54</xref>).</p>
<p>Another finding was the identification of the binding site for glioma-associated oncogene homolog 1 (GLI1) transcription factor adjacent to TG-TGAS sequence. GLI1 is the nuclear mediator of the Hedgehog pathway that regulates genes essential for various stages of tumor development and progression (<xref rid="b55-ijo-47-04-1393" ref-type="bibr">55</xref>). Accordingly, it is proposed that GLI1 is a potential target for cancer therapy. Recently, four natural compounds of physalins family showed dose-dependent GLI1-transcriptional inhibitory activity (<xref rid="b56-ijo-47-04-1393" ref-type="bibr">56</xref>).</p>
<p>Furthermore, we identified ten SNPs in the <italic>B4GALT1</italic> TR1-PE1 regulatory sequence; these can modify splicing and can alter <italic>B4GALT1</italic> expression. The SNPs are frequent in the human genome, the 1000 Genomes Project reported 38 million SNPs. SNPs that are located within the exons or exon-intron boundaries can modify the splicing sites and consequently the protein function resulting in development of many associated diseases including cancer (<xref rid="b57-ijo-47-04-1393" ref-type="bibr">57</xref>). Many SNPs are located within the regulatory regions of genes, which may influence their expression (<xref rid="b58-ijo-47-04-1393" ref-type="bibr">58</xref>).</p>
<p>Our studies on <italic>B4GALT1</italic> also enabled a comparison of regulatory elements present within the <italic>B4GALT1</italic> gene family. This is important, because the &#x003B2;-1, 4-galactosyltransferases perform redundant catalysis and functions, often compensating within and outside of this class of enzymes (<xref rid="b1-ijo-47-04-1393" ref-type="bibr">1</xref>,<xref rid="b4-ijo-47-04-1393" ref-type="bibr">4</xref>). For example, in <italic>B4GALT1</italic> knockout mice, there was a shift in the galactose linkages from the largely &#x003B2;-1,4 linkage to &#x003B2;-1,3 linkage, suggesting that <italic>B4GALT1</italic> deficiency was compensated for by &#x003B2;-1,3-galactosyltransferases (<xref rid="b59-ijo-47-04-1393" ref-type="bibr">59</xref>). We observed both similarities and differences in the genomic regulatory features when <italic>B4GALT1</italic> was compared with the other members of the family. The similarities included the presence of multiple promoters and some members with long non-coding RNAs. Differences were that the TR1-PE1, (TG)<sub>18</sub> and TGAS sequences were restricted to <italic>B4GALT1</italic> and the epigenetic landscape appeared to be different as well.</p>
<p>In conclusion, this study provides further insight into the regulatory features that govern <italic>B4GALT1</italic> and reveal a novel bidirectional promoter and multifaceted regulatory region at the 5&#x02032;-<italic>B4GALT1</italic> locus, which includes several genomic-epigenetic regulatory elements that control at least six transcription start sites embedded within regulatory sequence of 1.454 kb. The identified (TG)<sub>18</sub> and the TGAS sequences in the regulatory regions of <italic>B4GALT1</italic>, which were found conserved through evolution in many species, give further indication of involvement and evolution of several ancient sequences in the transcription process of specific genes. The data highlight the complexity and sophistication underlying the transcriptional and possible post-transcriptional regulation of the <italic>B4GALT1</italic> gene and its differential expression in human normal and cancer cells. The special genomic-epigenomic characteristics of <italic>B4GALT1</italic> gene expression is manifested by presence of several antisense lncRNA transcripts in comparison to other <italic>B4GALT</italic> members. There are five <italic>B4GALT1-AS1</italic> transcripts that have the potential to control the expression of two <italic>B4GALT1</italic> transcripts. The data are likely to advance and develop the role of this important enzyme in cancer patho-physiology, drug resistance and personalized therapy.</p></sec></body>
<back>
<ack>
<title>Acknowledgements</title>
<p>The present study was supported in part by a grant from the Cancer Prevention and Research Institute of Texas (RP130266) to KSS. We thank Ibtisam Ismael Alobaidi for technical assistance.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijo-47-04-1393"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Amado</surname><given-names>M</given-names></name><name><surname>Almeida</surname><given-names>R</given-names></name><name><surname>Schwientek</surname><given-names>T</given-names></name><name><surname>Clausen</surname><given-names>H</given-names></name></person-group><article-title>Identification and characterization of large galactosyltransferase gene families: Galactosyltransferases for all functions</article-title><source>Biochim Biophys Acta</source><volume>1473</volume><fpage>35</fpage><lpage>53</lpage><year>1999</year><pub-id pub-id-type="doi">10.1016/S0304-4165(99)00168-3</pub-id><pub-id pub-id-type="pmid">10580128</pub-id></element-citation></ref>
<ref id="b2-ijo-47-04-1393"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Qasba</surname><given-names>PK</given-names></name><name><surname>Ramakrishnan</surname><given-names>B</given-names></name><name><surname>Boeggeman</surname><given-names>E</given-names></name></person-group><article-title>Structure and function of beta -1,4-galactosyltransferase</article-title><source>Curr Drug Targets</source><volume>9</volume><fpage>292</fpage><lpage>309</lpage><year>2008</year><pub-id pub-id-type="doi">10.2174/138945008783954943</pub-id><pub-id pub-id-type="pmid">18393823</pub-id><pub-id pub-id-type="pmcid">2365515</pub-id></element-citation></ref>
<ref id="b3-ijo-47-04-1393"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Appert</surname><given-names>HE</given-names></name><name><surname>Rutherford</surname><given-names>TJ</given-names></name><name><surname>Tarr</surname><given-names>GE</given-names></name><name><surname>Wiest</surname><given-names>JS</given-names></name><name><surname>Thomford</surname><given-names>NR</given-names></name><name><surname>McCorquodale</surname><given-names>DJ</given-names></name></person-group><article-title>Isolation of a cDNA coding for human galactosyltransferase</article-title><source>Biochem Biophys Res Commun</source><volume>139</volume><fpage>163</fpage><lpage>168</lpage><year>1986</year><pub-id pub-id-type="doi">10.1016/S0006-291X(86)80094-8</pub-id><pub-id pub-id-type="pmid">3094506</pub-id></element-citation></ref>
<ref id="b4-ijo-47-04-1393"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hennet</surname><given-names>T</given-names></name></person-group><article-title>The galactosyltransferase family</article-title><source>Cell Mol Life Sci</source><volume>59</volume><fpage>1081</fpage><lpage>1095</lpage><year>2002</year><pub-id pub-id-type="doi">10.1007/s00018-002-8489-4</pub-id><pub-id pub-id-type="pmid">12222957</pub-id></element-citation></ref>
<ref id="b5-ijo-47-04-1393"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lopez</surname><given-names>LC</given-names></name><name><surname>Youakim</surname><given-names>A</given-names></name><name><surname>Evans</surname><given-names>SC</given-names></name><name><surname>Shur</surname><given-names>BD</given-names></name></person-group><article-title>Evidence for a molecular distinction between Golgi and cell surface forms of beta 1,4-galactosyltransferase</article-title><source>J Biol Chem</source><volume>266</volume><fpage>15984</fpage><lpage>15991</lpage><year>1991</year><pub-id pub-id-type="pmid">1714903</pub-id></element-citation></ref>
<ref id="b6-ijo-47-04-1393"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname><given-names>HJ</given-names></name><name><surname>Chung</surname><given-names>TW</given-names></name><name><surname>Kim</surname><given-names>CH</given-names></name><name><surname>Jeong</surname><given-names>HS</given-names></name><name><surname>Joo</surname><given-names>M</given-names></name><name><surname>Youn</surname><given-names>B</given-names></name><name><surname>Ha</surname><given-names>KT</given-names></name></person-group><article-title>Estrogen induced &#x003B2;-1,4-galactosyltransferase 1 expression regulates proliferation of human breast cancer MCF-7 cells</article-title><source>Biochem Biophys Res Commun</source><volume>426</volume><fpage>620</fpage><lpage>625</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.bbrc.2012.08.140</pub-id><pub-id pub-id-type="pmid">22982306</pub-id></element-citation></ref>
<ref id="b7-ijo-47-04-1393"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>W</given-names></name><name><surname>Cui</surname><given-names>Z</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Zhu</surname><given-names>X</given-names></name><name><surname>Fan</surname><given-names>J</given-names></name><name><surname>Bao</surname><given-names>G</given-names></name><name><surname>Qiu</surname><given-names>J</given-names></name><name><surname>Xu</surname><given-names>D</given-names></name></person-group><article-title>Elevated expression of &#x003B2;1,4-galactosyltransferase-I in cartilage and synovial tissue of patients with osteoarthritis</article-title><source>Inflammation</source><volume>35</volume><fpage>647</fpage><lpage>655</lpage><year>2012</year><pub-id pub-id-type="doi">10.1007/s10753-011-9357-x</pub-id></element-citation></ref>
<ref id="b8-ijo-47-04-1393"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>H</given-names></name><name><surname>Ma</surname><given-names>H</given-names></name><name><surname>Wei</surname><given-names>W</given-names></name><name><surname>Ji</surname><given-names>D</given-names></name><name><surname>Song</surname><given-names>X</given-names></name><name><surname>Sun</surname><given-names>J</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><name><surname>Jia</surname><given-names>L</given-names></name></person-group><article-title>B4GALT family mediates the multidrug resistance of human leukemia cells by regulating the hedgehog pathway and the expression of p-glycoprotein and multidrug resistance-associated protein 1</article-title><source>Cell Death Dis</source><volume>4</volume><fpage>e654</fpage><year>2013</year><pub-id pub-id-type="doi">10.1038/cddis.2013.186</pub-id><pub-id pub-id-type="pmid">23744354</pub-id><pub-id pub-id-type="pmcid">3698553</pub-id></element-citation></ref>
<ref id="b9-ijo-47-04-1393"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mengle-Gaw</surname><given-names>L</given-names></name><name><surname>McCoy-Haman</surname><given-names>MF</given-names></name><name><surname>Tiemeier</surname><given-names>DC</given-names></name></person-group><article-title>Genomic structure and expression of human beta-1,4-galactosyltransferase</article-title><source>Biochem Biophys Res Commun</source><volume>176</volume><fpage>1269</fpage><lpage>1276</lpage><year>1991</year><pub-id pub-id-type="doi">10.1016/0006-291X(91)90423-5</pub-id><pub-id pub-id-type="pmid">1903938</pub-id></element-citation></ref>
<ref id="b10-ijo-47-04-1393"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shaper</surname><given-names>NL</given-names></name><name><surname>Charron</surname><given-names>M</given-names></name><name><surname>Lo</surname><given-names>NW</given-names></name><name><surname>Shaper</surname><given-names>JH</given-names></name></person-group><article-title>Beta1,4-galactosyltransferase and lactose biosynthesis: Recruitment of a housekeeping gene from the nonmammalian vertebrate gene pool for a mammary gland specific function</article-title><source>J Mammary Gland Biol Neoplasia</source><volume>3</volume><fpage>315</fpage><lpage>324</lpage><year>1998</year><pub-id pub-id-type="doi">10.1023/A:1018719612087</pub-id></element-citation></ref>
<ref id="b11-ijo-47-04-1393"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>S</given-names></name><name><surname>Cai</surname><given-names>M</given-names></name><name><surname>Zhang</surname><given-names>SW</given-names></name><name><surname>Hu</surname><given-names>Y</given-names></name><name><surname>Gu</surname><given-names>JX</given-names></name></person-group><article-title>Involvement of beta 1,4 galactosyltransferase 1 and Gal beta1--&gt;4GlcNAc groups in human hepatocarcinoma cell apoptosis</article-title><source>Mol Cell Biochem</source><volume>243</volume><fpage>81</fpage><lpage>86</lpage><year>2003</year><pub-id pub-id-type="doi">10.1023/A:1021683324548</pub-id><pub-id pub-id-type="pmid">12619892</pub-id></element-citation></ref>
<ref id="b12-ijo-47-04-1393"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname><given-names>X</given-names></name><name><surname>Jiang</surname><given-names>J</given-names></name><name><surname>Shen</surname><given-names>H</given-names></name><name><surname>Wang</surname><given-names>H</given-names></name><name><surname>Zong</surname><given-names>H</given-names></name><name><surname>Li</surname><given-names>Z</given-names></name><name><surname>Yang</surname><given-names>Y</given-names></name><name><surname>Niu</surname><given-names>Z</given-names></name><name><surname>Liu</surname><given-names>W</given-names></name><name><surname>Chen</surname><given-names>X</given-names></name><etal/></person-group><article-title>Elevated beta1,4-galactosyltransferase I in highly metastatic human lung cancer cells. Identification of E1AF as important transcription activator</article-title><source>J Biol Chem</source><volume>280</volume><fpage>12503</fpage><lpage>12516</lpage><year>2005</year><pub-id pub-id-type="doi">10.1074/jbc.M413631200</pub-id></element-citation></ref>
<ref id="b13-ijo-47-04-1393"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Poeta</surname><given-names>ML</given-names></name><name><surname>Massi</surname><given-names>E</given-names></name><name><surname>Parrella</surname><given-names>P</given-names></name><name><surname>Pellegrini</surname><given-names>P</given-names></name><name><surname>De Robertis</surname><given-names>M</given-names></name><name><surname>Copetti</surname><given-names>M</given-names></name><name><surname>Rabitti</surname><given-names>C</given-names></name><name><surname>Perrone</surname><given-names>G</given-names></name><name><surname>Muda</surname><given-names>AO</given-names></name><name><surname>Molinari</surname><given-names>F</given-names></name><etal/></person-group><article-title>Aberrant promoter methylation of beta-1,4 galactosyltransferase 1 as potential cancer-specific biomarker of colorectal tumors</article-title><source>Genes Chromosomes Cancer</source><volume>51</volume><fpage>1133</fpage><lpage>1143</lpage><year>2012</year><pub-id pub-id-type="doi">10.1002/gcc.21998</pub-id><pub-id pub-id-type="pmid">22927297</pub-id></element-citation></ref>
<ref id="b14-ijo-47-04-1393"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Radhakrishnan</surname><given-names>P</given-names></name><name><surname>Chachadi</surname><given-names>V</given-names></name><name><surname>Lin</surname><given-names>MF</given-names></name><name><surname>Singh</surname><given-names>R</given-names></name><name><surname>Kannagi</surname><given-names>R</given-names></name><name><surname>Cheng</surname><given-names>PW</given-names></name></person-group><article-title>TNF,&#x003B1; enhances the motility and invasiveness of prostatic cancer cells by stimulating the expression of selective glycosyl- and sulfotransferase genes involved in the synthesis of selectin ligands</article-title><source>Biochem Biophys Res Commun</source><volume>409</volume><fpage>436</fpage><lpage>441</lpage><year>2011</year><pub-id pub-id-type="doi">10.1016/j.bbrc.2011.05.019</pub-id><pub-id pub-id-type="pmid">21596021</pub-id><pub-id pub-id-type="pmcid">4011552</pub-id></element-citation></ref>
<ref id="b15-ijo-47-04-1393"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yamashita</surname><given-names>H</given-names></name><name><surname>Kubushiro</surname><given-names>K</given-names></name><name><surname>Ma</surname><given-names>J</given-names></name><name><surname>Fujii</surname><given-names>T</given-names></name><name><surname>Tsukazaki</surname><given-names>K</given-names></name><name><surname>Iwamori</surname><given-names>M</given-names></name><name><surname>Nozawa</surname><given-names>S</given-names></name></person-group><article-title>Alteration in the metastatic potential of ovarian cancer cells by transfection of the antisense gene of beta-1,4-galactosyltransferase</article-title><source>Oncol Rep</source><volume>10</volume><fpage>1857</fpage><lpage>1862</lpage><year>2003</year><pub-id pub-id-type="pmid">14534708</pub-id></element-citation></ref>
<ref id="b16-ijo-47-04-1393"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>H</given-names></name><name><surname>Zhang</surname><given-names>Z</given-names></name><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Jin</surname><given-names>C</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><name><surname>Miao</surname><given-names>X</given-names></name><name><surname>Jia</surname><given-names>L</given-names></name></person-group><article-title>B4GALT1 gene knockdown inhibits the hedgehog pathway and reverses multidrug resistance in the human leukemia K562/adriamycin-resistant cell line</article-title><source>IUBMB Life</source><volume>64</volume><fpage>889</fpage><lpage>900</lpage><year>2012</year><pub-id pub-id-type="doi">10.1002/iub.1080</pub-id><pub-id pub-id-type="pmid">23024026</pub-id></element-citation></ref>
<ref id="b17-ijo-47-04-1393"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname><given-names>X</given-names></name><name><surname>Monitto</surname><given-names>CL</given-names></name><name><surname>Demokan</surname><given-names>S</given-names></name><name><surname>Kim</surname><given-names>MS</given-names></name><name><surname>Chang</surname><given-names>SS</given-names></name><name><surname>Zhong</surname><given-names>X</given-names></name><name><surname>Califano</surname><given-names>JA</given-names></name><name><surname>Sidransky</surname><given-names>D</given-names></name></person-group><article-title>Identification of hyper-methylated genes associated with cisplatin resistance in human cancers</article-title><source>Cancer Res</source><volume>70</volume><fpage>2870</fpage><lpage>2879</lpage><year>2010</year><pub-id pub-id-type="doi">10.1158/0008-5472.CAN-09-3427</pub-id><pub-id pub-id-type="pmid">20215521</pub-id><pub-id pub-id-type="pmcid">2849007</pub-id></element-citation></ref>
<ref id="b18-ijo-47-04-1393"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Helleman</surname><given-names>J</given-names></name><name><surname>Jansen</surname><given-names>MP</given-names></name><name><surname>Span</surname><given-names>PN</given-names></name><name><surname>van Staveren</surname><given-names>IL</given-names></name><name><surname>Massuger</surname><given-names>LF</given-names></name><name><surname>Meijervan Gelder</surname><given-names>ME</given-names></name><name><surname>Sweep</surname><given-names>FC</given-names></name><name><surname>Ewing</surname><given-names>PC</given-names></name><name><surname>van der Burg</surname><given-names>ME</given-names></name><name><surname>Stoter</surname><given-names>G</given-names></name><etal/></person-group><article-title>Molecular profiling of platinum resistant ovarian cancer</article-title><source>Int J Cancer</source><volume>118</volume><fpage>1963</fpage><lpage>1971</lpage><year>2006</year><pub-id pub-id-type="doi">10.1002/ijc.21599</pub-id></element-citation></ref>
<ref id="b19-ijo-47-04-1393"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname><given-names>Q</given-names></name><name><surname>Yang</surname><given-names>H</given-names></name><name><surname>Cheng</surname><given-names>C</given-names></name><name><surname>Li</surname><given-names>C</given-names></name><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Huan</surname><given-names>W</given-names></name><name><surname>Sun</surname><given-names>H</given-names></name><name><surname>Zhou</surname><given-names>Z</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Zhao</surname><given-names>Y</given-names></name><etal/></person-group><article-title>&#x003B2;-1,4-Galactosyltransferase I involved in Schwann cells proliferation and apoptosis induced by tumor necrosis factor-alpha via the activation of MAP kinases signal pathways</article-title><source>Mol Cell Biochem</source><volume>365</volume><fpage>149</fpage><lpage>158</lpage><year>2012</year><pub-id pub-id-type="doi">10.1007/s11010-012-1254-6</pub-id><pub-id pub-id-type="pmid">22359038</pub-id></element-citation></ref>
<ref id="b20-ijo-47-04-1393"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>MS</given-names></name><name><surname>Louwagie</surname><given-names>J</given-names></name><name><surname>Carvalho</surname><given-names>B</given-names></name><name><surname>Terhaar Sive Droste</surname><given-names>JS</given-names></name><name><surname>Park</surname><given-names>HL</given-names></name><name><surname>Chae</surname><given-names>YK</given-names></name><name><surname>Yamashita</surname><given-names>K</given-names></name><name><surname>Liu</surname><given-names>J</given-names></name><name><surname>Ostrow</surname><given-names>KL</given-names></name><name><surname>Ling</surname><given-names>S</given-names></name><etal/></person-group><article-title>Promoter DNA methylation of oncostatin m receptor-beta as a novel diagnostic and therapeutic marker in colon cancer</article-title><source>PLoS One</source><volume>4</volume><fpage>e6555</fpage><year>2009</year><pub-id pub-id-type="doi">10.1371/journal.pone.0006555</pub-id><pub-id pub-id-type="pmid">19662090</pub-id><pub-id pub-id-type="pmcid">2717211</pub-id></element-citation></ref>
<ref id="b21-ijo-47-04-1393"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Michailidi</surname><given-names>C</given-names></name><name><surname>Soudry</surname><given-names>E</given-names></name><name><surname>Brait</surname><given-names>M</given-names></name><name><surname>Maldonado</surname><given-names>L</given-names></name><name><surname>Jaffe</surname><given-names>A</given-names></name><name><surname>Ili-Gangas</surname><given-names>C</given-names></name><name><surname>Brebi-Mieville</surname><given-names>P</given-names></name><name><surname>Perez</surname><given-names>J</given-names></name><name><surname>Kim</surname><given-names>MS</given-names></name><name><surname>Zhong</surname><given-names>X</given-names></name><etal/></person-group><article-title>Genome-wide and gene-specific epigenomic platforms for hepatocellular carcinoma biomarker development trials</article-title><source>Gastroenterol Res Pract</source><volume>2014</volume><fpage>597164</fpage><year>2014</year><pub-id pub-id-type="doi">10.1155/2014/597164</pub-id><pub-id pub-id-type="pmid">24829571</pub-id><pub-id pub-id-type="pmcid">4009191</pub-id></element-citation></ref>
<ref id="b22-ijo-47-04-1393"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Forrest</surname><given-names>AR</given-names></name><name><surname>Kawaji</surname><given-names>H</given-names></name><name><surname>Rehli</surname><given-names>M</given-names></name><name><surname>Baillie</surname><given-names>JK</given-names></name><name><surname>de Hoon</surname><given-names>MJ</given-names></name><name><surname>Haberle</surname><given-names>V</given-names></name><name><surname>Lassmann</surname><given-names>T</given-names></name><name><surname>Kulakovskiy</surname><given-names>IV</given-names></name><name><surname>Lizio</surname><given-names>M</given-names></name><name><surname>Itoh</surname><given-names>M</given-names></name><etal/></person-group><collab>FANTOM Consortium and the RIKEN PMI and CLST (DGT)</collab><article-title>A promoter-level mammalian expression atlas</article-title><source>Nature</source><volume>507</volume><fpage>462</fpage><lpage>470</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nature13182</pub-id><pub-id pub-id-type="pmid">24670764</pub-id><pub-id pub-id-type="pmcid">4529748</pub-id></element-citation></ref>
<ref id="b23-ijo-47-04-1393"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>C</given-names></name><name><surname>Bolotin</surname><given-names>E</given-names></name><name><surname>Jiang</surname><given-names>T</given-names></name><name><surname>Sladek</surname><given-names>FM</given-names></name><name><surname>Martinez</surname><given-names>E</given-names></name></person-group><article-title>Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters</article-title><source>Gene</source><volume>389</volume><fpage>52</fpage><lpage>65</lpage><year>2007</year><pub-id pub-id-type="doi">10.1016/j.gene.2006.09.029</pub-id><pub-id pub-id-type="pmcid">1955227</pub-id></element-citation></ref>
<ref id="b24-ijo-47-04-1393"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hsu</surname><given-names>PC</given-names></name><name><surname>Chao</surname><given-names>CC</given-names></name><name><surname>Yang</surname><given-names>CY</given-names></name><name><surname>Ye</surname><given-names>YL</given-names></name><name><surname>Liu</surname><given-names>FC</given-names></name><name><surname>Chuang</surname><given-names>YJ</given-names></name><name><surname>Lan</surname><given-names>CY</given-names></name></person-group><article-title>Diverse Hap43-independent functions of the Candida albicans CCAAT-binding complex</article-title><source>Eukaryot Cell</source><volume>12</volume><fpage>804</fpage><lpage>815</lpage><year>2013</year><pub-id pub-id-type="doi">10.1128/EC.00014-13</pub-id><pub-id pub-id-type="pmid">23543673</pub-id><pub-id pub-id-type="pmcid">3675991</pub-id></element-citation></ref>
<ref id="b25-ijo-47-04-1393"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ko</surname><given-names>LJ</given-names></name><name><surname>Engel</surname><given-names>JD</given-names></name></person-group><article-title>DNA-binding specificities of the GATA transcription factor family</article-title><source>Mol Cell Biol</source><volume>13</volume><fpage>4011</fpage><lpage>4022</lpage><year>1993</year><pub-id pub-id-type="pmid">8321208</pub-id><pub-id pub-id-type="pmcid">359950</pub-id></element-citation></ref>
<ref id="b26-ijo-47-04-1393"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Blauwkamp</surname><given-names>TA</given-names></name><name><surname>Chang</surname><given-names>MV</given-names></name><name><surname>Cadigan</surname><given-names>KM</given-names></name></person-group><article-title>Novel TCF-binding sites specify transcriptional repression by Wnt signalling</article-title><source>EMBO J</source><volume>27</volume><fpage>1436</fpage><lpage>1446</lpage><year>2008</year><pub-id pub-id-type="pmid">18418383</pub-id><pub-id pub-id-type="pmcid">2396397</pub-id></element-citation></ref>
<ref id="b27-ijo-47-04-1393"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gardiner-Garden</surname><given-names>M</given-names></name><name><surname>Frommer</surname><given-names>M</given-names></name></person-group><article-title>CpG islands in vertebrate genomes</article-title><source>J Mol Biol</source><volume>196</volume><fpage>261</fpage><lpage>282</lpage><year>1987</year><pub-id pub-id-type="doi">10.1016/0022-2836(87)90689-9</pub-id><pub-id pub-id-type="pmid">3656447</pub-id></element-citation></ref>
<ref id="b28-ijo-47-04-1393"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hackenberg</surname><given-names>M</given-names></name><name><surname>Barturen</surname><given-names>G</given-names></name><name><surname>Carpena</surname><given-names>P</given-names></name><name><surname>Luque-Escamilla</surname><given-names>PL</given-names></name><name><surname>Previti</surname><given-names>C</given-names></name><name><surname>Oliver</surname><given-names>JL</given-names></name></person-group><article-title>Prediction of CpG-island function: CpG clustering vs. sliding-window methods</article-title><source>BMC Genomics</source><volume>11</volume><fpage>327</fpage><year>2010</year><pub-id pub-id-type="doi">10.1186/1471-2164-11-327</pub-id><pub-id pub-id-type="pmid">20500903</pub-id><pub-id pub-id-type="pmcid">2887419</pub-id></element-citation></ref>
<ref id="b29-ijo-47-04-1393"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ornstein</surname><given-names>RL</given-names></name><name><surname>Rein</surname><given-names>R</given-names></name><name><surname>Breen</surname><given-names>D</given-names></name><name><surname>Macelroy</surname><given-names>R</given-names></name></person-group><article-title>An optimized potential function for the calculation of nucleic acid interaction energies I. base stacking</article-title><source>Biopolymers</source><volume>17</volume><fpage>2341</fpage><lpage>2360</lpage><year>1978</year><pub-id pub-id-type="doi">10.1002/bip.1978.360171005</pub-id><pub-id pub-id-type="pmid">24624489</pub-id></element-citation></ref>
<ref id="b30-ijo-47-04-1393"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vesth</surname><given-names>T</given-names></name><name><surname>Lagesen</surname><given-names>K</given-names></name><name><surname>Acar</surname><given-names>&#x000D6;</given-names></name><name><surname>Ussery</surname><given-names>D</given-names></name></person-group><article-title>CMG-biotools, a free workbench for basic comparative microbial genomics</article-title><source>PLoS One</source><volume>8</volume><fpage>e60120</fpage><year>2013</year><pub-id pub-id-type="doi">10.1371/journal.pone.0060120</pub-id><pub-id pub-id-type="pmid">23577086</pub-id><pub-id pub-id-type="pmcid">3618517</pub-id></element-citation></ref>
<ref id="b31-ijo-47-04-1393"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ginno</surname><given-names>PA</given-names></name><name><surname>Lott</surname><given-names>PL</given-names></name><name><surname>Christensen</surname><given-names>HC</given-names></name><name><surname>Korf</surname><given-names>I</given-names></name><name><surname>Ch&#x000E9;din</surname><given-names>F</given-names></name></person-group><article-title>R-loop formation is a distinctive characteristic of unmethylated human CpG island promoters</article-title><source>Mol Cell</source><volume>45</volume><fpage>814</fpage><lpage>825</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.molcel.2012.01.017</pub-id><pub-id pub-id-type="pmid">22387027</pub-id><pub-id pub-id-type="pmcid">3319272</pub-id></element-citation></ref>
<ref id="b32-ijo-47-04-1393"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Seila</surname><given-names>AC</given-names></name><name><surname>Core</surname><given-names>LJ</given-names></name><name><surname>Lis</surname><given-names>JT</given-names></name><name><surname>Sharp</surname><given-names>PA</given-names></name></person-group><article-title>Divergent transcription: A new feature of active promoters</article-title><source>Cell Cycle</source><volume>8</volume><fpage>2557</fpage><lpage>2564</lpage><year>2009</year><pub-id pub-id-type="doi">10.4161/cc.8.16.9305</pub-id><pub-id pub-id-type="pmid">19597342</pub-id></element-citation></ref>
<ref id="b33-ijo-47-04-1393"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>MQ</given-names></name><name><surname>Koehly</surname><given-names>LM</given-names></name><name><surname>Elnitski</surname><given-names>LL</given-names></name></person-group><article-title>Comprehensive annotation of bidirectional promoters identifies co-regulation among breast and ovarian cancer genes</article-title><source>PLOS Comput Biol</source><volume>3</volume><fpage>e72</fpage><year>2007</year><pub-id pub-id-type="doi">10.1371/journal.pcbi.0030072</pub-id><pub-id pub-id-type="pmid">17447839</pub-id><pub-id pub-id-type="pmcid">1853124</pub-id></element-citation></ref>
<ref id="b34-ijo-47-04-1393"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Orekhova</surname><given-names>AS</given-names></name><name><surname>Rubtsov</surname><given-names>PM</given-names></name></person-group><article-title>Bidirectional promoters in the transcription of mammalian genomes</article-title><source>Biochemistry (Mosc)</source><volume>78</volume><fpage>335</fpage><lpage>341</lpage><year>2013</year><pub-id pub-id-type="doi">10.1134/S0006297913040020</pub-id></element-citation></ref>
<ref id="b35-ijo-47-04-1393"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>MQ</given-names></name><name><surname>Elnitski</surname><given-names>LL</given-names></name></person-group><article-title>Diversity of core promoter elements comprising human bidirectional promoters</article-title><source>BMC Genomics</source><volume>9</volume><issue>Suppl 2</issue><fpage>S3</fpage><year>2008</year><pub-id pub-id-type="pmid">18831794</pub-id><pub-id pub-id-type="pmcid">2559893</pub-id></element-citation></ref>
<ref id="b36-ijo-47-04-1393"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>W</given-names></name><name><surname>He</surname><given-names>L</given-names></name><name><surname>Liu</surname><given-names>W</given-names></name><name><surname>Sun</surname><given-names>C</given-names></name><name><surname>Ratain</surname><given-names>MJ</given-names></name></person-group><article-title>Exploring the relationship between polymorphic (TG/CA)n repeats in intron 1 regions and gene expression</article-title><source>Hum Genomics</source><volume>3</volume><fpage>236</fpage><lpage>245</lpage><year>2009</year><pub-id pub-id-type="pmid">19403458</pub-id><pub-id pub-id-type="pmcid">2735212</pub-id></element-citation></ref>
<ref id="b37-ijo-47-04-1393"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>De La Rosa-Vel&#x000E1;zquez</surname><given-names>IA</given-names></name><name><surname>Rinc&#x000F3;n-Arano</surname><given-names>H</given-names></name><name><surname>Ben&#x000ED;tez-Bribiesca</surname><given-names>L</given-names></name><name><surname>Recillas-Targa</surname><given-names>F</given-names></name></person-group><article-title>Epigenetic regulation of the human retinoblastoma tumor suppressor gene promoter by CTCF</article-title><source>Cancer Res</source><volume>67</volume><fpage>2577</fpage><lpage>2585</lpage><year>2007</year><pub-id pub-id-type="doi">10.1158/0008-5472.CAN-06-2024</pub-id><pub-id pub-id-type="pmid">17363576</pub-id></element-citation></ref>
<ref id="b38-ijo-47-04-1393"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Carninci</surname><given-names>P</given-names></name><name><surname>Kasukawa</surname><given-names>T</given-names></name><name><surname>Katayama</surname><given-names>S</given-names></name><name><surname>Gough</surname><given-names>J</given-names></name><name><surname>Frith</surname><given-names>MC</given-names></name><name><surname>Maeda</surname><given-names>N</given-names></name><name><surname>Oyama</surname><given-names>R</given-names></name><name><surname>Ravasi</surname><given-names>T</given-names></name><name><surname>Lenhard</surname><given-names>B</given-names></name><name><surname>Wells</surname><given-names>C</given-names></name><etal/></person-group><collab>RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group)</collab><article-title>The transcriptional landscape of the mammalian genome</article-title><source>Science</source><volume>309</volume><fpage>1559</fpage><lpage>1563</lpage><year>2005</year><pub-id pub-id-type="doi">10.1126/science.1112014</pub-id><pub-id pub-id-type="pmid">16141072</pub-id></element-citation></ref>
<ref id="b39-ijo-47-04-1393"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Uesaka</surname><given-names>M</given-names></name><name><surname>Nishimura</surname><given-names>O</given-names></name><name><surname>Go</surname><given-names>Y</given-names></name><name><surname>Nakashima</surname><given-names>K</given-names></name><name><surname>Agata</surname><given-names>K</given-names></name><name><surname>Imamura</surname><given-names>T</given-names></name></person-group><article-title>Bidirectional promoters are the major source of gene activation-associated non-coding RNAs in mammals</article-title><source>BMC Genomics</source><volume>15</volume><fpage>35</fpage><year>2014</year><pub-id pub-id-type="doi">10.1186/1471-2164-15-35</pub-id><pub-id pub-id-type="pmid">24438357</pub-id><pub-id pub-id-type="pmcid">3898825</pub-id></element-citation></ref>
<ref id="b40-ijo-47-04-1393"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lepoivre</surname><given-names>C</given-names></name><name><surname>Belhocine</surname><given-names>M</given-names></name><name><surname>Bergon</surname><given-names>A</given-names></name><name><surname>Griffon</surname><given-names>A</given-names></name><name><surname>Yammine</surname><given-names>M</given-names></name><name><surname>Vanhille</surname><given-names>L</given-names></name><name><surname>Zacarias-Cabeza</surname><given-names>J</given-names></name><name><surname>Garibal</surname><given-names>MA</given-names></name><name><surname>Koch</surname><given-names>F</given-names></name><name><surname>Maqbool</surname><given-names>MA</given-names></name><etal/></person-group><article-title>Divergent transcription is associated with promoters of transcriptional regulators</article-title><source>BMC Genomics</source><volume>14</volume><fpage>914</fpage><year>2013</year><pub-id pub-id-type="doi">10.1186/1471-2164-14-914</pub-id><pub-id pub-id-type="pmid">24365181</pub-id><pub-id pub-id-type="pmcid">3882496</pub-id></element-citation></ref>
<ref id="b41-ijo-47-04-1393"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Magistri</surname><given-names>M</given-names></name><name><surname>Faghihi</surname><given-names>MA</given-names></name><name><surname>St Laurent</surname><given-names>G</given-names><suffix>III</suffix></name><name><surname>Wahlestedt</surname><given-names>C</given-names></name></person-group><article-title>Regulation of chromatin structure by long noncoding RNAs: Focus on natural antisense transcripts</article-title><source>Trends Genet</source><volume>28</volume><fpage>389</fpage><lpage>396</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.tig.2012.03.013</pub-id><pub-id pub-id-type="pmid">22541732</pub-id><pub-id pub-id-type="pmcid">3768148</pub-id></element-citation></ref>
<ref id="b42-ijo-47-04-1393"><label>42</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kung</surname><given-names>JT</given-names></name><name><surname>Colognori</surname><given-names>D</given-names></name><name><surname>Lee</surname><given-names>JT</given-names></name></person-group><article-title>Long noncoding RNAs: Past, present, and future</article-title><source>Genetics</source><volume>193</volume><fpage>651</fpage><lpage>669</lpage><year>2013</year><pub-id pub-id-type="doi">10.1534/genetics.112.146704</pub-id><pub-id pub-id-type="pmid">23463798</pub-id><pub-id pub-id-type="pmcid">3583990</pub-id></element-citation></ref>
<ref id="b43-ijo-47-04-1393"><label>43</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hamada</surname><given-names>H</given-names></name><name><surname>Petrino</surname><given-names>MG</given-names></name><name><surname>Kakunaga</surname><given-names>T</given-names></name></person-group><article-title>A novel repeated element with Z-DNA-forming potential is widely found in evolutionarily diverse eukaryotic genomes</article-title><source>Proc Natl Acad Sci USA</source><volume>79</volume><fpage>6465</fpage><lpage>6469</lpage><year>1982</year><pub-id pub-id-type="doi">10.1073/pnas.79.21.6465</pub-id><pub-id pub-id-type="pmid">6755470</pub-id><pub-id pub-id-type="pmcid">347147</pub-id></element-citation></ref>
<ref id="b44-ijo-47-04-1393"><label>44</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hamada</surname><given-names>H</given-names></name><name><surname>Seidman</surname><given-names>M</given-names></name><name><surname>Howard</surname><given-names>BH</given-names></name><name><surname>Gorman</surname><given-names>CM</given-names></name></person-group><article-title>Enhanced gene expression by the poly(dT-dG).poly(dC-dA) sequence</article-title><source>Mol Cell Biol</source><volume>4</volume><fpage>2622</fpage><lpage>2630</lpage><year>1984</year><pub-id pub-id-type="pmid">6098815</pub-id><pub-id pub-id-type="pmcid">369269</pub-id></element-citation></ref>
<ref id="b45-ijo-47-04-1393"><label>45</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dutreix</surname><given-names>M</given-names></name></person-group><article-title>(GT)n repetitive tracts affect several stages of RecA-promoted recombination</article-title><source>J Mol Biol</source><volume>273</volume><fpage>105</fpage><lpage>113</lpage><year>1997</year><pub-id pub-id-type="doi">10.1006/jmbi.1997.1293</pub-id><pub-id pub-id-type="pmid">9367750</pub-id></element-citation></ref>
<ref id="b46-ijo-47-04-1393"><label>46</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>W</given-names></name><name><surname>Zheng</surname><given-names>J</given-names></name><name><surname>He</surname><given-names>Y</given-names></name><name><surname>Luo</surname><given-names>C</given-names></name></person-group><article-title>Tandem repeat modification during double-strand break repair induced by an engineered TAL effector nuclease in zebrafish genome</article-title><source>PLoS One</source><volume>8</volume><fpage>e84176</fpage><year>2013</year><pub-id pub-id-type="doi">10.1371/journal.pone.0084176</pub-id></element-citation></ref>
<ref id="b47-ijo-47-04-1393"><label>47</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hui</surname><given-names>J</given-names></name><name><surname>Hung</surname><given-names>L-H</given-names></name><name><surname>Heiner</surname><given-names>M</given-names></name><name><surname>Schreiner</surname><given-names>S</given-names></name><name><surname>Neum&#x000FC;ller</surname><given-names>N</given-names></name><name><surname>Reither</surname><given-names>G</given-names></name><name><surname>Haas</surname><given-names>SA</given-names></name><name><surname>Bindereif</surname><given-names>A</given-names></name></person-group><article-title>Intronic CA-repeat and CA-rich elements: A new class of regulators of mammalian alternative splicing</article-title><source>EMBO J</source><volume>24</volume><fpage>1988</fpage><lpage>1998</lpage><year>2005</year><pub-id pub-id-type="doi">10.1038/sj.emboj.7600677</pub-id><pub-id pub-id-type="pmid">15889141</pub-id><pub-id pub-id-type="pmcid">1142610</pub-id></element-citation></ref>
<ref id="b48-ijo-47-04-1393"><label>48</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zenklusen</surname><given-names>JC</given-names></name><name><surname>Bi&#x000E8;che</surname><given-names>I</given-names></name><name><surname>Lidereau</surname><given-names>R</given-names></name><name><surname>Conti</surname><given-names>CJ</given-names></name></person-group><article-title>(C-A)n micro-satellite repeat D7S522 is the most commonly deleted region in human primary breast cancer</article-title><source>Proc Natl Acad Sci USA</source><volume>91</volume><fpage>12155</fpage><lpage>12158</lpage><year>1994</year><pub-id pub-id-type="doi">10.1073/pnas.91.25.12155</pub-id></element-citation></ref>
<ref id="b49-ijo-47-04-1393"><label>49</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mukherjee</surname><given-names>B</given-names></name><name><surname>Zhao</surname><given-names>H</given-names></name><name><surname>Parashar</surname><given-names>B</given-names></name><name><surname>Sood</surname><given-names>BM</given-names></name><name><surname>Mahadevia</surname><given-names>PS</given-names></name><name><surname>Klinger</surname><given-names>HP</given-names></name><name><surname>Vikram</surname><given-names>B</given-names></name><name><surname>Achary</surname><given-names>MP</given-names></name></person-group><article-title>Microsatellite dinucleotide (T-G) repeat: A candidate DNA marker for breast metastasis</article-title><source>Cancer Detect Prev</source><volume>27</volume><fpage>19</fpage><lpage>23</lpage><year>2003</year><pub-id pub-id-type="doi">10.1016/S0361-090X(02)00177-0</pub-id><pub-id pub-id-type="pmid">12600413</pub-id></element-citation></ref>
<ref id="b50-ijo-47-04-1393"><label>50</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Frietze</surname><given-names>S</given-names></name><name><surname>Wang</surname><given-names>R</given-names></name><name><surname>Yao</surname><given-names>L</given-names></name><name><surname>Tak</surname><given-names>YG</given-names></name><name><surname>Ye</surname><given-names>Z</given-names></name><name><surname>Gaddis</surname><given-names>M</given-names></name><name><surname>Witt</surname><given-names>H</given-names></name><name><surname>Farnham</surname><given-names>PJ</given-names></name><name><surname>Jin</surname><given-names>VX</given-names></name></person-group><article-title>Cell type-specific binding patterns reveal that TCF7L2 can be tethered to the genome by association with GATA3</article-title><source>Genome Biol</source><volume>13</volume><fpage>R52</fpage><year>2012</year><pub-id pub-id-type="doi">10.1186/gb-2012-13-9-r52</pub-id><pub-id pub-id-type="pmid">22951069</pub-id><pub-id pub-id-type="pmcid">3491396</pub-id></element-citation></ref>
<ref id="b51-ijo-47-04-1393"><label>51</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hnisz</surname><given-names>D</given-names></name><name><surname>Abraham</surname><given-names>BJ</given-names></name><name><surname>Lee</surname><given-names>TI</given-names></name><name><surname>Lau</surname><given-names>A</given-names></name><name><surname>Saint-Andr&#x000E9;</surname><given-names>V</given-names></name><name><surname>Sigova</surname><given-names>AA</given-names></name><name><surname>Hoke</surname><given-names>HA</given-names></name><name><surname>Young</surname><given-names>RA</given-names></name></person-group><article-title>Super-enhancers in the control of cell identity and disease</article-title><source>Cell</source><volume>155</volume><fpage>934</fpage><lpage>947</lpage><year>2013</year><pub-id pub-id-type="doi">10.1016/j.cell.2013.09.053</pub-id><pub-id pub-id-type="pmid">24119843</pub-id><pub-id pub-id-type="pmcid">3841062</pub-id></element-citation></ref>
<ref id="b52-ijo-47-04-1393"><label>52</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pott</surname><given-names>S</given-names></name><name><surname>Lieb</surname><given-names>JD</given-names></name></person-group><article-title>What are super-enhancers?</article-title><source>Nat Genet</source><volume>47</volume><fpage>8</fpage><lpage>12</lpage><year>2015</year><pub-id pub-id-type="doi">10.1038/ng.3167</pub-id></element-citation></ref>
<ref id="b53-ijo-47-04-1393"><label>53</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Akiyama</surname><given-names>Y</given-names></name><name><surname>Watkins</surname><given-names>N</given-names></name><name><surname>Suzuki</surname><given-names>H</given-names></name><name><surname>Jair</surname><given-names>KW</given-names></name><name><surname>van Engeland</surname><given-names>M</given-names></name><name><surname>Esteller</surname><given-names>M</given-names></name><name><surname>Sakai</surname><given-names>H</given-names></name><name><surname>Ren</surname><given-names>CY</given-names></name><name><surname>Yuasa</surname><given-names>Y</given-names></name><name><surname>Herman</surname><given-names>JG</given-names></name><etal/></person-group><article-title>GATA-4 and GATA-5 transcription factor genes and potential downstream antitumor target genes are epigenetically silenced in colorectal and gastric cancer</article-title><source>Mol Cell Biol</source><volume>23</volume><fpage>8429</fpage><lpage>8439</lpage><year>2003</year><pub-id pub-id-type="doi">10.1128/MCB.23.23.8429-8439.2003</pub-id><pub-id pub-id-type="pmid">14612389</pub-id><pub-id pub-id-type="pmcid">262684</pub-id></element-citation></ref>
<ref id="b54-ijo-47-04-1393"><label>54</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname><given-names>R</given-names></name><name><surname>Blobel</surname><given-names>GA</given-names></name></person-group><article-title>GATA transcription factors and cancer</article-title><source>Genes Cancer</source><volume>1</volume><fpage>1178</fpage><lpage>1188</lpage><year>2010</year><pub-id pub-id-type="doi">10.1177/1947601911404223</pub-id></element-citation></ref>
<ref id="b55-ijo-47-04-1393"><label>55</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Carpenter</surname><given-names>RL</given-names></name><name><surname>Lo</surname><given-names>HW</given-names></name></person-group><article-title>Hedgehog pathway and GLI1 isoforms in human cancer</article-title><source>Discov Med</source><volume>13</volume><fpage>105</fpage><lpage>113</lpage><year>2012</year><pub-id pub-id-type="pmid">22369969</pub-id><pub-id pub-id-type="pmcid">3632644</pub-id></element-citation></ref>
<ref id="b56-ijo-47-04-1393"><label>56</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Arai</surname><given-names>MA</given-names></name><name><surname>Uchida</surname><given-names>K</given-names></name><name><surname>Sadhu</surname><given-names>SK</given-names></name><name><surname>Ahmed</surname><given-names>F</given-names></name><name><surname>Ishibashi</surname><given-names>M</given-names></name></person-group><article-title>Physalin H from Solanum nigrum as an Hh signaling inhibitor blocks GLI1-DNA-complex formation</article-title><source>Beilstein J Org Chem</source><volume>10</volume><fpage>134</fpage><lpage>140</lpage><year>2014</year><pub-id pub-id-type="doi">10.3762/bjoc.10.10</pub-id><pub-id pub-id-type="pmid">24454566</pub-id><pub-id pub-id-type="pmcid">3896228</pub-id></element-citation></ref>
<ref id="b57-ijo-47-04-1393"><label>57</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Faber</surname><given-names>K</given-names></name><name><surname>Glatting</surname><given-names>KH</given-names></name><name><surname>Mueller</surname><given-names>PJ</given-names></name><name><surname>Risch</surname><given-names>A</given-names></name><name><surname>Hotz-Wagenblatt</surname><given-names>A</given-names></name></person-group><article-title>Genome-wide prediction of splice-modifying SNPs in human genes using a new analysis pipeline called AASsites</article-title><source>BMC Bioinformatics</source><volume>12</volume><issue>Suppl 4</issue><fpage>S2</fpage><year>2011</year><pub-id pub-id-type="doi">10.1186/1471-2105-12-S4-S2</pub-id><pub-id pub-id-type="pmid">21992029</pub-id><pub-id pub-id-type="pmcid">3194194</pub-id></element-citation></ref>
<ref id="b58-ijo-47-04-1393"><label>58</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname><given-names>Y</given-names></name><name><surname>Jamison</surname><given-names>DC</given-names></name></person-group><article-title>The distribution of SNPs in human gene regulatory regions</article-title><source>BMC Genomics</source><volume>6</volume><fpage>140</fpage><year>2005</year><pub-id pub-id-type="doi">10.1186/1471-2164-6-140</pub-id><pub-id pub-id-type="pmid">16209714</pub-id><pub-id pub-id-type="pmcid">1260019</pub-id></element-citation></ref>
<ref id="b59-ijo-47-04-1393"><label>59</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kotani</surname><given-names>N</given-names></name><name><surname>Asano</surname><given-names>M</given-names></name><name><surname>Iwakura</surname><given-names>Y</given-names></name><name><surname>Takasaki</surname><given-names>S</given-names></name></person-group><article-title>Knockout of mouse beta 1,4-galactosyltransferase-1 gene results in a dramatic shift of outer chain moieties of N-glycans from type 2 to type 1 chains in hepatic membrane and plasma glycoproteins</article-title><source>Biochem J</source><volume>357</volume><fpage>827</fpage><lpage>834</lpage><year>2001</year><pub-id pub-id-type="doi">10.1042/bj3570827</pub-id><pub-id pub-id-type="pmid">11463354</pub-id><pub-id pub-id-type="pmcid">1222013</pub-id></element-citation></ref></ref-list></back>
<floats-group>
<fig id="f1-ijo-47-04-1393" position="float">
<label>Figure 1</label>
<caption>
<p>Map locations of alternative promoters of divergently-paired head to head genes, <italic>B4GALT1</italic> (uc003zsg.2) and <italic>B4GALT1-AS1</italic> (uc033cop.1). (A) Chromosomal locations of the two paired genes at 9p13 are shown. (B) Intergenic and intragenic locations of alternative promoters along <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> space at chromosome 9-NC_000009.12 is displayed.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g00.gif"/></fig>
<fig id="f2-ijo-47-04-1393" position="float">
<label>Figure 2</label>
<caption>
<p>Map location of TR1-PE1 regulatory complex compared with locations of <italic>B4GALT1</italic> and <italic>B4GALT1-AS1</italic> reverse complement (B4-AS1 RC), TR1 and PE1.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g01.gif"/></fig>
<fig id="f3-ijo-47-04-1393" position="float">
<label>Figure 3</label>
<caption>
<p>Distribution of TFBS and TSS along the TR1-PE1 regulatory complex. (A) The TR1-PE1 sequence, the underlined sequences and highlighted bases indicate TFBS and TSS, respectively. (B) Schematic presentation of BRE, INR rich regions and the six TSS (bent arrows) along TR1-PE1 regulatory sequence.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g02.gif"/></fig>
<fig id="f4-ijo-47-04-1393" position="float">
<label>Figure 4</label>
<caption>
<p>Plot of the dinucleotide base-stacking energy values (y-axis) along the TR1-PE1 regulatory sequence. A, B, C and D in the inset represent the locations of INR, INR, TATA-532 and BRE, respectively.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g03.gif"/></fig>
<fig id="f5-ijo-47-04-1393" position="float">
<label>Figure 5</label>
<caption>
<p>The GC skew (y-axis) along the <italic>B4GALT1</italic> TR1-PE1 promoter sequence (x-axis). The two bent arrows represent six transcription start sites in the bidirectional promoter.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g04.gif"/></fig>
<fig id="f6-ijo-47-04-1393" position="float">
<label>Figure 6</label>
<caption>
<p>The two (TG)<sub>18</sub> tandem repeats and TGAS sequence at the 5&#x02032; regulatory region of <italic>B4GALT1</italic>. The two (TG)<sub>18</sub> sequences are identified in HP3 intronic alternative promoter located at chr9: 33156756-33158354 close to TR1-PE1 (chr9: 33166901-33168354). Highlighted region is overlapping HP5 sequence, the two TGAS sequences are underlined, the numbers in square brackets show map location of part of the HP3 sequence, which harbor the TG repeats and TGAS sequence.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g05.gif"/></fig>
<fig id="f7-ijo-47-04-1393" position="float">
<label>Figure 7</label>
<caption>
<p>Enhancers, open chromatin and CTCF binding sites in the regulatory regions of <italic>B4GALT</italic> family. The data were extracted from the Ensembl-regulation database of multi-cell regulatory features.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g06.gif"/></fig>
<fig id="f8-ijo-47-04-1393" position="float">
<label>Figure 8</label>
<caption>
<p>Blast tree map of 58-bp sequence composed from (TG)<sub>18</sub> and the TGAS of 22 bp. The data imply that the two sequences (TG)<sub>18</sub> and the TGAS are highly conserved during evolution.</p></caption>
<graphic xlink:href="IJO-47-04-1393-g07.gif"/></fig>
<table-wrap id="tI-ijo-47-04-1393" position="float">
<label>Table I</label>
<caption>
<p>Map locations and number of promoters of seven <italic>B4GALT</italic> genes.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left"><italic>B4GALT</italic> genes</th>
<th valign="bottom" align="center">GenBank ID</th>
<th valign="bottom" align="center">Map location</th>
<th valign="bottom" align="center">Promoters</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="right">2683</td>
<td valign="top" align="left">chr9: 33,104,08233,167,356 Reverse strand</td>
<td valign="top" align="right">13</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT2</italic></td>
<td valign="top" align="right">8704</td>
<td valign="top" align="left">chr1: 43,978,94343,991,170 Forward strand</td>
<td valign="top" align="right">16</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT3</italic></td>
<td valign="top" align="right">8703</td>
<td valign="top" align="left">chr1: 161,171,310161,177,968 Reverse strand</td>
<td valign="top" align="right">11</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT4</italic></td>
<td valign="top" align="right">8702</td>
<td valign="top" align="left">chr3: 119,211,732119,241,103 Reverse strand</td>
<td valign="top" align="right">11</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT5</italic></td>
<td valign="top" align="right">9334</td>
<td valign="top" align="left">chr20: 49,632,94549,713,878 Reverse strand</td>
<td valign="top" align="right">11</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT6</italic></td>
<td valign="top" align="right">9331</td>
<td valign="top" align="left">chr18: 31,622,24731,685,836 Reverse strand</td>
<td valign="top" align="right">9</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT7</italic></td>
<td valign="top" align="right">11285</td>
<td valign="top" align="left">chr5: 177,600,100177,610,347 Forward strand</td>
<td valign="top" align="right">9</td></tr></tbody></table></table-wrap>
<table-wrap id="tII-ijo-47-04-1393" position="float">
<label>Table II</label>
<caption>
<p>Genomic contexts of <italic>B4GALT1-AS1</italic>, <italic>B4GALT4-AS1</italic>, <italic>B4GALT6-AS1</italic> and <italic>B4GALT7-AS1</italic> transcripts.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left"><italic>B4GALT</italic> genes</th>
<th valign="bottom" align="center">Location</th>
<th valign="bottom" align="center"><italic>B4GALT</italic> antisense</th>
<th valign="bottom" align="center">Location</th>
<th valign="bottom" align="center">Locus space (bp)</th>
<th valign="bottom" align="center">Transcript ID</th>
<th valign="bottom" align="center">Length (bp)</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="center">Chr9: 33110641-33167358<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT1-AS1</italic></td>
<td valign="top" align="center">Chr9: 33166946-33179981<break/>Forward strand</td>
<td valign="top" align="center">13036</td>
<td valign="top" align="center">NR_108110.1</td>
<td valign="top" align="center">808</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="center">Chr9: 33110641-33167358<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT1-AS1</italic></td>
<td valign="top" align="center">Chr9: 33166946-33179981<break/>Forward strand</td>
<td valign="top" align="center">13036</td>
<td valign="top" align="center">NR_108109.1</td>
<td valign="top" align="center">932</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="center">Chr9: 33110641-33167358<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT1-AS1</italic></td>
<td valign="top" align="center">Chr9: 33167857-33179981<break/>Forward strand</td>
<td valign="top" align="center">12125</td>
<td valign="top" align="center">NR_108108.1</td>
<td valign="top" align="center">1185</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="center">Chr9: 33,104,082-33,167,356<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT1-AS</italic>1</td>
<td valign="top" align="center">Chr9: 33166975-33179710<break/>Forward strand</td>
<td valign="top" align="center">12736</td>
<td valign="top" align="center">ENST00000426270</td>
<td valign="top" align="center">508</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="center">Chr9: 33104082-33167356<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT1-AS1</italic></td>
<td valign="top" align="center">Chr9: 33166975-33179983<break/>Forward strand</td>
<td valign="top" align="center">13009</td>
<td valign="top" align="center">ENST00000442432</td>
<td valign="top" align="center">843</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT4</italic></td>
<td valign="top" align="center">Chr3: 119211732-119241103<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT4-AS</italic></td>
<td valign="top" align="center">Chr3: 119226486-119290666<break/>Forward strand</td>
<td valign="top" align="center">64181</td>
<td valign="top" align="center">ENST00000470790</td>
<td valign="top" align="center">544</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT6</italic></td>
<td valign="top" align="center">Chr18: 31622247-31685836<break/>Reverse strand</td>
<td valign="top" align="center"><italic>B4GALT6-AS</italic></td>
<td valign="top" align="center">Chr18: 31685655-31686823<break/>Forward strand</td>
<td valign="top" align="center">1169</td>
<td valign="top" align="center">ENSG00000259985</td>
<td valign="top" align="center">1169</td></tr>
<tr>
<td valign="top" align="left"><italic>B4GALT7</italic></td>
<td valign="top" align="center">Chr5: 177600100-177610347<break/>Forward strand</td>
<td valign="top" align="center"><italic>B4GALT7-AS</italic></td>
<td valign="top" align="center">Chr5: 177611253-177619754<break/>Forward strand</td>
<td valign="top" align="center">8502</td>
<td valign="top" align="center">ENST00000499900</td>
<td valign="top" align="center">1751</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijo-47-04-1393">
<p>The data were mined from NCBI-Gene, NCBI-Nucleotide, UCSC and Ensembl databases.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="tIII-ijo-47-04-1393" position="float">
<label>Table III</label>
<caption>
<p>Map locations and CGIs of thirteen <italic>B4GALT1</italic> alternative promoters.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left">Alternative promoter source</th>
<th valign="bottom" align="center">Symbol</th>
<th valign="bottom" align="center">Map locations</th>
<th valign="bottom" align="center">Span (bp)</th>
<th valign="bottom" align="center">Obs/Exp CpG</th>
<th valign="bottom" align="center">&#x00025;G+C</th>
<th valign="bottom" align="center">CGI</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">TRED-42895</td>
<td valign="top" align="center">TR1</td>
<td valign="top" align="center">chr9: 33166901-33167900</td>
<td valign="top" align="right">1000</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="left">73.3</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">TRED-113955</td>
<td valign="top" align="center">TR2</td>
<td valign="top" align="center">chr9: 33110342-33111341</td>
<td valign="top" align="right">1000</td>
<td valign="top" align="center">0.22</td>
<td valign="top" align="left">45.3</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">MPromDB- HG_ACW: 80083</td>
<td valign="top" align="center">HP1</td>
<td valign="top" align="center">chr9: 33103518-33104577</td>
<td valign="top" align="right">1060</td>
<td valign="top" align="center">0.13</td>
<td valign="top" align="left">49.57</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">MPromDB-HG_ACW: 80090</td>
<td valign="top" align="center">HP2</td>
<td valign="top" align="center">chr9: 33124856-33126364</td>
<td valign="top" align="right">1509</td>
<td valign="top" align="center">0.19</td>
<td valign="top" align="left">49.86</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">MPromDB-HG_KWN: 62900</td>
<td valign="top" align="center">HP3</td>
<td valign="top" align="center">chr9: 33156756-33158354</td>
<td valign="top" align="right">1599</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="left">39.59</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">MPromDB-HG_ACW:80090</td>
<td valign="top" align="center">HP4</td>
<td valign="top" align="center">chr9: 33125681-33126181</td>
<td valign="top" align="right">501</td>
<td valign="top" align="center">0.13</td>
<td valign="top" align="left">50.8</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">MPromDB-HG_KWN:62900</td>
<td valign="top" align="center">HP5</td>
<td valign="top" align="center">chr9: 33157101-33157692</td>
<td valign="top" align="right">592</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="left">37.4</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">Choi <italic>et al</italic> (<xref rid="b6-ijo-47-04-1393" ref-type="bibr">6</xref>)</td>
<td valign="top" align="center">CP1</td>
<td valign="top" align="center">chr9: 33167181-33167680</td>
<td valign="top" align="right">500</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="left">76.6</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">EPD</td>
<td valign="top" align="center">EP1</td>
<td valign="top" align="center">chr9: 33167235-33167834</td>
<td valign="top" align="right">600</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="left">74.2</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">PEPED</td>
<td valign="top" align="center">PE1</td>
<td valign="top" align="center">chr9: 33167354-33168354</td>
<td valign="top" align="right">1001</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="left">69.2</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">Ensembl</td>
<td valign="top" align="center">ES1</td>
<td valign="top" align="center">chr9: 33128098-33128697</td>
<td valign="top" align="right">600</td>
<td valign="top" align="center">0.11</td>
<td valign="top" align="left">49.3</td>
<td valign="top" align="center">&#x02212;</td></tr>
<tr>
<td valign="top" align="left">Ensembl</td>
<td valign="top" align="center">ES2</td>
<td valign="top" align="center">chr9: 33167085-33167684</td>
<td valign="top" align="right">600</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="left">76</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">Ensembl</td>
<td valign="top" align="center">ES3</td>
<td valign="top" align="center">chr9: 33167255-33167854</td>
<td valign="top" align="right">600</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="left">74.2</td>
<td valign="top" align="center">+</td></tr>
<tr>
<td valign="top" align="left">(This study)</td>
<td valign="top" align="center">TR1-PE1</td>
<td valign="top" align="center">chr9: 33166901-33168354</td>
<td valign="top" align="right">1454</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="left">67.9</td>
<td valign="top" align="center">+</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn2-ijo-47-04-1393">
<p>+ and &#x02212; indicate presence and absence of CGI, respectively, in the alternative promoters.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="tIV-ijo-47-04-1393" position="float">
<label>Table IV</label>
<caption>
<p>The map locations of 116-bp sequence<xref rid="tfn3-ijo-47-04-1393" ref-type="table-fn">a</xref> composed of two (TG)<sub>18</sub> repeats and two TGAS sequences in six human chromosomes.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="bottom" align="left">Chr</th>
<th valign="bottom" align="center">Strand</th>
<th valign="bottom" align="center">Map location</th>
<th valign="bottom" align="center">Identified sequences<xref rid="tfn4-ijo-47-04-1393" ref-type="table-fn">b</xref></th>
<th valign="bottom" align="center">Span</th>
<th valign="bottom" align="center">Score</th></tr></thead>
<tbody>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">Reverse</td>
<td valign="top" align="left">chr9: 33157043-33157158</td>
<td valign="top" align="left"><bold>TGTGTGTGTGTGTGTGTGTGTGTGTGTGT</bold><break/><bold>GTGTGTG</bold><underline>TAGTTCCCTATGCTGGACACC</underline><underline><italic>G</italic></underline><break/><bold>TGTGTGTGTGTGTGTGTGTGTGTGTGTGT</bold><break/><bold>GTGTGTG</bold><underline>TAGTTCCCTATGCTGGACACC</underline><underline><italic>A</italic></underline></td>
<td valign="top" align="right">116</td>
<td valign="top" align="center">116/116</td></tr>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">Forward</td>
<td valign="top" align="left">chr1: 5387048-5387087</td>
<td valign="top" align="left"><bold>TGTG</bold><italic>G</italic><bold>TGT</bold><bold><italic>C</italic></bold><bold>TGTGT</bold><bold><italic>G</italic></bold><bold>GTGTGTGTG</bold><bold><italic>G</italic></bold><bold>GTGT</bold><break/><bold>GTGTGTG</bold><underline>TAGT</underline></td>
<td valign="top" align="right">40</td>
<td valign="top" align="center">33/116</td></tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">Forward</td>
<td valign="top" align="left">chr5: 26315000-26315046</td>
<td valign="top" align="left"><bold>GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT</bold><break/><bold>GTGTGTG</bold><underline>TAGTTCCCTA</underline></td>
<td valign="top" align="right">47</td>
<td valign="top" align="center">47/116</td></tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">Reverse</td>
<td valign="top" align="left">chr12: 96164336-96164418</td>
<td valign="top" align="left"><bold>TGTG</bold><bold><italic>G</italic></bold><bold>TGTGTGTGTGTG</bold><bold><italic>G</italic></bold><bold>TGTGTGTG</bold><bold><italic>AA</italic></bold><bold>TG</bold><break/><bold><italic>CG</italic></bold><bold>GTGTGTGTGTGT</bold><bold><italic>AG</italic></bold><bold>TGTGTGTG</bold><bold><italic>AA</italic></bold><bold>TGTG</bold><break/><bold><italic>G</italic></bold><bold>TGTGTGTGTGTGTGTGTG</bold><underline>TAGT</underline></td>
<td valign="top" align="right">83</td>
<td valign="top" align="center">67/116</td></tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">Reverse</td>
<td valign="top" align="left">chr12: 23831951-23831984</td>
<td valign="top" align="left"><bold>TGTGTGTGTGTGTGTGTGTGTGTGTG</bold><underline>TAGTTCCC</underline></td>
<td valign="top" align="right">34</td>
<td valign="top" align="center">34/116</td></tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="left">Reverse</td>
<td valign="top" align="left">chr12: 96164379-96164418</td>
<td valign="top" align="left"><bold>TGTG</bold><bold><italic>G</italic></bold><bold>TGTGTGTGTGTG</bold><bold><italic>G</italic></bold><bold>TGTGTGTG</bold><bold><italic>AA</italic></bold><bold>TG</bold><bold><italic>CG</italic></bold><bold>GTGTGTGT</bold></td>
<td valign="top" align="right">40</td>
<td valign="top" align="center">29/116</td></tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="left">Reverse</td>
<td valign="top" align="left">chr13: 20498045-20498093</td>
<td valign="top" align="left"><bold>TGTGTGT</bold><bold><italic>A</italic></bold><bold>TGTGTG</bold><bold><italic>G</italic></bold><bold>TGTGTGTGT</bold><bold><italic>AG</italic></bold><bold>TGTG</bold><bold><italic>TATAG</italic></bold><break/><bold><italic>GG</italic></bold><bold>TGTGTGTG</bold><underline>TAGTT</underline></td>
<td valign="top" align="right">49</td>
<td valign="top" align="center">37/116</td></tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="left">Forward</td>
<td valign="top" align="left">chr14:90274778-90274824</td>
<td valign="top" align="left"><bold>TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG</bold><break/><bold>TGTGTGTGTGTGTGTG</bold><underline>TAGTTCCCTAT</underline></td>
<td valign="top" align="right">61</td>
<td valign="top" align="center">61/116</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn3-ijo-47-04-1393">
<label>a</label>
<p>The 116-bp sequence was originally identified and located at 5&#x02032;-<italic>B4GALT1</italic> regulatory region in reverse strand of chromosome 9.</p></fn><fn id="tfn4-ijo-47-04-1393">
<label>b</label>
<p>Blat tool was used to search for the 116 nucleotides sequence in the human genome hg38. The (TG)<sub>18</sub> repeats are shown in bold letters, the TGAS sequences are underlined, the italicezed letters show the polymorphisms, insertion and gaps in the sequences.</p></fn></table-wrap-foot></table-wrap></floats-group></article>
