<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="review-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Molecular Medicine Reports</journal-id>
<journal-title-group>
<journal-title>Molecular Medicine Reports</journal-title>
</journal-title-group>
<issn pub-type="ppub">1791-2997</issn>
<issn pub-type="epub">1791-3004</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/mmr.2018.9092</article-id>
<article-id pub-id-type="publisher-id">mmr-18-02-1225</article-id>
<article-categories>
<subj-group>
<subject>Review</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Computational approaches for predicting key transcription factors in targeted cell reprogramming</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Guerrero-Ramirez</surname><given-names>Guillermo-Issac</given-names></name>
<xref rid="af1-mmr-18-02-1225" ref-type="aff"/></contrib>
<contrib contrib-type="author"><name><surname>Valdez-Cordoba</surname><given-names>Cesar-Miguel</given-names></name>
<xref rid="af1-mmr-18-02-1225" ref-type="aff"/></contrib>
<contrib contrib-type="author"><name><surname>Islas-Cisneros</surname><given-names>Jose-Francisco</given-names></name>
<xref rid="af1-mmr-18-02-1225" ref-type="aff"/></contrib>
<contrib contrib-type="author"><name><surname>Trevino</surname><given-names>Victor</given-names></name>
<xref rid="af1-mmr-18-02-1225" ref-type="aff"/>
<xref rid="c1-mmr-18-02-1225" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-mmr-18-02-1225">Tecnol&#x00F3;gico de Monterrey, Escuela de Medicina, Monterrey, Nuevo Le&#x00F3;n 64710, M&#x00E9;xico</aff>
<author-notes>
<corresp id="c1-mmr-18-02-1225"><italic>Correspondence to</italic>: Dr Victor Trevino, Tecnol&#x00F3;gico de Monterrey, Escuela de Medicina, 3000 Av Morones Prieto, Colonia Los Doctores, Monterrey, Nuevo Le&#x00F3;n 64710, M&#x00E9;xico, E-mail: <email>vtrevino@itesm.mx</email></corresp>
</author-notes>
<pub-date pub-type="ppub"><month>08</month><year>2018</year></pub-date>
<pub-date pub-type="epub"><day>29</day><month>05</month><year>2018</year></pub-date>
<volume>18</volume>
<issue>2</issue>
<fpage>1225</fpage>
<lpage>1237</lpage>
<history>
<date date-type="received"><day>26</day><month>09</month><year>2017</year></date>
<date date-type="accepted"><day>27</day><month>02</month><year>2018</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Guerrero-Ramirez et al.</copyright-statement>
<copyright-year>2018</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>There is a need for specific cell types in regenerative medicine and biological research. Frequently, specific cell types may not be easily obtained or the quantity obtained is insufficient for study. Therefore, reprogramming by the direct conversion (transdifferentiation) or re-induction of induced pluripotent stem cells has been used to obtain cells expressing similar profiles to those of the desired types. Therefore, a specific cocktail of transcription factors (TFs) is required for induction. Nevertheless, identifying the correct combination of TFs is difficult. Although certain computational approaches have been proposed for this task, their methods are complex, and corresponding implementations are difficult to use and generalize for specific source or target cell types. In the present review four computational approaches that have been proposed to obtain likely TFs were compared and discussed. A simplified view of the computational complexity of these methods is provided that consists of three basic ideas: i) The definition of target and non-target cell types; ii) the estimation of candidate TFs; and iii) filtering candidates. This simplified view was validated by analyzing a well-documented cardiomyocyte differentiation. Subsequently, these reviewed methods were compared when applied to an unknown differentiation of corneal endothelial cells. The generated results may provide important insights for laboratory assays. Data and computer scripts that may assist with direct conversions in other cell types are also provided.</p>
</abstract>
<kwd-group>
<kwd>transcription factors</kwd>
<kwd>cell reprogramming</kwd>
<kwd>computational methods</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>In tissue engineering and regenerative medicine, there is a need for large quantities of specific cell types (<xref rid="b1-mmr-18-02-1225" ref-type="bibr">1</xref>,<xref rid="b2-mmr-18-02-1225" ref-type="bibr">2</xref>). For example, in corneal disease the use of transplants is essential, although access to corneal tissues is difficult given the shortage of tissue donors. Therefore, an alternative for generating specific corneal cells is needed (<xref rid="b3-mmr-18-02-1225" ref-type="bibr">3</xref>). Furthermore, specific cell types are required in research for characterization, including studies on responses to treatment or genetic regulatory networks (<xref rid="b4-mmr-18-02-1225" ref-type="bibr">4</xref>&#x2013;<xref rid="b7-mmr-18-02-1225" ref-type="bibr">7</xref>). For these needs, stem cell technologies hold the promise of providing a sufficient number of cells of specialized linages (<xref rid="b2-mmr-18-02-1225" ref-type="bibr">2</xref>). Such promise is based certain factors, including the fact that cell differentiation may be reversed, that somatic cells may be induced to be pluripotent, or that cells may be forced to alter their identity or to transdifferentiate (<xref rid="b8-mmr-18-02-1225" ref-type="bibr">8</xref>).</p>
<p>In this context, cell identity or cell state is thought to be a highly regulated process that depends on their epigenetic and transcriptional programming (<xref rid="b9-mmr-18-02-1225" ref-type="bibr">9</xref>). The cell state is defined as the transcriptional output of a gene regulatory network (<xref rid="b10-mmr-18-02-1225" ref-type="bibr">10</xref>). Thus, the cell state is principally controlled by the expression of transcription factors (TFs) forming specific network modules to ensure stable gene expression (<xref rid="b7-mmr-18-02-1225" ref-type="bibr">7</xref>). However, genome analyses have identified approximately 2,000 TFs, and it is known that approximately one-half are expressed in a given cell (<xref rid="b11-mmr-18-02-1225" ref-type="bibr">11</xref>). Thus, there is a requirement to elucidate which and how many TFs define specific cell states. The majority of the current literature in stem cells suggests that only a few TFs are required to maintain cell identity (<xref rid="b7-mmr-18-02-1225" ref-type="bibr">7</xref>,<xref rid="b12-mmr-18-02-1225" ref-type="bibr">12</xref>&#x2013;<xref rid="b14-mmr-18-02-1225" ref-type="bibr">14</xref>). For example, only four TFs (MYC proto-oncogene bHLH transcription factor, Kruppel like factor 4, SRY-box 2 and POU class 5 homeobox 1) are required to maintain the pluripotency state (<xref rid="b8-mmr-18-02-1225" ref-type="bibr">8</xref>,<xref rid="b15-mmr-18-02-1225" ref-type="bibr">15</xref>). These factors were identified from serial rounds of gene inclusion and withdrawal from a pool of 24 potential genes selected from studies performed on isolated genes. From this seminal work, other research groups identified several TFs for direct conversion (<xref rid="b16-mmr-18-02-1225" ref-type="bibr">16</xref>&#x2013;<xref rid="b18-mmr-18-02-1225" ref-type="bibr">18</xref>). For example, glutamic-oxaloacetic acid transaminase 1 was used to convert fibroblasts into functional neurons (<xref rid="b16-mmr-18-02-1225" ref-type="bibr">16</xref>) while GATA-binding protein 4 (<italic>GATA4</italic>), monocyte enhancer factor 2C (<italic>MEF2C</italic>) and T-box 5 (<italic>TBX5</italic>) were used to convert fibroblasts into cardiomyocytes (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>). Moreover, alternative combinations of TFs may lead to very similar cell types (<xref rid="b18-mmr-18-02-1225" ref-type="bibr">18</xref>), suggesting that redundancy exists in which the genetic regulatory networks characteristic of the cell identity may be established by similar or equivalent combinations of TFs.</p>
<p>Thus, if a cell state can be defined by a combination of TFs, in theory, any source cell type may be converted into any target cell type by establishing the expression of those TFs. Thus, if the differences in expression between the source and target cells are very small, one may consider subtle methods based on stimulating or blocking connected pathways. If the differences are large, as is commonly the case in converting fibroblasts to a lineage-distant cell type, one may opt to force expression by transdifferentiation or direct conversion (<xref rid="b19-mmr-18-02-1225" ref-type="bibr">19</xref>&#x2013;<xref rid="b21-mmr-18-02-1225" ref-type="bibr">21</xref>) or via the generation of induced pluripotent stem cells (iPSCs) following the induction of the target cell type (<xref rid="b13-mmr-18-02-1225" ref-type="bibr">13</xref>).</p>
<p>For other specific cell types, it is necessary to identify how candidate TFs may be obtained to begin with or how alternative TFs may be obtained. In the present study, the focus will be on providing simplified views of the computational approaches that have been proposed to identify a set or sets of putative TFs likely to control the cell state of the desired cell type. This proposed view may be highly illustrative for non-bioinformatics specialists for a number of reasons. Firstly, previously proposed computational methods are complex. Secondly, the literature accompanying the computational methods is highly technical. Thirdly, the descriptions of certain methods may appear vague for non-specialists. Fourthly, certain data (specifically, the networks) or computer scripts and tools described in the algorithms are currently unavailable, complicating re-implementations. Finally, the majority of approaches were proposed using ad-hoc parameters and specific datasets. In addition, for bioinformatics specialists, a succinct starting point for novel implementations was provided by the present review. To overcome the aforementioned difficulties, a simplified and unified view of current methods was provided, which may be summarized thus: i) The establishment of the population of cell types; ii) the estimation of candidate TFs from cell populations; and iii) the filtering of TF pre-candidates (the most challenging element). Derived from these summarized concepts, clues as to how the methods work are provided, in addition to knowledge as to how to overcome or approach difficulties. Possible ways in which these computational methods may be re-implemented and adapted to provide a preliminary list of TFs are additionally provided.</p>
</sec>
<sec>
<label>2.</label>
<title>Identifying key cell-state transcription factors</title>
<p>The idea that cell states are associated with the binary decision of cell fates has long been proposed (<xref rid="b22-mmr-18-02-1225" ref-type="bibr">22</xref>). However, computational approaches to identify key TFs governing cell states are more recent. In practice, an aim may be to directly convert a specific source cell type into a target cell type; therefore, the most important component is the estimation of the target cell state, since the state of the source cell type may be forced to change. The source cell type is important to be able to estimate those TFs that may be redundant and perhaps do not required manipulation; this may be easily performed by comparing expression levels. Therefore, the majority of methods primarily focus on the estimation of TFs controlling the target cell state. The following sections consider the approaches of recent studies (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>&#x2013;<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>), which are accordingly referred to as Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>), D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>), Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>) and Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>).</p>
<sec>
<title/>
<sec>
<title>Identification of TFs via differential expression</title>
<p>Under the assumption that the cell identity is controlled by the gene expression level of a specific set of TFs, it follows that the identity of cell types be controlled by either different levels of the same set of TFs or a different set of TFs (<xref rid="b7-mmr-18-02-1225" ref-type="bibr">7</xref>). In any case, the same operation is needed: The identification of the characteristic and distinct gene expression levels. This is best known as differential expression. Since this operation involves the comparison between at least two populations assumed to be distinct, the target cell type population and the &#x2018;background&#x2019; population require careful selection. In theory, if these populations are well defined and the available data are highly representative and precise, it ought to be possible to create a small list of TFs. However, even today, the available data are scarce, highly noisy and contaminated with different populations of cells; the data from <italic>in vitro</italic> assays may not reflect genuine <italic>in vivo</italic> properties; and the computational and statistical tools may be imperfect. Therefore, the output of the differential expression between the defined cell populations usually generates large lists of pre-candidate TFs.</p>
</sec>
<sec>
<title>Filtering problem</title>
<p>Assuming that the number of TFs controlling the cell identity is small, this large list of pre-candidate TFs ought to be highly contaminated with false-positive calls representing cell-state-irrelevant TFs that require filtering out. Although certain irrelevant TFs may be easily identified by expert researchers and available biological knowledge in the literature, this process is time-consuming and may be prone to misinterpretations, errors and omissions. In addition, certain TFs may not be well studied or studied at all. Furthermore, manual filtering of the list causes difficulties in the scoring or ranking of TFs according to the scientific literature. Therefore, the systematic filtering and ranking of pre-candidate TFs is a challenging issue. This filtering process is obscured in original research articles due to the complexity of their implementations. The majority of the considered methods perform this filtering procedure analyzing the TFs within the context of biological networks. Although this may be considered to be a drawback by non-bioinformatics specialists, this step need not be very complicated to help to reduce large lists. In particular, within the examples provided, even when no filtering is used, sensible results may be obtained if target and non-target cell populations are well defined.</p>
<p>In summary, the proposed view of the process to identify TFs likely controlling a cell state is demonstrated in <xref rid="f1-mmr-18-02-1225" ref-type="fig">Fig. 1</xref> and is discussed in the following sections. In practice, it may be advisable to start with a specific source cell type for induction to a target cell type, whereas the majority of methods focus on the target cell type to identify TFs associated with the cell state (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>&#x2013;<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>). Thus, once the cell types have been identified, as depicted in <xref rid="f1-mmr-18-02-1225" ref-type="fig">Fig. 1</xref>, the TF expression profile of the source cell type is compared with the target to identify those TFs required to induce from that particular source.</p>
</sec>
<sec>
<title>Defining the populations of cell types</title>
<p>The first step consists of defining at least two populations of cell types (<xref rid="f1-mmr-18-02-1225" ref-type="fig">Fig. 1A</xref>), which are referred to as target and non-target cell types. A comparison of conceptual definitions by the authors is demonstrated in <xref rid="f2-mmr-18-02-1225" ref-type="fig">Fig. 2</xref> and discussed in the following paragraphs.</p>
</sec>
<sec>
<title>Datasets used</title>
<p>Gene expression data are required to be uniformly annotated for target and non-target cell types. Therefore, the majority of methods utilize information from the vast collections of microarray gene expression data available from the Gene Expression Omnibus (GEO) (<xref rid="b27-mmr-18-02-1225" ref-type="bibr">27</xref>,<xref rid="b28-mmr-18-02-1225" ref-type="bibr">28</xref>) and ArrayExpress (<xref rid="b29-mmr-18-02-1225" ref-type="bibr">29</xref>,<xref rid="b30-mmr-18-02-1225" ref-type="bibr">30</xref>), or from more recent next-generation sequence repositories in ENCODE (<xref rid="b31-mmr-18-02-1225" ref-type="bibr">31</xref>) or FANTOM (<xref rid="b6-mmr-18-02-1225" ref-type="bibr">6</xref>). The repositories used are detailed in <xref rid="tI-mmr-18-02-1225" ref-type="table">Table I</xref>. The majority of the studies discussed in the present review used GEO microarray data, except Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>), who used FANTOM5. They studied human data, although Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) additionally included murine data. The majority of the studies included numerous cell types; however, Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) used progenitor and daughter cell types from specific third-party authors.</p>
</sec>
<sec>
<title>Target cell type</title>
<p>For the target population, which is generally the easiest to delimitate, a number of considerations are noteworthy. First, the target cell type is required to be well represented. From the authors reviewed herein, various experiments were performed <italic>in vitro</italic>, while others have been obtained from tissue samples. The experiments performed <italic>in vitro</italic> have the advantage of a well-defined cell type, whilst the tissue samples may represent a mixture of distinct cell types generating an average cell state that may not properly represent the desired target. Second, the gene expression data may reflect the cell state of an individual donor instead of a population-generalizable cell state. Thus, it is desirable to include as many individuals as possible. Third, repetition is desirable as gene expression data are noisy, which is worsened by the technology used to acquire the data (particularly microarrays). In summary, the targets used for each method are mentioned in <xref rid="tI-mmr-18-02-1225" ref-type="table">Table I</xref>. Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>) used a hierarchical ontology definition of cell types from FANTOM5 to define a particular target cell type; they ignored closely associated cell types (<xref rid="f2-mmr-18-02-1225" ref-type="fig">Fig. 2</xref>). In this way, they favored the purity of the cell state. However, they lost generality as closed cell types may help to eliminate non-specific TFs, leading to larger lists of pre-candidate TFs if their implementation is not followed thoroughly. On the contrary, Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) used a specific cell type contrasted with the closest associated cell types (daughter cell types; <xref rid="f2-mmr-18-02-1225" ref-type="fig">Fig. 2</xref>). This has a number of advantages since the comparison of close, although distinct, cell types may lead to the clear identification of controlling TFs. However, this method is unable to be generalized as the type of experimental setting (well-defined progenitor and daughter cell types) required to run this approach is not as common in the data repositories and must be performed in advance to generate the data. D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) and Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) first defined a number of classes of tissues or cell types and compared each class against the remaining classes (<xref rid="f2-mmr-18-02-1225" ref-type="fig">Fig. 2</xref>). In each class, they used numerous samples, avoiding individual and noise effects.</p>
</sec>
<sec>
<title>Non-target cell types</title>
<p>Following removal of the target data, the non-target data are commonly obtained from the remaining tissue or cell types of the defined datasets (<xref rid="tI-mmr-18-02-1225" ref-type="table">Table I</xref>). Nevertheless, Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>) removed distantly related samples, probably due to a highly-curated cell lineage ontology. This has the advantage of removing false differentially expressed TFs that may control specialized functions in distant and target cell types, presumably via an upregulated TF. Nevertheless, this concept is only useful if the TF differential scoring depends on downregulated TFs, as in Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>). Therefore, the removal of distant cell types may be redundant if only upregulated TFs are considered and there are no large combinational effects in TFs. In addition, the threshold required to determine distance is hard to define, complicating further tests in diverse scenarios. In Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>), the target was one of the daughter cell types, and therefore the non-target was formed by the progenitor and the sister cell type. An issue with using large collections of samples in the non-target is that it may be highly disproportional to the number of samples. To avoid this overrepresentation, in D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>), the non-target dataset was balanced by selecting a representative sample from the collection of samples of each cell type.</p>
</sec>
</sec>
</sec>
<sec>
<label>3.</label>
<title>Identification of pre-candidate TFs</title>
<p>The four computational methods proposed used different approaches, which are conceptually summarized in <xref rid="f3-mmr-18-02-1225" ref-type="fig">Fig. 3</xref>. Theoretically, however, the identification of putative TFs may be obtained by identifying TFs whose expression is statistically different. Therefore, parametric, non-parametric or permutation tests may provide similar results (<xref rid="b32-mmr-18-02-1225" ref-type="bibr">32</xref>&#x2013;<xref rid="b34-mmr-18-02-1225" ref-type="bibr">34</xref>). Statistical tests provide a P-value that is useful, although it does not represent the magnitude of the difference between two average expression levels and is sensitive to the variance and number of samples (<xref rid="b35-mmr-18-02-1225" ref-type="bibr">35</xref>). Alternatively, these issues may be solved by using combinations of the P-value and fold-change, for example in Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>), where the score per TF is based on the absolute magnitude of the fold-change multiplied by the (negative) logarithm of the P-value. Nevertheless, certain of the methods reviewed demonstrate a preference for other strategies (<xref rid="tII-mmr-18-02-1225" ref-type="table">Table II</xref>). For example, D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) used Jensen-Shannon Divergence (JSD), which is a measure of the discrepancy between distributions. JSD was used to score differences between the observed TF expression profiles and idealized ones. These idealized profiles are formed by combining high expression in the corresponding target cell type and no expression in the remaining cell types.</p>
<p>Instead of comparing one TF across cell types, Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) and Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) compared pairs of TFs (<xref rid="tII-mmr-18-02-1225" ref-type="table">Table II</xref> and <xref rid="f3-mmr-18-02-1225" ref-type="fig">Fig. 3</xref>). The comparison of pairs is based on the concept that balanced expression between two TFs is associated with cell identity (<xref rid="b36-mmr-18-02-1225" ref-type="bibr">36</xref>&#x2013;<xref rid="b38-mmr-18-02-1225" ref-type="bibr">38</xref>). Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) proposed the normalized ratio difference (NRD) to score all pairs of TFs that are similarly expressed in a progenitor cell type, and highly different in and between daughter cell types. Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) additionally compared pairs of TFs, using the metric of the context likelihood of relatedness (CLR). The CLR is a measure that favors TFs that are highly correlated (by mutual information) and whose correlations are within the top ranked to increase the probability of genuine associations (<xref rid="b39-mmr-18-02-1225" ref-type="bibr">39</xref>). Notably, while Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) favored pairs of TFs whose expression was different in daughter cells, Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) favored TFs that were co-expressed and whose expression levels were cell-type specific (as explained in more detail in the following section). These opposing views are associated with the input data: Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) used cell types that were extremely close in the lineage, whereas Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) use tissue types that are more distant (<xref rid="f2-mmr-18-02-1225" ref-type="fig">Fig. 2</xref>). By definition, these methods will generate much larger lists of pre-candidates compared with those comparing one TF at the time. For example, assuming that there are ~2,000 TFs, there would be 1,999.000 pairwise comparisons vs. 2,000 when only one TF is assessed at the time. Thus, these methods require extensive filtering.</p>
</sec>
<sec>
<label>4.</label>
<title>Filtering the pre-candidate TF list</title>
<p>The objective of this step is to further filter the pre-candidate list to end up with a short list of candidate TFs whose overexpression will likely control the desired cell state. This step is frequently the most complex and time-consuming; it depends on the length of the pre-candidate TF list and the rules defined in the filters. In assays of one TF, including in D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) and Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>), if 5&#x0025; of the 1,000 expressed TFs are differential, a list of ~50 TFs is expected. This estimate is not far from reality, supposing that few TFs control the cell state by means of regulating further TFs, which thus regulate the downstream effector genes. Furthermore, for methods that compare pairs of TF, including in Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) and Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>), and even optimistically estimating that only 0.1&#x0025; of pairs are of interest, ~1,000 pairs of TFs would have to be analyzed from the ~1,000.000 TF pairs generated. Unless a shorter pre-candidate list is obtained, analyzing the TF list manually by reading scientific literature or browsing databases by hand may be arduous, prone to errors and time-consuming. Therefore, the filtering procedures proposed are focused on setting sensible rules that are approachable with current databases.</p>
<p>Thus far, the focus has been on differential TFs; however, other non-TF genes require consideration. They are involved in signal propagation or provide cell type-specific functions and should also be considered. Therefore, to completely explain the observations, the rules must be based on maximizing the control over all observed differentially expressed genes (DEGs), irrespective of the gene function (TF or not). Thus, the rules may be easily stated as &#x2018;show all TFs directly or indirectly controlling all DEGs.&#x2019; If all regulatory associations between TFs and other types of genes are known, this statement may be more easily implemented compared with the current methods. Nevertheless, the current databases are far from being complete, are context-specific (by culture or tissue) and are likely to include errors. Therefore, in the following paragraphs, how these rules were implemented in each method is explained and an overview is illustrated in <xref rid="f4-mmr-18-02-1225" ref-type="fig">Fig. 4</xref>. The majority of the methods make use of networks, databases and other tools to integrate information and connect the TFs with themselves and with other DEGs.</p>
<p>In Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>), a genetic regulatory network was built upon the significantly correlated pairs of genes using the CLR. As these networks are frequently large, the InfoMap tool was used to split this large network into smaller, highly connected sub-networks (<xref rid="b40-mmr-18-02-1225" ref-type="bibr">40</xref>). Furthermore, each sub-network was evaluated using gene set enrichment analysis (GSEA) (<xref rid="b41-mmr-18-02-1225" ref-type="bibr">41</xref>). GSEA generates a score depending on the position of the genes in the sub-network relative to all genes. If the genes are randomly distributed, the GSEA score is low, whilst if the expression levels are more concentrated in closer positions, the GSEA score increases. If the GSEA score of a sub-network obtained from tissue A is higher compared with other tissues, this sub-network is defined as specific for tissue A. Subsequent to executing this procedure in all sub-networks present in all tissues, Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) ended up with ~76 tissue-specific sub-networks. Thus, on average, approximately five sub-networks were expected in each of the 15&#x2013;20 tissues or cell types. From this, the study aimed to identify which sub-networks and which genes within the sub-networks were more likely to be manipulated, starting from a source cell type. To evaluate the former, the expression of each gene within the target cell type sub-network was compared against that of the source cell type; if the expression levels were similar, no larger alterations were required, whilst if the expression levels were very different, the sub-network had to be re-established and was therefore a target for manipulation. To assess the genes within the sub-networks, a network influence score (NIS) was estimated. This NIS depends on the difference in TF expression between the source and the target, the differences in the expression of the predicted genes regulated by that TF, and the number of regulated genes. In brief, a large network was split into sub-networks, filtered for tissue specificity, further filtered to detect those expressed at different levels and, finally, TFs were ranked within the resultant sub-networks. Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) demonstrated acceptable predictions in a number of conversion systems and suggested that direct conversions are less similar to the <italic>in vivo</italic> tissues compared with those conversions obtained from iPSCs.</p>
<p>Elsewhere, D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) used the JSD metric against an idealized profile to evaluate each TF between the target and non-target cell types in around 233 cell types. This procedure yielded 503 TFs across these cell types. A total of ~60&#x0025; of the TFs were considered to be pre-candidates in fewer than four cell types, demonstrating that most were cell-type specific. From the experiments, the study of D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>). focused on the top 10 TFs for induction. This approach was validated by comparing their predictions to well-known conversion systems, including iPSCs, neural precursor cells, cardiomyocytes, hepatocytes, motor neurons, pancreatic islets cells and melanocytes. Furthermore, D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) predicted and experimentally validated their approach in the conversion of fibroblasts to retinal pigment epithelial-like cells.</p>
<p>In Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>), the pre-candidate list of TFs was generated using a combination of a tissue-specific P-value and the magnitude of the difference. For the filtering, two additional network influence scores were used for each TF, which were estimated from MARA (<xref rid="b42-mmr-18-02-1225" ref-type="bibr">42</xref>) and STRING (<xref rid="b43-mmr-18-02-1225" ref-type="bibr">43</xref>) networks. These network scores depend on how many genes are connected to each TF, how far the connection is (number of nodes), and the score of the regulated gene (P-value and magnitude). Subsequently, the ranks of these three scores were added and ranked to provide a final rank. The first filter consisted of using only the TF within the top 100 final ranks. The second filter removed the TFs that were expressed in the source and target cell types. The third filter removed those redundant TFs that shared the majority of their targets with other TFs regulating more genes. A fourth filter was applied to include the top eight TFs. The approach was validated in at least five systems, involving conversions from fibroblasts to iPSCs, myoblasts, hepatocytes and cardiac cells, and from B cells to macrophages. Finally, two novel conversions were predicted and tested experimentally, converting fibroblasts to keratinocytes and keratinocytes to microvascular endothelial cells.</p>
<p>Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) used the NRD metric to evaluate and select pairs of TFs. Subsequently, the MetaCore network database (<xref rid="b44-mmr-18-02-1225" ref-type="bibr">44</xref>) was used to first filter TFs with over seven connections. This was based on the observation that important TFs are highly connected in MetaCore. The next filter removed unnecessary nodes of the network, based on the assumption that a cell type may be stabilized by a gene regulatory network that was additionally stable in the two daughter cell types. For this, the study re-implemented a sub-network-finding optimization algorithm combined with Boolean networks (<xref rid="b45-mmr-18-02-1225" ref-type="bibr">45</xref>). A Boolean network is a methodology that is able to identify attractor states (<xref rid="b46-mmr-18-02-1225" ref-type="bibr">46</xref>). These attractors were interpreted in biological cells as stable states that may be compared with the states of daughter cell types filtering those matching sub-networks. Subsequent to running the algorithm numerous times, the following filter looked for all sub-network solutions that contained at least one upregulated TF. Subsequently, the sub-networks were ranked based on the number of NRD pairs present, NRD pairs directly connected and lesser regulatory connections. This approach was validated in five stem cell systems, including mouse embryonic stem cells, mouse and human hematopoietic stem cells, mouse neural stem cells and mouse mesenchymal stem cells. Furthermore, the induction of neuronal and astrocyte differentiation was predicted and experimentally confirmed in a mouse neuronal stem cell system.</p>
</sec>
<sec>
<label>5.</label>
<title>Finding key TFs in practice</title>
<p>In this section, the focus is on how to estimate the key TFs for the target cell type of interest in an easy and practical way, while commenting on each approach. Ideally, the prediction would be made to manipulate a source cell type to achieve a target cell type. However, the majority of methods are restricted to specific sources, targets, or both. A summary is provided in <xref rid="tIII-mmr-18-02-1225" ref-type="table">Table III</xref>, and details are provided in the following paragraphs. An estimation of gene expression values was assumed and their annotation for the target cell type was available either from microarrays or from RNA-Seq. An overview of the available tools is provided followed by a practical example.</p>
</sec>
<sec>
<title>Overview of available tools</title>
<p>Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) provided a web interface (<uri xlink:href="http://cellnet.hms.harvard.edu">cellnet.hms.harvard.edu</uri>) and an R package (pcahan1.github.io/cellnetr) termed CellNet, which may be used to feed gene expression data of the source cell type or the already manipulated cell types. The output was composed of three main sections. The first output was a classification of input samples into cell types used in CellNet. The second output demonstrated how well each cell type-specific genetic regulatory network was established across the input samples. This helped to identify the networks that were required to be manipulated to achieve a cell type. The third output demonstrated the TFs having larger differences within networks, indicating which TFs required manipulation. For the web version, only Affymetrix (Thermo Fisher Scientific, Inc., Waltham, MA, USA) microarrays were able to be used. For the R package version, Illumina (Illumina, Inc., San Diego, CA, USA) microarray data were additionally able to be used.</p>
<p>For D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>), if the target cell type was already in the list of the 233 cell types processed [available in the supplementary information of the study (Table SI)], the top-ranked TFs demonstrated were able to be used (~10). If the cell type was not demonstrated, and to avoid reconstructing the entire study, the JSD or JSD-like value for the target cell type was estimated. Spreadsheet software using the predictions available for the 233 cell types was used. First, the TF expression of the target cell type was required to provide a rank of expression. Second, for each TF, the number of times this TF was counted in the top 10 other cell types was obtained. Third, for each TF, the minimum rank of the TF in all other cell types was obtained. Fourth, scatter plots of the target rank of TFs against those in the second and third steps was displayed. Fifth, TFs that were top ranked in the target and had low counts in the first scatter plot and/or were top ranked in the target and had higher ranks in the second scatter plot were estimated. These steps attempted to provide an easy approximation of the process followed by D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) instead of an accurate calculation, although these steps may be used as an easy starting point.</p>
<p>From Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>), a web interface is available (<uri xlink:href="http://www.mogrify.net">www.mogrify.net</uri>) in which the source and target cell types were specified from those already considered. The top eight ranked TFs were elucidated in a few seconds. Unfortunately, to estimate the possible TFs for a non-listed cell type, it is necessary to reconstruct the study of Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>) since no datasets are provided.</p>
<p>Notably, in Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>), neither implementation nor supplementary information was available. Thus, it is necessary to reconstruct the study to make predictions using this method. The MetaCore regulatory network used is not currently available, thus a different network database or another method for estimation may be used. Therefore, putative results may be different.</p>
</sec>
<sec>
<label>6.</label>
<title>Validation of target cell state TFs via different approaches in a well-known system</title>
<p>To demonstrate that this simplified view was able to generate sensible TFs, the first two concepts were applied in a well-known system, transdifferentiation towards cardiomyocytes (CM).</p>
</sec>
<sec>
<title>Target and non-target datasets</title>
<p>The target CM data were obtained from the GEO/NCBI, with accession no. GSE45878, for the 62 samples annotated as &#x2018;Heart.&#x2019; The dataset consists of 837 samples from diverse tissues. The non-target dataset was obtained from the remaining 775 samples and the number of probes was 22,704.</p>
</sec>
<sec>
<title>Data pre-processing</title>
<p>The two datasets were quantile normalized and scaled to a uniform distribution between 0 and 1, representing no expression and maximum expression. To recognize TFs, &#x2018;transcription&#x2019; and &#x2018;factor&#x2019; in the annotated description were used. Additionally, AnimalTFDB was used for the TF annotation (<xref rid="b47-mmr-18-02-1225" ref-type="bibr">47</xref>). Thus, 1,392 TFs were considered. The target and the non-target datasets, in addition to tissue annotation, are available at bioinformatica.mty.itesm.mx/CEC-TF-Example.</p>
</sec>
<sec>
<title>Score implementations</title>
<p>A total of five scores were used, two taken from basic concepts of differential expression, and three inspired by those scores used by the methods reviewed here. &#x2018;<italic>Delta</italic>&#x2019; is the difference in mean expression values between the target and non-target cell types. &#x2018;t-test&#x2019; is the P-value of the unequal variance t-test applied to target and non-target cell types. &#x2018;<italic>Rackham</italic>&#x2019; is -Log10(p-t-test)x|Delta|, as in Rackham <italic>et al</italic> (<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>). &#x2018;<italic>D&#x0027;Alessio</italic>&#x2019; is the sum of 100 JSD scores between the observed and the ideal profile. The observed profile was estimated using the average target expression together with k=3 random samples from non-targets (increasing values of k did not increase similarity to other scores). This process was similar, although not identical, to that implemented in D&#x0027;Alessio <italic>et al</italic> (<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>) (details of the algorithm in the supplementary information were not clear). &#x2018;<italic>Okawa</italic>&#x2019; was an adaptation of the Okawa <italic>et al</italic> (<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>) metric to cell types different from progenitor-daughter. It was estimated (TF<sub>Ti</sub>-TF<sub>Tk</sub>)-(TF<sub>Ni</sub>-TF<sub>Nk</sub>), where T and N sub-indexes refer to the mean expression values of the target and non-target cell types, respectively, i is a particular TF, and k represents all TFs. This metric generated very similar results to the NRD (which involves ratios that are more unstable, although the script provided includes the NRD estimation). To generate a single score per TF, the number of times a TF was included in differences between the top 1&#x0025; of pairs was counted. The score of Cahan <italic>et al</italic> (<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>) was not implemented since the tissue specificity is reached following large operations in networks (the scripts and data are available at bioinformatica.mty.itesm.mx/CEC-TF-Example).</p>
</sec>
<sec>
<title>Summary of the results</title>
<p><xref rid="tIV-mmr-18-02-1225" ref-type="table">Table IV</xref> illustrates the results of the top 20 genes generated by the five scoring methods. The table summarizes the most frequently mentioned TFs and previously reported experimental findings. All of the top seven TFs listed have already been used experimentally for the conversion of different cell types to cardiomyocytes, including T-box 20 (<xref rid="b48-mmr-18-02-1225" ref-type="bibr">48</xref>,<xref rid="b49-mmr-18-02-1225" ref-type="bibr">49</xref>), <italic>GATA4, TBX5</italic> (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>&#x2013;<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>), NK2 homeobox 5, and heart and neural crest derivatives expressed 2 (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>,<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>). However, a widely used TF in this conversion, <italic>MEF2C</italic> (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>,<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>,<xref rid="b51-mmr-18-02-1225" ref-type="bibr">51</xref>,<xref rid="b53-mmr-18-02-1225" ref-type="bibr">53</xref>), was not present in the list. Following revision, this gene was not marked as a TF in the present databases. Even if <italic>MEF2C</italic> was added as a TF, it was not included in the top 20 of any scoring method. This TF appears to be important as its overexpression removal did not generate cells expressing important cardiac markers (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>). A recent meta-analysis specific for CM differentiation did not identify <italic>MEF2C</italic>, although it did identify a family gene, <italic>MEF2A</italic> (<xref rid="b54-mmr-18-02-1225" ref-type="bibr">54</xref>). Although this result may give some clues regarding <italic>MEF2C</italic>, it is difficult to conclude the extent of its importance from this data alone. On the other hand, this example demonstrates that the majority of TFs may be obtained via straightforward application of simple concepts, as depicted in detail for the top 20 TFs identified in <xref rid="tIV-mmr-18-02-1225" ref-type="table">Table IV</xref> (<xref rid="b55-mmr-18-02-1225" ref-type="bibr">55</xref>&#x2013;<xref rid="b61-mmr-18-02-1225" ref-type="bibr">61</xref>), but also highlights that is possible that not all factors required are obtained with the current methods.</p>
</sec>
<sec>
<label>7.</label>
<title>Estimation of target cell state TFs via different approaches in a novel system</title>
<p>To provide a practical and simple way to reproduce an example and a comparison of different approaches as a starting point, corneal endothelial cells (CEC) were used as a target cell type. The tools and data available (Mogrify, CellNet and the D&#x0027;Alessio <italic>et al</italic> supplementary information) did not include CEC and therefore were not used. As the datasets represented in these tools are limited, this example represents a likely scenario for specific cell types. Re-implemented scores inspired by the revised methods and the pre-candidate lists are compared. This demonstrates that the first two steps are highly useful and relatively easy to implement. Subsequently, data are processed in R (<uri xlink:href="http://cran.r-project.org">cran.r-project.org</uri>). The scripts and the data required to reproduce the results are available in bioinformatica.mty.itesm.mx/CEC-TF-Example.</p>
<sec>
<title/>
<sec>
<title>Target and non-target datasets</title>
<p>The target CEC data were obtained from GEO/NCBI with accession no. GSE58315 (<xref rid="b62-mmr-18-02-1225" ref-type="bibr">62</xref>). The dataset consisted of 11 corneal endothelial cell samples from adults, adolescents and preschoolers. The non-target dataset was obtained from a preliminary study on gene co-expression networks (<xref rid="b63-mmr-18-02-1225" ref-type="bibr">63</xref>). This dataset consisted of 445 samples representing &#x003E;136 tissues from the two most popular Affymetrix platforms (HG-U133) extracted from the GEO/NCBI. The number of probes was &#x003E;50,000; however, due to the different versions of Affymetrix microarrays, certain samples provided data for only 22,000 probes.</p>
</sec>
<sec>
<title>Data pre-processing</title>
<p>The two datasets were quantile normalized and scaled to a uniform distribution between 0 and 1, representing no expression and maximum expression. For the non-target dataset, the JetSet package was used to identify a representative probe for each gene (<xref rid="b64-mmr-18-02-1225" ref-type="bibr">64</xref>). To recognize TFs, the Affymetrix annotation of the platform GPL570 was used to look for &#x2018;transcription&#x2019; and &#x2018;factor&#x2019; in the annotated description. Additionally, the TFs annotated in AnimalTFDB were used (<xref rid="b47-mmr-18-02-1225" ref-type="bibr">47</xref>). Thus, 1,478 TFs were considered. The target and the non-target datasets along with tissue annotation are available in bioinformatica.mty.itesm.mx/CEC-TF-Example. For the target dataset and duplicated probes per gene, the probe whose standard deviation was highest was selected. Only the 16,098 genes matching in the two datasets (by gene symbol) were used, of which 1,408 were annotated as TFs.</p>
</sec>
<sec>
<title>Score implementations</title>
<p>A total of five scores were used, as demonstrated in the aforementioned cardiomyocyte analysis.</p>
</sec>
<sec>
<title>Comparison of resultant scores</title>
<p>Whether the re-implemented scores were similar to each other was investigated. <xref rid="f5-mmr-18-02-1225" ref-type="fig">Fig. 5A</xref> illustrates the results for the 1,408 annotated TFs. It is clear that <italic>Delta</italic>, a measure of differences in the averages between target and non-target expression, correlated with all other scores. <italic>Rackham</italic>, as expected, was associated with t-test (<italic>Delta</italic> and t-test are part of the calculation). <italic>D&#x0027;Alessio</italic> was negatively correlated with <italic>Delta</italic>, although highly variable (lower <italic>D&#x0027;Alessio</italic> scores tended to be similar to high <italic>Delta</italic> scores). The <italic>Okawa</italic> score seemed to be a proxy of <italic>Delta</italic> irrespective of the sign. Overall, these results suggested that the scores are associated with differential expression, supporting the summarized view.</p>
</sec>
<sec>
<title>Comparison of the generated TF list of pre-candidates</title>
<p>To demonstrate an overview of the top selected genes per score, the TF identity of the top 20 TFs was investigated (<xref rid="f5-mmr-18-02-1225" ref-type="fig">Fig. 5B</xref>). It is clear that, apart from <italic>D&#x0027;Alessio</italic>, the majority of the genes were frequently in the top TFs, irrespective of the score. In <italic>Delta</italic> and t-test, there was no selection for overexpressed genes and therefore some underexpressed TFs appeared, including meis homeobox 2 (<italic>MEIS2</italic>) and zinc finger protein 208. Similarly, in <italic>Okawa</italic>, the metric implemented did not favor overexpression in the target and certain genes appeared to be underexpressed, including interferon regulatory factor 8 and <italic>MEIS2</italic> (the script available was commented so as to be able to alter this easily). The lack of similarity of the <italic>D&#x0027;Alessio</italic> TFs (2 out of 20) reflected the inappropriate implementation or deficiencies in providing details for reproduction.</p>
</sec>
<sec>
<title>Specificity of TF expression</title>
<p><xref rid="f5-mmr-18-02-1225" ref-type="fig">Fig. 5C</xref> demonstrates the expression of the 20 most frequent TFs, as listed in the column <italic>Mentions</italic> in <xref rid="f5-mmr-18-02-1225" ref-type="fig">Fig. 5B</xref>. It is clear that the expression of all TFs was high in CEC. Subsets of these genes, however, exhibited high expression in other cell types. This result suggested a highly specific profile for CEC. Lim homeobox transcription factor 1&#x03B2;, for instance, is essential for the correct development of the cornea and other eye structures in mice (<xref rid="b65-mmr-18-02-1225" ref-type="bibr">65</xref>), POU class 6 homeobox 2 is required for retinal regeneration in zebrafish (<xref rid="b66-mmr-18-02-1225" ref-type="bibr">66</xref>), transcription factor AP-2&#x03B2; has been demonstrated to control differentiated CEC markers (<xref rid="b67-mmr-18-02-1225" ref-type="bibr">67</xref>), TSC22 domain family member 1 is downregulated in dry eye syndrome (<xref rid="b68-mmr-18-02-1225" ref-type="bibr">68</xref>), and GLIS family zinc finger 3 has been associated with glaucoma (<xref rid="b69-mmr-18-02-1225" ref-type="bibr">69</xref>). This small literature analysis suggests that the observed list of TFs is important in CEC. To select more specific TFs, however, it is necessary to perform a network analysis (summarized in <xref rid="f4-mmr-18-02-1225" ref-type="fig">Fig. 4</xref>), literature revision, comparison of this profile with the source cell type, and analysis of the gene expression levels of these TFs and other differentially expressed genes (non-TF genes).</p>
</sec>
</sec>
</sec>
<sec sec-type="conclusions">
<label>8.</label>
<title>Conclusions</title>
<p>In conclusion, there is a requirement for specific cell types in regenerative medicine and biological research. An interesting proposal is the direct conversion of easy-to-obtain cells, which requires a specific cocktail of TFs to induce alterations in the cell state. Despite the complexity of the computational methods proposed for this task, it was demonstrated that the strategies to identify the TFs involved in the molecular state maintenance of a cell type are relatively simple: i) Define cell populations representing diverse cell types; ii) identify differences in TF expression; and iii) apply rules to remove unlikely TFs. The present review reported that the principal complexity in the computational methods is the third of these points. It was demonstrated in a well-known cardiomyocyte example and a novel corneal endothelial cell example that applying the first two easy-to-implement ideas is likely to provide useful results, which may provide important insights and a starting point for laboratory assays. The present review may additionally inspire novel computational methods to identify TFs associated with cell identity and direct cell conversions.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding</title>
<p>The present study was funded by CONACyT Ciencia B&#x00E1;sica (grant no. 255747) and the Grupos de Investigaci&#x00F3;n con Enfoque Estrat&#x00E9;gico en Bioinform&#x00E1;tica para el Diagn&#x00F3;stico Cl&#x00ED;nico from Tecnol&#x00F3;gico de Monterrey including a scholarship for GIGR and sponsorship of JFIC.</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>The datasets generated and/or analyzed during the current study are available from: <uri xlink:href="http://bioinformatica.mty.itesm.mx/CEC-TF-Example">http://bioinformatica.mty.itesm.mx/CEC-TF-Example</uri>.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>GIGR, CMVC, JFIC and VT made analyses of particular methods and participated in the overall conceptualization. GIGR selected and preprocessed the gene expression omnibus data. GIGR, CMVC and JFIC drafted the initial manuscript. VT conceptualized and supervised the study, wrote the R scripts and performed the computational analyses. GIGR and VT participated in writing the final manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="b1-mmr-18-02-1225"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Meguid</surname><given-names>Abdel E</given-names></name><name><surname>Ke</surname><given-names>Y</given-names></name><name><surname>Ji</surname><given-names>J</given-names></name><name><surname>El-Hashash</surname><given-names>AHK</given-names></name></person-group><article-title>Stem cells applications in bone and tooth repair and regeneration: New insights, tools and hopes</article-title><source>J Cell Physiol</source><volume>233</volume><fpage>1825</fpage><lpage>1835</lpage><year>2018</year><pub-id pub-id-type="doi">10.1002/jcp.25940</pub-id><pub-id pub-id-type="pmid">28369866</pub-id></element-citation></ref>
<ref id="b2-mmr-18-02-1225"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tabar</surname><given-names>V</given-names></name><name><surname>Studer</surname><given-names>L</given-names></name></person-group><article-title>Pluripotent stem cells in regenerative medicine: Challenges and recent progress</article-title><source>Nat Rev Genet</source><volume>15</volume><fpage>82</fpage><lpage>92</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nrg3563</pub-id><pub-id pub-id-type="pmid">24434846</pub-id><pub-id pub-id-type="pmcid">4539940</pub-id></element-citation></ref>
<ref id="b3-mmr-18-02-1225"><label>3</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Valdez-Garcia</surname><given-names>JE</given-names></name><name><surname>Zavala</surname><given-names>J</given-names></name><name><surname>Trevino</surname><given-names>V</given-names></name></person-group><chapter-title>Current state and future perspectives in corneal endothelium differentiation</chapter-title><source>Frontiers in Stem Cell and Regenerative Medicine Research</source><person-group person-group-type="editor"><name><surname>Atta-ur-Rahman</surname></name><name><surname>Anjum</surname><given-names>S</given-names></name></person-group><publisher-loc>Bentham</publisher-loc><year>2017</year><pub-id pub-id-type="doi">10.2174/9781681084756117050007</pub-id></element-citation></ref>
<ref id="b4-mmr-18-02-1225"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mora</surname><given-names>C</given-names></name><name><surname>Serzanti</surname><given-names>M</given-names></name><name><surname>Consiglio</surname><given-names>A</given-names></name><name><surname>Memo</surname><given-names>M</given-names></name><name><surname>Dell&#x0027;Era</surname><given-names>P</given-names></name></person-group><article-title>Clinical potentials of human pluripotent stem cells</article-title><source>Cell Biol Toxicol</source><volume>33</volume><fpage>351</fpage><lpage>360</lpage><year>2017</year><pub-id pub-id-type="doi">10.1007/s10565-017-9384-y</pub-id><pub-id pub-id-type="pmid">28176010</pub-id></element-citation></ref>
<ref id="b5-mmr-18-02-1225"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>L&#x00F3;pez-Gonz&#x00E1;lez</surname><given-names>R</given-names></name><name><surname>Velasco</surname><given-names>I</given-names></name></person-group><article-title>Therapeutic potential of motor neurons differentiated from embryonic stem cells and induced pluripotent stem cells</article-title><source>Arch Med Res</source><volume>43</volume><fpage>1</fpage><lpage>10</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.arcmed.2012.01.007</pub-id><pub-id pub-id-type="pmid">22293229</pub-id></element-citation></ref>
<ref id="b6-mmr-18-02-1225"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lizio</surname><given-names>M</given-names></name><name><surname>Harshbarger</surname><given-names>J</given-names></name><name><surname>Shimoji</surname><given-names>H</given-names></name><name><surname>Severin</surname><given-names>J</given-names></name><name><surname>Kasukawa</surname><given-names>T</given-names></name><name><surname>Sahin</surname><given-names>S</given-names></name><name><surname>Abugessaisa</surname><given-names>I</given-names></name><name><surname>Fukuda</surname><given-names>S</given-names></name><name><surname>Hori</surname><given-names>F</given-names></name><name><surname>Ishikawa-Kato</surname><given-names>S</given-names></name><etal/></person-group><article-title>Gateways to the FANTOM5 promoter level mammalian expression atlas</article-title><source>Genome Biol</source><volume>16</volume><fpage>22</fpage><year>2015</year><pub-id pub-id-type="doi">10.1186/s13059-014-0560-6</pub-id><pub-id pub-id-type="pmid">25723102</pub-id><pub-id pub-id-type="pmcid">4310165</pub-id></element-citation></ref>
<ref id="b7-mmr-18-02-1225"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>M</given-names></name><name><surname>Belmonte</surname><given-names>JC</given-names></name></person-group><article-title>Ground rules of the pluripotency gene regulatory network</article-title><source>Nat Rev Genet</source><volume>18</volume><fpage>180</fpage><lpage>191</lpage><year>2017</year><pub-id pub-id-type="doi">10.1038/nrg.2016.156</pub-id><pub-id pub-id-type="pmid">28045100</pub-id></element-citation></ref>
<ref id="b8-mmr-18-02-1225"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Takahashi</surname><given-names>K</given-names></name><name><surname>Yamanaka</surname><given-names>S</given-names></name></person-group><article-title>Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors</article-title><source>Cell</source><volume>126</volume><fpage>663</fpage><lpage>676</lpage><year>2006</year><pub-id pub-id-type="doi">10.1016/j.cell.2006.07.024</pub-id><pub-id pub-id-type="pmid">16904174</pub-id></element-citation></ref>
<ref id="b9-mmr-18-02-1225"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>M</given-names></name><name><surname>Liu</surname><given-names>G</given-names></name><name><surname>Belmonte</surname><given-names>Izpisua JC</given-names></name></person-group><article-title>Navigating the epigenetic landscape of pluripotent stem cells</article-title><source>Nat Rev Mol Cell Biol</source><volume>13</volume><fpage>524</fpage><lpage>535</lpage><year>2012</year><pub-id pub-id-type="doi">10.1038/nrm3393</pub-id><pub-id pub-id-type="pmid">22820889</pub-id></element-citation></ref>
<ref id="b10-mmr-18-02-1225"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moris</surname><given-names>N</given-names></name><name><surname>Pina</surname><given-names>C</given-names></name><name><surname>Arias</surname><given-names>AM</given-names></name></person-group><article-title>Transition states and cell fate decisions in epigenetic landscapes</article-title><source>Nat Rev Genet</source><volume>17</volume><fpage>693</fpage><lpage>703</lpage><year>2016</year><pub-id pub-id-type="doi">10.1038/nrg.2016.98</pub-id><pub-id pub-id-type="pmid">27616569</pub-id></element-citation></ref>
<ref id="b11-mmr-18-02-1225"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vaquerizas</surname><given-names>JM</given-names></name><name><surname>Kummerfeld</surname><given-names>SK</given-names></name><name><surname>Teichmann</surname><given-names>SA</given-names></name><name><surname>Luscombe</surname><given-names>NM</given-names></name></person-group><article-title>A census of human transcription factors: Function, expression and evolution</article-title><source>Nat Rev Genet</source><volume>10</volume><fpage>252</fpage><lpage>263</lpage><year>2009</year><pub-id pub-id-type="doi">10.1038/nrg2538</pub-id><pub-id pub-id-type="pmid">19274049</pub-id></element-citation></ref>
<ref id="b12-mmr-18-02-1225"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Frum</surname><given-names>T</given-names></name><name><surname>Ralston</surname><given-names>A</given-names></name></person-group><article-title>Cell signaling and transcription factors regulating cell fate during formation of the mouse blastocyst</article-title><source>Trends Genet</source><volume>31</volume><fpage>402</fpage><lpage>410</lpage><year>2015</year><pub-id pub-id-type="doi">10.1016/j.tig.2015.04.002</pub-id><pub-id pub-id-type="pmid">25999217</pub-id><pub-id pub-id-type="pmcid">4490046</pub-id></element-citation></ref>
<ref id="b13-mmr-18-02-1225"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Morris</surname><given-names>SA</given-names></name></person-group><article-title>Direct lineage reprogramming via pioneer factors; a detour through developmental gene regulatory networks</article-title><source>Development</source><volume>143</volume><fpage>2696</fpage><lpage>2705</lpage><year>2016</year><pub-id pub-id-type="doi">10.1242/dev.138263</pub-id><pub-id pub-id-type="pmid">27486230</pub-id><pub-id pub-id-type="pmcid">5004913</pub-id></element-citation></ref>
<ref id="b14-mmr-18-02-1225"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Iwafuchi-Doi</surname><given-names>M</given-names></name><name><surname>Zaret</surname><given-names>KS</given-names></name></person-group><article-title>Cell fate control by pioneer transcription factors</article-title><source>Development</source><volume>143</volume><fpage>1833</fpage><lpage>1837</lpage><year>2016</year><pub-id pub-id-type="doi">10.1242/dev.133900</pub-id><pub-id pub-id-type="pmid">27246709</pub-id></element-citation></ref>
<ref id="b15-mmr-18-02-1225"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Takahashi</surname><given-names>K</given-names></name><name><surname>Tanabe</surname><given-names>K</given-names></name><name><surname>Ohnuki</surname><given-names>M</given-names></name><name><surname>Narita</surname><given-names>M</given-names></name><name><surname>Ichisaka</surname><given-names>T</given-names></name><name><surname>Tomoda</surname><given-names>K</given-names></name><name><surname>Yamanaka</surname><given-names>S</given-names></name></person-group><article-title>Induction of pluripotent stem cells from adult human fibroblasts by defined factors</article-title><source>Cell</source><volume>131</volume><fpage>861</fpage><lpage>872</lpage><year>2007</year><pub-id pub-id-type="doi">10.1016/j.cell.2007.11.019</pub-id><pub-id pub-id-type="pmid">18035408</pub-id></element-citation></ref>
<ref id="b16-mmr-18-02-1225"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vierbuchen</surname><given-names>T</given-names></name><name><surname>Ostermeier</surname><given-names>A</given-names></name><name><surname>Pang</surname><given-names>ZP</given-names></name><name><surname>Kokubu</surname><given-names>Y</given-names></name><name><surname>S&#x00FC;dhof</surname><given-names>TC</given-names></name><name><surname>Wernig</surname><given-names>M</given-names></name></person-group><article-title>Direct conversion of fibroblasts to functional neurons by defined factors</article-title><source>Nature</source><volume>463</volume><fpage>1035</fpage><lpage>1041</lpage><year>2010</year><pub-id pub-id-type="doi">10.1038/nature08797</pub-id><pub-id pub-id-type="pmid">20107439</pub-id><pub-id pub-id-type="pmcid">2829121</pub-id></element-citation></ref>
<ref id="b17-mmr-18-02-1225"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ieda</surname><given-names>M</given-names></name><name><surname>Fu</surname><given-names>JD</given-names></name><name><surname>Delgado-Olguin</surname><given-names>P</given-names></name><name><surname>Vedantham</surname><given-names>V</given-names></name><name><surname>Hayashi</surname><given-names>Y</given-names></name><name><surname>Bruneau</surname><given-names>BG</given-names></name><name><surname>Srivastava</surname><given-names>D</given-names></name></person-group><article-title>Direct reprogramming of fibroblasts into functional cardiomyocytes by defined factors</article-title><source>Cell</source><volume>142</volume><fpage>375</fpage><lpage>386</lpage><year>2010</year><pub-id pub-id-type="doi">10.1016/j.cell.2010.07.002</pub-id><pub-id pub-id-type="pmid">20691899</pub-id><pub-id pub-id-type="pmcid">2919844</pub-id></element-citation></ref>
<ref id="b18-mmr-18-02-1225"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Protze</surname><given-names>S</given-names></name><name><surname>Khattak</surname><given-names>S</given-names></name><name><surname>Poulet</surname><given-names>C</given-names></name><name><surname>Lindemann</surname><given-names>D</given-names></name><name><surname>Tanaka</surname><given-names>EM</given-names></name><name><surname>Ravens</surname><given-names>U</given-names></name></person-group><article-title>A new approach to transcription factor screening for reprogramming of fibroblasts to cardiomyocyte-like cells</article-title><source>J Mol Cell Cardiol</source><volume>53</volume><fpage>323</fpage><lpage>332</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.yjmcc.2012.04.010</pub-id><pub-id pub-id-type="pmid">22575762</pub-id></element-citation></ref>
<ref id="b19-mmr-18-02-1225"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bonilla-Porras</surname><given-names>AR</given-names></name><name><surname>Velez-Pardo</surname><given-names>C</given-names></name><name><surname>Jimenez-Del-Rio</surname><given-names>M</given-names></name></person-group><article-title>Fast transdifferentiation of human Wharton&#x0027;s jelly mesenchymal stem cells into neurospheres and nerve-like cells</article-title><source>J Neurosci Methods</source><volume>282</volume><fpage>52</fpage><lpage>60</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.jneumeth.2017.03.005</pub-id><pub-id pub-id-type="pmid">28286110</pub-id></element-citation></ref>
<ref id="b20-mmr-18-02-1225"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Abad</surname><given-names>M</given-names></name><name><surname>Hashimoto</surname><given-names>H</given-names></name><name><surname>Zhou</surname><given-names>H</given-names></name><name><surname>Morales</surname><given-names>MG</given-names></name><name><surname>Chen</surname><given-names>B</given-names></name><name><surname>Bassel-Duby</surname><given-names>R</given-names></name><name><surname>Olson</surname><given-names>EN</given-names></name></person-group><article-title>Notch inhibition enhances cardiac reprogramming by increasing MEF2C transcriptional activity</article-title><source>Stem Cell Rep</source><volume>8</volume><fpage>548</fpage><lpage>560</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.stemcr.2017.01.025</pub-id></element-citation></ref>
<ref id="b21-mmr-18-02-1225"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Islas</surname><given-names>JF</given-names></name><name><surname>Liu</surname><given-names>Y</given-names></name><name><surname>Weng</surname><given-names>KC</given-names></name><name><surname>Robertson</surname><given-names>MJ</given-names></name><name><surname>Zhang</surname><given-names>S</given-names></name><name><surname>Prejusa</surname><given-names>A</given-names></name><name><surname>Harger</surname><given-names>J</given-names></name><name><surname>Tikhomirova</surname><given-names>D</given-names></name><name><surname>Chopra</surname><given-names>M</given-names></name><name><surname>Iyer</surname><given-names>D</given-names></name><etal/></person-group><article-title>Transcription factors ETS2 and MESP1 transdifferentiate human dermal fibroblasts into cardiac progenitors</article-title><source>Proc Natl Acad Sci USA</source><volume>109</volume><fpage>13016</fpage><lpage>13021</lpage><year>2012</year><pub-id pub-id-type="doi">10.1073/pnas.1120299109</pub-id><pub-id pub-id-type="pmid">22826236</pub-id></element-citation></ref>
<ref id="b22-mmr-18-02-1225"><label>22</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Waddington</surname><given-names>CH</given-names></name></person-group><source>The strategy of the genes</source><publisher-loc>Routledge</publisher-loc><year>1957</year></element-citation></ref>
<ref id="b23-mmr-18-02-1225"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cahan</surname><given-names>P</given-names></name><name><surname>Li</surname><given-names>H</given-names></name><name><surname>Morris</surname><given-names>SA</given-names></name><name><surname>Da Rocha</surname><given-names>Lummertz E</given-names></name><name><surname>Daley</surname><given-names>GQ</given-names></name><name><surname>Collins</surname><given-names>JJ</given-names></name></person-group><article-title>CellNet: Network biology applied to stem cell engineering</article-title><source>Cell</source><volume>158</volume><fpage>903</fpage><lpage>915</lpage><year>2014</year><pub-id pub-id-type="doi">10.1016/j.cell.2014.07.020</pub-id><pub-id pub-id-type="pmid">25126793</pub-id><pub-id pub-id-type="pmcid">4233680</pub-id></element-citation></ref>
<ref id="b24-mmr-18-02-1225"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>D&#x0027;Alessio</surname><given-names>AC</given-names></name><name><surname>Fan</surname><given-names>ZP</given-names></name><name><surname>Wert</surname><given-names>KJ</given-names></name><name><surname>Baranov</surname><given-names>P</given-names></name><name><surname>Cohen</surname><given-names>MA</given-names></name><name><surname>Saini</surname><given-names>JS</given-names></name><name><surname>Cohick</surname><given-names>E</given-names></name><name><surname>Charniga</surname><given-names>C</given-names></name><name><surname>Dadon</surname><given-names>D</given-names></name><name><surname>Hannett</surname><given-names>NM</given-names></name><etal/></person-group><article-title>A systematic approach to identify candidate transcription factors that control cell identity</article-title><source>Stem Cell Reports</source><volume>5</volume><fpage>763</fpage><lpage>775</lpage><year>2015</year><pub-id pub-id-type="doi">10.1016/j.stemcr.2015.09.016</pub-id><pub-id pub-id-type="pmid">26603904</pub-id><pub-id pub-id-type="pmcid">4649293</pub-id></element-citation></ref>
<ref id="b25-mmr-18-02-1225"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rackham</surname><given-names>OJL</given-names></name><name><surname>Firas</surname><given-names>J</given-names></name><name><surname>Fang</surname><given-names>H</given-names></name><name><surname>Oates</surname><given-names>ME</given-names></name><name><surname>Holmes</surname><given-names>ML</given-names></name><name><surname>Knaupp</surname><given-names>AS</given-names></name></person-group><article-title>FANTOM Consortium, Suzuki H, Nefzger CM, Daub CO, <italic>et al</italic>: A predictive computational framework for direct reprogramming between human cell types</article-title><source>Nat Genet</source><volume>48</volume><fpage>331</fpage><lpage>335</lpage><year>2016</year><pub-id pub-id-type="doi">10.1038/ng.3487</pub-id><pub-id pub-id-type="pmid">26780608</pub-id></element-citation></ref>
<ref id="b26-mmr-18-02-1225"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Okawa</surname><given-names>S</given-names></name><name><surname>Nicklas</surname><given-names>S</given-names></name><name><surname>Zickenrott</surname><given-names>S</given-names></name><name><surname>Schwamborn</surname><given-names>JC</given-names></name><name><surname>Del Sol</surname><given-names>A</given-names></name></person-group><article-title>A generalized gene-regulatory network model of stem cell differentiation for predicting lineage specifiers</article-title><source>Stem Cell Rep</source><volume>7</volume><fpage>307</fpage><lpage>315</lpage><year>2016</year><pub-id pub-id-type="doi">10.1016/j.stemcr.2016.07.014</pub-id></element-citation></ref>
<ref id="b27-mmr-18-02-1225"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Barrett</surname><given-names>T</given-names></name><name><surname>Suzek</surname><given-names>TO</given-names></name><name><surname>Troup</surname><given-names>DB</given-names></name><name><surname>Wilhite</surname><given-names>SE</given-names></name><name><surname>Ngau</surname><given-names>WC</given-names></name><name><surname>Ledoux</surname><given-names>P</given-names></name><name><surname>Rudnev</surname><given-names>D</given-names></name><name><surname>Lash</surname><given-names>AE</given-names></name><name><surname>Fujibuchi</surname><given-names>W</given-names></name><name><surname>Edgar</surname><given-names>R</given-names></name></person-group><article-title>NCBI GEO: Mining millions of expression profiles-database and tools</article-title><source>Nucleic Acids Res</source><volume>33</volume><comment>(Database Issue)</comment><fpage>D562</fpage><lpage>D566</lpage><year>2005</year><pub-id pub-id-type="doi">10.1093/nar/gki022</pub-id><pub-id pub-id-type="pmid">15608262</pub-id></element-citation></ref>
<ref id="b28-mmr-18-02-1225"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Barrett</surname><given-names>T</given-names></name><name><surname>Wilhite</surname><given-names>SE</given-names></name><name><surname>Ledoux</surname><given-names>P</given-names></name><name><surname>Evangelista</surname><given-names>C</given-names></name><name><surname>Kim</surname><given-names>IF</given-names></name><name><surname>Tomashevsky</surname><given-names>M</given-names></name><name><surname>Marshall</surname><given-names>KA</given-names></name><name><surname>Phillippy</surname><given-names>KH</given-names></name><name><surname>Sherman</surname><given-names>PM</given-names></name><name><surname>Holko</surname><given-names>M</given-names></name><etal/></person-group><article-title>NCBI GEO: Archive for functional genomics data sets-update</article-title><source>Nucleic Acids Res</source><volume>41</volume><comment>(Database Issue)</comment><fpage>D991</fpage><lpage>D995</lpage><year>2013</year><pub-id pub-id-type="doi">10.1093/nar/gks1193</pub-id><pub-id pub-id-type="pmid">23193258</pub-id></element-citation></ref>
<ref id="b29-mmr-18-02-1225"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Parkinson</surname><given-names>H</given-names></name><name><surname>Kapushesky</surname><given-names>M</given-names></name><name><surname>Shojatalab</surname><given-names>M</given-names></name><name><surname>Abeygunawardena</surname><given-names>N</given-names></name><name><surname>Coulson</surname><given-names>R</given-names></name><name><surname>Farne</surname><given-names>A</given-names></name><name><surname>Holloway</surname><given-names>E</given-names></name><name><surname>Kolesnykov</surname><given-names>N</given-names></name><name><surname>Lilja</surname><given-names>P</given-names></name><name><surname>Lukk</surname><given-names>M</given-names></name><etal/></person-group><article-title>ArrayExpress-a public database of microarray experiments and gene expression profiles</article-title><source>Nucleic Acids Res</source><volume>35</volume><comment>(Database Issue)</comment><fpage>D747</fpage><lpage>D750</lpage><year>2007</year><pub-id pub-id-type="doi">10.1093/nar/gkl995</pub-id><pub-id pub-id-type="pmid">17132828</pub-id></element-citation></ref>
<ref id="b30-mmr-18-02-1225"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kolesnikov</surname><given-names>N</given-names></name><name><surname>Hastings</surname><given-names>E</given-names></name><name><surname>Keays</surname><given-names>M</given-names></name><name><surname>Melnichuk</surname><given-names>O</given-names></name><name><surname>Tang</surname><given-names>YA</given-names></name><name><surname>Williams</surname><given-names>E</given-names></name><name><surname>Dylag</surname><given-names>M</given-names></name><name><surname>Kurbatova</surname><given-names>N</given-names></name><name><surname>Brandizi</surname><given-names>M</given-names></name><name><surname>Burdett</surname><given-names>T</given-names></name><etal/></person-group><article-title>ArrayExpress update-simplifying data submissions</article-title><source>Nucleic Acids Res</source><volume>43</volume><comment>(Database Issue)</comment><fpage>D1113</fpage><lpage>D1116</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/nar/gku1057</pub-id><pub-id pub-id-type="pmid">25361974</pub-id></element-citation></ref>
<ref id="b31-mmr-18-02-1225"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rosenbloom</surname><given-names>KR</given-names></name><name><surname>Sloan</surname><given-names>CA</given-names></name><name><surname>Malladi</surname><given-names>VS</given-names></name><name><surname>Dreszer</surname><given-names>TR</given-names></name><name><surname>Learned</surname><given-names>K</given-names></name><name><surname>Kirkup</surname><given-names>VM</given-names></name><name><surname>Wong</surname><given-names>MC</given-names></name><name><surname>Maddren</surname><given-names>M</given-names></name><name><surname>Fang</surname><given-names>R</given-names></name><name><surname>Heitner</surname><given-names>SG</given-names></name><etal/></person-group><article-title>ENCODE Data in the UCSC genome browser: Year 5 update</article-title><source>Nucleic Acids Res</source><volume>41</volume><comment>(Database Issue)</comment><fpage>D56</fpage><lpage>D63</lpage><year>2013</year><pub-id pub-id-type="doi">10.1093/nar/gks1172</pub-id><pub-id pub-id-type="pmid">23193274</pub-id></element-citation></ref>
<ref id="b32-mmr-18-02-1225"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>SY</given-names></name><name><surname>Lee</surname><given-names>JW</given-names></name><name><surname>Sohn</surname><given-names>IS</given-names></name></person-group><article-title>Comparison of various statistical methods for identifying differential gene expression in replicated microarray data</article-title><source>Stat Methods Med Res</source><volume>15</volume><fpage>3</fpage><lpage>20</lpage><year>2006</year><pub-id pub-id-type="doi">10.1191/0962280206sm423oa</pub-id><pub-id pub-id-type="pmid">16477945</pub-id></element-citation></ref>
<ref id="b33-mmr-18-02-1225"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>HC</given-names></name><name><surname>Niu</surname><given-names>Y</given-names></name><name><surname>Qin</surname><given-names>LX</given-names></name></person-group><article-title>Differential expression analysis for RNA-Seq: An overview of statistical methods and computational software</article-title><source>Cancer Inform</source><volume>14</volume><supplement>Suppl 1</supplement><fpage>S57</fpage><lpage>S67</lpage><year>2015</year></element-citation></ref>
<ref id="b34-mmr-18-02-1225"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Seyednasrollah</surname><given-names>F</given-names></name><name><surname>Laiho</surname><given-names>A</given-names></name><name><surname>Elo</surname><given-names>LL</given-names></name></person-group><article-title>Comparison of software packages for detecting differential expression in RNA-seq studies</article-title><source>Brief Bioinform</source><volume>16</volume><fpage>59</fpage><lpage>70</lpage><year>2013</year><pub-id pub-id-type="doi">10.1093/bib/bbt086</pub-id><pub-id pub-id-type="pmid">24300110</pub-id><pub-id pub-id-type="pmcid">4293378</pub-id></element-citation></ref>
<ref id="b35-mmr-18-02-1225"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sullivan</surname><given-names>GM</given-names></name><name><surname>Feinn</surname><given-names>R</given-names></name></person-group><article-title>Using effect size-or why the P value is not enough</article-title><source>J Grad Med Educ</source><volume>4</volume><fpage>279</fpage><lpage>282</lpage><year>2012</year><pub-id pub-id-type="doi">10.4300/JGME-D-12-00156.1</pub-id><pub-id pub-id-type="pmid">23997866</pub-id><pub-id pub-id-type="pmcid">3444174</pub-id></element-citation></ref>
<ref id="b36-mmr-18-02-1225"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>S</given-names></name><name><surname>Guo</surname><given-names>YP</given-names></name><name><surname>May</surname><given-names>G</given-names></name><name><surname>Enver</surname><given-names>T</given-names></name></person-group><article-title>Bifurcation dynamics in lineage-commitment in bipotent progenitor cells</article-title><source>Dev Biol</source><volume>305</volume><fpage>695</fpage><lpage>713</lpage><year>2007</year><pub-id pub-id-type="doi">10.1016/j.ydbio.2007.02.036</pub-id><pub-id pub-id-type="pmid">17412320</pub-id></element-citation></ref>
<ref id="b37-mmr-18-02-1225"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jacob</surname><given-names>F</given-names></name><name><surname>Monod</surname><given-names>J</given-names></name></person-group><article-title>Genetic regulatory mechanisms in the synthesis of proteins</article-title><source>J Mol Biol</source><volume>3</volume><fpage>318</fpage><lpage>356</lpage><year>1961</year><pub-id pub-id-type="doi">10.1016/S0022-2836(61)80072-7</pub-id><pub-id pub-id-type="pmid">13718526</pub-id></element-citation></ref>
<ref id="b38-mmr-18-02-1225"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roeder</surname><given-names>I</given-names></name><name><surname>Glauche</surname><given-names>I</given-names></name></person-group><article-title>Towards an understanding of lineage specification in hematopoietic stem cells: A mathematical model for the interaction of transcription factors GATA-1 and PU.1</article-title><source>J Theor Biol</source><volume>241</volume><fpage>852</fpage><lpage>865</lpage><year>2006</year><pub-id pub-id-type="doi">10.1016/j.jtbi.2006.01.021</pub-id><pub-id pub-id-type="pmid">16510158</pub-id></element-citation></ref>
<ref id="b39-mmr-18-02-1225"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Faith</surname><given-names>JJ</given-names></name><name><surname>Hayete</surname><given-names>B</given-names></name><name><surname>Thaden</surname><given-names>JT</given-names></name><name><surname>Mogno</surname><given-names>I</given-names></name><name><surname>Wierzbowski</surname><given-names>J</given-names></name><name><surname>Cottarel</surname><given-names>G</given-names></name><name><surname>Kasif</surname><given-names>S</given-names></name><name><surname>Collins</surname><given-names>JJ</given-names></name><name><surname>Gardner</surname><given-names>TS</given-names></name></person-group><article-title>Large-scale mapping and validation of escherichia coli transcriptional regulation from a compendium of expression profiles</article-title><source>PLoS Biol</source><volume>5</volume><fpage>e8</fpage><year>2007</year><pub-id pub-id-type="doi">10.1371/journal.pbio.0050008</pub-id><pub-id pub-id-type="pmid">17214507</pub-id><pub-id pub-id-type="pmcid">1764438</pub-id></element-citation></ref>
<ref id="b40-mmr-18-02-1225"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rosvall</surname><given-names>M</given-names></name><name><surname>Bergstrom</surname><given-names>CT</given-names></name></person-group><article-title>Maps of random walks on complex networks reveal community structure</article-title><source>Proc Natl Acad Sci USA</source><volume>105</volume><fpage>1118</fpage><lpage>1123</lpage><year>2008</year><pub-id pub-id-type="doi">10.1073/pnas.0706851105</pub-id><pub-id pub-id-type="pmid">18216267</pub-id></element-citation></ref>
<ref id="b41-mmr-18-02-1225"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Subramanian</surname><given-names>A</given-names></name><name><surname>Tamayo</surname><given-names>P</given-names></name><name><surname>Mootha</surname><given-names>VK</given-names></name><name><surname>Mukherjee</surname><given-names>S</given-names></name><name><surname>Ebert</surname><given-names>BL</given-names></name><name><surname>Gillette</surname><given-names>MA</given-names></name><name><surname>Paulovich</surname><given-names>A</given-names></name><name><surname>Pomeroy</surname><given-names>SL</given-names></name><name><surname>Golub</surname><given-names>TR</given-names></name><name><surname>Lander</surname><given-names>ES</given-names></name><name><surname>Mesirov</surname><given-names>JP</given-names></name></person-group><article-title>Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles</article-title><source>Proc Natl Acad Sci USA</source><volume>102</volume><fpage>15545</fpage><lpage>15550</lpage><year>2005</year><pub-id pub-id-type="doi">10.1073/pnas.0506580102</pub-id><pub-id pub-id-type="pmid">16199517</pub-id></element-citation></ref>
<ref id="b42-mmr-18-02-1225"><label>42</label><element-citation publication-type="journal"><collab collab-type="corp-author">FANTOM Consortium H</collab><person-group person-group-type="author"><name><surname>Suzuki</surname><given-names>H</given-names></name><name><surname>Forrest</surname><given-names>AR</given-names></name><name><surname>van Nimwegen</surname><given-names>E</given-names></name><name><surname>Daub</surname><given-names>CO</given-names></name><name><surname>Balwierz</surname><given-names>PJ</given-names></name><name><surname>Irvine</surname><given-names>KM</given-names></name><name><surname>Lassmann</surname><given-names>T</given-names></name><name><surname>Ravasi</surname><given-names>T</given-names></name><name><surname>Hasegawa</surname><given-names>Y</given-names></name><etal/></person-group><article-title>The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line</article-title><source>Nat Genet</source><volume>41</volume><fpage>553</fpage><lpage>562</lpage><year>2009</year><pub-id pub-id-type="doi">10.1038/ng.375</pub-id><pub-id pub-id-type="pmid">19377474</pub-id></element-citation></ref>
<ref id="b43-mmr-18-02-1225"><label>43</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname><given-names>D</given-names></name><name><surname>Franceschini</surname><given-names>A</given-names></name><name><surname>Wyder</surname><given-names>S</given-names></name><name><surname>Forslund</surname><given-names>K</given-names></name><name><surname>Heller</surname><given-names>D</given-names></name><name><surname>Huerta-Cepas</surname><given-names>J</given-names></name><name><surname>Simonovic</surname><given-names>M</given-names></name><name><surname>Roth</surname><given-names>A</given-names></name><name><surname>Santos</surname><given-names>A</given-names></name><name><surname>Tsafou</surname><given-names>KP</given-names></name><etal/></person-group><article-title>STRING v10: Protein-protein interaction networks, integrated over the tree of life</article-title><source>Nucleic Acids Res</source><volume>43</volume><comment>(Database Issue)</comment><fpage>D447</fpage><lpage>D452</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/nar/gku1003</pub-id><pub-id pub-id-type="pmid">25352553</pub-id></element-citation></ref>
<ref id="b44-mmr-18-02-1225"><label>44</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nikolsky</surname><given-names>Y</given-names></name><name><surname>Ekins</surname><given-names>S</given-names></name><name><surname>Nikolskaya</surname><given-names>T</given-names></name><name><surname>Bugrim</surname><given-names>A</given-names></name></person-group><article-title>A novel method for generation of signature networks as biomarkers from complex high throughput data</article-title><source>Toxicol Lett</source><volume>158</volume><fpage>20</fpage><lpage>29</lpage><year>2005</year><pub-id pub-id-type="doi">10.1016/j.toxlet.2005.02.004</pub-id><pub-id pub-id-type="pmid">15871913</pub-id></element-citation></ref>
<ref id="b45-mmr-18-02-1225"><label>45</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Crespo</surname><given-names>I</given-names></name><name><surname>Del Sol</surname><given-names>A</given-names></name></person-group><article-title>A general strategy for cellular reprogramming: The importance of transcription factor cross-repression</article-title><source>Stem Cells</source><volume>31</volume><fpage>2127</fpage><lpage>2135</lpage><year>2013</year><pub-id pub-id-type="doi">10.1002/stem.1473</pub-id><pub-id pub-id-type="pmid">23873656</pub-id></element-citation></ref>
<ref id="b46-mmr-18-02-1225"><label>46</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kauffman</surname><given-names>SA</given-names></name></person-group><article-title>Homeostasis and differentiation in random genetic control networks</article-title><source>Nature</source><volume>224</volume><fpage>177</fpage><lpage>178</lpage><year>1969</year><pub-id pub-id-type="doi">10.1038/224177a0</pub-id><pub-id pub-id-type="pmid">5343519</pub-id></element-citation></ref>
<ref id="b47-mmr-18-02-1225"><label>47</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>HM</given-names></name><name><surname>Liu</surname><given-names>T</given-names></name><name><surname>Liu</surname><given-names>CJ</given-names></name><name><surname>Song</surname><given-names>S</given-names></name><name><surname>Zhang</surname><given-names>X</given-names></name><name><surname>Liu</surname><given-names>W</given-names></name><name><surname>Jia</surname><given-names>H</given-names></name><name><surname>Xue</surname><given-names>Y</given-names></name><name><surname>Guo</surname><given-names>AY</given-names></name></person-group><article-title>AnimalTFDB 2.0: A resource for expression, prediction and functional study of animal transcription factors</article-title><source>Nucleic Acids Res</source><volume>43</volume><comment>(Database Issue)</comment><fpage>D76</fpage><lpage>D81</lpage><year>2015</year><pub-id pub-id-type="doi">10.1093/nar/gku887</pub-id><pub-id pub-id-type="pmid">25262351</pub-id></element-citation></ref>
<ref id="b48-mmr-18-02-1225"><label>48</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xiang</surname><given-names>FL</given-names></name><name><surname>Guo</surname><given-names>M</given-names></name><name><surname>Yutzey</surname><given-names>KE</given-names></name></person-group><article-title>Overexpression of Tbx20 in adult cardiomyocytes promotes proliferation and improves cardiac function after myocardial infarction</article-title><source>Circulation</source><volume>133</volume><fpage>1081</fpage><lpage>1092</lpage><year>2016</year><pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.115.019357</pub-id><pub-id pub-id-type="pmid">26841808</pub-id><pub-id pub-id-type="pmcid">4792775</pub-id></element-citation></ref>
<ref id="b49-mmr-18-02-1225"><label>49</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chakraborty</surname><given-names>S</given-names></name><name><surname>Yutzey</surname><given-names>KE</given-names></name></person-group><article-title>Tbx20 regulation of cardiac cell proliferation and lineage specialization during embryonic and fetal development in vivo</article-title><source>Dev Biol</source><volume>363</volume><fpage>234</fpage><lpage>246</lpage><year>2012</year><pub-id pub-id-type="doi">10.1016/j.ydbio.2011.12.034</pub-id><pub-id pub-id-type="pmid">22226977</pub-id></element-citation></ref>
<ref id="b50-mmr-18-02-1225"><label>50</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Addis</surname><given-names>RC</given-names></name><name><surname>Ifkovits</surname><given-names>JL</given-names></name><name><surname>Pinto</surname><given-names>F</given-names></name><name><surname>Kellam</surname><given-names>LD</given-names></name><name><surname>Esteso</surname><given-names>P</given-names></name><name><surname>Rentschler</surname><given-names>S</given-names></name><name><surname>Christoforou</surname><given-names>N</given-names></name><name><surname>Epstein</surname><given-names>JA</given-names></name><name><surname>Gearhart</surname><given-names>JD</given-names></name></person-group><article-title>Optimization of direct fibroblast reprogramming to cardiomyocytes using calcium activity as a functional measure of success</article-title><source>J Mol Cell Cardiol</source><volume>60</volume><fpage>97</fpage><lpage>106</lpage><year>2013</year><pub-id pub-id-type="doi">10.1016/j.yjmcc.2013.04.004</pub-id><pub-id pub-id-type="pmid">23591016</pub-id><pub-id pub-id-type="pmcid">3679282</pub-id></element-citation></ref>
<ref id="b51-mmr-18-02-1225"><label>51</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname><given-names>JD</given-names></name><name><surname>Stone</surname><given-names>NR</given-names></name><name><surname>Liu</surname><given-names>L</given-names></name><name><surname>Spencer</surname><given-names>CI</given-names></name><name><surname>Qian</surname><given-names>L</given-names></name><name><surname>Hayashi</surname><given-names>Y</given-names></name><name><surname>Delgado-Olguin</surname><given-names>P</given-names></name><name><surname>Ding</surname><given-names>S</given-names></name><name><surname>Bruneau</surname><given-names>BG</given-names></name><name><surname>Srivastava</surname><given-names>D</given-names></name></person-group><article-title>Direct reprogramming of human fibroblasts toward a cardiomyocyte-like state</article-title><source>Stem Cell Reports</source><volume>1</volume><fpage>235</fpage><lpage>247</lpage><year>2013</year><pub-id pub-id-type="doi">10.1016/j.stemcr.2013.07.005</pub-id><pub-id pub-id-type="pmid">24319660</pub-id><pub-id pub-id-type="pmcid">3849259</pub-id></element-citation></ref>
<ref id="b52-mmr-18-02-1225"><label>52</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>O</given-names></name><name><surname>Qian</surname><given-names>L</given-names></name></person-group><article-title>Direct cardiac reprogramming: Advances in cardiac regeneration</article-title><source>Biomed Res Int</source><volume>2015</volume><fpage>580406</fpage><year>2015</year><pub-id pub-id-type="pmid">26176012</pub-id><pub-id pub-id-type="pmcid">4484844</pub-id></element-citation></ref>
<ref id="b53-mmr-18-02-1225"><label>53</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ieda</surname><given-names>M</given-names></name><name><surname>Tsuchihashi</surname><given-names>T</given-names></name><name><surname>Ivey</surname><given-names>KN</given-names></name><name><surname>Ross</surname><given-names>RS</given-names></name><name><surname>Hong</surname><given-names>TT</given-names></name><name><surname>Shaw</surname><given-names>RM</given-names></name><name><surname>Srivastava</surname><given-names>D</given-names></name></person-group><article-title>Cardiac fibroblasts regulate myocardial proliferation through beta1 integrin signaling</article-title><source>Dev Cell</source><volume>16</volume><fpage>233</fpage><lpage>244</lpage><year>2009</year><pub-id pub-id-type="doi">10.1016/j.devcel.2008.12.007</pub-id><pub-id pub-id-type="pmid">19217425</pub-id><pub-id pub-id-type="pmcid">2664087</pub-id></element-citation></ref>
<ref id="b54-mmr-18-02-1225"><label>54</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rastegar-Pouyani</surname><given-names>S</given-names></name><name><surname>Khazaei</surname><given-names>N</given-names></name><name><surname>Wee</surname><given-names>P</given-names></name><name><surname>Yaqubi</surname><given-names>M</given-names></name><name><surname>Mohammadnia</surname><given-names>A</given-names></name></person-group><article-title>Meta-analysis of transcriptome regulation during induction to cardiac myocyte fate from mouse and human fibroblasts</article-title><source>J Cell Physiol</source><volume>232</volume><fpage>2053</fpage><lpage>2062</lpage><year>2017</year><pub-id pub-id-type="doi">10.1002/jcp.25580</pub-id><pub-id pub-id-type="pmid">27579918</pub-id></element-citation></ref>
<ref id="b55-mmr-18-02-1225"><label>55</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kamaraj</surname><given-names>US</given-names></name><name><surname>Gough</surname><given-names>J</given-names></name><name><surname>Polo</surname><given-names>JM</given-names></name><name><surname>Petretto</surname><given-names>E</given-names></name><name><surname>Rackham</surname><given-names>OJ</given-names></name></person-group><article-title>Computational methods for direct cell conversion</article-title><source>Cell Cycle</source><volume>15</volume><fpage>3343</fpage><lpage>3354</lpage><year>2016</year><pub-id pub-id-type="doi">10.1080/15384101.2016.1238119</pub-id><pub-id pub-id-type="pmid">27736295</pub-id><pub-id pub-id-type="pmcid">5224461</pub-id></element-citation></ref>
<ref id="b56-mmr-18-02-1225"><label>56</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ebrahimi</surname><given-names>B</given-names></name></person-group><article-title>Biological computational approaches: New hopes to improve (re)programming robustness, regenerative medicine and cancer therapeutics</article-title><source>Differentiation</source><volume>92</volume><fpage>35</fpage><lpage>40</lpage><year>2016</year><pub-id pub-id-type="doi">10.1016/j.diff.2016.03.001</pub-id><pub-id pub-id-type="pmid">27056282</pub-id></element-citation></ref>
<ref id="b57-mmr-18-02-1225"><label>57</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Risebro</surname><given-names>CA</given-names></name><name><surname>Searles</surname><given-names>RG</given-names></name><name><surname>Melville</surname><given-names>AA</given-names></name><name><surname>Ehler</surname><given-names>E</given-names></name><name><surname>Jina</surname><given-names>N</given-names></name><name><surname>Shah</surname><given-names>S</given-names></name><name><surname>Pallas</surname><given-names>J</given-names></name><name><surname>Hubank</surname><given-names>M</given-names></name><name><surname>Dillard</surname><given-names>M</given-names></name><name><surname>Harvey</surname><given-names>NL</given-names></name><etal/></person-group><article-title>Prox1 maintains muscle structure and growth in the developing heart</article-title><source>Development</source><volume>136</volume><fpage>495</fpage><lpage>505</lpage><year>2009</year><pub-id pub-id-type="doi">10.1242/dev.030007</pub-id><pub-id pub-id-type="pmid">19091769</pub-id></element-citation></ref>
<ref id="b58-mmr-18-02-1225"><label>58</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>Q</given-names></name><name><surname>Jiang</surname><given-names>C</given-names></name><name><surname>Xu</surname><given-names>J</given-names></name><name><surname>Zhao</surname><given-names>MT</given-names></name><name><surname>Van Bortle</surname><given-names>K</given-names></name><name><surname>Cheng</surname><given-names>X</given-names></name><name><surname>Wang</surname><given-names>G</given-names></name><name><surname>Chang</surname><given-names>HY</given-names></name><name><surname>Wu</surname><given-names>JC</given-names></name><name><surname>Snyder</surname><given-names>MP</given-names></name></person-group><article-title>Genome-wide temporal profiling of transcriptome and open chromatin of early cardiomyocyte differentiation derived from hiPSCs and hESCs</article-title><source>Circ Res</source><volume>121</volume><fpage>376</fpage><lpage>391</lpage><year>2017</year><pub-id pub-id-type="doi">10.1161/CIRCRESAHA.116.310456</pub-id><pub-id pub-id-type="pmid">28663367</pub-id></element-citation></ref>
<ref id="b59-mmr-18-02-1225"><label>59</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shekhar</surname><given-names>A</given-names></name><name><surname>Lin</surname><given-names>X</given-names></name><name><surname>Liu</surname><given-names>FY</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><name><surname>Mo</surname><given-names>H</given-names></name><name><surname>Bastarache</surname><given-names>L</given-names></name><name><surname>Denny</surname><given-names>JC</given-names></name><name><surname>Cox</surname><given-names>NJ</given-names></name><name><surname>Delmar</surname><given-names>M</given-names></name><name><surname>Roden</surname><given-names>DM</given-names></name><etal/></person-group><article-title>Transcription factor ETV1 is essential for rapid conduction in the heart</article-title><source>J Clin Invest</source><volume>126</volume><fpage>4444</fpage><lpage>4459</lpage><year>2016</year><pub-id pub-id-type="doi">10.1172/JCI87968</pub-id><pub-id pub-id-type="pmid">27775552</pub-id><pub-id pub-id-type="pmcid">5127680</pub-id></element-citation></ref>
<ref id="b60-mmr-18-02-1225"><label>60</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Koizumi</surname><given-names>A</given-names></name><name><surname>Sasano</surname><given-names>T</given-names></name><name><surname>Kimura</surname><given-names>W</given-names></name><name><surname>Miyamoto</surname><given-names>Y</given-names></name><name><surname>Aiba</surname><given-names>T</given-names></name><name><surname>Ishikawa</surname><given-names>T</given-names></name><name><surname>Nogami</surname><given-names>A</given-names></name><name><surname>Fukamizu</surname><given-names>S</given-names></name><name><surname>Sakurada</surname><given-names>H</given-names></name><name><surname>Takahashi</surname><given-names>Y</given-names></name><etal/></person-group><article-title>Genetic defects in a His-Purkinje system transcription factor, IRX3, cause lethal cardiac arrhythmias</article-title><source>Eur Heart J</source><volume>37</volume><fpage>1469</fpage><lpage>1475</lpage><year>2016</year><pub-id pub-id-type="doi">10.1093/eurheartj/ehv449</pub-id><pub-id pub-id-type="pmid">26429810</pub-id></element-citation></ref>
<ref id="b61-mmr-18-02-1225"><label>61</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nam</surname><given-names>YS</given-names></name><name><surname>Kim</surname><given-names>Y</given-names></name><name><surname>Joung</surname><given-names>H</given-names></name><name><surname>Kwon</surname><given-names>DH</given-names></name><name><surname>Choe</surname><given-names>N</given-names></name><name><surname>Min</surname><given-names>HK</given-names></name><name><surname>Kim</surname><given-names>YS</given-names></name><name><surname>Kim</surname><given-names>HS</given-names></name><name><surname>Kim</surname><given-names>DK</given-names></name><name><surname>Cho</surname><given-names>YK</given-names></name><etal/></person-group><article-title>Small heterodimer partner blocks cardiac hypertrophy by interfering with GATA6 signaling</article-title><source>Circ Res</source><volume>115</volume><fpage>493</fpage><lpage>503</lpage><year>2014</year><pub-id pub-id-type="doi">10.1161/CIRCRESAHA.115.304388</pub-id><pub-id pub-id-type="pmid">25015078</pub-id></element-citation></ref>
<ref id="b62-mmr-18-02-1225"><label>62</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Frausto</surname><given-names>RF</given-names></name><name><surname>Wang</surname><given-names>C</given-names></name><name><surname>Aldave</surname><given-names>AJ</given-names></name></person-group><article-title>Transcriptome analysis of the human corneal endothelium</article-title><source>Invest Ophthalmol Vis Sci</source><volume>55</volume><fpage>7821</fpage><lpage>7830</lpage><year>2014</year><pub-id pub-id-type="doi">10.1167/iovs.14-15021</pub-id><pub-id pub-id-type="pmid">25377225</pub-id><pub-id pub-id-type="pmcid">4258927</pub-id></element-citation></ref>
<ref id="b63-mmr-18-02-1225"><label>63</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Trevino</surname><given-names>V</given-names></name></person-group><article-title>Chi-Co-Express: A database of human co-expression networks from global cell states</article-title><source>Manuscr Prep</source><year>2017</year></element-citation></ref>
<ref id="b64-mmr-18-02-1225"><label>64</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>Q</given-names></name><name><surname>Birkbak</surname><given-names>NJ</given-names></name><name><surname>Gyorffy</surname><given-names>B</given-names></name><name><surname>Szallasi</surname><given-names>Z</given-names></name><name><surname>Eklund</surname><given-names>AC</given-names></name></person-group><article-title>Jetset: Selecting the optimal microarray probe set to represent a gene</article-title><source>BMC Bioinformatics</source><volume>12</volume><fpage>474</fpage><year>2011</year><pub-id pub-id-type="doi">10.1186/1471-2105-12-474</pub-id><pub-id pub-id-type="pmid">22172014</pub-id><pub-id pub-id-type="pmcid">3266307</pub-id></element-citation></ref>
<ref id="b65-mmr-18-02-1225"><label>65</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pressman</surname><given-names>CL</given-names></name><name><surname>Chen</surname><given-names>H</given-names></name><name><surname>Johnson</surname><given-names>RL</given-names></name></person-group><article-title>LMX1B, a LIM homeodomain class transcription factor, is necessary for normal development of multiple tissues in the anterior segment of the murine eye</article-title><source>Genesis</source><volume>26</volume><fpage>15</fpage><lpage>25</lpage><year>2000</year><pub-id pub-id-type="doi">10.1002/(SICI)1526-968X(200001)26:1&#x003C;15::AID-GENE5&#x003E;3.0.CO;2-V</pub-id><pub-id pub-id-type="pmid">10660670</pub-id></element-citation></ref>
<ref id="b66-mmr-18-02-1225"><label>66</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Powell</surname><given-names>C</given-names></name><name><surname>Cornblath</surname><given-names>E</given-names></name><name><surname>Goldman</surname><given-names>D</given-names></name></person-group><article-title>Zinc-binding domain-dependent, deaminase-independent actions of apolipoprotein B mRNA-editing enzyme, catalytic polypeptide 2 (Apobec2), mediate its effect on zebrafish retina regeneration</article-title><source>J Biol Chem</source><volume>289</volume><fpage>28924</fpage><lpage>28941</lpage><year>2014</year><pub-id pub-id-type="doi">10.1074/jbc.M114.603043</pub-id><pub-id pub-id-type="pmid">25190811</pub-id><pub-id pub-id-type="pmcid">4200251</pub-id></element-citation></ref>
<ref id="b67-mmr-18-02-1225"><label>67</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>L</given-names></name><name><surname>Martino</surname><given-names>V</given-names></name><name><surname>Dombkowski</surname><given-names>A</given-names></name><name><surname>Williams</surname><given-names>T</given-names></name><name><surname>West-Mays</surname><given-names>J</given-names></name><name><surname>Gage</surname><given-names>PJ</given-names></name></person-group><article-title>AP-2&#x03B2; is a downstream effector of PITX2 required to specify endothelium and establish angiogenic privilege during corneal development</article-title><source>Invest Opthalmol Vis Sci</source><volume>57</volume><fpage>1072</fpage><lpage>1081</lpage><year>2016</year><pub-id pub-id-type="doi">10.1167/iovs.15-18103</pub-id></element-citation></ref>
<ref id="b68-mmr-18-02-1225"><label>68</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bradley</surname><given-names>JL</given-names></name><name><surname>Edwards</surname><given-names>CS</given-names></name><name><surname>Fullard</surname><given-names>RJ</given-names></name></person-group><article-title>Adaptation of impression cytology to enable conjunctival surface cell transcriptome analysis</article-title><source>Curr Eye Res</source><volume>39</volume><fpage>31</fpage><lpage>41</lpage><year>2014</year><pub-id pub-id-type="doi">10.3109/02713683.2013.823213</pub-id><pub-id pub-id-type="pmid">24047118</pub-id></element-citation></ref>
<ref id="b69-mmr-18-02-1225"><label>69</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Khor</surname><given-names>CC</given-names></name><name><surname>Do</surname><given-names>T</given-names></name><name><surname>Jia</surname><given-names>H</given-names></name><name><surname>Nakano</surname><given-names>M</given-names></name><name><surname>George</surname><given-names>R</given-names></name><name><surname>Abu-Amero</surname><given-names>K</given-names></name><name><surname>Duvesh</surname><given-names>R</given-names></name><name><surname>Chen</surname><given-names>LJ</given-names></name><name><surname>Li</surname><given-names>Z</given-names></name><name><surname>Nongpiur</surname><given-names>ME</given-names></name><etal/></person-group><article-title>Genome-wide association study identifies five new susceptibility loci for primary angle closure glaucoma</article-title><source>Nat Genet</source><volume>48</volume><fpage>556</fpage><lpage>562</lpage><year>2016</year><pub-id pub-id-type="doi">10.1038/ng.3540</pub-id><pub-id pub-id-type="pmid">27064256</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-mmr-18-02-1225" position="float">
<label>Figure 1.</label>
<caption><p>Simplified view of TF identification for cell conversion. (A) Process of defining at least two cell populations. (B) Differential expression analysis of TFs between defined populations to identify pre-candidate TFs. (C) Filtering process of pre-candidates in order to generate a short list of TFs whose overexpression will likely control the desired cell state. TF, transcription factor.</p></caption>
<graphic xlink:href="MMR-18-02-1225-g00.tif"/>
</fig>
<fig id="f2-mmr-18-02-1225" position="float">
<label>Figure 2.</label>
<caption><p>Comparison of the definition of cell populations.</p></caption>
<graphic xlink:href="MMR-18-02-1225-g01.tif"/>
</fig>
<fig id="f3-mmr-18-02-1225" position="float">
<label>Figure 3.</label>
<caption><p>Comparison of conceptual definitions to identify TF differences. TF, transcription factor; mag, magnitude.</p></caption>
<graphic xlink:href="MMR-18-02-1225-g02.tif"/>
</fig>
<fig id="f4-mmr-18-02-1225" position="float">
<label>Figure 4.</label>
<caption><p>Comparison of the generation of candidate TFs. TF, transcription factor; CLR, context likelihood of relatedness; JSD, Jensen-Shannon divergence; NRD, normalized ratio difference; GSEA, gene set enrichment analysis.</p></caption>
<graphic xlink:href="MMR-18-02-1225-g03.tif"/>
</fig>
<fig id="f5-mmr-18-02-1225" position="float">
<label>Figure 5.</label>
<caption><p>Results for the CEC example. (A) Comparison of the five scores. The t-test P-value is indicated as -Log10. (B) Table of the top 20 genes by each criterion including those most frequently arising (Mentions column). Genes were assigned specific colors. Genes in italics were repeated, although not in the top 20. Black genes were specific to each score. (C) Comparison of gene expression of genes in column Mentions in panel (B) across CEC and non-target cell types. CEC, corneal endothelial cells.</p></caption>
<graphic xlink:href="MMR-18-02-1225-g04.tif"/>
</fig>
<table-wrap id="tI-mmr-18-02-1225" position="float">
<label>Table I.</label>
<caption><p>Definition of populations of cell types by all methods.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Author, year</th>
<th align="center" valign="bottom">Data</th>
<th align="center" valign="bottom">Target</th>
<th align="center" valign="bottom">Non-targets</th>
<th align="center" valign="bottom">(Refs.)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Cahan <italic>et al</italic>, 2014</td>
<td align="left" valign="top">GEO, queried datasets, 16&#x2013;20 cell types</td>
<td align="left" valign="top">Several samples of the same cell or tissue type</td>
<td align="left" valign="top">Remaining cell types</td>
<td align="center" valign="top">(<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">D&#x0027;Alessio <italic>et al</italic>, 2015</td>
<td align="left" valign="top">GEO, 504 datasets, 233 cell types</td>
<td align="left" valign="top">Several samples of the same cell or tissue type</td>
<td align="left" valign="top">Remaining cell types (balanced)</td>
<td align="center" valign="top">(<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Rackham <italic>et al</italic>, 2016</td>
<td align="left" valign="top">FANTOM5, &#x003E;700 datasets (CAGE-Seq)</td>
<td align="left" valign="top">Samples of the same cell type</td>
<td align="left" valign="top">Remaining cell types but avoiding close and distant related ones</td>
<td align="center" valign="top">(<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Okawa <italic>et al</italic>, 2016</td>
<td align="left" valign="top">GEO, Specific data</td>
<td align="left" valign="top">A daughter cell type</td>
<td align="left" valign="top">The progenitor and sister cell types</td>
<td align="center" valign="top">(<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-mmr-18-02-1225"><p>GEO, gene expression omnibus.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-mmr-18-02-1225" position="float">
<label>Table II.</label>
<caption><p>Identification of differential expressed TF.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Author, year</th>
<th align="center" valign="bottom">Method</th>
<th align="center" valign="bottom">Comparison</th>
<th align="center" valign="bottom">(Refs.)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Cahan <italic>et al</italic>, 2014</td>
<td align="left" valign="top">Tissue-Specific Context Likelihood of Relatedness</td>
<td align="left" valign="top">Pairs of co-expressed TF</td>
<td align="center" valign="top">(<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">D&#x0027;Alessio <italic>et al</italic>, 2015</td>
<td align="left" valign="top">Jensen-Shannon Divergence</td>
<td align="left" valign="top">Per TF</td>
<td align="center" valign="top">(<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Rackham <italic>et al</italic>, 2016</td>
<td align="left" valign="top">Combines P-values and fold-change</td>
<td align="left" valign="top">Per TF</td>
<td align="center" valign="top">(<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Okawa <italic>et al</italic>, 2016</td>
<td align="left" valign="top">Normalized Ratio Difference</td>
<td align="left" valign="top">Pairs of swap-expressed TF</td>
<td align="center" valign="top">(<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-mmr-18-02-1225"><p>TF, transcription factor.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIII-mmr-18-02-1225" position="float">
<label>Table III.</label>
<caption><p>Resources available for finding key TF.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Author, year</th>
<th align="center" valign="bottom">Resources and limitations</th>
<th align="center" valign="bottom">(Refs.)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Cahan <italic>et al</italic>, 2014</td>
<td align="left" valign="top">CellNet: Web interface and R package. Any source cell type as input but only from certain Affymetrix arrays, and Illumina arrays (in R). Only specific target cell types are available</td>
<td align="center" valign="top">(<xref rid="b23-mmr-18-02-1225" ref-type="bibr">23</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">D&#x0027;Alessio <italic>et al</italic>, 2015</td>
<td align="left" valign="top">File for 233 cell type predictions. Manual estimations are possible for a target. Source is not used.</td>
<td align="center" valign="top">(<xref rid="b24-mmr-18-02-1225" ref-type="bibr">24</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Rackham <italic>et al</italic>, 2016</td>
<td align="left" valign="top">Mogrify: Web interface. Specific for several already cataloged source and target cell types.</td>
<td align="center" valign="top">(<xref rid="b25-mmr-18-02-1225" ref-type="bibr">25</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Okawa <italic>et al</italic>, 2016</td>
<td align="left" valign="top">None available.</td>
<td align="center" valign="top">(<xref rid="b26-mmr-18-02-1225" ref-type="bibr">26</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3-mmr-18-02-1225"><p>TF, transcription factor.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIV-mmr-18-02-1225" position="float">
<label>Table IV.</label>
<caption><p>Top 20 genes per method for cardiomyocyte differentiation.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th align="left" valign="bottom" colspan="7">Method</th>
<th/>
</tr>
<tr>
<th/>
<th align="left" valign="bottom" colspan="7"><hr/></th>
<th/>
</tr>
<tr>
<th align="left" valign="bottom">Author, year</th>
<th align="center" valign="bottom">Delta</th>
<th align="center" valign="bottom">t-test</th>
<th align="center" valign="bottom">Rackham</th>
<th align="center" valign="bottom">D&#x0027;Alessio</th>
<th align="center" valign="bottom">Okawa</th>
<th align="center" valign="bottom">Mentions, n</th>
<th align="center" valign="bottom">TF comments (Refs.)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Kamaraj <italic>et al</italic>, 2016</td>
<td align="left" valign="top">TBX20</td>
<td align="left" valign="top">TBX20</td>
<td align="left" valign="top">TBX20</td>
<td align="left" valign="top">ZNF705A</td>
<td align="left" valign="top">GATA4</td>
<td align="left" valign="top">HAND1, 5</td>
<td align="left" valign="top">Computational prediction (<xref rid="b55-mmr-18-02-1225" ref-type="bibr">55</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Ieda <italic>et al</italic>, 2010; Ieda <italic>et al</italic>, 2009;</td>
<td align="left" valign="top">GATA4</td>
<td align="left" valign="top">GATA4</td>
<td align="left" valign="top">GATA4</td>
<td align="left" valign="top">ZNF283</td>
<td align="left" valign="top">TBX20</td>
<td align="left" valign="top">HAND2, 5</td>
<td align="left" valign="top">First described in (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>,<xref rid="b53-mmr-18-02-1225" ref-type="bibr">53</xref>), confirmed in mouse models and increased</td>
</tr>
<tr>
<td align="left" valign="top">Addis <italic>et al</italic>, 2013; Chen <italic>et al</italic>, 2015</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td align="left" valign="top">efficiency of CM expression markers (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>,<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Ieda <italic>et al</italic>, 2010; Ieda <italic>et al</italic>, 2009;</td>
<td align="left" valign="top">HAND1</td>
<td align="left" valign="top">HAND1</td>
<td align="left" valign="top">TBX5</td>
<td align="left" valign="top">ZSCAN4</td>
<td align="left" valign="top">HAND1</td>
<td align="left" valign="top">GATA4, 4</td>
<td align="left" valign="top">Key TF first described in (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>,<xref rid="b53-mmr-18-02-1225" ref-type="bibr">53</xref>), confirmed experimentally (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>&#x2013;<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Addis <italic>et al</italic>, 2013; Chen <italic>et al</italic>, 2015;</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td align="left" valign="top">and computationally (<xref rid="b56-mmr-18-02-1225" ref-type="bibr">56</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Ebrahimi <italic>et al</italic>, 2016</td>
</tr>
<tr>
<td align="left" valign="top">Ieda <italic>et al</italic>, 2010; Ieda <italic>et al</italic>, 2009;</td>
<td align="left" valign="top">TBX5</td>
<td align="left" valign="top">TBX5</td>
<td align="left" valign="top">GATA6</td>
<td align="left" valign="top">LIN28B</td>
<td align="left" valign="top">TBX5</td>
<td align="left" valign="top">TBX5, 4</td>
<td align="left" valign="top">Key TF first described in (<xref rid="b17-mmr-18-02-1225" ref-type="bibr">17</xref>,<xref rid="b53-mmr-18-02-1225" ref-type="bibr">53</xref>), confirmed experimentally (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>&#x2013;<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Addis <italic>et al</italic>, 2013; Chen <italic>et al</italic>, 2015;</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td align="left" valign="top">and computationally (<xref rid="b56-mmr-18-02-1225" ref-type="bibr">56</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Ebrahimi <italic>et al</italic>, 2016</td>
</tr>
<tr>
<td align="left" valign="top">Addis <italic>et al</italic>, 2013; Chen <italic>et al</italic>, 2015</td>
<td align="left" valign="top">HAND2</td>
<td align="left" valign="top">HAND2</td>
<td align="left" valign="top">HAND1</td>
<td align="left" valign="top">HAND2</td>
<td align="left" valign="top">HAND2</td>
<td align="left" valign="top">NKX2.5, 4</td>
<td align="left" valign="top">Increased efficiency of CM expression markers (<xref rid="b50-mmr-18-02-1225" ref-type="bibr">50</xref>,<xref rid="b52-mmr-18-02-1225" ref-type="bibr">52</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Xiang <italic>et al</italic>, 2016; Chakraborry <italic>et al</italic>, 2012</td>
<td align="left" valign="top">ESRRG</td>
<td align="left" valign="top">ESRRG</td>
<td align="left" valign="top">CSDC2</td>
<td align="left" valign="top">HAND1</td>
<td align="left" valign="top">ESRRG</td>
<td align="left" valign="top">TBX20, 4</td>
<td align="left" valign="top">Implicated in CM proliferation and cardiac function in mice (<xref rid="b48-mmr-18-02-1225" ref-type="bibr">48</xref>,<xref rid="b49-mmr-18-02-1225" ref-type="bibr">49</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Fu <italic>et al</italic>, 2013</td>
<td align="left" valign="top">NKX2.5</td>
<td align="left" valign="top">NKX2.5</td>
<td align="left" valign="top">NKX2.5</td>
<td align="left" valign="top">TFDP3</td>
<td align="left" valign="top">CSDC2</td>
<td align="left" valign="top">ESRRG, 4</td>
<td align="left" valign="top">Improved CM phenotype (<xref rid="b51-mmr-18-02-1225" ref-type="bibr">51</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Kamaraj <italic>et al</italic>, 2016</td>
<td align="left" valign="top">CSDC2</td>
<td align="left" valign="top">CSDC2</td>
<td align="left" valign="top">HAND2</td>
<td align="left" valign="top">POU1F1</td>
<td align="left" valign="top">NKX2.5</td>
<td align="left" valign="top">HEY2, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup>Computational prediction (<xref rid="b55-mmr-18-02-1225" ref-type="bibr">55</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Rastegar-Pouyani <italic>et al</italic>, 2017</td>
<td align="left" valign="top">PROX1</td>
<td align="left" valign="top">PROX1</td>
<td align="left" valign="top">ESRRG</td>
<td align="left" valign="top">E2F8</td>
<td align="left" valign="top">PROX1</td>
<td align="left" valign="top">TCF21, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup>Computational prediction in humans (<xref rid="b54-mmr-18-02-1225" ref-type="bibr">54</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Kamaraj <italic>et al</italic>, 2016</td>
<td align="left" valign="top">TCF21</td>
<td align="left" valign="top">TCF21</td>
<td align="left" valign="top">PROX1</td>
<td align="left" valign="top">HNF4G</td>
<td align="left" valign="top">TCF21</td>
<td align="left" valign="top">GATA6, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup>Computational prediction (<xref rid="b55-mmr-18-02-1225" ref-type="bibr">55</xref>)</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">HEY2</td>
<td align="left" valign="top">HEY2</td>
<td align="left" valign="top">HEY2</td>
<td align="left" valign="top">ZNF20</td>
<td align="left" valign="top">HEY2</td>
<td align="left" valign="top">CSDC2, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn5-mmr-18-02-1225" ref-type="table-fn">b</xref></sup>Highly expressed in the heart</td>
</tr>
<tr>
<td align="left" valign="top">Risebro <italic>et al</italic>, 2009</td>
<td align="left" valign="top">GATA6</td>
<td align="left" valign="top">GATA6</td>
<td align="left" valign="top">NPAS2</td>
<td align="left" valign="top">NR1H4</td>
<td align="left" valign="top">GATA6</td>
<td align="left" valign="top">PROX1, 4</td>
<td align="left" valign="top">Muscle structure maintenance (<xref rid="b57-mmr-18-02-1225" ref-type="bibr">57</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Kamaraj <italic>et al</italic>, 2016</td>
<td align="left" valign="top">NR0B2</td>
<td align="left" valign="top">NR0B2</td>
<td align="left" valign="top">TEAD2</td>
<td align="left" valign="top">RFX6</td>
<td align="left" valign="top">NR0B2</td>
<td align="left" valign="top">EBF2, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup>Computational prediction (<xref rid="b55-mmr-18-02-1225" ref-type="bibr">55</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Liu <italic>et al</italic>, 2017</td>
<td align="left" valign="top">EBF2</td>
<td align="left" valign="top">EBF2</td>
<td align="left" valign="top">PPARA</td>
<td align="left" valign="top">CDX4</td>
<td align="left" valign="top">EBF2</td>
<td align="left" valign="top">MEIS2, 4</td>
<td align="left" valign="top">May be important in CM (<xref rid="b58-mmr-18-02-1225" ref-type="bibr">58</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Rastegar-Pouyani <italic>et al</italic>, 2017</td>
<td align="left" valign="top">IRX3</td>
<td align="left" valign="top">IRX3</td>
<td align="left" valign="top">MEIS2</td>
<td align="left" valign="top">ESX1</td>
<td align="left" valign="top">ID4</td>
<td align="left" valign="top">TEAD2, 4</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup>Similar computational prediction (<xref rid="b54-mmr-18-02-1225" ref-type="bibr">54</xref>)</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">ETV1</td>
<td align="left" valign="top">ETV1</td>
<td align="left" valign="top">EBF2</td>
<td align="left" valign="top">ZFP42</td>
<td align="left" valign="top">IRX3</td>
<td align="left" valign="top">EBF3, 3</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup></td>
</tr>
<tr>
<td align="left" valign="top">Shekhar <italic>et al</italic>, 2016</td>
<td align="left" valign="top">MEIS2</td>
<td align="left" valign="top">MEIS2</td>
<td align="left" valign="top">TCF21</td>
<td align="left" valign="top">X.2878</td>
<td align="left" valign="top">ETV1</td>
<td align="left" valign="top">ETV1, 3</td>
<td align="left" valign="top">Involved in rapid impulse conduction (<xref rid="b59-mmr-18-02-1225" ref-type="bibr">59</xref>)</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">TEAD2</td>
<td align="left" valign="top">TEAD2</td>
<td align="left" valign="top">TEAD1</td>
<td align="left" valign="top">SRY</td>
<td align="left" valign="top">MEIS2</td>
<td align="left" valign="top">IRF6, 3</td>
<td align="left" valign="top"><sup><xref rid="tfn4-mmr-18-02-1225" ref-type="table-fn">a</xref></sup></td>
</tr>
<tr>
<td align="left" valign="top">Koizumi <italic>et al</italic>, 2016</td>
<td align="left" valign="top">IRF6</td>
<td align="left" valign="top">IRF6</td>
<td align="left" valign="top">IRX4</td>
<td align="left" valign="top">FOXR2</td>
<td align="left" valign="top">TEAD2</td>
<td align="left" valign="top">IRX3, 3</td>
<td align="left" valign="top">Involved in cardiac rhythm (<xref rid="b60-mmr-18-02-1225" ref-type="bibr">60</xref>)</td>
</tr>
<tr>
<td align="left" valign="top">Nam <italic>et al</italic>, 2014</td>
<td align="left" valign="top">EBF3</td>
<td align="left" valign="top">EBF3</td>
<td align="left" valign="top">EBF3</td>
<td align="left" valign="top">RFX8</td>
<td align="left" valign="top">IRF6</td>
<td align="left" valign="top">NR0B2, 3</td>
<td align="left" valign="top">Involved in cardiac hypertrophy (<xref rid="b61-mmr-18-02-1225" ref-type="bibr">61</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn4-mmr-18-02-1225"><label>a</label><p>TF not tested for differentiation.</p></fn>
<fn id="tfn5-mmr-18-02-1225"><label>b</label><p>GeneCards Human Gene Database, <uri xlink:href="http://www.genecards.org/cgi-bin/carddisp.pl?gene=CSDC2">www.genecards.org/cgi-bin/carddisp.pl?gene=CSDC2</uri>. Top 20 genes by each criterion including those most frequently appearing (Mentions column). TF, transcription factor; CM, cardiomyocyte; HAND1, heart and neural crest derivatives expressed 1; HAND2, heart and neural crest derivatives expressed 2; GATA4, GATA binding protein 4; TBX5, T-box 5; NKX2.5, NK2 homeobox 5; TBX20, T-box 20; ESRRG, estrogen related receptor &#x03B3;; HEY2, hes related family bHLH transcription factor with YRPW motif 2; TCF21, transcription factor 21; GATA6, GATA binding protein 6; CSDC2, cold shock domain containing C2; PROX1, prospero homeobox 1; EBF2, early B cell factor 2; MEIS2, meis homeobox 2; TEAD2, TEA domain transcription factor 2; EBF3, early B cell factor 3; ETV1, ETS variant 1; IRF6, interferon regulatory factor 6; IRX3, iroquois homeobox 3; NR0B2, nuclear receptor subfamily 0 group B member 2.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>