<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">OL</journal-id>
<journal-title-group>
<journal-title>Oncology Letters</journal-title>
</journal-title-group>
<issn pub-type="ppub">1792-1074</issn>
<issn pub-type="epub">1792-1082</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ol.2024.14832</article-id>
<article-id pub-id-type="publisher-id">OL-29-2-14832</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A novel algorithm for the detection of microsatellite instability in endometrial cancer using next‑generation sequencing data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Zhou</surname><given-names>Bing</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref>
<xref rid="fn1-ol-29-2-14832" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Wang</surname><given-names>Yu</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref>
<xref rid="fn1-ol-29-2-14832" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Ding</surname><given-names>Lu</given-names></name>
<xref rid="af2-ol-29-2-14832" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Tian</surname><given-names>Xiaolei</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Sun</surname><given-names>Wu</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Zhang</surname><given-names>Wei</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref>
<xref rid="c1-ol-29-2-14832" ref-type="corresp"/></contrib>
<contrib contrib-type="author"><name><surname>Liu</surname><given-names>Yin-Hua</given-names></name>
<xref rid="af1-ol-29-2-14832" ref-type="aff">1</xref>
<xref rid="c1-ol-29-2-14832" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-ol-29-2-14832"><label>1</label>Department of Pathology, Yijishan Hospital, The First Affiliated Hospital of Wannan Medical College, Wuhu, Anhui 241001, P.R. China</aff>
<aff id="af2-ol-29-2-14832"><label>2</label>Department of Gynecology, Yijishan Hospital, The First Affiliated Hospital of Wannan Medical College, Wuhu, Anhui 241001, P.R. China</aff>
<author-notes>
<corresp id="c1-ol-29-2-14832"><italic>Correspondence to</italic>: Dr Yin-Hua Liu or Dr Wei Zhang, Department of Pathology, Yijishan Hospital, The First Affiliated Hospital of Wannan Medical College, 2 ZheShanXi Road, Wuhu, Anhui 241001, P.R. China, E-mail: <email>liuyinhua@yjsyy.com zhangwei1981@wnmc.edu.cn </email></corresp>
<fn id="fn1-ol-29-2-14832"><label>&#x002A;</label><p>Contributed equally</p></fn></author-notes>
<pub-date pub-type="collection">
<month>02</month>
<year>2025</year></pub-date>
<pub-date pub-type="epub">
<day>03</day>
<month>12</month>
<year>2024</year></pub-date>
<volume>29</volume>
<issue>2</issue>
<elocation-id>86</elocation-id>
<history>
<date date-type="received"><day>02</day><month>07</month><year>2024</year></date>
<date date-type="accepted"><day>05</day><month>11</month><year>2024</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2024 Zhou et al.</copyright-statement>
<copyright-year>2024</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>The molecular-based detection of microsatellite instability (MSI) in endometrial cancer is complex, due to the low sensitivity of PCR and a lack of standardization in next-generation sequencing (NGS) methods. In the present study, sequenced data were obtained from an NGS panel following the addition of five commonly used microsatellite loci. Subsequently, a novel algorithm, namely MSIPeak, was developed for data analysis. Results of the present study demonstrated that MSI data obtained using MSIPeak were presented in a peak, using a threshold of 1.10 to distinguish stable and unstable loci. MSIPeak was further validated using synthetic DNA samples and endometrial cancer tissue and the results were compared with the immunohistochemical analysis-determined mismatch repair status. The PCR results demonstrated a 3-base-pair (bp) deletion in synthetic DNA samples, compared with 1- and 2-bp deletion controls. Results obtained using MSIPeak demonstrated notable differences in peak profiles and positive scores in synthetic DNA samples with 1-, 2- and 3-bp deletions, compared with controls. Thus, the results of the present study demonstrated that NGS-based MSI detection exhibited a higher sensitivity compared with PCR. In addition, NGS-based MSI detection exhibited higher levels of repeatability and applicability compared with other MSI-NGS-based methods, such as MSISensor2 and MANTIS. Collectively, the results of the present study highlighted that the combination of MSIPeak and NGS exhibits potential in the detection of cancer.</p>
</abstract>
<kwd-group>
<kwd>microsatellite loci</kwd>
<kwd>microsatellite instability</kwd>
<kwd>algorithm</kwd>
<kwd>peak</kwd>
<kwd>endometrial cancer</kwd>
</kwd-group>
<funding-group>
<award-group>
<funding-source>Key Project of Wannan Medical College</funding-source>
<award-id>WK2022ZF02</award-id>
</award-group>
<award-group>
<funding-source>Natural Science Foundation of Anhui Provincial Education Department</funding-source>
<award-id>2023AH051779</award-id>
</award-group>
<award-group>
<funding-source>The First Affiliated Hospital/Yijishan Hospital of Wannan Medical College</funding-source>
<award-id>KY27530533</award-id>
<award-id>YR202214</award-id>
<award-id>CX2023018</award-id>
<award-id>GF2019G19</award-id>
</award-group>
<award-group>
<funding-source>Anhui New Era Education Quality Project</funding-source>
<award-id>2022zyxwjxalk158</award-id>
</award-group>
<funding-statement>The present study was supported by the Key Project of Wannan Medical College (grant no. WK2022ZF02), Natural Science Foundation of Anhui Provincial Education Department (grant no. 2023AH051779), The First Affiliated Hospital/Yijishan Hospital of Wannan Medical College (grant nos. KY27530533, YR202214, CX2023018 and GF2019G19) and Anhui New Era Education Quality Project (Postgraduate Education; grant no. 2022zyxwjxalk158).</funding-statement>
</funding-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Microsatellites are short tandem repetitive DNA sequences with repeating units of 1 to 6 bases that are spread throughout the human genome. Notably, replication errors commonly occur in microsatellites during cell division (<xref rid="b1-ol-29-2-14832" ref-type="bibr">1</xref>,<xref rid="b2-ol-29-2-14832" ref-type="bibr">2</xref>). For the maintenance of homeostasis, the majority of replication errors are recognized and repaired by the DNA mismatch repair (MMR) system, which includes the mutL homolog 1 (MLH1), mutS Homolog 2 (MSH2), MSH6 and PMS1 homolog 2, mismatch repair system component (PMS2) proteins. A deficiency in MMR (dMMR) leads to an increase in microsatellite instability (MSI) (<xref rid="b3-ol-29-2-14832" ref-type="bibr">3</xref>). In such a state, MMR protein expression levels reflect the status of MSI. Notably, MSI promotes carcinogenesis and serves a major role in mechanisms underlying malignant transformation (<xref rid="b4-ol-29-2-14832" ref-type="bibr">4</xref>), which is a sensitive indicator of genetic instability in various types of cancer, including endometrial and colorectal cancer (<xref rid="b5-ol-29-2-14832" ref-type="bibr">5</xref>&#x2013;<xref rid="b7-ol-29-2-14832" ref-type="bibr">7</xref>).</p>
<p>At present, MSI is detected in clinical practice using immunohistochemical (IHC) analysis of the impaired DNA MMR proteins, and PCR is used for the analysis of microsatellite sites (<xref rid="b8-ol-29-2-14832" ref-type="bibr">8</xref>). Notably, PCR-based microsatellite analysis is the gold standard for MSI detection, involving the examination of PCR product length in a limited set of informative microsatellite sites (<xref rid="b9-ol-29-2-14832" ref-type="bibr">9</xref>). The Promega Corporation MSI analysis system is one of the most widely used commercial PCR assays, consisting of 5 mononucleotide markers for MSI detection, namely BAT-25, BAT-26, NR-21, NR-24 and MONO-27 (<xref rid="b10-ol-29-2-14832" ref-type="bibr">10</xref>). Although MSI-PCR is widely used in colorectal cancer and other gastrointestinal tumors, MMR-IHC is recommended in endometrial cancer due to the relatively low sensitivity of MSI-PCR (<xref rid="b11-ol-29-2-14832" ref-type="bibr">11</xref>,<xref rid="b12-ol-29-2-14832" ref-type="bibr">12</xref>). Tumors with dMMR often exhibit high MSI (MSI-H) that is detected using DNA-based testing (<xref rid="b13-ol-29-2-14832" ref-type="bibr">13</xref>); however, results of a previous study reported a 1&#x2013;10&#x0025; discrepancy between MMR protein and MSI status in numerous types of cancer (<xref rid="b11-ol-29-2-14832" ref-type="bibr">11</xref>). In addition, previous studies reported high levels of discrepancy between these factors (<xref rid="b6-ol-29-2-14832" ref-type="bibr">6</xref>,<xref rid="b14-ol-29-2-14832" ref-type="bibr">14</xref>). Samples with MSI-H may exhibit MMR proficiency (pMMR) as a result of MMR gene methylation (<xref rid="b15-ol-29-2-14832" ref-type="bibr">15</xref>) and MMR proteins may exhibit abnormal functions with an expected antigen structure (<xref rid="b16-ol-29-2-14832" ref-type="bibr">16</xref>). Notably, MMR may exert effects on factors other than the four common proteins, MLH1, MSH2, MSH6 and PMS2, detected using IHC analysis (<xref rid="b17-ol-29-2-14832" ref-type="bibr">17</xref>). A previous study reported that &#x003E;20&#x0025; of patients with endometrial cancer exhibit dMMR/MSI-H status, and accurate identification of this type is crucial for treatment optimization and the assessment of prognosis. Thus, the use of IHC analysis alone in the detection of dMMR may lead to inaccurate diagnoses of pMMR in patients with MSI-H (<xref rid="b18-ol-29-2-14832" ref-type="bibr">18</xref>). The development of a novel MSI detection method with high levels of sensitivity is required.</p>
<p>Next-generation sequencing (NGS) is used for the comprehensive analysis of genomic profiles and MSI status, and simultaneous analysis may decrease the number of tissue samples required and increase the efficiency of examination. NGS-based algorithms demonstrate a comparable accuracy to PCR-based MSI detection (<xref rid="b19-ol-29-2-14832" ref-type="bibr">19</xref>,<xref rid="b20-ol-29-2-14832" ref-type="bibr">20</xref>). Notably, existing algorithms, such as MSIsensor (<xref rid="b21-ol-29-2-14832" ref-type="bibr">21</xref>) and MANTIS (<xref rid="b22-ol-29-2-14832" ref-type="bibr">22</xref>), measure MSI levels using the read-count distribution of microsatellites with different repeat lengths. The aforementioned algorithms require the analysis of &#x003E;10 (or even &#x2265;40) loci for accurate MSI evaluation (<xref rid="b22-ol-29-2-14832" ref-type="bibr">22</xref>). NGS-based microsatellite testing selects mononucleotide repeats with stable repeat lengths among samples with microsatellite stability (MSS) (<xref rid="b9-ol-29-2-14832" ref-type="bibr">9</xref>,<xref rid="b23-ol-29-2-14832" ref-type="bibr">23</xref>). At present, various loci and numerous methods of MSI detection are used in research, leading to low levels of reliability and a lack of consistency.</p>
<p>In the present study, a novel algorithm was developed for the detection of MSI status, using NGS for the analysis of five mononucleotide repeats, namely BAT-25, BAT-26, NR-21, NR-24 and MONO-27. Notably, the aforementioned loci are often analyzed using MSI-PCR in clinical settings, with the ability to represent the MSI status of a sample. NGS was integrated into the algorithm to improve the sensitivity of the MSI detection, which may lead to improved detection of pMMR in patients with endometrial cancer and MSI-H.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Patients</title>
<p>A total of 181 patients aged 37 to 86 years (median, 56) with endometrial cancer were retrospectively enrolled from the First Affiliated Hospital of Wannan Medical College (Wuhu, China). Inclusion criteria were as: i) Female; pathologically diagnosed as endometrial cancer in the past 3 years; no other malignant tumors nor serious chronic diseases; can be contacted and agree to participate in the project and sign an informed consent form. Exclusion criteria were set as: tissue sample retained in pathology department was too small; tumor cells in the sample was less than 10&#x0025;; patients lost contact or were unwilling to participate in the research project.</p>
<p>These patients were diagnosed with endometrial cancer from April 2021 to November 2022 and tissues were collected. This was performed between November 2022 and June 2023. The present study was approved by the Ethics Committee of the First Affiliated Hospital of Wannan Medical College (approval no. 2022-110) and each patient provided written informed consent for their clinical information as well as their genomics data (from PCR and NGS) to be reported in the journal. Tumor and matched adjacent non-tumor tissues were collected from all patients and the MMR status was verified using IHC analysis of MSH2, MSH6, PMS2 and MLH1 protein expression levels. All IHC results were tested and reported by pathologists in the pathology department of the hospital. Antibodies including MLH1 (cat. no. ZM-0154, ZSGB-bio), MSH2 (clone FE11, cat. no. ZA-0622, ZSGB-bio, China), MSH6 (clone EP49, cat. no. ZA-0541, ZSGB-bio, China), and PMS2 (clone EP51, cat. no. ZA-0542, ZSGB-bio, China), were stained using Dako&#x0027;s automated staining system (LINK48, Dako, CA, USA) with 1:1,000 dilutions. All staining procedures were performed according to the manufacturer&#x0027;s recommendations and previous study (<xref rid="b24-ol-29-2-14832" ref-type="bibr">24</xref>).</p>
<p>Surgically specimens were fixed in 10&#x0025; neutral-buffered formalin for 24&#x2013;72 h at room temperature and embedded in paraffin. DNA was extracted from 10-&#x00B5;m-thick sections of formalin-fixed paraffin-embedded (FFPE) tumor tissue blocks using the GeneRead DNA FFPE kit (Qiagen, GmbH), according to the manufacturer&#x0027;s instructions. Samples were analyzed using MSI-PCR and NGS. After extraction, DNA quality was evaluated by 1&#x0025; agarose gel electrophoresis and the concentration of all samples was quantifed using the Qubit dsDNA HS Assay kit (Termo Fisher Scientifc, Waltham, MA, USA) with a Qubit 3.0 Fluorometer.</p>
</sec>
<sec>
<title>Spike-in samples with synthetic DNA</title>
<p>For each of the five microsatellite loci, namely BAT-25, BAT-26, NR-21, NR-24 and MONO-27, four plasmids were synthesized by Sangon Biotech (China). These included a wild-type fragment and deletions of the wild-type, consisting of 1-, 2- and 3-bp deletions. Plasmids were utilized as spike-in fragments and mixed into the DNA of noncancerous endometrial tissue at a ratio of 1:1.</p>
</sec>
<sec>
<title>MSI-PCR analysis</title>
<p>MSI-PCR analysis was performed using the MSI Analysis System (Promega Corporation) (<xref rid="b25-ol-29-2-14832" ref-type="bibr">25</xref>) as previously described (<xref rid="b26-ol-29-2-14832" ref-type="bibr">26</xref>,<xref rid="b27-ol-29-2-14832" ref-type="bibr">27</xref>). Briefly, the five microsatellite loci, namely BAT-25, BAT-26, NR-21, NR-24 and MONO-27, and two pentanucleotide repeats PENTAC and PENTAD, were amplified in a single multiplex 25 &#x00B5;l PCR reaction. PENTAC and PENTAD were used as reference genes to detect potential contamination. Fluorescently labeled primers used for MSI-PCR analysis are supplied by Sangon Biotech and listed in <xref rid="SD1-ol-29-2-14832" ref-type="supplementary-material">Table SI</xref>. The following thermocycling conditions were used for the PCR: Initial denaturation at 95&#x00B0;C for 11 min and 96&#x00B0;C for 1 min; 10 cycles of 94&#x00B0;C for 30 sec, ramp 68 sec to 58&#x00B0;C, hold for 30 sec, ramp 50 sec to 70&#x00B0;C and hold for 1 min; 20 cycles at 90&#x00B0;C for 30 sec, ramp 60 sec to 58&#x00B0;C, hold for 30 sec, ramp 50 sec to 70&#x00B0;C and hold for 1 min; 60&#x00B0;C for 30 min; hold at 4&#x00B0;C. PCR products were analyzed using a 3500 Genetic Analyzer (Thermo Fisher Scientific, Inc.). GeneMapper 6.0 (Thermo Fisher Scientific, Inc.) was used to determine the size differences between tumor samples and adjacent tissues. A tumor was defined as exhibiting MSI-H if &#x2265;2 markers were unstable, and MSS was defined according to the presence of &#x2264;1 unstable mononucleotide markers in the tumor sample. The term &#x2018;unstable&#x2019; was used for markers with a shift of &#x2265;2 bp, or if the shoulder pattern extended the range of the smallest peak by &#x2265;2 bp in the tumor allele.</p>
</sec>
<sec>
<title>MSI detection using NGS</title>
<p>NGS was performed on a NextSeq 500 or Novaseq 6000 (Illumina, Inc.) using a custom amplicon-based gene panel that comprised five microsatellite loci included in the Promega Corporation MSI kit. Initially, libraries were generated with the Hieff NGS&#x2122; OnePot Pro DNA Library Prep Kit (Shanghai Yeasen Biotechnology Co., Ltd.) according to the manufacturer&#x0027;s protocol. Briefly, 20 ng fragmented genomic DNA was used to amplify the target regions and amplified products were purified (<xref rid="SD2-ol-29-2-14832" ref-type="supplementary-material">Table SII</xref>). Subsequent rounds of PCR were carried out through the addition of sequencing adapters and barcodes to amplicons. Following the purification of the library, quantification of the DNA library was performed using Labchip GX Touch (PerkinElmer). The libraries with 1 pM concentration were then sequenced using the Novaseq 6000 NGS (Illumina, Inc.) platforms and NovaSeq 6000 SP reagent kit (100 cycles; cat. no. 2002746; Illumina Inc.), according to the manufacturer&#x0027;s instructions using 2X150 bp paired-end reads at an average depth of 5,000&#x00D7; for tissue.</p>
<p>A novel algorithm, MSIPeak, was developed in the present study to determine the MSI status of all samples using NGS read-count distribution. MSIPeak program flow was divided into four main steps, as follows (<xref rid="f1-ol-29-2-14832" ref-type="fig">Fig. 1</xref>).</p>
<p><italic>Step I</italic>: The sequencing data of each tumor tissue and matched adjacent tissues were read in FastQ format files. For each MSI locus, reads coverage information, including reads count, was extracted from FastQ files.</p>
<p><italic>Step II</italic>: Minimum-maximum normalization was performed on the reads count of each microsatellite locus. Values were scaled to the range (0,1) for subsequent data processing.</p>
<disp-formula>
<alternatives>
<mml:math id="umml1" display="block"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mi>e</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mo>min</mml:mo></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mo>max</mml:mo></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mtext>min</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g01.tif"/>
</alternatives>
</disp-formula>
<p>Here, i represents a single microsatellite locus, x<sub>i</sub> represents the reads count prior to normalization, x<sub>new</sub> represents the reads count value following normalization, and x<sub>min</sub> and x<sub>max</sub> represent the minimum and maximum values of the reads count for each locus, respectively.</p>
<p>Reads count values were smoothed using the sliding window. Following normalization and smoothing, peak data of the microsatellite loci of tumor and matched adjacent tissues were analyzed. The local maximum values of each repeat were compared with the values of neighboring points.</p>
<p><italic>Step III</italic>: For each peak determined in the tumor and adjacent tissues, peak shift size, peak area difference and Shannon coefficient difference were calculated to score the MSI status of each locus (<xref rid="f2-ol-29-2-14832" ref-type="fig">Fig. 2</xref>):</p>
<disp-formula>
<alternatives>
<mml:math id="umml2" display="block"><mml:mrow><mml:mi>s</mml:mi><mml:mi>h</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>t</mml:mi><mml:mo>_</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>z</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mi>a</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>l</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>l</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g02.tif"/>
</alternatives>
</disp-formula>
<p>Here, i represents a single microsatellite locus, l<sub>x</sub> represents the peak value of the microsatellite locus in the tumor sample and l<sub>y</sub> represents the peak value of the microsatellite locus in the matched adjacent tissue.</p>
<disp-formula>
<alternatives>
<mml:math id="umml3" display="block"><mml:mrow><mml:mi>H</mml:mi><mml:mo>_</mml:mo><mml:mi>d</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>f</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:msqrt></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g03.tif"/>
</alternatives>
</disp-formula>
<p>Here, H_diff represents the area difference of each microsatellite locus peak, &#x00D7; and y refer to the vectors of area values of MSI loci in tumor and adjacent tissues, respectively, i represents a single microsatellite locus, and x<sub>i</sub> and y<sub>i</sub> represent the area values of the ith MSI locus in tumor and adjacent tissues, respectively.</p>
<disp-formula>
<alternatives>
<mml:math id="umml4" display="block"><mml:mrow><mml:mi>H</mml:mi><mml:mo>=</mml:mo><mml:mi>a</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g05.tif"/>
</alternatives>
</disp-formula>
<disp-formula>
<alternatives>
<mml:math id="umml5" display="block"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi>&#x2009;</mml:mi><mml:mo>.</mml:mo><mml:mi>&#x2009;</mml:mi><mml:mtext>ln</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>]</mml:mo></mml:mrow></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g06.tif"/>
</alternatives>
</disp-formula>
<disp-formula>
<alternatives>
<mml:math id="umml6" display="block"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msubsup><mml:mi>p</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x0027;</mml:mo></mml:msubsup><mml:mi>&#x2009;</mml:mi><mml:mo>&#x00B7;</mml:mo><mml:mi>&#x2009;</mml:mi><mml:mtext>ln</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:msubsup><mml:mi>p</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x0027;</mml:mo></mml:msubsup><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>]</mml:mo></mml:mrow></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g07.tif"/>
</alternatives>
</disp-formula>
<p>Here, H represents the Shannon coefficient difference between tumor and adjacent tissues. H<sub>x</sub> represents the Shannon-Wiener diversity index of the tumor sample, H<sub>y</sub> represents the Shannon-Wiener diversity index of the adjacent sample, i represents a single microsatellite locus, p<sub>i</sub> represents the relative abundance of the ith microsatellite locus in the tumor sample and p&#x0027;<sub>i</sub> represents the relative abundance of the ith microsatellite locus in the adjacent sample.</p>
<p><italic>Step IV</italic>: The final score for each MSI locus was calculated using the following equation:</p>
<disp-formula>
<alternatives>
<mml:math id="umml7" display="block"><mml:mrow><mml:mi>s</mml:mi><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>h</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>t</mml:mi><mml:mo>_</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>z</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mtext>max</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:mi>H</mml:mi><mml:mo>_</mml:mo><mml:mi>d</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>f</mml:mi><mml:mo>,</mml:mo><mml:mi>H</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math>
<graphic xlink:href="ol-29-02-14832-g08.tif"/>
</alternatives>
</disp-formula>
<p>When the score was &#x2265;1.10, the MSI status of this locus was considered unstable. After the stability of all five markers had been determined, the MSI status of the patient was evaluated. Samples with two or more unstable markers were considered MSI-H and samples with &#x003C;2 unstable markers were considered MSS.</p>
</sec>
<sec>
<title>Comparison of MSIPeak with MSIsensor and MANTIS</title>
<p>Among previously published NGS-based MSI studies, MSIsensor (<xref rid="b21-ol-29-2-14832" ref-type="bibr">21</xref>) and MANTIS (<xref rid="b22-ol-29-2-14832" ref-type="bibr">22</xref>) were widely used analytical methods (<xref rid="b2-ol-29-2-14832" ref-type="bibr">2</xref>,<xref rid="b5-ol-29-2-14832" ref-type="bibr">5</xref>,<xref rid="b28-ol-29-2-14832" ref-type="bibr">28</xref>,<xref rid="b29-ol-29-2-14832" ref-type="bibr">29</xref>). The calculation principles of these two algorithms are markedly different from MSIPeak (<xref rid="tI-ol-29-2-14832" ref-type="table">Table I</xref>). Therefore, MSIPeak was compared with the MSIsensor and MANTIS algorithms. MSISensor2 (<xref rid="b30-ol-29-2-14832" ref-type="bibr">30</xref>), an upgraded version of MSIsensor, and Mantis were run according to their manuscript, for the analysis of in-house whole-exome sequencing (WES) data from 25 endometrial cancer samples. The WES library was constructed using the commercial Hi-Exon 35 Panel and supporting library construction kit (cat. no. P10016-96, Shanghai HeYin Biotechnology Co., LTd.). A total of 50 ng fragmented genomic DNA was used for a capture-based library (<xref rid="SD3-ol-29-2-14832" ref-type="supplementary-material">Table SIII</xref>) according to the manufacturer&#x0027;s protocol of the library construction kit. After the quantification of the DNA library by the Labchip GX Touch (PerkinElmer), WES library with 1 pM concentration was performed on the same Novaseq 6000 NGS (Illumina, Inc.) platforms and NovaSeq 6000 SP reagent kit (100 cycles; cat. no. 2002746; Illumina Inc.), according to the manufacturer&#x0027;s instructions using 2&#x00D7;150 bp paired-end reads at an average depth of 150&#x00D7;. To obtain clean reads, FASTQ files from tumor tissue and white blood samples were done by fastp (<uri xlink:href="https://github.com/OpenGene/fastp">https://github.com/OpenGene/fastp</uri>, version 0.19.3). Clean reads were mapped to the reference genome (hg38/GRCh38) by Burrows-Wheeler aligner (BWA, <uri xlink:href="https://github.com/lh3/bwa">https://github.com/lh3/bwa</uri>, version 0.7.12-r1039) and perform alignment processing by SAMtools (https://github.com/samtools/samtools, version 0.1.19&#x2013;96b5f2294a). The quality score was recalibrated using GATK (https://github.com/broadinstitute/gatk, version 4.1.0.0) to generate the final binary SAM (BAM) files used for subsequent analyses. Lastly MSI status was detected using the MANTIS (version v1.0.5) (<xref rid="b22-ol-29-2-14832" ref-type="bibr">22</xref>) and MSISensor2 (Version 0.1) (<xref rid="b30-ol-29-2-14832" ref-type="bibr">30</xref>).</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>The chi-square test was used to compare the frequencies of MSI-H and MSS tumors identified through PCR and NGS, with the dMMR and pMMR status determined by IHC. For chi-square test analysis, P&#x003C;0.001 was considered to indicate a statistically significant difference in PCR and NGS in ability to detect MSI-H. Cohen&#x0027;s &#x03BA; was calculated to evaluate the level of agreement between IHC-based and molecular-based methods, PCR and NGS. A Cohen&#x0027;s &#x03BA; of P&#x003C;0.001 was considered to indicate a statistically significant difference between methods. All data presented in figures and tables are reported as percentages for categorical comparisons. P&#x003C;0.05 was considered to indicate a statistically significant difference. The statistical analyses were performed using R software (version 4.3.2; RStudio).</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>MSI-PCR and MSI-NGS using synthetic DNA samples</title>
<p>The present study demonstrated that spike-in DNA sample profiles with 3-bp deletions displayed notable left-shifts, which could be distinguished from those of the wild-type fragments (<xref rid="f3-ol-29-2-14832" ref-type="fig">Fig. 3A</xref>). Spike-in DNA sample profiles with 2-bp deletions exhibited ambiguous extensions on the left shoulder of BAT25 and NR24, compared with the wild-type fragments. However, there were no notable shifts in BAT26, NR21 and MONO27 with 2-bp deletions or in the five markers with 1-bp deletions (<xref rid="f3-ol-29-2-14832" ref-type="fig">Fig. 3A and B</xref>).</p>
<p>Results obtained using MSI-NGS are presented as peaks, which were comparable with those obtained using MSI-PCR (<xref rid="f3-ol-29-2-14832" ref-type="fig">Fig. 3C</xref>). Peaks of the spike-in DNA samples with 1-bp deletions exhibited subtle shifts compared with those of the wild-type fragments. However, the score was more than the threshold of 1.10 (<xref rid="f3-ol-29-2-14832" ref-type="fig">Fig. 3D</xref>). Spike-in DNA samples with 1- or 2-bp deletions exhibited shifts and scores that were indicative of MSI (<xref rid="f3-ol-29-2-14832" ref-type="fig">Fig. 3C and D</xref>).</p>
</sec>
<sec>
<title>MMR/MSI detection in FFPE samples</title>
<p>A total of 39 endometrial cancer samples were identified as dMMR and the remaining 142 samples were identified as pMMR (<xref rid="tII-ol-29-2-14832" ref-type="table">Table II</xref>). Within the 39 dMMR samples, 16 were classified as MSI-H using PCR testing and 36 were identified as MSI-H using NGS (<xref rid="tII-ol-29-2-14832" ref-type="table">Table II</xref>). The concordance between IHC and NGS was significantly higher compared with that between IHC and PCR (Cohen&#x0027;s &#x03BA;=0.492 vs. 0.872; P&#x003C;0.001; <xref rid="tII-ol-29-2-14832" ref-type="table">Table II</xref>). All 16 dMMR/MSI-H samples confirmed using IHC and PCR were also defined as MSI-H using NGS (<xref rid="f4-ol-29-2-14832" ref-type="fig">Fig. 4</xref>; <xref rid="SD4-ol-29-2-14832" ref-type="supplementary-material">Tables SIV</xref> and <xref rid="SD5-ol-29-2-14832" ref-type="supplementary-material">SV</xref>). In addition, a further 20 MSI-H samples were identified using NGS alone, with the profiles exhibiting minor shifts that did not meet the criteria for MSI-H based on PCR analysis (data not shown).</p>
</sec>
<sec>
<title>Comparison with MSISensor2 and MANTIS</title>
<p>Among the 25 samples with available WES data, 8 dMMR and 17 pMMR cases were identified using IHC analysis. MSIPeak, MSISensor2 and MANTIS consistently classified the 17 pMMR samples as MSS (<xref rid="f5-ol-29-2-14832" ref-type="fig">Fig. 5A</xref>). However, MSIPeak was the only algorithm to identify all 8 dMMR samples as MSI-H, while MSISensor2 and MANTIS classified 2 and 3 dMMR samples as MSS, respectively (<xref rid="f5-ol-29-2-14832" ref-type="fig">Fig. 5A and B</xref>; <xref rid="SD6-ol-29-2-14832" ref-type="supplementary-material">Table SVI</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>In endometrial cancer, MMR detection using IHC analysis has been recommended over MSI detection using PCR. Notably, results obtained using MSI and MMR exhibited low levels of concordance in gynecologic tumors compared with gastrointestinal tumors (<xref rid="b11-ol-29-2-14832" ref-type="bibr">11</xref>). The subtle leftward shifts in endometrial cancer were 1&#x2013;3 bp, whereas those observed in colorectal cancer were &#x003E;6 bp (<xref rid="b31-ol-29-2-14832" ref-type="bibr">31</xref>,<xref rid="b32-ol-29-2-14832" ref-type="bibr">32</xref>). The results of the present study obtained using MSI-PCR showed that synthetic DNA fragments with 1&#x2013;2 bp differences displayed ambiguous shifts that could not be distinguished from the matched adjacent tissue samples. In addition, results obtained using MSI-PCR demonstrated that numerous samples could not be classified as MSI based on the shifts of their peaks. These ambiguous shifts imply that endometrial cancer samples with 1&#x2013;2 bp shifts cannot be differentiated from MSS samples, contributing to the low concordance between MSI and MMR in endometrial cancer.</p>
<p>Limitations of IHC analysis for the detection of MMR (<xref rid="b15-ol-29-2-14832" ref-type="bibr">15</xref>&#x2013;<xref rid="b18-ol-29-2-14832" ref-type="bibr">18</xref>,<xref rid="b33-ol-29-2-14832" ref-type="bibr">33</xref>) have led to the requirement for detecting specific microsatellite repeats. Thus, numerous NGS-based MSI detection methods have been introduced (<xref rid="b20-ol-29-2-14832" ref-type="bibr">20</xref>,<xref rid="b34-ol-29-2-14832" ref-type="bibr">34</xref>&#x2013;<xref rid="b37-ol-29-2-14832" ref-type="bibr">37</xref>), and these have detected a greater number of microsatellite markers compared with the 5 to 6 markers detected using PCR. Microsatellite markers analyzed using NGS technology have varied among studies and have only been demonstrated in specific cohorts or tumor types. However, the five markers in the Promega Corporation system (<xref rid="b8-ol-29-2-14832" ref-type="bibr">8</xref>) have been widely used in clinical practice for a number of tumor types (<xref rid="b3-ol-29-2-14832" ref-type="bibr">3</xref>&#x2013;<xref rid="b7-ol-29-2-14832" ref-type="bibr">7</xref>). Notably, these markers are used to represent the status of MMR proteins. In the present study, all samples defined as MSI-H using MSI-PCR were also defined as MSI-H using MSI-NGS. These results suggested that the novel algorithm developed in the present study exhibits the capability to identify relatively large shifts in endometrial cancer samples. Samples with IHC analysis-verified dMMR and PCR-verified MSS were further categorized into two groups according to shift using NGS combined with MSIPeak. These results suggest that the novel algorithm may exhibit potential in the identification of samples with sublet shifts.</p>
<p>MSIPeak uses the same markers as PCR, but levels of sensitivity are improved compared with PCR. Potential reasons include that, first, the interpretation of PCR results relies on the analysis of capillary electrophoresis patterns, which has a certain degree of subjectivity. Independent investigators may interpret electrophoresis patterns differently, potentially resulting in poor reproducibility of the results. In cases where endometrial MSI is offset by 1&#x2013;3 bp, errors may occur (<xref rid="b32-ol-29-2-14832" ref-type="bibr">32</xref>). By contrast, MSIPeak performs minimum-maximum normalization and data smoothing during the data preprocessing, which may reduce the impact of sequencing depth and data fluctuations on MSI detection. Furthermore, MSIPeak analyzes the differences in peak values between tumor samples and matched adjacent tissues from multiple dimensions, including the peak shift, peak area difference and Shannon coefficient difference (<xref rid="b38-ol-29-2-14832" ref-type="bibr">38</xref>,<xref rid="b39-ol-29-2-14832" ref-type="bibr">39</xref>). Thus, the MSI status was comprehensively evaluated to provide more accurate detection results.</p>
<p>MSIPeak sorts loci from small to large based on the distribution of different ploymer repetitions at each locus. Subsequently, the peak of each locus is identified, and shift size, area difference and Shannon-Wiener diversity index are evaluated in tumor and adjacent tissues (<xref rid="b38-ol-29-2-14832" ref-type="bibr">38</xref>,<xref rid="b39-ol-29-2-14832" ref-type="bibr">39</xref>). Results of the present study demonstrated that MSIPeak exhibited higher levels of accuracy compared with alternate NGS-based algorithms, such as MSISensor2 and MANTIS. However, further analyses using a larger number of samples are required to verify the results. Notably, MSISensor2 and MANTIS are designed to be performed using microsatellite loci across the entire genome or exon ranges (<xref rid="b21-ol-29-2-14832" ref-type="bibr">21</xref>,<xref rid="b22-ol-29-2-14832" ref-type="bibr">22</xref>,<xref rid="b30-ol-29-2-14832" ref-type="bibr">30</xref>). Loci derived from different batches may vary, which may affect the results. In addition, MSISensor2 and MANTIS have been extensively applied in the context of colorectal cancer (<xref rid="b2-ol-29-2-14832" ref-type="bibr">2</xref>,<xref rid="b5-ol-29-2-14832" ref-type="bibr">5</xref>,<xref rid="b28-ol-29-2-14832" ref-type="bibr">28</xref>,<xref rid="b29-ol-29-2-14832" ref-type="bibr">29</xref>); however, these algorithms are not widely used in endometrial cancer. Certain parameters and/or thresholds of these algorithms may require further refinement for effective MSI detection in endometrial cancer.</p>
<p>MSIPeak demonstrated high levels of reproducibility and adaptability for MSI detection in endometrial cancer. The five common loci detected using MSIPeak are small in size, and these can be integrated into other NGS sequencing panels, using associated amplicons for amplicon-based panels or associated probes for capture-based panels. Notably, this integration does not require specifically designed sequencing panels, and is inexpensive compared with WES or whole genome sequencing. Thus, NGS-based MSI detection exhibits potential in patient diagnosis, with high levels of flexibility and cost effectiveness.</p>
<p>In conclusion, a novel algorithm was developed for the detection of MSI in the present study, namely MSIPeak. This algorithm was designed to detect only five commonly used microsatellite loci, allowing it to be easily integrated into existing NGS panels, which could thereby lead to potential reductions in experimental costs. Results obtained using MSIPeak are presented in peak form for intuitive and convenient identification, which is comparable with MSI-PCR. However, MSIPeak demonstrated higher levels of accuracy and objectivity compared with PCR. In addition, MSIPeak may exhibit potential in detecting MSI in endometrial cancer. Further investigations with increased sample sizes are required to validate the present results and to explore the utility of this algorithm in other types of cancer, such as colorectal cancer. Future investigations should focus on refining and developing a widely applicable NGS-based MSI detection algorithm that could be effectively used across various types of cancer.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="pdf" xlink:href="Supplementary_Data1.pdf"/>
</supplementary-material>
<supplementary-material id="SD2-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data2.xlsx"/>
</supplementary-material>
<supplementary-material id="SD3-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data3.xlsx"/>
</supplementary-material>
<supplementary-material id="SD4-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data4.xlsx"/>
</supplementary-material>
<supplementary-material id="SD5-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data5.xlsx"/>
</supplementary-material>
<supplementary-material id="SD6-ol-29-2-14832" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data6.xlsx"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec sec-type="data-availability">
<title>Availability of data and materials</title>
<p>The data generated in the present study may be found in The National Center for Biotechnology Information Sequence Read Archive repository under accession number PRJNA1100268 or at the following URL: <uri xlink:href="https://www.ncbi.nlm.nih.gov/sra/PRJNA1100268">https://www.ncbi.nlm.nih.gov/sra/PRJNA1100268</uri>.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>BZ, YW, LD, XT, WS, WZ and YL acquired the data. BZ and YW carried out the molecular experiments and drafted the manuscript. LD analyzed and interpreted the data. XT and WS performed histological examination of the tissue samples. WZ and YL designed the research aims and acquired financial support for the project. BZ and YL confirm the authenticity of all the raw data. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>The present study was approved by the Ethics Committee of the First Affiliated Hospital of Wannan Medical College (approval no. 2022-110). Each patient provided written informed consent to participate.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>MSI-H</term><def><p>high microsatellite instability</p></def></def-item>
<def-item><term>MSS</term><def><p>microsatellite stability</p></def></def-item>
<def-item><term>PCR</term><def><p>polymerase chain reaction</p></def></def-item>
<def-item><term>NGS</term><def><p>next-generation sequencing</p></def></def-item>
<def-item><term>IHC</term><def><p>immunohistochemistry</p></def></def-item>
<def-item><term>MMR</term><def><p>mismatch repair</p></def></def-item>
<def-item><term>dMMR</term><def><p>mismatch repair deficiency</p></def></def-item>
<def-item><term>pMMR</term><def><p>MMR proficiency</p></def></def-item>
<def-item><term>FFPE</term><def><p>formalin-fixed paraffin-embedded</p></def></def-item>
<def-item><term>WES</term><def><p>whole-exome sequencing</p></def></def-item>
<def-item><term>WGS</term><def><p>whole-genome sequencing</p></def></def-item>
<def-item><term>Bp</term><def><p>base pair</p></def></def-item>
</def-list>
</glossary>
<ref-list>
<title>References</title>
<ref id="b1-ol-29-2-14832"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lynch</surname><given-names>HT</given-names></name><name><surname>Snyder</surname><given-names>CL</given-names></name><name><surname>Shaw</surname><given-names>TG</given-names></name><name><surname>Heinen</surname><given-names>CD</given-names></name><name><surname>Hitchins</surname><given-names>MP</given-names></name></person-group><article-title>Milestones of lynch syndrome: 1895&#x2013;2015</article-title><source>Nat Rev Cancer</source><volume>15</volume><fpage>181</fpage><lpage>194</lpage><year>2015</year><pub-id pub-id-type="doi">10.1038/nrc3878</pub-id><pub-id pub-id-type="pmid">25673086</pub-id></element-citation></ref>
<ref id="b2-ol-29-2-14832"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>K</given-names></name><name><surname>Luo</surname><given-names>H</given-names></name><name><surname>Huang</surname><given-names>L</given-names></name><name><surname>Luo</surname><given-names>H</given-names></name><name><surname>Zhu</surname><given-names>X</given-names></name></person-group><article-title>Microsatellite instability: A review of what the oncologist should know</article-title><source>Cancer Cell Int</source><volume>20</volume><fpage>16</fpage><year>2020</year><pub-id pub-id-type="doi">10.1186/s12935-019-1091-8</pub-id><pub-id pub-id-type="pmid">31956294</pub-id></element-citation></ref>
<ref id="b3-ol-29-2-14832"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sinicrope</surname><given-names>FA</given-names></name><name><surname>Sargent</surname><given-names>DJ</given-names></name></person-group><article-title>Molecular pathways: Microsatellite instability in colorectal cancer: prognostic, predictive, and therapeutic implications</article-title><source>Clin Cancer Res</source><volume>18</volume><fpage>1506</fpage><lpage>1512</lpage><year>2012</year><pub-id pub-id-type="doi">10.1158/1078-0432.CCR-11-1469</pub-id><pub-id pub-id-type="pmid">22302899</pub-id></element-citation></ref>
<ref id="b4-ol-29-2-14832"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Aaltonen</surname><given-names>LA</given-names></name><name><surname>Peltom&#x00E4;ki</surname><given-names>P</given-names></name><name><surname>Leach</surname><given-names>FS</given-names></name><name><surname>Sistonen</surname><given-names>P</given-names></name><name><surname>Pylkk&#x00E4;nen</surname><given-names>L</given-names></name><name><surname>Mecklin</surname><given-names>JP</given-names></name><name><surname>J&#x00E4;rvinen</surname><given-names>H</given-names></name><name><surname>Powell</surname><given-names>SM</given-names></name><name><surname>Jen</surname><given-names>J</given-names></name><name><surname>Hamilton</surname><given-names>SR</given-names></name><etal/></person-group><article-title>Clues to the pathogenesis of familial colorectal cancer</article-title><source>Science</source><volume>260</volume><fpage>812</fpage><lpage>816</lpage><year>1993</year><pub-id pub-id-type="doi">10.1126/science.8484121</pub-id><pub-id pub-id-type="pmid">8484121</pub-id></element-citation></ref>
<ref id="b5-ol-29-2-14832"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bonneville</surname><given-names>R</given-names></name><name><surname>Krook</surname><given-names>MA</given-names></name><name><surname>Kautto</surname><given-names>EA</given-names></name><name><surname>Miya</surname><given-names>J</given-names></name><name><surname>Wing</surname><given-names>MR</given-names></name><name><surname>Chen</surname><given-names>HZ</given-names></name><name><surname>Reeser</surname><given-names>JW</given-names></name><name><surname>Yu</surname><given-names>L</given-names></name><name><surname>Roychowdhury</surname><given-names>S</given-names></name></person-group><article-title>Landscape of microsatellite instability across 39 cancer types</article-title><source>JCO Precis Oncol</source><year>2017</year><comment>PO.17.00073</comment><year>2017</year><pub-id pub-id-type="doi">10.1200/PO.17.00073</pub-id><pub-id pub-id-type="pmid">29850653</pub-id></element-citation></ref>
<ref id="b6-ol-29-2-14832"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>N&#x00E1;dorv&#x00E1;ri</surname><given-names>ML</given-names></name><name><surname>Kenessey</surname><given-names>I</given-names></name><name><surname>Kiss</surname><given-names>A</given-names></name><name><surname>Barbai</surname><given-names>T</given-names></name><name><surname>Kulka</surname><given-names>J</given-names></name><name><surname>R&#x00E1;s&#x00F3;</surname><given-names>E</given-names></name><name><surname>T&#x00ED;m&#x00E1;r</surname><given-names>J</given-names></name></person-group><article-title>Comparison of standard mismatch repair deficiency and microsatellite instability tests in a large cancer series</article-title><source>J Transl Med</source><volume>22</volume><fpage>150</fpage><year>2024</year><pub-id pub-id-type="doi">10.1186/s12967-024-04960-y</pub-id><pub-id pub-id-type="pmid">38350968</pub-id></element-citation></ref>
<ref id="b7-ol-29-2-14832"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname><given-names>Y</given-names></name><name><surname>Nam</surname><given-names>SK</given-names></name><name><surname>Chang</surname><given-names>HE</given-names></name><name><surname>Lee</surname><given-names>C</given-names></name><name><surname>Kang</surname><given-names>GH</given-names></name><name><surname>Lee</surname><given-names>HS</given-names></name><name><surname>Park</surname><given-names>KU</given-names></name></person-group><article-title>Evaluation of an eight marker-panel including long mononucleotide repeat markers to detect microsatellite instability in colorectal, gastric, and endometrial cancers</article-title><source>BMC Cancer</source><volume>23</volume><fpage>1100</fpage><year>2023</year><pub-id pub-id-type="doi">10.1186/s12885-023-11607-6</pub-id><pub-id pub-id-type="pmid">37953261</pub-id></element-citation></ref>
<ref id="b8-ol-29-2-14832"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McConechy</surname><given-names>MK</given-names></name><name><surname>Talhouk</surname><given-names>A</given-names></name><name><surname>Li-Chang</surname><given-names>HH</given-names></name><name><surname>Leung</surname><given-names>S</given-names></name><name><surname>Huntsman</surname><given-names>DG</given-names></name><name><surname>Gilks</surname><given-names>CB</given-names></name><name><surname>McAlpine</surname><given-names>JN</given-names></name></person-group><article-title>Detection of DNA mismatch repair (MMR) deficiencies by immunohistochemistry can effectively diagnose the microsatellite instability (MSI) phenotype in endometrial carcinomas</article-title><source>Gynecol Oncol</source><volume>137</volume><fpage>306</fpage><lpage>310</lpage><year>2015</year><pub-id pub-id-type="doi">10.1016/j.ygyno.2015.01.541</pub-id><pub-id pub-id-type="pmid">25636458</pub-id></element-citation></ref>
<ref id="b9-ol-29-2-14832"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Boyarskikh</surname><given-names>U</given-names></name><name><surname>Kechin</surname><given-names>A</given-names></name><name><surname>Khrapov</surname><given-names>E</given-names></name><name><surname>Fedyanin</surname><given-names>M</given-names></name><name><surname>Raskin</surname><given-names>G</given-names></name><name><surname>Mukhina</surname><given-names>M</given-names></name><name><surname>Kravtsova</surname><given-names>E</given-names></name><name><surname>Tsukanov</surname><given-names>A</given-names></name><name><surname>Achkasov</surname><given-names>S</given-names></name><name><surname>Filipenko</surname><given-names>M</given-names></name></person-group><article-title>Detecting microsatellite instability in endometrial, colon, and stomach cancers using targeted NGS</article-title><source>Cancers (Basel)</source><volume>15</volume><fpage>5065</fpage><year>2023</year><pub-id pub-id-type="doi">10.3390/cancers15205065</pub-id><pub-id pub-id-type="pmid">37894432</pub-id></element-citation></ref>
<ref id="b10-ol-29-2-14832"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname><given-names>KM</given-names></name><name><surname>Zhang</surname><given-names>S</given-names></name><name><surname>Geiger</surname><given-names>T</given-names></name><name><surname>Hafez</surname><given-names>MJ</given-names></name><name><surname>Bacher</surname><given-names>J</given-names></name><name><surname>Berg</surname><given-names>KD</given-names></name><name><surname>Eshleman</surname><given-names>JR</given-names></name></person-group><article-title>Comparison of the microsatellite instability analysis system and the Bethesda panel for the determination of microsatellite instability in colorectal cancers</article-title><source>J Mol Diagn</source><volume>8</volume><fpage>305</fpage><lpage>311</lpage><year>2006</year><pub-id pub-id-type="doi">10.2353/jmoldx.2006.050092</pub-id><pub-id pub-id-type="pmid">16825502</pub-id></element-citation></ref>
<ref id="b11-ol-29-2-14832"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bartley</surname><given-names>AN</given-names></name><name><surname>Mills</surname><given-names>AM</given-names></name><name><surname>Konnick</surname><given-names>E</given-names></name><name><surname>Overman</surname><given-names>M</given-names></name><name><surname>Ventura</surname><given-names>CB</given-names></name><name><surname>Souter</surname><given-names>L</given-names></name><name><surname>Colasacco</surname><given-names>C</given-names></name><name><surname>Stadler</surname><given-names>ZK</given-names></name><name><surname>Kerr</surname><given-names>S</given-names></name><name><surname>Howitt</surname><given-names>BE</given-names></name><etal/></person-group><article-title>Mismatch repair and microsatellite instability testing for immune checkpoint inhibitor therapy: Guideline from the college of American pathologists in collaboration with the association for molecular pathology and fight colorectal cancer</article-title><source>Arch Pathol Lab Med</source><volume>146</volume><fpage>1194</fpage><lpage>1210</lpage><year>2022</year><pub-id pub-id-type="doi">10.5858/arpa.2021-0632-CP</pub-id><pub-id pub-id-type="pmid">35920830</pub-id></element-citation></ref>
<ref id="b12-ol-29-2-14832"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dedeurwaerdere</surname><given-names>F</given-names></name><name><surname>Claes</surname><given-names>KB</given-names></name><name><surname>Van Dorpe</surname><given-names>J</given-names></name><name><surname>Rottiers</surname><given-names>I</given-names></name><name><surname>Van der Meulen</surname><given-names>J</given-names></name><name><surname>Breyne</surname><given-names>J</given-names></name><name><surname>Swaerts</surname><given-names>K</given-names></name><name><surname>Martens</surname><given-names>G</given-names></name></person-group><article-title>Comparison of microsatellite instability detection by immunohistochemistry and molecular techniques in colorectal and endometrial cancer</article-title><source>Sci Rep</source><volume>11</volume><fpage>12880</fpage><year>2021</year><pub-id pub-id-type="doi">10.1038/s41598-021-91974-x</pub-id><pub-id pub-id-type="pmid">34145315</pub-id></element-citation></ref>
<ref id="b13-ol-29-2-14832"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lindor</surname><given-names>NM</given-names></name><name><surname>Burgart</surname><given-names>LJ</given-names></name><name><surname>Leontovich</surname><given-names>O</given-names></name><name><surname>Goldberg</surname><given-names>RM</given-names></name><name><surname>Cunningham</surname><given-names>JM</given-names></name><name><surname>Sargent</surname><given-names>DJ</given-names></name><name><surname>Walsh-Vockley</surname><given-names>C</given-names></name><name><surname>Petersen</surname><given-names>GM</given-names></name><name><surname>Walsh</surname><given-names>MD</given-names></name><name><surname>Leggett</surname><given-names>BA</given-names></name><etal/></person-group><article-title>Immunohistochemistry versus microsatellite instability testing in phenotyping colorectal tumors</article-title><source>J Clin Oncol</source><volume>20</volume><fpage>1043</fpage><lpage>1048</lpage><year>2002</year><pub-id pub-id-type="doi">10.1200/JCO.2002.20.4.1043</pub-id><pub-id pub-id-type="pmid">11844828</pub-id></element-citation></ref>
<ref id="b14-ol-29-2-14832"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lorenzi</surname><given-names>M</given-names></name><name><surname>Amonkar</surname><given-names>M</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name><name><surname>Mehta</surname><given-names>S</given-names></name><name><surname>Liaw</surname><given-names>KL</given-names></name></person-group><article-title>Epidemiology of microsatellite instability high (MSI-H) and deficient mismatch repair (dMMR) in solid tumors: A structured literature review</article-title><source>J Oncol</source><volume>2020</volume><fpage>1807929</fpage><year>2020</year><pub-id pub-id-type="doi">10.1155/2020/1807929</pub-id></element-citation></ref>
<ref id="b15-ol-29-2-14832"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McCarthy</surname><given-names>AJ</given-names></name><name><surname>Capo-Chichi</surname><given-names>JM</given-names></name><name><surname>Spence</surname><given-names>T</given-names></name><name><surname>Grenier</surname><given-names>S</given-names></name><name><surname>Stockley</surname><given-names>T</given-names></name><name><surname>Kamel-Reid</surname><given-names>S</given-names></name><name><surname>Serra</surname><given-names>S</given-names></name><name><surname>Sabatini</surname><given-names>P</given-names></name><name><surname>Chetty</surname><given-names>R</given-names></name></person-group><article-title>Heterogenous loss of mismatch repair (MMR) protein expression: A challenge for immunohistochemical interpretation and microsatellite instability (MSI) evaluation</article-title><source>J Pathol Clin Res</source><volume>5</volume><fpage>115</fpage><lpage>129</lpage><year>2019</year><pub-id pub-id-type="doi">10.1002/cjp2.120</pub-id><pub-id pub-id-type="pmid">30387329</pub-id></element-citation></ref>
<ref id="b16-ol-29-2-14832"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stelloo</surname><given-names>E</given-names></name><name><surname>Jansen</surname><given-names>AML</given-names></name><name><surname>Osse</surname><given-names>EM</given-names></name><name><surname>Nout</surname><given-names>RA</given-names></name><name><surname>Creutzberg</surname><given-names>CL</given-names></name><name><surname>Ruano</surname><given-names>D</given-names></name><name><surname>Church</surname><given-names>DN</given-names></name><name><surname>Morreau</surname><given-names>H</given-names></name><name><surname>Smit</surname><given-names>VTHBM</given-names></name><name><surname>van Wezel</surname><given-names>T</given-names></name><name><surname>Bosse</surname><given-names>T</given-names></name></person-group><article-title>Practical guidance for mismatch repair-deficiency testing in endometrial cancer</article-title><source>Ann Oncol</source><volume>28</volume><fpage>96</fpage><lpage>102</lpage><year>2017</year><pub-id pub-id-type="doi">10.1093/annonc/mdw542</pub-id><pub-id pub-id-type="pmid">27742654</pub-id></element-citation></ref>
<ref id="b17-ol-29-2-14832"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shia</surname><given-names>J</given-names></name></person-group><article-title>Immunohistochemistry versus microsatellite instability testing for screening colorectal cancer patients at risk for hereditary nonpolyposis colorectal cancer syndrome. Part I. The utility of immunohistochemistry</article-title><source>J Mol Diagn</source><volume>10</volume><fpage>293</fpage><lpage>300</lpage><year>2008</year><pub-id pub-id-type="doi">10.2353/jmoldx.2008.080031</pub-id><pub-id pub-id-type="pmid">18556767</pub-id></element-citation></ref>
<ref id="b18-ol-29-2-14832"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hechtman</surname><given-names>JF</given-names></name><name><surname>Rana</surname><given-names>S</given-names></name><name><surname>Middha</surname><given-names>S</given-names></name><name><surname>Stadler</surname><given-names>ZK</given-names></name><name><surname>Latham</surname><given-names>A</given-names></name><name><surname>Benayed</surname><given-names>R</given-names></name><name><surname>Soslow</surname><given-names>R</given-names></name><name><surname>Ladanyi</surname><given-names>M</given-names></name><name><surname>Yaeger</surname><given-names>R</given-names></name><name><surname>Zehir</surname><given-names>A</given-names></name><name><surname>Shia</surname><given-names>J</given-names></name></person-group><article-title>Retained mismatch repair protein expression occurs in approximately 6&#x0025; of microsatellite instability-high cancers and is associated with missense mutations in mismatch repair genes</article-title><source>Mod Pathol</source><volume>33</volume><fpage>871</fpage><lpage>879</lpage><year>2020</year><pub-id pub-id-type="doi">10.1038/s41379-019-0414-6</pub-id><pub-id pub-id-type="pmid">31857677</pub-id></element-citation></ref>
<ref id="b19-ol-29-2-14832"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>T</given-names></name><name><surname>Chen</surname><given-names>L</given-names></name><name><surname>Guo</surname><given-names>J</given-names></name><name><surname>Zhang</surname><given-names>M</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Cao</surname><given-names>S</given-names></name><name><surname>Lou</surname><given-names>F</given-names></name><name><surname>Wang</surname><given-names>H</given-names></name></person-group><article-title>MSIFinder: A python package for detecting MSI status using random forest classifier</article-title><source>BMC Bioinformatics</source><volume>22</volume><fpage>185</fpage><year>2021</year><pub-id pub-id-type="doi">10.1186/s12859-021-03986-z</pub-id><pub-id pub-id-type="pmid">33845765</pub-id></element-citation></ref>
<ref id="b20-ol-29-2-14832"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Salipante</surname><given-names>SJ</given-names></name><name><surname>Scroggins</surname><given-names>SM</given-names></name><name><surname>Hampel</surname><given-names>HL</given-names></name><name><surname>Turner</surname><given-names>EH</given-names></name><name><surname>Pritchard</surname><given-names>CC</given-names></name></person-group><article-title>Microsatellite instability detection by next generation sequencing</article-title><source>Clin Chem</source><volume>60</volume><fpage>1192</fpage><lpage>1199</lpage><year>2014</year><pub-id pub-id-type="doi">10.1373/clinchem.2014.223677</pub-id><pub-id pub-id-type="pmid">24987110</pub-id></element-citation></ref>
<ref id="b21-ol-29-2-14832"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Niu</surname><given-names>B</given-names></name><name><surname>Ye</surname><given-names>K</given-names></name><name><surname>Zhang</surname><given-names>Q</given-names></name><name><surname>Lu</surname><given-names>C</given-names></name><name><surname>Xie</surname><given-names>M</given-names></name><name><surname>McLellan</surname><given-names>MD</given-names></name><name><surname>Wendl</surname><given-names>MC</given-names></name><name><surname>Ding</surname><given-names>L</given-names></name></person-group><article-title>MSIsensor: Microsatellite instability detection using paired tumor-normal sequence data</article-title><source>Bioinformatics</source><volume>30</volume><fpage>1015</fpage><lpage>1016</lpage><year>2014</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btt755</pub-id><pub-id pub-id-type="pmid">24371154</pub-id></element-citation></ref>
<ref id="b22-ol-29-2-14832"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kautto</surname><given-names>EA</given-names></name><name><surname>Bonneville</surname><given-names>R</given-names></name><name><surname>Miya</surname><given-names>J</given-names></name><name><surname>Yu</surname><given-names>L</given-names></name><name><surname>Krook</surname><given-names>MA</given-names></name><name><surname>Reeser</surname><given-names>JW</given-names></name><name><surname>Roychowdhury</surname><given-names>S</given-names></name></person-group><article-title>Performance evaluation for rapid detection of pan-cancer microsatellite instability with MANTIS</article-title><source>Oncotarget</source><volume>8</volume><fpage>7452</fpage><lpage>7463</lpage><year>2017</year><pub-id pub-id-type="doi">10.18632/oncotarget.13918</pub-id><pub-id pub-id-type="pmid">27980218</pub-id></element-citation></ref>
<ref id="b23-ol-29-2-14832"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname><given-names>L</given-names></name><name><surname>Huang</surname><given-names>Y</given-names></name><name><surname>Fang</surname><given-names>X</given-names></name><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Deng</surname><given-names>W</given-names></name><name><surname>Zhong</surname><given-names>C</given-names></name><name><surname>Xu</surname><given-names>J</given-names></name><name><surname>Xu</surname><given-names>D</given-names></name><name><surname>Yuan</surname><given-names>Y</given-names></name></person-group><article-title>A novel and reliable method to detect microsatellite instability in colorectal cancer by next-generation sequencing</article-title><source>J Mol Diagn</source><volume>20</volume><fpage>225</fpage><lpage>231</lpage><year>2018</year><pub-id pub-id-type="doi">10.1016/j.jmoldx.2017.11.007</pub-id><pub-id pub-id-type="pmid">29277635</pub-id></element-citation></ref>
<ref id="b24-ol-29-2-14832"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yoshida</surname><given-names>H</given-names></name><name><surname>Takigawa</surname><given-names>W</given-names></name><name><surname>Kobayashi-Kato</surname><given-names>M</given-names></name><name><surname>Nishikawa</surname><given-names>T</given-names></name><name><surname>Shiraishi</surname><given-names>K</given-names></name><name><surname>Ishikawa</surname><given-names>M</given-names></name></person-group><article-title>Mismatch repair protein expression in endometrial cancer: Assessing concordance and unveiling pitfalls in two different immunohistochemistry assays</article-title><source>J Pers Med</source><volume>13</volume><fpage>1260</fpage><year>2023</year><pub-id pub-id-type="doi">10.3390/jpm13081260</pub-id><pub-id pub-id-type="pmid">37623510</pub-id></element-citation></ref>
<ref id="b25-ol-29-2-14832"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bacher</surname><given-names>JW</given-names></name><name><surname>Flanagan</surname><given-names>LA</given-names></name><name><surname>Smalley</surname><given-names>RL</given-names></name><name><surname>Nassif</surname><given-names>NA</given-names></name><name><surname>Burgart</surname><given-names>LJ</given-names></name><name><surname>Halberg</surname><given-names>RB</given-names></name><name><surname>Megid</surname><given-names>WM</given-names></name><name><surname>Thibodeau</surname><given-names>SN</given-names></name></person-group><article-title>Development of a fluorescent multiplex assay for detection of MSI-high tumors</article-title><source>Dis Markers</source><volume>20</volume><fpage>237</fpage><lpage>250</lpage><year>2004</year><pub-id pub-id-type="doi">10.1155/2004/136734</pub-id><pub-id pub-id-type="pmid">15528789</pub-id></element-citation></ref>
<ref id="b26-ol-29-2-14832"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nakagomi</surname><given-names>T</given-names></name><name><surname>Goto</surname><given-names>T</given-names></name><name><surname>Hirotsu</surname><given-names>Y</given-names></name><name><surname>Shikata</surname><given-names>D</given-names></name><name><surname>Yokoyama</surname><given-names>Y</given-names></name><name><surname>Higuchi</surname><given-names>R</given-names></name><name><surname>Amemiya</surname><given-names>K</given-names></name><name><surname>Okimoto</surname><given-names>K</given-names></name><name><surname>Oyama</surname><given-names>T</given-names></name><name><surname>Mochizuki</surname><given-names>H</given-names></name><name><surname>Omata</surname><given-names>M</given-names></name></person-group><article-title>New therapeutic targets for pulmonary sarcomatoid carcinomas based on their genomic and phylogenetic profiles</article-title><source>Oncotarget</source><volume>9</volume><fpage>10635</fpage><lpage>10649</lpage><year>2018</year><pub-id pub-id-type="doi">10.18632/oncotarget.24365</pub-id><pub-id pub-id-type="pmid">29535832</pub-id></element-citation></ref>
<ref id="b27-ol-29-2-14832"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Takaoka</surname><given-names>S</given-names></name><name><surname>Hirotsu</surname><given-names>Y</given-names></name><name><surname>Ohyama</surname><given-names>H</given-names></name><name><surname>Mochizuki</surname><given-names>H</given-names></name><name><surname>Amemiya</surname><given-names>K</given-names></name><name><surname>Oyama</surname><given-names>T</given-names></name><name><surname>Ashizawa</surname><given-names>H</given-names></name><name><surname>Yoshimura</surname><given-names>D</given-names></name><name><surname>Nakagomi</surname><given-names>K</given-names></name><name><surname>Hosoda</surname><given-names>K</given-names></name><etal/></person-group><article-title>Molecular subtype switching in early-stage gastric cancers with multiple occurrences</article-title><source>J Gastroenterol</source><volume>54</volume><fpage>674</fpage><lpage>686</lpage><year>2019</year><pub-id pub-id-type="doi">10.1007/s00535-019-01547-z</pub-id><pub-id pub-id-type="pmid">30666417</pub-id></element-citation></ref>
<ref id="b28-ol-29-2-14832"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Johansen</surname><given-names>AFB</given-names></name><name><surname>Kassentoft</surname><given-names>CG</given-names></name><name><surname>Knudsen</surname><given-names>M</given-names></name><name><surname>Laursen</surname><given-names>MB</given-names></name><name><surname>Madsen</surname><given-names>AH</given-names></name><name><surname>Iversen</surname><given-names>LH</given-names></name><name><surname>Sunesen</surname><given-names>KG</given-names></name><name><surname>Rasmussen</surname><given-names>MH</given-names></name><name><surname>Andersen</surname><given-names>CL</given-names></name></person-group><article-title>Validation of computational determination of microsatellite status using whole exome sequencing data from colorectal cancer patients</article-title><source>BMC Cancer</source><volume>19</volume><fpage>971</fpage><year>2019</year><pub-id pub-id-type="doi">10.1186/s12885-019-6227-7</pub-id><pub-id pub-id-type="pmid">31638937</pub-id></element-citation></ref>
<ref id="b29-ol-29-2-14832"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname><given-names>F</given-names></name><name><surname>Makrigiorgos</surname><given-names>A</given-names></name><name><surname>Leong</surname><given-names>KW</given-names></name><name><surname>Makrigiorgos</surname><given-names>GM</given-names></name></person-group><article-title>Sensitive detection of microsatellite instability in tissues and liquid biopsies: Recent developments and updates</article-title><source>Comput Struct Biotechnol J</source><volume>19</volume><fpage>4931</fpage><lpage>4940</lpage><year>2021</year><pub-id pub-id-type="doi">10.1016/j.csbj.2021.08.037</pub-id><pub-id pub-id-type="pmid">34527197</pub-id></element-citation></ref>
<ref id="b30-ol-29-2-14832"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jia</surname><given-names>P</given-names></name><name><surname>Yang</surname><given-names>X</given-names></name><name><surname>Guo</surname><given-names>L</given-names></name><name><surname>Liu</surname><given-names>B</given-names></name><name><surname>Lin</surname><given-names>J</given-names></name><name><surname>Liang</surname><given-names>H</given-names></name><name><surname>Sun</surname><given-names>J</given-names></name><name><surname>Zhang</surname><given-names>C</given-names></name><name><surname>Ye</surname><given-names>K</given-names></name></person-group><article-title>MSIsensor-pro: Fast, accurate, and matched-normal-sample-free detection of microsatellite instability</article-title><source>Genomics Proteomics Bioinformatics</source><volume>18</volume><fpage>65</fpage><lpage>71</lpage><year>2020</year><pub-id pub-id-type="doi">10.1016/j.gpb.2020.02.001</pub-id><pub-id pub-id-type="pmid">32171661</pub-id></element-citation></ref>
<ref id="b31-ol-29-2-14832"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Shi</surname><given-names>C</given-names></name><name><surname>Eisenberg</surname><given-names>R</given-names></name><name><surname>Vnencak-Jones</surname><given-names>CL</given-names></name></person-group><article-title>Differences in microsatellite instability profiles between endometrioid and colorectal cancers: A potential cause for false-negative results?</article-title><source>J Mol Diagn</source><volume>19</volume><fpage>57</fpage><lpage>64</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.jmoldx.2016.07.008</pub-id><pub-id pub-id-type="pmid">27810331</pub-id></element-citation></ref>
<ref id="b32-ol-29-2-14832"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>X</given-names></name><name><surname>Snir</surname><given-names>O</given-names></name><name><surname>Rottmann</surname><given-names>D</given-names></name><name><surname>Wong</surname><given-names>S</given-names></name><name><surname>Buza</surname><given-names>N</given-names></name><name><surname>Hui</surname><given-names>P</given-names></name></person-group><article-title>Minimal microsatellite shift in microsatellite instability high endometrial cancer: A significant pitfall in diagnostic interpretation</article-title><source>Mod Pathol</source><volume>32</volume><fpage>650</fpage><lpage>658</lpage><year>2019</year><pub-id pub-id-type="doi">10.1038/s41379-018-0179-3</pub-id><pub-id pub-id-type="pmid">30443012</pub-id></element-citation></ref>
<ref id="b33-ol-29-2-14832"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname><given-names>WCC</given-names></name><name><surname>Nerurkar</surname><given-names>SN</given-names></name><name><surname>Cai</surname><given-names>HY</given-names></name><name><surname>Ng</surname><given-names>HHM</given-names></name><name><surname>Wu</surname><given-names>D</given-names></name><name><surname>Wee</surname><given-names>YTF</given-names></name><name><surname>Lim</surname><given-names>JCT</given-names></name><name><surname>Yeong</surname><given-names>J</given-names></name><name><surname>Lim</surname><given-names>TKH</given-names></name></person-group><article-title>Overview of multiplex immunohistochemistry/immunofluorescence techniques in the era of cancer immunotherapy</article-title><source>Cancer Commun (Lond)</source><volume>40</volume><fpage>135</fpage><lpage>153</lpage><year>2020</year><pub-id pub-id-type="doi">10.1002/cac2.12023</pub-id><pub-id pub-id-type="pmid">32301585</pub-id></element-citation></ref>
<ref id="b34-ol-29-2-14832"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ali</surname><given-names>AS</given-names></name><name><surname>Alalem</surname><given-names>LS</given-names></name></person-group><article-title>Next-generation sequencing and immunohistochemistry approaches for microsatellite instability detection in endometrial cancer</article-title><source>Cell Mol Biol (Noisy-le-grand)</source><volume>69</volume><fpage>237</fpage><lpage>242</lpage><year>2023</year><pub-id pub-id-type="doi">10.14715/cmb/2023.69.14.40</pub-id><pub-id pub-id-type="pmid">38279425</pub-id></element-citation></ref>
<ref id="b35-ol-29-2-14832"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bonneville</surname><given-names>R</given-names></name><name><surname>Krook</surname><given-names>MA</given-names></name><name><surname>Chen</surname><given-names>HZ</given-names></name><name><surname>Smith</surname><given-names>A</given-names></name><name><surname>Samorodnitsky</surname><given-names>E</given-names></name><name><surname>Wing</surname><given-names>MR</given-names></name><name><surname>Reeser</surname><given-names>JW</given-names></name><name><surname>Roychowdhury</surname><given-names>S</given-names></name></person-group><article-title>Detection of microsatellite instability biomarkers via next-generation sequencing</article-title><source>Methods Mol Biol</source><volume>2055</volume><fpage>119</fpage><lpage>132</lpage><year>2020</year><pub-id pub-id-type="doi">10.1007/978-1-4939-9773-2_5</pub-id><pub-id pub-id-type="pmid">31502149</pub-id></element-citation></ref>
<ref id="b36-ol-29-2-14832"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Evrard</surname><given-names>C</given-names></name><name><surname>Cortes</surname><given-names>U</given-names></name><name><surname>Ndiaye</surname><given-names>B</given-names></name><name><surname>Bonnemort</surname><given-names>J</given-names></name><name><surname>Martel</surname><given-names>M</given-names></name><name><surname>Aguillon</surname><given-names>R</given-names></name><name><surname>Tougeron</surname><given-names>D</given-names></name><name><surname>Karayan-Tapon</surname><given-names>L</given-names></name></person-group><article-title>An innovative and accurate next-generation sequencing-based microsatellite instability detection method for colorectal and endometrial tumors</article-title><source>Lab Invest</source><volume>104</volume><fpage>100297</fpage><year>2024</year><pub-id pub-id-type="doi">10.1016/j.labinv.2023.100297</pub-id><pub-id pub-id-type="pmid">38008183</pub-id></element-citation></ref>
<ref id="b37-ol-29-2-14832"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pang</surname><given-names>J</given-names></name><name><surname>Gindin</surname><given-names>T</given-names></name><name><surname>Mansukhani</surname><given-names>M</given-names></name><name><surname>Fernandes</surname><given-names>H</given-names></name><name><surname>Hsiao</surname><given-names>S</given-names></name></person-group><article-title>Microsatellite instability detection using a large next-generation sequencing cancer panel across diverse tumour types</article-title><source>J Clin Pathol</source><volume>73</volume><fpage>83</fpage><lpage>89</lpage><year>2020</year><pub-id pub-id-type="doi">10.1136/jclinpath-2019-206136</pub-id><pub-id pub-id-type="pmid">31530574</pub-id></element-citation></ref>
<ref id="b38-ol-29-2-14832"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Felinger</surname><given-names>A</given-names></name></person-group><article-title>8 Peak detection</article-title><source>Data Handl Sci Technol</source><volume>21</volume><fpage>183</fpage><lpage>190</lpage><year>1998</year></element-citation></ref>
<ref id="b39-ol-29-2-14832"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Spellerberg</surname><given-names>IF</given-names></name><name><surname>Fedor</surname><given-names>PJ</given-names></name></person-group><article-title>A tribute to claude shannon (1916&#x2013;2001) and a plea for more rigorous use of species richness, species diversity and the &#x2018;shannon-wiener&#x2019; index</article-title><source>Glob Ecol Biogeogr</source><volume>12</volume><fpage>177</fpage><lpage>179</lpage><year>2003</year><pub-id pub-id-type="doi">10.1046/j.1466-822X.2003.00015.x</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-ol-29-2-14832" position="float">
<label>Figure 1.</label>
<caption><p>Program flow of MSIPeak for each microsatellite locus in each sample. MSI, microsatellite instability.</p></caption>
<graphic xlink:href="ol-29-02-14832-g00.tif"/>
</fig>
<fig id="f2-ol-29-2-14832" position="float">
<label>Figure 2.</label>
<caption><p>Schematic of microsatellite positioning point parameters calculation, including shift_size, H_diff and H.</p></caption>
<graphic xlink:href="ol-29-02-14832-g04.tif"/>
</fig>
<fig id="f3-ol-29-2-14832" position="float">
<label>Figure 3.</label>
<caption><p>MSI detection using spike-in DNA samples. (A) MSI-PCR detection of spike-in DNA samples with wild-type fragments (control), 1-, 2- and 3-bp deletions, from top to bottom. The polymorphic pentanucleotide repeat markers PENTAC and PENTAD were used as quality controls for sample authentication. (B) Fragment sizes of peaks for each microsatellite locus in spike-in DNA samples. (C) MSI-NGS detection of spike-in DNA samples with 1-, 2- and 3-bp deletions, displayed from top to bottom. The red line represents wild-type samples and the blue line represents spike-in samples with 1&#x2013;3-bp deletions. Polymer, number of base repeats size of each microsatellite locus; normalized-value, minimum-maximum normalization value of each microsatellite locus at each polymer. (D) MSIPeak score of each microsatellite locus in spike-in DNA samples. MSI, microsatellite instability; bp, base pair; NGS, next-generation sequencing.</p></caption>
<graphic xlink:href="ol-29-02-14832-g09.tif"/>
</fig>
<fig id="f4-ol-29-2-14832" position="float">
<label>Figure 4.</label>
<caption><p>Venn diagram of dMMR/MSI-H samples defined using IHC analysis, MSI-PCR and MSI-NGS. dMMR, mismatch repair deficiency; MSI, microsatellite instability; MSI-H, high MSI; IHC, immunohistochemical; NGS, next-generation sequencing; MMR, mismatch repair; dMMR, MMR deficiency.</p></caption>
<graphic xlink:href="ol-29-02-14832-g10.tif"/>
</fig>
<fig id="f5-ol-29-2-14832" position="float">
<label>Figure 5.</label>
<caption><p>Accuracy of MSIPeak was compared with that of MSISensor2 and MANTIS. (A) Bar chart presenting cases where MMR and MSI status were detected using different methods. (B) Venn diagram presenting the consistency between the different detection methods in all samples. The accuracy of the three algorithms was assessed and compared based on results obtained from MMR-IHC. MMR, mismatch repair; MSI, microsatellite instability; IHC, immunohistochemical.</p></caption>
<graphic xlink:href="ol-29-02-14832-g11.tif"/>
</fig>
<table-wrap id="tI-ol-29-2-14832" position="float">
<label>Table I.</label>
<caption><p>Comparison of MSIPeak with the published algorithms based on next-generation sequencing data.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Parameters</th>
<th align="center" valign="bottom">MSIPeak</th>
<th align="center" valign="bottom">MSISensor</th>
<th align="center" valign="bottom">MANTIS</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">No. of loci</td>
<td align="left" valign="top">Five</td>
<td align="left" valign="top">Tens to thousands</td>
<td align="left" valign="top">Dozens to thousands</td>
</tr>
<tr>
<td align="left" valign="top">Origin of the loci</td>
<td align="left" valign="top">Fixed</td>
<td align="left" valign="top">Genome-wide or target screening</td>
<td align="left" valign="top">Genome wide or target screening</td>
</tr>
<tr>
<td align="left" valign="top">Data preprocessing</td>
<td align="left" valign="top">Obtainment of coverage information of loci, and perform normalization and data smoothing processing.</td>
<td align="left" valign="top">Calculation of the coverage of each locus without mentioning normalization and data smoothing steps.</td>
<td align="left" valign="top">Calculation of the coverage of each locus and data normalization.</td>
</tr>
<tr>
<td align="left" valign="top">Comparison between tumor and normal samples</td>
<td align="left" valign="top">Peak shift, Peak area difference and Shannon coefficient. difference</td>
<td align="left" valign="top">Number of repetitions and allele distribution for each locus.</td>
<td align="left" valign="top">The repeat length distribution and stability level of each locus.</td>
</tr>
<tr>
<td align="left" valign="top">Scoring criteria of each locus</td>
<td align="left" valign="top">The final score of each MSI locus is obtained by the peak shift, peak area difference and Shannon coefficient difference. Loci with a score &#x2265;1.10 are rated as unstable.</td>
<td align="left" valign="top">Calculation of the proportion of unstable positioning points, and if the proportion exceeds a threshold, it is rated as unstable. The threshold is determined by the cumulative distribution of this indicator on a set of samples.</td>
<td align="left" valign="top">The average L1 norm of all loci is the MSI score of the sample. If the score exceeds the threshold, it is rated as unstable.</td>
</tr>
<tr>
<td align="left" valign="top">Criteria of MSI</td>
<td align="left" valign="top">&#x2265;2 of 5 loci are unstable</td>
<td align="left" valign="top">Default 20&#x0025;</td>
<td align="left" valign="top">Default 0.4</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-ol-29-2-14832"><p>MSI, microsatellite instability.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-ol-29-2-14832" position="float">
<label>Table II.</label>
<caption><p>Concordance between MMR-IHC and PCR- or NGS-based methods.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom" colspan="4">A, Tumors with dMMR identified by IHC technology</th>
</tr>
<tr>
<th align="left" valign="bottom" colspan="4"><hr/></th>
</tr>
<tr>
<th align="left" valign="bottom">Type</th>
<th align="center" valign="bottom">PCR, n (&#x0025;)</th>
<th align="center" valign="bottom">NGS, n (&#x0025;)</th>
<th align="center" valign="bottom">Chi-square P-value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">MSI-H</td>
<td align="center" valign="top">16 (8.84)</td>
<td align="center" valign="top">36 (19.89)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">MSS</td>
<td align="center" valign="top">23 (12.71)</td>
<td align="center" valign="top">3 (1.66)</td>
<td align="center" valign="top">0.444</td>
</tr>
<tr>
<td align="left" valign="top" colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" colspan="4"><bold>B, Tumors with pMMR identified by IHC technology</bold></td>
</tr>
<tr>
<td align="left" valign="top" colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Type</bold></td>
<td align="center" valign="top"><bold>PCR, n (&#x0025;)</bold></td>
<td align="center" valign="top"><bold>NGS, n (&#x0025;)</bold></td>
<td align="center" valign="top"><bold>Chi-square P-value</bold></td>
</tr>
<tr>
<td align="left" valign="top" colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">MSI-H</td>
<td align="center" valign="top">2 (1.10)</td>
<td align="center" valign="top">5 (2.76)</td>
<td align="center" valign="top">&#x003C;0.001</td>
</tr>
<tr>
<td align="left" valign="top">MSS</td>
<td align="center" valign="top">140 (77.35)</td>
<td align="center" valign="top">137 (75.69)</td>
<td align="center" valign="top">0.251</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-ol-29-2-14832"><p>MSI, microsatellite instability; MSI-H, high MSI; MSS, microsatellite stability; MMR, mismatch repair; dMMR, MMR deficiency; IHC, immunohistochemical; NGS, next-generation sequencing. The concordance between IHC and NGS was significantly higher compared with that between IHC and PCR (Cohen&#x0027;s &#x03BA;=0.492 vs. 0.872; P&#x003C;0.001).</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>
