<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Molecular Medicine Reports</journal-id>
<journal-title-group>
<journal-title>Molecular Medicine Reports</journal-title>
</journal-title-group>
<issn pub-type="ppub">1791-2997</issn>
<issn pub-type="epub">1791-3004</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/mmr.2019.9896</article-id>
<article-id pub-id-type="publisher-id">mmr-19-04-2837</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Simultaneous detection of target CNVs and SNVs of thalassemia by multiplex PCR and next-generation sequencing</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Fan</surname><given-names>Dong-Mei</given-names></name>
<xref rid="af1-mmr-19-04-2837" ref-type="aff">1</xref>
<xref rid="fn1-mmr-19-04-2837" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Yang</surname><given-names>Xu</given-names></name>
<xref rid="af2-mmr-19-04-2837" ref-type="aff">2</xref>
<xref rid="fn1-mmr-19-04-2837" ref-type="author-notes">&#x002A;</xref></contrib>
<contrib contrib-type="author"><name><surname>Huang</surname><given-names>Li-Min</given-names></name>
<xref rid="af1-mmr-19-04-2837" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Ouyang</surname><given-names>Guo-Jun</given-names></name>
<xref rid="af3-mmr-19-04-2837" ref-type="aff">3</xref></contrib>
<contrib contrib-type="author"><name><surname>Yang</surname><given-names>Xue-Xi</given-names></name>
<xref rid="af1-mmr-19-04-2837" ref-type="aff">1</xref>
<xref rid="c1-mmr-19-04-2837" ref-type="corresp"/></contrib>
<contrib contrib-type="author"><name><surname>Li</surname><given-names>Ming</given-names></name>
<xref rid="af1-mmr-19-04-2837" ref-type="aff">1</xref>
<xref rid="c1-mmr-19-04-2837" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-mmr-19-04-2837"><label>1</label>Institute of Antibody Engineering, School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou, Guangdong 510515, P.R. China</aff>
<aff id="af2-mmr-19-04-2837"><label>2</label>Clinical Innovation and Research Center, Shenzhen Hospital of Southern Medical University, Shenzhen, Guangdong 518110, P.R. China</aff>
<aff id="af3-mmr-19-04-2837"><label>3</label>Guangzhou Darui Biotechnology Co., Ltd., Guangzhou, Guangdong 510663, P.R. China</aff>
<author-notes>
<corresp id="c1-mmr-19-04-2837"><italic>Correspondence to</italic>: Dr Xue-Xi Yang or Dr Ming Li, Institute of Antibody Engineering, School of Laboratory Medicine and Biotechnology, Southern Medical University, 1838 North Guangzhou Road, Guangzhou, Guangdong 510515, P.R. China, E-mail: <email>yxxzb@sohu.com</email>, E-mail: <email>13318868107@126.com</email></corresp>
<fn id="fn1-mmr-19-04-2837"><label>&#x002A;</label><p>Contributed equally</p></fn>
</author-notes>
<pub-date pub-type="ppub"><month>04</month><year>2019</year></pub-date>
<pub-date pub-type="epub"><day>24</day><month>01</month><year>2019</year></pub-date>
<volume>19</volume>
<issue>4</issue>
<fpage>2837</fpage>
<lpage>2848</lpage>
<history>
<date date-type="received"><day>16</day><month>05</month><year>2018</year></date>
<date date-type="accepted"><day>03</day><month>12</month><year>2018</year></date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2019, Spandidos Publications</copyright-statement>
<copyright-year>2019</copyright-year>
</permissions>
<abstract>
<p>Thalassemia is caused by complex mechanisms, including copy number variants (CNVs) and single nucleotide variants (SNVs). The CNV types of &#x03B1;-thalassemia are typically detected by gap-polymerase chain reaction (PCR). The SNV types are detected by Sanger sequencing. In the present study, a novel method was developed that simultaneously detects CNVs and SNVs by multiplex PCR and next-generation sequencing (NGS). To detect CNVs, 33 normal samples were used as a cluster of control values to build a baseline, and the A, B, C, and D ratios were developed to evaluate-<sup>SEA</sup>, -&#x03B1;<sup>4.2</sup>, -&#x03B1;<sup>3.7</sup>, and compound or homozygous CNVs, respectively. To detect other SNVs, sequencing data were analyzed using the system&#x0027;s software and annotated using Annovar software. In a test of performance, 128 patients with thalassemia were detected using the method developed and were confirmed by Sanger sequencing and gap-PCR. Four different CNV types were clearly distinguished by the developed algorithm, with -<sup>SEA</sup>, -&#x03B1;<sup>3.7</sup>, -&#x03B1;<sup>4.2</sup>, and compound or homozygous deletions. The sensitivities for each CNV type were 96.72&#x0025; (59/61), 97.37&#x0025; (37/38), 83.33&#x0025; (10/12) and 95&#x0025; (19/20), and the specificities were 93.94&#x0025; (32/33), 93.94&#x0025; (32/33), 100&#x0025; (33/33) and 100&#x0025; (33/33), respectively. The SNVs detected were consistent with those of the Sanger sequencing.</p>
</abstract>
<kwd-group>
<kwd>thalassemia</kwd>
<kwd>copy number variant</kwd>
<kwd>single nucleotide variant</kwd>
<kwd>next-generation sequencing</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Thalassemia is caused by copy number variants (CNVs) and single nucleotide variants (SNVs) in the &#x03B1;-globin (HBA) or &#x03B2;-globin (HBB) genes that result in the absence or lack of &#x03B1;- or &#x03B2;-globin chains, and ultimately hemolytic anemia. It is estimated that ~7&#x0025; of the world population carries the gene for the disease (<xref rid="b1-mmr-19-04-2837" ref-type="bibr">1</xref>), and the birth rate of children with hemoglobin (Hb) disorders is &#x2265;2.4&#x0025; per year (<xref rid="b2-mmr-19-04-2837" ref-type="bibr">2</xref>). Thalassemia occurs most in the Mediterranean region, East South Asia, and the subcontinents of India and South China (<xref rid="b2-mmr-19-04-2837" ref-type="bibr">2</xref>). At present, the primary treatment methods are blood transfusion and iron removal. Bone marrow transplantation is also used but is expensive (<xref rid="b3-mmr-19-04-2837" ref-type="bibr">3</xref>). Thalassemia primarily includes &#x03B1;- and &#x03B2;-thalassemia. &#x03B1;-thalassemia is most often caused by CNVs or SNVs in the HBA gene. The most common SNV types in South China are Hb Constant Spring (HBA2:C.427T&#x003E;C), Hb Quong Sze (HBA2:c.377T&#x003E;C) and Hb Westmead (HBA2:c.369C&#x003E;G). The most common CNV types are the Southeast Asian type (&#x2212;<sup>SEA</sup>), the right deletion type (&#x2212;&#x03B1;<sup>3.7</sup>), and the left deletion type (&#x2212;&#x03B1;<sup>4.2</sup>). The -<sup>SEA</sup>/&#x03B1;&#x03B1;, -&#x03B1;<sup>3.7</sup>/&#x03B1;&#x03B1;, -&#x03B1;<sup>4.2</sup>/&#x03B1;&#x03B1;, &#x03B1;<sup>CS</sup>&#x03B1;/&#x03B1;&#x03B1;, and &#x03B1;<sup>QS</sup>&#x03B1;/&#x03B1;&#x03B1; types account for ~90&#x0025; of all &#x03B1;-thalassemia cases in this population (<xref rid="b4-mmr-19-04-2837" ref-type="bibr">4</xref>). &#x0392;-thalassemia is primarily caused by SNVs in the HBB gene; few cases are caused by CNVs. At present, 889 SNV types have been found (<uri xlink:href="http://globin.3se.psu.edu/">http://globin.3se.psu.edu/</uri>). In China, &#x003E;60 SNVs have been identified (<xref rid="b5-mmr-19-04-2837" ref-type="bibr">5</xref>); the most common types are CD41-42 (-TCTT) (HBB:c.126-129delCrITIT), CDl7(A&#x003E;T) (HBB:C.52A&#x003E;T), IVS-II-654(C&#x003E;T) (HBB:c.316-197C&#x003E;T), &#x2212;28(A&#x003E;G) (HBB:c.78A&#x003E;C), CD71/72(&#x002B;A) (HBB:c.216-217insA), &#x2212;29(A&#x003E;G) (HBB:c.-79A&#x003E;G), and CD26(G&#x003E;A) (HBB:c.79G&#x003E;A). These variants account for &#x003E;90&#x0025; of all &#x03B2;-thalassemia cases in China (<xref rid="b5-mmr-19-04-2837" ref-type="bibr">5</xref>).</p>
<p>The primary process for detecting thalassemia is routine blood examination of hematological parameters, including Hb content, mean corpuscular volume and mean corpuscular Hb, in addition to Hb electrophoresis of HbA2 and abnormal Hb (<xref rid="b6-mmr-19-04-2837" ref-type="bibr">6</xref>,<xref rid="b7-mmr-19-04-2837" ref-type="bibr">7</xref>). The molecular techniques used to diagnose thalassemia are primarily gap-polymerase chain reaction (PCR) and reverse dot blot (RDB) detection technology for target gene SNVs (<xref rid="b8-mmr-19-04-2837" ref-type="bibr">8</xref>). These two methods are used in clinical studies; however, they detect only ~20 known variants. Sanger sequencing technology can detect unknown SNVs, however, the data analysis is too complicated and the throughput is low. Fluorescence quantitative-PRC (qPCR) analysis can determine CNVs but cannot determine the breakpoint location. Multiplex ligation-dependent probe amplification, which involves designing specific probes for the globin gene cluster only, can detect 26 CNVs, however, the accuracy and precision of the results are affected by the limited number of fixed probes.</p>
<p>With the development of next-generation sequencing (NGS) technologies, including the Roche 454 system, Illumina Miseq and Hiseq systems, and Life Technologies Ion Torrent PGM and Proton systems, there are numerous reports on the concurrent detection of germline SNVs associated with a variety of monogenic diseases, or somatic mutations associated with various types of cancer, including non-small cell lung cancer (<xref rid="b9-mmr-19-04-2837" ref-type="bibr">9</xref>) and colorectal cancer (<xref rid="b10-mmr-19-04-2837" ref-type="bibr">10</xref>). Several studies have reported that a single testing method can simultaneously detect CNVs and SNVs (<xref rid="b11-mmr-19-04-2837" ref-type="bibr">11</xref>&#x2013;<xref rid="b15-mmr-19-04-2837" ref-type="bibr">15</xref>). However, there are no related reports on the simultaneous detection of CNVs and SNVs of thalassemia.</p>
<p>In the present study, a method was established to simultaneously detect &#x03B1;- and &#x03B2;-thalassemia using 82 multiplex PCR and NGS and two analysis algorithms of CNV and SNV types in target genes (HBB and HBA genes). The CNV type of each sample was confirmed by gap-PCR. The SNV type was confirmed by Sanger sequencing.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Blood sample collection and DNA extraction</title>
<p>A total of 128 blood samples of known thalassemia genotypes were collected from Fujian Medical University Union Hospital (Fujian, China) and the First Affiliated Hospital of Sun Yat-sen University (Guangzhou, China). The Samples were collected from October 2016 to January 2017. There were 79 female and 49 male patients, with an age range of 4 months to 86 years (mean age, 26). Peripheral blood samples (~5 ml) were collected into tubes that contained ethylenediaminetetraacetic acid. For each sample, genomic DNA was extracted from 100 &#x00B5;l whole blood using the DNeasy Blood and Tissue kit (Qiagen, Inc., Germantown, MD, USA) according to the manufacturer&#x0027;s protocol. Briefly, the blood samples were hydrated with 200 &#x00B5;l Buffer AL and 20 &#x00B5;l proteinase K followed by incubation for 10 min at 56&#x00B0;C, the contents were transferred to a DNeasy Mini Spin Column placed in a 2-ml collection tube following the addition of 200 &#x00B5;l ethanol (96&#x2013;100&#x0025;). The samples were centrifuged at 6,000 &#x00D7; g for 1 min at room temperature, at 6,000 &#x00D7; g for 1 min at room temperature following the addition of Buffer AW1, and again at 20,000 &#x00D7; g for 3 min at room temperature following the addition of Buffer AW2. Finally, the samples were eluted with 200 &#x00B5;l Buffer AE, quantified on a Qubit<sup>&#x00AE;</sup> fluorometer (Life Technologies; Thermo Fisher Scientific, Inc.), and stored at &#x2212;20&#x00B0;C prior to use.</p>
</sec>
<sec>
<title>Primer design</title>
<p>The primer sequences were designed using reference sequences of the HBA2 and HBB gene loci [accession nos. NC_000016.9 (222846.223709) and NC_000011.9 (5246696.5248301)] from the NCBI database (<uri xlink:href="http://www.ncbi.nlm.nih.gov/nuccore/NC_000016.9">http://www.ncbi.nlm.nih.gov/nuccore/NC_000016.9</uri> and <uri xlink:href="http://www.ncbi.nlm.nih.gov/nuccore/NC_000011.9">http://www.ncbi.nlm.nih.gov/nuccore/NC_000011.9</uri>) with Ion AmpliSeq&#x2122; Designer (<uri xlink:href="https://www.ampliseq.com/">https://www.ampliseq.com/</uri>). The Ion AmpliSeq&#x2122; Thalassemia Panel, which consists of two primer pools made up of 82 pairs of primers (72 pairs of primers for HBA2 and 10 pairs of primers for HBB), was designed by Life Technologies; Thermo Fisher Scientific, Inc.</p>
</sec>
<sec>
<title>Library construction</title>
<p>Each sample was used to construct the library using the Ion AmpliSeq&#x2122; Library Kit 2.0 (Life Technologies; Thermo Fisher Scientific, Inc.). In brief, 10 ng genomic DNA, 4 &#x00B5;l 5X Ion AmpliSeq&#x2122; HiFi mix, 10 &#x00B5;l 2X Ion AmpliSeq&#x2122; primer pool, and 4 &#x00B5;l nuclease-free water were mixed to amplify the target regions. Subsequently, 2 &#x00B5;l FuPa reagent was added to each amplified sample to partially digest the primer sequences, and each library was ligated into a unique barcode and a universal adapter provided in the Ion Xpress&#x2122; barcode adapters (Life Technologies; Thermo Fisher Scientific, Inc.). Each library was purified using AMPure XP beads (Beckman Coulter, Inc., Brea, CA, USA). The purified libraries were quantified on a Qubit<sup>&#x00AE;</sup> 3.0 fluorometer. The size distributions of the libraries were verified using the Agilent High Sensitivity DNA kit on a 2100 Bioanalyzer (Agilent Technologies, Inc., Palo Alto, CA, USA).</p>
</sec>
<sec>
<title>Template preparation and enrichment</title>
<p>Each library was diluted to 100 pM according to its quantified concentration as determined on the Qubit<sup>&#x00AE;</sup> 3.0 fluorometer. Subsequently, one test making up 14 or 15 libraries of 100 pM was emulsion PCR-amplified with Ion PGM&#x2122; Hi-Q&#x2122; ion sphere particles (ISPs) using the Ion OneTouch&#x2122; 2 Instrument (Life Technologies; Thermo Fisher Scientific, Inc.) according to the manufacturer&#x0027;s protocol. The template-positive ISPs were enriched using the Ion OneTouch&#x2122; ES instrument (Life Technologies; Thermo Fisher Scientific, Inc.) according to the manufacturer&#x0027;s protocol.</p>
</sec>
<sec>
<title>NGS</title>
<p>The enriched templates were loaded onto one Ion 318&#x2122; chip V2 and sequenced on the Ion Torrent Personal Genome Machine (PGM; Life Technologies; Thermo Fisher Scientific, Inc.), a semiconductor sequencing platform.</p>
</sec>
<sec>
<title>Variant detection</title>
<p>Sequencing data was mapped to the human reference sequence hg19 (Genome Reference Consortium GRCh37). The variants were called (Torrent Suite v.4.4.3; Life Technologies; Thermo Fisher Scientific, Inc.) using variant calling software with optimized parameters for the thalassemia panel. The variants were annotated using Annovar (<xref rid="b16-mmr-19-04-2837" ref-type="bibr">16</xref>) and the system&#x0027;s software. The detected variants were subjected to a rigorous manual curation process, which included querying variant databases, including the SNP database (<uri xlink:href="http://www.ncbi.nlm.nih.gov/snp/">www.ncbi.nlm.nih.gov/snp/</uri>), Exome Aggregation Consortium (<uri xlink:href="http://exac.broadinstitute.org/">exac.broadinstitute.org/</uri>), 1000 Genomes database (<uri xlink:href="http://www.internationalgenome.org/1000-genomes-browsers">www.internationalgenome.org/1000-genomes-browsers</uri>) and Clinvar database (<uri xlink:href="http://www.ncbi.nlm.nih.gov/clinvar/">www.ncbi.nlm.nih.gov/clinvar/</uri>) and a literature review.</p>
</sec>
<sec>
<title>Alignment of sequencing reads</title>
<p>The CNV was calculated by counting the reads in each amplicon with MAQ&#x003E;10. The reads were first uniquely mapped to the hg19 sequence from the RAW.bam file. The CIGAR index in each read was then trimmed. The read counts with &#x003E;50&#x0025; uniquely mapped in one amplicon were calculated. For certain amplicons, the reads were calculated with MAQ&#x003E;10, as their low mapping quality would lead to multiple hits, which included amplicons in the HBA1 and HBA2 genes. The same protocol was performed again using MAQ&#x003E;0 as the control group.</p>
</sec>
<sec>
<title>Statistical analysis for CNV detection</title>
<p>A novel algorithm was developed to identify the CNV types of &#x03B1;-thalassemia. In these cases, the target amplicons were related to different types of &#x03B1;-thalassemia regions, as described above. The algorithm consisted predominantly of four tests: A ratio, which revealed the &#x03B1;-thalassemia-<sup>SEA</sup> deletion type; B ratio, which revealed the &#x03B1;-thalassemia-&#x03B1;<sup>4.2</sup> deletion type; C ratio, which revealed the &#x03B1;-thalassemia-&#x03B1;<sup>3.7</sup> deletion type; and D ratio, which represented the compound heterozygous or homozygous deletion.</p>
<p>Initially, several basic parameters were defined, including the sequence read numbers of the ith reference amplicon (ref-reads-i) and the ith target amplicon (AMPL reads-i). The ref-reads was defined as the average number of reads of the five reference amplicons. The control reads ratio (AMPL-i) was defined as AMPL-i=(AMPL reads-i/ref-reads), and 28 control reads ratio values were obtained from 34 normal samples as a baseline. Other parameters were defined as follows: Median (median value of a cluster of numbers): Reads ref=&#x03A3; (ref-reads-i)/4 (i=3, 4, 8, 9 and 10); test reads ratio=(AMPL reads-i/ref-reads); A ratio=median (test reads ratio/control reads ratio) (i=4, 7, 8, 9, 10, 12, 13, 15, 44, 45, 48, 49, 50, 51, 55, 57, 58, 60, 65, 66 and 67); B ratio=median (test reads ratio/control reads ratio) (i=20, 21 and 22); C ratio=median (test reads ratio/control reads ratio) (i=32, 33 and 35); D ratio=median (test reads ratio/control reads ratio) (i=27). GraphPad Prism (v. 5.0; GraphPad Software, Inc., La Jolla, CA, USA) software was used for all statistical analyses. Data are expressed as the mean &#x00B1; standard error of the mean and were analyzed using an unpaired Student&#x0027;s t-test (two tailed).</p>
</sec>
<sec>
<title>Gap-PCR validation</title>
<p>All samples were amplified using the &#x03B1;-Thalassemia Genetic Diagnostic kit (gap-PCR method; DaAN Gene Co., Ltd., Sun Yat-sen University, Guangzhou, China). The target products were detected by agarose gel electrophoresis.</p>
</sec>
<sec>
<title>Sanger sequencing validation</title>
<p>The HBA2 and HBB target genes were amplified using specific primers, and the target products were sequenced by Sanger sequencing. New primers were designed using Primer Premier 5.0 software. The primer sequences for HBA2 were: Forward 5&#x2032;-CCCCACATCCCCTCACCTACATTC-3&#x2032; and reverse 5&#x2032;-CGGGCAGGAGGAACGGCTAC-3&#x2032;; the primer sequences for HBB were: Forward 5&#x2032;-CAGAAGAGCCAAGGACAGGTACGGCT-3&#x2032; and reverse 5&#x2032;-AAGGGCCTAGCTTGGACTCAGAATAATCC-3&#x2032;.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Sequencing bases and mean reads length of 128 samples</title>
<p>The present study aimed to establish a method of simultaneously detecting CNVs and SNVs of thalassemia that can be applied to other diseases, including autism spectrum disorder (ASD), spinal muscular atrophy (SMA), and Duchenne muscular dystrophy (DMD). The samples for the study were selected with the aim of including as many different types of thalassemia as possible. Samples with sequencing reads ranging between 100 and 500 M were selected for analysis. The average number of total raw bases was 45,236,536 (range: 22,632,244&#x2013;100,007,680). The average read length was 155 bp. The mean percentage of sequencing reads mapped to the reference hg19 genome was 98&#x0025;. Following filtering of the low-quality reads, polyclonal reads and primer dimer reads, sequenced bases with <sup>3</sup>Q20 values ranged between 21,042,640 and 94,594,388 (<xref rid="f1-mmr-19-04-2837" ref-type="fig">Fig. 1</xref>).</p>
</sec>
<sec>
<title>SNV spectrum in HBA2 and HBB</title>
<p>In the present study, SNVs were identified in the target region. Approximately 11 SNVs were identified according to the reference human genome hg19, including eight SNVs with clear definition of pathogenic alleles recorded in the Clinvar database, of which five SNVs were located in HBB exons, one in HBA2 exons, and two in introns or upstream of the HBB gene. The most frequently mutated gene locus was NM_000517.4:c.427T&#x003E;C (HBA2), which resulted in a termination codon mutation in the HBA2 gene to glutamic acid, making it difficult to continue synthesis of the polypeptide chain until the next stop password. Two samples carried a nonsynonymous variant causing p.Val114Glu, and three samples carried frameshift variants. The results were consistent with the results of Sanger sequencing (<xref rid="f2-mmr-19-04-2837" ref-type="fig">Figs. 2A and B</xref> and <xref rid="f3-mmr-19-04-2837" ref-type="fig">3</xref>).</p>
<p>There were three other nonsynonymous variants identified in exons which were not recorded in the Clinvar database. The SIFT score (<xref rid="b17-mmr-19-04-2837" ref-type="bibr">17</xref>&#x2013;<xref rid="b19-mmr-19-04-2837" ref-type="bibr">19</xref>), Polyphen2_HDIV_score, Polyphen2_HVAR_score (<xref rid="b17-mmr-19-04-2837" ref-type="bibr">17</xref>,<xref rid="b18-mmr-19-04-2837" ref-type="bibr">18</xref>), and PROVEAN score (<xref rid="b20-mmr-19-04-2837" ref-type="bibr">20</xref>), which were used to predict whether an amino acid substitution or indel affected the biological function of a protein, were calculated to evaluate the possible adverse effects (i.e., deleterious or possibly damaging nature) of the nonsynonymous variants on protein function. However, the SNV carriers exhibited symptoms of thalassemia, supporting the prediction results (<xref rid="tI-mmr-19-04-2837" ref-type="table">Tables I</xref> and <xref rid="tII-mmr-19-04-2837" ref-type="table">II</xref>).</p>
<p>An additional 66 SNVs were identified that did not result in an amino acid change and were located in an intron, an intergenic region, or upstream or downstream of genes. The minor allele frequency (MAF) value of 20 SNVs in the 1000 Genomes database was &#x003C;0.01, indicating that these SNVs occur less frequently in the normal population. However, their potential adverse effects require further evaluation. The MAF value of 32 additional SNVs in the 1000 Genomes database was &#x003E;0.01, indicating a probable polymorphism, and 1&#x2013;94 of the 128 samples in the present study carried these SNVs. These SNVs may form their own polymorphism in Chinese individuals, providing evidence for gene haplotype and crowd site distribution. Of these SNVs, 14 had no information in the 1000 Genomes database or other databases (<xref rid="tIII-mmr-19-04-2837" ref-type="table">Table III</xref>).</p>
</sec>
<sec>
<title>Determination of the quality of the sequencing reads</title>
<p>The human HBA gene cluster, located on chromosome 16, spans ~30 kb and includes seven loci: 5&#x2032;-zeta-pseudo-zeta-mu-pseudo-alpha-1-alpha-2-alpha-1-theta-3&#x2032;. The &#x03B1;-2 (HBA2) and &#x03B1;-1 (HBA1) coding sequences are identical. The similarity of these gene sequences is almost 97&#x0025;. They differ only marginally in their 5&#x2032;untranslated region and introns and differ significantly in their 3&#x2032;untranslated region. The target CNVs of HBA2 depend on an accurate alignment algorithm to avoid ambiguity between HBA2 and HBA1.</p>
<p>The present study introduced the concept of mapping quality, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. Align MAQ can build assemblies by mapping shotgun short reads to a reference genome using quality scores to derive genotype calls of the consensus sequence of a diploid genome (<xref rid="b21-mmr-19-04-2837" ref-type="bibr">21</xref>). In the present study, six -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup> samples and three -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup> samples were analyzed using Align MAQ=10. The aligned reads of AMPL-25, AMPL-29, AMPL-30, AMPL-37, and AMPL-38 in the -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup> samples using Align MAQ&#x003E;10 were close to 0 compared with those using Align MAQ=0 (<xref rid="f4-mmr-19-04-2837" ref-type="fig">Fig. 4A and B</xref>). Similar results were found in the -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup> samples.</p>
</sec>
<sec>
<title>Evaluation of the performance of the reference gene amplicons by NGS</title>
<p>Applying reference amplicons is key to constructing an algorithm to detect CNVs. For an algorithm to be accurate, the reference gene region should be a stable diploid with minimal variation in the amplicon sequencing depth of different samples. According to thalassemia disease-associated genes, regions of the HBB gene (ref-03-chr11: 5246753&#x2013;5246986, ref-04-chr11: 5246976&#x2013;5247184, ref-08-chr: 5248047&#x2013;5248296, ref-09-chr11: 5248286&#x2013;5248485, and ref-10-chr11: 5248475&#x2013;5248641, hg19), which encodes &#x03B2;-globin, were selected as reference amplicons. The HBB gene was used as the endogenous reference gene as &#x03B2;-thalassemia is predominantly caused by SNVs in the HBB gene, rather than a CNV. For thalassemia of the HBB CNV types, other genes require selection as the reference gene. The sequencing depth at each base pair position in these five regions was counted in all 128 samples divided by seven different groups. Normalized ref-reads were generated and are shown for each sample. The values varied between 1,136 and 13,282, with no significant differences in the amplicon sequencing depth among the samples in the seven groups, with the exception of -<sup>SEA</sup>/&#x03B1;&#x03B1; (<xref rid="f5-mmr-19-04-2837" ref-type="fig">Fig. 5A</xref>). The abnormal value in the -<sup>SEA</sup>/&#x03B1;&#x03B1; group may have been caused by deletions in the HBA2 gene region, although without influence on the final results.</p>
<p>The following step was to investigate the consistency of the samples. A cluster of reference reads ratios of 28 amplicons were built as a baseline across 33 normal samples (<xref rid="f5-mmr-19-04-2837" ref-type="fig">Fig. 5B</xref>). The reads ratio was defined as the ratio of the target region reads to the reference region reads of each sample. Examination of the coefficient of variation (CV) of the reference samples revealed that 24 of the 28 amplicons had CVs with values &#x003C;41.1&#x0025; (<xref rid="f5-mmr-19-04-2837" ref-type="fig">Fig. 5C</xref>).</p>
</sec>
<sec>
<title>CNV detection by NGS</title>
<p>To identify an indicator for CNV detection, a novel algorithm was developed based on the ratio of the median reads ratio of the target sample to that of the reference. The median ratio value, but not the mean ratio value, was used to evaluate the CNV type as the middle value is less vulnerable to a deviation as a result of a sequencing error. The A ratio, B ratio, C ratio and D ratio revealed the copy numbers of the region related to the Southeast Asia deletion, the -&#x03B1;<sup>4.2</sup> deletion, the -&#x03B1;<sup>3.7</sup> deletion, and the compound deletion type of &#x03B1;-thalassemia, respectively. The A ratio ranged between 0.741 and 1.298 in the normal group, between 0.263 and 0.899 in the -<sup>SEA</sup>/&#x03B1;&#x03B1; group, between 0.246 and 0.898 in the -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup> group, and between 0.232 and 0.707 in the -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup> group. The discrepancy in the A ratio was significant (P&#x003C;0.0001) between the normal (&#x03B1;&#x03B1;/&#x03B1;&#x03B1;) samples and heterozygous Southeast Asia deletion type (&#x2212;<sup>SEA</sup>/&#x03B1;&#x03B1;) samples according to Student&#x0027;s t-test (<xref rid="f6-mmr-19-04-2837" ref-type="fig">Fig. 6A</xref>). Consistent with the heterozygous Southeast Asia deletion type (&#x2212;<sup>SEA</sup>/&#x03B1;&#x03B1;, -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup>, and -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup>), the fluctuations in the B ratio and C ratio associated with the -&#x03B1;<sup>4.2</sup> deletion and -&#x03B1;<sup>3.7</sup> deletion were similar to that of the A ratio. The discrepancies were also significant (<xref rid="f6-mmr-19-04-2837" ref-type="fig">Fig. 6B and C</xref>). The D ratio was defined as the ratio of the AMPL-27 reads ratio of the target sample to the reference median reads ratio. The AMPL-27 ranged between chr16: 223333 and chr16: 223548 in the HBA2 gene (HBA1 and HBA2 genes encode ~97&#x0025; of the total Hb). A homozygous deletion in this region indicates a severe type of thalassemia. AMPL-27 is a common deletion region in these three types. Therefore, the D ratios in the -&#x03B1;<sup>3.7</sup>/-&#x03B1;<sup>3.7</sup>, -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup>, and -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup> groups were close to zero (<xref rid="f6-mmr-19-04-2837" ref-type="fig">Fig. 6D</xref>).</p>
</sec>
<sec>
<title>Targeted CNVs detected by NGS</title>
<p>The Southeast Asia, -&#x03B1;<sup>4.2</sup>, and -&#x03B1;<sup>3.7</sup> deletions were identified using the following criteria: A ratio &#x003C;0.8, B ratio &#x003C;0.4, and C ratio &#x003C;0.8, respectively. Subsequently, gap-PCR was used to evaluate the sensitivity and specificity of the approach. A total of 61 heterozygous Southeast Asia deletion (&#x2212;<sup>SEA</sup>/&#x03B1;&#x03B1;, -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup>, and -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup>) samples were detected with 96.72&#x0025; (59/61) sensitivity and 93.94&#x0025; (31/33) specificity, 12 heterozygous -&#x03B1;<sup>4.2</sup> deletion (&#x2212;&#x03B1;<sup>4.2</sup>/&#x03B1;&#x03B1; and -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup>) samples were detected with 83.33&#x0025; (10/12) sensitivity and 100&#x0025; (33/33) specificity, and 38 -&#x03B1;<sup>3.7</sup> deletion (&#x2212;&#x03B1;<sup>3.7</sup>/&#x03B1;&#x03B1;, -&#x03B1;<sup>3.7</sup>/-&#x03B1;<sup>3.7</sup>, and -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup>) samples were detected with 97.37&#x0025; sensitivity (37/38) and 93.94&#x0025; (31/33) specificity. Compound homozygous thalassemia was identified using the following criterion: D ratio &#x003C;0.002. In total, 20 homozygous deletions of AMPL-27 were detected with 95&#x0025; (19/20) sensitivity and 100&#x0025; specificity (33/33).</p>
</sec>
<sec>
<title>Correlation between target CNVs and SNVs</title>
<p>As NGS technology is able to simultaneously detect target gene CNVs and SNVs, their correlation was investigated in the present study. When a gene exhibits a loss of heterozygosity, only a haploid gene exists, not a diploid. Once this gene acquires SNVs, 100&#x0025; frequency can be detected; this abnormal sample is defined as a compound heterozygous CNV and SNV. In the present study, certain samples had compound heterozygous CNVs and SNVs (e.g., -<sup>SEA</sup>/&#x03B1;&#x03B1; and CD122).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>Human genetic diseases are generally caused by changes in genetic material that are considered to affect performance by controlling the expression of traits. These changes include SNVs and structural variations, which are operationally defined as CNVs, inversions and translocations (<xref rid="b22-mmr-19-04-2837" ref-type="bibr">22</xref>&#x2013;<xref rid="b31-mmr-19-04-2837" ref-type="bibr">31</xref>). There are different detection methods for different diseases. SNVs are usually detected by Sanger sequencing, Southern blotting (<xref rid="b32-mmr-19-04-2837" ref-type="bibr">32</xref>), PCR-RDB (<xref rid="b33-mmr-19-04-2837" ref-type="bibr">33</xref>,<xref rid="b34-mmr-19-04-2837" ref-type="bibr">34</xref>) or matrix-assisted laser desorption ionization time-of-flight mass spectrometry (<xref rid="b35-mmr-19-04-2837" ref-type="bibr">35</xref>). Partial CNVs, including deletions and duplications, are often detected by qPCR (<xref rid="b36-mmr-19-04-2837" ref-type="bibr">36</xref>), array comparative genomic hybridization (<xref rid="b37-mmr-19-04-2837" ref-type="bibr">37</xref>) and massively parallel DNA sequencing (<xref rid="b38-mmr-19-04-2837" ref-type="bibr">38</xref>). However, the genetic profile is so complex that the concurrent detection of an SNV and a structural chromosomal abnormality is difficult. In previous years, with the development of NGS technologies, several reports have described a single testing method that can simultaneously detect an SNV and an CNV (<xref rid="b11-mmr-19-04-2837" ref-type="bibr">11</xref>&#x2013;<xref rid="b15-mmr-19-04-2837" ref-type="bibr">15</xref>). These reports provide insight into novel methods of detecting inherited diseases. However, in the majority of studies, massive probes have been used to capture target gene regions, following which the target DNA was detected by massively parallel sequencing or NGS. In other studies, the whole genome was sequenced and only the target gene region was analyzed. The use of massive probes or the whole genome requires higher costs and labor requirements compared with the use of multiple primers to capture target gene regions.</p>
<p>In the present study, &#x03B1;- and &#x03B2;-thalassemia was used as the study model, including CNVs and SNVs in the HBA gene or SNVs in the HBB gene. Multiplex PCR-NGS technology can detect CNVs and SNVs in disease-specific genes. For the detection of SNVs, the coincidence rate using gold-standard generation sequencing was 100&#x0025;. For the detection of CNVs, although 100&#x0025; accuracy was not achieved in the present study, there were few false negatives, and false positives could be reduced using a subsequent validation technique, including Sanger sequencing and/or gap-PCR technology. Furthermore, the technology can also detect CNVs and SNVs in the entire region in addition to the specific region. This method has similar accuracy to Sanger sequencing for detecting SNVs. In the present study, a novel algorithm was developed to detect target CNVs and SNVs simultaneously using NGS data. In this algorithm, Align MAQ=10 was used to align the sequencing reads at a specific position and to remove mismatches, which may lead to the false detection of variants. The results indicated that the method was accurate, with high sensitivity and specificity, using MAQ=10.</p>
<p>The reference gene region was selected to normalize the PCRs for the quantity of genomic DNA added to the sequencing reactions. A ratio was set using the reads in the target gene relative to that in the reference gene. The read count data were converted into a standardized normal score. In the present study, the HBB gene was used as the endogenous reference gene for detecting the &#x03B1;-thalassemia CNV type, as &#x03B2;-thalassemia is predominantly caused by an SNV in the HBB gene, not a CNV. For thalassemia of the HBB CNV type, other genes require selection as the reference gene. In the present study, an algorithm was developed based on a previously reported relative qPCR method (<xref rid="b39-mmr-19-04-2837" ref-type="bibr">39</xref>). However, standard housekeeping genes, including GAPDH and &#x03B2;-actin, are typically used as internal control genes (<xref rid="b40-mmr-19-04-2837" ref-type="bibr">40</xref>,<xref rid="b41-mmr-19-04-2837" ref-type="bibr">41</xref>). Suitable internal controls for algorithm building are necessary. Some bias of the normalized ref-reads (<xref rid="f5-mmr-19-04-2837" ref-type="fig">Fig. 5A</xref>) remained present in the -<sup>SEA</sup>/&#x03B1;&#x03B1; group (i.e., expression of the reference gene region in the -<sup>SEA</sup>/&#x03B1;&#x03B1; group was significantly higher than in other groups). Therefore, based on the algorithm built in the present study, the reference gene can be used instead of other housekeeping genes, and the results are likely to be more accurate.</p>
<p>The present study also provides an example of CNV detection that can be exploited for other CNV-related diseases. Several diseases are related to target CNVs and SNVS; these include neurological disorders, including ASD (<xref rid="b42-mmr-19-04-2837" ref-type="bibr">42</xref>) and schizophrenia (<xref rid="b43-mmr-19-04-2837" ref-type="bibr">43</xref>), muscular disorders including SMA (<xref rid="b44-mmr-19-04-2837" ref-type="bibr">44</xref>) and DMD (<xref rid="b45-mmr-19-04-2837" ref-type="bibr">45</xref>), and certain types of cancer (<xref rid="b46-mmr-19-04-2837" ref-type="bibr">46</xref>&#x2013;<xref rid="b48-mmr-19-04-2837" ref-type="bibr">48</xref>) However, only a few uncommon diagnostic methods can simultaneously resolve these problems. The ability to combine CNV and SNV analyses using one method can save on labor costs.</p>
<p>In conclusion, the simultaneous detection of target CNVs and SNVs of thalassemia by multiplex PCR and next-generation sequencing is a valid strategy for thalassemia studies. The previous method for SNV detection involves PCR-RDB or Sanger sequencing. These methods are currently used in clinical studies; however, they detect only known variants. Sanger sequencing technology can detect unknown gene SNVs, but the data analysis is too complicated and the throughput is low. The present study used multiplex PCR and next-generation sequencing to detect novel mutations and target SNVs. For CNV detection, the previous method of gap-PCR can detect the -<sup>SEA</sup>, -&#x03B1;<sup>4.2</sup>, and -&#x03B1;<sup>3.7</sup> deletion type with good accuracy, but samples require re-testing, which increases labor. Therefore, the present study built a novel algorithm for CNV detection. The use of a cluster of control values to build a baseline and the ratios of the target amplicons to the reference amplicons increased the precision of the algorithm. Overall, the present study demonstrates the feasibility of using NGS data to detect both targeted CNVs and CNVs. This strategy allows for the use of multiplex PCR and NGS as routine methods, however, further computational and technological developments are required.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding</title>
<p>This study received financial assistance from the Science and Technology Program of Guangdong (grant no. 2015A030401040), the Key Program for Health Care Collaborative Innovation of Guangzhou (grant no. 201500000004-4), the Science and Technology Program of Guangzhou (grant no. 201704020114) and the Medical Scientific Research Foundation of Guangdong Province, China (grant no. A2017518).</p>
</sec>
<sec>
<title>Availability of data and materials</title>
<p>The datasets used or analyzed during the current study are available from the corresponding author on reasonable request.</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>DMF XY, XXY and ML conceived and designed the study. DMF and LMH performed the experiments. DMF, XY and LMH wrote the paper. XXY and ML improved the manuscript. DMF, XY and GJO analyzed the data. All authors read and approved the manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>The study protocol was approved by the Medical Ethics Committee of Shenzhen Hospital of Southern Medical University (Shenzhen, China), and the Committee on Human Research, Publications and Ethics of School of Laboratory Medicine and Biotechnology, Southern Medical University. Prior to recruitment and sample collection, meetings were held to explain in detail the purpose and procedures of the study. The inconveniences involved, including blood sampling, were also explained to the participants. Written informed consent was obtained from each participant or participant&#x0027;s guardian. The study was undertaken according to the principles of the Helsinki Declaration of 1975 (as revised 2008).</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>NGS</term><def><p>next-generation sequencing</p></def></def-item>
<def-item><term>CNV</term><def><p>copy number variant</p></def></def-item>
<def-item><term>SNV</term><def><p>single nucleotide variant</p></def></def-item>
</def-list>
</glossary>
<ref-list>
<title>References</title>
<ref id="b1-mmr-19-04-2837"><label>1</label><element-citation publication-type="journal"><collab collab-type="corp-author">Modell, Bernadette and World Health Organization</collab><article-title>Hereditary Diseases Programme</article-title><source>Guidelines for the control of haemoglobin disorders/edited by Bernadette Modell</source><publisher-name>World Health Organization</publisher-name><publisher-loc>Geneva</publisher-loc><year>1994</year></element-citation></ref>
<ref id="b2-mmr-19-04-2837"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Angastiniotis</surname><given-names>M</given-names></name><name><surname>Modell</surname><given-names>B</given-names></name></person-group><article-title>Global epidemiology of hemoglobin disorders</article-title><source>Ann N Y Acad Sci</source><volume>850</volume><fpage>251</fpage><lpage>269</lpage><year>1998</year><pub-id pub-id-type="doi">10.1111/j.1749-6632.1998.tb10482.x</pub-id><pub-id pub-id-type="pmid">9668547</pub-id></element-citation></ref>
<ref id="b3-mmr-19-04-2837"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mohamed</surname><given-names>SY</given-names></name></person-group><article-title>Thalassemia Major: Transplantation or transfusion and chelation</article-title><source>Hematol Oncol Stem Cell Ther</source><volume>10</volume><fpage>290</fpage><lpage>298</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.hemonc.2017.05.022</pub-id><pub-id pub-id-type="pmid">28651066</pub-id></element-citation></ref>
<ref id="b4-mmr-19-04-2837"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname><given-names>CG</given-names></name><name><surname>Liu</surname><given-names>M</given-names></name><name><surname>Du</surname><given-names>J</given-names></name><name><surname>Chen</surname><given-names>K</given-names></name><name><surname>Yang</surname><given-names>Y</given-names></name><name><surname>Yang</surname><given-names>Z</given-names></name></person-group><article-title>Molecular spectrum of &#x03B1;- and &#x03B2;-globin gene mutations detected in the population of Guangxi Zhuang Autonomous Region, People&#x0027;s Republic of China</article-title><source>Hemoglobin</source><volume>35</volume><fpage>28</fpage><lpage>39</lpage><year>2011</year><pub-id pub-id-type="doi">10.3109/03630269.2010.547429</pub-id><pub-id pub-id-type="pmid">21250879</pub-id></element-citation></ref>
<ref id="b5-mmr-19-04-2837"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Y</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name></person-group><article-title>Research progress on thalassemia in Southern China-review</article-title><source>Zhongguo Shi Yan Xue Ye Xue Za Zhi</source><volume>25</volume><fpage>276</fpage><lpage>280</lpage><year>2017</year><comment>(In Chinese)</comment><pub-id pub-id-type="pmid">28245416</pub-id></element-citation></ref>
<ref id="b6-mmr-19-04-2837"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Batterbee</surname><given-names>H</given-names></name><name><surname>De la Salle</surname><given-names>B</given-names></name><name><surname>Wild</surname><given-names>B</given-names></name><name><surname>McTaggart</surname><given-names>P</given-names></name><name><surname>Dore&#x00B4;</surname><given-names>C</given-names></name><name><surname>Porter</surname><given-names>N</given-names></name><name><surname>Hyde</surname><given-names>K</given-names></name></person-group><article-title>Evaluation of the validity of UK NEQAS Hb A2 data for the NHS Sickle Cell and Thalassaemia Screening Programme</article-title><source>Br J Haematol</source><volume>149</volume><fpage>S1</fpage><lpage>S96</lpage><year>2010</year></element-citation></ref>
<ref id="b7-mmr-19-04-2837"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ryan</surname><given-names>K</given-names></name><name><surname>Bain</surname><given-names>BJ</given-names></name><name><surname>Worthington</surname><given-names>D</given-names></name><name><surname>James</surname><given-names>J</given-names></name><name><surname>Plews</surname><given-names>D</given-names></name><name><surname>Mason</surname><given-names>A</given-names></name><name><surname>Roper</surname><given-names>D</given-names></name><name><surname>Rees</surname><given-names>DC</given-names></name><name><surname>de la Salle</surname><given-names>B</given-names></name><name><surname>Streetly</surname><given-names>A</given-names></name><etal/></person-group><article-title>Significant haemoglobinopathies: Guidelines for screening and diagnosis</article-title><source>Brit J Haematol</source><volume>149</volume><fpage>35</fpage><lpage>49</lpage><year>2010</year><pub-id pub-id-type="doi">10.1111/j.1365-2141.2009.08054.x</pub-id></element-citation></ref>
<ref id="b8-mmr-19-04-2837"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname><given-names>W</given-names></name><name><surname>Zhang</surname><given-names>C</given-names></name><name><surname>Lu</surname><given-names>F</given-names></name><name><surname>Tang</surname><given-names>J</given-names></name><name><surname>Lu</surname><given-names>Y</given-names></name><name><surname>Cui</surname><given-names>X</given-names></name><name><surname>Qin</surname><given-names>X</given-names></name><name><surname>Li</surname><given-names>S</given-names></name></person-group><article-title>Spectrum of &#x03B1;-thalassemia and &#x03B2;-thalassemia mutations in the Guilin Region of southern China</article-title><source>Clin Biochem</source><volume>48</volume><fpage>1068</fpage><lpage>1072</lpage><year>2015</year><pub-id pub-id-type="doi">10.1016/j.clinbiochem.2015.06.008</pub-id><pub-id pub-id-type="pmid">26079343</pub-id></element-citation></ref>
<ref id="b9-mmr-19-04-2837"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>K</given-names></name><name><surname>Zhang</surname><given-names>M</given-names></name><name><surname>Zhu</surname><given-names>J</given-names></name><name><surname>Hong</surname><given-names>W</given-names></name></person-group><article-title>Screening of gene mutations associated with bone metastasis in nonsmall cell lung cancer</article-title><source>J Cancer Res Ther</source><volume>12</volume><supplement>(Suppl)</supplement><fpage>C186</fpage><lpage>C190</lpage><year>2016</year><pub-id pub-id-type="doi">10.4103/0973-1482.200597</pub-id><pub-id pub-id-type="pmid">28230015</pub-id></element-citation></ref>
<ref id="b10-mmr-19-04-2837"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gallego</surname><given-names>CJ</given-names></name><name><surname>Shirts</surname><given-names>BH</given-names></name><name><surname>Bennette</surname><given-names>CS</given-names></name><name><surname>Guzauskas</surname><given-names>G</given-names></name><name><surname>Amendola</surname><given-names>LM</given-names></name><name><surname>Horike-Pyne</surname><given-names>M</given-names></name><name><surname>Hisama</surname><given-names>FM</given-names></name><name><surname>Pritchard</surname><given-names>CC</given-names></name><name><surname>Grady</surname><given-names>WM</given-names></name><name><surname>Burke</surname><given-names>W</given-names></name><etal/></person-group><article-title>Next-Generation sequencing panels for the diagnosis of colorectal cancer and polyposis syndromes: A cost-effectiveness analysis</article-title><source>J Clin Oncol</source><volume>33</volume><fpage>2084</fpage><lpage>2091</lpage><year>2015</year><pub-id pub-id-type="doi">10.1200/JCO.2014.59.3665</pub-id><pub-id pub-id-type="pmid">25940718</pub-id><pub-id pub-id-type="pmcid">4461806</pub-id></element-citation></ref>
<ref id="b11-mmr-19-04-2837"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tuononen</surname><given-names>K</given-names></name><name><surname>M&#x00E4;ki-Nevala</surname><given-names>S</given-names></name><name><surname>Sarhadi</surname><given-names>VK</given-names></name><name><surname>Wirtanen</surname><given-names>A</given-names></name><name><surname>R&#x00F6;nty</surname><given-names>M</given-names></name><name><surname>Salmenkivi</surname><given-names>K</given-names></name><name><surname>Andrews</surname><given-names>JM</given-names></name><name><surname>Telaranta-Keerie</surname><given-names>AI</given-names></name><name><surname>Hannula</surname><given-names>S</given-names></name><name><surname>Lagstr&#x00F6;m</surname><given-names>S</given-names></name><etal/></person-group><article-title>Comparison of targeted next-generation sequencing (NGS) and real-time PCR in the detection of EGFR, KRAS, and BRAF mutations on formalin-fixed, paraffin-embedded tumor material of non-small cell lung carcinoma-superiority of NGS</article-title><source>Genes Chromosomes Cancer</source><volume>52</volume><fpage>503</fpage><lpage>511</lpage><year>2013</year><pub-id pub-id-type="doi">10.1002/gcc.22047</pub-id><pub-id pub-id-type="pmid">23362162</pub-id></element-citation></ref>
<ref id="b12-mmr-19-04-2837"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shen</surname><given-names>W</given-names></name><name><surname>Szankasi</surname><given-names>P</given-names></name><name><surname>Sederberg</surname><given-names>M</given-names></name><name><surname>Schumacher</surname><given-names>J</given-names></name><name><surname>Frizzell</surname><given-names>KA</given-names></name><name><surname>Gee</surname><given-names>EP</given-names></name><name><surname>Patel</surname><given-names>JL</given-names></name><name><surname>South</surname><given-names>ST</given-names></name><name><surname>Xu</surname><given-names>X</given-names></name><name><surname>Kelley</surname><given-names>TW</given-names></name></person-group><article-title>Concurrent detection of targeted copy number variants and mutations using a myeloid malignancy next generation sequencing panel allows comprehensive genetic analysis using a single testing strategy</article-title><source>Br J Haematol</source><volume>173</volume><fpage>49</fpage><lpage>58</lpage><year>2016</year><pub-id pub-id-type="doi">10.1111/bjh.13921</pub-id><pub-id pub-id-type="pmid">26728869</pub-id></element-citation></ref>
<ref id="b13-mmr-19-04-2837"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>SY</given-names></name><name><surname>Kim</surname><given-names>JH</given-names></name><name><surname>Chung</surname><given-names>YJ</given-names></name></person-group><article-title>Effect of combining multiple CNV defining algorithms on the reliability of CNV calls from SNP genotyping data</article-title><source>Genomics Inform</source><volume>10</volume><fpage>194</fpage><lpage>199</lpage><year>2012</year><pub-id pub-id-type="doi">10.5808/GI.2012.10.3.194</pub-id><pub-id pub-id-type="pmid">23166530</pub-id><pub-id pub-id-type="pmcid">3492655</pub-id></element-citation></ref>
<ref id="b14-mmr-19-04-2837"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Marenne</surname><given-names>G</given-names></name><name><surname>Real</surname><given-names>FX</given-names></name><name><surname>Rothman</surname><given-names>N</given-names></name><name><surname>Rodr&#x00ED;guez-Santiago</surname><given-names>B</given-names></name><name><surname>P&#x00E9;rez-Jurado</surname><given-names>L</given-names></name><name><surname>Kogevinas</surname><given-names>M</given-names></name><name><surname>Garc&#x00ED;a-Closas</surname><given-names>M</given-names></name><name><surname>Silverman</surname><given-names>DT</given-names></name><name><surname>Chanock</surname><given-names>SJ</given-names></name><name><surname>G&#x00E9;nin</surname><given-names>E</given-names></name><name><surname>Malats</surname><given-names>N</given-names></name></person-group><article-title>Genome-wide CNV analysis replicates the association between GSTM1 deletion and bladder cancer: A support for using continuous measurement from SNP-array data</article-title><source>BMC Genomics</source><volume>13</volume><fpage>326</fpage><year>2012</year><pub-id pub-id-type="doi">10.1186/1471-2164-13-326</pub-id><pub-id pub-id-type="pmid">22817656</pub-id><pub-id pub-id-type="pmcid">3425254</pub-id></element-citation></ref>
<ref id="b15-mmr-19-04-2837"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Peterson</surname><given-names>RE</given-names></name><name><surname>Maes</surname><given-names>HH</given-names></name><name><surname>Lin</surname><given-names>P</given-names></name><name><surname>Kramer</surname><given-names>JR</given-names></name><name><surname>Hesselbrock</surname><given-names>VM</given-names></name><name><surname>Bauer</surname><given-names>LO</given-names></name><name><surname>Nurnberger</surname><given-names>JI</given-names><suffix>Jr</suffix></name><name><surname>Edenberg</surname><given-names>HJ</given-names></name><name><surname>Dick</surname><given-names>DM</given-names></name><name><surname>Webb</surname><given-names>BT</given-names></name></person-group><article-title>On the association of common and rare genetic variation influencing body mass index: A combined SNP and CNV analysis</article-title><source>BMC Genomics</source><volume>15</volume><fpage>368</fpage><year>2014</year><pub-id pub-id-type="doi">10.1186/1471-2164-15-368</pub-id><pub-id pub-id-type="pmid">24884913</pub-id><pub-id pub-id-type="pmcid">4035084</pub-id></element-citation></ref>
<ref id="b16-mmr-19-04-2837"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>K</given-names></name><name><surname>Li</surname><given-names>M</given-names></name><name><surname>Hakonarson</surname><given-names>H</given-names></name></person-group><article-title>ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data</article-title><source>Nucleic Acids Res</source><volume>38</volume><fpage>e164</fpage><year>2010</year><pub-id pub-id-type="doi">10.1093/nar/gkq603</pub-id><pub-id pub-id-type="pmid">20601685</pub-id><pub-id pub-id-type="pmcid">2938201</pub-id></element-citation></ref>
<ref id="b17-mmr-19-04-2837"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Galehdari</surname><given-names>H</given-names></name><name><surname>Saki</surname><given-names>N</given-names></name><name><surname>Mohammadi-Asl</surname><given-names>J</given-names></name><name><surname>Rahim</surname><given-names>F</given-names></name></person-group><article-title>Meta-analysis diagnostic accuracy of SNP-based pathogenicity detection tools: A case of UTG1A1 gene mutations</article-title><source>Int J Mol Epidemiol Genet</source><volume>4</volume><fpage>77</fpage><lpage>85</lpage><year>2013</year><pub-id pub-id-type="pmid">23875061</pub-id><pub-id pub-id-type="pmcid">3709112</pub-id></element-citation></ref>
<ref id="b18-mmr-19-04-2837"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname><given-names>P</given-names></name><name><surname>Henikoff</surname><given-names>S</given-names></name><name><surname>Ng</surname><given-names>PC</given-names></name></person-group><article-title>Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm</article-title><source>Nat Protoc</source><volume>4</volume><fpage>1073</fpage><lpage>1081</lpage><year>2009</year><pub-id pub-id-type="doi">10.1038/nprot.2009.86</pub-id><pub-id pub-id-type="pmid">19561590</pub-id></element-citation></ref>
<ref id="b19-mmr-19-04-2837"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname><given-names>PC</given-names></name><name><surname>Henikoff</surname><given-names>S</given-names></name></person-group><article-title>SIFT: Predicting amino acid changes that affect protein function</article-title><source>Nucleic Acids Res</source><volume>31</volume><fpage>3812</fpage><lpage>3814</lpage><year>2003</year><pub-id pub-id-type="doi">10.1093/nar/gkg509</pub-id><pub-id pub-id-type="pmid">12824425</pub-id><pub-id pub-id-type="pmcid">168916</pub-id></element-citation></ref>
<ref id="b20-mmr-19-04-2837"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname><given-names>Y</given-names></name><name><surname>Sims</surname><given-names>GE</given-names></name><name><surname>Murphy</surname><given-names>S</given-names></name><name><surname>Miller</surname><given-names>JR</given-names></name><name><surname>Chan</surname><given-names>AP</given-names></name></person-group><article-title>Predicting the functional effect of amino acid substitutions and indels</article-title><source>PLoS One</source><volume>7</volume><fpage>e46688</fpage><year>2012</year><pub-id pub-id-type="doi">10.1371/journal.pone.0046688</pub-id><pub-id pub-id-type="pmid">23056405</pub-id><pub-id pub-id-type="pmcid">3466303</pub-id></element-citation></ref>
<ref id="b21-mmr-19-04-2837"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>H</given-names></name><name><surname>Ruan</surname><given-names>J</given-names></name><name><surname>Durbin</surname><given-names>R</given-names></name></person-group><article-title>Mapping short DNA sequencing reads and calling variants using mapping quality scores</article-title><source>Genome Res</source><volume>18</volume><fpage>1851</fpage><lpage>1858</lpage><year>2008</year><pub-id pub-id-type="doi">10.1101/gr.078212.108</pub-id><pub-id pub-id-type="pmid">18714091</pub-id><pub-id pub-id-type="pmcid">2577856</pub-id></element-citation></ref>
<ref id="b22-mmr-19-04-2837"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wong</surname><given-names>KK</given-names></name><name><surname>deLeeuw</surname><given-names>RJ</given-names></name><name><surname>Dosanjh</surname><given-names>NS</given-names></name><name><surname>Kimm</surname><given-names>LR</given-names></name><name><surname>Cheng</surname><given-names>Z</given-names></name><name><surname>Horsman</surname><given-names>DE</given-names></name><name><surname>MacAulay</surname><given-names>C</given-names></name><name><surname>Ng</surname><given-names>RT</given-names></name><name><surname>Brown</surname><given-names>CJ</given-names></name><name><surname>Eichler</surname><given-names>EE</given-names></name><name><surname>Lam</surname><given-names>WL</given-names></name></person-group><article-title>A comprehensive analysis of common copy-number variations in the human genome</article-title><source>Am J Hum Genet</source><volume>80</volume><fpage>91</fpage><lpage>104</lpage><year>2007</year><pub-id pub-id-type="doi">10.1086/510560</pub-id><pub-id pub-id-type="pmid">17160897</pub-id></element-citation></ref>
<ref id="b23-mmr-19-04-2837"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname><given-names>Z</given-names></name><name><surname>Ventura</surname><given-names>M</given-names></name><name><surname>She</surname><given-names>X</given-names></name><name><surname>Khaitovich</surname><given-names>P</given-names></name><name><surname>Graves</surname><given-names>T</given-names></name><name><surname>Osoegawa</surname><given-names>K</given-names></name><name><surname>Church</surname><given-names>D</given-names></name><name><surname>DeJong</surname><given-names>P</given-names></name><name><surname>Wilson</surname><given-names>RK</given-names></name><name><surname>P&#x00E4;&#x00E4;bo</surname><given-names>S</given-names></name><etal/></person-group><article-title>A genome-wide comparison of recent chimpanzee and human segmental duplications</article-title><source>Nature</source><volume>437</volume><fpage>88</fpage><lpage>93</lpage><year>2005</year><pub-id pub-id-type="doi">10.1038/nature04000</pub-id><pub-id pub-id-type="pmid">16136132</pub-id></element-citation></ref>
<ref id="b24-mmr-19-04-2837"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Conrad</surname><given-names>DF</given-names></name><name><surname>Andrews</surname><given-names>TD</given-names></name><name><surname>Carter</surname><given-names>NP</given-names></name><name><surname>Hurles</surname><given-names>ME</given-names></name><name><surname>Pritchard</surname><given-names>JK</given-names></name></person-group><article-title>A high-resolution survey of deletion polymorphism in the human genome</article-title><source>Nat Genet</source><volume>38</volume><fpage>75</fpage><lpage>81</lpage><year>2006</year><pub-id pub-id-type="doi">10.1038/ng1697</pub-id><pub-id pub-id-type="pmid">16327808</pub-id></element-citation></ref>
<ref id="b25-mmr-19-04-2837"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McCarroll</surname><given-names>SA</given-names></name><name><surname>Hadnott</surname><given-names>TN</given-names></name><name><surname>Perry</surname><given-names>GH</given-names></name><name><surname>Sabeti</surname><given-names>PC</given-names></name><name><surname>Zody</surname><given-names>MC</given-names></name><name><surname>Barrett</surname><given-names>JC</given-names></name><name><surname>Dallaire</surname><given-names>S</given-names></name><name><surname>Gabriel</surname><given-names>SB</given-names></name><name><surname>Lee</surname><given-names>C</given-names></name><name><surname>Daly</surname><given-names>MJ</given-names></name><etal/></person-group><article-title>Common deletion polymorphisms in the human genome</article-title><source>Nat Genet</source><volume>38</volume><fpage>86</fpage><lpage>92</lpage><year>2006</year><pub-id pub-id-type="doi">10.1038/ng1696</pub-id><pub-id pub-id-type="pmid">16468122</pub-id></element-citation></ref>
<ref id="b26-mmr-19-04-2837"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hinds</surname><given-names>DA</given-names></name><name><surname>Kloek</surname><given-names>AP</given-names></name><name><surname>Jen</surname><given-names>M</given-names></name><name><surname>Chen</surname><given-names>X</given-names></name><name><surname>Frazer</surname><given-names>KA</given-names></name></person-group><article-title>Common deletions and SNPs are in linkage disequilibrium in the human genome</article-title><source>Nat Genet</source><volume>38</volume><fpage>82</fpage><lpage>85</lpage><year>2006</year><pub-id pub-id-type="doi">10.1038/ng1695</pub-id><pub-id pub-id-type="pmid">16327809</pub-id></element-citation></ref>
<ref id="b27-mmr-19-04-2837"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Iafrate</surname><given-names>AJ</given-names></name><name><surname>Feuk</surname><given-names>L</given-names></name><name><surname>Rivera</surname><given-names>MN</given-names></name><name><surname>Listewnik</surname><given-names>ML</given-names></name><name><surname>Donahoe</surname><given-names>PK</given-names></name><name><surname>Qi</surname><given-names>Y</given-names></name><name><surname>Scherer</surname><given-names>SW</given-names></name><name><surname>Lee</surname><given-names>C</given-names></name></person-group><article-title>Detection of large-scale variation in the human genome</article-title><source>Nat Genet</source><volume>36</volume><fpage>949</fpage><lpage>951</lpage><year>2004</year><pub-id pub-id-type="doi">10.1038/ng1416</pub-id><pub-id pub-id-type="pmid">15286789</pub-id></element-citation></ref>
<ref id="b28-mmr-19-04-2837"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tuzun</surname><given-names>E</given-names></name><name><surname>Sharp</surname><given-names>AJ</given-names></name><name><surname>Bailey</surname><given-names>JA</given-names></name><name><surname>Kaul</surname><given-names>R</given-names></name><name><surname>Morrison</surname><given-names>VA</given-names></name><name><surname>Pertz</surname><given-names>LM</given-names></name><name><surname>Haugen</surname><given-names>E</given-names></name><name><surname>Hayden</surname><given-names>H</given-names></name><name><surname>Albertson</surname><given-names>D</given-names></name><name><surname>Pinkel</surname><given-names>D</given-names></name><etal/></person-group><article-title>Fine-scale structural variation of the human genome</article-title><source>Nat Genet</source><volume>37</volume><fpage>727</fpage><lpage>732</lpage><year>2005</year><pub-id pub-id-type="doi">10.1038/ng1562</pub-id><pub-id pub-id-type="pmid">15895083</pub-id></element-citation></ref>
<ref id="b29-mmr-19-04-2837"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Redon</surname><given-names>R</given-names></name><name><surname>Ishikawa</surname><given-names>S</given-names></name><name><surname>Fitch</surname><given-names>KR</given-names></name><name><surname>Feuk</surname><given-names>L</given-names></name><name><surname>Perry</surname><given-names>GH</given-names></name><name><surname>Andrews</surname><given-names>TD</given-names></name><name><surname>Fiegler</surname><given-names>H</given-names></name><name><surname>Shapero</surname><given-names>MH</given-names></name><name><surname>Carson</surname><given-names>AR</given-names></name><name><surname>Chen</surname><given-names>W</given-names></name><etal/></person-group><article-title>Global variation in copy number in the human genome</article-title><source>Nature</source><volume>444</volume><fpage>444</fpage><lpage>454</lpage><year>2006</year><pub-id pub-id-type="doi">10.1038/nature05329</pub-id><pub-id pub-id-type="pmid">17122850</pub-id><pub-id pub-id-type="pmcid">2669898</pub-id></element-citation></ref>
<ref id="b30-mmr-19-04-2837"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sebat</surname><given-names>J</given-names></name><name><surname>Lakshmi</surname><given-names>B</given-names></name><name><surname>Troge</surname><given-names>J</given-names></name><name><surname>Alexander</surname><given-names>J</given-names></name><name><surname>Young</surname><given-names>J</given-names></name><name><surname>Lundin</surname><given-names>P</given-names></name><name><surname>M&#x00E5;n&#x00E9;r</surname><given-names>S</given-names></name><name><surname>Massa</surname><given-names>H</given-names></name><name><surname>Walker</surname><given-names>M</given-names></name><name><surname>Chi</surname><given-names>M</given-names></name><etal/></person-group><article-title>Large-scale copy number polymorphism in the human genome</article-title><source>Science</source><volume>305</volume><fpage>525</fpage><lpage>528</lpage><year>2004</year><pub-id pub-id-type="doi">10.1126/science.1098918</pub-id><pub-id pub-id-type="pmid">15273396</pub-id></element-citation></ref>
<ref id="b31-mmr-19-04-2837"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname><given-names>AJ</given-names></name><name><surname>Locke</surname><given-names>DP</given-names></name><name><surname>Mcgrath</surname><given-names>SD</given-names></name><name><surname>Cheng</surname><given-names>Z</given-names></name><name><surname>Bailey</surname><given-names>JA</given-names></name><name><surname>Vallente</surname><given-names>RU</given-names></name><name><surname>Pertz</surname><given-names>LM</given-names></name><name><surname>Clark</surname><given-names>RA</given-names></name><name><surname>Schwartz</surname><given-names>S</given-names></name><name><surname>Segraves</surname><given-names>R</given-names></name><etal/></person-group><article-title>Segmental duplications and copy-number variation in the human genome</article-title><source>Am J Hum Genet</source><volume>77</volume><fpage>78</fpage><lpage>88</lpage><year>2005</year><pub-id pub-id-type="doi">10.1086/431652</pub-id><pub-id pub-id-type="pmid">15918152</pub-id><pub-id pub-id-type="pmcid">1226196</pub-id></element-citation></ref>
<ref id="b32-mmr-19-04-2837"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Southern</surname><given-names>E</given-names></name></person-group><article-title>Southern blotting</article-title><source>Nat Protoc</source><volume>1</volume><fpage>518</fpage><lpage>525</lpage><year>2006</year><pub-id pub-id-type="doi">10.1038/nprot.2006.73</pub-id><pub-id pub-id-type="pmid">17406277</pub-id></element-citation></ref>
<ref id="b33-mmr-19-04-2837"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>G</given-names></name><name><surname>Li</surname><given-names>P</given-names></name><name><surname>Li</surname><given-names>YX</given-names></name><name><surname>Ye</surname><given-names>LZ</given-names></name></person-group><article-title>Coexistence of two &#x03B2;-globin gene deletions in a Chinese Girl with &#x03B2;-thalassemia Minor</article-title><source>Hemoglobin</source><volume>38</volume><fpage>70</fpage><lpage>72</lpage><year>2014</year><pub-id pub-id-type="doi">10.3109/03630269.2013.853673</pub-id><pub-id pub-id-type="pmid">24200214</pub-id></element-citation></ref>
<ref id="b34-mmr-19-04-2837"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Long</surname><given-names>J</given-names></name><name><surname>Ye</surname><given-names>X</given-names></name><name><surname>Lao</surname><given-names>K</given-names></name><name><surname>Pang</surname><given-names>W</given-names></name><name><surname>Weng</surname><given-names>X</given-names></name><name><surname>Fu</surname><given-names>K</given-names></name><name><surname>Yan</surname><given-names>S</given-names></name><name><surname>Sun</surname><given-names>L</given-names></name></person-group><article-title>Detection of three common &#x03B1;-thalassemia in non-deletion types and six common thalassemia in deletion types by QF-PCR</article-title><source>Clin Biochem</source><volume>46</volume><fpage>1860</fpage><lpage>1864</lpage><year>2013</year><pub-id pub-id-type="doi">10.1016/j.clinbiochem.2013.09.013</pub-id><pub-id pub-id-type="pmid">24070774</pub-id></element-citation></ref>
<ref id="b35-mmr-19-04-2837"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Soler</surname><given-names>L</given-names></name><name><surname>Labas</surname><given-names>V</given-names></name><name><surname>Th&#x00E9;lie</surname><given-names>A</given-names></name><name><surname>Grasseau</surname><given-names>I</given-names></name><name><surname>Teixeira-Gomes</surname><given-names>AP</given-names></name><name><surname>Blesbois</surname><given-names>E</given-names></name></person-group><article-title>Intact cell MALDI-TOF MS on sperm: A molecular test for male fertility diagnosis</article-title><source>Mol Cell Proteomics</source><volume>169</volume><fpage>1998</fpage><lpage>2010</lpage><year>2016</year><pub-id pub-id-type="doi">10.1074/mcp.M116.058289</pub-id></element-citation></ref>
<ref id="b36-mmr-19-04-2837"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Plengvidhya</surname><given-names>N</given-names></name><name><surname>Chanprasert</surname><given-names>K</given-names></name><name><surname>Tangjittipokin</surname><given-names>W</given-names></name><name><surname>Thongnoppakhun</surname><given-names>W</given-names></name><name><surname>Yenchitsomanus</surname><given-names>PT</given-names></name></person-group><article-title>Detection of CAPN10 copy number variation in Thai patients with type 2 diabetes by denaturing high performance liquid chromatography and real-time quantitative polymerase chain reaction</article-title><source>J Diabetes Invest</source><volume>6</volume><fpage>632</fpage><lpage>639</lpage><year>2015</year><pub-id pub-id-type="doi">10.1111/jdi.12341</pub-id></element-citation></ref>
<ref id="b37-mmr-19-04-2837"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hussein</surname><given-names>IR</given-names></name><name><surname>Magbooli</surname><given-names>A</given-names></name><name><surname>Huwait</surname><given-names>E</given-names></name><name><surname>Chaudhary</surname><given-names>A</given-names></name><name><surname>Bader</surname><given-names>R</given-names></name><name><surname>Gari</surname><given-names>M</given-names></name><name><surname>Ashgan</surname><given-names>F</given-names></name><name><surname>Alquaiti</surname><given-names>M</given-names></name><name><surname>Abuzenadah</surname><given-names>A</given-names></name><name><surname>AlQahtani</surname><given-names>M</given-names></name></person-group><article-title>Genome wide array-CGH and qPCR analysis for the identification of genome defects in Williams&#x0027; syndrome patients in Saudi Arabia</article-title><source>Mol Cytogenet</source><volume>9</volume><fpage>65</fpage><year>2016</year><pub-id pub-id-type="doi">10.1186/s13039-016-0266-4</pub-id><pub-id pub-id-type="pmid">27525043</pub-id><pub-id pub-id-type="pmcid">4981984</pub-id></element-citation></ref>
<ref id="b38-mmr-19-04-2837"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Miyagawa</surname><given-names>M</given-names></name><name><surname>Nishio</surname><given-names>SY</given-names></name><name><surname>Hattori</surname><given-names>M</given-names></name><name><surname>Moteki</surname><given-names>H</given-names></name><name><surname>Kobayashi</surname><given-names>Y</given-names></name><name><surname>Sato</surname><given-names>H</given-names></name><name><surname>Watanabe</surname><given-names>T</given-names></name><name><surname>Naito</surname><given-names>Y</given-names></name><name><surname>Oshikawa</surname><given-names>C</given-names></name><name><surname>Usami</surname><given-names>S</given-names></name></person-group><article-title>Mutations in the MYO15A gene are a significant cause of nonsyndromic hearing loss: Massively parallel DNA sequencing-based analysis</article-title><source>Ann Otol Rhinol Laryngol</source><volume>124</volume><supplement>(Suppl 1)</supplement><fpage>158S</fpage><lpage>168S</lpage><year>2015</year><pub-id pub-id-type="doi">10.1177/0003489415575058</pub-id><pub-id pub-id-type="pmid">25792667</pub-id></element-citation></ref>
<ref id="b39-mmr-19-04-2837"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Livak</surname><given-names>KJ</given-names></name><name><surname>Schmittgen</surname><given-names>TD</given-names></name></person-group><article-title>Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method</article-title><source>Methods</source><volume>25</volume><fpage>402</fpage><lpage>408</lpage><year>2001</year><pub-id pub-id-type="doi">10.1006/meth.2001.1262</pub-id><pub-id pub-id-type="pmid">11846609</pub-id></element-citation></ref>
<ref id="b40-mmr-19-04-2837"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Piorkowski</surname><given-names>G</given-names></name><name><surname>Baronti</surname><given-names>C</given-names></name><name><surname>de Lamballerie</surname><given-names>X</given-names></name><name><surname>de Fabritus</surname><given-names>L</given-names></name><name><surname>Bichaud</surname><given-names>L</given-names></name><name><surname>Pastorino</surname><given-names>BA</given-names></name><name><surname>Bessaud</surname><given-names>M</given-names></name></person-group><article-title>Development of generic Taqman PCR and RT-PCR assays for the detection of DNA and mRNA of &#x03B2;-actin-encoding sequences in a wide range of animal species</article-title><source>J Virol Methods</source><volume>202</volume><fpage>101</fpage><lpage>105</lpage><year>2014</year><pub-id pub-id-type="doi">10.1016/j.jviromet.2014.02.026</pub-id><pub-id pub-id-type="pmid">24642236</pub-id></element-citation></ref>
<ref id="b41-mmr-19-04-2837"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>J</given-names></name><name><surname>Lin</surname><given-names>Q</given-names></name><name><surname>Lin</surname><given-names>J</given-names></name><name><surname>Ye</surname><given-names>X</given-names></name></person-group><article-title>Selection and validation of reference genes for quantitative Real-time polymerase chain reaction studies in mossy maze polypore, Cerrena unicolor (Higher Basidiomycetes)</article-title><source>Int J Med Mushrooms</source><volume>18</volume><fpage>165</fpage><lpage>175</lpage><year>2016</year><pub-id pub-id-type="doi">10.1615/IntJMedMushrooms.v18.i2.70</pub-id><pub-id pub-id-type="pmid">27279538</pub-id></element-citation></ref>
<ref id="b42-mmr-19-04-2837"><label>42</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Merikangas</surname><given-names>AK</given-names></name><name><surname>Segurado</surname><given-names>R</given-names></name><name><surname>Heron</surname><given-names>EA</given-names></name><name><surname>Anney</surname><given-names>RJ</given-names></name><name><surname>Paterson</surname><given-names>AD</given-names></name><name><surname>Cook</surname><given-names>EH</given-names></name><name><surname>Pinto</surname><given-names>D</given-names></name><name><surname>Scherer</surname><given-names>SW</given-names></name><name><surname>Szatmari</surname><given-names>P</given-names></name><name><surname>Gill</surname><given-names>M</given-names></name><etal/></person-group><article-title>The phenotypic manifestations of rare genic CNVs in autism spectrum disorder</article-title><source>Mol Psychiatry</source><volume>20</volume><fpage>1366</fpage><lpage>1372</lpage><year>2015</year><pub-id pub-id-type="doi">10.1038/mp.2014.150</pub-id><pub-id pub-id-type="pmid">25421404</pub-id></element-citation></ref>
<ref id="b43-mmr-19-04-2837"><label>43</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rudd</surname><given-names>DS</given-names></name><name><surname>Axelsen</surname><given-names>M</given-names></name><name><surname>Epping</surname><given-names>EA</given-names></name><name><surname>Andreasen</surname><given-names>NC</given-names></name><name><surname>Wassink</surname><given-names>TH</given-names></name></person-group><article-title>A genome-wide CNV analysis of schizophrenia reveals a potential role for a multiple-hit model</article-title><source>Am J Med Genet B Neuropsychiatr Genet 165B</source><fpage>619</fpage><lpage>626</lpage><year>2014</year><pub-id pub-id-type="doi">10.1002/ajmg.b.32266</pub-id></element-citation></ref>
<ref id="b44-mmr-19-04-2837"><label>44</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wain</surname><given-names>LV</given-names></name><name><surname>Pedroso</surname><given-names>I</given-names></name><name><surname>Landers</surname><given-names>JE</given-names></name><name><surname>Breen</surname><given-names>G</given-names></name><name><surname>Shaw</surname><given-names>CE</given-names></name><name><surname>Leigh</surname><given-names>PN</given-names></name><name><surname>Brown</surname><given-names>RH</given-names></name><name><surname>Tobin</surname><given-names>MD</given-names></name><name><surname>Al-Chalabi</surname><given-names>A</given-names></name></person-group><article-title>The role of copy number variation in susceptibility to amyotrophic lateral sclerosis: Genome-wide association study and comparison with published loci</article-title><source>PLoS One</source><volume>4</volume><fpage>e8175</fpage><year>2009</year><pub-id pub-id-type="doi">10.1371/journal.pone.0008175</pub-id><pub-id pub-id-type="pmid">19997636</pub-id><pub-id pub-id-type="pmcid">2780722</pub-id></element-citation></ref>
<ref id="b45-mmr-19-04-2837"><label>45</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>White</surname><given-names>SJ</given-names></name><name><surname>den Dunnen</surname><given-names>JT</given-names></name></person-group><article-title>Copy number variation in the genome; the human DMD gene as an example</article-title><source>Cytogenet Genome Res</source><volume>115</volume><fpage>240</fpage><lpage>246</lpage><year>2006</year><pub-id pub-id-type="doi">10.1159/000095920</pub-id><pub-id pub-id-type="pmid">17124406</pub-id></element-citation></ref>
<ref id="b46-mmr-19-04-2837"><label>46</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>W</given-names></name><name><surname>Ding</surname><given-names>J</given-names></name><name><surname>Long</surname><given-names>J</given-names></name><name><surname>Liu</surname><given-names>Z</given-names></name><name><surname>Zhou</surname><given-names>X</given-names></name><name><surname>Shi</surname><given-names>D</given-names></name></person-group><article-title>DNA copy number profiling in microsatellite-stable and microsatellite-unstable hereditary non-polyposis colorectal cancers by targeted CNV array</article-title><source>Funct Integr Genomics</source><volume>17</volume><fpage>85</fpage><lpage>96</lpage><year>2017</year><pub-id pub-id-type="doi">10.1007/s10142-016-0532-x</pub-id><pub-id pub-id-type="pmid">27896456</pub-id></element-citation></ref>
<ref id="b47-mmr-19-04-2837"><label>47</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>L</given-names></name><name><surname>Liu</surname><given-names>B</given-names></name><name><surname>Qiu</surname><given-names>F</given-names></name><name><surname>Huang</surname><given-names>B</given-names></name><name><surname>Li</surname><given-names>Y</given-names></name><name><surname>Huang</surname><given-names>D</given-names></name><name><surname>Yang</surname><given-names>R</given-names></name><name><surname>Yang</surname><given-names>X</given-names></name><name><surname>Deng</surname><given-names>J</given-names></name><name><surname>Jiang</surname><given-names>Q</given-names></name><etal/></person-group><article-title>The effect of functional MAPKAPK2 copy number variation CNV-30450 on elevating nasopharyngeal carcinoma risk is modulated by EBV infection</article-title><source>Carcinogenesis</source><volume>35</volume><fpage>46</fpage><lpage>52</lpage><year>2014</year><pub-id pub-id-type="doi">10.1093/carcin/bgt314</pub-id><pub-id pub-id-type="pmid">24056810</pub-id></element-citation></ref>
<ref id="b48-mmr-19-04-2837"><label>48</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>Y</given-names></name><name><surname>Tan</surname><given-names>X</given-names></name><name><surname>Ding</surname><given-names>Y</given-names></name><name><surname>Mai</surname><given-names>B</given-names></name><name><surname>Huang</surname><given-names>X</given-names></name><name><surname>Hu</surname><given-names>G</given-names></name><name><surname>Luo</surname><given-names>X</given-names></name></person-group><article-title>WWOX CNV-67048 functions as a risk factor for epithelial ovarian cancer in chinese women by negatively interacting with oral contraceptive use</article-title><source>Biomed Res In</source><volume>2016</volume><fpage>6594039</fpage><year>2016</year></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-mmr-19-04-2837" position="float">
<label>Figure 1.</label>
<caption><p>Sequencing bases and mean reads length of 128 samples.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g00.tif"/>
</fig>
<fig id="f2-mmr-19-04-2837" position="float">
<label>Figure 2.</label>
<caption><p>DNA sequences of the samples carrying single nucleotide variants in the HBB gene by Sanger sequencing. HBB, &#x03B2;-globin.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g01.tif"/>
</fig>
<fig id="f3-mmr-19-04-2837" position="float">
<label>Figure 3.</label>
<caption><p>DNA sequences of the samples carrying single nucleotide variants in the HBA2 and HBQ1 genes by Sanger sequencing. HBA, &#x03B1;-globin; HBQ1, hemoglobin subunit q1.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g02.tif"/>
</fig>
<fig id="f4-mmr-19-04-2837" position="float">
<label>Figure 4.</label>
<caption><p>Comparison of Aligned reads using Align MAQ&#x003E;0 and Align MAQ&#x003E;10. The red dashed boxes indicate that the aligned reads using Align MAQ&#x003E;10 were close to 0, compared with those using Align MAQ&#x003E;0. (A) Aligned reads of six samples of -<sup>SEA</sup>/-&#x03B1;<sup>3.7</sup>. (B) Aligned reads of three samples of -<sup>SEA</sup>/-&#x03B1;<sup>4.2</sup>.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g03.tif"/>
</fig>
<fig id="f5-mmr-19-04-2837" position="float">
<label>Figure 5.</label>
<caption><p>Evaluation of reference amplicons. (A) Normalized ref-reads of the reference amplicons. &#x002A;&#x002A;&#x002A;P&#x003C;0.001. (B) Baseline built by a cluster of reference reads ratios. (C) Coefficient of variation of each amplicon.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g04.tif"/>
</fig>
<fig id="f6-mmr-19-04-2837" position="float">
<label>Figure 6.</label>
<caption><p>A, B, C, D read ratios. (A) A, (B) B, (C) C and (D) D read ratios were obtained for each sample in seven groups. &#x002A;&#x002A;&#x002A;P&#x003C;0.001. n.s., not significant.</p></caption>
<graphic xlink:href="MMR-19-04-2837-g05.tif"/>
</fig>
<table-wrap id="tI-mmr-19-04-2837" position="float">
<label>Table I.</label>
<caption><p>Pathogenic alleles or likely pathogenic alleles detected by next-generation sequencing.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">dbSNP ID</th>
<th align="center" valign="bottom">cDNA change</th>
<th align="center" valign="bottom">Amino acid change</th>
<th align="center" valign="bottom">Function</th>
<th align="center" valign="bottom">Gene</th>
<th align="center" valign="bottom">Exonic function</th>
<th align="center" valign="bottom">Clinical significance</th>
<th align="center" valign="bottom">1000g2015-aug_all</th>
<th align="center" valign="bottom">ExAC_ALL</th>
<th align="center" valign="bottom">Sample number</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">rs34484056</td>
<td align="left" valign="top">NM_000518.4:c.341T&#x003E;A</td>
<td align="left" valign="top">NP_000509.1:p.Val114Glu</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">Nonsynonymous</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">0.0000165</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs34451549</td>
<td align="left" valign="top">NM_000518.4:c.316-197C&#x003E;T</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBB</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">With pathogenic allele</td>
<td/>
<td/>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs33969853</td>
<td align="left" valign="top">NM_000518.4:c.216_217insA</td>
<td align="left" valign="top">NP_000509.1:p.Ser73Lysfs</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">Frameshift insertion</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs281864900</td>
<td align="left" valign="top">NM_000518.4:c.126_129delCTTT</td>
<td align="left" valign="top">NP_000509.1:p.Phe42Leufs</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">Frameshift deletion</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">0.0010</td>
<td align="left" valign="top">0.0003</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs33986703</td>
<td align="left" valign="top">NM_000518.4:c.52A&#x003E;T</td>
<td align="left" valign="top">NP_000509.1:p.Lys18Ter</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">Stopgain</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">0.0012</td>
<td align="left" valign="top">0.0000165</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs35383398</td>
<td align="left" valign="top">NM_000518.4:c.45_46insG</td>
<td align="left" valign="top">NP_000509.1:p.Trp16Valfs</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">Frameshift insertion</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs33931746</td>
<td align="left" valign="top">NM_000518.4:c.-78A&#x003E;G</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBB</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;5</td>
</tr>
<tr>
<td align="left" valign="top">rs41464951</td>
<td align="left" valign="top">NM_000517.4:c.427T&#x003E;C</td>
<td align="left" valign="top">NP_000508.1:p.Ter143Glu</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">Stopgain</td>
<td align="left" valign="top">With pathogenic allele</td>
<td align="center" valign="top">0.0002</td>
<td/>
<td align="center" valign="top">14</td>
</tr>
<tr>
<td align="left" valign="top">rs41397847</td>
<td align="left" valign="top">NM_000517.4:c.377T&#x003E;C</td>
<td align="left" valign="top">NP_000508.1:p.Leu126Pro</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">Nonsynonymous</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x2013;</td>
<td align="left" valign="top">0.0001</td>
<td align="center" valign="top">&#x00A0;&#x00A0;5</td>
</tr>
<tr>
<td align="left" valign="top">rs41479347</td>
<td align="left" valign="top">NM_000517.4:c.369C&#x003E;G</td>
<td align="left" valign="top">NP_000508.1:p.His123Gln</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">Nonsynonymous</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">0.0002</td>
<td align="left" valign="top">0.0001</td>
<td align="center" valign="top">&#x00A0;&#x00A0;3</td>
</tr>
<tr>
<td align="left" valign="top">rs184435680</td>
<td align="left" valign="top">NM_005331.4:c.239C&#x003E;T</td>
<td align="left" valign="top">NP_005322.1:p.Ala80Val</td>
<td align="left" valign="top">Exonic</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">Nonsynonymous</td>
<td align="left" valign="top">NA</td>
<td align="center" valign="top">0.0026</td>
<td align="left" valign="top">0.0013</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-mmr-19-04-2837"><p>NA, not applicable.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-mmr-19-04-2837" position="float">
<label>Table II.</label>
<caption><p>Prediction of amino acid changes that affect the protein function of likely pathogenic alleles.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">dbSNP ID</th>
<th align="center" valign="bottom">SIFT_score</th>
<th align="center" valign="bottom">Polyphen2_HDIV_score</th>
<th align="center" valign="bottom">Polyphen2_HVAR_score</th>
<th align="center" valign="bottom">PROVEAN_score</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">rs184435680</td>
<td align="center" valign="top">0.001</td>
<td align="center" valign="top">0.979</td>
<td align="center" valign="top">0.162</td>
<td align="center" valign="top">&#x2212;3.47</td>
</tr>
<tr>
<td align="left" valign="top">rs41397847</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">0.997</td>
<td align="center" valign="top">&#x2212;5.01</td>
</tr>
<tr>
<td align="left" valign="top">rs41479347</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">0.866</td>
<td align="center" valign="top">0.76</td>
<td align="center" valign="top">&#x2212;5.74</td>
</tr>
<tr>
<td align="left" valign="top">Categorical prediction</td>
<td align="center" valign="top">D: deleterious (sift&#x003C;=0.05); T: tolerated (sift&#x003E;0.05)</td>
<td align="center" valign="top">D: probably damaging (&#x003E;=0.957), P: possibly damaging (0.453&#x003C;=pp2_hdiv&#x003C;=0.956); B: benign (pp2_hdiv&#x003C;=0.452)</td>
<td align="center" valign="top">D: probably damaging (&#x003E;=0.909), P: possibly damaging (0.447&#x003C;=pp2_hdiv&#x003C;=0.909); B: benign (pp2_hdiv&#x003C;=0.446)</td>
<td align="center" valign="top">D: deleterious (provean&#x003C;=&#x2212;2.5); T: tolerated (provean&#x003E;-2.5) (&#x2018;polymorphism_automatic&#x2019;)</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="tIII-mmr-19-04-2837" position="float">
<label>Table III.</label>
<caption><p>Alleles with unclear clinical significance or polymorphisms.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">dbSNP ID</th>
<th align="center" valign="bottom">Location</th>
<th align="center" valign="bottom">Gene</th>
<th align="center" valign="bottom">MAF (1000g2015aug_all)</th>
<th align="center" valign="bottom">Sample number</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">rs184435680</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">T=0.0026/13</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs2541669</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">T=0.3423/1714</td>
<td align="center" valign="top">&#x00A0;&#x00A0;3</td>
</tr>
<tr>
<td align="left" valign="top">rs281864524</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">T=0.0006/3</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs565600725</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">T=0.0002/1</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs180783444</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">A=0.0016/8</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs551376957</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBA1</td>
<td align="left" valign="top">C=0.0002/1</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs571103784</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">C=0.0002/1</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs75154897</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.0014/7</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs570069684</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">C=0.0004/2</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs529931134</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">G=0.0006/3</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs556749777</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.0006/3</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs14010613</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">A=0.0012/6</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs75154897</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.0014/7</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs76306358</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">C=0.0018/9</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs189144293</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">A=0.0024/12</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs181879924</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">A=0.0024/12</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs376289816</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">T=0.0036/18</td>
<td align="center" valign="top">&#x00A0;&#x00A0;2</td>
</tr>
<tr>
<td align="left" valign="top">rs200410739</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">&#x2212;=0.0046/23</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs181734727</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA2, HBA1</td>
<td align="left" valign="top">A=0.0068/34</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs193110122</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.0080/40</td>
<td align="center" valign="top">11</td>
</tr>
<tr>
<td align="left" valign="top">chr11:5247070G&#x003E;T</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBB</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs11431675</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBM</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">81</td>
</tr>
<tr>
<td align="left" valign="top">rs377158360</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">chr16:220861delC</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs373693318</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBA2</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;3</td>
</tr>
<tr>
<td align="left" valign="top">chr16:223997C&#x003E;G</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBA2</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">chr16:228779A&#x003E;C</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA1, HBQ1</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">chr16:229068T&#x003E;C</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA1, HBQ1</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs117470710</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBQ1</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">chr16:230614C&#x003E;A</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBQ1</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs5018713</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">126</td>
</tr>
<tr>
<td align="left" valign="top">chr16:233238G&#x003E;C</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">chr16:233605C&#x003E;T</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs67113805</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="center" valign="top">&#x2013;</td>
<td align="center" valign="top">41</td>
</tr>
<tr>
<td align="left" valign="top">rs3760046</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBA1</td>
<td align="left" valign="top">C=0.0120/60</td>
<td align="center" valign="top">&#x00A0;&#x00A0;7</td>
</tr>
<tr>
<td align="left" valign="top">rs75368786</td>
<td align="left" valign="top">Utr3</td>
<td align="left" valign="top">HBM</td>
<td align="left" valign="top">A=0.0198/99</td>
<td align="center" valign="top">13</td>
</tr>
<tr>
<td align="left" valign="top">rs2238370</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">A=0.0304/152</td>
<td align="center" valign="top">13</td>
</tr>
<tr>
<td align="left" valign="top">rs72763686</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.0389/195</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs72763688</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">T=0.0413/207</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs72763685</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA2, HBA1</td>
<td align="left" valign="top">A=0.0425/213</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs72763684</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA2, HBA1</td>
<td align="left" valign="top">T=0.0447/224</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs28444102</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">T=0.0561/281</td>
<td align="center" valign="top">&#x00A0;&#x00A0;1</td>
</tr>
<tr>
<td align="left" valign="top">rs78502923</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">T=0.0405/203</td>
<td align="center" valign="top">15</td>
</tr>
<tr>
<td align="left" valign="top">rs12574989</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">T=0.0465/233</td>
<td align="center" valign="top">25</td>
</tr>
<tr>
<td align="left" valign="top">rs1203834</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">T=0.0703/352</td>
<td align="center" valign="top">15</td>
</tr>
<tr>
<td align="left" valign="top">rs7946748</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">A=0.0992/497</td>
<td align="center" valign="top">&#x00A0;&#x00A0;6</td>
</tr>
<tr>
<td align="left" valign="top">rs2685118</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.1645/824</td>
<td align="center" valign="top">21</td>
</tr>
<tr>
<td align="left" valign="top">rs11639532</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBA2, HBA1</td>
<td align="left" valign="top">A=0.1975/989</td>
<td align="center" valign="top">21</td>
</tr>
<tr>
<td align="left" valign="top">rs1203833</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">C=0.2196/1100</td>
<td align="center" valign="top">15</td>
</tr>
<tr>
<td align="left" valign="top">rs2858016</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">T=0.2466/1235</td>
<td align="center" valign="top">13</td>
</tr>
<tr>
<td align="left" valign="top">rs10837631</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">A=0.2480/1242</td>
<td align="center" valign="top">54</td>
</tr>
<tr>
<td align="left" valign="top">rs2541677</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBM</td>
<td align="left" valign="top">A=0.2943/1474</td>
<td align="center" valign="top">&#x00A0;&#x00A0;4</td>
</tr>
<tr>
<td align="left" valign="top">rs2858935</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBM</td>
<td align="left" valign="top">C=0.3181/1593</td>
<td align="center" valign="top">19</td>
</tr>
<tr>
<td align="left" valign="top">rs3859140</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">C=0.3379/1692</td>
<td align="center" valign="top">65</td>
</tr>
<tr>
<td align="left" valign="top">rs2238369</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBA2</td>
<td align="left" valign="top">C=0.3550/1778</td>
<td align="center" valign="top">57</td>
</tr>
<tr>
<td align="left" valign="top">rs78928216</td>
<td align="left" valign="top">Downstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">C=0.3614/1810</td>
<td align="center" valign="top">33</td>
</tr>
<tr>
<td align="left" valign="top">rs7480526</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">C=0.3690/1848</td>
<td align="center" valign="top">52</td>
</tr>
<tr>
<td align="left" valign="top">rs56308933</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">T=0.4077/2042</td>
<td align="center" valign="top">94</td>
</tr>
<tr>
<td align="left" valign="top">rs3859139</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">C=0.4225/2116</td>
<td align="center" valign="top">63</td>
</tr>
<tr>
<td align="left" valign="top">rs57397665</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">T=0.4637/2322</td>
<td align="center" valign="top">50</td>
</tr>
<tr>
<td align="left" valign="top">rs28673162</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBQ1, LUC7L</td>
<td align="left" valign="top">A=0.4858/2433</td>
<td align="center" valign="top">38</td>
</tr>
<tr>
<td align="left" valign="top">rs2974771</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">T=0.4748/2378</td>
<td align="center" valign="top">85</td>
</tr>
<tr>
<td align="left" valign="top">rs10742583</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBB</td>
<td align="left" valign="top">G=0.2817/1411</td>
<td align="center" valign="top">94</td>
</tr>
<tr>
<td align="left" valign="top">rs2858942</td>
<td align="left" valign="top">Upstream</td>
<td align="left" valign="top">HBA1</td>
<td align="left" valign="top">A=0.2616/1310</td>
<td align="center" valign="top">69</td>
</tr>
<tr>
<td align="left" valign="top">rs11863726</td>
<td align="left" valign="top">Intronic</td>
<td align="left" valign="top">HBQ1</td>
<td align="left" valign="top">G=0.2039/1021</td>
<td align="center" valign="top">11</td>
</tr>
<tr>
<td align="left" valign="top">rs2541675</td>
<td align="left" valign="top">Intergenic</td>
<td align="left" valign="top">HBM, HBA2</td>
<td align="left" valign="top">A=0.2560/1282</td>
<td align="center" valign="top">70</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-mmr-19-04-2837"><p>MAF, minor allele frequency.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>