<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "journalpublishing3.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<?release-delay 0|0?>
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">OL</journal-id>
<journal-title-group>
<journal-title>Oncology Letters</journal-title>
</journal-title-group>
<issn pub-type="ppub">1792-1074</issn>
<issn pub-type="epub">1792-1082</issn>
<publisher>
<publisher-name>D.A. Spandidos</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3892/ol.2022.13637</article-id>
<article-id pub-id-type="publisher-id">OL-25-02-13637</article-id>
<article-categories>
<subj-group>
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Genetic variants in African-American and Hispanic patients with breast cancer</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Dutta</surname><given-names>Pranabananda</given-names></name>
<xref rid="af1-ol-25-02-13637" ref-type="aff">1</xref>
<xref rid="c1-ol-25-02-13637" ref-type="corresp"/></contrib>
<contrib contrib-type="author"><name><surname>Keung</surname><given-names>Man Y.</given-names></name>
<xref rid="af1-ol-25-02-13637" ref-type="aff">1</xref></contrib>
<contrib contrib-type="author"><name><surname>Wu</surname><given-names>Yanyuan</given-names></name>
<xref rid="af1-ol-25-02-13637" ref-type="aff">1</xref>
<xref rid="af2-ol-25-02-13637" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author"><name><surname>Vadgama</surname><given-names>Jaydutt V.</given-names></name>
<xref rid="af1-ol-25-02-13637" ref-type="aff">1</xref>
<xref rid="af2-ol-25-02-13637" ref-type="aff">2</xref>
<xref rid="c1-ol-25-02-13637" ref-type="corresp"/></contrib>
</contrib-group>
<aff id="af1-ol-25-02-13637"><label>1</label>Division of Cancer Research and Training, Charles R. Drew University of Medicine and Science, Los Angeles, CA 90059, USA</aff>
<aff id="af2-ol-25-02-13637"><label>2</label>David Geffen UCLA School of Medicine, Jonsson Comprehensive Cancer Center, University of California at Los Angeles, Los Angeles, CA 90095, USA</aff>
<author-notes>
<corresp id="c1-ol-25-02-13637"><italic>Correspondence to:</italic> Dr Pranabananda Dutta or Dr Jaydutt V. Vadgama, Division of Cancer Research and Training, Charles R. Drew University of Medicine and Science, 1748 E 118th St, Los Angeles, CA 90059, USA, E-mail: <email>pranabandutta@cdrewu.edu</email>, E-mail: <email>jayvadgama@cdrewu.edu</email>; <email>jvadgama@ucla.edu</email></corresp>
</author-notes>
<pub-date pub-type="collection">
<month>02</month>
<year>2023</year></pub-date>
<pub-date pub-type="epub">
<day>16</day>
<month>12</month>
<year>2022</year></pub-date>
<volume>25</volume>
<issue>2</issue>
<elocation-id>51</elocation-id>
<history>
<date date-type="received"><day>23</day><month>08</month><year>2022</year></date>
<date date-type="accepted"><day>31</day><month>10</month><year>2022</year></date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; Dutta et al.</copyright-statement>
<copyright-year>2022</copyright-year>
<license license-type="open-access">
<license-p>This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">Creative Commons Attribution-NonCommercial-NoDerivs License</ext-link>, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.</license-p></license>
</permissions>
<abstract>
<p>Breast cancer is a disease with significant health disparity affecting mortality in minority women. The present study examined the genetic makeup of breast cancers in African-American and Hispanic/Latinx patients to determine specific genetic mutations associated with breast cancer in the minority population from South Los Angeles, United States. Whole-exome sequencing was performed on DNA extracted from breast cancer tumor biopsies collected from 13 African-American and 15 Hispanic women and 8 matched-normal samples for each ethnic category. The results were analyzed using Ensemble Variant Effect Predictor and Mutation Significance. Additionally, a comparative analysis with The Cancer Genome Atlas data was provided. Our data revealed somatic mutations in genes such as SET domain containing (lysine methyltransferase) 8, serine protease 1 and AT-rich interaction domain 1B (<italic>ARID1B</italic>) and known breast cancer genes, such as <italic>BRCA1/2, TP53</italic> and the DNA damage response genes across all ethnicities. Additionally, Hispanic patients had BRCA1 associated RING domain 1B (<italic>BARD1</italic>) variants, while African-American patients had higher numbers of nonsynonymous variants in the RAD51 paralog B (<italic>RAD51B</italic>), <italic>ARID1B</italic> and X-ray repair cross complementing 3 (<italic>XRCC3</italic>) genes. In addition, our patients exhibited mutational signature enrichment that indicated DNA homologous recombination repair deficiencies. Therefore, African-American and Hispanic breast cancer samples showed considerable overlap in breast cancer genetic mutations. However, there are differences in specific genetic variants in <italic>TP53, BRCA1/2, BARD1</italic> or <italic>ARID1B</italic>, which will require further study of their role in tumorigenesis.</p>
</abstract>
<kwd-group>
<kwd>breast cancer</kwd>
<kwd>variants</kwd>
<kwd>health disparity</kwd>
<kwd>exome</kwd>
<kwd>African-American</kwd>
<kwd>Hispanic</kwd>
</kwd-group>
<funding-group>
<award-group>
<funding-source>NIH/NCI/NIMHD</funding-source>
<award-id>1U54CA14393</award-id>
<award-id>U54MD007598</award-id>
</award-group>
<award-group>
<funding-source>National Institute on Minority Health and Health Disparities of the National Institutes of Health</funding-source>
<award-id>S21 MD000103</award-id>
</award-group>
<funding-statement>The research was funded by NIH/NCI/NIMHD (grant nos. 1U54CA14393 and U54MD007598). Research reported in this publication was supported by the National Institute on Minority Health and Health Disparities of the National Institutes of Health (grant no. S21 MD000103).</funding-statement>
</funding-group>
</article-meta>
</front>
<body>
<sec sec-type="intro">
<title>Introduction</title>
<p>Breast cancer is one of the leading cancers in women, with 1 in 8 women showing a lifetime risk of developing it (<xref rid="b1-ol-25-02-13637" ref-type="bibr">1</xref>). Breast cancer is heterogeneous, and patient demographics show considerable health disparities (<xref rid="b2-ol-25-02-13637" ref-type="bibr">2</xref>,<xref rid="b3-ol-25-02-13637" ref-type="bibr">3</xref>). For example, African-American women are more likely to develop breast cancer at a younger age and suffer from an aggressive sub-type called triple-negative breast cancer more often than their white counterparts (<xref rid="b4-ol-25-02-13637" ref-type="bibr">4</xref>). Next-generation sequencing technologies have been used to study cancer genomics to determine causative mutations leading to the disease. Whole-Genome (WGS) and Whole-Exome sequencing (WES) technologies are used for this purpose. Many large consortia made an effort to understand cancer biology using these technologies. The Cancer Genome Atlas (TCGA) is one of the most significant multicenter initiatives that use exome and genome sequencing for all cancers. The data from TCGA showed that mutations in <italic>PIK3C, PTEN, TP53</italic>, and <italic>CDH1</italic> are highly enriched in breast cancer samples with an increased risk or less overall survival (<xref rid="b5-ol-25-02-13637" ref-type="bibr">5</xref>). However, the TCGA dataset primarily has samples from white patients, even with contributions from African-American or Hispanic patients. Thus, a significant effort is underway to understand the effect of ancestry/ethnicity on breast cancer (<xref rid="b6-ol-25-02-13637" ref-type="bibr">6</xref>,<xref rid="b7-ol-25-02-13637" ref-type="bibr">7</xref>).</p>
<p>Recent studies show that the overall breast cancer incidence rates are similar among white and black patients. However, Black patients are more like to be diagnosed with larger tumor size (&#x003E;5 cm (12&#x0025;) and with high-grade (42&#x0025;) breast cancer (<xref rid="b8-ol-25-02-13637" ref-type="bibr">8</xref>). There is also a noticeable disparity in Triple-Negative Breast Cancer (TNBC), with black women showing 19&#x0025; of all breast tumors as TNBC compared to 9&#x0025; in white patients. Utilizing exome technologies will be crucial to understanding the genetic aspects of this health disparity. We report our analysis on WES of breast tumor biopsies from African-American and Hispanic patients from South Los Angeles, a region with significant health disparity. Our work highlights the overall mutational landscape and specific genetic mutations that might provide insight into the biological aspects of breast cancer in minority patients.</p>
</sec>
<sec sec-type="materials|methods">
<title>Materials and methods</title>
<sec>
<title/>
<sec>
<title>Tumor sample and patient ethnicity</title>
<p>Breast cancer patient samples from minority patients were collected (<xref rid="tI-ol-25-02-13637" ref-type="table">Table I</xref>). Patients belong to the Los Angeles SPA6 (Service Planning Area 6) region, which traditionally shows significant health disparity. A total of thirteen African-American and fifteen Hispanic patients were analyzed for the study with confirmed invasive ductal carcinoma, except for one African-American sample with a Ductal Carcinoma in situ (DCIS) diagnosis. We also utilized matched normal (tumor-adjacent) for eight samples from each ethnicity. Patient samples were used from our ongoing Breast Cancer Study in the Division of Cancer Research and Training at the Charles R. Drew University of Medicine and Science in collaboration with Martin Luther King Ambulatory Care Center (formerly known as King-Drew Medical Center, #IRB 00-06-041) and the protocol has been approved since 1999 and continuing review approved annually (recent continuing review approval was August 18, 2021. The patient demographics and breast cancer subtypes are listed in <xref rid="tI-ol-25-02-13637" ref-type="table">Table I</xref>.</p>
</sec>
<sec>
<title>Illumina nextera exome library preparation</title>
<p>Total DNA was isolated from fresh-frozen breast cancer biopsies and matched normal tissues using the QIAGEN QiaAmp DNA purification kit and Quantified using Nanodrop (Thermo Fischer Scientific, USA). 250 to 500 ng DNA was used to prepare the library using Nextera Flex for Enrichment (Illumina, USA, Cat No 20025524). Libraries were run on a Bioanalyzer DNA 1000 chip (Agilent, USA) to assess quality. Library quantitation was done using the qubit 3.0 fluorometer (Thermo Fischer Scientific, USA) using the dsDNA high sensitivity kit (ThermoFisher, USA, Cat No. Q32851). The exome capture probes cover about 45 Mb of the primarily protein-coding region of the human genome (hg19 assembly). The bed file with the chromosomal coordinates is available at: (<uri xlink:href="https://support.illumina.com/content/dam/illumina-support/documents/downloads/productfiles/nextera/nextera-flex-for-enrichment/TruSeq_Exome_TargetedRegions_v1.2.bed">https://support.illumina.com/content/dam/illumina-support/documents/downloads/productfiles/nextera/nextera-flex-for-enrichment/TruSeq_Exome_TargetedRegions_v1.2.bed</uri>).</p>
</sec>
<sec>
<title>Whole exome nextera flex from enrichment workflow</title>
<p>250 to 500 ng DNA was subjected to library preparation using the Nextera Flex for Enrichment (Currently, DNA Prep for Enrichment, Illumina, USA) following manufacturer-recommended protocol. Briefly, DNA was &#x2018;tagmented&#x2019; Tagmentation is the initial step in library prep where genomic DNA is cleaved and tagged for analysis, cut into small pieces of 300&#x2013;400 bp length by a transposase, and bead-linked transposes ligated adaptor for sequencing in the same process. The tagmentation was followed by captured by biotinylated oligonucleotides covering approximately 45 Mb of human genomic exons (sequence version UCSC hg19). Finally, exome library capture hybridization was performed for 1.5 h. Twelve patient pools were run on a NextSeq 550 sequencer (Illumina, USA) with paired-end 70 bp reads.</p>
</sec>
<sec>
<title>Bioinformatics analysis of exome sequencing data</title>
<p>Fastq generated by the NextSeq 550 run was mapped to the hg19 human genome using the Burrows-Wheeler Aligner (BWA, BWA mem), and variants were identified using the Genome Analysis Toolkit (GATK). To this end, we utilized Illumina BaseSpace &#x2018;BWA Enrichment&#x2019; pipeline (<uri xlink:href="https://www.illumina.com/products/by-type/informatics-products/basespace-sequence-hub/apps/bwa-enrichment.html">https://www.illumina.com/products/by-type/informatics-products/basespace-sequence-hub/apps/bwa-enrichment.html</uri>). Mapping was restricted to the chromosomal regions mentioned in &#x2018;TruSeq_Exome_TargetedRegions_v1.2.bed&#x2019;. This &#x2018;bed&#x2019; file specifies the coordinates of the regions used to generate probes targeting all known protein-coding genes in the human genome. Variants were annotated using Illumina Variant Annotator. For the figures shown below, the variant call format (VCF) file generated by the Illumina BWA enrichment pipeline was filtered for the common/germline variants in the QIAGEN QCI software platform using the following strategy. The following criteria were used to exclude variants from the cancer exome VCFs. Variants that are present &#x003E;=1&#x0025; of Allele Frequency Community (QIAGEN Database) or &#x003E;=3&#x0025; in the following: the Genome Aggregation Database (gnomAD <uri xlink:href="https://gnomad.broadinstitute.org">http://gnomad.broadinstitute.org</uri>) or ExAC (<uri xlink:href="https://exac.broadinstitute.org">https://exac.broadinstitute.org</uri>, currently merged with gnomAD) or of NHLBI GO Exome Sequencing Project (NHLBI ESP Exomes: <uri xlink:href="https://evs.gs.washington.edu/EVS/">http://evs.gs.washington.edu/EVS/</uri>) Or 1000 Genomes project (<uri xlink:href="https://www.internationalgenome.org">https://www.internationalgenome.org</uri>). Variants were also excluded if present in the dbSNP database. However, common germline variants were kept for analysis if established as pathogenic variants with support from literature published. Matched normal samples were also used to filter out germline and common variants. However, variants with known pathogenicity in any disease were included in the analysis. The QIAGEN Clinical Insight contains gnomAD (v2.1.1), Exome Variant Server (EVS, vESP6500SI&#x2013;V2), 1000 Genome Frequency (phase3v5b), Single Nucleotide Polymorphism Database (dbSNP v154), Combined Annotation Dependent Depletion (CADD v1.6), Sorting Intolerant from Tolerant (SIFT4G v2016-02-23), BSIFT (2016-02-23), Polymorphism Phenotyping (PolyPhen-2 v2.2.2) versions (PhyloP (2009&#x2013;11). The version information was obtained from the release notes available at the <uri xlink:href="https://variants.ingenuity.com/qci/website">https://variants.ingenuity.com/qci/website</uri>. The filtered VCF was annotated using Ensemble Variant Effect Predicted (VEP) (<xref rid="b9-ol-25-02-13637" ref-type="bibr">9</xref>), which was then converted to Mutation Annotation Format (MAF) using vcf2maf script (<uri xlink:href="https://github.com/mskcc/vcf2maf">https://github.com/mskcc/vcf2maf</uri>). All figures were generated using the Maftools R package [R version 4.10 (2021-05-18) and maftools 2.8.0] (<xref rid="b10-ol-25-02-13637" ref-type="bibr">10</xref>). Mutational signatures were determined by maftools &#x2018;extractSignature&#x2019; and Plot Signature functions. Finally, mutational profiles were compared with the current Single base substitutions (SBS) signature from the Catalogue Of Somatic Mutations In Cancer (COSMIC) database (<uri xlink:href="https://cancer.sanger.ac.uk/signatures/sbs/">https://cancer.sanger.ac.uk/signatures/sbs/</uri>). The overall survival analysis was conducted using maftools mafSurvival function and comparison between patients with high or low DFS was carried out with mafCompare function for Fisher&#x0027;s exact test. The survival analysis utilized Cox proportional hazard function and correlated gene mutations individually or in groups on the overall survival of the patients.</p>
</sec>
<sec>
<title>Analysis of significantly mutated genes in the patient sample</title>
<p>Tumor Mutational Burden (TMB) was calculated as total nonsynonymous mutations per mb (megabases, log2 transformed, per mb is calculated from the capture size of the exome capture baits). The statistical test was performed with Graphpad Prism 9 for the Mann-Whitney U test or maftools for the pairwise t-test. All differences in variant comparison between groups of the sample were calculated using 2&#x00D7;2 Fisher&#x0027;s Exact Test in maftools. In all cases, P-value &#x003C;=0.05 was considered significant. We have used MutSig v1.4 to analyze significantly mutated genes in African-American and Hispanic samples. The genes with a q-value less than 0.0001 were analyzed for enrichment using the ShinyGO v.0741 web portal (<uri xlink:href="https://bioinformatics.sdstate.edu/go/">http://bioinformatics.sdstate.edu/go/</uri>) against the Gene Ontology Biological pathways and the Hallmark Dataset form The Molecular Signatures Database (MSigDB) (<xref rid="b11-ol-25-02-13637" ref-type="bibr">11</xref>,<xref rid="b12-ol-25-02-13637" ref-type="bibr">12</xref>). For MutSig cancer driver identification, we used variants with the &#x2018;PASS&#x2019; criteria attached to the variants of interest in the final filtered variant list for our samples utilizing the maftools prepareMutsig function. For the TCGA cohorts, we directly exported the MutSig Compatible variant list file for analysis from the GDC mc3 maf file. We did the comparative analysis in the &#x2018;Ingenuity QCI&#x2019; (app.ingenuity.com/). The result includes SIFT (<uri xlink:href="https://sift.bii.a-star.edu.sg/">https://sift.bii.a-star.edu.sg/</uri>) and Polyphen (<uri xlink:href="https://genetics.bwh.harvard.edu/pph2/">http://genetics.bwh.harvard.edu/pph2/</uri>) scores (<xref rid="b13-ol-25-02-13637" ref-type="bibr">13</xref>,<xref rid="b14-ol-25-02-13637" ref-type="bibr">14</xref>). These scores predict the functional consequences of a variation for a protein. We filter the variants to predict detrimental &#x2018;damaging&#x2019; mutations and &#x2018;activating&#x2019; mutations.</p>
</sec>
<sec>
<title>Analysis of publicly available COSMIC and TCGA data</title>
<p>TCGA was accessed via cBioPortal using the general web interface (cbioportal.org). We also downloaded the publicly available &#x2018;mc3.v0.2.8.PUBLIC.maf.gz &#x2018;from the <uri xlink:href="https://gdc.cancer.gov/about-data/publications/mc3-2017">https://gdc.cancer.gov/about-data/publications/mc3-2017</uri> website and used clinical data available from cBioPortal for the TCGA Breast Cancer (TCGA-BRCA) cohort to subset the maf file into African-American and Hispanic categories. We used the clinical category &#x2018;Race&#x2019; as &#x2018;Black or African-American&#x2019; (Total 162 extracted) and the &#x2018;Ethnicity&#x2019; category &#x2018;Hispanic&#x2019; (total 33 extracted) to create the ethnicity-specific mafs using the subset Maf function in maftools. We analyzed these ethnic categories for comparative analysis with our cohort of patients. COSMIC Census genes were downloaded from Sanger&#x0027;s COSMIC site (<uri xlink:href="https://cancer.sanger.ac.uk/census">https://cancer.sanger.ac.uk/census</uri>). Venn diagram comparison of somatic mutations was conducted using the web portal <uri xlink:href="https://genevenn.sourceforge.net/vennresults.php">http://genevenn.sourceforge.net/vennresults.php</uri>.</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>We analyzed variants in total of 13 African-American and 15 Hispanic samples. We also utilized TCGA breast cancer data set with 163 African-American and 33 Hispanic breast cancer samples. The comparative analysis was conducted using an unpaired Mann-Whitney U test, one-way ANOVA with Tukey&#x0027;s multiple comparison or Fisher&#x0027;s exact test (comparison between high and low disease-free survival groups). The P-value generated by MutSig for a gene is a combination of three P-values for the mutation abundance, location in the genome and the conservation of genetic sequence across species. Details of the MutSig algorithm are available at <uri xlink:href="https://www.broadinstitute.org/cancer/cga/mutsig">https://www.broadinstitute.org/cancer/cga/mutsig</uri>. The data used for ANOVA and Mann-Whitney U test are available at the synapse project page (project ID, syn42137028). GraphPad Prism version 9, GraphPad Inc., San Diego, USA was used for one-way ANOVA with Tukey&#x0027;s multiple comparison and Mann-Whitney U test. Fisher exact test was carried out using maftools in R [Maftools R package (R version 4.10 (2021-05-18, <uri xlink:href="https://cran.r-project.org/">http://cran.r-project.org/</uri>) and maftools 2.8.0, <uri xlink:href="https://github.com/PoisonAlien/maftools/">https://github.com/PoisonAlien/maftools/</uri>]. Kaplan-Meier plots were generated with univariate cox proportional hazard model analysis using maftools. In all cases, P-value &#x003C;=0.05 was considered significant.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<sec>
<title/>
<sec>
<title>Significant genetic variants in the African-American and Hispanic cohorts</title>
<p>Our current study examines patient samples from a small cohort of ethnic minority groups, with an overall of 13 African-American (AA) and 16 Hispanic/Latinx Tumor samples (<xref rid="tI-ol-25-02-13637" ref-type="table">Table I</xref> Method section, 15 samples for the Hispanic group analyzed). Overall, our exome analysis recovered single nucleotide variations (SNVs) and a small number of insertions and deletions. The average coverage for African-American and Hispanic tumor samples was &#x007E;52X and &#x007E;48X, respectively, excluding one Hispanic sample with low coverage (excluded from analysis). The median Tumor mutational burden (calculated a log2 (missense mutations/Mb of capture size) were 0.98 and 0.89 in African-American and Hispanic samples, respectively (<xref rid="f1-ol-25-02-13637" ref-type="fig">Fig. 1</xref>). The TMB values were slightly lower than the TCGA cohorts. However, we did not observe any statistically significant difference between our and TCGA samples. After filtering for common variants, the African-American samples had 647 mutations (SNVs including insertion and deletion). On the other hand, Hispanic samples had 594 mutations (<xref rid="SD2-ol-25-02-13637" ref-type="supplementary-material">Table SI</xref>: Summary of mutation after applying common variant filters).</p>
<p>In either case, while including known pathogenic mutations, the significantly mutated genes were <italic>F5</italic> (Coagulation Factor V, p.Q534R) and <italic>PRSS1</italic> (Serine Protease 1) in the African-American samples. On the other hand, Both African-American and Hispanic samples had <italic>MTHFR</italic> (methylenetetrahydrofolate reductase, p.E470A, and p.A263V) variants. Both samples showed variants in histone lysine methyltransferase gene SETD8. <italic>SETD8</italic> had two in-frame deletions, namely p.A20_A21del and p.L181Hfs&#x002A;20. One Hispanic and 3 AA samples showed insertion in the ARID1B (p.H1534Q and p.Q130_Q131dup). <italic>PRSS1</italic> mutations were observed in 11 Hispanic and 8 AA samples (p.N29I). The p.N29I polymorphism in <italic>PRSS1</italic> is possibly pathogenic (e.g., ClinVar Accession RCV000012652.31). Due to the higher frequencies observed in our samples and associated dbSNP identifiers, these genetic variants could be due to germline contribution and are predicted to be benign.</p>
</sec>
<sec>
<title>Variants in breast cancer-related and DNA damage response genes</title>
<p>We examined single nucleotide variants in our dataset for known breast cancer genes. These genes are reported to be frequently mutated in breast cancer [Online Mendelian Inheritance of Man (OMIM) entry for breast carcinoma: <uri xlink:href="https://omim.org/entry/114480">https://omim.org/entry/114480</uri> and Human phenotype ontology (HPO): <uri xlink:href="https://hpo.jax.org/app/browse/term/HP:0003002">https://hpo.jax.org/app/browse/term/HP:0003002</uri>] (<xref rid="b15-ol-25-02-13637" ref-type="bibr">15</xref>). We also compared the gene list we obtained for each cohort with the COSMIC Cancer Gene Census (CGC) genes, which in many cases, have experimental evidence as oncogenes and tumor suppressors (Tier 1) (<xref rid="b16-ol-25-02-13637" ref-type="bibr">16</xref>).</p>
<p>The cancer genome atlas lists <italic>PIK3CA</italic> (45&#x0025;) as the most frequently mutated gene, followed by <italic>MAP3K1, GATA3, TP53, CDH1</italic>, and <italic>MAP2K</italic> [<xref rid="b5-ol-25-02-13637" ref-type="bibr">5</xref>]. Although <italic>TP53</italic> was the most frequently mutated gene in African-American breast cancer patients in our cohort, we found missense mutations in <italic>ARID1B</italic> (5 samples), <italic>BRCA1</italic> (4 samples), <italic>BRCA2</italic> (4 samples), and <italic>RAD51B</italic> (3 samples) (<xref rid="f2-ol-25-02-13637" ref-type="fig">Fig. 2</xref>, <xref rid="tII-ol-25-02-13637" ref-type="table">Table II</xref>). The details of example genes from the top 50 frequently mutated genes [Online Mendelian Inheritance of Man (<uri xlink:href="https://omim.org/">https://omim.org/</uri>) and Human Phenotype Ontology] and the COSMIC Census (updated as of February 2022) are shown in <xref rid="tII-ol-25-02-13637" ref-type="table">Table II</xref>. To capture genetic variants in these genes, we included variants with dbSNP ids while comparing the tumor with matched normal (<xref rid="tIII-ol-25-02-13637" ref-type="table">Table III</xref>). As a result, <italic>BRCA2</italic> (6 samples), ARID1B (AT-rich interactive domain, 5 samples), and <italic>BARD1</italic> (BRCA1 Associated RING Domain 1B, 2 samples) showed variants in this cohort. In addition, a single sample in each cohort showed variants in the <italic>TP53, ERBB2</italic>, and <italic>HELQ</italic> (Helicase, POLQ Like, a single-stranded DNA-dependent ATPase, and DNA helicase) genes. On the other hand, Hispanic patients also exhibited similar mutational profiles in the top frequently mutated breast cancer or DNA damage response genes (<xref rid="f2-ol-25-02-13637" ref-type="fig">Fig. 2</xref>, <xref rid="tII-ol-25-02-13637" ref-type="table">Table II</xref>).</p>
<p>As mentioned earlier, we compared the somatic mutation data from our cohort with the COSMIC CGCs, for variants causally linked with cancer. Overall, 47 genes from the African-American and 52 in the Hispanic cohorts overlap with the CGC lists (<xref rid="SD3-ol-25-02-13637" ref-type="supplementary-material">Table SII</xref>: COSMIC Census genes found in our patient cohort). Most of the genes were found to be mutated in a single sample. In the African-American cohort, both <italic>KDM6A</italic> [Lysine (K)-specific demethylase 6A, c.2703-5dup/del, Intronic] and <italic>PABPC1</italic> [poly (A) binding protein cytoplasmic 1, p.K254Nfs&#x002A;24] showed variants in 4 samples. In the Hispanic group, <italic>KMT2C</italic> [lysine (K)-specific methyltransferase 2C, p.A30P; p.M1774T], <italic>EIF1AX</italic> (eukaryotic translation initiation factor 1A; X-linked, c.337&#x002B;1G&#x003E;C; C256-3A&#x003E;C, Splice Site), and <italic>SGK1</italic> (serum/glucocorticoid regulated kinase 1 c.285&#x002B;50T&#x003E;C; c.362-3230T&#x003E;G; Intronic) each showed variants in 2 samples. <italic>ZNF384</italic> (p.Q547del) showed the same variants in 3 Hispanic samples.</p>
</sec>
<sec>
<title>Mutations in significant genes in breast cancer patients in comparison to the TCGA data</title>
<p>The Cancer Genome Atlas (TCGA, <uri xlink:href="https://www.cancer.gov/tcga.">http://www.cancer.gov/tcga.</uri>) provides a large dataset on various cancer types to querying for mutations and gene expression with samples from multiple ethnicities. We queried the TCGA data set via cBioPortal (cbioportal.org) (<xref rid="b17-ol-25-02-13637" ref-type="bibr">17</xref>). We utilized the TCGA Pan-Can Atlas 2018 dataset to this end as a maf (mutation annotation format) file from the NCI Genomic Data Common (GDC). We compared the mutational profile of African-American (Race Category) and Hispanic patients (Ethnicity Category) with our cohorts. According to the mutations listed in the TCGA data, the most frequently mutated gene is <italic>TP53</italic> (accessed via cBioPortal). TP53 was the most significantly mutated gene in African-American samples (42.7&#x0025;, 72 patients out of 182 on cBioPortal), followed by <italic>FBXW7</italic> (F-box and WD repeat domain containing 7) at 8&#x0025;. This finding is in concordance with other reported studies, which found <italic>TP53</italic> mutation to be highly enriched in African-American patients (<xref rid="b18-ol-25-02-13637" ref-type="bibr">18</xref>). The GDC maf file also listed <italic>PIK3CA, TTN, GATA3</italic>, and <italic>KMT2C, MAP3K1</italic> as frequently mutated in the Black or African-American cohort. TP53 mutations were found in our patient set in two African-American and only one Hispanic sample. A somatic FBXW7 variant was not observed in our cohorts (Filtered) except for one Hispanic patient (p.I394&#x002A;). In white patients, on the other hand, the mutation level of <italic>TP53</italic> was 29.23&#x0025; (216 patients out of 752). <italic>PIK3CA</italic> was the most frequently mutated gene in white patients (34.64&#x0025; compared to 19.66&#x0025; in African-American patients).</p>
<p>The genetic variants in the TCGA cohort are in well-known tumor suppressors or oncogenes. However, excluding variants in databases such as gnomAD or dbSNP and applying the &#x2018;PASS&#x2019; filter to the vcf files, the variants in genes such as TP53 and FBXW7 were filtered out from most of our samples. We only observed <italic>TP53</italic> p.R175H in one African-American and p.R213&#x002A; in one Hispanic patient. Therefore, we examined the tumor and normal samples for known variants in those genes to capture the possible contribution of tumor matched-normal tissues for <italic>TP53</italic> variants and other gene mutations in our samples. In our patient samples, <italic>TP53</italic> mutations that were most frequent were p.P72R (COSMIC id: COSV52666208), p.R273H (COSMIC id: COSV52660980), p.R342&#x002A; (stop codon, COSMIC id: COSV52665487). The Hispanic group only showed the P72R mutations (<xref rid="f3-ol-25-02-13637" ref-type="fig">Fig. 3</xref>). All of these <italic>TP53</italic> variants are implicated in cancer (<xref rid="b19-ol-25-02-13637" ref-type="bibr">19</xref>).</p>
<p>On the other hand, one additional <italic>FBXW7</italic> mutation was found in 2 African-American patients. The <italic>FBWX7</italic> p.P160L missense mutations might be a loss-of-function implicated in cancer (COSMIC ID: COSV55920521). In addition, the Hispanic breast cancer cohort also showed p.D600N mutation, which might be a somatic loss of function of the protein (COSMIC ID: COSV55951044).</p>
</sec>
<sec>
<title>Mutational signatures in minority breast cancer patients</title>
<p>In recent years, it has become apparent that certain mutational processes are causative for a specific type of single nucleotide variations in the genome. These processes involving the APOBEC3 (Apolipoprotein B Editing Complex) enzyme or DNA damage response protein will leave a &#x2018;signature&#x2019; behind, which can be revealed by WES or WGS (<xref rid="b20-ol-25-02-13637" ref-type="bibr">20</xref>). This mutational signature is displayed using a 96-mutational profile classification. The signature is calculated using the substitution class (A&#x003E;G, T&#x003E;C) and three nucleotides 3&#x2032; downstream or 5&#x2032; upstream to the mutated base. We applied the signature algorithms on the exome profile of our patient data. We did not find APOBEC mutational pattern enrichment in our patient samples. However, all breast cancer patients in our cohort showed similar mutational profiles. Furthermore, we discovered that Signatures 3 and 5 are enriched in African-American and Hispanic samples (<xref rid="f4-ol-25-02-13637" ref-type="fig">Fig. 4</xref>). Signature 3 is associated with DNA double-strand Homologous Recombination repair. We compared the signature profile with TCGA African-American and Hispanic cohorts. Signature 3 was found to be over-represented in all samples. The TCGA cohorts showed Spontaneous or Enzymatic deamination of 5-methylcytosine (SBS1) and APOBEC Cytidine Deaminase (SBS2). The cytosine deaminase APOBEC3 mediated C to T mutation is prevalent in breast cancer, and generally, multiple cancer shows mutational patterns indicative of APOBEC activity (<xref rid="b20-ol-25-02-13637" ref-type="bibr">20</xref>,<xref rid="b21-ol-25-02-13637" ref-type="bibr">21</xref>). However, we did not observe these signatures in our datasets. Only one African-American patient and three Hispanic patients had APOBEC enrichment (&#x003E;2) in our cohort (<xref rid="SD4-ol-25-02-13637" ref-type="supplementary-material">Table SIII</xref>: APOBEC Enrichment scores for patient samples). APOBEC enrichment score profiles are shown as a boxplot in <xref rid="f4-ol-25-02-13637" ref-type="fig">Fig. 4D</xref>. The TCGA-AA group has a higher APOBEC value than our cohort (DCRT-AA) (Mann-Whitney U test, Median TCGA-AA=1.439 and DCRT-0.954, two-tailed P=0.039). However, all other comparisons between AA and Hispanic categories did not significantly differ.</p>
</sec>
<sec>
<title>Oncogenic pathways in African-American and Hispanic samples</title>
<p>We also examined somatic variants in the signaling pathways associated with cancer. Ten most affected pathways were examined using maftools. The pathways are derived from Sanchez-Vega <italic>et al</italic> (<xref rid="b22-ol-25-02-13637" ref-type="bibr">22</xref>). In analyzing African-American and Hispanic tumor samples, we found that both have similar profiles regarding affected genes in the pathways. For example, African-American and Hispanic tumors have RTK-RAS, NOTCH, and WNT as the top three pathways, although each category&#x0027;s number of affected genes varies. For example, the RTK-RAS pathway showed 5 and 3 genes mutated for African-American and Hispanic patients, respectively (<xref rid="f5-ol-25-02-13637" ref-type="fig">Fig. 5</xref>). Hispanic patients had more variants in The TGF&#x03B2; signaling pathway-associated genes than African-American patients. Similarly, the Notch pathway showed variants in 5 and 4 genes out of 71 in African-American and Hispanic patients. The TCGA data for African-American and Hispanic cohorts showed similar patterns in the oncogenic pathways. Hippo and PI3K pathways showed similar profiles in all cohorts analyzed.</p>
<p>However, individual subtype-based analysis shows other genes and the genes described above. For example, we compared the tumor of three African-American patients with three Hispanic patients with triple-negative (TNBC) subtype in QCI interpretation. In this case, we observed enrichment for variants in <italic>IGSF3</italic> (p.R456C, p.D254N), <italic>ZNF717</italic> (p.E370Q; p.Y499&#x002A;, p.K622fs&#x002A;79, p.K790&#x002A;, p.S861fs&#x002A;?) KIR3DL3 (p.V324A) and <italic>KMT2C</italic> (p.W858L, p.P860S) while selecting pathogenic or likely pathogenic variants. A <italic>G6PD</italic> p.V68M (known loss of function) was found in one African-American sample. However, this variant has a high prevalence in the African-American population (gnomAD 11.64&#x0025; in Africans, <xref rid="SD5-ol-25-02-13637" ref-type="supplementary-material">Table SIV</xref>: Details of all genetic variants).</p>
</sec>
<sec>
<title>Effect of genetic variants on overall survival</title>
<p>The challenge with survival analysis is the low frequency of somatic mutations, significantly reducing the number of patients carrying a mutation. Therefore, we conducted the overall survival analysis on the TCGA cohorts due to the higher available number. In addition, we chose the genes that are frequently mutated in our cohorts. In our survival analysis for the TCGA African-American cohort, we observed a statically significant (P&#x003C;0.05) reduction in the probability of overall survival with a mutation in F5 (Median Survival 174.5 days vs. 414.5 days in the WT, HR=2.96) or BRCA2 (HR=5.2). We also conducted survival analysis with gene sets instead of individual genes. We found that patients with mutations in at least two genes among XRCC3, TP53, BRCA1 and BRCA2 have reduced overall survival (Median survival 176 days vs. 413 in patients with Wt alleles, HR=2.37).</p>
<p>Additionally, patients with at least one mutation in either SETD8, PRSS1, ARID1B, F5, or CDC27 genes showed reduced survival to 176 days compared to 426 days in Wt patients (Wt patients) HR=3.06) (<xref rid="SD1-ol-25-02-13637" ref-type="supplementary-material">Fig. S1</xref>). Additionally, we also utilized the Disease-Free Survival (DFS) data (<xref rid="SD6-ol-25-02-13637" ref-type="supplementary-material">Table SV</xref>) for our patients (median=38 months) and compared the genetic variants in patients with Low DFS with patients who had higher DFS (Median Split). Even though not statistically significant, African-American patients with lower DFS had more PRSS1 and CDC27 frequently mutated. On the other hand, Hispanic patients had F5 and MTHFR more frequently mutated along with SETD8 and PRSS1 (<xref rid="SD7-ol-25-02-13637" ref-type="supplementary-material">Table SVI</xref>).</p>
</sec>
<sec>
<title>Novel driver gene mutation and differential mutations in African American and Hispanic samples</title>
<p>We wanted to understand the differentially mutated genes in African-American and Hispanic breast cancer patient samples. To this end, we utilized MutSig v1.4 to determine the novel driver genes in the patients in both African-American and Hispanic samples (<xref rid="b23-ol-25-02-13637" ref-type="bibr">23</xref>). MutSig algorithm frequently determines mutated genes while considering gene expression and chromatin state and has been tested to find novel driver mutations across 21 different tumor types (<xref rid="b24-ol-25-02-13637" ref-type="bibr">24</xref>). The somatic mutation rate is calculated with respect to a background mutation rate. Among the frequently mutated somatic genes, 97 genes are shared between African-American and Hispanic samples among the top 250 genes as determined by MutSig [<xref rid="SD8-ol-25-02-13637" ref-type="supplementary-material">Table SVII</xref>: MutSig scores for genes in the African-American and Hispanic sample and <xref rid="SD9-ol-25-02-13637" ref-type="supplementary-material">Table SVIII</xref>: Top mutation significance (MutSig) genes across various ethnicities]. After Filtering the dataset with P-values (&#x003C;0.05) in African-American samples, the top ten mutated genes are <italic>SETD8, PRSS1, TMIE, PABPC1, OR8D4, OR6P1, EPHB6, BMP2K, MEF2A</italic>, and <italic>TPP1</italic>. However, in the TCGA African-American cohort, <italic>GATA3, TP53, PIK3CA, CDH1, PTEN, MAP3K1, MAP2K4, MUC4, RUNX1</italic>, and FBXW7 were the top ten candidate genes. We also conducted GO (Gene Ontology) analysis on the significant driver genes for both our and TCGA cohorts. Our data set did not show enrichment in any particular category. The TCGA African-American cohorts showed enrichment in the protein stabilization and intrinsic apoptotic signaling pathways (<xref rid="f6-ol-25-02-13637" ref-type="fig">Fig. 6</xref>). Hallmark MSigDB Hypoxia, P53 Pathway, and E2F targets were also enriched in the candidate driver gene profiles from the TCGA African-American cohort set.</p>
<p>In the TCGA Hispanic group, the top ten genes found by MutSig were <italic>TP53, PIK3CA, GATA3, MAP3K1, TYW3, KHDC1, CEACAM8, TCP10L2, GPS2</italic>, and <italic>SNAP29</italic>. However, GO ontology analysis failed to show enrichment in any specific category. Our Hispanic patient cohort showed a similar candidate driver gene profile, with the top ten being <italic>SETD8, MTHFR, PRSS1, KIAA2018, CDC27, ZNF384, MEF2A, F5</italic>, and <italic>NUDT15</italic>. As a candidate driver gene, we observed <italic>SETD18</italic>, a histone lysine methyltransferase. <italic>SETD8</italic> is known to play a role in breast cancer metabolism by stabilizing hypoxia-inducible factor 1&#x03B1; (HIF1&#x03B1;) and is also implicated in DNA damage response maintaining genomic integrity (<xref rid="b25-ol-25-02-13637" ref-type="bibr">25</xref>,<xref rid="b26-ol-25-02-13637" ref-type="bibr">26</xref>).</p>
<p>As with all pathways, the mutated genes in African-American samples were similar to Hispanic samples. 11 genes (including <italic>ATXN1, CMTM5, EIF1AX, GPRC6A, MEF2A, PRSS1</italic>, and <italic>SETD8</italic>) were found to be candidate drivers in both categories. However, some genes were only enriched in one or the other sample (72 in African-American, 42 in the Hispanic cohort, <xref rid="f6-ol-25-02-13637" ref-type="fig">Fig. 6</xref> Venn Diagram). There were only two genes common between our Hispanic and TCGA cohorts. Our analysis with the Hispanic samples observed significant mutations in genes such as <italic>CDC27</italic> (cell division cycle 27), making it a candidate driver gene. CDC27 protein levels and polymorphism are associated with breast cancer mortality and risk (<xref rid="b27-ol-25-02-13637" ref-type="bibr">27</xref>,<xref rid="b28-ol-25-02-13637" ref-type="bibr">28</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>Efforts are underway which utilize various omics approaches to understand cancer health disparity. Next-generation sequencing like whole genome and exome technologies are paving the way to understanding the mutational burden and discovering driver mutations in various tumor types. With technologies like WES, it is possible to elucidate the biological aspects of health disparity. The current study shows WES results on African-American and Hispanic patients, two minority demographics not significantly represented in large databases like TCGA. We wanted to determine somatic mutations in our cohort with a tumor-normal comparison and explore the variants in genes implicated in breast cancer from publicly available somatic and germline variants databases such as COSMIC and OMIM (<xref rid="SD5-ol-25-02-13637" ref-type="supplementary-material">Table SIV</xref>: Details of all variants in our patient samples). However, we observed multiple breast cancer-related genetic variants in the germline after relaxing filtering criteria to include dbSNP variants or variants with known pathogenicity, which might be due to tumor heterogeneity or purity.</p>
<p>The DNA Damage Response (DDR) pathways are essential for affecting critical biological processes. The proteins involved in these pathways can result in mutations in DNA sequences due to error-prone repair. In addition, multiple proteins affecting this pathway are implicated in cancer (<xref rid="b29-ol-25-02-13637" ref-type="bibr">29</xref>). Thus, we examined the DNA damage response pathway genes and signatures in our cohort of patients and examined potential driver mutations. We also utilized the TCGA database to explore the mutational landscaper in African-American and Hispanic cohorts and did a comparative analysis with our cohort.</p>
<p>In African-American and Hispanic samples from TCGA, we overserved that known breast cancer-related genes, including <italic>TP53, PIK3CA, GATA3</italic>, and <italic>MAP3K1</italic>, were significantly more variants. Even though samples from both ethnic categories of patients had variants in the genes mentioned, they showed different frequencies with which these genes were mutated. For example, <italic>TP53</italic> is the most frequently mutated in the African-American TCGA sample as opposed to <italic>PIK3CA</italic> in Hispanic samples. Our cohorts found TP53 variants in a few selected tumor samples, including tumor-adjacent normal tissue. The variants (<xref rid="f3-ol-25-02-13637" ref-type="fig">Fig. 3</xref>) were also found in the TCGA cohort except for <italic>TP53</italic> p.<italic>P72R</italic>. Among genes that showed a higher frequency in our cohort were <italic>F5</italic> and <italic>MTHFR</italic>, with possible germline contributions. However, F5 is a potential candidate gene for breast cancer and a marker for immune cell infiltration in breast cancer (<xref rid="b30-ol-25-02-13637" ref-type="bibr">30</xref>,<xref rid="b31-ol-25-02-13637" ref-type="bibr">31</xref>). <italic>PRSS1, SETD8</italic>, and <italic>CDC27</italic> were frequently mutated in African-American and Hispanic samples.</p>
<p>We observed that the DDR pathway genes are mutated in the African-American and Hispanic samples. These findings explain the observed mutational signatures (Current SBS), namely &#x2018;Signature 3 and 5&#x2019;. Signature 3. These signatures are associated with DNA Homologous Recombination repair. However, we did not observe any SBS1/2 associated with APOBEC activity in our breast cancer samples compared to the TCGA cohorts. Instead, we observed Signature 5, which may be due to environmental exposure or other unknown factors.</p>
<p>DNA damage signature can also modify the tumor microenvironment and affect immune gene expression (<xref rid="b32-ol-25-02-13637" ref-type="bibr">32</xref>). A &#x2018;DNA damage response-deficient&#x2019; subtype shows up-regulation of Programmed Death-Ligand 1 (PD-L1) in a cyclic GMP-AMP synthase and signaling effector stimulator of interferon genes (cGAS-STING) dependent manner. The cGAS-STING pathway is a foreign DNA sensing mechanism associated with multiple inflammatory responses (<xref rid="b33-ol-25-02-13637" ref-type="bibr">33</xref>). Thus, minority breast cancer patients might benefit from checkpoint inhibitor therapy when multiple genes in the DNA damage response and homologous recombination pathway have variants with functional implications (<xref rid="tII-ol-25-02-13637" ref-type="table">Table II</xref>). Other genomic stability pathways, such as Microsatellite Instability (MSI), are prevalent in all cancers, with variability across cancer types (<xref rid="b34-ol-25-02-13637" ref-type="bibr">34</xref>). It will be interesting to study the effect of MSI on breast cancer susceptibility and its occurrence in African-American and Hispanic patients to understand the contribution of mismatch repair in breast cancer.</p>
<p>Primarily, the somatic and germline variants show similarities with some differences between two minority breast cancer populations that can be further studied in a sub-type-specific manner. Further studies could help understand this disparity in our minority breast cancer patients with more extensive cohort studies. Our exome analysis found variants in the polymorphic genes in our patient samples, particularly <italic>CDC27</italic>. These genes potentially involve cell division and adipocyte metabolisms (<xref rid="b35-ol-25-02-13637" ref-type="bibr">35</xref>). In addition, low <italic>CDC27</italic> expression and <italic>CDC27</italic> polymorphisms are associated with worse breast cancer outcomes (<xref rid="b27-ol-25-02-13637" ref-type="bibr">27</xref>,<xref rid="b28-ol-25-02-13637" ref-type="bibr">28</xref>). Thus, despite being polymorphic in the general population, the variants in these genes could also have functional implications for cancer.</p>
<p>Our findings can have clinical implications in determining therapy in patients with specific genetic mutations. For example, MTHFR is associated with drug metabolism and can affect the patient&#x0027;s ability to respond to chemotherapy. In colon and breast cancers, 5-Fluorouracila sensitivity is associated with variants in the MTHFR gene (<xref rid="b36-ol-25-02-13637" ref-type="bibr">36</xref>,<xref rid="b37-ol-25-02-13637" ref-type="bibr">37</xref>). F5 (Coagulation factor V) is an estrogen response gene associated with CD8&#x002B; T cell in cancer immunity (<xref rid="b30-ol-25-02-13637" ref-type="bibr">30</xref>,<xref rid="b38-ol-25-02-13637" ref-type="bibr">38</xref>). These genes are also associated with the TGF-&#x03B2; pathway. Thus, using inhibitors when the pathway is activated because of mutations can be a potential therapeutic option. SETD8 variants can affect epigenetic pathways due to their role as lysine methyltransferases. SETD8 is also involved in DNA damage repair, thus making it a potential target via small-molecule inhibition (<xref rid="b38-ol-25-02-13637" ref-type="bibr">38</xref>). Additionally, PRSS1 is associated with drug resistance in cancer and higher cancer risk along with SETD8 and ARID1B (<xref rid="b39-ol-25-02-13637" ref-type="bibr">39</xref>&#x2013;<xref rid="b41-ol-25-02-13637" ref-type="bibr">41</xref>). Thus, our results on genetic variants can potentially be used as predictors of cancer risk in minority women.</p>
<p>In our study, we conducted WES on African-American and Hispanic breast cancer samples to elucidate the genetic makeup of breast cancer in these patients. We found overlapping genetic variants in both ethnicities that are potentially causative such as <italic>PRSS1</italic> and <italic>SETD8</italic>. However, there are significant differences in the specific genetic variants that belong to DNA damage response to transcription factors such as <italic>BRCA1/2, XRCC3, HELQ</italic>, and <italic>ARID1B</italic>. In our study, variants shown to be potentially damaging will need to be further studied to understand their molecular mechanisms concerning cancer initiation or progression. In addition, it will be beneficial to validate our findings in larger cohorts, which could lead to biomarker discovery towards the goal of alleviating health disparity. Overall, WES and other next-generation sequencing technologies will be crucial in our efforts to understand breast cancer health disparity.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="pdf" xlink:href="Supplementary_Data1.pdf"/>
</supplementary-material>
<supplementary-material id="SD2-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data2.xlsx"/>
</supplementary-material>
<supplementary-material id="SD3-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data3.xlsx"/>
</supplementary-material>
<supplementary-material id="SD4-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data4.xlsx"/>
</supplementary-material>
<supplementary-material id="SD5-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data5.xlsx"/>
</supplementary-material>
<supplementary-material id="SD6-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data6.xlsx"/>
</supplementary-material>
<supplementary-material id="SD7-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data7.xlsx"/>
</supplementary-material>
<supplementary-material id="SD8-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data8.xlsx"/>
</supplementary-material>
<supplementary-material id="SD9-ol-25-02-13637" content-type="local-data">
<caption>
<title>Supporting Data</title>
</caption>
<media mimetype="application" mime-subtype="xlsx" xlink:href="Supplementary_Data9.xlsx"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>Not applicable.</p>
</ack>
<sec sec-type="data-availability">
<title>Availability of data and materials</title>
<p>The datasets generated and/or analyzed during the current study are available in the <uri xlink:href="https://Synapse.org">Synapse.org</uri> repository, <uri xlink:href="https://doi.org/10.7303/syn42137028">https://doi.org/10.7303/syn42137028</uri> (project ID, syn42137028).</p>
</sec>
<sec>
<title>Authors&#x0027; contributions</title>
<p>PD was responsible for the conceptualization, methodology, formal analysis, data curation and original draft preparation. MYK was involved in the methodology and formal analysis. YW was involved in obtaining resources, study design, analysis, writing, review and editing. JVV was responsible for the conceptualization, supervision, review and editing, and funding acquisition. PD and JVV confirm the authenticity of all the raw data. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Ethics approval and consent to participate</title>
<p>The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board of Charles R. Drew University of Medicine and Science (#IRB 00-06-041; Los Angeles, USA), and the protocol has been approved since 1999 and is reviewed annually for continuation (recent continuing review approval was August 18, 2021). Written informed consent for participation was obtained from all subjects involved in the study.</p>
</sec>
<sec>
<title>Patient consent for publication</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="b1-ol-25-02-13637"><label>1</label><element-citation publication-type="book"><person-group person-group-type="editor"><name><surname>Howlader</surname><given-names>N</given-names></name><name><surname>Noone</surname><given-names>AM</given-names></name><name><surname>Krapcho</surname><given-names>M</given-names></name><name><surname>Miller</surname><given-names>D</given-names></name><name><surname>Brest</surname><given-names>A</given-names></name><name><surname>Yu</surname><given-names>M</given-names></name><name><surname>Ruhl</surname><given-names>J</given-names></name><name><surname>Tatalovich</surname><given-names>Z</given-names></name><name><surname>Mariotto</surname><given-names>A</given-names></name><name><surname>Lewis</surname><given-names>DR</given-names></name><etal/></person-group><article-title>SEER cancer statistics review, 1975&#x2013;2017</article-title><publisher-name>National Cancer Institute</publisher-name><publisher-loc>Bethesda, MD</publisher-loc><year>2020</year><uri xlink:href="https://seer.cancer.gov/csr/1975_2017/">https://seer.cancer.gov/csr/1975_2017/</uri></element-citation></ref>
<ref id="b2-ol-25-02-13637"><label>2</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Polyak</surname><given-names>K</given-names></name></person-group><article-title>Heterogeneity in breast cancer</article-title><source>J Clin Invest</source><volume>121</volume><fpage>3786</fpage><lpage>3788</lpage><year>2011</year><pub-id pub-id-type="doi">10.1172/JCI60534</pub-id><pub-id pub-id-type="pmid">21965334</pub-id></element-citation></ref>
<ref id="b3-ol-25-02-13637"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yedjou</surname><given-names>CG</given-names></name><name><surname>Sims</surname><given-names>JN</given-names></name><name><surname>Miele</surname><given-names>L</given-names></name><name><surname>Noubissi</surname><given-names>F</given-names></name><name><surname>Lowe</surname><given-names>L</given-names></name><name><surname>Fonseca</surname><given-names>DD</given-names></name><name><surname>Alo</surname><given-names>RA</given-names></name><name><surname>Payton</surname><given-names>M</given-names></name><name><surname>Tchounwou</surname><given-names>PB</given-names></name></person-group><article-title>Health and racial disparity in breast cancer</article-title><source>Adv Exp Med Biol</source><volume>1152</volume><fpage>31</fpage><lpage>49</lpage><year>2019</year><pub-id pub-id-type="doi">10.1007/978-3-030-20301-6_3</pub-id><pub-id pub-id-type="pmid">31456178</pub-id></element-citation></ref>
<ref id="b4-ol-25-02-13637"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chlebowski</surname><given-names>RT</given-names></name><name><surname>Chen</surname><given-names>Z</given-names></name><name><surname>Anderson</surname><given-names>GL</given-names></name><name><surname>Rohan</surname><given-names>T</given-names></name><name><surname>Aragaki</surname><given-names>A</given-names></name><name><surname>Lane</surname><given-names>D</given-names></name><name><surname>Dolan</surname><given-names>NC</given-names></name><name><surname>Paskett</surname><given-names>ED</given-names></name><name><surname>McTiernan</surname><given-names>A</given-names></name><name><surname>Hubbell</surname><given-names>FA</given-names></name><etal/></person-group><article-title>Ethnicity and breast cancer: Factors influencing differences in incidence and outcome</article-title><source>J Natl Cancer Inst</source><volume>97</volume><fpage>439</fpage><lpage>448</lpage><year>2005</year><pub-id pub-id-type="doi">10.1093/jnci/dji064</pub-id><pub-id pub-id-type="pmid">15770008</pub-id></element-citation></ref>
<ref id="b5-ol-25-02-13637"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Koboldt</surname><given-names>DC</given-names></name><name><surname>Fulton</surname><given-names>RS</given-names></name><name><surname>McLellan</surname><given-names>MD</given-names></name><name><surname>Schmidt</surname><given-names>H</given-names></name><name><surname>Kalicki-Veizer</surname><given-names>J</given-names></name><name><surname>McMichael</surname><given-names>JF</given-names></name><name><surname>Fulton</surname><given-names>LL</given-names></name><name><surname>Dooling</surname><given-names>DJ</given-names></name><name><surname>Ding</surname><given-names>J</given-names></name><name><surname>Mardis</surname><given-names>ER</given-names></name><etal/></person-group><article-title>Comprehensive molecular portraits of human breast tumours</article-title><source>Nature</source><volume>490</volume><fpage>61</fpage><lpage>70</lpage><year>2012</year><pub-id pub-id-type="doi">10.1038/nature11412</pub-id><pub-id pub-id-type="pmid">23000897</pub-id></element-citation></ref>
<ref id="b6-ol-25-02-13637"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Carrot-Zhang</surname><given-names>J</given-names></name><name><surname>Chambwe</surname><given-names>N</given-names></name><name><surname>Damrauer</surname><given-names>JS</given-names></name><name><surname>Knijnenburg</surname><given-names>TA</given-names></name><name><surname>Robertson</surname><given-names>AG</given-names></name><name><surname>Yau</surname><given-names>C</given-names></name><name><surname>Zhou</surname><given-names>W</given-names></name><name><surname>Berger</surname><given-names>AC</given-names></name><name><surname>Huang</surname><given-names>KL</given-names></name><name><surname>Newberg</surname><given-names>JY</given-names></name><etal/></person-group><article-title>Comprehensive analysis of genetic ancestry and its molecular correlates in cancer</article-title><source>Cancer Cell</source><volume>37</volume><fpage>639</fpage><lpage>654.e6</lpage><year>2020</year><pub-id pub-id-type="doi">10.1016/j.ccell.2020.04.012</pub-id><pub-id pub-id-type="pmid">32396860</pub-id></element-citation></ref>
<ref id="b7-ol-25-02-13637"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huo</surname><given-names>D</given-names></name><name><surname>Hu</surname><given-names>H</given-names></name><name><surname>Rhie</surname><given-names>SK</given-names></name><name><surname>Gamazon</surname><given-names>ER</given-names></name><name><surname>Cherniack</surname><given-names>AD</given-names></name><name><surname>Liu</surname><given-names>J</given-names></name><name><surname>Yoshimatsu</surname><given-names>TF</given-names></name><name><surname>Pitt</surname><given-names>JJ</given-names></name><name><surname>Hoadley</surname><given-names>KA</given-names></name><name><surname>Troester</surname><given-names>M</given-names></name><etal/></person-group><article-title>Comparison of breast cancer molecular features and survival by african and european ancestry in the cancer genome atlas</article-title><source>JAMA Oncol</source><volume>3</volume><fpage>1654</fpage><lpage>1662</lpage><year>2017</year><pub-id pub-id-type="doi">10.1001/jamaoncol.2017.0595</pub-id><pub-id pub-id-type="pmid">28472234</pub-id></element-citation></ref>
<ref id="b8-ol-25-02-13637"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>DeSantis</surname><given-names>CE</given-names></name><name><surname>Ma</surname><given-names>J</given-names></name><name><surname>Gaudet</surname><given-names>MM</given-names></name><name><surname>Newman</surname><given-names>LA</given-names></name><name><surname>Miller</surname><given-names>KD</given-names></name><name><surname>Goding Sauer</surname><given-names>A</given-names></name><name><surname>Jemal</surname><given-names>A</given-names></name><name><surname>Siegel</surname><given-names>RL</given-names></name></person-group><article-title>Breast cancer statistics, 2019</article-title><source>CA Cancer J Clin</source><volume>69</volume><fpage>438</fpage><lpage>451</lpage><year>2019</year><pub-id pub-id-type="doi">10.3322/caac.21555</pub-id><pub-id pub-id-type="pmid">31577379</pub-id></element-citation></ref>
<ref id="b9-ol-25-02-13637"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McLaren</surname><given-names>W</given-names></name><name><surname>Gil</surname><given-names>L</given-names></name><name><surname>Hunt</surname><given-names>SE</given-names></name><name><surname>Riat</surname><given-names>HS</given-names></name><name><surname>Ritchie</surname><given-names>GR</given-names></name><name><surname>Thormann</surname><given-names>A</given-names></name><name><surname>Flicek</surname><given-names>P</given-names></name><name><surname>Cunningham</surname><given-names>F</given-names></name></person-group><article-title>The ensembl variant effect predictor</article-title><source>Genome Biol</source><volume>17</volume><fpage>122</fpage><year>2016</year><pub-id pub-id-type="doi">10.1186/s13059-016-0974-4</pub-id><pub-id pub-id-type="pmid">27268795</pub-id></element-citation></ref>
<ref id="b10-ol-25-02-13637"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mayakonda</surname><given-names>A</given-names></name><name><surname>Lin</surname><given-names>DC</given-names></name><name><surname>Assenov</surname><given-names>Y</given-names></name><name><surname>Plass</surname><given-names>C</given-names></name><name><surname>Koeffler</surname><given-names>HP</given-names></name></person-group><article-title>Maftools: Efficient and comprehensive analysis of somatic variants in cancer</article-title><source>Genome Res</source><volume>28</volume><fpage>1747</fpage><lpage>1756</lpage><year>2018</year><pub-id pub-id-type="doi">10.1101/gr.239244.118</pub-id><pub-id pub-id-type="pmid">30341162</pub-id></element-citation></ref>
<ref id="b11-ol-25-02-13637"><label>11</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ge</surname><given-names>SX</given-names></name><name><surname>Jung</surname><given-names>D</given-names></name><name><surname>Yao</surname><given-names>R</given-names></name></person-group><article-title>ShinyGO: A graphical gene-set enrichment tool for animals and plants</article-title><source>Bioinformatics</source><volume>36</volume><fpage>2628</fpage><lpage>2629</lpage><year>2020</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btz931</pub-id><pub-id pub-id-type="pmid">31882993</pub-id></element-citation></ref>
<ref id="b12-ol-25-02-13637"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liberzon</surname><given-names>A</given-names></name><name><surname>Subramanian</surname><given-names>A</given-names></name><name><surname>Pinchback</surname><given-names>R</given-names></name><name><surname>Thorvaldsd&#x00F3;ttir</surname><given-names>H</given-names></name><name><surname>Tamayo</surname><given-names>P</given-names></name><name><surname>Mesirov</surname><given-names>JP</given-names></name></person-group><article-title>Molecular signatures database (MSigDB) 3.0</article-title><source>Bioinformatics</source><volume>27</volume><fpage>1739</fpage><lpage>1740</lpage><year>2011</year><pub-id pub-id-type="doi">10.1093/bioinformatics/btr260</pub-id><pub-id pub-id-type="pmid">21546393</pub-id></element-citation></ref>
<ref id="b13-ol-25-02-13637"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Adzhubei</surname><given-names>IA</given-names></name><name><surname>Schmidt</surname><given-names>S</given-names></name><name><surname>Peshkin</surname><given-names>L</given-names></name><name><surname>Ramensky</surname><given-names>VE</given-names></name><name><surname>Gerasimova</surname><given-names>A</given-names></name><name><surname>Bork</surname><given-names>P</given-names></name><name><surname>Kondrashov</surname><given-names>AS</given-names></name><name><surname>Sunyaev</surname><given-names>SR</given-names></name></person-group><article-title>A method and server for predicting damaging missense mutations</article-title><source>Nat Methods</source><volume>7</volume><fpage>248</fpage><lpage>249</lpage><year>2010</year><pub-id pub-id-type="doi">10.1038/nmeth0410-248</pub-id><pub-id pub-id-type="pmid">20354512</pub-id></element-citation></ref>
<ref id="b14-ol-25-02-13637"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vaser</surname><given-names>R</given-names></name><name><surname>Adusumalli</surname><given-names>S</given-names></name><name><surname>Leng</surname><given-names>SN</given-names></name><name><surname>Sikic</surname><given-names>M</given-names></name><name><surname>Ng</surname><given-names>PC</given-names></name></person-group><article-title>SIFT missense predictions for genomes</article-title><source>Nat Protoc</source><volume>11</volume><fpage>1</fpage><lpage>9</lpage><year>2016</year><pub-id pub-id-type="doi">10.1038/nprot.2015.123</pub-id><pub-id pub-id-type="pmid">26633127</pub-id></element-citation></ref>
<ref id="b15-ol-25-02-13637"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>K&#x00F6;hler</surname><given-names>S</given-names></name><name><surname>Gargano</surname><given-names>M</given-names></name><name><surname>Matentzoglu</surname><given-names>N</given-names></name><name><surname>Carmody</surname><given-names>LC</given-names></name><name><surname>Lewis-Smith</surname><given-names>D</given-names></name><name><surname>Vasilevsky</surname><given-names>NA</given-names></name><name><surname>Danis</surname><given-names>D</given-names></name><name><surname>Balagura</surname><given-names>G</given-names></name><name><surname>Baynam</surname><given-names>G</given-names></name><name><surname>Brower</surname><given-names>AM</given-names></name><etal/></person-group><article-title>The human phenotype ontology in 2021</article-title><source>Nucleic Acids Res</source><volume>49</volume><issue>D1</issue><fpage>D1207</fpage><lpage>D1217</lpage><year>2021</year><pub-id pub-id-type="doi">10.1093/nar/gkaa1043</pub-id><pub-id pub-id-type="pmid">33264411</pub-id></element-citation></ref>
<ref id="b16-ol-25-02-13637"><label>16</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sondka</surname><given-names>Z</given-names></name><name><surname>Bamford</surname><given-names>S</given-names></name><name><surname>Cole</surname><given-names>CG</given-names></name><name><surname>Ward</surname><given-names>SA</given-names></name><name><surname>Dunham</surname><given-names>I</given-names></name><name><surname>Forbes</surname><given-names>SA</given-names></name></person-group><article-title>The COSMIC cancer gene census: Describing genetic dysfunction across all human cancers</article-title><source>Nat Rev Cancer</source><volume>18</volume><fpage>696</fpage><lpage>705</lpage><year>2018</year><pub-id pub-id-type="doi">10.1038/s41568-018-0060-1</pub-id><pub-id pub-id-type="pmid">30293088</pub-id></element-citation></ref>
<ref id="b17-ol-25-02-13637"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cerami</surname><given-names>E</given-names></name><name><surname>Gao</surname><given-names>J</given-names></name><name><surname>Dogrusoz</surname><given-names>U</given-names></name><name><surname>Gross</surname><given-names>BE</given-names></name><name><surname>Sumer</surname><given-names>SO</given-names></name><name><surname>Aksoy</surname><given-names>BA</given-names></name><name><surname>Jacobsen</surname><given-names>A</given-names></name><name><surname>Byrne</surname><given-names>CJ</given-names></name><name><surname>Heuer</surname><given-names>ML</given-names></name><name><surname>Larsson</surname><given-names>E</given-names></name><etal/></person-group><article-title>The cBio cancer genomics portal: An open platform for exploring multidimensional cancer genomics data</article-title><source>Cancer Discov</source><volume>2</volume><fpage>401</fpage><lpage>404</lpage><year>2012</year><pub-id pub-id-type="doi">10.1158/2159-8290.CD-12-0095</pub-id><pub-id pub-id-type="pmid">22588877</pub-id></element-citation></ref>
<ref id="b18-ol-25-02-13637"><label>18</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Keenan</surname><given-names>T</given-names></name><name><surname>Moy</surname><given-names>B</given-names></name><name><surname>Mroz</surname><given-names>EA</given-names></name><name><surname>Ross</surname><given-names>K</given-names></name><name><surname>Niemierko</surname><given-names>A</given-names></name><name><surname>Rocco</surname><given-names>JW</given-names></name><name><surname>Isakoff</surname><given-names>S</given-names></name><name><surname>Ellisen</surname><given-names>LW</given-names></name><name><surname>Bardia</surname><given-names>A</given-names></name></person-group><article-title>Comparison of the genomic landscape between primary breast cancer in African American versus white women and the association of racial differences with tumor recurrence</article-title><source>J Clin Oncol</source><volume>33</volume><fpage>3621</fpage><lpage>3627</lpage><year>2015</year><pub-id pub-id-type="doi">10.1200/jco.2015.33.15_suppl.6553</pub-id><pub-id pub-id-type="pmid">26371147</pub-id></element-citation></ref>
<ref id="b19-ol-25-02-13637"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Olivier</surname><given-names>M</given-names></name><name><surname>Hollstein</surname><given-names>M</given-names></name><name><surname>Hainaut</surname><given-names>P</given-names></name></person-group><article-title>TP53 mutations in human cancers: Origins, consequences, and clinical use</article-title><source>Cold Spring Harb Perspect Biol</source><volume>2</volume><fpage>a001008</fpage><year>2010</year><pub-id pub-id-type="doi">10.1101/cshperspect.a001008</pub-id><pub-id pub-id-type="pmid">20182602</pub-id></element-citation></ref>
<ref id="b20-ol-25-02-13637"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname><given-names>SA</given-names></name><name><surname>Lawrence</surname><given-names>MS</given-names></name><name><surname>Klimczak</surname><given-names>LJ</given-names></name><name><surname>Grimm</surname><given-names>SA</given-names></name><name><surname>Fargo</surname><given-names>D</given-names></name><name><surname>Stojanov</surname><given-names>P</given-names></name><name><surname>Kiezun</surname><given-names>A</given-names></name><name><surname>Kryukov</surname><given-names>GV</given-names></name><name><surname>Carter</surname><given-names>SL</given-names></name><name><surname>Saksena</surname><given-names>G</given-names></name><etal/></person-group><article-title>An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers</article-title><source>Nat Genet</source><volume>45</volume><fpage>970</fpage><lpage>976</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/ng.2702</pub-id><pub-id pub-id-type="pmid">23852170</pub-id></element-citation></ref>
<ref id="b21-ol-25-02-13637"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Burns</surname><given-names>MB</given-names></name><name><surname>Lackey</surname><given-names>L</given-names></name><name><surname>Carpenter</surname><given-names>MA</given-names></name><name><surname>Rathore</surname><given-names>A</given-names></name><name><surname>Land</surname><given-names>AM</given-names></name><name><surname>Leonard</surname><given-names>B</given-names></name><name><surname>Refsland</surname><given-names>EW</given-names></name><name><surname>Kotandeniya</surname><given-names>D</given-names></name><name><surname>Tretyakova</surname><given-names>N</given-names></name><name><surname>Nikas</surname><given-names>JB</given-names></name><etal/></person-group><article-title>APOBEC3B is an enzymatic source of mutation in breast cancer</article-title><source>Nature</source><volume>494</volume><fpage>366</fpage><lpage>370</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/nature11881</pub-id><pub-id pub-id-type="pmid">23389445</pub-id></element-citation></ref>
<ref id="b22-ol-25-02-13637"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sanchez-Vega</surname><given-names>F</given-names></name><name><surname>Mina</surname><given-names>M</given-names></name><name><surname>Armenia</surname><given-names>J</given-names></name><name><surname>Chatila</surname><given-names>WK</given-names></name><name><surname>Luna</surname><given-names>A</given-names></name><name><surname>La</surname><given-names>KC</given-names></name><name><surname>Dimitriadoy</surname><given-names>S</given-names></name><name><surname>Liu</surname><given-names>DL</given-names></name><name><surname>Kantheti</surname><given-names>HS</given-names></name><name><surname>Saghafinia</surname><given-names>S</given-names></name><etal/></person-group><article-title>Oncogenic signaling pathways in the cancer genome atlas</article-title><source>Cell</source><volume>173</volume><fpage>321</fpage><lpage>337.e10</lpage><year>2018</year><pub-id pub-id-type="doi">10.1016/j.cell.2018.03.035</pub-id><pub-id pub-id-type="pmid">29625050</pub-id></element-citation></ref>
<ref id="b23-ol-25-02-13637"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lawrence</surname><given-names>MS</given-names></name><name><surname>Stojanov</surname><given-names>P</given-names></name><name><surname>Polak</surname><given-names>P</given-names></name><name><surname>Kryukov</surname><given-names>GV</given-names></name><name><surname>Cibulskis</surname><given-names>K</given-names></name><name><surname>Sivachenko</surname><given-names>A</given-names></name><name><surname>Carter</surname><given-names>SL</given-names></name><name><surname>Stewart</surname><given-names>C</given-names></name><name><surname>Mermel</surname><given-names>CH</given-names></name><name><surname>Roberts</surname><given-names>SA</given-names></name><etal/></person-group><article-title>Mutational heterogeneity in cancer and the search for new cancer-associated genes</article-title><source>Nature</source><volume>499</volume><fpage>214</fpage><lpage>218</lpage><year>2013</year><pub-id pub-id-type="doi">10.1038/nature12213</pub-id><pub-id pub-id-type="pmid">23770567</pub-id></element-citation></ref>
<ref id="b24-ol-25-02-13637"><label>24</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lawrence</surname><given-names>MS</given-names></name><name><surname>Stojanov</surname><given-names>P</given-names></name><name><surname>Mermel</surname><given-names>CH</given-names></name><name><surname>Robinson</surname><given-names>JT</given-names></name><name><surname>Garraway</surname><given-names>LA</given-names></name><name><surname>Golub</surname><given-names>TR</given-names></name><name><surname>Meyerson</surname><given-names>M</given-names></name><name><surname>Gabriel</surname><given-names>SB</given-names></name><name><surname>Lander</surname><given-names>ES</given-names></name><name><surname>Getz</surname><given-names>G</given-names></name></person-group><article-title>Discovery and saturation analysis of cancer genes across 21 tumour types</article-title><source>Nature</source><volume>505</volume><fpage>495</fpage><lpage>501</lpage><year>2014</year><pub-id pub-id-type="doi">10.1038/nature12912</pub-id><pub-id pub-id-type="pmid">24390350</pub-id></element-citation></ref>
<ref id="b25-ol-25-02-13637"><label>25</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>R</given-names></name><name><surname>Yu</surname><given-names>Y</given-names></name><name><surname>Zong</surname><given-names>X</given-names></name><name><surname>Li</surname><given-names>X</given-names></name><name><surname>Ma</surname><given-names>L</given-names></name><name><surname>Zheng</surname><given-names>Q</given-names></name></person-group><article-title>Monomethyltransferase SETD8 regulates breast cancer metabolism via stabilizing hypoxia-inducible factor 1&#x03B1;</article-title><source>Cancer Lett</source><volume>390</volume><fpage>1</fpage><lpage>10</lpage><year>2017</year><pub-id pub-id-type="doi">10.1016/j.canlet.2016.12.038</pub-id><pub-id pub-id-type="pmid">28089831</pub-id></element-citation></ref>
<ref id="b26-ol-25-02-13637"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Paulsen</surname><given-names>RD</given-names></name><name><surname>Soni</surname><given-names>DV</given-names></name><name><surname>Wollman</surname><given-names>R</given-names></name><name><surname>Hahn</surname><given-names>AT</given-names></name><name><surname>Yee</surname><given-names>MC</given-names></name><name><surname>Guan</surname><given-names>A</given-names></name><name><surname>Hesley</surname><given-names>JA</given-names></name><name><surname>Miller</surname><given-names>SC</given-names></name><name><surname>Cromwell</surname><given-names>EF</given-names></name><name><surname>Solow-Cordero</surname><given-names>DE</given-names></name><etal/></person-group><article-title>A genome-wide siRNA screen reveals diverse cellular processes and pathways that mediate genome stability</article-title><source>Mol Cell</source><volume>35</volume><fpage>228</fpage><lpage>239</lpage><year>2009</year><pub-id pub-id-type="doi">10.1016/j.molcel.2009.06.021</pub-id><pub-id pub-id-type="pmid">19647519</pub-id></element-citation></ref>
<ref id="b27-ol-25-02-13637"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname><given-names>H</given-names></name><name><surname>Chen</surname><given-names>W</given-names></name><name><surname>Ming</surname><given-names>J</given-names></name><name><surname>Zhong</surname><given-names>R</given-names></name><name><surname>Yi</surname><given-names>P</given-names></name><name><surname>Zhu</surname><given-names>B</given-names></name><name><surname>Miao</surname><given-names>X</given-names></name><name><surname>Huang</surname><given-names>T</given-names></name></person-group><article-title>Association between polymorphisms in cdc27 and breast cancer in a Chinese population</article-title><source>Tumour Biol</source><volume>36</volume><fpage>5299</fpage><lpage>5304</lpage><year>2015</year><pub-id pub-id-type="doi">10.1007/s13277-014-2912-y</pub-id><pub-id pub-id-type="pmid">25680405</pub-id></element-citation></ref>
<ref id="b28-ol-25-02-13637"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Talvinen</surname><given-names>K</given-names></name><name><surname>Karra</surname><given-names>H</given-names></name><name><surname>Pitk&#x00E4;nen</surname><given-names>R</given-names></name><name><surname>Ahonen</surname><given-names>I</given-names></name><name><surname>Nyk&#x00E4;nen</surname><given-names>M</given-names></name><name><surname>Lintunen</surname><given-names>M</given-names></name><name><surname>S&#x00F6;derstr&#x00F6;m</surname><given-names>M</given-names></name><name><surname>Kuopio</surname><given-names>T</given-names></name><name><surname>Kronqvist</surname><given-names>P</given-names></name></person-group><article-title>Low cdc27 and high securin expression predict short survival for breast cancer patients</article-title><source>APMIS</source><volume>121</volume><fpage>945</fpage><lpage>953</lpage><year>2013</year><pub-id pub-id-type="doi">10.1111/apm.12110</pub-id><pub-id pub-id-type="pmid">23755904</pub-id></element-citation></ref>
<ref id="b29-ol-25-02-13637"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jackson</surname><given-names>SP</given-names></name><name><surname>Bartek</surname><given-names>J</given-names></name></person-group><article-title>The DNA-damage response in human biology and disease</article-title><source>Nature</source><volume>461</volume><fpage>1071</fpage><lpage>1078</lpage><year>2009</year><pub-id pub-id-type="doi">10.1038/nature08467</pub-id><pub-id pub-id-type="pmid">19847258</pub-id></element-citation></ref>
<ref id="b30-ol-25-02-13637"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Andresen</surname><given-names>MS</given-names></name><name><surname>Sletten</surname><given-names>M</given-names></name><name><surname>Sandset</surname><given-names>PM</given-names></name><name><surname>Iversen</surname><given-names>N</given-names></name><name><surname>Stavik</surname><given-names>B</given-names></name><name><surname>Tinholt</surname><given-names>M</given-names></name></person-group><article-title>Coagulation factor V (F5) is an estrogen-responsive gene in breast cancer cells</article-title><source>Thromb Haemost</source><volume>122</volume><fpage>1288</fpage><lpage>1295</lpage><year>2022</year><pub-id pub-id-type="doi">10.1055/a-1707-2130</pub-id><pub-id pub-id-type="pmid">34826880</pub-id></element-citation></ref>
<ref id="b31-ol-25-02-13637"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tinholt</surname><given-names>M</given-names></name><name><surname>Stavik</surname><given-names>B</given-names></name><name><surname>Tekpli</surname><given-names>X</given-names></name><name><surname>Garred</surname><given-names>&#x00D8;</given-names></name><name><surname>Borgen</surname><given-names>E</given-names></name><name><surname>Kristensen</surname><given-names>V</given-names></name><name><surname>Sahlberg</surname><given-names>KK</given-names></name><name><surname>Sandset</surname><given-names>PM</given-names></name><name><surname>Iversen</surname><given-names>N</given-names></name></person-group><article-title>Coagulation factor V is a marker of tumor-infiltrating immune cells in breast cancer</article-title><source>Oncoimmunology</source><volume>9</volume><fpage>1824644</fpage><year>2020</year><pub-id pub-id-type="doi">10.1080/2162402X.2020.1824644</pub-id><pub-id pub-id-type="pmid">33457104</pub-id></element-citation></ref>
<ref id="b32-ol-25-02-13637"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Parkes</surname><given-names>EE</given-names></name><name><surname>Walker</surname><given-names>SM</given-names></name><name><surname>Taggart</surname><given-names>LE</given-names></name><name><surname>McCabe</surname><given-names>N</given-names></name><name><surname>Knight</surname><given-names>LA</given-names></name><name><surname>Wilkinson</surname><given-names>R</given-names></name><name><surname>McCloskey</surname><given-names>KD</given-names></name><name><surname>Buckley</surname><given-names>NE</given-names></name><name><surname>Savage</surname><given-names>KI</given-names></name><name><surname>Salto-Tellez</surname><given-names>M</given-names></name><etal/></person-group><article-title>Activation of STING-dependent innate immune signaling by S-phase-specific DNA damage in breast cancer</article-title><source>J Natl Cancer Inst</source><volume>109</volume><fpage>djw199</fpage><year>2016</year><pub-id pub-id-type="doi">10.1093/jnci/djw199</pub-id><pub-id pub-id-type="pmid">27707838</pub-id></element-citation></ref>
<ref id="b33-ol-25-02-13637"><label>33</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Motwani</surname><given-names>M</given-names></name><name><surname>Pesiridis</surname><given-names>S</given-names></name><name><surname>Fitzgerald</surname><given-names>KA</given-names></name></person-group><article-title>DNA sensing by the cGAS-STING pathway in health and disease</article-title><source>Nat Rev Genet</source><volume>20</volume><fpage>657</fpage><lpage>674</lpage><year>2019</year><pub-id pub-id-type="doi">10.1038/s41576-019-0151-1</pub-id><pub-id pub-id-type="pmid">31358977</pub-id></element-citation></ref>
<ref id="b34-ol-25-02-13637"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cortes-Ciriano</surname><given-names>I</given-names></name><name><surname>Lee</surname><given-names>S</given-names></name><name><surname>Park</surname><given-names>WY</given-names></name><name><surname>Kim</surname><given-names>TM</given-names></name><name><surname>Park</surname><given-names>PJ</given-names></name></person-group><article-title>A molecular portrait of microsatellite instability across multiple cancers</article-title><source>Nat Commun</source><volume>8</volume><fpage>15180</fpage><year>2017</year><pub-id pub-id-type="doi">10.1038/ncomms15180</pub-id><pub-id pub-id-type="pmid">28585546</pub-id></element-citation></ref>
<ref id="b35-ol-25-02-13637"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vernochet</surname><given-names>C</given-names></name><name><surname>Peres</surname><given-names>SB</given-names></name><name><surname>Davis</surname><given-names>KE</given-names></name><name><surname>McDonald</surname><given-names>ME</given-names></name><name><surname>Qiang</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>H</given-names></name><name><surname>Scherer</surname><given-names>PE</given-names></name><name><surname>Farmer</surname><given-names>SR</given-names></name></person-group><article-title>C/EBPalpha and the corepressors CtBP1 and CtBP2 regulate repression of select visceral white adipose genes during induction of the brown phenotype in white adipocytes by peroxisome proliferator-activated receptor gamma agonists</article-title><source>Mol Cell Biol</source><volume>29</volume><fpage>4714</fpage><lpage>4728</lpage><year>2009</year><pub-id pub-id-type="doi">10.1128/MCB.01899-08</pub-id><pub-id pub-id-type="pmid">19564408</pub-id></element-citation></ref>
<ref id="b36-ol-25-02-13637"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>YI</given-names></name></person-group><article-title>Role of the MTHFR polymorphisms in cancer risk modification and treatment</article-title><source>Future Oncol</source><volume>5</volume><fpage>523</fpage><lpage>542</lpage><year>2009</year><pub-id pub-id-type="doi">10.2217/fon.09.26</pub-id><pub-id pub-id-type="pmid">19450180</pub-id></element-citation></ref>
<ref id="b37-ol-25-02-13637"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sohn</surname><given-names>KJ</given-names></name><name><surname>Croxford</surname><given-names>R</given-names></name><name><surname>Yates</surname><given-names>Z</given-names></name><name><surname>Lucock</surname><given-names>M</given-names></name><name><surname>Kim</surname><given-names>YI</given-names></name></person-group><article-title>Effect of the methylenetetrahydrofolate reductase C677T polymorphism on chemosensitivity of colon and breast cancer cells to 5-fluorouracil and methotrexate</article-title><source>J Natl Cancer Inst</source><volume>96</volume><fpage>134</fpage><lpage>144</lpage><year>2004</year><pub-id pub-id-type="doi">10.1093/jnci/djh015</pub-id><pub-id pub-id-type="pmid">14734703</pub-id></element-citation></ref>
<ref id="b38-ol-25-02-13637"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guan</surname><given-names>Y</given-names></name><name><surname>Xu</surname><given-names>B</given-names></name><name><surname>Sui</surname><given-names>Y</given-names></name><name><surname>Chen</surname><given-names>Z</given-names></name><name><surname>Luan</surname><given-names>Y</given-names></name><name><surname>Jiang</surname><given-names>Y</given-names></name><name><surname>Wei</surname><given-names>L</given-names></name><name><surname>Long</surname><given-names>W</given-names></name><name><surname>Zhao</surname><given-names>S</given-names></name><name><surname>Han</surname><given-names>L</given-names></name><etal/></person-group><article-title>Pan-cancer analysis and validation reveals that D-dimer-related genes are prognostic and downregulate CD8<sup>&#x002B;</sup> T cells via TGF-Beta signaling in gastric cancer</article-title><source>Front Mol Biosci</source><volume>9</volume><fpage>790706</fpage><year>2022</year><pub-id pub-id-type="doi">10.3389/fmolb.2022.790706</pub-id><pub-id pub-id-type="pmid">35274004</pub-id></element-citation></ref>
<ref id="b39-ol-25-02-13637"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname><given-names>Z</given-names></name><name><surname>Gao</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Xu</surname><given-names>J</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name></person-group><article-title>PRSS contributes to cetuximab resistance in colorectal cancer</article-title><source>Sci Adv</source><volume>6</volume><fpage>eaax5576</fpage><year>2020</year><pub-id pub-id-type="doi">10.1126/sciadv.aax5576</pub-id><pub-id pub-id-type="pmid">31911942</pub-id></element-citation></ref>
<ref id="b40-ol-25-02-13637"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Weiss</surname><given-names>FU</given-names></name></person-group><article-title>Pancreatic cancer risk in hereditary pancreatitis</article-title><source>Front Physiol</source><volume>5</volume><fpage>70</fpage><year>2014</year><pub-id pub-id-type="doi">10.3389/fphys.2014.00070</pub-id><pub-id pub-id-type="pmid">24600409</pub-id></element-citation></ref>
<ref id="b41-ol-25-02-13637"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Milite</surname><given-names>C</given-names></name><name><surname>Feoli</surname><given-names>A</given-names></name><name><surname>Viviano</surname><given-names>M</given-names></name><name><surname>Rescigno</surname><given-names>D</given-names></name><name><surname>Cianciulli</surname><given-names>A</given-names></name><name><surname>Balzano</surname><given-names>AL</given-names></name><name><surname>Mai</surname><given-names>A</given-names></name><name><surname>Castellano</surname><given-names>S</given-names></name><name><surname>Sbardella</surname><given-names>G</given-names></name></person-group><article-title>The emerging role of lysine methyltransferase SETD8 in human diseases</article-title><source>Clin Epigenetics</source><volume>8</volume><fpage>102</fpage><year>2016</year><pub-id pub-id-type="doi">10.1186/s13148-016-0268-4</pub-id><pub-id pub-id-type="pmid">27688818</pub-id></element-citation></ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-ol-25-02-13637" position="float">
<label>Figure 1.</label>
<caption><p>Overall sample summary in the African-American and Hispanic cohorts. The figure shows top mutated genes after filtering for all variants with the Single Nucleotide Polymorphism Database identifiers. (A) Log2 trans-formed TMB for the samples is shown (mutation/exome capture size). There was no significant difference between African-American and Hispanic samples (P=0.69; Mann Whitney U test; unpaired). Oncoplot showing the top 20 significant mutated genes in (B) African-American and (C) Hispanic samples. (B and C) Top: TMB displaying total somatic mutations in each sample. The bottom legend shows each cohort sample&#x0027;s subtype (Luminal A/B, Her2 and TNBC). Bottom right: Color key showing the different categories of mutations shown. AA, African-American; Del, deletion; Hisp, Hispanic; Ins, insertion; TMB, tumor mutational burden; TNBC, triple-negative breast cancer.</p></caption>
<graphic xlink:href="ol-25-02-13637-g00.tif"/>
</fig>
<fig id="f2-ol-25-02-13637" position="float">
<label>Figure 2.</label>
<caption><p>Summary of mutations in breast cancer driver genes and genes associated with DNA damage response across all tumor samples. Tumor samples from African American (top) and Hispanic (bottom) patients are shown. Tumor samples include variants listed in the Single Nucleotide Polymorphism Database. Variants also include known pathogenic variants. Each column is an individual sample. The tables underneath the images show subtypes: Luminal (including both A and B), Her2-positive and TNBC. TNBC, triple-negative breast cancer.</p></caption>
<graphic xlink:href="ol-25-02-13637-g01.tif"/>
</fig>
<fig id="f3-ol-25-02-13637" position="float">
<label>Figure 3.</label>
<caption><p>Mutated amino acid residues in the TP53 protein combining African-American and Hispanic patients are shown in a lollipop plot. In addition, both missense (green) and nonsense (red) mutations are shown. The domain of TP53 is indicated as follows: TAD (green), P53 (red) and P53_tetramer (blue). The amino acid changes are shown using the single-letter codes. The asterisks in the nonsense mutations indicate stop codons. TAD, trans activation domain. P53, P53 DNA binding domain; P53_tetramer, tetramerization domain.</p></caption>
<graphic xlink:href="ol-25-02-13637-g02.tif"/>
</fig>
<fig id="f4-ol-25-02-13637" position="float">
<label>Figure 4.</label>
<caption><p>Mutational signatures are shown for our and TCGA African-American and Hispanic cohorts. The mutational signature was calculated with two significant signatures with the best similarity with the COSMIC SBS signature. The TCGA African-American and Hispanic data were used to find the top four and three signatures, respectively. (A) Our cohort. (B) TCGA African American. (C) TCGA Hispanic. The result was plotted with the current SBS signature from the COSMIC database for African-American and Hispanic samples. (D) APOBEC enrichment score. No significant difference was observed among the four groups based on one-way ANOVA and Tukey&#x0027;s multiple comparisons. AA, African-American; APOBEC, apolipoprotein B mRNA editing catalytic polypeptide-like; DCRT, Division of Cancer Research and Training, Charles R. Drew University; HR, homologous recombination; Hisp, Hispanic; SBS, single base substitution; TCGA, The Cancer Genome Atlas.</p></caption>
<graphic xlink:href="ol-25-02-13637-g03.tif"/>
</fig>
<fig id="f5-ol-25-02-13637" position="float">
<label>Figure 5.</label>
<caption><p>Affected oncogenic pathways in patients with breast cancer. The total number of genes and the number of genes affected (with variants) are shown for each oncogenic pathway. Left: African-American breast cancer data. Right: Hispanic breast cancer data. Oncogenic pathways in the corresponding TCGA cohorts are shown at the bottom. NRF2, nuclear factor erythroid 2-related factor 2; RTK, receptor tyrosine kinase; TCGA, The Cancer Genome Atlas.</p></caption>
<graphic xlink:href="ol-25-02-13637-g04.tif"/>
</fig>
<fig id="f6-ol-25-02-13637" position="float">
<label>Figure 6.</label>
<caption><p>Frequently mutated genes and pathway enrichment of MutSig significantly mutated candidate driver genes. Left: MutSig candidate driver genes in each group and the shared genes are shown. The gene symbols show 11 genes common between African-American and Hispanic cohorts in our dataset. Right: Gene Ontology pathway enrichment for frequently mutated genes (identified by MutSig) in the TCGA African-American category. Fold enrichment is shown in the columns. The FDR was &#x003C;0.05 for each category. Venn diagram from <uri xlink:href="https://bioinformatics.psb.ugent.be/webtools/Venn/">https://bioinformatics.psb.ugent.be/webtools/Venn/</uri>. FDR, false discovery rate; MutSig, mutation significance; TCGA, The Cancer Genome Atlas.</p></caption>
<graphic xlink:href="ol-25-02-13637-g05.tif"/>
</fig>
<table-wrap id="tI-ol-25-02-13637" position="float">
<label>Table I.</label>
<caption><p>Description of the breast cancer samples used in the present study.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Variable</th>
<th align="center" valign="bottom">No. (&#x0025;)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Ethnicity</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;African-American</td>
<td align="center" valign="top">13 (44.8)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Hispanic</td>
<td align="center" valign="top">15 (55.2)</td>
</tr>
<tr>
<td align="left" valign="top">Subtype</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Luminal</td>
<td align="center" valign="top">15 (55.2)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;AA</td>
<td align="center" valign="top">8 (61.5)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;Hisp</td>
<td align="center" valign="top">7 (46.7)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;Her2 enriched</td>
<td align="center" valign="top">5 (17.2)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;AA</td>
<td align="center" valign="top">1 (7.7)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;Hisp</td>
<td align="center" valign="top">4 (26.6)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;TNBC</td>
<td align="center" valign="top">6 (17.2)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;AA</td>
<td align="center" valign="top">3 (23.1)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;Hisp</td>
<td align="center" valign="top">3 (23.1)</td>
</tr>
<tr>
<td align="left" valign="top">Age, years</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;30-50</td>
<td align="center" valign="top">10 (35.7)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;AA</td>
<td align="center" valign="top">3 (23.1)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;Hisp</td>
<td align="center" valign="top">7 (46.7)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x003E;50</td>
<td align="center" valign="top">18 (64.3)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;AA</td>
<td align="center" valign="top">10 (76.9)</td>
</tr>
<tr>
<td align="left" valign="top">&#x00A0;&#x00A0;&#x00A0;&#x00A0;Hisp</td>
<td align="center" valign="top">8 (53.3)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-ol-25-02-13637"><p>AA, African-American; Hisp, Hispanic; TNBC, triple-negative breast cancer.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tII-ol-25-02-13637" position="float">
<label>Table II.</label>
<caption><p>Details of mutations in the African-American and Hispanic patients.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Symbol</th>
<th align="center" valign="bottom">Approved name</th>
<th align="center" valign="bottom">Mutations</th>
<th align="center" valign="bottom">Total samples, n</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">KDM6A</td>
<td align="left" valign="top">Lysine demethylase 6A</td>
<td align="left" valign="top">c.2703-5dup/del, intronic</td>
<td align="center" valign="top">4</td>
</tr>
<tr>
<td align="left" valign="top">PABPC1</td>
<td align="left" valign="top">Poly(A) binding protein cytoplasmic 1</td>
<td align="left" valign="top">p.K254Nfs&#x002A;24</td>
<td align="center" valign="top">4</td>
</tr>
<tr>
<td align="left" valign="top">ARID1B</td>
<td align="left" valign="top">AT-rich interaction domain 1B</td>
<td align="left" valign="top">p.Q130_Q131dup; p.Q131dup</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">MUC16</td>
<td align="left" valign="top">Mucin 16, cell surface associated</td>
<td align="left" valign="top">p.M2786V; P13559N</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">AKT3</td>
<td align="left" valign="top">AKT serine/threonine kinase 3</td>
<td align="left" valign="top">p.L208&#x002A;</td>
<td align="center" valign="top">1</td>
</tr>
<tr>
<td align="left" valign="top">ZNF384</td>
<td align="left" valign="top">Zinc finger protein 384</td>
<td align="left" valign="top">p.Q547del</td>
<td align="center" valign="top">3</td>
</tr>
<tr>
<td align="left" valign="top">EIF1AX</td>
<td align="left" valign="top">Eukaryotic translation initiation factor 1A X-linked</td>
<td align="left" valign="top">c.337&#x002B;1G&#x003E;C; C256-3A&#x003E;C, splice site</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">KMT2C</td>
<td align="left" valign="top">Lysine methyltransferase 2C</td>
<td align="left" valign="top">p.A30P; p.M1774T</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">MUC4</td>
<td align="left" valign="top">Mucin 4, cell surface associated</td>
<td align="left" valign="top">p.V3635F; p.G4028S</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">SGK1</td>
<td align="left" valign="top">Serum/glucocorticoid regulated kinase 1</td>
<td align="left" valign="top">c.285&#x002B;50T&#x003E;C; c.362-3230T&#x003E;G; intron</td>
<td align="center" valign="top">2</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-ol-25-02-13637"><p>Example genes from the top 50 frequently mutated genes [Online Mendelian Inheritance of Man (<uri xlink:href="https://omim.org/">https://omim.org/</uri>) and Human Phenotype Ontology] and the COSMIC Census (updated as of February 2022) are shown. The top 5 are from African-American patients and the bottom 5 are from Hispanic patients. The protein changes are shown with single letter amino acid codes. The asterisks indicate stop codons.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="tIII-ol-25-02-13637" position="float">
<label>Table III.</label>
<caption><p>Breast cancer and DNA damage response-related genetic variants.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom" colspan="5">A, African-American</th>
</tr>
<tr>
<th align="left" valign="bottom" colspan="5"><hr/></th>
</tr>
<tr>
<th align="left" valign="bottom">Gene symbol</th>
<th align="center" valign="bottom">Protein variant</th>
<th align="center" valign="bottom">dbSNP ID</th>
<th align="center" valign="bottom">gnomAD, &#x0025;</th>
<th align="center" valign="bottom">COSMIC ID</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.Y600H</td>
<td align="center" valign="top">75419644</td>
<td align="center" valign="top">0.051</td>
<td align="center" valign="top">7349601</td>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.G715G</td>
<td align="center" valign="top">112566179</td>
<td align="center" valign="top">0.015</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.L929S</td>
<td align="center" valign="top">2227943</td>
<td align="center" valign="top">0.097</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.N987I</td>
<td align="center" valign="top">2227944</td>
<td align="center" valign="top">0.096</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.D1902N</td>
<td align="center" valign="top">4987048</td>
<td align="center" valign="top">0.195</td>
<td align="center" valign="top">9269275</td>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.H2116R</td>
<td align="center" valign="top">55953736</td>
<td align="center" valign="top">0.134</td>
<td align="center" valign="top">4985277</td>
</tr>
<tr>
<td align="left" valign="top">BRCA2</td>
<td align="left" valign="top">p.R2502C</td>
<td align="center" valign="top">55716624</td>
<td align="center" valign="top">0.033</td>
<td align="center" valign="top">6958612</td>
</tr>
<tr>
<td align="left" valign="top">BRCA1</td>
<td align="left" valign="top">p.T826K</td>
<td align="center" valign="top">28897683</td>
<td align="center" valign="top">0.018</td>
<td align="center" valign="top">7343747</td>
</tr>
<tr>
<td align="left" valign="top">BRCA1</td>
<td align="left" valign="top">p.N723D; p.N676D</td>
<td align="center" valign="top">4986845</td>
<td align="center" valign="top">0.058</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">RAD51B</td>
<td align="left" valign="top">p.S212A; p.S131A; p.S250A</td>
<td align="center" valign="top">33929366</td>
<td align="center" valign="top">0.284</td>
<td align="center" valign="top">9494712</td>
</tr>
<tr>
<td align="left" valign="top">XRCC3</td>
<td align="left" valign="top">p.R243H</td>
<td align="center" valign="top">77381814</td>
<td align="center" valign="top">0.198</td>
<td align="center" valign="top">8488089</td>
</tr>
<tr>
<td align="left" valign="top">BARD1</td>
<td align="left" valign="top">p.I738V</td>
<td align="center" valign="top">61754118</td>
<td align="center" valign="top">0.747</td>
<td align="center" valign="top">7349100</td>
</tr>
<tr>
<td align="left" valign="top">BARD1</td>
<td align="left" valign="top">p.G184G; p.G203G</td>
<td align="center" valign="top">28997574</td>
<td align="center" valign="top">0.878</td>
<td align="center" valign="top">9494804</td>
</tr>
<tr>
<td align="left" valign="top">RAD52</td>
<td align="left" valign="top">p.Q377&#x002A;; p.Q300&#x002A;</td>
<td align="center" valign="top">1024866946</td>
<td align="center" valign="top">0.001</td>
<td/>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><bold>B, Hispanic</bold></td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gene symbol</bold></td>
<td align="left" valign="top"><bold>Protein variant</bold></td>
<td align="center" valign="top"><bold>dbSNP ID</bold></td>
<td align="center" valign="top"><bold>gnomAD, &#x0025;</bold></td>
<td align="center" valign="top"><bold>COSMIC ID</bold></td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">BRCA1</td>
<td align="left" valign="top">p.I1275V; p.I1228V</td>
<td align="center" valign="top">80357280</td>
<td align="center" valign="top">0.015</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">HELQ</td>
<td align="left" valign="top">p.L802V; p.L372V; p.L325V; p.L869V</td>
<td align="center" valign="top">1344701424</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">PALB2</td>
<td align="left" valign="top">p.S524S; p.S229S</td>
<td align="center" valign="top">45472400</td>
<td align="center" valign="top">0.319</td>
<td align="center" valign="top">9494341</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3-ol-25-02-13637"><p>The specific protein variants are shown with single-letter amino acid code. dbSNP IDs and COSMIC IDs with the allele frequencies in the gnomAD database are shown when available. The asterisks indicate stop codons. dbSNP, Single Nucleotide Polymorphism Database; gnomAD, Genome Aggregation Database.</p></fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</article>
