Application of mixsep software package: Performance verification of male-mixed DNA analysis
- Authors:
- Na Hu
- Bin Cong
- Tao Gao
- Yu Chen
- Junyi Shen
- Shujin Li
- Chunling Ma
View Affiliations
Affiliations: Department of Forensic Medicine, Hebei Key Laboratory of Forensic Medicine, Hebei Medical University, Shijiazhuang, Hebei 050017, P.R. China, Department of Statistics, Institute of Statistics, Renmin University of China, Beijing 100872, P.R. China, Laboratory of Forensic DNA Testing, Institute of Forensic Science, Public Security Department of Shanxi, Taiyuan, Shanxi 030001, P.R. China
- Published online on: April 30, 2015 https://doi.org/10.3892/mmr.2015.3710
-
Pages:
2431-2442
-
Copyright: © Hu
et al. This is an open access article distributed under the
terms of Creative
Commons Attribution License [CC BY_NC 3.0].
Metrics: Total
Views: 0 (Spandidos Publications: | PMC Statistics: )
Metrics: Total PDF Downloads: 0 (Spandidos Publications: | PMC Statistics: )
This article is mentioned in:
Abstract
An experimental model of male-mixed DNA (n=297) was constructed according to the mixed DNA construction principle. This comprised the use of the Applied Biosystems (ABI) 7500 quantitative polymerase chain reaction system, with scientific validation of mixture proportion (Mx; root‑mean‑square error ≤0.02). Statistical analysis was performed on locus separation accuracy using mixsep, a DNA mixture separation R‑package, and the analytical performance of mixsep was assessed by examining the data distribution pattern of different mixed gradients, short tandem repeat (STR) loci and mixed DNA types. The results showed that locus separation accuracy had a negative linear correlation with the mixed gradient (R2=‑0.7121). With increasing mixed gradient imbalance, locus separation accuracy first increased and then decreased, with the highest value detected at a gradient of 1:3 (≥90%). The mixed gradient, which is the theoretical Mx, was one of the primary factors that influenced the success of mixed DNA analysis. Among the 16 STR loci detected by Identifiler®, the separation accuracy was relatively high (>88%) for loci D5S818, D8S1179 and FGA, whereas the median separation accuracy value was lowest for the D7S820 locus. STR loci with relatively large numbers of allelic drop‑out (ADO; >15) were all located in the yellow and red channels, including loci D18S51, D19S433, FGA, TPOX and vWA. These five loci featured low allele peak heights, which was consistent with the low sensitivity of the ABI 3130xl Genetic Analyzer to yellow and red fluorescence. The locus separation accuracy of the mixsep package was substantially different with and without the inclusion of ADO loci; inclusion of ADO significantly reduced the analytical performance of the mixsep package, which was consistent with the lack of an ADO functional module in this software. The present study demonstrated that the mixsep software had a number of advantages and was recommended for analysis of mixed DNA. This software was easy to operate and produced understandable results with a degree of controllability.
View Figures |
Figure 1
|
|
Figure 2
|
|
Figure 3
|
|
Figure 4
|
|
Figure 5
|
|
Figure 6
|
|
Figure 7
|
|
Figure 8
|
|
Figure 9
|
|
Figure 10
|
View References
1
|
R Development Core Team (2013): R: A
language and environment for statistical computing. R Foundation
for Statistical Computing; Vienna, Austria: URL: http://www.R-project.org.
2013
|
2
|
RStudio (2012): RStudio: Integrated
development environment for R (Version 0.96.122). Computer
software. Boston, MA, USA: URL: http://www.rstudio.org/.
2012
|
3
|
Gentleman RC, Carey VJ, Bates DM, et al:
Bioconductor: open software development for computational biology
and bioinformatics. Genome Biol. 5:R802004. View Article : Google Scholar : PubMed/NCBI
|
4
|
Smyth GK: Limma: linear models for
microarray data. Bioinformatics and computational biology solutions
using R and Bioconductor. Springer; Berlin, Germany: pp. 397–420.
2005
|
5
|
Gautier L, Cope L, Bolstad BM and Irizarry
RA: affy - analysis of Affymetrix GeneChip data at the probe level.
Bioinformatics. 20:307–315. 2004. View Article : Google Scholar : PubMed/NCBI
|
6
|
Ellis B, Haaland P, Hahne F and Le Meur N:
basic structures for flow cytometry data. R package version 1.24.2.
Computer software. 2009.
|
7
|
Pages H, Carlson M, Falcon S, Li N and
Maintainer MBP: Package ‘AnnotationDbi’: Annotation Database
Interface. R package version 1.20.7. Computer software. 2014.
|
8
|
Carlson M: hgu95av2.db: Affymetrix human
genome U95 set annotation data (chip hgu95av2). R package version
2.8.0. Computer software. 2012
|
9
|
Carlson M, Falcon S, Pages H, et al: A set
of annotation maps describing the entire Gene Ontology. R package
version 2.8.0. Computer software. 2010
|
10
|
Shannon P: MotifDb: An annotated
collection of protein-DNA binding sequence motifs. R package
version 1.0.0. [Computer software], URL: http://bioconductor.org/biocLite.R.
2012
|
11
|
Pages H, Aboyoun P, Gentleman R and DebRoy
S: String objects representing biological sequences, and matching
algorithms. R package version 2.26.3. [Computer software], URL:
http://bioconductor.org/biocLite.R.
2009
|
12
|
Tvedebrink T: mixsep: DNA mixture
separation. R package version 0.2.1-2. Computer software. 2013
|
13
|
Tvedebrink T: mixsep: An R-package for DNA
mixture separation. Forensic Sci Int: Genet Supple Seri.
3:e486–e488. 2011.
|
14
|
Tvedebrink T, Eriksen PS, Mogensen HS and
Morling N: Identifying contributors of DNA mixtures by means of
quantitative information of STR typing. J Comput Biol. 19:887–902.
2012. View Article : Google Scholar
|
15
|
Tvedebrink T, Eriksen PS, Mogensen HS and
Morling N: Evaluating the weight of evidence by using quantitative
short tandem repeat data in DNA mixtures. J R Stat Soc Ser C Appl
Stat. 59:855–874. 2010. View Article : Google Scholar
|