|Home | About | Journals | Submit | Contact Us | Français|
Conceived and designed the experiments: JB. Performed the experiments: EH. Analyzed the data: JB EH. Wrote the paper: JB EH.
In forensic casework, Y chromosome short tandem repeat markers (Y-STRs) are often used to identify a male donor DNA profile in the presence of excess quantities of female DNA, such as is found in many sexual assault investigations. Commercially available Y-STR multiplexes incorporating 12–17 loci are currently used in forensic casework (Promega's PowerPlex® Y and Applied Biosystems' AmpFlSTR® Yfiler®). Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants. Over 400 Y-STR loci have been identified on the Y chromosome. While these have the potential to increase the discrimination potential afforded by the commercially available kits, many have not been well characterized. In the present work, 91 loci were tested for their relative ability to increase the discrimination potential of the commonly used ‘core’ Y-STR loci. The result of this extensive evaluation was the development of an ultra high discrimination (UHD) multiplex DNA typing system that allows for the robust co-amplification of 14 non-core Y-STR loci. Population studies with a mixed African American and American Caucasian sample set (n=572) indicated that the overall discriminatory potential of the UHD multiplex was superior to all commercial kits tested. The combined use of the UHD multiplex and the Applied Biosystems' AmpFlSTR® Yfiler® kit resulted in 100% discrimination of all individuals within the sample set, which presages its potential to maximally augment currently available forensic casework markers. It could also find applications in human evolutionary genetics and genetic genealogy.
The unique biology of the Y chromosome has led to the widespread use in forensic and evolutionary studies of genetic markers thereon in determining patrilineal relationships within and between population groups –. A subset of these markers, minisatellites or short tandem repeats (Y-STRs), are now used routinely in certain forensic casework situations –. Their intended use is not to supplant the current battery of autosomal STR loci but to apply them to certain defined casework situations whereby the traditional autosomal loci would not be expected to yield sufficient probative information. Y-STRs are particularly beneficial for the analysis of the male donor in a male/female DNA admixture when the female DNA component is present in vast excess (e.g. ≥100×) and when traditional autosomal STR analysis fails or is not possible –.
Autosomal STR analysis may not be possible if the sample contains an admixture of body fluids other than semen, such as in saliva/saliva mixtures, saliva/vaginal secretion mixtures, or fingernail scrapings comprising cells from the (female) victim and cells from the perpetrator. In these types of samples a differential extraction to separate the male and female cells is not possible with current technology and the male component is often not detectable with the autosomal STR multiplex systems routinely used due to a titration of critical PCR reagents by the major contributor in the sample . Autosomal STR analysis may also fail with some semen containing samples in which the sperm are present in very low copy number, or are present in an extremely fragile state, such as in extended interval (i.e. >48 h) post-coital samples. Differential extraction of these particular samples may yield no profile from the male donor due to a combination of premature lysis of the sperm's cellular constituents into the non-sperm fraction and to sperm loss during the physical manipulations required of the DNA isolation process. Therefore, the use of Y-STRs, which target only the male fraction, eliminate the need for a differential extraction process and lessen the potential to lose the trace amounts of male DNA that may be present.
There are several additional benefits of Y-STR analysis in forensic casework. Y-STR analysis allows for the facile determination of the number of male contributors in mixtures. Y-STR profiles are hemizygous in nature, with one allele being found at most loci (the exception being a small number of multi-copy loci). Multiple alleles at single copy loci give a clear indication of the number of male contributors , , , . Y-STR analysis can also aid in the identification of haplotypes of missing persons. While the X and Y chromosomes exhibit a degree of homology, they do not undergo genetic recombination during meiosis (except for the pseudoautosomal regions at the chromosome tip) . As a result, most of the Y chromosome is inherited paternally as a block of linked haplotype markers from generation to generation. Thus, males in the same lineage will possess identical Y-STR haplotypes allowing for the determination of a missing person's male haplotype by typing male relatives. Finally, Y-STR analysis can provide additional discriminatory power when combined with (often partial) autosomal STR profile results –.
A set of nine Y-STR loci, commonly referred to as the “minimal haplotype loci” set (MHL), have been recommended for use in the forensic community . Subsequently, several additional loci were reported that, in conjunction with the MHL loci, proffered increased discriminatory capacity , –. A number of casework-validated commercial Y-STR kits are available and most incorporate 12–17 markers into single multiplex analysis systems , , . These commercial kits incorporate all twelve of the Y-STR ‘core’ markers that were recommended for forensic use by the US Scientific Working Group on DNA Analysis Methods (SWGDAM) . Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants.
A large number of Y-STR loci (>400) have been identified by various groups and deposited into public databases such as the GDB Human Genome Database (www.gdb.org). A comprehensive annotated STR physical map of the human Y chromosome details the precise location of each locus along the chromosome . Despite the identification and positioning of hundreds of currently known Y-STR loci, a majority of these markers have not been fully characterized with respect to their utility in forensic casework.
A small number of studies have been published involving an in-depth evaluation of a significant numbers of these novel markers , , , . A seminal study by Kayser and co-workers describes an extensive survey of human Y-chromosome microsatellites . While providing an extensive overview, less detail is provided on specific loci, with a small number of loci being identified as the “most variable”. However the gene diversity values used in classifying these loci as variable are based on a population size of only eight male samples. Current forensic literature is replete with small population data studies involving a few novel markers , , –. However, simply describing individual allele frequencies in numerous populations is not sufficient to determine if a novel Y-STR could be used in forensic casework. While some of these markers may exhibit individually high gene diversity values, few studies have demonstrated the ability of these markers to aid in resolving coincidental matches.
Only a few novel non-‘core’ markers have been incorporated into multiplex PCR systems that have undergone the extensive developmental validation studies ,  required, for example, by US national standards . Without such developmental validation studies it is not possible to evaluate whether the loci are sufficiently robust for use with degraded and limiting quantities of sample in a multiplex analysis format, or can provide sufficient additional discrimination potential when used in combination with other core Y-STR markers.
We have extensively evaluated one hundred and thirty three Y-STR loci (33% of all known loci) for possible use in forensic casework –, , , . Twenty five of these loci were rejected due to poor diversity values or amplification inefficiencies. The remaining one hundred and eight loci have been incorporated into ten multiplex systems that have undergone full developmental validations as required by industry standards (, , unpublished data). However, due to the limited amount of sample and limited resources of operational crime laboratories, it is unlikely that all ten multiplexes would be employed simultaneously to produce 108-locus profiles. Additionally, not every locus included in these multiplex systems would be beneficial in helping to distinguish between male individuals. Therefore in the present work, attempts were made to construct a non-core loci containing Y-STR multiplex system that, based upon empirical data, offered an extremely high discrimination potential and was robust enough for forensic use.
The result of these efforts was the development of an ultra-high discrimination (UHD) multiplex system incorporating fourteen novel Y-STR loci with a discriminatory capacity greater than that achieved with any commercial kit.
Body fluids were collected from volunteers using procedures approved by the University of Central Florida's Institutional Review Board. Informed written consent was obtained from each donor. Buccal samples were collected from donors using sterile swabs by swabbing the inside of the donor's mouth. Neat semen samples were provided in sealed plastic tubes and stored frozen until they were dried onto sterile cotton swabs. Post-coital cervicovaginal swabs were taken from female participants at various specified time periods subsequent to sexual intercourse. Blood samples were collected by venipuncture and 50 µl aliquots were placed onto cotton cloth and dried at room temperature. Population samples (bloodstains) for gene diversity studies were obtained from the Virginia Division of Forensic Science, Richmond, VA, the South Dakota State Forensic Laboratory, and the Alabama Department of Forensic Sciences. All samples were stored at −47°C until needed.
DNA was extracted from the buccal swabs, the vaginal swabs, and the semen swabs using a standard phenol: chloroform method . Stains or swabs were cut into small pieces and placed into a Spin-Ease tube (Gibco-BRL, Grand Island NY). The tubes were incubated overnight in a 56°C water bath using 400 µl DNA Extraction Buffer (100 mM NaCl, 10 mM Tris-HCl, pH 8.0, 25 mM EDTA, 0.5% SDS), 0.1 mg/mL Proteinase K, and 10% 0.39 M DTT (added to semen containing samples). After the overnight incubation, swab or stain fabric was placed into a Spin-Ease basket, the basket inserted back into the original tube, and the samples centrifuged at 14,000 rpm for 5 minutes to remove the absorbed fluid from the swab material. A volume of phenol/chloroform/isoamyl alcohol equal to the volume of the crude extract was added and vigorously intermixed by shaking. The aqueous layer, containing the DNA, was removed. Precipitation of the DNA was accomplished by the addition of cold absolute ethanol (two and a half times the volume of the aqueous layer extract) and allowed to progress overnight at −20°C. The DNA was pelleted by centrifugation, washed once using 70% ethanol and re-solubilized with 100 µl of TE−4 (10 mM Tris-HCl, 0.1 mM EDTA, pH 7.5) overnight at 56°C.
Sperm and non-sperm cells were separated using a standard differential lysis protocol, with minor modifications . Post-coital cervicovaginal swabs were incubated overnight at 37°C in 400 µl of DNA extraction buffer (100 mM NaCl, 10 mM Tris-HCl, 25 mM EDTA, 0.5% SDS) and 0.1 mg/mL Proteinase K. Swab remnants were removed to a Spin-ease basket, the basket inserted back into the original tube, and centrifuged at 14,000 g for 5 min. the resulting supernatant, containing the non-sperm DNA fraction, was removed into a separate tube for further analysis. The sperm pellet was re-suspended in 400 µl of DNA extraction buffer, 0.1 mg/mL Proteinase K, and 40 µl of 0.39 M DTT and incubated for 1 h at 56°C. DNA from both the sperm and non-sperm fractions was isolated and purified using the phenolchloroform method described above.
The dried bloodstains were incubated overnight at 56°C in 400 µl of DNA extraction buffer (100 mM NaCl, 10 mM Tris-HCl, 25 mM EDTA, 0.5% SDS) and 0.1 mg/mL Proteinase K. The swab pieces were placed into a spin ease basket and centrifuged at 14,000 g for 5 minutes. An equal volume of phenol/chloroform/isoamyl alcohol was added to the crude extract. The aqueous phase extracts containing the DNA were purified using Centricon 100TM concentrators (Millipore, Bedford, MA) according to the manufacturer's instructions.
DNA was quantitated using ethidium bromide induced fluorescence on a 1% agarose yield gel, using a reference set of DNA standards of known concentration .
All locus characteristics, including repeat unit structure and size and general chromosome location, were obtained from published sources or by sequencing.
Candidate loci were selected based on diversity value, performance in multiplex systems with forensic samples, and allele size range obtained through previous studies involving developmental validations of novel Y-STR multiplexes , , , , .
Optimization of the UHD multiplex system resulted in a set of standard conditions. The 25 µl reaction mix contained: 1ng of template DNA, 0.08–1.6 µM primers (see below), 1 mM dNTPs, 1X PCR Buffer II (10 mM Tris-HCl, pH 8.3, 50 mM KCl), 1.75 mM MgCl2, 10 µg of non-acetylated bovine serum albumin (Sigma-Aldrich, St. Louis, MO), and 2.0 units of AmpliTaq Gold DNA Polymerase (Applied Biosystems).
Primer sequences were obtained from published sources, the Human Genome Database or re-designed using Oligo 6 Primer Analysis Software (Lifescience Software Resource, Long Lake, MN). The primers were tested initially in previously developed multiplexes, first as singleplexes and then in a multiplex system. The primer concentrations for the UHD Multiplex were as follows: DYS444 – 1.04 µM; DYS446 – 0.40 µM; DYS449 – 1.6 µM; DYS459 – 0.28 µM; DYS481 – 0.12 µM; DYS508 – 0.20 µM; DYS522 – 1.6 µM; DYS527 – 1.12 µM; DYS549 – 0.24 µM; DYS552 – 0.56 µM; DYS570 – 0.12 µM; DYS576 – 0.08 µM; DYS607 – 0.24 µM; DYS627 – 1.12 µM; (Invitrogen, Grand Island, NY; Applied Biosystems, Foster City, CA).
(1) 95°C 11 min, (2) 32 cycles of 96°C 30 sec, 59°C 1 min 30 sec, 72°C 1 min 30 sec, (3) and final extension 72°C for 45 min.
A 0.75 µl aliquot of the amplified product was added to 8.7 µl of deionized formamide and 0.3 µl GeneScan 500 LIZ internal lane standard. Tubes containing the above were heated at 95°C for 3 min and snap-cooled on ice for 3 min. Samples were injected onto a Applied Biosystems 3130 Genetic Analyzer, using Module G5 and analyzed with GeneMapper Analysis Software v3.7 using Filter Set G5. A peak detection threshold of 50 RFUs was used for allele designation.
Different input quantities of template male DNA were tested using the standard UHD multiplex reaction conditions. The amounts tested were: 25 pg, 50 pg, 100 pg, 125 pg, 150 pg, 200 pg, 250 pg, 500 pg, 1 ng, 3 ng, 5 ng, and 10 ng.
To evaluate possible female DNA cross-reactivity, DNA from female volunteers was tested using the following amounts: 3 ng, 30 ng, 300 ng, and 1 µg.
DNA from two individual males was combined in the following ratios: 1/2 (1.5 ng male DNA/1.5 ng male DNA), 1/3 (1.0 ng male DNA/2.0 ng male DNA), 1/6 (0.5 ng male DNA/ 2.5 ng male DNA), 1/12 (0.25 ng male DNA/2.75 ng male DNA), 1/20 (0.15 ng male DNA/2.85 ng male DNA), and 1/30 (0.10 ng male DNA/2.90 ng male DNA). In each case 3 ng of the admixed DNA was tested.
3 ng of male DNA was co-amplified with increasing amounts of female DNA in the following ratios: 1/2, 1/10, 1/100, 1/500, 1/1000 and 1/5000.
50 µl aliquots of human blood were dried onto cotton cloth. These samples were exposed to different environmental conditions including various temperatures, light sources, and environmental influences including humidity and rain. The environmental conditions were as follows: RTED (room temperature, envelope dried), HLHR (placed outside for exposure to heat, light, humidity, and rain), HLH (placed outside and covered exposure to heat, light, and humidity). Samples from all sets of conditions were collected at varying lengths of time, including 3 days, 1 week, 1 month, 3 months, 6 months, and 1 year. Samples were also exposed to short wave UV at room temperature for 3 days, 1 week, 1 month, 3 months, 6 months and 18 months. For the UV-exposed and room temperature stored samples, 3 ng of input DNA was amplified. For the HLH and HLHR samples, no product was observed during quantitation. Therefore, a 5 µl aliquot of the DNA extract (5%) was used for amplification.
Post-coital cervicovaginal swabs were taken from a female participant at various time intervals after sexual intercourse (immediately (0 h), 12 h, 24 h, 36 h, and 48 h). Only one set of swabs was taken after each individual act of sexual intercourse to ensure that the amount of semen present at the time was not affected by prior removal of sample. Two swabs were taken at each time interval and DNA isolated as described above. One of the swabs was extracted using a differential extraction method to separate the sperm and non-sperm fractions.
In order to determine if the multiplex could accommodate low copy number forensic samples, swabs of a beverage container lid (plastic coffee cup lid) were collected after being deposited by a male volunteer. The DNA was extracted using the standard phenol:chloroform method described previously. A 5 µl aliquot of the DNA extract (5%) was used for amplification as no product was observed during quantitation.
The following formulae were used: (1) discriminatory capacity=no. of individuals/no. of different haplotypes observed; (2) gene diversity (h), equivalent to the expected frequency of heterozygotes with autosomal diploid loci, was calculated as (n/(n−1))*(1−Σpi2), where pi=allele frequency at the ith locus .
In forensic casework, two commercially available Y chromosome STR multiplex amplification kits, the Promega PowerPlex® Y kit and the Applied Biosystems AmpFlSTR® Yfiler® kit, are widely used. These commercial kits employ 12 and 17 markers respectively, and both include the SWGDAM core loci. While the kits have the ability to distinguish most male individuals in a particular case, the occurrence of coincidental matches between two unrelated individuals can still occur. A number of other non-core Y-STR loci have been identified on the Y chromosome that, hypothetically, could provide the ability to resolve a majority of these coincidental matches. We previously evaluated and characterized 133 of the ~400 known Y-STR loci (33%). This resulted in the development of ten novel Y-STR multiplex systems encompassing 91 non-core Y-STR loci in addition to the loci contained in the commercial kits –, , . The design and developmental validation of the these multiplex systems resulted in a determination of each locus' performance within a multiplex system format ,  as well as its gene diversity , , , , .
Initial UHD design specifications were: (i) that the number of loci would not exceed 15 since our previous experience with multiplex development indicated that the most robust systems contain 15 or fewer loci (data not shown), and (ii) that only loci that were not already incorporated into commercial kits were considered. A number of non core candidate loci were rejected as UHD candidates due to performance limitations, low or inaccurate (due to sampling effects) gene diversity values and this resulted in a ‘first pass’ candidate list of 65 loci. The candidate loci and their associated gene diversities are listed in Table 1.
The forensic usefulness of a Y-STR locus in distinguishing individuals is not solely determined by its gene diversity values but is also related to how well it complements the other markers in the typing set used. A person's Y-STR haplotype consists of a set of physically and genetically linked STR markers that are co-typed subsequent to multiplex amplification with a standard set of well characterized core loci. The extent to which an additional locus adds to the discriminatory capacity of the multiplex system depends not only upon the inherent genetic diversity of the locus (the number and frequencies of alleles) but also upon the evolutionary history of the locus in question within different haplogroups. A multi-locus Y-STR haplotype will often be very predictive of (or even define) the Y-SNP haplogroup to which an individual belongs and thus the utility of adding another Y-STR marker to the typing set depends upon the extent to which it is able to further differentiate individuals within that particular haplogroup. Thus a locus displaying low variance in allele frequencies within the particular haplogroup would be less useful in discriminating individuals than one that had a high variance. It is thus conceivable that a particular Y-STR locus that demonstrates high gene diversity in the general population (that comprises individuals belonging to multiple haplogroups) would not add, due to its low intra-haplogroup variance, a significant amount of additional discrimination power to the multiplex.
In order to incorporate the most informative loci from the non core 65 candidates into the UHD multiplex, the relative ability of each of the candidates in improving the discriminatory power of the core loci set was empirically determined. For this, a large mixed African American and Caucasian population (n=560) expected to contain most of the major haplogroup classes present in these two populations was used. Firstly the haplotype diversity of the SWGDAM core loci was determined. Then each of the 65 candidate loci was added to the SWGDAM core set and the increase in haplotype diversity determined. For each locus, the single locus diversity value was plotted against the increase in multi-locus haplotype diversity achieved by that locus when used in combination with the SWGDAM loci (Figure 1). In general there was a correlation between the single locus diversity and its ability to enhance the multi-locus diversity of the SWGDAM multiplex (r2=0.51). Loci with low individual diversity values (e.g. DYS425 (0.11); DYS434 (0.12); DYS435 (0.05); DYS436 (0.08); DYS476 (0.12); DYS575 (0.05); DYS590 (0.09); DYS641 (0.16)) produced a small increase in multi-locus haplotype diversity for the SWGDAM core loci. Many of the loci with high diversity values (e.g. DYS449 (0.81); DYS456 (0.67); DYS464 (0.99); DYS527 (0.92); DYS570 (0.79); DYS576 (0.83); DYS627 (0.86) resulted in a larger increase in haplotype diversity. However several of the loci resulted in little or no increase in haplotype diversity, despite a moderate diversity value (DYS426-0.48; DYS445-0.41; DYS453-0.40; DYS462-0.54; DYS494-0.51; DYS578-0.46; DYS594-0.56; DYS598-0.59; YAP-0.50) and illustrates the need for empirical testing of the relative efficacy of loci for complementing a set of multiplexed loci.
Loci exhibiting a high diversity value and a large increase in haplotype diversity (upper right portion of the plot above the regression line)) were considered for inclusion into the UHD multiplex. Multi-copy loci with >2 copies were excluded due to complications with their analysis. Based on compatible allele size ranges, the availability of dye labels, and annealing temperatures, fourteen novel Y-STR loci were selected for inclusion into the UHD multiplex and: DYS444, DYS446, DYS449, DYS459, DYS481, DYS508, DYS522, DYS527, DYS549, DYS552, DYS570, DYS576, DYS607, and DYS627 (Figure 1). Loci characteristics (repeat motif, size range, number of alleles, average percent stutter, and primer sequences) are provided in Table 2.
Multiplex PCR reaction conditions were optimized by altering the concentration of critical PCR reagents and the thermocycling conditions and using both male (3 ng) and female (300 ng) DNA as input template. This resulted in the determination of a set of standard reaction conditions described in the Methods Section.
A forensic DNA typing system needs to be able to work with sub-nanogram quantities of template DNA. The sensitivity of the UHD multiplex was tested using varying amounts of input template male DNA (25 pg–10 ng). Although 500 pg (Figure 2A) to 2 ng of DNA gave optimal signal intensity and inter-locus balance, a full 14-locus male Y-STR haplotype was still obtained with 25 pg of DNA (Figure 2B), which is equivalent to ~4–5 diploid cells.
A significant benefit of using Y-STR analysis is the specific amplification of male DNA in male/female DNA admixtures, a situation that arises often in biological stains recovered from sexual assault investigations. Although there is no genetic recombination between the X and Y chromosomes, the presence of homologous sequences on the X chromosome could confound Y-STR analysis by titrating out critical reagents or producing pseudo-Y alleles. Thus empirical testing of Y-STR primer sets for female DNA cross reactivity is essential. The UHD was tested with varying amounts of female DNA (3 ng–1 µg) and no significant amplification products were observed (Figure 2C).
Often in forensic casework there is more than one male contributor to a sample. With standard autosomal STR analysis is often difficult to determine the number of contributors in an admixture. Due to the hemizygous nature of Y-STR loci, with only one allele being found at most loci, determination of the number of male contributors is facile.
In order to evaluate the ability of the UHD multiplex to identify multiple male contributors, male/male DNA admixtures from two unrelated individuals were tested at various ratios including 1/2, 1/3, 1/6, 1/12, 1/20 and 1/30. For each admixture, 3 ng of total DNA was amplified. A full minor male component profile was obtained when it comprised 1/2, 1/3 and 1/6 of the total male DNA (Figure 3A). Partial minor male component profiles were obtained when it comprised 1/12 and 1/20 of the total DNA (data not shown).
To test the ability of the UHD multiplex to produce a male haplotype in the presence of large quantities of female DNA, a situation that is frequently encountered in actual casework, a series of male/female admixtures were prepared in which 3 ng of total male DNA was amplified, with varying quantities of female DNA. Full 14-locus profiles were obtained when the male DNA component comprised 1/2, 1/10, 1/100, 1/500 1/1000, and 1/5000 (Figure 3B) of the total DNA.
For a multiplex system to be useful in forensic casework, it must permit the recovery of DNA profiles from samples that have been exposed to various environmental conditions. To assess the ability of the UHD multiplex to recover Y-STR haplotypes from environmentally compromised samples, bloodstains originating from the same male individual were exposed to different environmental conditions including various temperatures, light sources, and environmental influences including humidity and rain. For this study, samples were grouped into four categories: 1) dried blood stains stored in envelopes at room temperature (RTED), 2) dried blood stains left outside and uncovered (HLHR), exposed to heat, light, humidity, and rain; 3) dried blood stains exposed to UV light (UV); and 4) dried blood stains left outside and covered (HLH), exposed to heat, light, and humidity. Samples were left exposed to these conditions and collected at various time intervals including three days, one week, one month, three months, six months, one year and, for the UV exposure set, eighteen months.
For the RTED samples, full profiles were recovered for all samples up to and including one year, with no observed reduction in allelic signal intensities as the time interval increased (Figure 4A). Full profiles were also obtained from the UV exposed samples, up to and including eighteen months (Figure 4B).
The HLHR and HLH samples were placed in an outdoor, un-wooded area in Orlando, Florida which has a semi-tropical climate. During the course of this study (one year), these samples were exposed to temperatures ranging from 32°F to 94°F (average high of 88°F and average low of 37°C) and the HLHR samples were additionally exposed to rain fall on 137 days out of the year (the three-day samples received one day of rain; the one-week samples received one day of rain; the six-month samples received 73 days of rain; and the one year sample received 137 days of rain). Complete 14-locus profiles were obtained from the one month exposed HLHR (Figure 4C) and HLH (Figure 4D) samples. These latter somewhat-unexpected results were encouraging from the casework standpoint, since heat, light, humidity and rain are often detrimental to DNA analysis.
A series of post-coital cervicovaginal samples were collected from a female donor 0 to 48 h (in 12 h intervals) after intercourse. Each time point sample was collected after a separate act of sexual intercourse and was preceded by a five day abstention period. As a negative control, a pre-coital swab was also recovered prior to coitus for each sampling and only data from post-coital samples that demonstrated a lack of male DNA in the associated pre-coital sample was used. The participant was instructed to carry out normal daily functions to permit the normal loss of semen constituents experienced over time and that often confounds the analysis of bona fide casework samples. Using a differential lysis DNA isolation method, the ‘sperm’ and ‘non-sperm’ DNA fractions were separated. Approximately 3 ng of the sperm or 100 ng of the non-sperm DNA fractions was analyzed by the UHD system. Complete 14-locus Y-STR profiles were recovered from the 0, 12, 24, and 48 h post coital samples from the sperm fractions (Figure 5A), all of which were consistent with the semen donor's known profile.
In a differential lysis procedure, sperm from low copy number samples (i.e. extended interval post-coital samples) experience varying degrees of loss due to the inherent limitations of the physical manipulations required to carry out the differential extraction protocol. Therefore, a standard ‘non-differential’ DNA extraction was performed on the second swab for each time interval thus eliminating the need for separation, and potential loss, of the sperm component. Again, complete male profiles consistent with the known donor were obtained from the 0, 12, 24 and 48 hour samples (Figure 5B).
Limited amounts of DNA are often recovered from evidentiary materials. A male participant provided a beverage container lid from which DNA was successfully recovered. A complete fourteen-locus UHD Y-STR profile consistent with the known donor was recovered from the beverage container lid (Figure 6).
We specifically incorporated into the UHD a number of loci that, individually, provided enhanced discriminatory power to the SWGDAM core set (Figure 1). In order to empirically determine the discriminatory capacity of the UHD system as a whole a population study was carried out. Fourteen-locus UHD haplotypes were obtained from 191 Caucasian American and 381 African American individuals and the number of unique and matching multi-locus haplotypes determined (Table 3). Among Caucasian American samples, 190 unique haplotypes were observed, and among the 381 African American samples, 380 unique haplotypes were observed (Table 3) resulting in a discriminatory capacity of 99.5% and 99.7% respectively. The discrimination capacity was 99.7% when the population samples were combined (570 unique haplotypes/572 individuals). Additional typing using non-UHD loci indicated that the multilocus haplotype shared between one pair of Caucasian American individuals and the (different) multilocus haplotype shared between one pair of African American individuals were bona fide coincidental matches between unrelated individuals (data not shown).
The same population sample set was used to determine the discriminatory power afforded by the two most discriminatory commercial Y-STR multiplex systems, the Promega PowerPlex® Y kit and the Applied Biosystems AmpFlSTR® Yfiler® kit (Table 3) , . UHD was found to possess a higher discriminatory capacity than either commercial kit. Among the Caucasian American samples, 152 unique PowerPlex® Y and 189 unique Yfiler® haplotypes were present compared with 189 unique UHD haplotypes. Among the African American samples, 328 unique PowerPlex® Y and 375 unique Yfiler® haplotypes were present compared with 379 unique UHD haplotypes. Amongst the 572 combined population samples included in this study, 500 unique haplotypes were obtained using the Promega PowerPlex® Y kit, resulting in an overall multiplex discriminatory potential of 87.4%. Only 462 haplotypes (80.8%) were observed once in the data set and the most frequent haplotype was observed ten times. Using the Applied Biosystems AmpFlSTR® Yfiler® kit, 563 unique haplotypes were obtained, resulting in an overall multiplex discriminatory potential of 98.4%, an 11% increase compared to the PowerPlex® Y system. However, using the UHD multiplex, 570 unique haplotypes were obtained, resulting in the highest discrimination potential (99.7%) of the multiplexes compared in this study (~12% increase compared to PowerPlex® Y; ~1% increase compared to AmpFlSTR® Yfiler®).
The UHD was designed to enhance the discrimination afforded by the commercially available Y-STR multiplex systems. The augmentation efficacy was evaluated by determining the extent to which the UHD loci would discriminate individuals who would otherwise be indistinguishable with the commercial kits.
For the PowerPlex® Y system, 110 of the 572 (~19%) samples did not possess a unique haplotype. These 110 individuals represented 38 distinct haplotypes, comprising 2–10 individuals per haplotype group (Table 4). All of the individuals within these 38 haplogroup classes, except for one pair, were distinguishable by the UHD system (Table 4). Thus, only two individuals (0.3%) did not possess a unique PowerPlex® Y+UHD combined haplotype, a significant reduction from the 19% with PowerPlex® Y alone.
For the more discriminating AmpFlSTR® Yfiler® system, 17 of the 572 (~3%) samples did not possess a unique haplotype. These individuals comprised 8 different haplotype groups that consisted of 2 to 3 individuals (Table 5). With the UHD all samples could be distinguished. Thus, the combination of the AmpFlSTR® Yfiler® system and the UHD multiplex was able to distinguish all 572 individuals in the sample set.
In this work we designed and developed a highly discriminating robust multiplex Y-STR typing system that uses loci that are not present in any of the commercially available Y-STR systems. Rationalization of the loci selection process means that, at least for the American-Caucasian and African-American populations, it is probably the most discriminating Y-STR system available at this time. It is principally intended for forensic casework use but could have a number of other applications, particularly in the field of human evolutionary genetics and genetic genealogy. It can be used either as a stand alone system in its own right or to augment the discriminatory power of the commercially available Y-STR multiplex systems.
Candidate loci were selected based upon empirical studies that evaluated the relative ability of each locus to augment the discriminatory potential of a set of routinely used core loci. Due to genetic admixture within the major bio-geographic-specific haplogroup sub-populations, a high single locus Y-STR haplotype diversity calculated from a mixed ‘total’ population does not necessarily translate into an equally high increase in discrimination when combined with the core loci. However we found that, in general, there was a correlation between the single locus diversity and its ability to enhance the multi-locus diversity of a core loci multiplex (r2=0.51). Nevertheless we rejected a number of candidates that, despite possessing a reasonably high diversity, provided weak augmentation power (e.g. DYS594, DYS 598).
The result of the extensive evaluation of the non-core loci was the development of an ultra high discrimination (UHD) multiplex that permits the robust co-amplification of 14 non-core Y-STR loci. Analysis of 587 Caucasian American and African American samples produced an overall multiplex discriminatory potential of 99.7%. In comparison, the two commercially available Y-STR multiplex kits, Promega's PowerPlex® Y and Applied Biosystems' AmpFlSTR® Yfiler®, produced discriminatory capacities of only 87.4% and 98.4%, respectively. Thus the ability of the UHD multiplex to distinguish individual male samples surpasses that of the two commercially available multiplex systems currently used in forensic casework. Additionally, the use of the UHD multiplex in conjunction with Applied Biosystems' AmpFlSTR® Yfiler® system resulted in the ability to distinguish all 587 samples.
The UHD multiplex has been evaluated for use with samples frequently encountered in forensic casework. Based upon a number of studies including specificity, sensitivity, discriminating capacity and performance with non-probative casework specimens, the system has demonstrated its potential forensic casework utility to augment the core loci contained in commercially available kits.
The authors would like to gratefully acknowledge the National Institute of Justice for their support and funding for the initial portions of this project. The authors would also like to thank the Virginia Division of Forensic Sciences, the South Dakota State Forensic Laboratory, and the Alabama Department of Forensic Sciences for providing the population samples.
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported under Award number 1998-IJ-CX-K003 from the Office of Justice Programs, National Institute of Justice, Department of Justice. Points of view in this document are those of the authors and do not necessarily represent the official position of the U.S. Department of Justice. While this project was supported under the above mentioned award, the awarded funding is no longer available.