Modelling allelic drop-outs in STR sequencing data generated by MPS

Research output: Contribution to journalJournal articlepeer-review

We used a Poisson-gamma model to analyse the allele coverage of autosomal short tandem repeat (STR) systems obtained by massively parallel sequencing (MPS). The Poisson-gamma coverage model was created using the peak height models from capillary electrophoresis (CE) based detection of PCR products as a starting point. The CE models were modified to account for the differences between CE and MPS signals by accounting for the large marker imbalances seen for MPS data and by using the Poisson-gamma distribution instead of the normal, log-normal, or gamma distributions that were applied for CE data. We took two approaches to estimate the marker imbalance parameters by (1) using a work-flow data base, and (2) using the results of replicate investigations of the samples. The Poisson-gamma model was used to estimate the rate of drop-outs of (1) single contributor dilution series experiments and (2) the minor contributor in two-person mixture samples. We examined the predictive capabilities of the model by comparing the observed and expected Brier scores of each sample. We derived the expected Brier scores and their variances to create asymptotic confidence intervals of the Brier scores. We found that the Poisson-gamma model performed well when using the work-flow data base, but that the replicate approach is not necessarily a viable option.

Original languageEnglish
JournalForensic Science International: Genetics
Volume37
Pages (from-to)6-12
Number of pages7
ISSN1872-4973
DOIs
Publication statusPublished - 1 Nov 2018

    Research areas

  • Forensic genetics, Massively parallel sequencing, Modelling allele coverage, Poisson-gamma distribution, Probability of drop-out, Short tandem repeat

ID: 203554870