Covariance of pairwise differences on a multi-species coalescent tree and implications for FST

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 1,51 MB, PDF-dokument

The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence times, which are fundamental for understanding the properties oFSTatistics that combine data from multiple species, such as the fixation index (FST). The major contribution of this study is the derivation and implementation of exact expressions for the covariances of pairwise coalescence times under phylogenetic models with piecewise constant changes in population size, assuming no gene flow after species divergence. We use these expressions to derive the variance in average pairwise differences within and between populations. We then derive approximations for the expectation and bias of a sequence-based estimator of FST, a commonly used genetic measurement of population differentiation, when it is applied to a non-recombining region of the genome. We show that the estimator of FST is generally biased downward. A freely available software package is provided, STCov, to calculate the mean, variances and covariances in coalescence times presented here under user-defined piecewise-constant species trees. This article is part of the theme issue Celebrating 50 years since Lewontin's apportionment of human diversity'.

OriginalsprogEngelsk
Artikelnummer20200415
TidsskriftPhilosophical Transactions of the Royal Society B: Biological Sciences
Vol/bind377
Udgave nummer1852
Antal sider14
ISSN0962-8436
DOI
StatusUdgivet - 2022

Bibliografisk note

Funding Information:
This research was supported by NIH grant no. R01GM138634 to R.N. Acknowledgements

Publisher Copyright:
© 2022 Royal Society Publishing. All rights reserved.

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk


Ingen data tilgængelig

ID: 307333945