Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
Genre
Journal ArticleDate
2020-01-01Author
Mannino, FWisotsky, S
Pond, SLK
Muse, SV
Subject
AlgorithmsAmino Acid Substitution
Evolution, Molecular
Likelihood Functions
Models, Genetic
Mutation Rate
Phylogeny
Sequence Alignment
Sequence Analysis, DNA
Permanent link to this record
http://hdl.handle.net/20.500.12613/4283
Metadata
Show full item recordDOI
10.1371/journal.pone.0229493Abstract
© 2020 Mannino et al. It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.Citation to related work
Public Library of Science (PLoS)Has part
PLoS ONEADA compliance
For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.eduae974a485f413a2113503eed53cd6c53
http://dx.doi.org/10.34944/dspace/4265