An adjacent-swap Markov chain on coalescent trees

Part of: Markov processes

Published online by Cambridge University Press: 02 September 2022

Mackenzie Simper

and

Julia A. Palacios

Show author details

Mackenzie Simper*: Affiliation:
Stanford University
Julia A. Palacios*: Affiliation:
Stanford University
*: *Postal address: Department of Mathematics, Building 380, 450 Jane Stanford Way, Stanford, CA 94305, USA. Email address: msimper@stanford.edu
**Postal address: Department of Statistics and Department of Biomedical Data Science, Sequoia Hall, 390 Jane Stanford Way, Stanford, CA 94305, USA. Email address: juliapr@stanford.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The standard coalescent is widely used in evolutionary biology and population genetics to model the ancestral history of a sample of molecular sequences as a rooted and ranked binary tree. In this paper we present a representation of the space of ranked trees as a space of constrained ordered matched pairs. We use this representation to define ergodic Markov chains on labeled and unlabeled ranked tree shapes analogously to transposition chains on the space of permutations. We show that an adjacent-swap chain on labeled and unlabeled ranked tree shapes has a mixing time at least of order $n^3$ , and at most of order $n^{4}$ . Bayesian inference methods rely on Markov chain Monte Carlo methods on the space of trees. Thus it is important to define good Markov chains which are easy to simulate and for which rates of convergence can be studied.

Keywords

Random transpositions lumped Markov chain Tajima distribution

MSC classification

Primary: 60J10: Markov chains (discrete-time Markov processes on discrete state spaces)

Type: Original Article
Information: Journal of Applied Probability , Volume 59 , Issue 4 , December 2022 , pp. 1243 - 1260

DOI: https://doi.org/10.1017/jpr.2022.15 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aldous, D. (1983). Random walks on finite groups and rapidly mixing Markov chains. In Séminaire de Probabilités XVII 1981/82, pp. 243–297. Springer.CrossRef Google Scholar

Aldous, D. J. (2000). Mixing time for a Markov chain on cladograms. Combinatorics Prob. Comput. 9, 191–204.Google Scholar

Cappello, L. and Palacios, J. A. (2020). Sequential importance sampling for multiresolution Kingman–Tajima coalescent counting. Ann. Appl. Statist. 14, 727–751.CrossRef Google Scholar PubMed

Cappello, L., Veber, A. and Palacios, J. A. (2020). The Tajima heterochronous n-coalescent: Inference from heterochronously sampled molecular data. Available at arXiv:2004.06826.Google Scholar

Diaconis, P. W. and Holmes, S. P. (1998). Matchings and phylogenetic trees. Proc. Nat. Acad. Sci. USA 95, 14600–14602.CrossRef Google Scholar

Diaconis, P. and Holmes, S. (2002). Random walks on trees and matchings. Electron. J. Prob. 7, 1–17.CrossRef Google Scholar

Diaconis, P. and Wood, P. M. (2013). Random doubly stochastic tridiagonal matrices. Random Structures Algorithms 42, 403–437.CrossRef Google Scholar

Dinh, V., Darling, A. E. and Matsen, IV, F. A. (2018). Online Bayesian phylogenetic inference: Theoretical foundations via sequential Monte Carlo. Syst. Biol. 67, 503–517.CrossRef Google Scholar PubMed

Donaghey, R. (1975). Alternating permutations and binary increasing trees. J. Combinatorial Theory A 18, 141–148.CrossRef Google Scholar

Drummond, A., Suchard, M., Xie, D. and Rambaut, A. (2012). Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973.CrossRef Google Scholar PubMed

Durrett, R. (2003). Shuffling chromosomes. J. Theoret. Prob. 16, 725–750.Google Scholar

Felsenstein, J. (2004). Inferring Phylogenies, Vol. 2. Sinauer Associates, Sunderland, MA.Google Scholar

Friedman, N., Ninio, M., Pe’er, I. and Pupko, T. (2001). A structural EM algorithm for phylogenetic inference. In Proceedings of the Fifth Annual International Conference on Computational Biology, pp. 132–140.CrossRef Google Scholar

Frost, S. D. W. and Volz, E. M. (2013). Modelling tree shape and structure in viral phylodynamics. Phil. Trans. R. Soc. B 368, 20120208.CrossRef Google Scholar PubMed

Janson, S. and Kersting, G. (2011). On the total external length of the Kingman coalescent. Electron. J. Prob. 16, 2203–2218.CrossRef Google Scholar

Kingman, J. (1982). The coalescent. Stoch. Process. Appl. 13, 235–248.CrossRef Google Scholar

Kirkpatrick, M. and Slatkin, M. (1993). Searching for evolutionary patterns in the shape of a phylogenetic tree. Evolution 47, 1171–1181.CrossRef Google Scholar

Kuznetsov, A. G., Pak, I. M. and Postnikov, A. E. (1994). Increasing trees and alternating permutations. Russian Math. Surveys 49, 79.CrossRef Google Scholar

Lacoin, H. (2016). Mixing time and cutoff for the adjacent transposition shuffle and the simple exclusion. Ann. Prob. 44, 1426–1487.CrossRef Google Scholar

Levin, D. A. and Peres, Y. (2017). Markov Chains and Mixing Times, American Mathematical Society.CrossRef Google Scholar

Liu, S., Westbury, M. V., Dussex, N., Mitchell, K. J., Sinding, M.-H. S., Heintzman, P. D., Duchêne, D. A., Kapp, J. D., von Seth, J., Heiniger, H. et al. (2021). Ancient and modern genomes unravel the evolutionary history of the rhinoceros family. Cell 184, 4874–4885.CrossRef Google Scholar PubMed

Maliet, O., Gascuel, F. and Lambert, A. (2018). Ranked tree shapes, nonrandom extinctions, and the loss of phylogenetic diversity. Syst. Biol. 67, 1025–1040.CrossRef Google Scholar PubMed

Misra, N., Blelloch, G., Ravi, R. and Schwartz, R. (2011). An optimization-based sampling scheme for phylogenetic trees. In International Conference on Research in Computational Molecular Biology (Lecture Notes in Computer Science 6577), pp. 252–266. Springer, Berlin and Heidelberg.CrossRef Google Scholar

Mossel, E. and Vigoda, E. (2006). Limitations of Markov chain Monte Carlo algorithms for Bayesian inference of phylogeny. Ann. Appl. Prob. 16, 2215–2234.Google Scholar

Palacios, J. A., Véber, A., Cappello, L., Wang, Z., Wakeley, J. and Ramachandran, S. (2019). Bayesian estimation of population size changes by sampling Tajima’s trees. Genetics 213, 967–986.CrossRef Google Scholar PubMed

Rajanala, S. and Palacios, J. A. (2021). Statistical summaries of unlabelled evolutionary trees and ranked hierarchical clustering trees. Available at arXiv:2106.02724.Google Scholar

Sainudiin, R., Stadler, T. and Véber, A. (2015). Finding the best resolution for the Kingman–Tajima coalescent: Theory and applications. J. Math. Biology 70, 1207–1247.CrossRef Google Scholar PubMed

Schweinsberg, J. (2002). An

${O}(n^{2})$ bound for the relaxation time of a Markov chain on cladograms. Random Structures Algorithms 20, 59–70.CrossRef Google Scholar

Spade, D., Herbei, R. and Kubatko, L. (2014). A note on the relaxation time of two Markov chains on rooted phylogenetic tree spaces. Statist. Prob. Lett. 84, 247–252.CrossRef Google Scholar

Stanley, R. P. (1999). Enumerative Combinatorics, Vol. I. Wadsworth & Brooks/Cole.Google Scholar

Štefankovič, D. and Vigoda, E. (2011). Fast convergence of Markov chain Monte Carlo algorithms for phylogenetic reconstruction with homogeneous data on closely related species. SIAM J. Discrete Math. 25, 1194–1211.CrossRef Google Scholar

Volz, E. M., Koelle, K. and Bedford, T. (2013). Viral phylodynamics. PLoS Comput. Biol. 9, e1002947.CrossRef Google Scholar PubMed

Wakeley, J. (2008). Coalescent Theory: An Introduction. Roberts.Google Scholar

Wilson, D. B. (2004). Mixing times of lozenge tiling and card shuffling Markov chains. Ann. Appl. Prob. 14, 274–325.CrossRef Google Scholar

Article contents

An adjacent-swap Markov chain on coalescent trees

Abstract

Keywords

MSC classification

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests