A central limit theorem for parsimony length of trees

Type of content
Discussion / Working Papers
Publisher's DOI/URI
Thesis discipline
Degree name
Research Report
Publisher
University of Canterbury. Dept. of Mathematics
Journal Title
Journal ISSN
Volume Title
Language
Date
1994
Authors
Steel, M. A.
Goldstein, Larry J.
Waterman, Michael S.
Abstract

In phylogenetic analysis it is useful to study the distribution of parsimony length of a tree, under the null model by which the leaves are independently assigned letters according to prescribed probabilities. Except in one special case, this distribution is difficult.to describe exactly. Here we analyse this distribution by providing a recursive and readily computable description, establishing large deviation bounds for the parsimony length of a fixed tree on a single site and for the minimum length (maximum parsimony) tree over several sites, and by showing that, under very general conditions, the former distribution converges asymptotically to the normal, thereby settling a recent conjecture. Furthermore, we show how the mean and variance of this distribution can be efficiently calculated. The proof of normality requires a number of new and recent results, as the parsimony length is not directly expressible as a sum of independent random variables, and so normality does not follow immediately from a standard central limit theorem.

Description
Citation
Keywords
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Field of Research::01 - Mathematical Sciences
Rights
Copyright M. A. Steel