Universitat Politècnica de Catalunya. Departament de Matemàtica Aplicada I
Universitat Politècnica de Catalunya. EGSA - Equacions Diferencials, Geometria, Sistemes Dinàmics i de Control, i Aplicacions
2011-10
In phylogenetic inference, an evolutionarymodel describes the substitution processes along each edge of a phylogenetic tree. Misspecification of the model has important implications for the analysis of phylogenetic data. Conventionally, however, the selection of a suitable evolutionary model is based on heuristics or relies on the choice of an approximate input tree. We introduce a method for model Selection in Phylogenetics based on linear INvariants (SPIn), which uses recent insights on linear invariants to characterize a model of nucleotide evolution for phylogenetic mixtures on any number of components. Linear invariants are constraints among the joint probabilities of the bases in the operational taxonomic units that hold irrespective of the tree topologies appearing in the mixtures. SPIn therefore requires no input tree and is designed to deal with nonhomogeneous phylogenetic data consisting ofmultiple sequence alignments showing different patterns of evolution, for example, concatenated genes, exons, and/or introns. Here, we report on the results of the proposedmethod evaluated on multiple sequence alignments simulatedunder a variety of single-tree andmixture settings for both continuous- and discretetime models. In the simulations, SPInsuccessfully recovers the underlying evolutionarymodel and is shown to performbetter than existing approaches.
Peer Reviewed
Postprint (published version)
Article
English
http://mbe.oxfordjournals.org/content/early/2011/12/23/molbev.msr259.full.pdf+html
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Restricted access - publisher's policy
Attribution-NonCommercial-NoDerivs 3.0 Spain
E-prints [72986]