Systematic investigations of gene effects on both topologies and supports: An Echinococcus illustration

Affiliation auteurs!!!! Error affiliation !!!!
TitreSystematic investigations of gene effects on both topologies and supports: An Echinococcus illustration
Type de publicationJournal Article
Year of Publication2017
AuteursGuyeux C, Chretien S, Cote NM-L, Bahi JM
JournalJOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY
Volume15
Pagination1750019
Date PublishedOCT
Type of ArticleArticle; Proceedings Paper
ISSN0219-7200
Mots-clésEchinococcus, molecular phylogeny, statistical tests
Résumé

In this paper, we propose a high performance computing toolbox implementing effcient statistical methods for the study of phylogenies. This toolbox, which implements logit models and LASSO-type penalties, gives a way to better understand, measure, and compare the impact of each gene on a global phylogeny. As an application, we study the Echinococcus phylogeny, which is often considered as a particularly diffcult example. Mitochondrial and nuclear genomes ( 19 coding sequences) of nine Echinococcus species are considered in order to investigate the molecular phylogeny of this genus. First, we check that the 19 gene trees lead to 19 totally different unsupported topologies ( a topology is the sister relationship when both branch lengths and supports are ignored in a phylogenetic tree), while using the 19 genes as a whole are not suffcient for estimating the phylogeny. In order to circumvent this issue and understand the impact of the genes, we computed 43,796 trees using combinations ranging from 13 to 19 genes. By doing so, 15 topologies are obtained. Four particular topologies, appearing more robust and frequent, are then selected for more precise investigation. Refining further our statistical analysis, a particularly robust topology is extracted. We also carefully demonstrate the influence of nuclear genes on the likelihood of the phylogeny.

DOI10.1142/S0219720017500196