Efficient Online Laplacian Eigenmap Computation for Dimensionality Reduction in Molecular Phylogeny via Optimisation on the Sphere
Affiliation auteurs | Affiliation ok |
Titre | Efficient Online Laplacian Eigenmap Computation for Dimensionality Reduction in Molecular Phylogeny via Optimisation on the Sphere |
Type de publication | Conference Paper |
Year of Publication | 2019 |
Auteurs | Chretien S, Guyeux C |
Editor | Rojas I, Valenzuela O, Rojas F, Ortuno F |
Conference Name | BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2019, PT I |
Publisher | Univ Granada, Fac Sci, Dept Comp Architecture & Comp Technol; Univ Granada, CITIC; Univ Granada |
Conference Location | GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND |
ISBN Number | 978-3-030-17938-0; 978-3-030-17937-3 |
Mots-clés | Biomolecular phylogeny, Laplacian eigenmap, Nonlinear dimentionality reduction, Online matrix completion |
Résumé | Reconstructing the phylogeny of large groups of large divergent genomes remains a difficult problem to solve, whatever the methods considered. Methods based on distance matrices are blocked due to the calculation of these matrices that is impossible in practice, when Bayesian inference or maximum likelihood methods presuppose multiple alignment of the genomes, which is itself difficult to achieve if precision is required. In this paper, we propose to calculate new distances for randomly selected couples of species over iterations, and then to map the biological sequences in a space of small dimension based on the partial knowledge of this genome similarity matrix. This mapping is then used to obtain a complete graph from which a minimum spanning tree representing the phylogenetic links between species is extracted. This new online Newton method for the computation of eigenvectors that solves the problem of constructing the Laplacian eigenmap for molecular phylogeny is finally applied on a set of more than two thousand complete chloroplasts. |
DOI | 10.1007/978-3-030-17938-0_39 |