Efficient Online Laplacian Eigenmap Computation for Dimensionality Reduction in Molecular Phylogeny via Optimisation on the Sphere

Affiliation auteursAffiliation ok
TitreEfficient Online Laplacian Eigenmap Computation for Dimensionality Reduction in Molecular Phylogeny via Optimisation on the Sphere
Type de publicationConference Paper
Year of Publication2019
AuteursChretien S, Guyeux C
EditorRojas I, Valenzuela O, Rojas F, Ortuno F
Conference NameBIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2019, PT I
PublisherUniv Granada, Fac Sci, Dept Comp Architecture & Comp Technol; Univ Granada, CITIC; Univ Granada
Conference LocationGEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
ISBN Number978-3-030-17938-0; 978-3-030-17937-3
Mots-clésBiomolecular phylogeny, Laplacian eigenmap, Nonlinear dimentionality reduction, Online matrix completion
Résumé

Reconstructing the phylogeny of large groups of large divergent genomes remains a difficult problem to solve, whatever the methods considered. Methods based on distance matrices are blocked due to the calculation of these matrices that is impossible in practice, when Bayesian inference or maximum likelihood methods presuppose multiple alignment of the genomes, which is itself difficult to achieve if precision is required. In this paper, we propose to calculate new distances for randomly selected couples of species over iterations, and then to map the biological sequences in a space of small dimension based on the partial knowledge of this genome similarity matrix. This mapping is then used to obtain a complete graph from which a minimum spanning tree representing the phylogenetic links between species is extracted. This new online Newton method for the computation of eigenvectors that solves the problem of constructing the Laplacian eigenmap for molecular phylogeny is finally applied on a set of more than two thousand complete chloroplasts.

DOI10.1007/978-3-030-17938-0_39