A Bregman-proximal point algorithm for robust non-negative matrix factorization with possible missing values and outliers - application to gene expression analysis

Affiliation auteurs!!!! Error affiliation !!!!
TitreA Bregman-proximal point algorithm for robust non-negative matrix factorization with possible missing values and outliers - application to gene expression analysis
Type de publicationJournal Article
Year of Publication2016
AuteursChretien S, Guyeux C, Conesa B, Delage-Mouroux R, Jouvenot M, Huetz P, Descotes F
JournalBMC BIOINFORMATICS
Volume17
Pagination284
Date PublishedAUG 31
Type of ArticleArticle; Proceedings Paper
ISSN1471-2105
Mots-clésfeature extraction, Gene expression analysis, Non-negative matrix factorization, Outliers and missing data
Résumé

Background: Non-Negative Matrix factorization has become an essential tool for feature extraction in a wide spectrum of applications. In the present work, our objective is to extend the applicability of the method to the case of missing and/or corrupted data due to outliers. Results: An essential property for missing data imputation and detection of outliers is that the uncorrupted data matrix is low rank, i.e. has only a small number of degrees of freedom. We devise a new version of the Bregman proximal idea which preserves nonnegativity and mix it with the Augmented Lagrangian approach for simultaneous reconstruction of the features of interest and detection of the outliers using a sparsity promoting l(1) penality. Conclusions: An application to the analysis of gene expression data of patients with bladder cancer is finally proposed.

DOI10.1186/s12859-016-1120-8