An effective and conceptually simple feature representation for off-line text-independent writer identification

Affiliation auteurs!!!! Error affiliation !!!!
TitreAn effective and conceptually simple feature representation for off-line text-independent writer identification
Type de publicationJournal Article
Year of Publication2019
AuteursChahi A, Merabet YEl, Ruichek Y, Touahni R
JournalEXPERT SYSTEMS WITH APPLICATIONS
Volume123
Pagination357-376
Date PublishedJUN 1
Type of ArticleArticle
ISSN0957-4174
Mots-clésDissimilarity measure, Feature histogram descriptors, Hamming distance, Handwritten connected components, Histogram sequence concatenation, Off-line writer identification, Text-independent, Texture features
Résumé

Feature engineering forms an important component of machine learning and pattern recognition. It is a fundamental process for off-line writer identification of handwritten documents, which continues to be an interesting subject of research in various forensic and authentication areas. In this work, we propose an efficient, yet computationally and conceptually simple framework for off-line text independent writer identification using local textural features in characterizing the writing style of each writer. These include Local Binary Patterns (LBP), Local Ternary Patterns (LIP), and Local Phase Quantization (LPQ). Our approach focuses on exploiting the writing images at small observation regions where a set of connected component sub-images are cropped and extracted from each handwriting sample (document or set of word text line images). These connected components are seen as texture images where each one of them is subjected to feature extraction using LBP, LPQ or LTP. Then, a histogram sequence concatenation is applied to the feature image after dimensionality reduction followed by image subdivision into a number of non-overlapping regions. For classification, the 1-NN (Nearest Neighbor) classifier is used to identify the writer of the questioned samples based on the dissimilarity of feature vectors computed from all components in the writing. Experiments on IFN/ENIT (411 writers/Arabic), AHTID/MW (53 writers/Arabic), CVL (309 writers/English), and IAM (657 writers/English) databases demonstrate that our proposed system outperforms old and recent state-of-the-art writer identification systems on Arabic script, and demonstrates a competitive performance on English ones. (C) 2019 Elsevier Ltd. All rights reserved.

DOI10.1016/j.eswa.2019.01.045