Efficient anomaly detection on sampled data streams with contaminated phase I data

Affiliation auteurs!!!! Error affiliation !!!!
TitreEfficient anomaly detection on sampled data streams with contaminated phase I data
Type de publicationJournal Article
Year of Publication2020
AuteursSibai REl, Abdo JBou, Jaoude CAbou, Demerjian J, Assaker J, Makhoul A
JournalINTERNET TECHNOLOGY LETTERS
Volume3
Paginatione205
Date PublishedSEP
Type of ArticleArticle
Mots-clésanomalies detection, data streams, EWMA, sampling algorithms
Résumé

Control chart algorithms aim to monitor a process over time. This process consists of two phases. Phase I, also called the learning phase, estimates the normal process parameters, then in Phase II, anomalies are detected. However, the learning phase itself can contain contaminated data such as outliers. If left undetected, they can jeopardize the accuracy of the whole chart by affecting the computed parameters, which leads to faulty classifications and defective data analysis results. This problem becomes more severe when the analysis is done on a sample of the data rather than the whole data. To avoid such a situation, Phase I quality must be guaranteed. The purpose of this paper is to introduce a new approach for applying EWMA chart to obtain accurate anomaly detection results over sampled data even if contaminations exist in Phase I. The new chart is applied to a real dataset, and its performance is evaluated on both sampled and not sampled data according to several criteria.

DOI10.1002/itl2.205