Efficient anomaly detection on sampled data streams with contaminated phase I data
Affiliation auteurs | !!!! Error affiliation !!!! |
Titre | Efficient anomaly detection on sampled data streams with contaminated phase I data |
Type de publication | Journal Article |
Year of Publication | 2020 |
Auteurs | Sibai REl, Abdo JBou, Jaoude CAbou, Demerjian J, Assaker J, Makhoul A |
Journal | INTERNET TECHNOLOGY LETTERS |
Volume | 3 |
Pagination | e205 |
Date Published | SEP |
Type of Article | Article |
Mots-clés | anomalies detection, data streams, EWMA, sampling algorithms |
Résumé | Control chart algorithms aim to monitor a process over time. This process consists of two phases. Phase I, also called the learning phase, estimates the normal process parameters, then in Phase II, anomalies are detected. However, the learning phase itself can contain contaminated data such as outliers. If left undetected, they can jeopardize the accuracy of the whole chart by affecting the computed parameters, which leads to faulty classifications and defective data analysis results. This problem becomes more severe when the analysis is done on a sample of the data rather than the whole data. To avoid such a situation, Phase I quality must be guaranteed. The purpose of this paper is to introduce a new approach for applying EWMA chart to obtain accurate anomaly detection results over sampled data even if contaminations exist in Phase I. The new chart is applied to a real dataset, and its performance is evaluated on both sampled and not sampled data according to several criteria. |
DOI | 10.1002/itl2.205 |