Anomaly detection on event logs with a scarcity of labels

Sylvio Barbon Junior, Paolo Ceravolo, Ernesto Damiani, Nicolas Jashchenko Omori, Gabriel Marques Tavares

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

23 Scopus citations

Abstract

Assuring anomaly-free business process executions is a key challenge for many organizations. Traditional techniques address this challenge using prior knowledge about anomalous cases that is seldom available in real-life. In this work, we propose the usage of word2vec encoding and One-Class Classification algorithms to detect anomalies by relying on normal behavior only. We investigated 6 different types of anomalies over 38 real and synthetics event logs, comparing the predictive performance of Support Vector Machine, One-Class Support Vector Machine, and Local Outlier Factor. Results show that our technique is viable for real-life scenarios, overcoming traditional machine learning for a wide variety of settings where only the normal behavior can be labeled.

Original languageBritish English
Title of host publicationProceedings - 2020 2nd International Conference on Process Mining, ICPM 2020
EditorsBoudewijn van Dongen, Marco Montali, Moe Thandar Wynn
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages161-168
Number of pages8
ISBN (Electronic)9781728198323
DOIs
StatePublished - Oct 2020
Event2nd International Conference on Process Mining, ICPM 2020 - Virtual, Padua, Italy
Duration: 4 Oct 20209 Oct 2020

Publication series

NameProceedings - 2020 2nd International Conference on Process Mining, ICPM 2020

Conference

Conference2nd International Conference on Process Mining, ICPM 2020
Country/TerritoryItaly
CityVirtual, Padua
Period4/10/209/10/20

Keywords

  • anomaly detection
  • encoding
  • Local Outlier Factor
  • One Class Classification
  • Support Vector Machine

Fingerprint

Dive into the research topics of 'Anomaly detection on event logs with a scarcity of labels'. Together they form a unique fingerprint.

Cite this