Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/92994
Type of publication: Magistro darbas / Master thesis
Field of Science: Informatika (N009) / Informatics
Author(s): Pantechovskis, Aleksas
Title: Kriterijų nustatymas anomalijų aptikimo algoritmo parinkimui
Other Title: Determining criteria for choosing anomaly detection algorithm
Extent: 61 p.
Date: 21-May-2019
Keywords: Pašalinių aptikimas;Outlier detection;Anomalijų aptikimas;Anomaly detection;Benchmarking;Įvertinimas;Evaluation;Palyginimas;Comparison
Abstract: In today’s world there is lots of data requiring automated processing: nobody can analyze and extract useful information from it manually. One of the existing processing modes is anomaly detection: detect failures, high traffic, dangerous states and so on. However, it often requires the developer or the user of such analysis systems to have a lot of knowledge on this subject making it less accessible. One of the aspects is the choice of a suitable algorithm and its parameters. The main goal of this work is to start creating guidelines or a decision tree to simplify the process of choosing the most suitable anomaly detection algorithm depending on the dataset characteristics and other requirements. This project was proposed by SAP and inspired by works of the Dawn research team from Stanford and their MacroBase system. In this work we review MacroBase architecture and functionality, describe commonly used real datasets for anomaly detection benchmarking and synthetic dataset generation methods, anomaly detection quality metrics, and develop a benchmarking platform, evaluate anomaly detection algorithms of different types: distance-based (MCOD), density-based (LOF), statistical (MAD, FastMCD, Percentile), Isolation forest.
Internet: https://hdl.handle.net/20.500.12259/92994
Affiliation(s): Informatikos fakultetas
Taikomosios informatikos katedra
Appears in Collections:2019 m. (IF mag.)

Files in This Item:
src.zip106.89 MBUnknownView/Open

source code

datasets.zip1.32 GBUnknownView/Open

datasets

Aleksas_Pantechovskis_md.pdf2.57 MBAdobe PDFView/Open

master thesis

Show full item record

Page view(s)

84
checked on Nov 5, 2019

Download(s)

114
checked on Nov 5, 2019

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons