Active Learning for Anomaly Detection in Environmental data

by Stefania Russo, Moritz Lürig, Blake Matthews, Kris Roger Elie Villez

Publication Type

Journal

Journal Name

Environmental Modelling & Software

Publication Date

December, 2020

Page Number

104869

Volume

134

Issue

View DOI Listing

Abstract

Due to the growing amount of data from in-situ sensors in environmental monitoring, it becomes necessary to automatically detect anomalous data points. Nowadays, this is mainly performed using supervised machine learning models, which need a fully labelled data set for their training process. However, the process of labelling data is typically cumbersome and, as a result, a hindrance to the adoption of machine learning methods for automated anomaly detection. In this work, we propose to address this challenge by means of active learning. This method consists of querying the domain expert for the labels of only a selected subset of the full data set. We show that this reduces the time and costs associated to labelling while delivering the same or similar anomaly detection performances. Finally, we also show that machine learning models providing a nonlinear classification boundary are to be recommended for anomaly detection in complex environmental data sets.

Active Learning for Anomaly Detection in Environmental data

Abstract

Researchers

Organizations