ALDI++: Automatic and parameter-less discord and outlier detection for building energy load profiles

03/13/2022
by   Matias Quintana, et al.
8

Data-driven building energy prediction is an integral part of the process for measurement and verification, building benchmarking, and building-to-grid interaction. The ASHRAE Great Energy Predictor III (GEPIII) machine learning competition used an extensive meter data set to crowdsource the most accurate machine learning workflow for whole building energy prediction. A significant component of the winning solutions was the pre-processing phase to remove anomalous training data. Contemporary pre-processing methods focus on filtering statistical threshold values or deep learning methods requiring training data and multiple hyper-parameters. A recent method named ALDI (Automated Load profile Discord Identification) managed to identify these discords using matrix profile, but the technique still requires user-defined parameters. We develop ALDI++, a method based on the previous work that bypasses user-defined parameters and takes advantage of discord similarity. We evaluate ALDI++ against a statistical threshold, variational auto-encoder, and the original ALDI as baselines in classifying discords and energy forecasting scenarios. Our results demonstrate that while the classification performance improvement over the original method is marginal, ALDI++ helps achieve the best forecasting error improving 6 computation time.

READ FULL TEXT
research
06/03/2020

The Building Data Genome Project 2: Hourly energy meter data from the ASHRAE Great Energy Predictor III competition

This paper describes an open data set of 3,053 energy meters from 1,636 ...
research
06/03/2020

The Building Data Genome Project 2: Energy meter data from the ASHRAE Great Energy Predictor III competition

This paper describes an open data set of 3,053 energy meters from 1,636 ...
research
07/14/2020

The ASHRAE Great Energy Predictor III competition: Overview and results

In late 2019, ASHRAE hosted the Great Energy Predictor III (GEPIII) mach...
research
08/01/2019

LoadCNN: A Efficient Green Deep Learning Model for Day-ahead Individual Resident Load Forecasting

Accurate day-ahead individual resident load forecasting is very importan...
research
06/25/2021

Limitations of machine learning for building energy prediction

Machine learning for building energy prediction has exploded in populari...
research
02/07/2022

Gradient boosting machines and careful pre-processing work best: ASHRAE Great Energy Predictor III lessons learned

The ASHRAE Great Energy Predictor III (GEPIII) competition was held in l...
research
04/03/2019

Optimized Preprocessing and Machine Learning for Quantitative Raman Spectroscopy in Biology

Raman spectroscopy's capability to provide meaningful composition predic...

Please sign up or login with your details

Forgot password? Click here to reset