Piecewise Linear Approximation in Data Streaming: Algorithmic Implementations and Experimental Analysis

08/27/2018
by   Romaric Duvignau, et al.
0

Piecewise Linear Approximation (PLA) is a well-established tool to reduce the size of the representation of time series by approximating the series by a sequence of line segments while keeping the error introduced by the approximation within some predetermined threshold. With the recent rise of edge computing, PLA algorithms find a complete new set of applications with the emphasis on reducing the volume of streamed data. In this study, we identify two scenarios set in a data-stream processing context: data reduction in sensor transmissions and datacenter storage. In connection to those scenarios, we identify several streaming metrics and propose streaming protocols as algorithmic implementations of the state of the art PLA techniques. In an experimental evaluation, we measure the quality of the reviewed meth- ods and protocols and evaluate their performance against those streaming statistics. All known methods have defi- ciencies when it comes to handling streaming-like data, e.g. inflation of the input stream, high latency or poor aver- age error. Our experimental results highlight the challenges raised when transferring those classical methods into the stream processing world and present alternative techniques to overcome them and balance the related trade-offs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2022

Colocating Real-time Storage and Processing: An Analysis of Pull-based versus Push-based Streaming

Real-time Big Data architectures evolved into specialized layers for han...
research
03/11/2021

ESPBench: The Enterprise Stream Processing Benchmark

Growing data volumes and velocities in fields such as Industry 4.0 or th...
research
10/03/2020

Spiking Neural Networks Through the Lens of Streaming Algorithms

We initiate the study of biological neural networks from the perspective...
research
12/07/2020

Passive Approach for the K-means Problem on Streaming Data

Currently the amount of data produced worldwide is increasing beyond mea...
research
10/05/2018

GraphBolt: Streaming Graph Approximations on Big Data

Graphs are found in a plethora of domains, including online social netwo...
research
10/29/2021

Parallel-and-stream accelerator for computationally fast supervised learning

Two dominant distributed computing strategies have emerged to overcome t...
research
06/20/2020

Coconut Palm: Static and Streaming Data Series Exploration Now in your Palm

Many modern applications produce massive streams of data series and main...

Please sign up or login with your details

Forgot password? Click here to reset