Adapting to Skew: Imputing Spatiotemporal Urban Data with 3D Partial Convolutions and Biased Masking

01/10/2023
by   Bin Han, et al.
0

We adapt image inpainting techniques to impute large, irregular missing regions in urban settings characterized by sparsity, variance in both space and time, and anomalous events. Missing regions in urban data can be caused by sensor or software failures, data quality issues, interference from weather events, incomplete data collection, or varying data use regulations; any missing data can render the entire dataset unusable for downstream applications. To ensure coverage and utility, we adapt computer vision techniques for image inpainting to operate on 3D histograms (2D space + 1D time) commonly used for data exchange in urban settings. Adapting these techniques to the spatiotemporal setting requires handling skew: urban data tend to follow population density patterns (small dense regions surrounded by large sparse areas); these patterns can dominate the learning process and fool the model into ignoring local or transient effects. To combat skew, we 1) train simultaneously in space and time, and 2) focus attention on dense regions by biasing the masks used for training to the skew in the data. We evaluate the core model and these two extensions using the NYC taxi data and the NYC bikeshare data, simulating different conditions for missing data. We show that the core model is effective qualitatively and quantitatively, and that biased masking during training reduces error in a variety of scenarios. We also articulate a tradeoff in varying the number of timesteps per training sample: too few timesteps and the model ignores transient events; too many timesteps and the model is slow to train with limited performance gain.

READ FULL TEXT

page 1

page 2

page 7

research
01/01/2019

EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning

Over the last few years, deep learning techniques have yielded significa...
research
08/27/2019

Robust Tensor Recovery with Fiber Outliers for Traffic Events

Event detection is gaining increasing attention in smart cities research...
research
03/12/2021

Spatiotemporal Tensor Completion for Improved Urban Traffic Imputation

Effective management of urban traffic is important for any smart city in...
research
06/25/2022

Missing data patterns in runners' careers: do they matter?

Predicting the future performance of young runners is an important resea...
research
12/28/2018

Spatiotemporal Data Fusion for Precipitation Nowcasting

Precipitation nowcasting using neural networks and ground-based radars h...
research
10/26/2021

MisConv: Convolutional Neural Networks for Missing Data

Processing of missing data by modern neural networks, such as CNNs, rema...
research
02/09/2019

WarpFlow: Exploring Petabytes of Space-Time Data

WarpFlow is a fast, interactive data querying and processing system with...

Please sign up or login with your details

Forgot password? Click here to reset