Asymptotic Degradation of Linear Regression Estimates With Strategic Data Sources

06/28/2021
by   Nicolas Gast, et al.
0

We consider the problem of linear regression from strategic data sources with a public good component, i.e., when data is provided by strategic agents who seek to minimize an individual provision cost for increasing their data's precision while benefiting from the model's overall precision. In contrast to previous works, our model tackles the case where there is uncertainty on the attributes characterizing the agents' data – a critical aspect of the problem when the number of agents is large. We provide a characterization of the game's equilibrium, which reveals an interesting connection with optimal design. Subsequently, we focus on the asymptotic behavior of the covariance of the linear regression parameters estimated via generalized least squares as the number of data sources becomes large. We provide upper and lower bounds for this covariance matrix and we show that, when the agents' provision costs are superlinear, the model's covariance converges to zero but at a slower rate relative to virtually all learning problems with exogenous data. On the other hand, if the agents' provision costs are linear, this covariance fails to converge. This shows that even the basic property of consistency of generalized least squares estimators is compromised when the data sources are strategic.

READ FULL TEXT
research
07/14/2020

The Effect of Strategic Noise in Linear Regression

We build on an emerging line of work which studies strategic manipulatio...
research
08/11/2014

Optimum Statistical Estimation with Strategic Data Sources

We propose an optimum mechanism for providing monetary incentives to the...
research
05/27/2018

Strategyproof Linear Regression in High Dimensions

This paper is part of an emerging line of work at the intersection of ma...
research
06/28/2023

Linear regression for Poisson count data: A new semi-analytical method with applications to COVID-19 events

This paper presents the application of a new semi-analytical method of l...
research
10/22/2020

Positive definiteness of the asymptotic covariance matrix of OLS estimators in parsimonious regressions

Recently, Ghysels, Hill, and Motegi (2020) proposed a test for examining...
research
03/19/2021

On the design of autonomous agents from multiple data sources

This paper is concerned with the problem of designing agents able to dyn...
research
04/29/2019

Competitive Statistical Estimation with Strategic Data Sources

In recent years, data has played an increasingly important role in the e...

Please sign up or login with your details

Forgot password? Click here to reset