(Ab)Using Regression for Data Adjustment
Abstract
In various economic applications, people want to compare units with respect to certain quantities measuring their performance. The latter, however, is often influenced by certain factors which are beyond control of the units, and one would like to extract an adjusted performance from the data. Specifically, let summarize the factors of the -th unit. Then one could think of a model equation with a regression function describing the unavoidable influence of the factors and being the adjusted performance of the -th unit. Now a common proposal is to estimate via regression methods by a function depending on the current data , possibly augmented by additional past data, and to use the residuals as surrogates for the adjusted performances . In the present report we discuss this approach, its potential pitfalls and (mis)interpretation. In particular, an unavoidable property of the residuals is that they measure only parts of the adjusted performance while the remaining parts get hidden in the estimated function . Possible alternatives are mentioned briefly.
Cite
@article{arxiv.1202.1964,
title = {(Ab)Using Regression for Data Adjustment},
author = {Lutz Duembgen},
journal= {arXiv preprint arXiv:1202.1964},
year = {2016}
}
Comments
Replaces an older manuscript "On Ranks of Regression Errors and Residuals"