arXiv:2305.17297 Abstract | arXiv Analytics

arXiv:2305.17297 [cs.LG]Abstract References Reviews Resources

Generalization Error without Independence: Denoising, Linear Regression, and Transfer Learning

Chinmaya Kausik, Kashvi Srivastava, Rishi Sonthalia

Published 2023-05-26Version 1

Studying the generalization abilities of linear models with real data is a central question in statistical learning. While there exist a limited number of prior important works (Loureiro et al. (2021A, 2021B), Wei et al. 2022) that do validate theoretical work with real data, these works have limitations due to technical assumptions. These assumptions include having a well-conditioned covariance matrix and having independent and identically distributed data. These assumptions are not necessarily valid for real data. Additionally, prior works that do address distributional shifts usually make technical assumptions on the joint distribution of the train and test data (Tripuraneni et al. 2021, Wu and Xu 2020), and do not test on real data. In an attempt to address these issues and better model real data, we look at data that is not I.I.D. but has a low-rank structure. Further, we address distributional shift by decoupling assumptions on the training and test distribution. We provide analytical formulas for the generalization error of the denoising problem that are asymptotically exact. These are used to derive theoretical results for linear regression, data augmentation, principal component regression, and transfer learning. We validate all of our theoretical results on real data and have a low relative mean squared error of around 1% between the empirical risk and our estimated risk.

Categories: cs.LG, math.ST, stat.ML, stat.TH

Keywords: generalization error, linear regression, transfer learning, relative mean squared error, address distributional shift

Related articles: Most relevant | Search more

arXiv:2006.07002 [cs.LG] (Published 2020-06-12)

Double Double Descent: On Generalization Errors in Transfer Learning between Linear Regression Tasks

Yehuda Dar, Richard G. Baraniuk

arXiv:1705.07048 [cs.LG] (Published 2017-05-19)

Linear regression without correspondence

Daniel Hsu, Kevin Shi, Xiaorui Sun

arXiv:1711.05482 [cs.LG] (Published 2017-11-15)

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

Dhruv Mahajan, Vivek Gupta, S Sathiya Keerthi, Sellamanickam Sundararajan, Shravan Narayanamurthy, Rahul Kidambi