arXiv:1706.04601 Abstract | arXiv Analytics

arXiv:1706.04601 [cs.LG]Abstract References Reviews Resources

Provable benefits of representation learning

Published 2017-06-14Version 1

There is general consensus that learning representations is useful for a variety of reasons, e.g. efficient use of labeled data (semi-supervised learning), transfer learning and understanding hidden structure of data. Popular techniques for representation learning include clustering, manifold learning, kernel-learning, autoencoders, Boltzmann machines, etc. To study the relative merits of these techniques, it's essential to formalize the definition and goals of representation learning, so that they are all become instances of the same definition. This paper introduces such a formal framework that also formalizes the utility of learning the representation. It is related to previous Bayesian notions, but with some new twists. We show the usefulness of our framework by exhibiting simple and natural settings -- linear mixture models and loglinear models, where the power of representation learning can be formally shown. In these examples, representation learning can be performed provably and efficiently under plausible assumptions (despite being NP-hard), and furthermore: (i) it greatly reduces the need for labeled data (semi-supervised learning) and (ii) it allows solving classification tasks when simpler approaches like nearest neighbors require too much data (iii) it is more powerful than manifold learning methods.

Comments: 22 pages

Categories: cs.LG, stat.ML

Keywords: representation learning, provable benefits, labeled data, linear mixture models, simpler approaches

Related articles: Most relevant | Search more

arXiv:1703.00854 [cs.LG] (Published 2017-03-02)

Learning the Structure of Generative Models without Labeled Data

Stephen H. Bach, Bryan He, Alexander Ratner, Christopher Ré

arXiv:1809.06473 [cs.LG] (Published 2018-09-17)

Towards Deep and Representation Learning for Talent Search at LinkedIn

Rohan Ramanath et al.

arXiv:1909.09252 [cs.LG] (Published 2019-09-19)

HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities

Devanshu Arya, Stevan Rudinac, Marcel Worring

arXiv Analytics

arXiv:1706.04601 [cs.LG]Abstract References Reviews Resources

Provable benefits of representation learning

Links

Toolbox

arXiv:1706.04601 [cs.LG]AbstractReferencesReviewsResources

Provable benefits of representation learning

Links

Toolbox

arXiv:1706.04601 [cs.LG]Abstract References Reviews Resources