arXiv:2010.12268 Abstract | arXiv Analytics

arXiv:2010.12268 [cs.LG]Abstract References Reviews Resources

A Combinatorial Perspective on Transfer Learning

Jianan Wang, Eren Sezener, David Budden, Marcus Hutter, Joel Veness

Published 2020-10-23Version 1

Human intelligence is characterized not only by the capacity to learn complex skills, but the ability to rapidly adapt and acquire new skills within an ever-changing environment. In this work we study how the learning of modular solutions can allow for effective generalization to both unseen and potentially differently distributed data. Our main postulate is that the combination of task segmentation, modular learning and memory-based ensembling can give rise to generalization on an exponentially growing number of unseen tasks. We provide a concrete instantiation of this idea using a combination of: (1) the Forget-Me-Not Process, for task segmentation and memory based ensembling; and (2) Gated Linear Networks, which in contrast to contemporary deep learning techniques use a modular and local learning mechanism. We demonstrate that this system exhibits a number of desirable continual learning properties: robustness to catastrophic forgetting, no negative transfer and increasing levels of positive transfer as more tasks are seen. We show competitive performance against both offline and online methods on standard continual learning benchmarks.

Categories: cs.LG, stat.ML

Keywords: transfer learning, combinatorial perspective, task segmentation, standard continual learning benchmarks, learn complex skills

Related articles: Most relevant | Search more

arXiv:1904.04334 [cs.LG] (Published 2019-04-08)

A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning

Shahbaz Rezaei, Xin Liu

arXiv:2006.07002 [cs.LG] (Published 2020-06-12)

Double Double Descent: On Generalization Errors in Transfer Learning between Linear Regression Tasks

Yehuda Dar, Richard G. Baraniuk

arXiv:1902.04151 [cs.LG] (Published 2019-01-26)

Evaluation of Transfer Learning for Classification of: (1) Diabetic Retinopathy by Digital Fundus Photography and (2) Diabetic Macular Edema, Choroidal Neovascularization and Drusen by Optical Coherence Tomography

Rony Gelman

arXiv Analytics

arXiv:2010.12268 [cs.LG]Abstract References Reviews Resources

A Combinatorial Perspective on Transfer Learning

Links

Toolbox

arXiv:2010.12268 [cs.LG]AbstractReferencesReviewsResources

A Combinatorial Perspective on Transfer Learning

Links

Toolbox

arXiv:2010.12268 [cs.LG]Abstract References Reviews Resources