arXiv:2206.10139 Abstract | arXiv Analytics

arXiv:2206.10139 [cs.LG]Abstract References Reviews Resources

Insights into Pre-training via Simpler Synthetic Tasks

Published 2022-06-21Version 1

Pre-training produces representations that are effective for a wide range of downstream tasks, but it is still unclear what properties of pre-training are necessary for effective gains. Notably, recent work shows that even pre-training on synthetic tasks can achieve significant gains in downstream tasks. In this work, we perform three experiments that iteratively simplify pre-training and show that the simplifications still retain much of its gains. First, building on prior work, we perform a systematic evaluation of three existing synthetic pre-training methods on six downstream tasks. We find the best synthetic pre-training method, LIME, attains an average of $67\%$ of the benefits of natural pre-training. Second, to our surprise, we find that pre-training on a simple and generic synthetic task defined by the Set function achieves $65\%$ of the benefits, almost matching LIME. Third, we find that $39\%$ of the benefits can be attained by using merely the parameter statistics of synthetic pre-training. We release the source code at https://github.com/felixzli/synthetic_pretraining.

Comments: 30 pages

Categories: cs.LG, cs.AI

Keywords: simpler synthetic tasks, downstream tasks, achieve significant gains, best synthetic pre-training method, generic synthetic task

Related articles: Most relevant | Search more

arXiv:2309.17002 [cs.LG] (Published 2023-09-29)

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Hao Chen et al.

arXiv:2211.03782 [cs.LG] (Published 2022-11-07)

On minimal variations for unsupervised representation learning

Vivien Cabannes, Alberto Bietti, Randall Balestriero

arXiv:2307.08623 [cs.LG] (Published 2023-07-14)

HYTREL: Hypergraph-enhanced Tabular Data Representation Learning

Pei Chen, Soumajyoti Sarkar, Leonard Lausen, Balasubramaniam Srinivasan, Sheng Zha, Ruihong Huang, George Karypis

arXiv Analytics

arXiv:2206.10139 [cs.LG]Abstract References Reviews Resources

Insights into Pre-training via Simpler Synthetic Tasks

Links

Toolbox

arXiv:2206.10139 [cs.LG]AbstractReferencesReviewsResources

Insights into Pre-training via Simpler Synthetic Tasks

Links

Toolbox

arXiv:2206.10139 [cs.LG]Abstract References Reviews Resources