arXiv:2310.03419 Abstract | arXiv Analytics

arXiv:2310.03419 [cs.LG]Abstract References Reviews Resources

Pre-Training and Fine-Tuning Generative Flow Networks

Ling Pan, Moksh Jain, Kanika Madan, Yoshua Bengio

Published 2023-10-05Version 1

Generative Flow Networks (GFlowNets) are amortized samplers that learn stochastic policies to sequentially generate compositional objects from a given unnormalized reward distribution. They can generate diverse sets of high-reward objects, which is an important consideration in scientific discovery tasks. However, as they are typically trained from a given extrinsic reward function, it remains an important open challenge about how to leverage the power of pre-training and train GFlowNets in an unsupervised fashion for efficient adaptation to downstream tasks. Inspired by recent successes of unsupervised pre-training in various domains, we introduce a novel approach for reward-free pre-training of GFlowNets. By framing the training as a self-supervised problem, we propose an outcome-conditioned GFlowNet (OC-GFN) that learns to explore the candidate space. Specifically, OC-GFN learns to reach any targeted outcomes, akin to goal-conditioned policies in reinforcement learning. We show that the pre-trained OC-GFN model can allow for a direct extraction of a policy capable of sampling from any new reward functions in downstream tasks. Nonetheless, adapting OC-GFN on a downstream task-specific reward involves an intractable marginalization over possible outcomes. We propose a novel way to approximate this marginalization by learning an amortized predictor enabling efficient fine-tuning. Extensive experimental results validate the efficacy of our approach, demonstrating the effectiveness of pre-training the OC-GFN, and its ability to swiftly adapt to downstream tasks and discover modes more efficiently. This work may serve as a foundation for further exploration of pre-training strategies in the context of GFlowNets.

Categories: cs.LG, cs.AI

Keywords: fine-tuning generative flow networks, pre-training, downstream tasks, predictor enabling efficient fine-tuning, generate compositional objects

Related articles: Most relevant | Search more

arXiv:2309.17002 [cs.LG] (Published 2023-09-29)

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Hao Chen et al.

arXiv:2211.03782 [cs.LG] (Published 2022-11-07)

On minimal variations for unsupervised representation learning

Vivien Cabannes, Alberto Bietti, Randall Balestriero

arXiv:2205.09357 [cs.LG] (Published 2022-05-19)

Continual Pre-Training Mitigates Forgetting in Language and Vision

Andrea Cossu, Tinne Tuytelaars, Antonio Carta, Lucia Passaro, Vincenzo Lomonaco, Davide Bacciu

arXiv Analytics

arXiv:2310.03419 [cs.LG]Abstract References Reviews Resources

Pre-Training and Fine-Tuning Generative Flow Networks

Links

Toolbox

arXiv:2310.03419 [cs.LG]AbstractReferencesReviewsResources

Pre-Training and Fine-Tuning Generative Flow Networks

Links

Toolbox

arXiv:2310.03419 [cs.LG]Abstract References Reviews Resources