Abstract

Conventional wisdom attributes the success of Generative Flow Networks (GFlowNets) to their ability to exploit the compositional structure of the sample space for learning generalizable flow functions (Bengio et al., 2021). Despite the abundance of empirical evidence, formalizing this belief with verifiable nonvacuous statistical guarantees has remained elusive. We address this issue with the first data-dependent generalization bounds for GFlowNets. We also elucidate the negative impact of the state space size on the generalization performance of these models via Azuma-Hoeffding-type oracle PAC-Bayesian inequalities. We leverage our theoretical insights to design a novel distributed learning algorithm for GFlowNets, which we call Subgraph Asynchronous Learning (SAL). In a nutshell, SAL utilizes a divide-and-conquer strategy: multiple GFlowNets are trained in parallel on smaller subnetworks of the flow network, and then aggregated with an additional GFlowNet that allocates appropriate flow to each subnetwork. Our experiments with synthetic and real-world problems demonstrate the benefits of SAL over centralized training in terms of mode coverage and distribution matching.

Original languageEnglish
Title of host publication13th International Conference on Learning Representations, ICLR 2025
PublisherCurran Associates Inc.
Pages68205-68247
Number of pages43
ISBN (Electronic)9798331320850
Publication statusPublished - 2025
MoE publication typeA4 Conference publication
EventInternational Conference on Learning Representations - Singapore, Singapore, Singapore
Duration: 24 Apr 202528 Apr 2025
Conference number: 13
https://iclr.cc/

Conference

ConferenceInternational Conference on Learning Representations
Abbreviated titleICLR
Country/TerritorySingapore
CitySingapore
Period24/04/202528/04/2025
Internet address

Fingerprint

Dive into the research topics of 'Generalization and Distributed Learning of GFlowNets'. Together they form a unique fingerprint.
  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this