Abstract
StyleGAN2 is a Tensorflow-based Generative Adversarial Network (GAN) framework that represents the state-of-the-art in generative image modelling. The current release of StyleGAN2 implements multi-GPU training via Tensorflow’s device contexts which limits data parallelism to a single node. In this work, a data-parallel multi-node training capability is implemented in StyleGAN2 via Horovod which enables harnessing the compute capability of larger cluster architectures. We demonstrate that the new Horovod-based communication outperforms the previous context approach on a single node. Furthermore, we demonstrate that the multi-node training does not compromise the accuracy of StyleGAN2 for a constant effective batch size. Finally, we report strong and weak scaling of the new implementation up to 64 NVIDIA Tesla A100 GPUs distributed across eight NVIDIA DGX A100 nodes, demonstrating the utility of the approach at scale.
Original language | English |
---|---|
Title of host publication | Pattern Recognition. ICPR International Workshops and Challenges, 2021, Proceedings |
Editors | Alberto Del Bimbo, Rita Cucchiara, Stan Sclaroff, Giovanni Maria Farinella, Tao Mei, Marco Bertini, Hugo Jair Escalante, Roberto Vezzani |
Publisher | Springer |
Pages | 677-684 |
Number of pages | 8 |
ISBN (Print) | 978-3-030-68762-5 |
DOIs | |
Publication status | Published - 2021 |
MoE publication type | A4 Conference publication |
Event | International Conference on Pattern Recognition - Virtual, Online, Milan, Italy Duration: 10 Jan 2021 → 15 Jan 2021 Conference number: 25 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Publisher | Springer |
Volume | 12661 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | International Conference on Pattern Recognition |
---|---|
Abbreviated title | ICPR |
Country/Territory | Italy |
City | Milan |
Period | 10/01/2021 → 15/01/2021 |
Keywords
- GAN
- GPU
- Massively parallel architectures
- Multi-node training
- StyleGAN2