Efficient NeRF Optimization - Not All Samples Remain Equally Hard

Juuso Korhonen*, Goutham Rangu, Hamed R. Tavakoli, Juho Kannala

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

We propose an application of online hard sample mining for efficient training of Neural Radiance Fields (NeRF). NeRF models produce state-of-the-art quality for many 3D reconstruction and rendering tasks but require substantial computational resources. The encoding of the scene information within the NeRF network parameters necessitates stochastic sampling. We observe that during the training, a major part of the compute time and memory usage is spent on processing already learnt samples, which no longer affect the model update significantly. We identify the backward pass on the stochastic samples as the computational bottleneck during the optimization. We thus perform the first forward pass in inference mode as a relatively low-cost search for hard samples. This is followed by building the computational graph and updating the NeRF network parameters using only the hard samples. To demonstrate the effectiveness of the proposed approach, we apply our method to Instant-NGP, resulting in significant improvements of the view-synthesis quality over the baseline (1 dB improvement on average per training time, or 2x speedup to reach the same PSNR level) along with 40% memory savings coming from using only the hard samples to build the computational graph. As our method only interfaces with the network module, we expect it to be widely applicable.
Original languageEnglish
Title of host publicationComputer Vision – ECCV 2024
Subtitle of host publication18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXXVI
EditorsAleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
PublisherSpringer
Pages198-213
ISBN (Electronic)978-3-031-72764-1
ISBN (Print)978-3-031-72763-4
DOIs
Publication statusE-pub ahead of print - 25 Oct 2024
MoE publication typeA4 Conference publication
EventEuropean Conference on Computer Vision - Milano, Italy
Duration: 29 Sept 20244 Oct 2024
Conference number: 18

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume15094
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceEuropean Conference on Computer Vision
Abbreviated titleECCV
Country/TerritoryItaly
CityMilano
Period29/09/202404/10/2024

Fingerprint

Dive into the research topics of 'Efficient NeRF Optimization - Not All Samples Remain Equally Hard'. Together they form a unique fingerprint.

Cite this