TY - JOUR
T1 - Bilateral Reference for High-Resolution Dichotomous Image Segmentation
AU - Zheng, Peng
AU - Gao, Dehong
AU - Fan, Deng-Ping
AU - Liu, Li
AU - Laaksonen, Jorma
AU - Ouyang, Wanli
AU - Sebe, Nicu
PY - 2024/8/22
Y1 - 2024/8/22
N2 - We introduce a novel bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS). It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef). LM aids in object localization using global semantic information. Within the RM, we utilize BiRef for the reconstruction process, where hierarchical patches of images provide the source reference, and gradient maps serve as the target reference. These components collaborate to generate the final predicted maps. We also introduce auxiliary gradient supervision to enhance the focus on regions with finer details. In addition, we outline practical training strategies tailored for DIS to improve map quality and the training process. To validate the general applicability of our approach, we conduct extensive experiments on four tasks to evince that BiRefNet exhibits remarkable performance, outperforming task-specific cutting-edge methods across all benchmarks. Our codes are publicly available at https://github.com/ZhengPeng7/BiRefNet.
AB - We introduce a novel bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS). It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef). LM aids in object localization using global semantic information. Within the RM, we utilize BiRef for the reconstruction process, where hierarchical patches of images provide the source reference, and gradient maps serve as the target reference. These components collaborate to generate the final predicted maps. We also introduce auxiliary gradient supervision to enhance the focus on regions with finer details. In addition, we outline practical training strategies tailored for DIS to improve map quality and the training process. To validate the general applicability of our approach, we conduct extensive experiments on four tasks to evince that BiRefNet exhibits remarkable performance, outperforming task-specific cutting-edge methods across all benchmarks. Our codes are publicly available at https://github.com/ZhengPeng7/BiRefNet.
KW - dichotomous image segmentation
KW - camouflaged object detection
KW - salient object detection
KW - bilateral reference
KW - high-resolution segmentation
U2 - 10.26599/AIR.2024.9150038
DO - 10.26599/AIR.2024.9150038
M3 - Article
SN - 2097-194X
VL - 3
SP - 1
EP - 12
JO - CAAI Artificial Intelligence Research
JF - CAAI Artificial Intelligence Research
M1 - 9150038
ER -