WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors

Yi Wang; Ruili Wang; Xiangjian He; Chi Lin; Tianzhu Wang; Qi Jia; Xin Fan

doi:10.1016/j.patcog.2024.110579

WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors

Yi Wang, Ruili Wang, Xiangjian He, Chi Lin, Tianzhu Wang, Qi Jia, Xin Fan

School of Computer Science

Research output: Journal Publication › Article › peer-review

14 Citations (Scopus)

Abstract

Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet.

Original language	English
Article number	110579
Journal	Pattern Recognition
Volume	154
DOIs	https://doi.org/10.1016/j.patcog.2024.110579
Publication status	Published - Oct 2024

Keywords

Neural networks
Pseudo labels
Salient object detection
Scribble labels
Transformer
Weakly supervision

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1016/j.patcog.2024.110579

Cite this

@article{e2409c23f8324690965b9f5cd6cc1397,

title = "WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors",

abstract = "Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet.",

keywords = "Neural networks, Pseudo labels, Salient object detection, Scribble labels, Transformer, Weakly supervision",

author = "Yi Wang and Ruili Wang and Xiangjian He and Chi Lin and Tianzhu Wang and Qi Jia and Xin Fan",

note = "Publisher Copyright: {\textcopyright} 2024 The Author(s)",

year = "2024",

month = oct,

doi = "10.1016/j.patcog.2024.110579",

language = "English",

volume = "154",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - WBNet

T2 - Weakly-supervised salient object detection via scribble and pseudo-background priors

AU - Wang, Yi

AU - Wang, Ruili

AU - He, Xiangjian

AU - Lin, Chi

AU - Wang, Tianzhu

AU - Jia, Qi

AU - Fan, Xin

PY - 2024/10

Y1 - 2024/10

N2 - Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet.

AB - Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet.

KW - Neural networks

KW - Pseudo labels

KW - Salient object detection

KW - Scribble labels

KW - Transformer

KW - Weakly supervision

UR - http://www.scopus.com/inward/record.url?scp=85193683006&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.110579

DO - 10.1016/j.patcog.2024.110579

M3 - Article

AN - SCOPUS:85193683006

SN - 0031-3203

VL - 154

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 110579

ER -

WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this