Abstract
With the continuous exploration of marine resources, underwater artificial intelligent robots play an increasingly important role in the fish industry. However, the detection of underwater objects is a very challenging problem due to the irregular movement of underwater objects, the occlusion of sand and rocks, the diversity of water illumination, and the poor visibility and low color contrast in the underwater environment. In this article, we first propose a real-world underwater object detection dataset (UODD), which covers more than 3K images of the most common aquatic products. Then we propose Channel Sharpening Attention Module (CSAM) as a plug-and-play module to further fuse high-level image information, providing the network with the privilege of selecting feature maps. Fusion of original images through CSAM can improve the accuracy of detecting small and medium objects, thereby improving the overall detection accuracy. We also use Water-Net as a preprocessing method to remove the haze and color cast in complex underwater scenes, which shows a satisfactory detection result on small-sized objects. In addition, we use the class weighted loss as the training loss, which can accurately describe the relationship between classification and precision of bounding boxes of targets, and the loss function converges faster during the training process. Experimental results show that the proposed method reaches a maximum AP of 50.1%, outperforming other traditional and state-of-the-art detectors. In addition, our model only needs an average inference time of 25.4 ms per image, which is quite fast and might suit the real-time scenario.
| Original language | English |
|---|---|
| Title of host publication | MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia |
| Publisher | Association for Computing Machinery, Inc |
| Pages | 4259-4267 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781450386517 |
| DOIs | |
| Publication status | Published - 17 Oct 2021 |
| Externally published | Yes |
| Event | 29th ACM International Conference on Multimedia, MM 2021 - Virtual, Online, China Duration: 20 Oct 2021 → 24 Oct 2021 |
Publication series
| Name | MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia |
|---|
Conference
| Conference | 29th ACM International Conference on Multimedia, MM 2021 |
|---|---|
| Country/Territory | China |
| City | Virtual, Online |
| Period | 20/10/21 → 24/10/21 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 14 Life Below Water
Free Keywords
- attention
- classification
- image enhancement
- object detection
- underwater
ASJC Scopus subject areas
- Human-Computer Interaction
- Software
- Computer Graphics and Computer-Aided Design
Fingerprint
Dive into the research topics of 'Underwater Species Detection using Channel Sharpening Attention'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver