|
|
|
| Small-target water-floating garbage detection based on edge perception and cross-scale feature enhancement |
Baijing WU( ),Guanghui YAN*( ),Long MA,Wenxin CHENG,Yaning HUANG |
| School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China |
|
|
|
Abstract A new method for small-target water-floating garbage detection was proposed to address the issue of missed and false detections due to the limited information of small-target water-floating garbage that is prone to losing detailed features in feature extraction. After analyzing the limitations of small-target feature extraction, an edge-enhanced feature extraction network which synergistically utilized the spatial features and high-frequency features was proposed to effectively enhance the complex edges, details, and high-frequency information of small targets. A triple-branch cross-scale feature adaptive fusion module was designed to enhance the network’s ability to represent multi-dimensional small-target features by analyzing small-target features through local detail perception, global context modeling, and the large receptive field. An adaptive sparse attention-based intra-scale feature interaction module was constructed, which dynamically adjusted the interactive features by leveraging sparsity to enhance the discriminability between small targets and backgrounds. Experimental results show that, compared with the baseline model RT-DETR, the proposed method achieves improvements of 4.93, 2.46, and 3.18 percentage points in mAP, mmAP, and recall rate (R) respectively on the water-floating garbage dataset from the Lanzhou section of the Yellow River, and achieves improvements of 3.39, 1.45, and 2.23 percentage points on the FloW-Img dataset. These results indicate that the proposed method effectively enhances the detection performance for small-target water-floating garbage, thereby facilitating the efficient monitoring and management of water-floating garbage.
|
|
Received: 15 April 2025
Published: 23 May 2026
|
|
|
| Fund: 国家自然科学基金资助项目(62466032, 62366028, 62062049);甘肃省自然科学基金资助项目(24JRRA256);甘肃省水利厅省级项目(LZJT523029). |
|
Corresponding Authors:
Guanghui YAN
E-mail: 1420716156@qq.com;yanguanghui@mail.lzjtu.cn
|
边缘感知和跨尺度特征增强的小目标水漂垃圾检测
针对小目标水漂垃圾所含信息有限、在特征提取时容易丢失细节特征而造成漏检、错检的问题,提出基于边缘感知和跨尺度特征增强的小目标水漂垃圾检测方法. 在分析小目标特征提取的局限性后,提出边缘增强的特征提取网络,协同利用空间特征和高频特征,有效增强小目标复杂边缘、细节和高频信息;设计三分支跨尺度特征自适应融合模块,通过局部细节感知、全局上下文建模和大感受野解析小目标特征,提升网络对多维小目标特征的表征能力;构建基于自适应稀疏注意力的尺度内特征交互模块,利用稀疏性动态调整交互特征,强化小目标与背景的区分程度. 实验结果表明,相较于基准模型RT-DETR,所提方法的mAP、mmAP和召回率R在黄河兰州段水漂垃圾数据集上分别提升了4.93、2.46和3.18个百分点,在FloW-Img数据集上分别提升了3.39、1.45和2.23个百分点,表明所提方法能够有效提升对小目标水漂垃圾的检测性能,助力水漂垃圾的高效监测与治理.
关键词:
目标检测,
水漂垃圾,
小目标,
RT-DETR,
边缘感知,
跨尺度特征增强
|
|
| [1] |
AI P, MA L, WU B LI-DWT- and PD-FC-MSPCNN-based small-target localization method for floating garbage on water surfaces[J]. Water, 2023, 15 (12): 2302
doi: 10.3390/w15122302
|
|
|
| [2] |
JIANG Z, WU B, MA L, et al APM-YOLOv7 for small-target water-floating garbage detection based on multi-scale feature adaptive weighted fusion[J]. Sensors, 2024, 24 (1): 50
|
|
|
| [3] |
LI N, WANG M, YANG G, et al DENS-YOLOv6: a small object detection model for garbage detection on water surface[J]. Multimedia Tools and Applications, 2024, 83 (18): 55751- 55771
|
|
|
| [4] |
盘姿君, 王建华, 郑翔, 等 水面垃圾清理机器人结构及自主控制研究综述[J]. 计算机工程与应用, 2024, 60 (11): 17- 31 PAN Zijun, WANG Jianhua, ZHENG Xiang, et al Review of research on structure and autonomous control of water surface garbage cleaning robots[J]. Computer Engineering and Applications, 2024, 60 (11): 17- 31
|
|
|
| [5] |
陈清江, 李璐 基于改进YOLOv8n的遥感小目标检测算法[J]. 激光与光电子学进展, 2025, 62 (16): 368- 379 CHEN Qingjiang, LI Lu Small target detection algorithm based on improved YOLOv8n for remote sensing[J]. Laser & Optoelectronics Progress, 2025, 62 (16): 368- 379
|
|
|
| [6] |
李金沛, 孟晓林, 胡亮亮, 等 基于改进YOLOv8的桥梁小目标裂缝检测[J]. 清华大学学报: 自然科学版, 2025, 65 (7): 1260- 1271 LI Jinpei, MENG Xiaolin, HU Liangliang, et al Bridge small target crack detection based on improved YOLOv8[J]. Journal of Tsinghua University: Science and Technology, 2025, 65 (7): 1260- 1271
|
|
|
| [7] |
王浚银, 文斌, 沈艳军, 等 基于改进YOLOv7-tiny的铝型材表面缺陷检测方法[J]. 浙江大学学报: 工学版, 2025, 59 (3): 523- 534 WANG Junyin, WEN Bin, SHEN Yanjun, et al Surface defect detection method for aluminum profiles based on improved YOLOv7-tiny[J]. Journal of Zhejiang University: Engineering Science, 2025, 59 (3): 523- 534
|
|
|
| [8] |
SELVAM P, SUNDARI P S, TAMILSELVI M, et al YOLO-SAIL: attention-enhanced YOLOv5 with optimized Bi-FPN for ship target detection in SAR images[J]. IEEE Access, 2025, 13: 29523- 29540
doi: 10.1109/ACCESS.2025.3536621
|
|
|
| [9] |
BI J, LI K, ZHENG X, et al SPDC-YOLO: an efficient small target detection network based on improved YOLOv8 for drone aerial image[J]. Remote Sensing, 2025, 17 (4): 685
doi: 10.3390/rs17040685
|
|
|
| [10] |
HAN B, HE L, KE J, et al Weighted parallel decoupled feature pyramid network for object detection[J]. Neurocomputing, 2024, 593: 127809
doi: 10.1016/j.neucom.2024.127809
|
|
|
| [11] |
MA P, HE X, CHEN Y, et al ISOD: improved small object detection based on extended scale feature pyramid network[J]. The Visual Computer, 2025, 41 (1): 465- 479
doi: 10.1007/s00371-024-03341-2
|
|
|
| [12] |
蒋占军, 吴佰靖, 马龙, 等 多尺度特征和极化自注意力的Faster-RCNN水漂垃圾识别[J]. 计算机应用, 2024, 44 (3): 938- 944 JIANG Zhanjun, WU Baijing, MA Long, et al Faster-RCNN water-floating garbage recognition based on multi-scale feature and polarized self-attention[J]. Journal of Computer Applications, 2024, 44 (3): 938- 944
|
|
|
| [13] |
杜丁健, 高遵海, 陈倬 基于改进YOLOv5的枸杞虫害检测[J]. 浙江大学学报: 工学版, 2024, 58 (10): 1992- 2000 DU Dingjian, GAO Zunhai, CHEN Zhuo Wolfberry pest detection based on improved YOLOv5[J]. Journal of Zhejiang University: Engineering Science, 2024, 58 (10): 1992- 2000
|
|
|
| [14] |
LIU Y B, HUANG H Y, ZENG Y H DC-Net: a dual-channel and cross-scale feature fusion infrared small target detection network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 4708809
|
|
|
| [15] |
CHANG J, DAI H, ZHENG Y. CAG-FPN: channel self-attention guided feature pyramid network for object detection [C]// Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Seoul: IEEE, 2024: 9616–9620.
|
|
|
| [16] |
XU S, CHEN X, LI H, et al Airborne small target detection method based on multimodal and adaptive feature fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5637215
|
|
|
| [17] |
XU X, ZHAN W, JIANG Y, et al Infrared small target detection based on weak feature enhancement and target adaptive proliferation[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 18: 2829- 2850
|
|
|
| [18] |
ZHANG X, ZUO G Small target detection in UAV view based on improved YOLOv8 algorithm[J]. Scientific Reports, 2025, 15: 421
doi: 10.1038/s41598-024-84747-9
|
|
|
| [19] |
尹向雷, 屈少鹏, 解永芳, 等 基于渐进特征融合及多尺度空洞注意力的遮挡鸟巢检测[J]. 浙江大学学报: 工学版, 2025, 59 (3): 535- 545 YIN Xianglei, QU Shaopeng, XIE Yongfang, et al Occluded bird nest detection based on asymptotic feature fusion and multi-scale dilated attention[J]. Journal of Zhejiang University: Engineering Science, 2025, 59 (3): 535- 545
|
|
|
| [20] |
宋耀莲, 王粲, 李大焱, 等 基于改进YOLOv5s的无人机小目标检测算法[J]. 浙江大学学报: 工学版, 2024, 58 (12): 2417- 2426 SONG Yaolian, WANG Can, LI Dayan, et al UAV small target detection algorithm based on improved YOLOv5s[J]. Journal of Zhejiang University: Engineering Science, 2024, 58 (12): 2417- 2426
|
|
|
| [21] |
熊干, 陈慈发, 张上 QMDF-YOLO11: 复杂场景下水稻害虫检测算法[J]. 计算机工程与应用, 2025, 61 (13): 113- 123 XIONG Gan, CHEN Cifa, ZHANG Shang QMDF: YOLO11: rice pests detection algorithm in complex scenarios[J]. Computer Engineering and Applications, 2025, 61 (13): 113- 123
|
|
|
| [22] |
ZHAO Y, LV W, XU S, et al. DETRs beat YOLOs on real-time object detection [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 16965–16974.
|
|
|
| [23] |
CHENG Y, ZHU J, JIANG M, et al. FloW: a dataset and benchmark for floating waste detection in inland waters [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021: 10933–10942.
|
|
|
| [24] |
CUI Y, REN W, CAO X, et al. Focal network for image restoration [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Paris: IEEE, 2023: 12955–12965.
|
|
|
| [25] |
WAN W, WANG L, WANG B, et al Space to depth convolution bundled with coordinate attention for detecting surface defects[J]. Signal, Image and Video Processing, 2024, 18 (5): 4861- 4874
doi: 10.1007/s11760-024-03122-3
|
|
|
| [26] |
CUI Y, REN W, KNOLL A Omni-kernel network for image restoration[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38 (2): 1426- 1434
doi: 10.1609/aaai.v38i2.27907
|
|
|
| [27] |
ZHOU S, CHEN D, PAN J, et al. Adapt or perish: adaptive sparse transformer with attentive feature refinement for image restoration [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 2952–2963.
|
|
|
| [28] |
LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: common objects in context [C]// Proceedings of the European Conference on Computer Vision. Zurich: Springer, 2014: 740–755.
|
|
|
| [29] |
KOU R, WANG C, PENG Z, et al Infrared small target segmentation networks: a survey[J]. Pattern Recognition, 2023, 143: 109788
doi: 10.1016/j.patcog.2023.109788
|
|
|
| [30] |
ZHANG Y, YE M, ZHU G, et al FFCA-YOLO for small object detection in remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5611215
|
|
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
| |
Shared |
|
|
|
|
| |
Discussed |
|
|
|
|