Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (7): 1452-1463    DOI: 10.3785/j.issn.1008-973X.2026.07.008
计算机与控制工程     
边缘感知和跨尺度特征增强的小目标水漂垃圾检测
吴佰靖(),闫光辉*(),马龙,程文鑫,黄亚宁
兰州交通大学 电子与信息工程学院,甘肃 兰州 730070
Small-target water-floating garbage detection based on edge perception and cross-scale feature enhancement
Baijing WU(),Guanghui YAN*(),Long MA,Wenxin CHENG,Yaning HUANG
School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
 全文: PDF(5452 KB)   HTML
摘要:

针对小目标水漂垃圾所含信息有限、在特征提取时容易丢失细节特征而造成漏检、错检的问题,提出基于边缘感知和跨尺度特征增强的小目标水漂垃圾检测方法. 在分析小目标特征提取的局限性后,提出边缘增强的特征提取网络,协同利用空间特征和高频特征,有效增强小目标复杂边缘、细节和高频信息;设计三分支跨尺度特征自适应融合模块,通过局部细节感知、全局上下文建模和大感受野解析小目标特征,提升网络对多维小目标特征的表征能力;构建基于自适应稀疏注意力的尺度内特征交互模块,利用稀疏性动态调整交互特征,强化小目标与背景的区分程度. 实验结果表明,相较于基准模型RT-DETR,所提方法的mAP、mmAP和召回率R在黄河兰州段水漂垃圾数据集上分别提升了4.93、2.46和3.18个百分点,在FloW-Img数据集上分别提升了3.39、1.45和2.23个百分点,表明所提方法能够有效提升对小目标水漂垃圾的检测性能,助力水漂垃圾的高效监测与治理.

关键词: 目标检测水漂垃圾小目标RT-DETR边缘感知跨尺度特征增强    
Abstract:

A new method for small-target water-floating garbage detection was proposed to address the issue of missed and false detections due to the limited information of small-target water-floating garbage that is prone to losing detailed features in feature extraction. After analyzing the limitations of small-target feature extraction, an edge-enhanced feature extraction network which synergistically utilized the spatial features and high-frequency features was proposed to effectively enhance the complex edges, details, and high-frequency information of small targets. A triple-branch cross-scale feature adaptive fusion module was designed to enhance the network’s ability to represent multi-dimensional small-target features by analyzing small-target features through local detail perception, global context modeling, and the large receptive field. An adaptive sparse attention-based intra-scale feature interaction module was constructed, which dynamically adjusted the interactive features by leveraging sparsity to enhance the discriminability between small targets and backgrounds. Experimental results show that, compared with the baseline model RT-DETR, the proposed method achieves improvements of 4.93, 2.46, and 3.18 percentage points in mAP, mmAP, and recall rate (R) respectively on the water-floating garbage dataset from the Lanzhou section of the Yellow River, and achieves improvements of 3.39, 1.45, and 2.23 percentage points on the FloW-Img dataset. These results indicate that the proposed method effectively enhances the detection performance for small-target water-floating garbage, thereby facilitating the efficient monitoring and management of water-floating garbage.

Key words: object detection    water-floating garbage    small target    RT-DETR    edge perception    cross-scale feature enhancement
收稿日期: 2025-04-15 出版日期: 2026-05-23
CLC:  TP 391.4  
基金资助: 国家自然科学基金资助项目(62466032, 62366028, 62062049);甘肃省自然科学基金资助项目(24JRRA256);甘肃省水利厅省级项目(LZJT523029).
通讯作者: 闫光辉     E-mail: 1420716156@qq.com;yanguanghui@mail.lzjtu.cn
作者简介: 吴佰靖(1997—),男,博士生,从事人工智能、智慧水利等研究. orcid.org/0009-0006-1844-1666. E-mail:1420716156@qq.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
吴佰靖
闫光辉
马龙
程文鑫
黄亚宁

引用本文:

吴佰靖,闫光辉,马龙,程文鑫,黄亚宁. 边缘感知和跨尺度特征增强的小目标水漂垃圾检测[J]. 浙江大学学报(工学版), 2026, 60(7): 1452-1463.

Baijing WU,Guanghui YAN,Long MA,Wenxin CHENG,Yaning HUANG. Small-target water-floating garbage detection based on edge perception and cross-scale feature enhancement. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1452-1463.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.07.008        https://www.zjujournals.com/eng/CN/Y2026/V60/I7/1452

图 1  复杂背景中小目标水漂垃圾示意图
图 2  不同卷积层的输出特征图
图 3  ECA-RTDETR方法的整体结构图
图 4  多尺度边缘增强模块结构图
图 5  双域选择机制的结构示意图
图 6  跨尺度选择融合模块结构图
图 7  三分支特征融合模块结构图
图 8  三分支网络结构图
图 9  自注意力和自适应稀疏注意力结构图
类别NL
训练集测试集验证集总标签
plastic5 5321 5468457 923
paper2 5247163433 583
glass57015479803
metal1 1243361571 617
fabric/fiber1 9045852812 770
nature1 3504052001 955
others679194109982
总计13 6833 9362 01419 633
表 1  YRLW数据集中各类水漂垃圾的标签数量
图 10  YRLW数据集中各个类别的大、中、小目标分布
数据集类别AP/%
RT-DETRM1M2M3
YRLWplastic92.4494.2995.7395.39
paper92.5193.7194.2794.34
glass88.3992.1893.6193.67
metal88.0591.7493.5493.18
fabric/fiber91.6694.3295.0195.10
nature91.8793.6494.8794.90
others72.2782.2783.3385.13
mAP88.1791.7492.9193.10
FloW-Imgbottle83.8785.4686.7387.26
表 2  不同改进策略下各类水漂垃圾检测的AP指标结果
数据集方法mAP/%mmAP/%R/%Para/MFLOPs/GFPS/(帧·s?1)
YRLWRT-DETR88.1747.0188.7420.0958.34164.47
M191.7448.5390.0715.9550.91184.24
M292.9149.1591.3617.4059.47159.19
M393.1049.4791.9218.1560.01157.70
FloW-ImgRT-DETR83.8740.2481.0520.0958.34164.47
M185.4641.0882.8415.9550.91184.24
M286.7341.4583.0117.4059.47159.19
M387.2641.6983.2818.1560.01157.70
表 3  ECA-RTDETR中不同改进策略的消融实验结果对比
图 11  不同改进策略的消融实验热力图结果
数据集方法mAP/%mmAP/%R/%Para/MFLOPs/GFPS/(帧·s?1)
YRLWYOLOv8m88.4947.8888.6725.9178.91128.67
YOLOv11l89.3848.1985.9125.3786.88111.24
YOLOv12l90.8849.2089.8726.4488.92102.39
RT-DETR88.1747.0188.7420.0958.34164.47
FFCA-YOLO90.5247.8690.397.1251.19185.41
SuperYOLO91.8148.0591.074.8316.61193.53
ECA-RTDETR93.1049.4791.9218.1560.01157.70
FloW-ImgYOLOv8m85.4238.9180.4525.9178.91128.67
YOLOv11l86.8040.8381.9325.3786.88111.24
YOLOv12l86.9441.1782.5926.4488.92102.39
RT-DETR83.8740.2481.0520.0958.34164.47
FFCA-YOLO86.3440.4581.987.1251.19185.41
SuperYOLO86.7940.9782.774.8316.61193.53
ECA-RTDETR87.2641.6983.2818.1560.01157.70
表 4  不同方法的水漂垃圾检测实验结果对比
图 12  不同方法在YRLW数据集上的水漂垃圾检测可视化结果对比
图 13  不同方法在FloW-Img数据集上的水漂垃圾检测可视化结果对比
1 AI P, MA L, WU B LI-DWT- and PD-FC-MSPCNN-based small-target localization method for floating garbage on water surfaces[J]. Water, 2023, 15 (12): 2302
doi: 10.3390/w15122302
2 JIANG Z, WU B, MA L, et al APM-YOLOv7 for small-target water-floating garbage detection based on multi-scale feature adaptive weighted fusion[J]. Sensors, 2024, 24 (1): 50
3 LI N, WANG M, YANG G, et al DENS-YOLOv6: a small object detection model for garbage detection on water surface[J]. Multimedia Tools and Applications, 2024, 83 (18): 55751- 55771
4 盘姿君, 王建华, 郑翔, 等 水面垃圾清理机器人结构及自主控制研究综述[J]. 计算机工程与应用, 2024, 60 (11): 17- 31
PAN Zijun, WANG Jianhua, ZHENG Xiang, et al Review of research on structure and autonomous control of water surface garbage cleaning robots[J]. Computer Engineering and Applications, 2024, 60 (11): 17- 31
5 陈清江, 李璐 基于改进YOLOv8n的遥感小目标检测算法[J]. 激光与光电子学进展, 2025, 62 (16): 368- 379
CHEN Qingjiang, LI Lu Small target detection algorithm based on improved YOLOv8n for remote sensing[J]. Laser & Optoelectronics Progress, 2025, 62 (16): 368- 379
6 李金沛, 孟晓林, 胡亮亮, 等 基于改进YOLOv8的桥梁小目标裂缝检测[J]. 清华大学学报: 自然科学版, 2025, 65 (7): 1260- 1271
LI Jinpei, MENG Xiaolin, HU Liangliang, et al Bridge small target crack detection based on improved YOLOv8[J]. Journal of Tsinghua University: Science and Technology, 2025, 65 (7): 1260- 1271
7 王浚银, 文斌, 沈艳军, 等 基于改进YOLOv7-tiny的铝型材表面缺陷检测方法[J]. 浙江大学学报: 工学版, 2025, 59 (3): 523- 534
WANG Junyin, WEN Bin, SHEN Yanjun, et al Surface defect detection method for aluminum profiles based on improved YOLOv7-tiny[J]. Journal of Zhejiang University: Engineering Science, 2025, 59 (3): 523- 534
8 SELVAM P, SUNDARI P S, TAMILSELVI M, et al YOLO-SAIL: attention-enhanced YOLOv5 with optimized Bi-FPN for ship target detection in SAR images[J]. IEEE Access, 2025, 13: 29523- 29540
doi: 10.1109/ACCESS.2025.3536621
9 BI J, LI K, ZHENG X, et al SPDC-YOLO: an efficient small target detection network based on improved YOLOv8 for drone aerial image[J]. Remote Sensing, 2025, 17 (4): 685
doi: 10.3390/rs17040685
10 HAN B, HE L, KE J, et al Weighted parallel decoupled feature pyramid network for object detection[J]. Neurocomputing, 2024, 593: 127809
doi: 10.1016/j.neucom.2024.127809
11 MA P, HE X, CHEN Y, et al ISOD: improved small object detection based on extended scale feature pyramid network[J]. The Visual Computer, 2025, 41 (1): 465- 479
doi: 10.1007/s00371-024-03341-2
12 蒋占军, 吴佰靖, 马龙, 等 多尺度特征和极化自注意力的Faster-RCNN水漂垃圾识别[J]. 计算机应用, 2024, 44 (3): 938- 944
JIANG Zhanjun, WU Baijing, MA Long, et al Faster-RCNN water-floating garbage recognition based on multi-scale feature and polarized self-attention[J]. Journal of Computer Applications, 2024, 44 (3): 938- 944
13 杜丁健, 高遵海, 陈倬 基于改进YOLOv5的枸杞虫害检测[J]. 浙江大学学报: 工学版, 2024, 58 (10): 1992- 2000
DU Dingjian, GAO Zunhai, CHEN Zhuo Wolfberry pest detection based on improved YOLOv5[J]. Journal of Zhejiang University: Engineering Science, 2024, 58 (10): 1992- 2000
14 LIU Y B, HUANG H Y, ZENG Y H DC-Net: a dual-channel and cross-scale feature fusion infrared small target detection network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 4708809
15 CHANG J, DAI H, ZHENG Y. CAG-FPN: channel self-attention guided feature pyramid network for object detection [C]// Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Seoul: IEEE, 2024: 9616–9620.
16 XU S, CHEN X, LI H, et al Airborne small target detection method based on multimodal and adaptive feature fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5637215
17 XU X, ZHAN W, JIANG Y, et al Infrared small target detection based on weak feature enhancement and target adaptive proliferation[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 18: 2829- 2850
18 ZHANG X, ZUO G Small target detection in UAV view based on improved YOLOv8 algorithm[J]. Scientific Reports, 2025, 15: 421
doi: 10.1038/s41598-024-84747-9
19 尹向雷, 屈少鹏, 解永芳, 等 基于渐进特征融合及多尺度空洞注意力的遮挡鸟巢检测[J]. 浙江大学学报: 工学版, 2025, 59 (3): 535- 545
YIN Xianglei, QU Shaopeng, XIE Yongfang, et al Occluded bird nest detection based on asymptotic feature fusion and multi-scale dilated attention[J]. Journal of Zhejiang University: Engineering Science, 2025, 59 (3): 535- 545
20 宋耀莲, 王粲, 李大焱, 等 基于改进YOLOv5s的无人机小目标检测算法[J]. 浙江大学学报: 工学版, 2024, 58 (12): 2417- 2426
SONG Yaolian, WANG Can, LI Dayan, et al UAV small target detection algorithm based on improved YOLOv5s[J]. Journal of Zhejiang University: Engineering Science, 2024, 58 (12): 2417- 2426
21 熊干, 陈慈发, 张上 QMDF-YOLO11: 复杂场景下水稻害虫检测算法[J]. 计算机工程与应用, 2025, 61 (13): 113- 123
XIONG Gan, CHEN Cifa, ZHANG Shang QMDF: YOLO11: rice pests detection algorithm in complex scenarios[J]. Computer Engineering and Applications, 2025, 61 (13): 113- 123
22 ZHAO Y, LV W, XU S, et al. DETRs beat YOLOs on real-time object detection [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 16965–16974.
23 CHENG Y, ZHU J, JIANG M, et al. FloW: a dataset and benchmark for floating waste detection in inland waters [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021: 10933–10942.
24 CUI Y, REN W, CAO X, et al. Focal network for image restoration [C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Paris: IEEE, 2023: 12955–12965.
25 WAN W, WANG L, WANG B, et al Space to depth convolution bundled with coordinate attention for detecting surface defects[J]. Signal, Image and Video Processing, 2024, 18 (5): 4861- 4874
doi: 10.1007/s11760-024-03122-3
26 CUI Y, REN W, KNOLL A Omni-kernel network for image restoration[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38 (2): 1426- 1434
doi: 10.1609/aaai.v38i2.27907
27 ZHOU S, CHEN D, PAN J, et al. Adapt or perish: adaptive sparse transformer with attentive feature refinement for image restoration [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 2952–2963.
28 LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: common objects in context [C]// Proceedings of the European Conference on Computer Vision. Zurich: Springer, 2014: 740–755.
29 KOU R, WANG C, PENG Z, et al Infrared small target segmentation networks: a survey[J]. Pattern Recognition, 2023, 143: 109788
doi: 10.1016/j.patcog.2023.109788
30 ZHANG Y, YE M, ZHU G, et al FFCA-YOLO for small object detection in remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5611215
[1] 董绍江,肖涛,吕振鸣,夏浩然,罗家元,孙世政,张霞,刘超. 基于改进RT-DETR的水下色偏环境中小型生物检测[J]. 浙江大学学报(工学版), 2026, 60(7): 1404-1415.
[2] 王平,徐安之,赵洪黎,魏小源,杨富龙. 融合流体自动标注与轻量化YOLOv8n的泥石流实时检测方法[J]. 浙江大学学报(工学版), 2026, 60(7): 1416-1426.
[3] 于天河,王文龙,刘镛,杨壮壮,侯善冲. 改进的有雾图像中被遮挡车辆及行人识别算法[J]. 浙江大学学报(工学版), 2026, 60(4): 738-750.
[4] 马龙,候永琪,吴佰靖,高丽,邓建伟,闫光辉. 多尺度图卷积下的水漂垃圾轨迹预测模型[J]. 浙江大学学报(工学版), 2026, 60(4): 751-762.
[5] 宋耀莲,彭驰,唐菁敏,赵宣植,虞贵财. 基于融合注意力机制的光学遥感图像小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(4): 763-771.
[6] 刘慧,王防修,王意,黄淄博,苏晨. 轻量级改进RT-DETR的葡萄叶片病害检测算法[J]. 浙江大学学报(工学版), 2026, 60(3): 604-613.
[7] 孟昱煜,孔垂乐,火久元,武泽宇. 重构YOLOv11的无人机小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(2): 303-312.
[8] 肖剑,何昕泽,程鸿亮,杨小苑,胡欣. 基于多尺度特征增强的航拍小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(1): 19-31.
[9] 包晓安,彭书友,张娜,涂小妹,张庆琪,吴彪. 基于多方位感知深度融合检测头的目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(1): 32-42.
[10] 董超群,汪战,廖平,谢帅,荣玉杰,周靖淞. 轻量化YOLOv5s-OCG的轨枕裂纹检测算法[J]. 浙江大学学报(工学版), 2025, 59(9): 1838-1845.
[11] 翟亚红,陈雅玲,徐龙艳,龚玉. 改进YOLOv8s的轻量级无人机航拍小目标检测算法[J]. 浙江大学学报(工学版), 2025, 59(8): 1708-1717.
[12] 付家瑞,李兆飞,周豪,黄惟. 基于Convnextv2与纹理边缘引导的伪装目标检测[J]. 浙江大学学报(工学版), 2025, 59(8): 1718-1726.
[13] 梁耕良,韩曙光. 基于改进RT-DETR的牛仔面料疵点检测算法[J]. 浙江大学学报(工学版), 2025, 59(6): 1169-1178.
[14] 徐慧智,王秀青. 基于车辆图像特征的前车距离与速度感知[J]. 浙江大学学报(工学版), 2025, 59(6): 1219-1232.
[15] 吴晓佳,杨金龙,赵豪豪. 优化多核相关滤波的弱小目标检测前跟踪算法[J]. 浙江大学学报(工学版), 2025, 59(5): 947-955.