Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (7): 1416-1426    DOI: 10.3785/j.issn.1008-973X.2026.07.005
计算机与控制工程     
融合流体自动标注与轻量化YOLOv8n的泥石流实时检测方法
王平1,2(),徐安之1,赵洪黎1,魏小源1,杨富龙1
1. 兰州理工大学 微电子现代产业学院,甘肃 兰州 730050
2. 甘肃省科学院 自动化研究所,甘肃 兰州 730000
Real-time debris flow detection method combining fluid automatic annotation and lightweight YOLOv8n
Ping WANG1,2(),Anzhi XU1,Hongli ZHAO1,Xiaoyuan WEI1,Fulong YANG1
1. School of Microelectronics Industry-education Integration, Lanzhou University of Technology, Lanzhou 730050, China
2. Institute of Automation, Gansu Academy of Sciences, Lanzhou 730000, China
 全文: PDF(5551 KB)   HTML
摘要:

针对泥石流灾害实时检测中数据标注效率低和模型适应性不足问题,提出融合流体目标自动标注与轻量化YOLOv8n的泥石流检测框架. 基于流体运动连续的特性,设计多尺度动态采集框,通过自适应视频帧截取与改进的二分类筛选模型,构建包含8 064张图像的开源泥石流数据集. 在模型架构上,设计LGSA模块与C2f_GhostNetV2结构,并引入Shape-IoU损失函数,增强模型对泥石流动态特征的提取能力和不规则边界的定位精度. 实验结果表明,提出的YOLOv8-Mudslide模型的mAP@0.5指标达到86.7%,较基线模型提升了4.7%,检测速度达到230.89 帧/s,为泥石流灾害的实时监测提供了可靠的技术支持.

关键词: 自动标注流体目标目标检测泥石流灾害预警    
Abstract:

A novel detection framework integrating automatic fluid target annotation and a lightweight YOLOv8n model was proposed, aiming at addressing the challenges of low annotation efficiency and insufficient model adaptability in real-time debris flow monitoring. An open-source debris flow dataset of 8 064 images was built. This dataset was achieved using multi-scale dynamic capture frames, which leverage fluid motion continuity, along with adaptive frame extraction and an improved binary classification model. To enhance the extraction of dynamic debris flow features and improve the irregular boundary localization, the LGSA module and C2f_GhostNetV2 structure were developed. The Shape-IoU loss function was also introduced into the model architecture. Experimental results demonstrated that the proposed YOLOv8-Mudslide model achieved an mAP@0.5 of 86.7%, which was 4.7% higher than that of the baseline model, and the detection speed reached 230.89 frame/s. This case provides reliable technical support for the real-time monitoring of debris flow disasters, and its framework can be further extended to other intelligent detection fields of fluid targets.

Key words: automatic annotation    fluid target    object detection    debris flow    disaster early warning
收稿日期: 2025-06-24 出版日期: 2026-05-23
CLC:  TP 391.4  
基金资助: 国家自然科学基金资助项目(62361039, 62001198, 62173170);中国博士后科学基金资助项目(2024MD763938);甘肃省青年科技基金资助项目(24JRRA1146);甘肃省联合科研基金资助项目(25JRRA1153, 24JRRA829).
作者简介: 王平(1989—),男,副教授,从事计算机视觉及智能感知研究. orcid.org/0000-0003-2358-0020. E-mail:pingwangsky@163.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
王平
徐安之
赵洪黎
魏小源
杨富龙

引用本文:

王平,徐安之,赵洪黎,魏小源,杨富龙. 融合流体自动标注与轻量化YOLOv8n的泥石流实时检测方法[J]. 浙江大学学报(工学版), 2026, 60(7): 1416-1426.

Ping WANG,Anzhi XU,Hongli ZHAO,Xiaoyuan WEI,Fulong YANG. Real-time debris flow detection method combining fluid automatic annotation and lightweight YOLOv8n. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1416-1426.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.07.005        https://www.zjujournals.com/eng/CN/Y2026/V60/I7/1416

图 1  不同目标跟踪算法对比
图 2  流体目标自动标注流程
图 3  数据采集过程
图 4  正负样本的分类
图 5  二分类筛选模型结构
图 6  正样本筛选流程
STFH
1100100[0,0.5)
2200300[0,0.6)
3300400[0,0.7)
4400500[0,0.8)
5500600[0,0.9)
表 1  二分类筛选模型参数设置
图 7  改进模型的网络结构图
图 8  LGSA结构图
图 9  改进前和改进后的C2f网络结构图
图 10  GhostNetV2网络结构图
图 11  不同位置下锚框回归过程
图 12  预测框与真实框的关系
参数含义数值
FG_THRESH正样本IoU阈值[0.9,1.0]
BG_THRESH_LO负样本IoU阈值表1设定
NMS_THRESH非极大值抑制阈值0.9
BATCHSIZE图片目标批次大小256
NUM_SAMPLES总样本数量153 667
表 2  基准参数设置
图 13  相同数据集下的不同筛选条件结果图
图 14  采用自主标注方法的部分标注结果
注意力机制FLOPs/GParams/MmAP@0.5/%
YOLOv8n8.23.082.0
+CBAM8.33.082.7
+SE8.23.081.8
+SimAM8.23.081.3
+EMA8.33.082.5
+LGSA8.33.083.3
表 3  不同注意力机制对比实验结果
LGSAC2f_GhostNetV2Shape-IoUmAP@0.5/%FLOPs/GParams/MFPS/(帧·s?1)
82.08.23.0223.63
83.38.33.0217.92
82.57.32.6231.41
85.88.33.0216.97
84.37.22.6235.68
86.77.32.6230.89
表 4  消融对比实验结果
图 15  模型改进前后的泥石流检测热力图
模型P/%R/%mAP@0.5/%Params/MFLOPs/GFPS/(帧·s?1)
Faster R-CNN73.582.884.05108.2163.828.28
SSD83.281.286.591.6135.245.65
YOLOv5n74.183.380.22.57.1233.51
YOLOv8n71.280.782.03.18.2223.63
YOLOv8m71.380.684.225.879.192.64
YOLOv10n73.382.382.52.36.7214.32
RefineDet[17]70.276.878.22610.256.27
RT-DETR-R18[18]78.381.483.819.856.9113.64
YOLOv8-Mudslide72.382.886.72.67.3230.89
表 5  相同数据集下的不同模型对比实验结果
12 ZHANG Ye, XU Ting, FENG Dingzhong, et al Research on faster RCNN object detection based on hard example mining[J]. Journal of Electronics and Information Technology, 2019, 41 (6): 1496- 1502
13 TANG Y, HAN K, GUO J, et al GhostNetv2: Enhance cheap operation with long-range attention[J]. Advances in Neural Information Processing Systems, 2022, 35: 9969- 9982
14 HAN K, WANG Y, GUO J, et al. ParameterNet: Parameters are all you need for large-scale visual pretraining of mobile networks [C]// 2024 IEEE Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 15751–15761.
15 ZHANG H, ZHANG S. Shape-iou: more accurate metric considering bounding box shape and scale [EB/OL]. [2025-06-01]. https://arxiv.org/pdf/2312.17663.pdf.
16 CHEN B, LIU Y, ZHANG Z, et al Transattunet: multi-level attention-guided u-net with transformer for medical image segmentation[J]. IEEE Transactions on Emerging Topics in Computational Intelligence, 2023, 8 (1): 55- 68
17 ZHANG S, WEN L, BIAN X, et al. Single-shot refinement neural network for object detection [C]// 2018 IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 4203–4212.
1 CABRAL V, REIS F, VELOSO V, et al The consequences of debris flows in Brazil: a historical analysis based on recorded events in the last 100 years[J]. Landslides, 2023, 20 (3): 511- 529
doi: 10.1007/s10346-022-01984-7
2 DU Y, LIU H, LI H, et al Exploring the initiating mechanism, monitoring equipment and warning indicators of gully-type debris flow for disaster reduction: a review[J]. Natural Hazards, 2024, 120 (15): 13667- 13692
doi: 10.1007/s11069-024-06742-7
3 PHAM M V, KIM Y T Debris flow detection and velocity estimation using deep convolutional neural network and image processing[J]. Landslides, 2022, 19 (10): 2473- 2488
doi: 10.1007/s10346-022-01931-6
4 何明杰, 刘德方, 张猛, 等 融合YOLOX和ASFF的高原山地灾害检测模型[J]. 防灾减灾工程学报, 2023, 43 (6): 1215- 1223
HE Mingjie, LIU Defang, ZHANG Meng, et al A plateau mountain disaster detection model by integrating YOLOX and ASFF[J]. Journal of Disaster Prevention and Mitigation Engineering, 2023, 43 (6): 1215- 1223
doi: 10.13409/j.cnki.jdpme.20230105002
5 张领先, 景嘉平, 李淑菲, 等 基于图像自动标注与改进YOLO v5的番茄病害识别系统[J]. 农业机械学报, 2023, 54 (11): 198- 207
18 ZHAO Y, LV W, XU S, et al. Detrs beat yolos on real-time object detection [C]// 2024 IEEE Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2024: 16965–16974.
5 ZHANG Lingxian, JING Jiaping, LI Shufei, et al Tomato disease recognition system based on image automatic labeling and improved YOLOv5[J]. Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (11): 198- 207
6 陈庆林, 谷雨, 宋忠浩, 等 融合检测与跟踪的半自动视频目标标注[J]. 计算机工程与应用, 2021, 57 (14): 223- 230
CHEN Qinglin, GU Yu, SONG Zhonghao, et al Semi-automatic video target annotation by combining detection and tracking[J]. Computer Engineering and Applications, 2021, 57 (14): 223- 230
7 CUI B, CRÉPUT J C A systematic algorithm for moving object detection with application in real-time surveillance[J]. SN Computer Science, 2020, 1 (2): 106
doi: 10.1007/s42979-020-0118-5
8 KALAL Z, MIKOLAJCZYK K, MATAS J Tracking-learning-detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 34 (7): 1409- 1422
9 YADAV S, PAYANDEH S Critical overview of visual tracking with kernel correlation filter[J]. Technologies, 2021, 9 (4): 93
doi: 10.3390/technologies9040093
10 OKOYE O C, BOLAJI B O. Physics of fluid motion [M]// Applications of Heat, Mass and Fluid Boundary Layers. Cambridge: Woodhead Publishing, 2020: 1–22.
11 HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 770–778.
[1] 董绍江,肖涛,吕振鸣,夏浩然,罗家元,孙世政,张霞,刘超. 基于改进RT-DETR的水下色偏环境中小型生物检测[J]. 浙江大学学报(工学版), 2026, 60(7): 1404-1415.
[2] 于天河,王文龙,刘镛,杨壮壮,侯善冲. 改进的有雾图像中被遮挡车辆及行人识别算法[J]. 浙江大学学报(工学版), 2026, 60(4): 738-750.
[3] 宋耀莲,彭驰,唐菁敏,赵宣植,虞贵财. 基于融合注意力机制的光学遥感图像小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(4): 763-771.
[4] 孟昱煜,孔垂乐,火久元,武泽宇. 重构YOLOv11的无人机小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(2): 303-312.
[5] 肖剑,何昕泽,程鸿亮,杨小苑,胡欣. 基于多尺度特征增强的航拍小目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(1): 19-31.
[6] 包晓安,彭书友,张娜,涂小妹,张庆琪,吴彪. 基于多方位感知深度融合检测头的目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(1): 32-42.
[7] 董超群,汪战,廖平,谢帅,荣玉杰,周靖淞. 轻量化YOLOv5s-OCG的轨枕裂纹检测算法[J]. 浙江大学学报(工学版), 2025, 59(9): 1838-1845.
[8] 翟亚红,陈雅玲,徐龙艳,龚玉. 改进YOLOv8s的轻量级无人机航拍小目标检测算法[J]. 浙江大学学报(工学版), 2025, 59(8): 1708-1717.
[9] 付家瑞,李兆飞,周豪,黄惟. 基于Convnextv2与纹理边缘引导的伪装目标检测[J]. 浙江大学学报(工学版), 2025, 59(8): 1718-1726.
[10] 徐慧智,王秀青. 基于车辆图像特征的前车距离与速度感知[J]. 浙江大学学报(工学版), 2025, 59(6): 1219-1232.
[11] 李沈崇,曾新华,林传渠. 基于轴向注意力的多任务自动驾驶环境感知算法[J]. 浙江大学学报(工学版), 2025, 59(4): 769-777.
[12] 王浚银,文斌,沈艳军,张俊,王子豪. 基于改进YOLOv7-tiny的铝型材表面缺陷检测方法[J]. 浙江大学学报(工学版), 2025, 59(3): 523-534.
[13] 董红召,林少轩,佘翊妮. 交通目标YOLO检测技术的研究进展[J]. 浙江大学学报(工学版), 2025, 59(2): 249-260.
[14] 何永福,谢世维,于佳禄,陈思宇. 考虑跨层特征融合的抛洒风险车辆检测方法[J]. 浙江大学学报(工学版), 2025, 59(2): 300-309.
[15] 陈江浩,杨军. 结合深度可分离卷积的多源遥感融合影像目标检测[J]. 浙江大学学报(工学版), 2025, 59(12): 2545-2555.