|
|
|
| Real-time vehicle detection algorithm based on UAV aerial images |
Yuyu MENG( ),Yinbao MA,Jiuyuan HUO*( ) |
| School of Electronics and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China |
|
|
|
Abstract Multi-scale targets in unmanned aerial vehicle (UAV) aerial images, especially small targets, have low detection accuracy in complex scenarios such as dense scenes, occlusions, and low illumination. Thus, a convolution-wavelet dual-domain downsampling module (RDWTConv) was proposed to preserve fine details of small targets. Additionally, a three-layer cross-scale residual fusion module (RCDFM) was designed to enhance multi-scale feature interactions. Furthermore, a scale-shape loss function (TSSIoU) was introduced to improve bounding box localization accuracy for varying object scales and shapes under aerial perspectives. On this basis, a series of CF-YOLO models, namely CF-YOLOn, CF-YOLOs, and CF-YOLOm, were constructed based on YOLOv8 to meet diverse computational requirements. Experimental results demonstrated that on the VisDrone dataset, CF-YOLOn achieved a 23.7% reduction in parameters and only a 22.5% increase in computational cost, while improving mAP@0.5 and mAP@0.5:0.95 by 5.5 and 4.0 percentage points, respectively, compared with the baseline YOLOv8n, as well as maintaining a frame rate of 169.1 frames per second. The s and m variants also achieved the highest accuracy within the same frame rate range. After retraining on the Drone-Vehicle dataset, CF-YOLOn’s mAP@0.5:0.95 improved by 3.0 percentage points compared to the baseline. Through the above synergistic improvements, the proposed method not only maintains real-time detection under lightweight computational costs but also effectively enhances multi-scale target detection performance in complex scenarios, achieving state-of-the-art results among comparable methods.
|
|
Received: 29 May 2025
Published: 23 May 2026
|
|
|
| Fund: 国家自然科学基金资助项目(62262038);甘肃省技术创新指导计划-科技专家资助项目(25CXGA030);甘肃省重点研发计划-工业资助项目(25YFGA045). |
|
Corresponding Authors:
Jiuyuan HUO
E-mail: mengyuyu@mail.lzjtu.cn;huojy@mail.lzjtu.cn
|
基于无人机航拍图像的实时车辆检测算法
针对无人机(UAV)航拍图像中多尺度目标,尤其是小目标,在密集、遮挡及低光照等复杂场景下检测精度较低的问题,提出卷积-小波双域下采样器RDWTConv,以保留小目标细节;设计3层跨尺度残差融合模块RCDFM,以增强多尺度特征交互;提出尺度-形状损失TSSIoU,以提升航拍视角下目标尺度与形状的边界框定位精度. 在此基础上,基于YOLOv8构建适配不同算力需求的CF-YOLOn、CF-YOLOs与CF-YOLOm模型. 实验结果显示,在VisDrone数据集上,CF-YOLOn在参数量减少23.7%、计算量仅增加22.5%的情况下,mAP@0.5和mAP@0.5:0.95较基线YOLOv8n分别提高5.5和4.0个百分点,帧率保持169.1帧/s,且在相同帧率区间内,s、m版本取得最高精度;在Drone-Vehicle数据集上重新训练后,CF-YOLOn的mAP@0.5:0.95较基线YOLOv8n提升3.0个百分点. 通过上述协同改进,所提方法不仅在轻量计算开销下保持实时检测,而且有效提升了复杂场景下的多尺度目标检测性能,达到同类方法的先进水平.
关键词:
车辆检测,
多尺度目标,
复杂场景,
YOLOv8,
下采样
|
|
| [1] |
HEARST M A, DUMAIS S T, OSUNA E, et al Support vector machines[J]. IEEE Intelligent Systems and Their Applications, 1998, 13 (4): 18- 28
doi: 10.1109/5254.708428
|
|
|
| [2] |
BEJA-BATTAIS P. Overview of AdaBoost : reconciling its views to better understand its dynamics [EB/OL]. (2023-10-06)[2025-04-18]. https://arxiv.org/abs/2310.18323
|
|
|
| [3] |
REN S, HE K, GIRSHICK R, et al Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (6): 1137- 1149
doi: 10.1109/TPAMI.2016.2577031
|
|
|
| [4] |
REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection [C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 779–788.
|
|
|
| [5] |
LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector [C]// European Conference on Computer Vision (ECCV) 2016. Cham: Springer International Publishing, 2016: 21–37.
|
|
|
| [6] |
GUPTA P, PAREEK B, SINGAL G, et al Edge device based military vehicle detection and classification from UAV[J]. Multimedia Tools and Applications, 2022, 81 (14): 19813- 19834
doi: 10.1007/s11042-021-11242-y
|
|
|
| [7] |
史涛, 崔杰, 李松 优化改进YOLOv8实现实时无人机车辆检测的算法[J]. 计算机工程与应用, 2024, 60 (9): 79- 89 SHI Tao, CUI Jie, LI Song Algorithm for real-time vehicle detection from UAVs based on optimizing and improving YOLOv8[J]. Computer Engineering and Applications, 2024, 60 (9): 79- 89
doi: 10.3778/j.issn.1002-8331.2312-0291
|
|
|
| [8] |
SUN Y, SHAO Z, CHENG G, et al Road and car extraction using UAV images via efficient dual contextual parsing network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5632113
|
|
|
| [9] |
HAMZENEJADI M H, MOHSENI H Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: architectural improvements and performance boost[J]. Expert Systems with Applications, 2023, 231: 120845
doi: 10.1016/j.eswa.2023.120845
|
|
|
| [10] |
YING Z, ZHOU J, ZHAI Y, et al Large-scale high-altitude UAV-based vehicle detection via pyramid dual pooling attention path aggregation network[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (10): 14426- 14444
doi: 10.1109/TITS.2024.3396915
|
|
|
| [11] |
HUI Y, WANG J, LI B STF-YOLO: a small target detection algorithm for UAV remote sensing images based on improved SwinTransformer and class weighted classification decoupling head[J]. Measurement, 2024, 224: 113936
doi: 10.1016/j.measurement.2023.113936
|
|
|
| [12] |
姜贸翔, 司占军, 王晓喆 改进RT-DETR的无人机图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 98- 108 JIANG Maoxiang, SI Zhanjun, WANG Xiaozhe Improved target detection algorithm for UAV images with RT-DETR[J]. Computer Engineering and Applications, 2025, 61 (1): 98- 108
doi: 10.3778/j.issn.1002-8331.2405-0331
|
|
|
| [13] |
李彬, 李生林 改进YOLOv11n的无人机小目标检测算法[J]. 计算机工程与应用, 2025, 61 (7): 96- 104 LI Bin, LI Shenglin Improved YOLOv11n small object detection algorithm in UAV view[J]. Computer Engineering and Applications, 2025, 61 (7): 96- 104
doi: 10.3778/j.issn.1002-8331.2411-0072
|
|
|
| [14] |
梁燕, 何孝武, 邵凯, 等 改进YOLOv8的无人机航拍图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 121- 130 LIANG Yan, HE Xiaowu, SHAO Kai, et al Target detection algorithm for UAV images based on improved YOLOv8[J]. Computer Engineering and Applications, 2025, 61 (1): 121- 130
doi: 10.3778/j.issn.1002-8331.2405-0459
|
|
|
| [15] |
JOCHER G, CHAURASIA A, QIU J. Ultralytics YOLOv8 [EB/OL]. (2023-01-28)[2025-04-18]. https://github.com/ultralytics/ultralytics.
|
|
|
| [16] |
XUE Y, JIN G, SHEN T, et al SmallTrack: wavelet pooling and graph enhanced classification for UAV small object tracking[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5618815
|
|
|
| [17] |
LI C, LI L, GENG Y, et al. YOLOv6 v3. 0: a full-scale reloading [EB/OL]. (2023-01-13)[2025-04-18]. https://arxiv.org/abs/2301.05586.
|
|
|
| [18] |
ZHANG Z Drone-YOLO: an efficient neural network method for target detection in drone images[J]. Drones, 2023, 7 (8): 526
doi: 10.3390/drones7080526
|
|
|
| [19] |
ZHANG Y F, REN W, ZHANG Z, et al Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506: 146- 157
doi: 10.1016/j.neucom.2022.07.042
|
|
|
| [20] |
YANG X, YAN J, MING Q, et al. Rethinking rotated object detection with Gaussian Wasserstein distance loss [C]// International Conference on Machine Learning (ICML). Virtual Event: PMLR, 2021: 11830–11841.
|
|
|
| [21] |
DU D, ZHU P, WEN L, et al. VisDrone-DET2019: the Vision Meets Drone Object Detection in Image Challenge Results [C]// 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Seoul: IEEE, 2019: 213–226.
|
|
|
| [22] |
SUN Y, CAO B, ZHU P, et al Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32 (10): 6700- 6713
doi: 10.1109/TCSVT.2022.3168279
|
|
|
| [23] |
WANG C-Y, YEH I-H, LIAO H. YOLOv9: learning what you want to learn using programmable gradient information [EB/OL]. (2024-02-21)[2025-04-18]. https://arxiv.org/abs/2402.13616.
|
|
|
| [24] |
WANG A, CHEN H, LIU L, et al. YOLOv10: real-time end-to-end object detection [EB/OL]. (2023-05-23)[2025-04-18]. https://arxiv.org/abs/2405.14458.
|
|
|
| [25] |
CHOLLET F. Xception: deep learning with depthwise separable convolutions [C]// IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 1800–1807.
|
|
|
| [26] |
XIAO Y, XU T, XIN Y, et al. FBRT-YOLO: faster and better for real-time aerial image detection [EB/OL]. (2025-04-29)[2025-04-18]. https://arxiv.org/abs/2504.20670.
|
|
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
| |
Shared |
|
|
|
|
| |
Discussed |
|
|
|
|