Please wait a minute...
Journal of ZheJiang University (Engineering Science)  2026, Vol. 60 Issue (7): 1599-1610    DOI: 10.3785/j.issn.1008-973X.2026.07.021
    
Real-time vehicle detection algorithm based on UAV aerial images
Yuyu MENG(),Yinbao MA,Jiuyuan HUO*()
School of Electronics and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
Download: HTML     PDF(4247KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

Multi-scale targets in unmanned aerial vehicle (UAV) aerial images, especially small targets, have low detection accuracy in complex scenarios such as dense scenes, occlusions, and low illumination. Thus, a convolution-wavelet dual-domain downsampling module (RDWTConv) was proposed to preserve fine details of small targets. Additionally, a three-layer cross-scale residual fusion module (RCDFM) was designed to enhance multi-scale feature interactions. Furthermore, a scale-shape loss function (TSSIoU) was introduced to improve bounding box localization accuracy for varying object scales and shapes under aerial perspectives. On this basis, a series of CF-YOLO models, namely CF-YOLOn, CF-YOLOs, and CF-YOLOm, were constructed based on YOLOv8 to meet diverse computational requirements. Experimental results demonstrated that on the VisDrone dataset, CF-YOLOn achieved a 23.7% reduction in parameters and only a 22.5% increase in computational cost, while improving mAP@0.5 and mAP@0.5:0.95 by 5.5 and 4.0 percentage points, respectively, compared with the baseline YOLOv8n, as well as maintaining a frame rate of 169.1 frames per second. The s and m variants also achieved the highest accuracy within the same frame rate range. After retraining on the Drone-Vehicle dataset, CF-YOLOn’s mAP@0.5:0.95 improved by 3.0 percentage points compared to the baseline. Through the above synergistic improvements, the proposed method not only maintains real-time detection under lightweight computational costs but also effectively enhances multi-scale target detection performance in complex scenarios, achieving state-of-the-art results among comparable methods.



Key wordsvehicle detection      multi-scale target      complex scenario      YOLOv8      downsampling     
Received: 29 May 2025      Published: 23 May 2026
CLC:  TP 391  
Fund:  国家自然科学基金资助项目(62262038);甘肃省技术创新指导计划-科技专家资助项目(25CXGA030);甘肃省重点研发计划-工业资助项目(25YFGA045).
Corresponding Authors: Jiuyuan HUO     E-mail: mengyuyu@mail.lzjtu.cn;huojy@mail.lzjtu.cn
Cite this article:

Yuyu MENG,Yinbao MA,Jiuyuan HUO. Real-time vehicle detection algorithm based on UAV aerial images. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1599-1610.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2026.07.021     OR     https://www.zjujournals.com/eng/Y2026/V60/I7/1599


基于无人机航拍图像的实时车辆检测算法

针对无人机(UAV)航拍图像中多尺度目标,尤其是小目标,在密集、遮挡及低光照等复杂场景下检测精度较低的问题,提出卷积-小波双域下采样器RDWTConv,以保留小目标细节;设计3层跨尺度残差融合模块RCDFM,以增强多尺度特征交互;提出尺度-形状损失TSSIoU,以提升航拍视角下目标尺度与形状的边界框定位精度. 在此基础上,基于YOLOv8构建适配不同算力需求的CF-YOLOn、CF-YOLOs与CF-YOLOm模型. 实验结果显示,在VisDrone数据集上,CF-YOLOn在参数量减少23.7%、计算量仅增加22.5%的情况下,mAP@0.5和mAP@0.5:0.95较基线YOLOv8n分别提高5.5和4.0个百分点,帧率保持169.1帧/s,且在相同帧率区间内,s、m版本取得最高精度;在Drone-Vehicle数据集上重新训练后,CF-YOLOn的mAP@0.5:0.95较基线YOLOv8n提升3.0个百分点. 通过上述协同改进,所提方法不仅在轻量计算开销下保持实时检测,而且有效提升了复杂场景下的多尺度目标检测性能,达到同类方法的先进水平.


关键词: 车辆检测,  多尺度目标,  复杂场景,  YOLOv8,  下采样 
Fig.1 YOLOv8n network structure diagram
Fig.2 CF-YOLOn network structure diagram
Fig.3 Wavelet pooling layer structure diagram
Fig.4 RDWTConv structure diagram
Fig.5 Birectional Concatenate structure diagram
Fig.6 Sandwich-fusion structure diagram
Fig.7 RCDFM structure diagram
Fig.8 Target counts and size distributions in VisDrone and Drone-Vehicle datasets
MethodsP/%R/%mAP@
0.5/%
mAP@
0.5:0.95/%
Params/
106
GFLOPs/
109
CBS44.532.332.418.73.018.1
DWT43.432.332.318.72.797.6
RDWTConv45.133.233.719.63.188.5
ADown[23]43.430.831.117.82.727.4
SCDown[24]44.633.133.019.02.667.6
DWConv[25]43.131.231.217.82.627.2
RepVGGBlock[18]43.732.632.718.93.058.2
Tab.1 Performance comparison of different downsampling modules (VisDrone Dataset)
MethodsmAP@0.5/%mAP@0.5:0.95/%Params/106GFLOPs/109
Concat32.418.73.018.1
BiC[17]33.119.23.058.4
SF[18]32.819.03.028.3
RCDFM34.319.93.078.4
Tab.2 Performance comparison of RCDFM (VisDrone Dataset)
MethodsP/%R/%mAP@0.5/%mAP@0.5:0.95/%
CIoU44.532.332.418.7
DIoU42.832.431.918.5
SIoU42.532.832.218.5
GIoU44.332.232.818.9
EIoU43.232.832.518.8
TSSIoU44.732.933.219.2
Tab.3 Performance comparison of different bounding box loss functions (VisDrone Dataset)
Fig.9 Study on hyperparameter$ \gamma $
ModelsRDWTConvRCDFMTSSIoUP/%R/%mAP@0.5/%mAP@0.5:0.95/%Params/106GFLOPs/109FPS/(帧·s?1)
YOLOv8n44.532.332.418.73.018.1209.9
M145.133.233.719.63.188.5181.3
M246.935.736.421.63.1811.1178.4
M344.732.933.219.23.018.1209.9
M448.836.937.622.43.3511.5168.5
M549.237.538.322.93.3511.5168.5
CF-YOLOn49.336.937.922.72.309.9169.1
Tab.4 Model ablation studies (VisDrone Dataset)
Fig.10 Model comparison experiments (VisDrone Dataset)
ModelsmAP@0.5/%mAP@0.5:0.95/%Params/106FPS/(帧·s?1)
Drone-YOLO74.550.12.97172.1
FBRT-YOLOn[26]74.650.20.90165.3
IV-YOLO[27]74.949.64.31184.7
YOLOv8n75.750.53.01205.4
YOLOv9t77.252.21.97103.2
YOLOv10n76.250.62.70136.8
YOLO11n75.450.32.58187.5
YOLO12n76.351.42.5175.9
CF-YOLOn77.853.52.30164.7
Tab.5 Model generalization experiments  (Drone-Vehicle Dataset)
Fig.11 Visualization of results (VisDrone Dataset)
[1]   HEARST M A, DUMAIS S T, OSUNA E, et al Support vector machines[J]. IEEE Intelligent Systems and Their Applications, 1998, 13 (4): 18- 28
doi: 10.1109/5254.708428
[2]   BEJA-BATTAIS P. Overview of AdaBoost : reconciling its views to better understand its dynamics [EB/OL]. (2023-10-06)[2025-04-18]. https://arxiv.org/abs/2310.18323
[3]   REN S, HE K, GIRSHICK R, et al Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (6): 1137- 1149
doi: 10.1109/TPAMI.2016.2577031
[4]   REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection [C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 779–788.
[5]   LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector [C]// European Conference on Computer Vision (ECCV) 2016. Cham: Springer International Publishing, 2016: 21–37.
[6]   GUPTA P, PAREEK B, SINGAL G, et al Edge device based military vehicle detection and classification from UAV[J]. Multimedia Tools and Applications, 2022, 81 (14): 19813- 19834
doi: 10.1007/s11042-021-11242-y
[7]   史涛, 崔杰, 李松 优化改进YOLOv8实现实时无人机车辆检测的算法[J]. 计算机工程与应用, 2024, 60 (9): 79- 89
SHI Tao, CUI Jie, LI Song Algorithm for real-time vehicle detection from UAVs based on optimizing and improving YOLOv8[J]. Computer Engineering and Applications, 2024, 60 (9): 79- 89
doi: 10.3778/j.issn.1002-8331.2312-0291
[8]   SUN Y, SHAO Z, CHENG G, et al Road and car extraction using UAV images via efficient dual contextual parsing network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5632113
[9]   HAMZENEJADI M H, MOHSENI H Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: architectural improvements and performance boost[J]. Expert Systems with Applications, 2023, 231: 120845
doi: 10.1016/j.eswa.2023.120845
[10]   YING Z, ZHOU J, ZHAI Y, et al Large-scale high-altitude UAV-based vehicle detection via pyramid dual pooling attention path aggregation network[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (10): 14426- 14444
doi: 10.1109/TITS.2024.3396915
[11]   HUI Y, WANG J, LI B STF-YOLO: a small target detection algorithm for UAV remote sensing images based on improved SwinTransformer and class weighted classification decoupling head[J]. Measurement, 2024, 224: 113936
doi: 10.1016/j.measurement.2023.113936
[12]   姜贸翔, 司占军, 王晓喆 改进RT-DETR的无人机图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 98- 108
JIANG Maoxiang, SI Zhanjun, WANG Xiaozhe Improved target detection algorithm for UAV images with RT-DETR[J]. Computer Engineering and Applications, 2025, 61 (1): 98- 108
doi: 10.3778/j.issn.1002-8331.2405-0331
[13]   李彬, 李生林 改进YOLOv11n的无人机小目标检测算法[J]. 计算机工程与应用, 2025, 61 (7): 96- 104
LI Bin, LI Shenglin Improved YOLOv11n small object detection algorithm in UAV view[J]. Computer Engineering and Applications, 2025, 61 (7): 96- 104
doi: 10.3778/j.issn.1002-8331.2411-0072
[14]   梁燕, 何孝武, 邵凯, 等 改进YOLOv8的无人机航拍图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 121- 130
LIANG Yan, HE Xiaowu, SHAO Kai, et al Target detection algorithm for UAV images based on improved YOLOv8[J]. Computer Engineering and Applications, 2025, 61 (1): 121- 130
doi: 10.3778/j.issn.1002-8331.2405-0459
[15]   JOCHER G, CHAURASIA A, QIU J. Ultralytics YOLOv8 [EB/OL]. (2023-01-28)[2025-04-18]. https://github.com/ultralytics/ultralytics.
[16]   XUE Y, JIN G, SHEN T, et al SmallTrack: wavelet pooling and graph enhanced classification for UAV small object tracking[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5618815
[17]   LI C, LI L, GENG Y, et al. YOLOv6 v3. 0: a full-scale reloading [EB/OL]. (2023-01-13)[2025-04-18]. https://arxiv.org/abs/2301.05586.
[18]   ZHANG Z Drone-YOLO: an efficient neural network method for target detection in drone images[J]. Drones, 2023, 7 (8): 526
doi: 10.3390/drones7080526
[19]   ZHANG Y F, REN W, ZHANG Z, et al Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506: 146- 157
doi: 10.1016/j.neucom.2022.07.042
[20]   YANG X, YAN J, MING Q, et al. Rethinking rotated object detection with Gaussian Wasserstein distance loss [C]// International Conference on Machine Learning (ICML). Virtual Event: PMLR, 2021: 11830–11841.
[21]   DU D, ZHU P, WEN L, et al. VisDrone-DET2019: the Vision Meets Drone Object Detection in Image Challenge Results [C]// 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Seoul: IEEE, 2019: 213–226.
[22]   SUN Y, CAO B, ZHU P, et al Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32 (10): 6700- 6713
doi: 10.1109/TCSVT.2022.3168279
[23]   WANG C-Y, YEH I-H, LIAO H. YOLOv9: learning what you want to learn using programmable gradient information [EB/OL]. (2024-02-21)[2025-04-18]. https://arxiv.org/abs/2402.13616.
[24]   WANG A, CHEN H, LIU L, et al. YOLOv10: real-time end-to-end object detection [EB/OL]. (2023-05-23)[2025-04-18]. https://arxiv.org/abs/2405.14458.
[25]   CHOLLET F. Xception: deep learning with depthwise separable convolutions [C]// IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 1800–1807.
[26]   XIAO Y, XU T, XIN Y, et al. FBRT-YOLO: faster and better for real-time aerial image detection [EB/OL]. (2025-04-29)[2025-04-18]. https://arxiv.org/abs/2504.20670.
[1] Jian XIAO,Xiaoyuan YANG,Xinze HE,Lin CHEN,Xin HU. Lightweight rebar surface defect detection algorithm based on global information perception[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1438-1451.
[2] Wei TIAN,Linhong ZHOU,Xinyang LI,Jianming WANG,Yukang HUANG. 3D-printed concrete apparent defect detection method based on improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 833-843.
[3] Binbin LI,Chao ZHANG,Tao QIN,Changsheng CHEN,Xingyan LIU,Jing YANG. Mobile-based human fall detection method for photovoltaic power plant construction[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 546-555.
[4] Weiqun LUO,Jingwei LU,Jiadi WU,Yuying LIANG,Chuanpeng SHEN,Rui ZHU. Lightweight detection model for typical environmental terrain target in Tibetan Plateau[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 594-603.
[5] Yuyu MENG,Chuile KONG,Jiuyuan HUO,Zeyu WU. UAV small target detection algorithm based on reconstruction of YOLOv11[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(2): 303-312.
[6] Jian XIAO,Xinze HE,Hongliang CHENG,Xiaoyuan YANG,Xin HU. Aerial small target detection algorithm based on multi-scale feature enhancement[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 19-31.
[7] Yahong ZHAI,Yaling CHEN,Longyan XU,Yu GONG. Improved YOLOv8s lightweight small target detection algorithm of UAV aerial image[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(8): 1708-1717.
[8] Jingyao HE,Pengfei LI,Chengzhi WANG,Zhenming LV,Ping MU. Dynamic 3D reconstruction method using binocular vision and improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(7): 1443-1450.
[9] Ming CAO,Wufeng DUAN,Mengxiao MA,Fanrong AI,Kui ZHOU. Uniformity evaluation of bio-printer based on improved YOLOv8-Seg model[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1277-1283.
[10] Liming LIANG,Pengwei LONG,Jiaxin JIN,Renjie LI,Lu ZENG. Steel surface defect detection algorithm based on improved YOLOv8s[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(3): 512-522.
[11] Yongfu HE,Shiwei XIE,Jialu YU,Siyu CHEN. Detection method for spillage risk vehicle considering cross-level feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(2): 300-309.
[12] Lin DUO,Yu YIN,Wei DUAN,Yun ZHANG,Yong REN. Ship target detection algorithm based on improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(11): 2379-2388.
[13] Xiaochun WU,Hengjun ZHANG,Lei TAN. Corrosion detection and grade determination of tunnel bolts based on YOLOv8-HSV[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(10): 2144-2153.
[14] Tianmin DENG,Xinxin CHENG,Jinfeng LIU,Xiyue ZHANG. Small target detection algorithm for aerial images based on feature reuse mechanism[J]. Journal of ZheJiang University (Engineering Science), 2024, 58(3): 437-448.
[15] Anjing WANG,Julong YUAN,Yongjian ZHU,Cong CHEN,Jinjin WU. Drum roller surface defect detection algorithm based on improved YOLOv8s[J]. Journal of ZheJiang University (Engineering Science), 2024, 58(2): 370-380.