Real-time vehicle detection algorithm based on UAV aerial images

doi:10.3785/j.issn.1008-973X.2026.07.021

Journal of ZheJiang University (Engineering Science)

2026, Vol. 60

Issue (7): 1599-1610 DOI: 10.3785/j.issn.1008-973X.2026.07.021

Real-time vehicle detection algorithm based on UAV aerial images

Yuyu MENG(

),Yinbao MA,Jiuyuan HUO*(

)

School of Electronics and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China

Download:

HTML

PDF(4247KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

Multi-scale targets in unmanned aerial vehicle (UAV) aerial images, especially small targets, have low detection accuracy in complex scenarios such as dense scenes, occlusions, and low illumination. Thus, a convolution-wavelet dual-domain downsampling module (RDWTConv) was proposed to preserve fine details of small targets. Additionally, a three-layer cross-scale residual fusion module (RCDFM) was designed to enhance multi-scale feature interactions. Furthermore, a scale-shape loss function (TSSIoU) was introduced to improve bounding box localization accuracy for varying object scales and shapes under aerial perspectives. On this basis, a series of CF-YOLO models, namely CF-YOLOn, CF-YOLOs, and CF-YOLOm, were constructed based on YOLOv8 to meet diverse computational requirements. Experimental results demonstrated that on the VisDrone dataset, CF-YOLOn achieved a 23.7% reduction in parameters and only a 22.5% increase in computational cost, while improving mAP@0.5 and mAP@0.5:0.95 by 5.5 and 4.0 percentage points, respectively, compared with the baseline YOLOv8n, as well as maintaining a frame rate of 169.1 frames per second. The s and m variants also achieved the highest accuracy within the same frame rate range. After retraining on the Drone-Vehicle dataset, CF-YOLOn’s mAP@0.5:0.95 improved by 3.0 percentage points compared to the baseline. Through the above synergistic improvements, the proposed method not only maintains real-time detection under lightweight computational costs but also effectively enhances multi-scale target detection performance in complex scenarios, achieving state-of-the-art results among comparable methods.

Key words： vehicle detection multi-scale target complex scenario YOLOv8 downsampling

Received: 29 May 2025 Published: 23 May 2026

CLC:

TP 391

Fund: 国家自然科学基金资助项目（62262038）；甘肃省技术创新指导计划-科技专家资助项目（25CXGA030）；甘肃省重点研发计划-工业资助项目（25YFGA045）.

Corresponding Authors: Jiuyuan HUO E-mail: mengyuyu@mail.lzjtu.cn;huojy@mail.lzjtu.cn

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Yuyu MENG
	Yinbao MA
	Jiuyuan HUO

Cite this article:

Yuyu MENG,Yinbao MA,Jiuyuan HUO. Real-time vehicle detection algorithm based on UAV aerial images. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1599-1610.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2026.07.021 OR https://www.zjujournals.com/eng/Y2026/V60/I7/1599

基于无人机航拍图像的实时车辆检测算法

针对无人机(UAV)航拍图像中多尺度目标，尤其是小目标，在密集、遮挡及低光照等复杂场景下检测精度较低的问题，提出卷积-小波双域下采样器RDWTConv，以保留小目标细节；设计3层跨尺度残差融合模块RCDFM，以增强多尺度特征交互；提出尺度-形状损失TSSIoU，以提升航拍视角下目标尺度与形状的边界框定位精度. 在此基础上，基于YOLOv8构建适配不同算力需求的CF-YOLOn、CF-YOLOs与CF-YOLOm模型. 实验结果显示，在VisDrone数据集上，CF-YOLOn在参数量减少23.7%、计算量仅增加22.5%的情况下，mAP@0.5和mAP@0.5:0.95较基线YOLOv8n分别提高5.5和4.0个百分点，帧率保持169.1帧/s，且在相同帧率区间内，s、m版本取得最高精度；在Drone-Vehicle数据集上重新训练后，CF-YOLOn的mAP@0.5:0.95较基线YOLOv8n提升3.0个百分点. 通过上述协同改进，所提方法不仅在轻量计算开销下保持实时检测，而且有效提升了复杂场景下的多尺度目标检测性能，达到同类方法的先进水平.

关键词： 车辆检测, 多尺度目标, 复杂场景, YOLOv8, 下采样

Fig.1 YOLOv8n network structure diagram

Fig.2 CF-YOLOn network structure diagram

Fig.3 Wavelet pooling layer structure diagram

Fig.4 RDWTConv structure diagram

Fig.5 Birectional Concatenate structure diagram

Fig.6 Sandwich-fusion structure diagram

Fig.7 RCDFM structure diagram

Fig.8 Target counts and size distributions in VisDrone and Drone-Vehicle datasets

Tab.1 Performance comparison of different downsampling modules (VisDrone Dataset)

Tab.2 Performance comparison of RCDFM (VisDrone Dataset)

Tab.3 Performance comparison of different bounding box loss functions (VisDrone Dataset)

Fig.9 Study on hyperparameter

$ \gamma $

Tab.4 Model ablation studies (VisDrone Dataset)

Fig.10 Model comparison experiments (VisDrone Dataset)

Tab.5 Model generalization experiments  (Drone-Vehicle Dataset)

Fig.11 Visualization of results (VisDrone Dataset)


[1]	HEARST M A, DUMAIS S T, OSUNA E, et al Support vector machines[J]. IEEE Intelligent Systems and Their Applications, 1998, 13 (4): 18- 28 doi: 10.1109/5254.708428

[2]	BEJA-BATTAIS P. Overview of AdaBoost : reconciling its views to better understand its dynamics [EB/OL]. (2023-10-06)[2025-04-18]. https://arxiv.org/abs/2310.18323

[3]	REN S, HE K, GIRSHICK R, et al Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (6): 1137- 1149 doi: 10.1109/TPAMI.2016.2577031

[4]	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection [C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 779–788.

[5]	LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector [C]// European Conference on Computer Vision (ECCV) 2016. Cham: Springer International Publishing, 2016: 21–37.

[6]	GUPTA P, PAREEK B, SINGAL G, et al Edge device based military vehicle detection and classification from UAV[J]. Multimedia Tools and Applications, 2022, 81 (14): 19813- 19834 doi: 10.1007/s11042-021-11242-y

[7]	史涛, 崔杰, 李松优化改进YOLOv8实现实时无人机车辆检测的算法[J]. 计算机工程与应用, 2024, 60 (9): 79- 89 SHI Tao, CUI Jie, LI Song Algorithm for real-time vehicle detection from UAVs based on optimizing and improving YOLOv8[J]. Computer Engineering and Applications, 2024, 60 (9): 79- 89 doi: 10.3778/j.issn.1002-8331.2312-0291

[8]	SUN Y, SHAO Z, CHENG G, et al Road and car extraction using UAV images via efficient dual contextual parsing network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5632113

[9]	HAMZENEJADI M H, MOHSENI H Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: architectural improvements and performance boost[J]. Expert Systems with Applications, 2023, 231: 120845 doi: 10.1016/j.eswa.2023.120845

[10]	YING Z, ZHOU J, ZHAI Y, et al Large-scale high-altitude UAV-based vehicle detection via pyramid dual pooling attention path aggregation network[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (10): 14426- 14444 doi: 10.1109/TITS.2024.3396915

[11]	HUI Y, WANG J, LI B STF-YOLO: a small target detection algorithm for UAV remote sensing images based on improved SwinTransformer and class weighted classification decoupling head[J]. Measurement, 2024, 224: 113936 doi: 10.1016/j.measurement.2023.113936

[12]	姜贸翔, 司占军, 王晓喆改进RT-DETR的无人机图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 98- 108 JIANG Maoxiang, SI Zhanjun, WANG Xiaozhe Improved target detection algorithm for UAV images with RT-DETR[J]. Computer Engineering and Applications, 2025, 61 (1): 98- 108 doi: 10.3778/j.issn.1002-8331.2405-0331

[13]	李彬, 李生林改进YOLOv11n的无人机小目标检测算法[J]. 计算机工程与应用, 2025, 61 (7): 96- 104 LI Bin, LI Shenglin Improved YOLOv11n small object detection algorithm in UAV view[J]. Computer Engineering and Applications, 2025, 61 (7): 96- 104 doi: 10.3778/j.issn.1002-8331.2411-0072

[14]	梁燕, 何孝武, 邵凯, 等改进YOLOv8的无人机航拍图像目标检测算法[J]. 计算机工程与应用, 2025, 61 (1): 121- 130 LIANG Yan, HE Xiaowu, SHAO Kai, et al Target detection algorithm for UAV images based on improved YOLOv8[J]. Computer Engineering and Applications, 2025, 61 (1): 121- 130 doi: 10.3778/j.issn.1002-8331.2405-0459

[15]	JOCHER G, CHAURASIA A, QIU J. Ultralytics YOLOv8 [EB/OL]. (2023-01-28)[2025-04-18]. https://github.com/ultralytics/ultralytics.

[16]	XUE Y, JIN G, SHEN T, et al SmallTrack: wavelet pooling and graph enhanced classification for UAV small object tracking[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5618815

[17]	LI C, LI L, GENG Y, et al. YOLOv6 v3. 0: a full-scale reloading [EB/OL]. (2023-01-13)[2025-04-18]. https://arxiv.org/abs/2301.05586.

[18]	ZHANG Z Drone-YOLO: an efficient neural network method for target detection in drone images[J]. Drones, 2023, 7 (8): 526 doi: 10.3390/drones7080526

[19]	ZHANG Y F, REN W, ZHANG Z, et al Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506: 146- 157 doi: 10.1016/j.neucom.2022.07.042

[20]	YANG X, YAN J, MING Q, et al. Rethinking rotated object detection with Gaussian Wasserstein distance loss [C]// International Conference on Machine Learning (ICML). Virtual Event: PMLR, 2021: 11830–11841.

[21]	DU D, ZHU P, WEN L, et al. VisDrone-DET2019: the Vision Meets Drone Object Detection in Image Challenge Results [C]// 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Seoul: IEEE, 2019: 213–226.

[22]	SUN Y, CAO B, ZHU P, et al Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32 (10): 6700- 6713 doi: 10.1109/TCSVT.2022.3168279

[23]	WANG C-Y, YEH I-H, LIAO H. YOLOv9: learning what you want to learn using programmable gradient information [EB/OL]. (2024-02-21)[2025-04-18]. https://arxiv.org/abs/2402.13616.

[24]	WANG A, CHEN H, LIU L, et al. YOLOv10: real-time end-to-end object detection [EB/OL]. (2023-05-23)[2025-04-18]. https://arxiv.org/abs/2405.14458.

[25]	CHOLLET F. Xception: deep learning with depthwise separable convolutions [C]// IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 1800–1807.

[26]	XIAO Y, XU T, XIN Y, et al. FBRT-YOLO: faster and better for real-time aerial image detection [EB/OL]. (2025-04-29)[2025-04-18]. https://arxiv.org/abs/2504.20670.

[1]	Jian XIAO,Xiaoyuan YANG,Xinze HE,Lin CHEN,Xin HU. Lightweight rebar surface defect detection algorithm based on global information perception[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1438-1451.

[2]	Wei TIAN,Linhong ZHOU,Xinyang LI,Jianming WANG,Yukang HUANG. 3D-printed concrete apparent defect detection method based on improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 833-843.

[3]	Binbin LI,Chao ZHANG,Tao QIN,Changsheng CHEN,Xingyan LIU,Jing YANG. Mobile-based human fall detection method for photovoltaic power plant construction[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 546-555.

[4]	Weiqun LUO,Jingwei LU,Jiadi WU,Yuying LIANG,Chuanpeng SHEN,Rui ZHU. Lightweight detection model for typical environmental terrain target in Tibetan Plateau[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 594-603.

[5]	Yuyu MENG,Chuile KONG,Jiuyuan HUO,Zeyu WU. UAV small target detection algorithm based on reconstruction of YOLOv11[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(2): 303-312.

[6]	Jian XIAO,Xinze HE,Hongliang CHENG,Xiaoyuan YANG,Xin HU. Aerial small target detection algorithm based on multi-scale feature enhancement[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 19-31.

[7]	Yahong ZHAI,Yaling CHEN,Longyan XU,Yu GONG. Improved YOLOv8s lightweight small target detection algorithm of UAV aerial image[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(8): 1708-1717.

[8]	Jingyao HE,Pengfei LI,Chengzhi WANG,Zhenming LV,Ping MU. Dynamic 3D reconstruction method using binocular vision and improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(7): 1443-1450.

[9]	Ming CAO,Wufeng DUAN,Mengxiao MA,Fanrong AI,Kui ZHOU. Uniformity evaluation of bio-printer based on improved YOLOv8-Seg model[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1277-1283.

[10]	Liming LIANG,Pengwei LONG,Jiaxin JIN,Renjie LI,Lu ZENG. Steel surface defect detection algorithm based on improved YOLOv8s[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(3): 512-522.

[11]	Yongfu HE,Shiwei XIE,Jialu YU,Siyu CHEN. Detection method for spillage risk vehicle considering cross-level feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(2): 300-309.

[12]	Lin DUO,Yu YIN,Wei DUAN,Yun ZHANG,Yong REN. Ship target detection algorithm based on improved YOLOv8[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(11): 2379-2388.

[13]	Xiaochun WU,Hengjun ZHANG,Lei TAN. Corrosion detection and grade determination of tunnel bolts based on YOLOv8-HSV[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(10): 2144-2153.

[14]	Tianmin DENG,Xinxin CHENG,Jinfeng LIU,Xiyue ZHANG. Small target detection algorithm for aerial images based on feature reuse mechanism[J]. Journal of ZheJiang University (Engineering Science), 2024, 58(3): 437-448.

[15]	Anjing WANG,Julong YUAN,Yongjian ZHU,Cong CHEN,Jinjin WU. Drum roller surface defect detection algorithm based on improved YOLOv8s[J]. Journal of ZheJiang University (Engineering Science), 2024, 58(2): 370-380.

Viewed

Full text

Abstract

Cited

Shared

Discussed