Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (6): 1289-1298    DOI: 10.3785/j.issn.1008-973X.2026.06.016
计算机技术     
无人机辅助车联网NOMA协同缓存优化
张艺炜(),崔鑫*(),赵庆慧,陈燕
山东理工大学 计算机科学与技术学院,山东 淄博 255049
Collaborative content caching optimization in UAV-assisted internet of vehicle based on NOMA
Yiwei ZHANG(),Xin CUI*(),Qinghui ZHAO,Yan CHEN
School of Computer Science and Technology, Shandong University of Technology, Zibo 255049, China
 全文: PDF(1088 KB)   HTML
摘要:

针对车联网(IoV)高动态场景下计算密集与延迟敏感型业务的通信需求,提出基于非正交多址接入(NOMA)的无人机集群协作内容缓存方案. 引入无人机集群作为边缘节点,结合NOMA技术实现多车辆频谱共享,设计基于K-Means++的动态分簇机制以周期性划分车辆集群,优化无人机簇头的部署位置. 对无人机集群网构建图卷积网络(GCN),通过动态邻接矩阵聚合无人机集群的拓扑关系、缓存状态及内容流行度特征,增强跨节点信息共享能力. 将协同缓存问题建模为分散式部分可观测马尔可夫决策过程(Dec-POMDP),把注意力机制引入Qmix深度强化学习算法,利用注意力机制加权融合邻域无人机状态,实现长期缓存命中率最大化. 仿真结果表明,与传统正交多址接入(OMA)相比,NOMA在时延和吞吐量性能上提高了近60%,所提方案相较于其他缓存方案,在不同车辆密度场景下,缓存命中率、平均内容检索时延及能耗等性能指标均有所提高,验证了所提方案在动态车联网环境下的高效性与鲁棒性.

关键词: 内容缓存非正交多址接入(NOMA)车联网移动边缘计算深度强化学习算法图卷积网络    
Abstract:

A UAV-swarm-enabled collaborative content caching scheme based on non-orthogonal multiple access (NOMA) was proposed in order to address the communication requirement of computation-intensive and latency-sensitive service in highly dynamic internet of vehicle (IoV) scenario. NOMA technology was integrated to achieve spectrum sharing among multiple vehicle by deploying UAV swarm as edge node. A K-Means++ based dynamic clustering mechanism was designed to periodically partition vehicle cluster and optimize the deployment location of UAV cluster head. A graph convolutional network (GCN) was constructed for the UAV swarm network in order to aggregate the topological relationship, caching status and content popularity feature through a dynamic adjacency matrix. Then the capability of cross-node information sharing was enhanced. The cooperative caching problem was formulated as a decentralized partially observable Markov decision process (Dec-POMDP). An attention mechanism was introduced into the Qmix deep reinforcement learning algorithm. The attention mechanism was utilized to perform weighted fusion of neighboring UAV state in order to maximize the long-term cache hit rate. The simulation results showed that NOMA achieved nearly 60% improvement in latency and throughput performance compared with traditional orthogonal multiple access (OMA). The proposed scheme outperformed other caching schemes across various vehicle density scenario, showing enhancement in key performance metrics such as cache hit rate, average content retrieval latency and energy consumption. The efficiency and robustness of the proposed scheme in dynamic IoV environment were validated.

Key words: content caching    non-orthogonal multiple access (NOMA)    internet of vehicle    mobile edge computing    deep reinforcement learning algorithm    graph convolutional network
收稿日期: 2025-06-09 出版日期: 2026-05-06
CLC:  TN 925  
基金资助: 科技博士项目基金资助项目(4041422007).
通讯作者: 崔鑫     E-mail: zhangyiwei1014@163.com;cx@sdut.edu.cn
作者简介: 张艺炜(2001—),男,硕士生,从事智能交通系统、车联网、边缘计算技术的研究. orcid.org/0009-0006-6767-022X. E-mail:zhangyiwei1014@163.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
张艺炜
崔鑫
赵庆慧
陈燕

引用本文:

张艺炜,崔鑫,赵庆慧,陈燕. 无人机辅助车联网NOMA协同缓存优化[J]. 浙江大学学报(工学版), 2026, 60(6): 1289-1298.

Yiwei ZHANG,Xin CUI,Qinghui ZHAO,Yan CHEN. Collaborative content caching optimization in UAV-assisted internet of vehicle based on NOMA. Journal of ZheJiang University (Engineering Science), 2026, 60(6): 1289-1298.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.06.016        https://www.zjujournals.com/eng/CN/Y2026/V60/I6/1289

图 1  基于NOMA的无人机集群协同缓存示意图
算法1 GCQM无人机协同缓存算法
1) 初始化:定义图${G} $、初始全局状态$ {\boldsymbol{S}} $、每个无人机的局部观测状态$ {\boldsymbol{o}} $和邻接矩阵集$ \boldsymbol{A} $. 初始化GCN参数$ {{\boldsymbol{W}}}^{\left(l\right)} $、局部Q网络参数$ {{\boldsymbol{\theta}} }_{\text{local}} $和全局Q网络参数$ {{\boldsymbol{\theta}} }_{\text{mix}} $、批量大小$ {N}_{{\mathrm{b}}} $.
2) For each iteration do
Step1: 图卷积网络特征提取和Q网络输入
3) For each UAV agent do
4)    使用式(30)~(32)提取最终特征并输入局部Q网络
5)    使用式(33)计算每个无人机的局部Q值
6)  end
7) 将所有局部Q值输入全局混合网络$ {f}_{\text{mix}} $中,以计算全局Q
Step 2: 损失计算和参数更新
8)  使用式(35)计算全局 Q 网络的损失值
9)  更新GCN参数$ {\nabla }_{{\boldsymbol{W}}(l)}{L}_{\text{local}}={\partial {L}_{\text{local}}}/{\partial {{\boldsymbol{W}}}^{(l)}} $
10) 更新局部Q网络参数$ {{\boldsymbol{\theta}} }_{\text{local}}\leftarrow {{\boldsymbol{\theta}} }_{\text{local}}-\eta {\nabla }_{{{{\boldsymbol{\theta }}}_{\text{local}}}}{L}_{\text{local}} $
11) 更新全局Q网络参数$ {{\boldsymbol{\theta}} }_{\text{mix}}\leftarrow {{\boldsymbol{\theta }}}_{\text{mix}}-\eta {\nabla }_{{{{\boldsymbol{\theta}} }_{\text{mix}}}}{L}_{\text{local}} $
Step 3: 动作选择与经验回放
12) 每驾无人机选择动作$ {\boldsymbol{a}}_{{t}_{c}}^{u} $
13) 存储$ ({\boldsymbol{s}}_{{{{t}}_{{c}}}},{\boldsymbol{a}}_{{{{t}}_{{c}}}},{\boldsymbol{r}}_{{{{t}}_{{c}}}},{\boldsymbol{s}}_{{{{t}}_{{c}+1}}}) $到经验回放池中
14) 选择一个小批量样本并重复损失计算和参数更新过程
15) End
16) 输出:优化后的全局Q值和无人机行动策略
  
参数数值参数数值
HAP高度20 km噪声功率?174 dBm/Hz
HAP发射功率40 dBm仿真时长1080 s
HAP带宽100 MHzGCN学习率0.005
H2U链路频率3.5 GHzQmix学习率0.001
U2V载波频率2 GHz折扣因子0.9
UAV带宽20 MHz经验回放区10000
无人机发射功率30 dBm小批量样本64
本地与协作命中率权重0.67, 0.33训练轮次5000
表 1  基于无人机缓存的网络仿真参数
车辆数轮廓系数车辆数轮廓系数
200.68800.55
400.641000.54
600.63
表 2  不同车辆规模下K-Means++动态分簇轮廓系数表
车辆数吞吐量/(Gb·s?1)时延/ms
NOMAOMANOMAOMA
208.16.2120181
6012.37.5145382
10015.68.9195620
表 3  NOMA与OMA的吞吐量及时延性能对比
图 2  不同车辆规模下的命中率、时延、系统回程负载及能耗性能评估
图 3  不同无人机缓存容量下的命中率、时延、系统回程负载及能耗性能评估
图 4  GCN在无人机协作缓存收敛性能中的关键作用验证
1 GARCIA M H C, MOLINA-GALAN A, BOBAN M, et al A tutorial on 5G NR V2X communications[J]. IEEE Communications Surveys and Tutorials, 2021, 23 (3): 1972- 2026
doi: 10.1109/COMST.2021.3057017
2 SHI J, DU J, SHEN Y, et al DRL-based V2V computation offloading for blockchain-enabled vehicular networks[J]. IEEE Transactions on Mobile Computing, 2023, 22 (7): 3882- 3897
doi: 10.1109/TMC.2022.3153346
3 JIANG H, DAI X, XIAO Z, et al Joint task offloading and resource allocation for energy-constrained mobile edge computing[J]. IEEE Transactions on Mobile Computing, 2023, 22 (7): 4000- 4015
doi: 10.1109/TMC.2022.3150432
4 WU H, JIN J, MA H, et al Federation-based deep reinforcement learning cooperative cache in vehicular edge networks[J]. IEEE Internet of Things Journal, 2024, 11 (2): 2550- 2560
doi: 10.1109/JIOT.2023.3292374
5 余意, 李松, 王艳芬 车联网场景联合缓存及内容请求策略[J]. 高技术通讯, 2022, 32 (5): 502- 510
YU Yi, LI Song, WANG Yanfen Joint caching and content request strategy for Internet of vehicle[J]. Chinese High Technology Letters, 2022, 32 (5): 502- 510
doi: 10.3772/j.issn.1002-0470.2022.05.007
6 XU C, ZHANG P, XIA X, et al Digital-twin-assisted intelligent secure task offloading and caching in blockchain-based vehicular edge computing networks[J]. IEEE Internet of Things Journal, 2025, 12 (4): 4128- 4143
doi: 10.1109/JIOT.2024.3482870
7 雒江涛, 杨和平, 冉泳屹 基于参数化强化学习的车联网内容缓存和功率分配联合优化[J]. 电子与信息学报, 2023, 45 (7): 2476- 2483
LUO Jiangtao, YANG Heping, RAN Yongyi Joint optimization of content caching and power distribution for Internet of vehicles based on parametric reinforcement learning[J]. Journal of Electronics and Information Technology, 2023, 45 (7): 2476- 2483
doi: 10.11999/JEIT220857
8 崔亚平, 石宏吉, 吴大鹏, 等 内容新鲜度保障的车联网多智能体缓存分发策略[J]. 通信学报, 2025, 46 (1): 52- 66
CUI Yaping, SHI Hongji, WU Dapeng, et al Multi-agent caching distribution strategy for content freshness guarantee in IoV[J]. Journal on Communications, 2025, 46 (1): 52- 66
doi: 10.11959/j.issn.1000-436x.2025013
9 YU S, DAS A K, PARK Y RLBA-UAV: a robust and lightweight blockchain-based authentication and key agreement scheme for PUF-enabled UAVs[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (12): 21697- 21708
doi: 10.1109/TITS.2024.3480029
10 VAEZI M, AZARI A, KHOSRAVIRAD S R, et al Cellular, wide-area, and non-terrestrial IoT: a survey on 5G advances and the road toward 6G[J]. IEEE Communications Surveys and Tutorials, 2022, 24 (2): 1117- 1174
doi: 10.1109/COMST.2022.3151028
11 LIN Z, LIN M, DE COLA T, et al Supporting IoT with rate-splitting multiple access in satellite and aerial-integrated networks[J]. IEEE Internet of Things Journal, 2021, 8 (14): 11123- 11134
doi: 10.1109/JIOT.2021.3051603
12 张天魁, 徐瑜, 刘元玮, 等 无人机辅助MEC系统: 架构、关键技术与未来挑战[J]. 电信科学, 2022, 38 (8): 3- 16
ZHANG Tiankui, XU Yu, LIU Yuanwei, et al UAV-assisted MEC systems: architecture, key technology, and future challenges[J]. Telecommunications Science, 2022, 38 (8): 3- 16
doi: 10.11959/j.issn.1000-0801.2022237
13 ARAF S, SAHA A S, KAZI S H, et al UAV assisted cooperative caching on network edge using multi-agent actor-critic reinforcement learning[J]. IEEE Transactions on Vehicular Technology, 2023, 72 (2): 2322- 2337
doi: 10.1109/TVT.2022.3209079
14 BAI J, ZHU S, CHEN Y, et al Joint optimization of caching and content delivery in air–ground cooperation environment[J]. IEEE Internet of Things Journal, 2025, 12 (5): 6029- 6045
doi: 10.1109/JIOT.2024.3490612
15 ZHAO M, ZHANG R, HE Z, et al Joint optimization of trajectory, offloading, caching, and migration for UAV-assisted MEC[J]. IEEE Transactions on Mobile Computing, 2025, 24 (3): 1981- 1998
doi: 10.1109/TMC.2024.3486995
16 YU G, WU J, LIU R, et al Joint cooperative caching and UAV trajectory optimization based on mobility prediction in the Internet of connected vehicles[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (11): 17392- 17406
doi: 10.1109/TITS.2024.3429305
17 ISLAM S R, AVAZOV N, DOBRE O A, et al Power-domain non-orthogonal multiple access (NOMA) in 5G systems: potentials and challenges[J]. IEEE Communications Surveys and Tutorials, 2017, 19 (2): 721- 742
doi: 10.1109/COMST.2016.2621116
18 MURUGANATHAN S D, LIN X Q, MÄÄTTÄNEN H, et al An overview of 3GPP release-15 study on enhanced LTE support for connected drones[J]. IEEE Communications Standards Magazine, 2021, 5 (4): 140- 146
19 ZHANG H, HAN M, LIU X, et al Joint resource allocation and trajectory optimization in multi-cell UAV and sidelink heterogeneous networks[J]. IEEE Transactions on Wireless Communications, 2024, 23 (11): 16635- 16647
doi: 10.1109/TWC.2024.3445149
20 MAHANANDA I G E, YOVITA L V, NEGARA R M. Performance of homogeneous and heterogeneous cache policy for named data network [C]//Proceedings of the 10th International Conference on Information and Communication Technology. Bandung: IEEE, 2022: 120–123.
21 ROSSI D, ROSSINI G. Caching performance of content centric networks under multi-path routing (and more) [R]. Paris: Telecom ParisTech, 2011: 1-6.
22 VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double Q-learning [C]// Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2016: 2094-2100.
23 BEN BEZZIANE M, HASAN S, BRIK B, et al Game theory-based UAV-cloud for service selection architecture in flying ad hoc networks[J]. IEEE Open Journal of Vehicular Technology, 2024, 5: 1692- 1711
doi: 10.1109/OJVT.2024.3430818
[1] 赵庆慧,崔鑫,张艺炜,陈燕. 融合图卷积网络与社交池化的多模态轨迹预测模型[J]. 浙江大学学报(工学版), 2026, 60(5): 989-997.
[2] 周思瑶,夏楠,江佳鸿. 姿态引导的双分支换装行人重识别网络[J]. 浙江大学学报(工学版), 2026, 60(1): 71-80.
[3] 栾宏之,李书才,李屹,钱传杰,梅琼. 5G-A车联网确定性网络中的语义通信系统[J]. 浙江大学学报(工学版), 2025, 59(5): 964-972.
[4] 李宗民,徐畅,白云,鲜世洋,戎光彩. 面向点云理解的双邻域图卷积方法[J]. 浙江大学学报(工学版), 2025, 59(5): 879-889.
[5] 张海波,王新月,王冬宇,刘富. 车联网中基于三方Stackelberg博弈的动态多媒体定价方案[J]. 浙江大学学报(工学版), 2024, 58(9): 1781-1789.
[6] 吴书晗,王丹,陈远方,贾子钰,张越棋,许萌. 融合注意力的滤波器组双视图图卷积运动想象脑电分类[J]. 浙江大学学报(工学版), 2024, 58(7): 1326-1335.
[7] 曾耀平,刘月强,关赛莘,江伟伟,夏玉婷. 能量收集下的D2D-MEC计算卸载[J]. 浙江大学学报(工学版), 2024, 58(5): 967-978.
[8] 何勇禧,韩虎,孔博. 基于多依赖图和知识融合的方面级情感分析模型[J]. 浙江大学学报(工学版), 2024, 58(4): 737-747.
[9] 向晓倩,陈璟. 基于双重注意力时空图卷积网络的行人轨迹预测[J]. 浙江大学学报(工学版), 2024, 58(12): 2586-2595.
[10] 王友卫,王炜琦,凤丽洲,朱建明,李洋. 基于广度-深度采样和图卷积网络的谣言检测方法[J]. 浙江大学学报(工学版), 2024, 58(10): 2040-2052.
[11] 申自浩,唐雨雨,王辉,刘沛骞,刘琨. 基于聚类和深度学习的车联网轨迹隐私保护机制[J]. 浙江大学学报(工学版), 2024, 58(1): 20-28.
[12] 孟闯,王慧. 多信息融合的时空图卷积交通流量预测模型[J]. 浙江大学学报(工学版), 2023, 57(8): 1541-1550.
[13] 王传华,张权,王慧敏,徐欣,麻瓯勃. 区块链架构下具有隐私保护的车联网信誉模型[J]. 浙江大学学报(工学版), 2023, 57(4): 760-772.
[14] 程艳芬,吴家俊,何凡. 基于关系门控图卷积网络的方面级情感分析[J]. 浙江大学学报(工学版), 2023, 57(3): 437-445.
[15] 张京京,张兆功,许鑫. 融合图增强和采样策略的图卷积协同过滤模型[J]. 浙江大学学报(工学版), 2023, 57(2): 243-251.