基于集成时序预测模型的视频聚合平台监测预警方法

doi:10.3785/j.issn.1008-973X.2025.06.010

浙江大学学报(工学版)

2025, Vol. 59

Issue (6): 1191-1200 DOI: 10.3785/j.issn.1008-973X.2025.06.010

计算机技术

基于集成时序预测模型的视频聚合平台监测预警方法

宋雪1(

),嵇程2,3,*(

)

1. 国家计算机网络应急技术处理协调中心山东分中心，山东济南 250002
2. 南京大学计算机学院，江苏南京 210008
3. 国家计算机网络应急技术处理协调中心江苏分中心，江苏南京 210003

Surveillance and alerting approach for video aggregation platforms predicated upon ensemble time series forecasting model

Xue SONG1(

),Cheng JI2,3,*(

)

1. Shandong Branch of National Computer Network Emergency Response Technical Team, Jinan 250002, China
2. School of Computer Science, Nanjing University, Nanjing 210008, China
3. Jiangsu Branch of National Computer Network Emergency Response Technical Team, Nanjing 210003, China

全文: PDF(2638 KB) HTML

摘要：

为了防范深度链接视频聚合平台带来的侵权风险及内容安全隐患，发现并提醒通过非法途径访问此类平台的网络用户，提出基于集成时序预测模型的视频聚合平台监测预警方法. 根据多个视频聚合平台的网络行为日志数据，以IP地址为用户维度，以天为时间维度，提取用户在平台侧和渠道侧的网络行为特征. 选择长短期时间序列网络（LSTNet）、循环神经网络（RNN）和多层感知机（MLP）3个模型作为基模型，构造Stacking集成学习模型，通过Stacking集成模型学习基模型特征从而实现对用户访问行为的预测. 进行对比实验和回测实验，结果表明，本研究方法相比于单模型预测方法，在均方误差（MSE）指标上降低0.9724，在平均绝对误差（MAE）指标上降低0.5443，在自定义平衡准确率（BAC）指标上提升0.20，能够对视频聚合平台访问情况进行预测从而实现对高风险用户行为的预警.

关键词： 视频聚合平台; 时序预测; 集成学习; 网络行为; 监测预警

Abstract:

A surveillance and alerting mechanism for video aggregation platforms based on an ensemble time series forecasting model was proposed, in order to mitigate the risks of copyright infringement and content security brought by deep linking video aggregation platforms, as well as to facilitate the prompt detection and notification of network users who engaged with such platforms through illicit means. Initially, the network behavioral log data from multiple video aggregation platforms were leveraged. The network behavior characteristics of users were then extracted with IP address as the user dimension and day as the time dimension, on both the platform side and the channel side. Subsequently, long- and short-term time-series networks (LSTNet), recurrent neural networks (RNN) and multilayer perceptron (MLP) were harnessed as foundational models to construct a Stacking ensemble learning model for predicting user access behavior by learning features from base model. Ultimately, empirical validation was conducted through comparative and backtesting experiments. Results showed that the proposed method achieved a notable decrease of 0.9724 in mean squared error (MSE), a significant reduction of 0.5443 in mean absolute error (MAE), and a moderate improvement of 0.20 in balanced accuracy (BAC). The proposed method could effectively forecast access patterns to video aggregation platforms and provide early warnings for high-risk user behavior.

Key words: video aggregation platform time series forecasting ensemble learning network behavior monitoring and warning

收稿日期: 2024-05-01 出版日期: 2025-05-30

CLC:

TP 309

基金资助: 国家自然科学基金面上资助项目(62272125) .

通讯作者: 嵇程 E-mail: 2777432504@qq.com;jicheng01@foxmail.com

作者简介: 宋雪（1995—），女，助理工程师，硕士，从事信息安全研究. orcid.org/0009-0003-2363-9069. E-mail：2777432504@qq.com

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	宋雪
	嵇程

引用本文:

宋雪,嵇程. 基于集成时序预测模型的视频聚合平台监测预警方法[J]. 浙江大学学报(工学版), 2025, 59(6): 1191-1200.

Xue SONG,Cheng JI. Surveillance and alerting approach for video aggregation platforms predicated upon ensemble time series forecasting model. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1191-1200.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2025.06.010 或 https://www.zjujournals.com/eng/CN/Y2025/V59/I6/1191

图 1 LSTNet模型结构

图 2 RNN模型结构

图 3 MLP模型结构

图 4 Stacking集成流程

表 1 视频源域名访问日志

表 2 视频聚合平台访问日志

图 5 用户通过视频聚合平台访问视频源的4种情况

表 3 视频源侧与渠道侧目标特征

图 6 重点特征与目标结果的变化趋势图

图 7 视频源侧及目标侧特征热力图

图 8 访问视频源次数与总链接数的箱型图

表 4 测试实验环境选择

表 5 集成模型部分关键参数

表 6 单一模型实验结果

表 7 基模型实验结果

表 8 集成方式实验结果

表 9 元模型实验结果

图 9 集成模型回测方式

表 10 回测试验结果

图 10 第10步预测结果的特征贡献值分布图

图 11 全局特征重要性分布图

1	刘晓庆, 万柯视频聚合平台的版权侵权责任[J]. 中国版权, 2014, (4): 44- 47 LIU Xiaoqing, WAN Ke Copyright infringement liability of video aggregation platform[J]. China Copyright, 2014, (4): 44- 47 doi: 10.3969/j.issn.1671-4717.2014.04.013
2	徐晖. 视频聚合平台深度链接行为的侵权认定标准研究 [D]. 长春: 吉林大学, 2022. XU Hui. Research on infringement identification standards of deep linking behavior of video aggregation platform [D]. Changchun: Jilin University, 2022.
3	李怡璇. 视频聚合平台的侵权责任研究: 以“盗链” 行为为例 [D]. 济南: 山东大学, 2020. LI Yixuan. Research on infringement liability of video aggregation platform: taking “hotlinking” as an example [D]. Jinan: Shandong University, 2020.
4	徐珉川论互联网“提供作品” 行为的界定[J]. 中外法学, 2020, 32 (2): 378- 401 XU Minchuan On the definition of making work available[J]. Peking University Law Journal, 2020, 32 (2): 378- 401
5	何昊天. 网络环境下著作权默示许可制度研究 [D]. 济南: 山东大学, 2022. HE Haotian. Study on the implied license of copyright in the network environment [D]. Jinan: Shandong University, 2022.
6	刘友华, 魏远山聚合分发平台与传统新闻出版者的著作权冲突及解决[J]. 新闻与传播研究, 2018, 25 (5): 69- 87,127 LIU Youhua, WEI Yuanshan Copyright conflicts and solutions between aggregation distribution platforms and traditional news publishers[J]. Journalism and Communication, 2018, 25 (5): 69- 87,127
7	黎维, 陶蔚, 周星宇, 等时空序列预测方法综述[J]. 计算机应用研究, 2020, 37 (10): 2881- 2888 LI Wei, TAO Wei, ZHOU Xingyu, et al Survey of spatio-temporal sequence prediction methods[J]. Application Research of Computers, 2020, 37 (10): 2881- 2888
8	危婷, 张宏海, 蔺小丽, 等云服务网站用户复访行为预测模型研究[J]. 数据与计算发展前沿, 2022, 4 (3): 124- 130 WEI Ting, ZHANG Honghai, LIN Xiaoli, et al Predictive model of the revisit behavior of cloud service site users[J]. Frontiers of Data and Computing, 2022, 4 (3): 124- 130
9	姚丽, 崔超然, 马乐乐, 等基于校园上网行为感知的学生成绩预测方法[J]. 计算机研究与发展, 2022, 59 (8): 1770- 1781 YAO Li, CUI Chaoran, MA Lele, et al Student performance prediction base on campus online behavior-aware[J]. Journal of Computer Research and Development, 2022, 59 (8): 1770- 1781 doi: 10.7544/issn1000-1239.20220060
10	周胜利, 徐啸炀基于网络流量的用户网络行为被害性分析模型[J]. 电信科学, 2021, 37 (2): 125- 134 ZHOU Shengli, XU Xiaoyang Victimization analysis model of user network behavior based on network traffic[J]. Telecommunications Science, 2021, 37 (2): 125- 134 doi: 10.11959/j.issn.1000-0801.2021041
11	杨晨. 基于DNS流量的用户访问行为分析研究 [D]. 广州: 广州大学, 2022. YANG Chen. Analysis and research on users’ access behavior based on DNS traffic [D]. Guangzhou: Guangzhou University, 2022.
12	魏佳代. 基于DNS日志的用户访问行为分析和研究 [D]. 北京: 北京交通大学, 2019. WEI Jiadai. Analysis and research of user access behavior based on DNS logs [D]. Beijing: Beijing Jiaotong University, 2019.
13	马艺闻视频聚合平台侵权行为的法律定性[J]. 区域治理, 2019, (38): 119- 121 MA Yiwen Legal qualification of infringement behavior of video aggregation platforms[J]. Regional Governance, 2019, (38): 119- 121 doi: 10.3969/j.issn.2096-4595.2019.38.045
14	张晨曦. 智媒体背景下新闻编辑业务创新研究: 以新闻聚合平台为例 [D]. 吉林: 吉林大学, 2021. ZHANG Chenxi. Research on news editing business in the age of intelligence media: take news aggregation platforms as an example [D]. Jilin: Jilin University, 2021.
15	刘溪视频聚合平台经营者盗链行为侵害作品信息网络传播权的司法认定[J]. 法制与社会, 2019, (20): 58- 59 LIU Xi Judicial determination of infringement of right to communicate works through information networks by operator’s theft linking behavior of video aggregation platforms[J]. Legal System and Society, 2019, (20): 58- 59
16	LAI G, CHANG W C, YANG Y, et al. Modeling long-and short-term temporal patterns with deep neural networks [C]// 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. Ann Arbor: ACM, 2018: 95–104.
17	ELMAN J L Finding structure in time[J]. Cognitive science, 1990, 14 (2): 179- 211 doi: 10.1207/s15516709cog1402_1

[1]	周欣磊,顾海挺,刘晶,许月萍,耿芳,王冲. 基于集成学习与深度学习的日供水量预测方法[J]. 浙江大学学报(工学版), 2023, 57(6): 1120-1127.
[2]	葛志辉,邢江宽,罗坤,樊建人. 基于样本优选的集成学习在脱硫优化中的应用[J]. 浙江大学学报(工学版), 2021, 55(8): 1566-1575.
[3]	王友卫,凤丽洲. 基于合群度-隶属度噪声检测及动态特征选择的改进AdaBoost算法[J]. 浙江大学学报(工学版), 2021, 55(2): 367-376.
[4]	罗娜, 魏松杰, 时召伟, 吴高翔. 采用LSTM模型的Android应用行为一致性检测[J]. 浙江大学学报(工学版), 2018, 52(6): 1097-1106.
[5]	刘如辉, 黄炜平, 王凯, 刘创, 梁军. 半监督约束集成的快速密度峰值聚类算法[J]. 浙江大学学报(工学版), 2018, 52(11): 2191-2200.
[6]	罗建宏,陈德钊. 兼顾正确率和差异性的自适应集成算法及应用[J]. J4, 2011, 45(3): 557-562.
[7]	谷雨, 李平, 韩波. 基于分层粒子滤波的地标检测与跟踪[J]. J4, 2010, 44(4): 687-691.

Viewed

Full text

Abstract

Cited

Shared

Discussed