Civil and Traffic Engineering |
|
|
|
|
Urban travel time prediction based on gradient boosting regression tress |
GONG Yue1, LUO Xiao-Qin1, WANG Dian-hai1, YANG Shao-hui2 |
1. College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China;
2. China Academy of Urban Planning & Design, Beijing 100037, China |
|
|
Abstract A new method based on gradient boosting regression tress was proposed in order to improve the prediction accuracy of travel time, considering the correlation of the time series, and also took into account spatial correlation. First, massive data collected by license plate recognition equipment was preprocessed, and the missing data problem was solved by corresponding data completion algorithm. Then the travel time historical data set was established. By analyzing the correlation between the different influence factors and travel time, the feature vector was established. Moreover, in order to better understand the model, the importance of each feature vector was proposed by gradient boosting regression tress model. Finally, the actual data was used to evaluate the model, and the average absolute error percentage of the travel time is about 10.0%. Compared with SVM, ARIMA and other methods, the proposed method has higher accuracy.
|
Received: 06 May 2017
Published: 11 September 2018
|
|
基于梯度提升回归树的城市道路行程时间预测
为了提高行程时间的预测精度,在考虑时间序列相关性的同时,分析相邻路段的空间相关性对于行程时间的影响,并提出基于梯度提升回归树模型的城市道路行程时间预测方法.对车牌识别设备获取的实际数据进行预处理,并提出相应的补全算法以解决数据缺失问题,建立完整的历史数据集.通过分析各影响因素与行程时间的相关性,构建特征向量.为了能更好地理解模型,通过梯度提升回归树模型输出各变量对于预测结果的重要度.利用实际数据对模型进行评估,预测行程时间的平均绝对误差百分比,约为10.0%.与SVM、ARIMA等方法相比,所提方法具有较高的精度.
|
|
[1] 付凤杰, 龚越, 王殿海,等. 基于高清智能卡口路段行程时间的数据质量分析[J]. 浙江大学学报:工学版,2016,50(9):1761-1767. FU Feng-jie, GONG Yue, WANG Dian-hai, et al. Data quality analysis of link travel time based on HD smart gate[J]. Journal of Zhejiang University:Engineering Science, 2016,50(9):1761-1767.
[2] WILLIAMS B,DURVASULA P, BROWN D. Urban freeway traffic flow prediction:application of seasonal autoregressive integrated moving average and exponential smoothing models[J]. Transportation Research Record:Journal of the Transportation Research Board, 1998(1644):132-141.
[3] WILLIAMS B M, HOEL L A. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process:Theoretical basis and empirical results[J]. Journal of transportation engineering, 2003, 129(6):664-672.
[4] 胡继华, 李国源, 程智锋. 基于马尔科夫链的公交站间行程时间预测算法[J]. 交通信息与安全, 2014, 32(2):17-22. HU Ji-hua, LI Guo-yuan, CHEN Zhi-feng. Algorithm for predicting bus travel time between stops based on markov chain[J].Journal of Transport Information and Safety, 2014, 32(2):17-22.
[5] JI Y J, TANG D N, GUO W H, et al. Forecasting available parking space with largest Lyapunov exponents method[J]. Journal of Central South University, 2014, 21(4):1624-1632.
[6] 李进燕, 朱征宇, 刘琳, 等. 基于简化路网模型的卡尔曼滤波多步行程时间预测方法[J]. 系统工程理论与实践, 2013, 33(5):1289-1297. LI Jin-yan, ZHU Zheng-yu, LIU Ling, et al. Multi-step kalman filtering travel time estimation method based on simplified road network model[J].Systems Engineering:Theory & Practice, 2013,33(5):1289-1297.
[7] 丁宏飞, 李演洪, 刘博, 等. 基于BP神经网络与SVM的快速路行程时间组合预测研究[J]. 计算机应用研究, 2016, 33(10):2929-2932. DING Hong-fei, LI Yan-hong, LIU Bo, et al. Expressway's travel time prediction based on combined BP neural network and support vector machine approach[J]. Application Research of Computers, 2016, 33(10):2929-2932.
[8] 王翔, 陈小鸿, 杨祥妹. 基于K最近邻算法的高速公路短时行程时间预测[J]. 中国公路学报, 2015, 28(1):102-111. WANG Xiang, CHEN Xiao-hong, YANG Xiang-mei. Short term prediction of expressway travel time based on K nearest neighbor algorithm[J]. China Journal of Highway and Transport, 2015, 28(1):102-111.
[9] 张娟, 孙剑. 基于SVM的城市快速路行程时间预测研究[J]. 交通运输系统工程与信息, 2011, 11(2):174-179. ZHANG Juan, SUN Jian. Prediction of urban express way travel time based on SVM[J]. Journal of Transportation Systems Engineering and Information Technology, 2011, 11(2):174-179.
[10] AHMED M, ABDEL-ATY M. Application of stochastic gradient boosting technique to enhance reliability of real-time risk assessment:use of automatic vehicle identification and remote traffic microwave sensor data[J]. Transportation Research Record:Journal of the Transportation Research Board, 2013(2386):26-34.
[11] CHUNG Y S. Factor complexity of crash occurrence:An empirical demonstration using boosted regression trees[J]. Accident Analysis & Prevention, 2013, 61:107-118.
[12] LIU Q C, LU J, CHEN S U. Design and analysis of traffic incident detection based on random forest[J]. Journal of Southeast University, 2014, 1:88-95.
[13] ZHANG Y, HAGHANI A. A gradient boosting method to improve travel time prediction[J]. Transportation Research Part C:Emerging Technologies, 2015, 58:308-324.
[14] FRIEDMAN J H. Greedy function approximation:a gradient boosting machine[J]. Annals of Statistics, 2001:1189-1232.
[15] 范金城, 梅长林. 数据分析[M]. 北京:科学出版社, 2002:6. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|