|
|
|
| Prediction of shield tunneling-induced soil settlement based on multi-head self-attention-Bi-LSTM model |
Minghui YANG1( ),Muyuan SONG1,*( ),Daxi FU2,Yanwei GUO2,Xianzhui LU3,Wencong ZHANG1,Weilong ZHENG1 |
1. School of Architecture and Civil Engineering, Xiamen University, Xiamen 361005, China 2. Henan Zhonggong Design & Research Group Co., Ltd., Zhengzhou 451450, China 3. Geological Engineering Survey in Fujian Province, Fuzhou 350003, China |
|
|
|
Abstract To improve the prediction accuracy of soil settlement induced by shield tunnel construction, a deep learning model was proposed that combined the self-attention (SA) mechanism and multi-head self-attention (MHSA) mechanism separately with the bidirectional long short-term memory (Bi-LSTM) model, effectively capturing the spatiotemporal features and key information within the data. Using the time-series data from multiple sensors as inputs, the model employed a multi-layer bidirectional network architecture and attention mechanisms to capture the vital data features and their internal self-correlation. Based on the actual soil settlement data from a shield tunnel project, hyperparameters such as the number of hidden units and the number of attention units were optimized through cross-validation, and the predictive effects on soil settlement for the Bi-LSTM model before and after the introduction of various attention mechanisms were compared. Results show that the MHSA-Bi-LSTM model achieved optimal performance, with its total mean absolute percentage error (1.27%) showing approximately a 46% decrease over the SA-Bi-LSTM model (2.53%). Both models maintained high prediction accuracy for soil settlement across various engineering scenarios without parameter recalibration, exhibiting total mean absolute percentage errors of 9.06% for the MHSA-Bi-LSTM model and 14.82% for the SA-Bi-LSTM model, indicating strong generalization capability.
|
|
Received: 19 February 2025
Published: 03 February 2026
|
|
|
| Fund: 河南省重大科研专项项目(241111241000);自然资源部丘陵山地地质灾害防治重点实验室自主项目(KY-070000-04-2021-025). |
|
Corresponding Authors:
Muyuan SONG
E-mail: mhyang@xmu.edu.cn;mysong@stu.xmu.edu.cn
|
基于多头自注意力-Bi-LSTM模型的盾构掘进引发的土体沉降预测
为了提高盾构隧道施工引发的土体沉降预测精度,将双向长短期记忆(Bi-LSTM)模型分别结合自注意力(SA)机制和多头自注意力(MHSA)机制,提出有效捕捉数据时空特性和关键信息的深度学习模型. 该模型联合多个传感器的时序数据作为输入,利用多层双向网络架构和注意力机制捕获数据的关键特征及其内部的自相关性. 基于盾构隧道项目中土体沉降实测数据,采用交叉验证法对如隐藏层和注意力单元数量的超参数进行优化,对比引入不同注意力机制前后Bi-LSTM模型的土体沉降预测效果. 结果表明:MHSA-Bi-LSTM模型的预测效果最优,总平均绝对百分误差(1.27%)较SA-Bi-LSTM模型(2.53%)降低了约46%. 所提模型在未经参数重调的情况下对不同工程场景中的土体沉降具备较高预测精度,MHSA-Bi-LSTM和SA-Bi-LSTM的总平均绝对百分比误差分别为9.06%和14.82%,证明所提模型具备良好的泛化性.
关键词:
隧道工程,
沉降预测,
深度学习,
土体沉降,
多头自注意力机制
|
|
| [1] |
中国城市轨道交通协会. 城市轨道交通2023年度统计和分析报告[R/OL]. (2024–03–29)[2025–01–12]. https://www.camet.org.cn/xytj/tjxx/14894.shtml.
|
|
|
| [2] |
张超, 朱闽湘, 郎志雄, 等 基于深度学习的盾构机土舱压力场预测方法[J]. 岩土工程学报, 2024, 46 (2): 307- 315 ZHANG Chao, ZHU Minxiang, LANG Zhixiong, et al Deep learning-based prediction method for chamber pressure field in shield machines[J]. Chinese Journal of Geotechnical Engineering, 2024, 46 (2): 307- 315
|
|
|
| [3] |
王海涛, 苏鹏, 孙昊宇, 等 软岩地层盾构隧道施工引起的地层沉降预测[J]. 岩石力学与工程学报, 2020, 39 (Suppl.2): 3549- 3556 WANG Haitao, SU Peng, SUN Haoyu, et al Prediction of ground settlement caused by shield construction in soft rock ground[J]. Chinese Journal of Rock Mechanics and Engineering, 2020, 39 (Suppl.2): 3549- 3556
doi: 10.13722/j.cnki.jrme.2019.1192
|
|
|
| [4] |
周中, 张俊杰, 丁昊晖, 等 基于GA-Bi-LSTM的盾构隧道下穿既有隧道沉降预测模型[J]. 岩石力学与工程学报, 2023, 42 (1): 224- 234 ZHOU Zhong, ZHANG Junjie, DING Haohui, et al Settlement prediction model of shield tunnel under-crossing existing tunnel based on GA-Bi-LSTM[J]. Chinese Journal of Rock Mechanics and Engineering, 2023, 42 (1): 224- 234
|
|
|
| [5] |
江帅, 朱勇, 栗青, 等 隧道开挖地表沉降动态预测及影响因素分析[J]. 岩土力学, 2022, 43 (1): 195- 204 JIANG Shuai, ZHU Yong, LI Qing, et al Dynamic prediction and influence factors analysis of ground surface settlement during tunnel excavation[J]. Rock and Soil Mechanics, 2022, 43 (1): 195- 204
doi: 10.16285/j.rsm.2021.1201
|
|
|
| [6] |
潘秋景, 吴洪涛, 张子龙, 等 基于多域物理信息神经网络的复合地层隧道掘进地表沉降预测[J]. 岩土力学, 2024, 45 (2): 539- 551 PAN Qiujing, WU Hongtao, ZHANG Zilong, et al Prediction of tunneling-induced ground surface settlement within composite strata using multi-physics-informed neural network[J]. Rock and Soil Mechanics, 2024, 45 (2): 539- 551
|
|
|
| [7] |
陈湘生, 曾仕琪, 韩文龙, 等 机器学习方法在盾构隧道工程中的应用研究现状与展望[J]. 土木与环境工程学报(中英文), 2024, 46 (1): 1- 13 CHEN Xiangsheng, ZENG Shiqi, HAN Wenlong, et al Review and prospect of machine learning method in shield tunnel construction[J]. Journal of Civil and Environmental Engineering, 2024, 46 (1): 1- 13
doi: 10.11835/j.issn.2096-6717.2022.069
|
|
|
| [8] |
YAN K, DAI Y, XU M, et al Tunnel surface settlement forecasting with ensemble learning[J]. Sustainability, 2020, 12 (1): 232
doi: 10.3390/su12010232
|
|
|
| [9] |
YE X W, JIN T, CHEN Y M Machine learning-based forecasting of soil settlement induced by shield tunneling construction[J]. Tunnelling and Underground Space Technology, 2022, 124: 104452
doi: 10.1016/j.tust.2022.104452
|
|
|
| [10] |
SONG M, YANG M, YAO G, et al Artificial intelligence driven tunneling-induced surface settlement prediction[J]. Automation in Construction, 2024, 168: 105819
doi: 10.1016/j.autcon.2024.105819
|
|
|
| [11] |
李洛宾, 龚晓南, 甘晓露, 等 基于循环神经网络的盾构隧道引发地面最大沉降预测[J]. 土木工程学报, 2020, 53 (Suppl.1): 13- 19 LI Luobin, GONG Xiaonan, GAN Xiaolu, et al Prediction of maximum ground settlement induced by shield tunneling based on recurrent neural network[J]. China Civil Engineering Journal, 2020, 53 (Suppl.1): 13- 19
|
|
|
| [12] |
CAO Y, ZHOU X, YAN K Deep learning neural network model for tunnel ground surface settlement prediction based on sensor data[J]. Mathematical Problems in Engineering, 2021, 2021 (1): 9488892
doi: 10.1155/2021/9488892
|
|
|
| [13] |
MA K, CHEN L P, FANG Q, et al Machine learning in conventional tunnel deformation in high in situ stress regions[J]. Symmetry, 2022, 14 (3): 513
doi: 10.3390/sym14030513
|
|
|
| [14] |
LI C, LI J, SHI Z, et al Prediction of surface settlement induced by large-diameter shield tunneling based on machine-learning algorithms[J]. Geofluids, 2022, 2022 (1): 4174768
doi: 10.1155/2022/4174768
|
|
|
| [15] |
YANG M, SONG M, GUO Y, et al Prediction of shield tunneling-induced ground settlement using LSTM architecture enhanced by multi-head self-attention mechanism[J]. Tunnelling and Underground Space Technology, 2025, 161: 106536
doi: 10.1016/j.tust.2025.106536
|
|
|
| [16] |
ZHANG W S, YUAN Y, LONG M, et al Prediction of surface settlement around subway foundation pits based on spatiotemporal characteristics and deep learning models[J]. Computers and Geotechnics, 2024, 168: 106149
doi: 10.1016/j.compgeo.2024.106149
|
|
|
| [17] |
SEON P, HWAN A, JUN P, et al Convolutional neural network-based safety evaluation method for structures with dynamic responses[J]. Expert Systems with Applications, 2020, 158: 113634
doi: 10.1016/j.eswa.2020.113634
|
|
|
| [18] |
CHEN C, TANG L, LU Y, et al Reconstruction of long-term strain data for structural health monitoring with a hybrid deep-learning and autoregressive model considering thermal effects[J]. Engineering Structures, 2023, 285: 116063
doi: 10.1016/j.engstruct.2023.116063
|
|
|
| [19] |
吴伟强. 基于CNN-LSTM的采空区地表沉降预测 [D]. 绵阳: 西南科技大学, 2022: 1–67. WU Weiqiang. Prediction of surface settlement in goaf based on CNN-LSTM [D]. Mianyang: Southwest University of Science and Technology, 2022: 1–67.
|
|
|
| [20] |
洪宇超, 钱建固, 叶源新, 等 基于时空关联特征的CNN-LSTM模型在基坑工程变形预测中的应用[J]. 岩土工程学报, 2021, 43 (Suppl.2): 108- 111 HONG Yuchao, QIAN Jiangu, YE Yuanxin, et al Application of CNN-LSTM model based on spatiotemporal correlation characteristics in deformation prediction of excavation engineering[J]. Chinese Journal of Geotechnical Engineering, 2021, 43 (Suppl.2): 108- 111
|
|
|
| [21] |
LU Y, TANG L, CHEN C, et al Reconstruction of structural long-term acceleration response based on BiLSTM networks[J]. Engineering Structures, 2023, 285: 116000
doi: 10.1016/j.engstruct.2023.116000
|
|
|
| [22] |
JEONG S, FERGUSON M, HOU R, et al Sensor data reconstruction using bidirectional recurrent neural network with application to bridge monitoring[J]. Advanced Engineering Informatics, 2019, 42: 100991
doi: 10.1016/j.aei.2019.100991
|
|
|
| [23] |
VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. [S.l.]: NIPS, 2017: 6000–6010.
|
|
|
| [24] |
ZHANG W, ZHANG P, YU Y, et al Missing data repairs for traffic flow with self-attention generative adversarial imputation net[J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 23 (7): 7919- 7930
doi: 10.1109/TITS.2021.3074564
|
|
|
| [25] |
FAN G, HE Z, LI J Structural dynamic response reconstruction using self-attention enhanced generative adversarial networks[J]. Engineering Structures, 2023, 276: 115334
doi: 10.1016/j.engstruct.2022.115334
|
|
|
| [26] |
高墨通, 杨维芳, 刘祖昱, 等 结合卷积神经网络和注意力机制的LSTM采空区地表沉降预测方法[J]. 测绘通报, 2024, (6): 53- 58 GAO Motong, YANG Weifang, LIU Zuyu, et al LSTM goaf surface subsidence prediction method combining convolutional neural network and attention mechanism[J]. Bulletin of Surveying and Mapping, 2024, (6): 53- 58
doi: 10.13474/j.cnki.11-2246.2024.0610
|
|
|
| [27] |
HOCHREITER S, SCHMIDHUBER J Long short-term memory[J]. Neural Computation, 1997, 9 (8): 1735- 1780
doi: 10.1162/neco.1997.9.8.1735
|
|
|
| [28] |
MAHMOODZADEH A, MOHAMMADI M, DARAEI A, et al Forecasting maximum surface settlement caused by urban tunneling[J]. Automation in Construction, 2020, 120: 103375
doi: 10.1016/j.autcon.2020.103375
|
|
|
| [29] |
SALMAN A G, HERYADI Y, ABDURAHMAN E, et al Single layer & multi-layer long short-term memory (LSTM) model with intermediate variables for weather forecasting[J]. Procedia Computer Science, 2018, 135: 89- 98
doi: 10.1016/j.procs.2018.08.153
|
|
|
| [30] |
FATHNEJAT H, AHMADI-NEDUSHAN B, HOSSEININEJAD S, et al A data-driven structural damage identification approach using deep convolutional-attention-recurrent neural architecture under temperature variations[J]. Engineering Structures, 2023, 276: 115311
doi: 10.1016/j.engstruct.2022.115311
|
|
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
| |
Shared |
|
|
|
|
| |
Discussed |
|
|
|
|