| 土木与建筑工程 |
|
|
|
|
| 微调稳定扩散模型的裂缝图像数据集扩充方法 |
吴杰1( ),韩贝林1,张舣航1,邹超2,辛莉峰3,黄仕平4,*( ) |
1. 武汉轻工大学 土木工程与建筑学院,湖北 武汉 430023 2. 广东工业大学 土木与交通工程学院,广东 广州 510006 3. 西北工业大学 力学与交通运载工程学院,陕西 西安 710072 4. 华南理工大学 土木与交通学院,广东 广州 510641 |
|
| Method for augmenting crack image datasets via fine-tuning of stable diffusion models |
Jie WU1( ),Beilin HAN1,Yihang ZHANG1,Chao ZOU2,Lifeng XIN3,Shiping HUANG4,*( ) |
1. School of Civil Engineering and Architecture, Wuhan Polytechnic University, Wuhan 430023, China 2. School of Civil and Transportation Engineering, Guangdong University of Technology, Guangzhou 510006, China 3. School of Mechanics and Transportation Engineering, Northwestern Polytechnical University, Xi’an 710072, China 4. School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510641, China |
引用本文:
吴杰,韩贝林,张舣航,邹超,辛莉峰,黄仕平. 微调稳定扩散模型的裂缝图像数据集扩充方法[J]. 浙江大学学报(工学版), 2026, 60(5): 926-934.
Jie WU,Beilin HAN,Yihang ZHANG,Chao ZOU,Lifeng XIN,Shiping HUANG. Method for augmenting crack image datasets via fine-tuning of stable diffusion models. Journal of ZheJiang University (Engineering Science), 2026, 60(5): 926-934.
链接本文:
https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.05.002
或
https://www.zjujournals.com/eng/CN/Y2026/V60/I5/926
|
| 1 |
SUN L, SHANG Z, XIA Y, et al Review of bridge structural health monitoring aided by big data and artificial intelligence: from condition assessment to damage detection[J]. Journal of Structural Engineering, 2020, 146 (5): 04020073
doi: 10.1061/(ASCE)ST.1943-541X.0002535
|
| 2 |
DENG L, SUN T, YANG L, et al Binocular video-based 3D reconstruction and length quantification of cracks in concrete structures[J]. Automation in Construction, 2023, 148: 104743
doi: 10.1016/j.autcon.2023.104743
|
| 3 |
CHA Y J, ALI R, LEWIS J, et al Deep learning-based structural health monitoring[J]. Automation in Construction, 2024, 161: 105328
doi: 10.1016/j.autcon.2024.105328
|
| 4 |
吴杰, 黄楚越, 韩贝林, 等 基于深度学习和图像处理的螺栓损伤检测[J]. 哈尔滨工程大学学报, 2025, 46 (9): 1754- 1764 WU Jie, HUANG Chuyue, HAN Beilin, et al Bolt damage detection based on deep learning and image processing[J]. Journal of Harbin Engineering University, 2025, 46 (9): 1754- 1764
doi: 10.11990/jheu.202403035
|
| 5 |
MAEDA H, KASHIYAMA T, SEKIMOTO Y, et al Generative adversarial network for road damage detection[J]. Computer-Aided Civil and Infrastructure Engineering, 2021, 36 (1): 47- 60
doi: 10.1111/mice.12561
|
| 6 |
DEEPA D, SIVASANGARI A ESSR-GAN: enhanced super and semi supervised remora resolution based generative adversarial learning framework model for smartphone based road damage detection[J]. Multimedia Tools and Applications, 2024, 83 (2): 5099- 5129
doi: 10.1007/s11042-023-15850-8
|
| 7 |
XU B, LIU C Pavement crack detection algorithm based on generative adversarial network and convolutional neural network under small samples[J]. Measurement, 2022, 196: 111219
doi: 10.1016/j.measurement.2022.111219
|
| 8 |
CRESWELL A, WHITE T, DUMOULIN V, et al Generative adversarial networks: an overview[J]. IEEE Signal Processing Magazine, 2018, 35 (1): 53- 65
doi: 10.20944/preprints202212.0191.v1
|
| 9 |
ZHONG J, JU H, ZHANG W, et al A deeper generative adversarial network for grooved cement concrete pavement crack detection[J]. Engineering Applications of Artificial Intelligence, 2023, 119: 105808
doi: 10.1016/j.engappai.2022.105808
|
| 10 |
DONG J, WANG N, FANG H, et al Innovative method for pavement multiple damages segmentation and measurement by the Road-Seg-CapsNet of feature fusion[J]. Construction and Building Materials, 2022, 324: 126719
doi: 10.1016/j.conbuildmat.2022.126719
|
| 11 |
DUNPHY K, FEKRI M N, GROLINGER K, et al Data augmentation for deep-learning-based multiclass structural damage detection using limited information[J]. Sensors, 2022, 22 (16): 6193
doi: 10.3390/s22166193
|
| 12 |
PEI L, SUN Z, XIAO L, et al Virtual generation of pavement crack images based on improved deep convolutional generative adversarial network[J]. Engineering Applications of Artificial Intelligence, 2021, 104: 104376
doi: 10.1016/j.engappai.2021.104376
|
| 13 |
程风雯, 甘进, 李星, 等 基于DCGAN的水下结构物表面缺陷图像生成[J]. 长江科学院院报, 2023, 40 (9): 155- 161 CHENG Fengwen, GAN Jin, LI Xing, et al Image generation for surface defects of underwater structures based on deep convolutional generative adversarial networks[J]. Journal of Changjiang River Scientific Research Institute, 2023, 40 (9): 155- 161
doi: 10.11988/ckyyb.20220421
|
| 14 |
赵阳, 康飞, 万刚 基于改进CycleGAN与YOLOv8s的混凝土坝水下裂缝识别方法[J]. 水电能源科学, 2025, 43 (4): 158- 162 ZHAO Yang, KANG Fei, WAN Gang Underwater crack identification for concrete dams based on improved CycleGAN and YOLOv8s[J]. Water Resources and Power, 2025, 43 (4): 158- 162
doi: 10.20040/j.cnki.1000-7709.2025.20240874
|
| 15 |
吴海鸣, 陈敬玉 基于AIGC技术的民族服饰设计研究: 以畲族为例[J]. 丝绸, 2025, 62 (1): 20- 29 WU Haiming, CHEN Jingyu Research on ethnic costume design based on AIGC technology: taking the She ethnic group as an example[J]. Silk, 2025, 62 (1): 20- 29
doi: 10.3969/j.issn.1001-7003.2025.01.003
|
| 16 |
师妹华 AIGC技术赋能江苏动画产业研究与应用[J]. 天津美术学院学报, 2024, (3): 18- 21 SHI Meihua Research and application of AIGC technology empowering animation industry in Jiangsu Province[J]. Journal of Tianjin Academy of Fine Arts, 2024, (3): 18- 21
doi: 10.3969/j.issn.1008-8822.2024.03.004
|
| 17 |
HU E J, SHEN Y, WALLIS P, et al. LoRA: low-rank adaptation of large language models [EB/OL]. (2021–10–16)[2025–07–12]. https://arxiv.org/pdf/2106.09685.
|
| 18 |
NICHOL A Q, DHARIWAL P. Improved denoising diffusion probabilistic models [C]// 38th International Conference on Machine Learning. [S.l.]: ML Research Press, 2021: 8162−8171.
|
| 19 |
ROMBACH R, BLATTMANN A, LORENZ D, et al. High-resolution image synthesis with latent diffusion models [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New Orleans. IEEE, 2022: 10674–10685.
|
| 20 |
罗义凯, 徐金华, 李昱燃, 等 基于时空关联和异构图卷积的车道级流量预测[J]. 哈尔滨工业大学学报, 2025, 57 (11): 62- 70 LUO Yikai, XU Jinhua, LI Yuran, et al Lane-level traffic flow prediction based on spatiotemporal correlation and heterogeneous graph convolution[J]. Journal of Harbin Institute of Technology, 2025, 57 (11): 62- 70
doi: 10.11918/202407040
|
| 21 |
RADFORD A, METZ L, CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks [EB/OL]. (2015−11−19) [2026−04−10]. https://arxiv.org/pdf/1511.06434.
|
| 22 |
GULRAJANI I, AHMED F, ARJOVSKY M, et al. Improved training of Wasserstein GANs [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. [S.l.]: Curran Associates Inc., 2017: 5769−5779.
|
| 23 |
KARRAS T, LAINE S, AILA T. A style-based generator architecture for generative adversarial networks [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 4401−4410
|
| 24 |
RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [C]// Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. [S.l.]: Springer, 2015: 234–241.
|
| 25 |
CHEN J, LU Y, YU Q, et al. TransUNet: transformers make strong encoders for medical image segmentation [EB/OL]. (2021–02–08)[2025–07–12]. https://arxiv.org/pdf/2102.04306.
|
| 26 |
MEHTA S, RASTEGARI M. MobileVIT: light-weight, general-purpose, and mobile-friendly vision transformer [EB/OL]. (2022–03–04)[2025–07–12]. https://arxiv.org/pdf/2110.02178.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
| |
Shared |
|
|
|
|
| |
Discussed |
|
|
|
|