Industrial image anomaly detection method based on adversarial learning of abnormal features

doi:10.3785/j.issn.1008-973X.2025.12.011

Journal of ZheJiang University (Engineering Science)

2025, Vol. 59

Issue (12): 2566-2575 DOI: 10.3785/j.issn.1008-973X.2025.12.011

Industrial image anomaly detection method based on adversarial learning of abnormal features

Tianfei WANG1(

),Wenjun ZHOU1,*(

),Sheng XIANG2,Yuhang HE1,Bo PENG1

1. School of Computer Science and Software Engineering, Southwest Petroleum University, Chengdu 610500, China
2. School of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China

Download:

HTML

PDF(2791KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

A novel anomaly detection method named EDA (enhancing anomaly detection via adversarial anomaly learning) was proposed, to address the challenges of industrial image anomaly detection, including the scarcity of anomalous samples, the complexity of annotation, and the high computational cost of deep models. The proposed approach consisted of two key stages. 1) Anomaly learning and embedding stage: a generative adversarial network (GAN) architecture was employed to learn anomalous features. The generator’s parameters were reduced to ensure lightweight design, and subpixel convolution was introduced to enhance anomalous information. Random regions were selected from normal images, refined using the SAM (segment anything) model, and then anomalous features were generated in these refined regions, providing prior anomalous features and corresponding masks for the anomaly detection stage. 2) Anomaly detection stage: a Contrast U-net network was introduced to improve sensitivity to anomalous features and enhance the accuracy of identification and localization through supervised training. Experimental results on the MVTec dataset demonstrated the superior performance of the proposed method, achieving an image-level AUROC of 98.2%, a pixel-level AUROC of 97.8%, and an AU-PR of 81.1%, showing significant advantages and outstanding performance in the field of industrial image anomaly detection and segmentation.

Key words： anomaly detection generative adversarial network anomaly generation contrast operator deep learning

Received: 18 December 2024 Published: 25 November 2025

CLC:

TP 391

Fund: 四川省自然科学基金资助项目（2023NSFSC0504）.

Corresponding Authors: Wenjun ZHOU E-mail: tianfeifeiwang@outlook.com;zhouwenjun@swpu.edu.cn

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Tianfei WANG
	Wenjun ZHOU
	Sheng XIANG
	Yuhang HE
	Bo PENG

Cite this article:

Tianfei WANG,Wenjun ZHOU,Sheng XIANG,Yuhang HE,Bo PENG. Industrial image anomaly detection method based on adversarial learning of abnormal features. Journal of ZheJiang University (Engineering Science), 2025, 59(12): 2566-2575.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2025.12.011 OR https://www.zjujournals.com/eng/Y2025/V59/I12/2566

基于异常特征对抗学习的工业图像异常检测方法

为了解决工业图像异常检测中遇到的异常样本稀缺、标注过程复杂及深度模型计算开销大的问题，提出新的异常检测方法EDA. 该方法分为2个阶段. 1）异常学习和嵌入阶段，采用生成式对抗网络（GAN）架构来学习异常特征，通过缩减生成器参数量以保证网络轻量化，引入亚像素卷积以增强异常信息，随后在正常图像中随机选择区域，通过SAM (segment anything)模型进行区域的细化处理，在细化处理后的区域生成异常信息，为异常检测阶段提供先验异常特征及相应掩码. 2）异常检测阶段，引入Contrast U-Net网络利用有监督训练方式增强对异常特征的敏感度，并提升识别与定位的准确性. 在MVTec数据集上进行的实验结果表明，所提方法性能优异，图像级别AUROC为98.2%，像素级别AUROC为97.8%，AU-PR为81.1%，具有显著优势，在图像异常检测分割领域具有出色表现.

关键词： 异常检测, 生成对抗网络, 异常图像生成, 对比度算子, 深度学习

Tab.1 Comparison of commonly used methods for industrial anomaly detection

Fig.1 Overall structure of EDA

Fig.2 Illustration of anomaly learning and embedding

Fig.3 Structure of Bottleneck block

Fig.4 Schematic of mask region generation

Fig.5 Illustration of anomaly detection module

Fig.6 Network structure of anomaly detection module

Fig.7 Examples from MVTec dataset

Tab.2 Image- and pixel-level AUROC comparison results for different models

Tab.3 Pixel-level AU-PR comparison results for different models

Fig.8 Qualitative comparison of different models

Tab.4 Ablation experiment design

Tab.5 Image- and piexl-level AUROC results of ablation experiment

Tab.6 Ablation experiment results of GAN

Tab.7 Comparison of time complexity

Fig.9 Illustration of similarity of real and simulated abnormal image feature distribution

Fig.10 Qualitative result examples for texture image


[1]	吕承侃, 沈飞, 张正涛, 等图像异常检测研究现状综述[J]. 自动化学报, 2022, 48 (6): 1402- 1428 LV Chengkan, SHEN Fei, ZHANG Zhengtao, et al Review of image anomaly detection[J]. Acta Automatica Sinica, 2022, 48 (6): 1402- 1428

[2]	LIU J, XIE G, WANG J, et al Deep industrial image anomaly detection: a survey[J]. Machine Intelligence Research, 2024, 21 (1): 104- 135 doi: 10.1007/s11633-023-1459-z

[3]	KIRILLOV A, MINTUN E, RAVI N, et al. Segment anything [C]// IEEE/CVF International Conference on Computer Vision. Paris: IEEE, 2023: 3992–4003.

[4]	HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely connected convolutional networks [C]// IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 2261–2269.

[5]	SHI W, CABALLERO J, HUSZÁR F, et al. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network [C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 1874–1883.

[6]	BERGMANN P, FAUSER M, SATTLEGGER D, et al. MVTec AD: a comprehensive real-world dataset for unsupervised anomaly detection [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 9584–9592.

[7]	ZHOU W, WANG T, HE Y, et al Contrast U-Net driven by sufficient texture extraction for carotid plaque detection[J]. Mathematical Biosciences and Engineering, 2023, 20 (9): 15623- 15640 doi: 10.3934/mbe.2023697

[8]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132–7141.

[9]	HE Y, XIANG S, ZHOU W, et al. A novel contrast operator for robust object searching [C]// 17th International Conference on Computational Intelligence and Security. Chengdu: IEEE, 2021: 309–313.

[10]	LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection [C]// IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 2999–3007.

[11]	BERMAN M, TRIKI A R, BLASCHKO M B. The lovasz-softmax loss: a tractable surrogate for the optimization of the intersection-over-union measure in neural networks [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 4413–4421.

[12]	DEFARD T, SETKOV A, LOESCH A, et al. PaDiM: a patch distribution modeling framework for anomaly detection and localization [C]// International conference on pattern recognition. Cham: Springer International Publishing, 2021: 475–489.

[13]	BERGMANN P, FAUSER M, SATTLEGGER D, et al. Uninformed students: student-teacher anomaly detection with discriminative latent embeddings [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 4182−4191.

[14]	BERGMANN P, LOWS S, FAUSER M, et al. Improving unsupervised defect segmentation by applying structural similarity to autoencoders.

[15]	ZAVRTANIK V, KRISTAN M, SKOČAJ D Reconstruction by inpainting for visual anomaly detection[J]. Pattern Recognition, 2021, 112: 107706 doi: 10.1016/j.patcog.2020.107706

[16]	LI C L, SOHN K, YOON J, et al. CutPaste: self-supervised learning for anomaly detection and localization [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 9659-9669.

[17]	张玥, 陈锡伟, 陈梦丹, 等基于对比学习生成对抗网络的无监督工业品表面异常检测[J]. 电子测量与仪器学报, 2023, 37 (10): 193- 201 ZHANG Yue, CHEN Xiwei, CHEN Mengdan, et al Unsupervised surface anomaly detection of industrial products based on contrastive learning generative adversarial network[J]. Journal of Electronic Measurement and Instrumentation, 2023, 37 (10): 193- 201

[18]	WAN Q, GAO L, LI X, et al Unsupervised image anomaly detection and segmentation based on pretrained feature mapping[J]. IEEE Transactions on Industrial Informatics, 2023, 19 (3): 2330- 2339 doi: 10.1109/TII.2022.3182385

[19]	孔森林, 张辉, 黄镇南, 等面向工业图像异常检测的非对称师生网络模型[J]. 计算机科学, 2024, 51 (Suppl.2): 331- 337 KONG Senlin, ZHANG Hui, HUANG Zhennan, et al Asymmetric teacher-student network model for industrial image anomaly detection[J]. Computer Science, 2024, 51 (Suppl.2): 331- 337

[1]	Jizhong DUAN,Haiyuan LI. Multi-scale parallel magnetic resonance imaging reconstruction based on variational model and Transformer[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(9): 1826-1837.

[2]	Fujian WANG,Zetian ZHANG,Xiqun CHEN,Dianhai WANG. Usage prediction of shared bike based on multi-channel graph aggregation attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(9): 1986-1995.

[3]	Hong ZHANG,Xuecheng ZHANG,Guoqiang WANG,Panlong GU,Nan JIANG. Real-time positioning and control of soft robot based on three-dimensional vision[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(8): 1574-1582.

[4]	Shengju WANG,Zan ZHANG. Missing value imputation algorithm based on accelerated diffusion model[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(7): 1471-1480.

[5]	Dongping ZHANG,Dawei WANG,Shuji HE,Siliang TANG,Zhiyong LIU,Zhongqiu LIU. Remaining useful life prediction of aircraft engines based on cross-dimensional feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(7): 1504-1513.

[6]	Yongqing CAI,Cheng HAN,Wei QUAN,Wudi CHEN. Visual induced motion sickness estimation model based on attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1110-1118.

[7]	Jian XIAO,Liangliang WU,Xinze HE,Xin HU. Image feature matching algorithm based on anomaly detection[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1140-1147.

[8]	Lihong WANG,Xinqian LIU,Jing LI,Zhiquan FENG. Network intrusion detection method based on federated learning and spatiotemporal feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1201-1210.

[9]	Huizhi XU,Xiuqing WANG. Perception of distance and speed of front vehicle based on vehicle image features[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(6): 1219-1232.

[10]	Zan CHEN,Ran LI,Yuanjing FENG,Yongqiang LI. Video snapshot compressive imaging reconstruction based on temporal super-resolution[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(5): 956-963.

[11]	Qincheng JIANG,Jianfeng TAO,Yangyang WANG,Yulei ZHANG,Chengliang LIU. EWT-LSTM based industrial robot joint anomaly detection[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(5): 982-994.

[12]	Li MA,Yongshun WANG,Yao HU,Lei FAN. Pre-trained long-short spatiotemporal interleaved Transformer for traffic flow prediction applications[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(4): 669-678.

[13]	Qiaohong CHEN,Menghao GUO,Xian FANG,Qi SUN. Image captioning based on cross-modal cascaded diffusion model[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(4): 787-794.

[14]	Zhengyu GU,Feifei LAI,Chen GENG,Ximing WANG,Yakang DAI. Knowledge-guided infarct segmentation of ischemic stroke[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(4): 814-820.

[15]	Minghui YAO,Yueyan WANG,Qiliang WU,Yan NIU,Cong WANG. Siamese networks algorithm based on small human motion behavior recognition[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(3): 504-511.

Viewed

Full text

Abstract

Cited

Shared

Discussed