Please wait a minute...
Journal of ZheJiang University (Engineering Science)  2023, Vol. 57 Issue (8): 1487-1494    DOI: 10.3785/j.issn.1008-973X.2023.08.002
    
Image super-resolution reconstruction based on dynamic attention network
Xiao-qiang ZHAO1,2,3(),Ze WANG1,Zhao-yang SONG1,Hong-mei JIANG1,2,3
1. School of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China
2. Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou 730050, China
3. National Experimental Teaching Center of Electrical and Control Engineering, Lanzhou University of Technology, Lanzhou 730050, China
Download: HTML     PDF(1196KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

The image super-resolution algorithm adopts the same processing mode in channels and spatial domains with different importance, which leads to the failure of computing resources to concentrate on important features. Aiming at the above problem, an image super-resolution algorithm based on dynamic attention network was proposed. Firstly, the existing way of equalizing attention mechanisms was changed, and dynamic learning weights were assigned to different attention mechanisms by constructed dynamic attention modules, by which high-frequency information more needed by the network was obtained and high-quality pictures were reconstructed. Secondly, the double butterfly structure was constructed through feature reuse , which fully integrated the information from the two branches of attention and compensated for the missing feature information between the different attention mechanisms. Finally, model evaluation was conducted on Set5, Set14, BSD100, Urban100 and Manga109 datasets. Results show that the proposed algorithm has better overall performance than other mainstream super-resolution algorithms. When the amplification factor was 4, compared with the sub-optimal algorithm, the peak signal-to-noise ratio values were improved by 0.06, 0.07, 0.04, 0.15 and 0.15 dB, respectively, on the above five public test sets.



Key wordsimage processing      image super-resolution      attention mechanism      dynamic convolution      double butterfly structure     
Received: 10 October 2022      Published: 31 August 2023
CLC:  TN 391  
Fund:  国家自然科学基金资助项目(62263021);国家重点研发计划资助项目(2020YFB1713600);甘肃省科技计划资助项目(21YF5GA072, 21JR7RA206)
Cite this article:

Xiao-qiang ZHAO,Ze WANG,Zhao-yang SONG,Hong-mei JIANG. Image super-resolution reconstruction based on dynamic attention network. Journal of ZheJiang University (Engineering Science), 2023, 57(8): 1487-1494.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2023.08.002     OR     https://www.zjujournals.com/eng/Y2023/V57/I8/1487


基于动态注意力网络的图像超分辨率重建

针对图像超分辨率算法在具有不同重要性的通道和空间域上采取相同的处理方式,导致计算资源无法集中利用到重要特征上的问题,提出基于动态注意力网络的图像超分辨率算法. 该算法改变了现有均等处理注意力机制的方式,通过构建的动态注意力模块对不同的注意力机制赋予动态学习的权重,以获取网络更需要的高频信息,重建高质量图片;通过特征重用的方式构建双蝶式结构,充分融合2个注意力分支的信息,弥补不同注意力机制间所缺失的特征信息. 在Set5、Set14、BSD100、Urban100和Manga109数据集上的模型评估结果表明,相较于其他主流超分辨率算法,本研究所提算法整体性能表现更好. 当放大因子为4时,相较于次优算法,所提算法在5个公开测试集上的峰值信噪比分别提升了0.06、0.07、0.04、0.15、0.15 dB.


关键词: 图像处理,  图像超分辨率,  注意力机制,  动态卷积,  双蝶式结构 
Fig.1 Dynamic convolution framework diagram
Fig.2 Dynamic attention network structure diagram
Fig.3 Dynamic attention module structure diagram
Fig.4 Two kinds of attention structure chart
Fig.5 Double butterfly construction
Fig.6 Structure of spatial attention
方法 倍数 参数量/K PSNR(dB)/SSIM
Set5 Set14 BSD100 Urban100 Manga109
Bicubic 2 33.68/0.9304 30.24/0.8691 29.56/0.8435 26.88/0.8405 30.80/0.9339
IMDN 694 38.00/0.9605 33.63/0.9177 32.19/0.8996 32.17/0.9283 38.88/0.977 4
MemNet 677 37.78/0.9597 33.28/0.9142 32.08/0.8978 31.31/0.9195 37.72/0.9740
CARN 1592 37.76/0.9590 33.52/0.9166 32.09/0.8978 31.92/0.9256 38.36/0.9765
EDSR-baseline 1370 37.99/0.9604 33.57/0.9175 32.16/0.8994 31.98/0.9272 38.45/0.9770
SRMDNF 1513 37.79/0.9600 33.32/0.9150 32.05/0.8980 31.33/0.9200
SeaNet-baseline 2102 37.99/0.9607 33.60/0.9174 32.18/0.8995 32.08/0.9276 38.48/0.9768
Cross-SRN 1296 38.03/0.9606 33.62/0.9180 32.19/0.8997 32.28/0.9290 38.75/0.9773
DAN(本研究) 1298 38.04/0.9608 33.64/0.9180 32.20/0.8998 32.26/0.9296 38.72/0.9773
Bicubic 3 30.93/0.8682 27.55/0.7742 27.21/0.7385 24.46/0.7349 26.95/0.8556
IMDN 703 34.36/0.9270 30.32/0.8417 29.09/0.8046 28.17/0.8519 33.61/0.9445
MemNet] 677 34.09/0.9248 30.00/0.8350 28.96/0.8001 27.56/0.8376 32.51/0.9369
CARN 1592 34.29/0.9255 30.29/0.8407 29.06/0.8034 28.06/0.8493 33.50/0.9440
EDSR-baseline 1500 34.37/0.9270 30.28/0.8417 29.09/0.8052 28.15/0.8527 33.49/0.9438
SRMDNF 1530 34.12/0.9250 30.04/0.8370 28.97/0.8030 27.57/0.8400
SeaNet-baseline 2471 34.36/0.9280 30.34/0.8428 29.09/0.8053 28.17/0.8527 33.40/0.9444
Cross-SRN 1296 34.43/0.9275 30.33/0.8417 29.09/0.8050 28.23/0.8535 33.65/0.9448
DAN(本研究) 1326 34.42/0.9274 30.38/0.8429 29.10/0.8052 28.24/0.8542 33.63/0.9446
Bicubic 4 28.42/0.8104 26.00/0.7027 26.96/0.6675 23.14/0.6577 24.80/0.7866
IMDN 715 32.21/0.8948 28.58/0.7811 27.56/0.7353 26.04/0.7838 30.45/0.9075
MemNet 677 31.74/0.8893 28.26/0.7723 27.40/0.7281 25.50/0.7630 29.42/0.8942
CARN 1592 32.13/0.8937 28.60/0.7806 27.58/0.7349 26.07/0.7837 30.47/0.9087
EDSR-baseline 1500 32.09/0.8938 28.58/0.7813 27.57/0.7357 26.04/0.7849 30.45/0.9082
SRMDNF 1555 31.96/0.8930 28.35/0.7770 27.49/0.7340 25.68/0.7730
SeaNet-baseline 2397 32.18/0.8948 28.61/0.7822 27.57/0.7359 26.05/0.7896 30.44/0.9088
Cross-SRN 1296 32.24/0.8954 28.59/0.7817 27.58/0.7364 26.16/0.7881 30.53/0.9081
DAN(本研究) 1337 32.32/0.8962 28.68/0.7841 27.62/0.7381 26.31/0.7936 30.68/0.9106
Tab.1 Average PSNR and SSIM for different SR algorithms at magnifications of 2, 3 and 4
Fig.7 Comparison of visual effects at magnification of four in standard test set
[1]   SI W, HAN J, YANG Z, et al. Research on key techniques for super-resolution reconstruction of satellite remote sensing images of transmission lines [C]// Journal of Physics: Conference Series. Sanya: ICAACE, 2021: 012092.
[2]   DEEBA F, KUN S, DHAREJO F A, et al Sparse representation based computed tomography images reconstruction by coupled dictionary learning algorithm[J]. IET Image Processing, 2020, 14 (11): 2365- 2375
doi: 10.1049/iet-ipr.2019.1312
[3]   ZHANG F, LIU N, CHANG L, et al Edge-guided single facial depth map super-resolution using CNN[J]. IET Image Processing, 2020, 14 (17): 4708- 4716
doi: 10.1049/iet-ipr.2019.1623
[4]   LI W, LIAO W Stable super-resolution limit and smallest singular value of restricted Fourier matrices[J]. Applied and Computational Harmonic Analysis, 2021, 51: 118- 156
doi: 10.1016/j.acha.2020.10.004
[5]   吴世豪, 罗小华, 张建炜, 等 基于FPGA的新边缘指导插值算法硬件实现[J]. 浙江大学学报: 工学版, 2018, 52 (11): 2226- 2232
WU Shi-hao, LUO Xiao-hua, ZHANG Jian-wei, et al FPGA-based hardware implementation of new edge-directed interpolation algorithm[J]. Journal of Zhejiang University: Engineering Science, 2018, 52 (11): 2226- 2232
[6]   段然, 周登文, 赵丽娟, 等 基于多尺度特征映射网络的图像超分辨率重建[J]. 浙江大学学报: 工学版, 2019, 53 (7): 1331- 1339
DUAN Ran, ZHOU Deng-wen, ZHAO Li-juan, et al Image super-resolution reconstruction based on multi-scale feature mapping network[J]. Journal of Zhejiang University: Engineering Science, 2019, 53 (7): 1331- 1339
[7]   DONG C, LOY C C, HE K, et al. Learning a deep convolutional network for image super-resolution [C]// European Conference on Computer Vision. Zurich: ECCV, 2014: 184-199.
[8]   DONG C, LOY C C, HE K, et al Image super-resolution using deep convolutional networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 38 (2): 295- 307
[9]   LIM B, SON S, KIM H, et al. Enhanced deep residual networks for single image super-resolution [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Honolulu: CVPRW, 2017: 136-144.
[10]   TAI Y, YANG J, LIU X, et al. Memnet: a persistent memory network for image restoration [C]// Proceedings of the IEEE International Conference on Computer Vision. Venice: ICCV, 2017: 4539-4547.
[11]   AHN N, KANG B, SOHN K A. Fast, accurate, and lightweight super-resolution with cascading residual network [C]// Proceedings of the European Conference on Computer Vision. Munich: ECCV, 2018: 252-268.
[12]   WANG C, LI Z , SHI J. Lightweight image super-resolution with adaptive weighted learning network [EB/OL]. [2019-04-04]. https://arxiv.org/abs/1904.02358.
[13]   WOO S, PARK J, LEE J Y, et al. Cbam: convolutional block attention module [C]// Proceedings of the European Conference on Computer Vision. Munich: ECCV, 2018: 3-19.
[14]   DAI T, CAI J, ZHANG Y, et al. Second-order attention network for single image super-resolution [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: CVPR, 2019: 11065-11074.
[15]   ZHANG Y, LI K, LI K, et al. Residual non-local attention networks for image restoration[EB/OL]. [2019-03-24]. https://arxiv.org/abs/1903.10082.
[16]   JIA X, BRABANDERE D B, TUYTELAARS T, et al. Dynamic filter networks for predicting unobserved views [C]// Proceedings of the European Conference on Computer Vision 2016 Workshops. Amsterdam: ECCVW, 2016: 1-2.
[17]   YANG B, BENDER G, LE Q V, et al. Condconv: conditionally parameterized convolutions for efficient inference [C]// Advances in Neural Information Processing Systems. 2019, 32: 767-779.
[18]   CHEN Y, DAI X, LIU M, et al. Dynamic convolution: attention over convolution kernels [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: CVPR, 2020: 11030-11039.
[19]   ZHANG Y, ZHANG J, WANG Q, et al. Dynet: dynamic convolution for accelerating convolutional neural networks [EB/OL]. [2020-04-22]. https://arxiv.org/abs/2004.10694.
[20]   ZHANG Y, LI K, LI K, et al. Image super-resolution using very deep residual channel attention networks [C]// Proceedings of the European Conference on Computer Vision. Munich: ECCV, 2018: 286-301.
[21]   CHEN H, GU J, ZHANG Z. Attention in attention network for image super-resolution [EB/OL]. [2021-04-19]. https://arxiv.org/abs/2104.09497.
[22]   TIMOFTE R, AGUSTSSON E, VAN G L, et al. Ntire 2017 challenge on single image super-resolution: methods and results [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Hawaii: CVPRW, 2017: 114-125.
[23]   BEVILACQUA M, ROUMY A, GUILLEMOT C, et al. Low-complexity single-image super-resolution based on nonnegative neighbor embedding [C]// Proceedings British Machine Vision Conference. Surrey: Springer, 2012: 1-10.
[24]   ZEYDE R, ELAD M, PROTTER M. On single image scale-up using sparse-representations [C]// International Conference on Curves and Surfaces. Avignon: ICCS, 2010: 711-730.
[25]   MARTIN D, FOWLKES C, TAL D, et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics [C]// Proceedings 18th IEEE International Conference on Computer Vision. Vancouver: ICCV, 2001: 416-423.
[26]   HUANG J B, SINGH A, AHUJA N. Single image super-resolution from transformed self-exemplars [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Santiago: IEEE, 2015: 5197-5206.
[27]   MATSUI Y, ITO K, ARAMAKI Y, et al Sketch-based manga retrieval using manga109 dataset[J]. Multimedia Tools and Applications, 2017, 76 (20): 21811- 21838
doi: 10.1007/s11042-016-4020-z
[28]   FEI Y, LIAN F H, YAN Y. An improved PSNR algorithm for objective video quality evaluation [C]// 2007 Chinese Control Conference. Zhangjiajie: CCC, 2007: 376-380.
[29]   WANG Z, BOVIK A C, SHEIKH H R, et al Image quality assessment: from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13 (4): 600- 612
doi: 10.1109/TIP.2003.819861
[30]   KINGMA D P, BA J. Adam: a method for stochastic optimization [EB/OL]. [2014-12-22]. https://arxiv.org/abs/1412.6980.
[31]   HUI Z, GAO X, YANG Y, et al. Lightweight image super-resolution with information multi-distillation network [C]// Proceedings of the 27th ACM International Conference on Multimedia. Ottawa: ACM, 2019: 2024-2032.
[32]   FANG F, LI J, ZENG T Soft-edge assisted network for single image super-resolution[J]. IEEE Transactions on Image Processing, 2020, 29: 4656- 4668
doi: 10.1109/TIP.2020.2973769
[33]   LIU Y, JIA Q, FAN X, et al Cross-srn: structure-preserving super-resolution network with cross convolution[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 32 (8): 4927- 4939
[1] Hao-ran GUO,Ji-chang GUO,Yu-dong WANG. Lightweight semantic segmentation network for underwater image[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(7): 1278-1286.
[2] Xiao-yan LI,Peng WANG,Jia GUO,Xue LI,Meng-yu SUN. Multi branch Siamese network target tracking based on double attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(7): 1307-1316.
[3] Wei QUAN,Yong-qing CAI,Chao WANG,Jia SONG,Hong-kai SUN,Lin-xuan LI. VR sickness estimation model based on 3D-ResNet two-stream network[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(7): 1345-1353.
[4] Jun HAN,Xiao-ping YUAN,Zhun WANG,Ye CHEN. UAV dense small target detection algorithm based on YOLOv5s[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(6): 1224-1233.
[5] Xin-dong LV,Jiao LI,Zhen-nan DENG,Hao FENG,Xin-tong CUI,Hong-xia DENG. Structured image super-resolution network based on improved Transformer[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(5): 865-874.
[6] Xue-yong XIANG,Li WANG,Wen-peng ZONG,Guang-yun LI. Point cloud instance segmentation based on attention mechanism KNN and ASIS module[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(5): 875-882.
[7] Fan XIONG,Tian CHEN,Bai-cheng BIAN,Jun LIU. Chip surface character recognition based on convolutional recurrent neural network[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(5): 948-956.
[8] Yu-ting SU,Rong-xuan LU,Wei ZHANG. Vehicle re-identification algorithm based on attention mechanism and adaptive weight[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(4): 712-718.
[9] Bai-cheng BIAN,Tian CHEN,Ru-jun WU,Jun LIU. Improved YOLOv3-based defect detection algorithm for printed circuit board[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(4): 735-743.
[10] Yan-fen CHENG,Jia-jun WU,Fan HE. Aspect level sentiment analysis based on relation gated graph convolutional network[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(3): 437-445.
[11] Fan YANG,Bo NING,Huai-qing LI,Xin ZHOU,Guan-yu LI. Multimodal image retrieval model based on semantic-enhanced feature fusion[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(2): 252-258.
[12] Chao LIU,Bing KONG,Guo-wang DU,Li-hua ZHOU,Hong-mei CHEN,Chong-ming BAO. Deep clustering via high-order mutual information maximization and pseudo-label guidance[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(2): 299-309.
[13] Lin-tao WANG,Qi MAO. Position measurement method for tunnel segment grabbing based on RGB and depth information fusion[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(1): 47-54.
[14] Li-zhou FENG,Yang YANG,You-wei WANG,Gui-jun YANG. New method for news recommendation based on Transformer and knowledge graph[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(1): 133-143.
[15] Kun HAO,Kuo WANG,Bei-bei WANG. Lightweight underwater biological detection algorithm based on improved Mobilenet-YOLOv3[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(8): 1622-1632.