Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (7): 1392-1403    DOI: 10.3785/j.issn.1008-973X.2026.07.003
计算机与控制工程     
基于轻量高频Transformer与特征互补融合的视网膜血管分割
梁礼明(),王成斌,钟奕,陈林俊,吴健*()
江西理工大学 电气工程与自动化学院,江西 赣州 341000
Retinal vessel segmentation based on lightweight high-frequency Transformer and feature complementary fusion
Liming LIANG(),Chengbin WANG,Yi ZHONG,Linjun CHEN,Jian WU*()
School of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou 341000, China
 全文: PDF(4709 KB)   HTML
摘要:

针对视网膜血管分割中血管信息易损失、血管像素占比小及病灶区域血管分割困难的问题,构建基于轻量高频Transformer与特征互补融合的视网膜血管分割算法. 利用轻量高频Transformer模块构建双边特征提取编码结构,通过全局和局部信息的有效聚合,缓解初始编码过程中血管信息损失的问题. 在编码端设计细节增强注意力模块,准确辨别血管前景和非血管背景,提高模型对微细血管的捕捉能力. 在解码端构建双解码路径并引入特征互补融合模块,实现低层细节与高层语义信息的充分整合,改善对病灶区域的血管分割精度. 实验结果表明,在DRIVE、STARE和CHASE_DB1公共数据集上所提算法的准确率分别为97.12%、97.62%和97.65%;灵敏度分别为80.23%、80.48%和81.30%;F1分数分别为82.99%、83.73%和81.73%. 该方法的参数量和计算复杂度FLOPs分别为4.27 M和1.79 G,在实现较高计算效率的同时保持了模型的分割性能,充分展现了其在临床眼科疾病诊断中的应用潜力.

关键词: 视网膜血管分割轻量高频Transformer特征互补融合细节增强注意力全局特征    
Abstract:

A retinal vessel segmentation algorithm based on a lightweight high-frequency Transformer and feature complementary fusion was developed to address the issues in retinal vessel segmentation, including the loss of vessel information, the low proportion of vessel pixels, and the difficulties in segmenting vessels in lesion regions. A bilateral feature extraction encoding structure was constructed using the lightweight high-frequency Transformer module. Global and local information were effectively aggregated to mitigate the loss of vessel information during the initial encoding process. A detail-enhancement attention module was designed within the encoding stage to accurately distinguish the vessel foreground from the non-vessel background, thereby improving the model’s ability to capture fine vessels. A dual-decoding path was constructed at the decoding stage, and a feature complementary fusion module was introduced to fully integrate low-level details with high-level semantic information, thereby enhancing the segmentation accuracy of vessels in lesion regions. Experimental results on the public datasets of DRIVE, STARE, and CHASE_DB1 indicated that the proposed algorithm achieved accuracies of 97.12%, 97.62%, and 97.65%, sensitivities of 80.23%, 80.48%, and 81.30%, and F1-scores of 82.99%, 83.73%, and 81.73%, respectively. The parameter count and computational complexity (FLOPs) were 4.27 M and 1.79 G, respectively. While achieving high computational efficiency, this method maintains the model’s segmentation performance, fully demonstrating its potential in clinical diagnosis of ophthalmic diseases.

Key words: retinal vessel segmentation    lightweight high-frequency Transformer    feature complementary fusion    detail-enhanced attention    global feature
收稿日期: 2025-03-26 出版日期: 2026-05-23
CLC:  TP 391  
基金资助: 国家自然科学基金资助项目(51365017, 61463018);江西省自然科学基金资助项目(20192BAB205084);江西省教育厅科学技术研究资助项目(GJJ2200848).
通讯作者: 吴健     E-mail: 9119890012@jxust.edu.cn;wujian@jxust.edu.cn
作者简介: 梁礼明(1967—),男,教授,硕导,从事机器学习、模式识别与图像处理研究. orcid.org/0000-0003-4549-4414. E-mail:9119890012@jxust.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
梁礼明
王成斌
钟奕
陈林俊
吴健

引用本文:

梁礼明,王成斌,钟奕,陈林俊,吴健. 基于轻量高频Transformer与特征互补融合的视网膜血管分割[J]. 浙江大学学报(工学版), 2026, 60(7): 1392-1403.

Liming LIANG,Chengbin WANG,Yi ZHONG,Linjun CHEN,Jian WU. Retinal vessel segmentation based on lightweight high-frequency Transformer and feature complementary fusion. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1392-1403.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.07.003        https://www.zjujournals.com/eng/CN/Y2026/V60/I7/1392

图 1  LFF-Net网络结构
图 2  轻量高频Transformer模块
图 3  轻量高频注意力模块
图 4  细节增强注意力模块
图 5  特征互补融合模块
图 6  图像预处理流程
图 7  不同算法的视网膜血管分割结果
图 8  不同算法的视网膜血管分割局部图像
数据集模型ACC/%SE/%SP/%AUC/%F1/%
DRIVEU-Net97.0878.5798.8698.8182.53
Attention U-Net97.1079.4298.8098.8282.78
FR-UNet97.0680.1098.6998.7882.69
GT-DLA-dsHFF97.1080.1598.7398.8082.91
LFF-Net97.1280.2398.7498.8382.99
STAREU-Net97.5279.2699.0299.0182.98
Attention U-Net97.5578.8899.0199.0583.08
FR-UNet97.4980.3698.9099.0383.01
GT-DLA-dsHFF97.5679.9899.0199.0983.37
LFF-Net97.6280.4899.0299.1283.73
CHASE_DB1U-Net97.4680.9098.5798.8680.09
Attention U-Net97.6080.6598.7498.9780.96
FR-UNet97.4181.1998.5098.7579.83
GT-DLA-dsHFF97.6180.7298.7499.0080.97
LFF-Net97.6581.3098.7598.9981.37
表 1  不同算法在3个数据集上的血管分割结果对比
数据集模型ACC/%SE/%SP/%AUC/%F1/%
DRIVESFIT-Net[27]97.0781.5998.5598.7582.97
PA-Net[28]95.8282.8498.0798.3383.93
DAE-Former[29]95.9279.2898.4697.8083.73
MSM-TDE[30]96.6684.9297.2397.8079.30
BINet[31]96.0686.9297.3784.25
MSTP-Net[32]96.9183.6898.1882.58
DAU-Net[33]95.8581.5598.1598.1882.99
LFF-Net97.1280.2398.7498.8382.99
STARESFIT-Net[27]97.5082.1898.9299.1083.37
PA-Net[28]97.0988.1398.0599.0885.61
DAE-Former[29]97.0682.6698.6698.9784.78
MSM-TDE[30]97.2686.9098.2298.0983.70
BINet[31]96.1682.7697.7681.33
MSTP-Net[32]97.6186.0398.5884.68
DAU-Net[33]97.1285.8098.4399.0886.20
LFF-Net97.6280.4899.0299.1283.73
CHASE_DB1SFIT-Net[27]97.5382.1998.5698.8180.76
PA-Net[28]96.7785.7097.7998.7583.08
DAE-Former[29]96.6083.2897.9298.7081.61
MSM-TDE[30]96.6786.0297.5396.4578.05
BINet[31]96.0483.9397.3480.47
MSTP-Net[32]97.4584.8598.3080.74
DAU-Net[33]97.0083.6498.3598.9484.99
LFF-Net97.6581.3098.7598.9981.37
表 2  所提算法与先进算法在3个数据集上的血管分割结果对比
数据集FCFMLHFTDEAMACC/%SE/%SP/%AUC/%F1/%
DRIVE97.0878.5798.8698.8082.53
97.1079.6198.7798.8182.78
97.0979.9298.7598.8282.79
97.1079.7598.7798.8182.76
97.1079.7998.7798.8482.86
97.1079.7198.7898.8182.82
97.1179.7898.8098.8282.87
97.1280.2398.7498.8382.99
STARE97.5279.2699.0299.0182.98
97.5779.6299.0499.0883.30
97.5379.8899.0499.0583.41
97.5479.1899.0799.0183.20
97.6080.3599.0299.1083.62
97.5980.3399.0199.1083.57
97.6080.2299.0299.0883.58
97.6280.4899.0299.1283.73
CHASE_DB197.4680.9098.5798.8580.09
97.5981.0598.7198.9980.97
97.5481.1498.7798.9981.03
97.5780.3598.7898.9580.86
97.6481.1798.7599.0381.30
97.6180.5298.7698.9880.97
97.6281.1998.7298.9781.16
97.6581.3098.7598.9981.37
表 3  采取不同模块时模型在不同数据集上的血管分割性能
图 9  不同改进网络的血管分割结果
模型Np/MFLOPs/G
U-Net34.524.09
Attention U-Net34.874.16
FR-UNet5.723.68
GT-DLA-dsHFF26.097.40
LFF-Net4.271.79
表 4  参数量及每秒浮点运算次数对比
模型ttr/stte/s
U-Net140090
Attention U-Net160095
FR-UNet2400200
GT-DLA-dsHFF52403120
LFF-Net2040180
表 5  训练及测试时间对比
1 梁礼明, 詹涛, 雷坤, 等 多分辨率融合输入的U型视网膜血管分割算法[J]. 电子与信息学报, 2023, 45 (5): 1795- 1806
LIANG Liming, ZHAN Tao, LEI Kun, et al Multi-resolution fusion input U-shaped retinal vessel segmentation algorithm[J]. Journal of Electronics & Information Technology, 2023, 45 (5): 1795- 1806
doi: 10.11999/JEIT220470
2 MAPAYI T, VIRIRI S, TAPAMO J R. Adaptive thresholding technique for retinal vessel segmentation based on GLCM-energy information [J]. Computational and Mathematical Methods in Medicine, 2015: 597475.
3 ODSTRCILIK J, KOLAR R, BUDAI A, et al Retinal vessel segmentation by improved matched filtering: evaluation on a new high-resolution fundus image database[J]. IET Image Processing, 2013, 7 (4): 373- 383
doi: 10.1049/iet-ipr.2012.0455
4 VLACHOS M, DERMATAS E Multi-scale retinal vessel segmentation using line tracking[J]. Computerized Medical Imaging and Graphics, 2010, 34 (3): 213- 227
doi: 10.1016/j.compmedimag.2009.09.006
5 ZHAO J, YANG J, AI D, et al Automatic retinal vessel segmentation using multi-scale superpixel chain tracking[J]. Digital Signal Processing, 2018, 81: 26- 42
doi: 10.1016/j.dsp.2018.06.006
6 RELAN D, MACGILLIVRAY T, BALLERINI L, et al. Automatic retinal vessel classification using a least square-support vector machine in VAMPIRE [C]// Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Chicago: IEEE, 2014: 142–145.
7 梁礼明, 刘博文, 杨海龙, 等 基于多特征融合的有监督视网膜血管提取[J]. 计算机学报, 2018, 41 (11): 2566- 2580
LIANG Liming, LIU Bowen, YANG Hailong, et al Supervised blood vessel extraction in retinal images based on multiple feature fusion[J]. Chinese Journal of Computers, 2018, 41 (11): 2566- 2580
doi: 10.11897/SP.J.1016.2018.02566
8 王万良, 王铁军, 陈嘉诚, 等 融合多尺度和多头注意力的医疗图像分割方法[J]. 浙江大学学报: 工学版, 2022, 56 (9): 1796- 1805
WANG Wanliang, WANG Tiejun, CHEN Jiacheng, et al Medical image segmentation method combining multi-scale and multi-head attention[J]. Journal of Zhejiang University: Engineering Science, 2022, 56 (9): 1796- 1805
9 RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [M]// Lecture notes in computer science. Cham: Springer, 2015: 234–241.
10 LI J, GAO G, LIU Y, et al MAGF-Net: a multiscale attention-guided fusion network for retinal vessel segmentation[J]. Measurement, 2023, 206: 112316
doi: 10.1016/j.measurement.2022.112316
11 ZHANG H, FANG W, LI J A microvascular segmentation network based on pyramidal attention mechanism[J]. Sensors, 2024, 24 (12): 4014
doi: 10.3390/s24124014
12 LI J, LI A, LIU Y, et al An adaptive fundus retinal vessel segmentation model capable of adapting to the complex structure of blood vessels[J]. Biomedical Signal Processing and Control, 2025, 101: 107150
doi: 10.1016/j.bspc.2024.107150
13 PAN P, ZHANG C, SUN J, et al Multi-scale conv-attention U-Net for medical image segmentation[J]. Scientific Reports, 2025, 15: 12041
doi: 10.1038/s41598-025-96101-8
14 HUANG L, MIRON A, HONE K, et al. Segmenting medical images: from UNet to res-UNet and nnUNet [C]// Proceedings of the IEEE 37th International Symposium on Computer-Based Medical Systems. Guadalajara: IEEE, 2024: 483–489.
15 VASWANI A, SHAZEER N, PARAMAR N, et al. Attention is all you need [EB/OL]. (2023-08-02) [2025-02-26]. https://arxiv.org/abs/1706.03762.
16 CHEN J, LU Y, YU Q, et al. TransUNet: Transformers make strong encoders for medical image segmentation [EB/OL]. (2021-02-08) [2025-03-12]. https://arxiv.org/abs/2102.04306.
17 SHI Z, LI Y, ZOU H, et al TCU-Net: Transformer embedded in convolutional U-shaped network for retinal vessel segmentation[J]. Sensors, 2023, 23 (10): 4897
doi: 10.3390/s23104897
18 TANG W, DENG H, HUANG Z, et al Medical image segmentation method based on full perceived dynamic network[J]. Engineering Applications of Artificial Intelligence, 2025, 142: 109867
doi: 10.1016/j.engappai.2024.109867
19 LI Y, XU L, JIN Y, et al Diffusion probabilistic learning with gate-fusion Transformer and edge-frequency attention for retinal vessel segmentation[J]. IEEE Transactions on Instrumentation and Measurement, 2024, 73: 2523513
20 ZHANG F, PANAHI A, GAO G FsaNet: frequency self-attention for semantic segmentation[J]. IEEE Transactions on Image Processing, 2023, 32: 4757- 4772
doi: 10.1109/TIP.2023.3305090
21 KARIMIJAFARBIGLOO S, AZAD R, KAZEROUNI A, et al. MS-Former: multi-scale self-guided Transformer for medical image segmentation [C]// Medical Imaging with Deep Learning. Paris: PMLR, 2024: 680–694.
22 CHEN Z, HE Z, LU Z DEA-net: single image dehazing based on detail-enhanced convolution and content-guided attention[J]. IEEE Transactions on Image Processing, 2024, 33: 1002- 1015
doi: 10.1109/TIP.2024.3354108
23 ZHANG X, ZHOU X, LIN M, et al. ShuffleNet: an extremely efficient convolutional neural network for mobile devices [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 6848–6856.
24 OKTAY O, SCHLEMPER J, FOLGOC L, et al. Attention U-Net: learning where to look for the pancreas [EB/OL]. (2018-05-20) [2025-03-14]. https://arxiv.org/abs/1804.03999.
25 LIU W, YANG H, TIAN T, et al Full-resolution network and dual-threshold iteration for retinal vessel and coronary angiograph segmentation[J]. IEEE Journal of Biomedical and Health Informatics, 2022, 26 (9): 4623- 4634
doi: 10.1109/JBHI.2022.3188710
26 LI Y, ZHANG Y, LIU J Y, et al Global Transformer and dual local attention network via deep-shallow hierarchical feature fusion for retinal vessel segmentation[J]. IEEE Transactions on Cybernetics, 2023, 53 (9): 5826- 5839
doi: 10.1109/TCYB.2022.3194099
27 LIANG L, LU B, WU J, et al SFIT-Net: spatial reconstruction feature interaction transformer retinal vessel segmentation algorithm[J]. Biomedical Signal Processing and Control, 2025, 106: 107688
doi: 10.1016/j.bspc.2025.107688
28 LUO X, PENG L, KE Z, et al PA-Net: a hybrid architecture for retinal vessel segmentation[J]. Pattern Recognition, 2025, 161: 111254
doi: 10.1016/j.patcog.2024.111254
29 AZAD R, ARIMOND R, AGHDAM E K, et al. DAE-Former: dual attention-guided efficient transformer for medical image segmentation [M]// Predictive intelligence in medicine. Cham: Springer, 2023: 83–95.
30 ZHANG H, ZHANG J, ZHONG X, et al MSM-TDE: multi-scale semantics mining and tiny details enhancement network for retinal vessel segmentation[J]. Complex & Intelligent Systems, 2025, 11 (1): 114
31 QIN L, LI Y, LIN C BINet: bio-inspired network for retinal vessel segmentation[J]. Biomedical Signal Processing and Control, 2025, 100: 107003
doi: 10.1016/j.bspc.2024.107003
32 WANG J, LI X, MA Z Multi-scale three-path network (MSTP-Net): a new architecture for retinal vessel segmentation[J]. Measurement, 2025, 250: 117100
doi: 10.1016/j.measurement.2025.117100
[1] 陈捷丰,姚金良. 基于自相似嵌入和全局特征重排序的图像检索方法[J]. 浙江大学学报(工学版), 2025, 59(6): 1130-1139.
[2] 艾青林,杨佳豪,崔景瑞. 基于自适应增殖数据增强与全局特征融合的小目标行人检测[J]. 浙江大学学报(工学版), 2023, 57(10): 1933-1944.
[3] 徐泽鑫,段立娟,王文健,恩擎. 基于上下文特征融合的代码漏洞检测方法[J]. 浙江大学学报(工学版), 2022, 56(11): 2260-2270.