基于轻量高频Transformer与特征互补融合的视网膜血管分割

doi:10.3785/j.issn.1008-973X.2026.07.003

浙江大学学报(工学版)

2026, Vol. 60

Issue (7): 1392-1403 DOI: 10.3785/j.issn.1008-973X.2026.07.003

计算机与控制工程

基于轻量高频Transformer与特征互补融合的视网膜血管分割

梁礼明(

),王成斌,钟奕,陈林俊,吴健*(

)

江西理工大学电气工程与自动化学院，江西赣州 341000

Retinal vessel segmentation based on lightweight high-frequency Transformer and feature complementary fusion

Liming LIANG(

),Chengbin WANG,Yi ZHONG,Linjun CHEN,Jian WU*(

)

School of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou 341000, China

全文: PDF(4709 KB) HTML

摘要：

针对视网膜血管分割中血管信息易损失、血管像素占比小及病灶区域血管分割困难的问题，构建基于轻量高频Transformer与特征互补融合的视网膜血管分割算法. 利用轻量高频Transformer模块构建双边特征提取编码结构，通过全局和局部信息的有效聚合，缓解初始编码过程中血管信息损失的问题. 在编码端设计细节增强注意力模块，准确辨别血管前景和非血管背景，提高模型对微细血管的捕捉能力. 在解码端构建双解码路径并引入特征互补融合模块，实现低层细节与高层语义信息的充分整合，改善对病灶区域的血管分割精度. 实验结果表明，在DRIVE、STARE和CHASE_DB1公共数据集上所提算法的准确率分别为97.12%、97.62%和97.65%；灵敏度分别为80.23%、80.48%和81.30%；F1分数分别为82.99%、83.73%和81.73%. 该方法的参数量和计算复杂度FLOPs分别为4.27 M和1.79 G，在实现较高计算效率的同时保持了模型的分割性能，充分展现了其在临床眼科疾病诊断中的应用潜力.

关键词： 视网膜血管分割; 轻量高频Transformer; 特征互补融合; 细节增强注意力; 全局特征

Abstract:

A retinal vessel segmentation algorithm based on a lightweight high-frequency Transformer and feature complementary fusion was developed to address the issues in retinal vessel segmentation, including the loss of vessel information, the low proportion of vessel pixels, and the difficulties in segmenting vessels in lesion regions. A bilateral feature extraction encoding structure was constructed using the lightweight high-frequency Transformer module. Global and local information were effectively aggregated to mitigate the loss of vessel information during the initial encoding process. A detail-enhancement attention module was designed within the encoding stage to accurately distinguish the vessel foreground from the non-vessel background, thereby improving the model’s ability to capture fine vessels. A dual-decoding path was constructed at the decoding stage, and a feature complementary fusion module was introduced to fully integrate low-level details with high-level semantic information, thereby enhancing the segmentation accuracy of vessels in lesion regions. Experimental results on the public datasets of DRIVE, STARE, and CHASE_DB1 indicated that the proposed algorithm achieved accuracies of 97.12%, 97.62%, and 97.65%, sensitivities of 80.23%, 80.48%, and 81.30%, and F1-scores of 82.99%, 83.73%, and 81.73%, respectively. The parameter count and computational complexity (FLOPs) were 4.27 M and 1.79 G, respectively. While achieving high computational efficiency, this method maintains the model’s segmentation performance, fully demonstrating its potential in clinical diagnosis of ophthalmic diseases.

Key words: retinal vessel segmentation lightweight high-frequency Transformer feature complementary fusion detail-enhanced attention global feature

收稿日期: 2025-03-26 出版日期: 2026-05-23

CLC:

TP 391

基金资助: 国家自然科学基金资助项目(51365017, 61463018)；江西省自然科学基金资助项目(20192BAB205084)；江西省教育厅科学技术研究资助项目(GJJ2200848).

通讯作者: 吴健 E-mail: 9119890012@jxust.edu.cn;wujian@jxust.edu.cn

作者简介: 梁礼明（1967—），男，教授，硕导，从事机器学习、模式识别与图像处理研究. orcid.org/0000-0003-4549-4414. E-mail：9119890012@jxust.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	梁礼明
	王成斌
	钟奕
	陈林俊
	吴健

引用本文:

梁礼明,王成斌,钟奕,陈林俊,吴健. 基于轻量高频Transformer与特征互补融合的视网膜血管分割[J]. 浙江大学学报(工学版), 2026, 60(7): 1392-1403.

Liming LIANG,Chengbin WANG,Yi ZHONG,Linjun CHEN,Jian WU. Retinal vessel segmentation based on lightweight high-frequency Transformer and feature complementary fusion. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1392-1403.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.07.003 或 https://www.zjujournals.com/eng/CN/Y2026/V60/I7/1392

图 1 LFF-Net网络结构

图 2 轻量高频Transformer模块

图 3 轻量高频注意力模块

图 4 细节增强注意力模块

图 5 特征互补融合模块

图 6 图像预处理流程

图 7 不同算法的视网膜血管分割结果

图 8 不同算法的视网膜血管分割局部图像

表 1 不同算法在3个数据集上的血管分割结果对比

表 2 所提算法与先进算法在3个数据集上的血管分割结果对比

表 3 采取不同模块时模型在不同数据集上的血管分割性能

图 9 不同改进网络的血管分割结果

表 4 参数量及每秒浮点运算次数对比

表 5 训练及测试时间对比

1	梁礼明, 詹涛, 雷坤, 等多分辨率融合输入的U型视网膜血管分割算法[J]. 电子与信息学报, 2023, 45 (5): 1795- 1806 LIANG Liming, ZHAN Tao, LEI Kun, et al Multi-resolution fusion input U-shaped retinal vessel segmentation algorithm[J]. Journal of Electronics & Information Technology, 2023, 45 (5): 1795- 1806 doi: 10.11999/JEIT220470
2	MAPAYI T, VIRIRI S, TAPAMO J R. Adaptive thresholding technique for retinal vessel segmentation based on GLCM-energy information [J]. Computational and Mathematical Methods in Medicine, 2015: 597475.
3	ODSTRCILIK J, KOLAR R, BUDAI A, et al Retinal vessel segmentation by improved matched filtering: evaluation on a new high-resolution fundus image database[J]. IET Image Processing, 2013, 7 (4): 373- 383 doi: 10.1049/iet-ipr.2012.0455
4	VLACHOS M, DERMATAS E Multi-scale retinal vessel segmentation using line tracking[J]. Computerized Medical Imaging and Graphics, 2010, 34 (3): 213- 227 doi: 10.1016/j.compmedimag.2009.09.006
5	ZHAO J, YANG J, AI D, et al Automatic retinal vessel segmentation using multi-scale superpixel chain tracking[J]. Digital Signal Processing, 2018, 81: 26- 42 doi: 10.1016/j.dsp.2018.06.006
6	RELAN D, MACGILLIVRAY T, BALLERINI L, et al. Automatic retinal vessel classification using a least square-support vector machine in VAMPIRE [C]// Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Chicago: IEEE, 2014: 142–145.
7	梁礼明, 刘博文, 杨海龙, 等基于多特征融合的有监督视网膜血管提取[J]. 计算机学报, 2018, 41 (11): 2566- 2580 LIANG Liming, LIU Bowen, YANG Hailong, et al Supervised blood vessel extraction in retinal images based on multiple feature fusion[J]. Chinese Journal of Computers, 2018, 41 (11): 2566- 2580 doi: 10.11897/SP.J.1016.2018.02566
8	王万良, 王铁军, 陈嘉诚, 等融合多尺度和多头注意力的医疗图像分割方法[J]. 浙江大学学报: 工学版, 2022, 56 (9): 1796- 1805 WANG Wanliang, WANG Tiejun, CHEN Jiacheng, et al Medical image segmentation method combining multi-scale and multi-head attention[J]. Journal of Zhejiang University: Engineering Science, 2022, 56 (9): 1796- 1805
9	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [M]// Lecture notes in computer science. Cham: Springer, 2015: 234–241.
10	LI J, GAO G, LIU Y, et al MAGF-Net: a multiscale attention-guided fusion network for retinal vessel segmentation[J]. Measurement, 2023, 206: 112316 doi: 10.1016/j.measurement.2022.112316
11	ZHANG H, FANG W, LI J A microvascular segmentation network based on pyramidal attention mechanism[J]. Sensors, 2024, 24 (12): 4014 doi: 10.3390/s24124014
12	LI J, LI A, LIU Y, et al An adaptive fundus retinal vessel segmentation model capable of adapting to the complex structure of blood vessels[J]. Biomedical Signal Processing and Control, 2025, 101: 107150 doi: 10.1016/j.bspc.2024.107150
13	PAN P, ZHANG C, SUN J, et al Multi-scale conv-attention U-Net for medical image segmentation[J]. Scientific Reports, 2025, 15: 12041 doi: 10.1038/s41598-025-96101-8
14	HUANG L, MIRON A, HONE K, et al. Segmenting medical images: from UNet to res-UNet and nnUNet [C]// Proceedings of the IEEE 37th International Symposium on Computer-Based Medical Systems. Guadalajara: IEEE, 2024: 483–489.
15	VASWANI A, SHAZEER N, PARAMAR N, et al. Attention is all you need [EB/OL]. (2023-08-02) [2025-02-26]. https://arxiv.org/abs/1706.03762.
16	CHEN J, LU Y, YU Q, et al. TransUNet: Transformers make strong encoders for medical image segmentation [EB/OL]. (2021-02-08) [2025-03-12]. https://arxiv.org/abs/2102.04306.
17	SHI Z, LI Y, ZOU H, et al TCU-Net: Transformer embedded in convolutional U-shaped network for retinal vessel segmentation[J]. Sensors, 2023, 23 (10): 4897 doi: 10.3390/s23104897
18	TANG W, DENG H, HUANG Z, et al Medical image segmentation method based on full perceived dynamic network[J]. Engineering Applications of Artificial Intelligence, 2025, 142: 109867 doi: 10.1016/j.engappai.2024.109867
19	LI Y, XU L, JIN Y, et al Diffusion probabilistic learning with gate-fusion Transformer and edge-frequency attention for retinal vessel segmentation[J]. IEEE Transactions on Instrumentation and Measurement, 2024, 73: 2523513
20	ZHANG F, PANAHI A, GAO G FsaNet: frequency self-attention for semantic segmentation[J]. IEEE Transactions on Image Processing, 2023, 32: 4757- 4772 doi: 10.1109/TIP.2023.3305090
21	KARIMIJAFARBIGLOO S, AZAD R, KAZEROUNI A, et al. MS-Former: multi-scale self-guided Transformer for medical image segmentation [C]// Medical Imaging with Deep Learning. Paris: PMLR, 2024: 680–694.
22	CHEN Z, HE Z, LU Z DEA-net: single image dehazing based on detail-enhanced convolution and content-guided attention[J]. IEEE Transactions on Image Processing, 2024, 33: 1002- 1015 doi: 10.1109/TIP.2024.3354108
23	ZHANG X, ZHOU X, LIN M, et al. ShuffleNet: an extremely efficient convolutional neural network for mobile devices [C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 6848–6856.
24	OKTAY O, SCHLEMPER J, FOLGOC L, et al. Attention U-Net: learning where to look for the pancreas [EB/OL]. (2018-05-20) [2025-03-14]. https://arxiv.org/abs/1804.03999.
25	LIU W, YANG H, TIAN T, et al Full-resolution network and dual-threshold iteration for retinal vessel and coronary angiograph segmentation[J]. IEEE Journal of Biomedical and Health Informatics, 2022, 26 (9): 4623- 4634 doi: 10.1109/JBHI.2022.3188710
26	LI Y, ZHANG Y, LIU J Y, et al Global Transformer and dual local attention network via deep-shallow hierarchical feature fusion for retinal vessel segmentation[J]. IEEE Transactions on Cybernetics, 2023, 53 (9): 5826- 5839 doi: 10.1109/TCYB.2022.3194099
27	LIANG L, LU B, WU J, et al SFIT-Net: spatial reconstruction feature interaction transformer retinal vessel segmentation algorithm[J]. Biomedical Signal Processing and Control, 2025, 106: 107688 doi: 10.1016/j.bspc.2025.107688
28	LUO X, PENG L, KE Z, et al PA-Net: a hybrid architecture for retinal vessel segmentation[J]. Pattern Recognition, 2025, 161: 111254 doi: 10.1016/j.patcog.2024.111254
29	AZAD R, ARIMOND R, AGHDAM E K, et al. DAE-Former: dual attention-guided efficient transformer for medical image segmentation [M]// Predictive intelligence in medicine. Cham: Springer, 2023: 83–95.
30	ZHANG H, ZHANG J, ZHONG X, et al MSM-TDE: multi-scale semantics mining and tiny details enhancement network for retinal vessel segmentation[J]. Complex & Intelligent Systems, 2025, 11 (1): 114
31	QIN L, LI Y, LIN C BINet: bio-inspired network for retinal vessel segmentation[J]. Biomedical Signal Processing and Control, 2025, 100: 107003 doi: 10.1016/j.bspc.2024.107003
32	WANG J, LI X, MA Z Multi-scale three-path network (MSTP-Net): a new architecture for retinal vessel segmentation[J]. Measurement, 2025, 250: 117100 doi: 10.1016/j.measurement.2025.117100

[1]	陈捷丰,姚金良. 基于自相似嵌入和全局特征重排序的图像检索方法[J]. 浙江大学学报(工学版), 2025, 59(6): 1130-1139.
[2]	艾青林,杨佳豪,崔景瑞. 基于自适应增殖数据增强与全局特征融合的小目标行人检测[J]. 浙江大学学报(工学版), 2023, 57(10): 1933-1944.
[3]	徐泽鑫,段立娟,王文健,恩擎. 基于上下文特征融合的代码漏洞检测方法[J]. 浙江大学学报(工学版), 2022, 56(11): 2260-2270.

Viewed

Full text

Abstract

Cited

Shared

Discussed