Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (6): 1277-1288    DOI: 10.3785/j.issn.1008-973X.2026.06.015
计算机技术     
基于KAN与CKAN优化的医学图像分割模型
娄世猛1(),邵玉斌1,*(),杜庆治1,唐菁敏1,张赜涛2
1. 昆明理工大学 信息工程与自动化学院,云南 昆明 650500
2. 云南省媒体融合重点实验室,云南 昆明 650228
Medical image segmentation model based on KAN and CKAN optimization
Shimeng LOU1(),Yubin SHAO1,*(),Qingzhi DU1,Jingmin TANG1,Zetao ZHANG2
1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
2. Yunnan Province Key Laboratory for Media Integration, Kunming 650228, China
 全文: PDF(1375 KB)   HTML
摘要:

为了解决UNet模型在医学图像分割任务中的复杂特征提取与泛化能力不足的问题,提出基于Kolmogorov-Arnold网络(KAN)和卷积KAN(CKAN)的优化模型KUNet,增强UNet模型的性能. 通过用CKAN替换传统卷积层,引入KAN特征增强模块,优化跳跃连接,结合自适应基函数学习机制,在保留结构信息的同时提高特征提取的多样性与精度. 在4个不同的多模态数据集LiTS、CORN、DRIVE和Lungs上,与UNet基线模型、nnUNet模型和Swin-UNet模型进行对比实验. 结果表明,UNet基线模型与KUNet模型在4个数据集上Dice系数和IoU系数的平均最大性能差异指标(MAPG)分别为0.67990.6203,且KUNet模型相较于最优或次优模型在4个数据集上的平均提升指标为0.32130.2625. 利用KUNet模型,能够在短周期内有效提取到更多的特征,提升图像分割的准确度.

关键词: 图像分割UNetKolmogorov-Arnold network(KAN)卷积KAN(CKAN)最大性能差异指标(MAPG)    
Abstract:

An optimized model KUNet based on Kolmogorov-Arnold network (KAN) and convolutional KAN (CKAN) was proposed to enhance the performance of the UNet model in order to address the limitation of the UNet model in complex feature extraction and generalization capability for medical image segmentation task. Traditional convolutional layer was replaced with CKAN, KAN feature enhancement module was introduced, and skip connection was optimized. Then the diversity and accuracy of feature extraction were improved while preserving structural information by incorporating an adaptive basis function learning mechanism. Comparative experiments were conducted against the UNet baseline model, nnUNet model and Swin-UNet model on four different multimodal datasets: LiTS, CORN, DRIVE and Lungs. Results showed that the average maximum absolute performance gap (MAPG) between the UNet baseline model and the KUNet model across the four datasets were 0.679 9 and 0.620 3 for Dice coefficient and IoU coefficient, respectively, and the KUNet model achieved average improvement metrics of 0.3213 and 0.2625 compared with the optimal or suboptimal model across the four datasets. The KUNet model was utilized to effectively extract more feature within short training cycle and improve the accuracy of image segmentation.

Key words: image segmentation    UNet    Kolmogorov-Arnold network (KAN)    convolutional Kolmogorov-Arnold network (CKAN)    maximum absolute performance gap (MAPG)
收稿日期: 2025-06-28 出版日期: 2026-05-06
CLC:  TP 393  
基金资助: 云南省媒体融合重点实验室资助项目(220245203).
通讯作者: 邵玉斌     E-mail: 2962772160@qq.com;shaoyubin999@qq.com
作者简介: 娄世猛(2001—),男,硕士生,从事智能信息处理研究. orcid.org/0009-0002-3212-4318.E-mail:2962772160@qq.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
娄世猛
邵玉斌
杜庆治
唐菁敏
张赜涛

引用本文:

娄世猛,邵玉斌,杜庆治,唐菁敏,张赜涛. 基于KAN与CKAN优化的医学图像分割模型[J]. 浙江大学学报(工学版), 2026, 60(6): 1277-1288.

Shimeng LOU,Yubin SHAO,Qingzhi DU,Jingmin TANG,Zetao ZHANG. Medical image segmentation model based on KAN and CKAN optimization. Journal of ZheJiang University (Engineering Science), 2026, 60(6): 1277-1288.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.06.015        https://www.zjujournals.com/eng/CN/Y2026/V60/I6/1277

图 1  KUNet模型的架构图
图 2  CKAN网络模型的架构图
数据集样本数量分辨率分割目标
LiTS400512×512肝脏
CORN1238384×384角膜神经
DRIVE20565×584眼底血管
Lungs200512×512
表 1  数据集的统计表
图 3  在LiTS数据集上的可视化分割结果
图 4  在CORN数据集上的可视化分割结果
图 5  在DRIVE数据集上的可视化分割结果
图 6  在Lungs数据集上的可视化分割结果
图 7  UNet、KUNet、nnUNet、Swin-UNet模型在LiTS数据集上的性能曲线
图 8  UNet、KUNet、nnUNet、Swin-UNet模型在CORN数据集上的性能曲线
图 9  UNet、KUNet、nnUNet、Swin-UNet模型在DRIVE数据集上的性能曲线
图 10  UNet、KUNet、nnUNet、Swin-UNet模型在Lungs数据集上的性能曲线
数据集周期DiceMAPG1IoUMAPG2
UNet模型KUNet模型UNet模型KUNet模型
LiTS1000.97860.978600.95820.9582
CORN2000.06530.12640.06110.03390.06770.0338
DRIVE30000.71470.714700.55610.5561
Lungs1000.96510.965100.93320.9332
均值0.67990.6203
表 2  KUNet模型与UNet基线模型在各数据集上的最大性能差距
网络模型DiceIoU
LiTSCORNDRIVELungsLiTSCORNDRIVELungs
UNet0.98090.51330.81040.96090.96260.35310.68130.9248
nnUNet0.99170.32190.79810.97470.98370.20490.65210.9563
Swin-UNet0.98420.08150.68750.95710.96880.0430.52390.9177
KUNet0.99050.53020.93010.98420.98110.36810.86940.9689
$ \varDelta $0.13480.03530.92250.46230.14770.02350.83730.3369
$ \overline{\varDelta } $0.32130.2625
表 3  UNet、KUNet、nnUNet、SWin-UNet模型在各数据集上的最佳性能
数据集$ {E}_{\text{UNet}} $$ {E}_{\text{nnUNet}} $$ {E}_{\text{Swin-UNet}} $$ {E}_{\text{KUNet}} $$ {T}_{\text{UNet}} $/s$ {T}_{\text{nnUNet}} $/s$ {T}_{\text{Swin-UNet}} $/s$ {T}_{\text{KUNet}} $/s
LiTS100100100100557.114344544998.17
CORN20001002000200037674.0455071538057941.40
DRIVE500050005000100001630.206393216595458.41
Lungs10010010050262.875509383241.84
表 4  UNet、KUNet、nnUNet、Swin-UNet模型在各数据集上的计算效率
模型DiceIoU
完整KUNet0.99050.9811
变体A0.98860.9774
变体B0.99040.9810
变体C0.98250.9657
变体D0.98920.9786
表 5  不同模块配置下的消融实验结果比较
1 GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A review on deep learning techniques applied to semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1704.06857.
2 MINAEE S, BOYKOV Y, PORIKLI F, et al Image segmentation using deep learning: a survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44 (7): 3523- 3542
3 ROTH H R, LU L, FARAG A, et al. DeepOrgan: multi-level deep convolutional networks for automated pancreas segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 556–564.
4 LITJENS G, KOOI T, BEJNORDI B E, et al A survey on deep learning in medical image analysis[J]. Medical Image Analysis, 2017, 42: 60- 88
doi: 10.1016/j.media.2017.07.005
5 DEVALLA S K, PHAM T H, PANDA S K, et al Towards label-free 3D segmentation of optical coherence tomography images of the optic nerve head using deep learning[J]. Biomedical Optics Express, 2020, 11 (11): 6356- 6378
doi: 10.1364/BOE.395934
6 CORDTS M, OMRAN M, RAMOS S, et al. The cityscapes dataset for semantic urban scene understanding [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 3213–3223.
7 RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 234–241.
8 ZHOU Z, RAHMAN SIDDIQUEE M M, TAJBAKHSH N, et al. UNet++: a nested U-Net architecture for medical image segmentation [C]//Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018: 3–11.
9 ZHAO H, SHI J, QI X, et al. Pyramid scene parsing network [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6230–6239.
10 CHEN L C, PAPANDREOU G, KOKKINOS I, et al DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40 (4): 834- 848
doi: 10.1109/TPAMI.2017.2699184
11 CAO H, WANG Y, CHEN J, et al. Swin-Unet: Unet-like pure transformer for medical image segmentation [C]// European Conference on Computer Vision. Cham: Springer, 2023: 205–218.
12 ISENSEE F, JAEGER P F, KOHL S A A, et al nnU-NET: a self-configuring method for deep learning-based biomedical image segmentation[J]. Nature Methods, 2021, 18 (2): 203- 211
doi: 10.1038/s41592-020-01008-z
13 ISENSEE F, PETERSEN J, KLEIN A, et al. nnU-NET: self-adapting framework for U-Net-based medical image segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1809.10486.
14 MOU L, ZHAO Y, CHEN L, et al. CS-Net: channel and spatial attention network for curvilinear structure segmentation [C]//Medical Image Computing and Computer Assisted Intervention. Cham: Springer, 2019: 721–730.
15 STAAL J, ABRAMOFF M D, NIEMEIJER M, et al Ridge-based vessel segmentation in color images of the retina[J]. IEEE Transactions on Medical Imaging, 2004, 23 (4): 501- 509
doi: 10.1109/TMI.2004.825627
16 LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2404.19756.
17 BODNER A D, TEPSICH A S, SPOLSKI J N, et al. Convolutional Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2406.13155.
18 LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660
doi: 10.1609/aaai.v39i5.32491
19 MA X, WANG Z, HU Y, et al. Kolmogorov-Arnold network for remote sensing image semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2501.07390.
20 AGRAWAL A, AGRAWAL A, GUPTA S, et al. KAN-Mamba FusionNet: redefining medical image segmentation with non-linear modeling [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2411.11926.
21 OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention U-Net: learning where to look for the pancreas [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1804.03999.
22 ZHANG Z, LIU Q, WANG Y Road extraction by deep residual U-Net[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15 (5): 749- 753
doi: 10.1109/LGRS.2018.2802944
23 BILIC P, CHRIST P, LI H B, et al The Liver tumor segmentation benchmark (LiTS)[J]. Medical Image Analysis, 2023, 84: 102680
doi: 10.1016/j.media.2022.102680
24 SCHOENBERG I J. Contributions to the problem of approximation of equidistant data by analytic functions: part A. -on the problem of smoothing or graduation. a first class of analytic approximation formulae [J]. Quarterly of Applied Mathematics, 1946, 4(1): 45–99.
25 HOLLADAY J C A smoothest curve approximation[J]. Mathematical Tables and Other Aids to Computation, 1957, 11 (60): 233- 243
doi: 10.1090/s0025-5718-1957-0093894-6
26 MILLETARI F, NAVAB N, AHMADI S A. V-net: fully convolutional neural networks for volumetric medical image segmentation [C]// Proceedings of the Fourth International Conference on 3D Vision. Stanford: IEEE, 2016: 565–571.
27 EVERINGHAM M, VAN GOOL L, WILLIAMS C K I, et al The pascal visual object classes (VOC) challenge[J]. International Journal of Computer Vision, 2010, 88 (2): 303- 338
doi: 10.1007/s11263-009-0275-4
28 TAGHANAKI S A, ZHENG Y, ZHOU K S, et al Combo loss: handling input and output imbalance in multi-organ segmentation[J]. Computerized Medical Imaging and Graphics, 2019, 75: 24- 33
doi: 10.1016/j.compmedimag.2019.04.005
[1] 朱志航,闫云凤,齐冬莲. 基于扩散模型多模态提示的电力人员行为图像生成[J]. 浙江大学学报(工学版), 2026, 60(1): 43-51.
[2] 袁小平,王小倩,何祥,胡杨明. 用于遥感图像变化检测的深度监督网络[J]. 浙江大学学报(工学版), 2023, 57(10): 1966-1976.
[3] 王万良,王铁军,陈嘉诚,尤文波. 融合多尺度和多头注意力的医疗图像分割方法[J]. 浙江大学学报(工学版), 2022, 56(9): 1796-1805.
[4] 袁小平,何祥,王小倩,胡杨明. 基于多层级特征自适应融合的图像分割算法[J]. 浙江大学学报(工学版), 2022, 56(10): 1958-1966.
[5] 刘清清,周志勇,范国华,钱旭升,胡冀苏,陈光强,戴亚康. 基于3D scSE-UNet的肝脏CT图像半监督学习分割方法[J]. 浙江大学学报(工学版), 2021, 55(11): 2033-2044.
[6] 郑洲, 张学昌, 郑四鸣, 施岳定. 基于区域增长与统一化水平集的CT肝脏图像分割[J]. 浙江大学学报(工学版), 2018, 52(12): 2382-2396.
[7] 廖苗, 赵于前, 曾业战, 黄忠朝, 张丙奎, 邹北骥. 基于支持向量机和椭圆拟合的细胞图像自动分割[J]. 浙江大学学报(工学版), 2017, 51(4): 722-728.
[8] 张建廷,张立民. 新型自适应稳健双边滤波图像分割[J]. 浙江大学学报(工学版), 2016, 50(9): 1703-1710.
[9] 胡祝华, 赵瑶池, 程杰仁, 彭金莲. 基于改进DRLSE的运动目标分割方法[J]. 浙江大学学报(工学版), 2014, 48(8): 1488-1495.
[10] 刘中, 陈伟海, 吴星明, 邹宇华, 王建华. 基于双目视觉的显著性区域检测[J]. J4, 2014, 48(2): 354-359.
[11] 李光廷, 禹卫东. 马尔可夫随机场SAR图像分割的快速实现技术[J]. J4, 2012, 46(10): 1810-1815.
[12] 吴一全,张晓杰,吴诗婳,张生伟. 基于混沌PSO或分解的二维最小误差阈值分割[J]. J4, 2011, 45(7): 1198-1205.
[13] 谢强军, 侯迪波, 黄平捷, 张光新, 周泽魁. 基于半隐差分的单参数水平集快速分割[J]. J4, 2010, 44(8): 1496-1501.
[14] 蔡晋辉, 张光新, 才辉. 基于连通掩模的重构开算子及应用[J]. J4, 2010, 44(4): 675-680.
[15] 孔丁科, 汪国昭. 用于图像分割的边界保持局部拟合模型[J]. J4, 2010, 44(12): 2236-2240.