基于KAN与CKAN优化的医学图像分割模型

doi:10.3785/j.issn.1008-973X.2026.06.015

浙江大学学报(工学版)

2026, Vol. 60

Issue (6): 1277-1288 DOI: 10.3785/j.issn.1008-973X.2026.06.015

计算机技术

基于KAN与CKAN优化的医学图像分割模型

娄世猛1(

),邵玉斌1,*(

),杜庆治1,唐菁敏1,张赜涛2

1. 昆明理工大学信息工程与自动化学院，云南昆明 650500
2. 云南省媒体融合重点实验室，云南昆明 650228

Medical image segmentation model based on KAN and CKAN optimization

Shimeng LOU1(

),Yubin SHAO1,*(

),Qingzhi DU1,Jingmin TANG1,Zetao ZHANG2

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
2. Yunnan Province Key Laboratory for Media Integration, Kunming 650228, China

全文: PDF(1375 KB) HTML

摘要：

为了解决UNet模型在医学图像分割任务中的复杂特征提取与泛化能力不足的问题，提出基于Kolmogorov-Arnold网络（KAN）和卷积KAN（CKAN）的优化模型KUNet，增强UNet模型的性能. 通过用CKAN替换传统卷积层，引入KAN特征增强模块，优化跳跃连接，结合自适应基函数学习机制，在保留结构信息的同时提高特征提取的多样性与精度. 在4个不同的多模态数据集LiTS、CORN、DRIVE和Lungs上，与UNet基线模型、nnUNet模型和Swin-UNet模型进行对比实验. 结果表明，UNet基线模型与KUNet模型在4个数据集上Dice系数和IoU系数的平均最大性能差异指标（MAPG）分别为0.6799和0.6203，且KUNet模型相较于最优或次优模型在4个数据集上的平均提升指标为0.3213和0.2625. 利用KUNet模型，能够在短周期内有效提取到更多的特征，提升图像分割的准确度.

关键词： 图像分割; UNet; Kolmogorov-Arnold network（KAN）; 卷积KAN（CKAN）; 最大性能差异指标(MAPG)

Abstract:

An optimized model KUNet based on Kolmogorov-Arnold network (KAN) and convolutional KAN (CKAN) was proposed to enhance the performance of the UNet model in order to address the limitation of the UNet model in complex feature extraction and generalization capability for medical image segmentation task. Traditional convolutional layer was replaced with CKAN, KAN feature enhancement module was introduced, and skip connection was optimized. Then the diversity and accuracy of feature extraction were improved while preserving structural information by incorporating an adaptive basis function learning mechanism. Comparative experiments were conducted against the UNet baseline model, nnUNet model and Swin-UNet model on four different multimodal datasets: LiTS, CORN, DRIVE and Lungs. Results showed that the average maximum absolute performance gap (MAPG) between the UNet baseline model and the KUNet model across the four datasets were 0.679 9 and 0.620 3 for Dice coefficient and IoU coefficient, respectively, and the KUNet model achieved average improvement metrics of 0.3213 and 0.2625 compared with the optimal or suboptimal model across the four datasets. The KUNet model was utilized to effectively extract more feature within short training cycle and improve the accuracy of image segmentation.

Key words: image segmentation UNet Kolmogorov-Arnold network (KAN) convolutional Kolmogorov-Arnold network (CKAN) maximum absolute performance gap (MAPG)

收稿日期: 2025-06-28 出版日期: 2026-05-06

CLC:

TP 393

基金资助: 云南省媒体融合重点实验室资助项目（220245203）.

通讯作者: 邵玉斌 E-mail: 2962772160@qq.com;shaoyubin999@qq.com

作者简介: 娄世猛（2001—），男，硕士生，从事智能信息处理研究. orcid.org/0009-0002-3212-4318.E-mail：2962772160@qq.com

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	娄世猛
	邵玉斌
	杜庆治
	唐菁敏
	张赜涛

引用本文:

娄世猛,邵玉斌,杜庆治,唐菁敏,张赜涛. 基于KAN与CKAN优化的医学图像分割模型[J]. 浙江大学学报(工学版), 2026, 60(6): 1277-1288.

Shimeng LOU,Yubin SHAO,Qingzhi DU,Jingmin TANG,Zetao ZHANG. Medical image segmentation model based on KAN and CKAN optimization. Journal of ZheJiang University (Engineering Science), 2026, 60(6): 1277-1288.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.06.015 或 https://www.zjujournals.com/eng/CN/Y2026/V60/I6/1277

图 1 KUNet模型的架构图

图 2 CKAN网络模型的架构图

表 1 数据集的统计表

图 3 在LiTS数据集上的可视化分割结果

图 4 在CORN数据集上的可视化分割结果

图 5 在DRIVE数据集上的可视化分割结果

图 6 在Lungs数据集上的可视化分割结果

图 7 UNet、KUNet、nnUNet、Swin-UNet模型在LiTS数据集上的性能曲线

图 8 UNet、KUNet、nnUNet、Swin-UNet模型在CORN数据集上的性能曲线

图 9 UNet、KUNet、nnUNet、Swin-UNet模型在DRIVE数据集上的性能曲线

图 10 UNet、KUNet、nnUNet、Swin-UNet模型在Lungs数据集上的性能曲线

表 2 KUNet模型与UNet基线模型在各数据集上的最大性能差距

表 3 UNet、KUNet、nnUNet、SWin-UNet模型在各数据集上的最佳性能

表 4 UNet、KUNet、nnUNet、Swin-UNet模型在各数据集上的计算效率

表 5 不同模块配置下的消融实验结果比较

1	GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A review on deep learning techniques applied to semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1704.06857.
2	MINAEE S, BOYKOV Y, PORIKLI F, et al Image segmentation using deep learning: a survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44 (7): 3523- 3542
3	ROTH H R, LU L, FARAG A, et al. DeepOrgan: multi-level deep convolutional networks for automated pancreas segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 556–564.
4	LITJENS G, KOOI T, BEJNORDI B E, et al A survey on deep learning in medical image analysis[J]. Medical Image Analysis, 2017, 42: 60- 88 doi: 10.1016/j.media.2017.07.005
5	DEVALLA S K, PHAM T H, PANDA S K, et al Towards label-free 3D segmentation of optical coherence tomography images of the optic nerve head using deep learning[J]. Biomedical Optics Express, 2020, 11 (11): 6356- 6378 doi: 10.1364/BOE.395934
6	CORDTS M, OMRAN M, RAMOS S, et al. The cityscapes dataset for semantic urban scene understanding [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 3213–3223.
7	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 234–241.
8	ZHOU Z, RAHMAN SIDDIQUEE M M, TAJBAKHSH N, et al. UNet++: a nested U-Net architecture for medical image segmentation [C]//Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018: 3–11.
9	ZHAO H, SHI J, QI X, et al. Pyramid scene parsing network [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6230–6239.
10	CHEN L C, PAPANDREOU G, KOKKINOS I, et al DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40 (4): 834- 848 doi: 10.1109/TPAMI.2017.2699184
11	CAO H, WANG Y, CHEN J, et al. Swin-Unet: Unet-like pure transformer for medical image segmentation [C]// European Conference on Computer Vision. Cham: Springer, 2023: 205–218.
12	ISENSEE F, JAEGER P F, KOHL S A A, et al nnU-NET: a self-configuring method for deep learning-based biomedical image segmentation[J]. Nature Methods, 2021, 18 (2): 203- 211 doi: 10.1038/s41592-020-01008-z
13	ISENSEE F, PETERSEN J, KLEIN A, et al. nnU-NET: self-adapting framework for U-Net-based medical image segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1809.10486.
14	MOU L, ZHAO Y, CHEN L, et al. CS-Net: channel and spatial attention network for curvilinear structure segmentation [C]//Medical Image Computing and Computer Assisted Intervention. Cham: Springer, 2019: 721–730.
15	STAAL J, ABRAMOFF M D, NIEMEIJER M, et al Ridge-based vessel segmentation in color images of the retina[J]. IEEE Transactions on Medical Imaging, 2004, 23 (4): 501- 509 doi: 10.1109/TMI.2004.825627
16	LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2404.19756.
17	BODNER A D, TEPSICH A S, SPOLSKI J N, et al. Convolutional Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2406.13155.
18	LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660 doi: 10.1609/aaai.v39i5.32491
19	MA X, WANG Z, HU Y, et al. Kolmogorov-Arnold network for remote sensing image semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2501.07390.
20	AGRAWAL A, AGRAWAL A, GUPTA S, et al. KAN-Mamba FusionNet: redefining medical image segmentation with non-linear modeling [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2411.11926.
21	OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention U-Net: learning where to look for the pancreas [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1804.03999.
22	ZHANG Z, LIU Q, WANG Y Road extraction by deep residual U-Net[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15 (5): 749- 753 doi: 10.1109/LGRS.2018.2802944
23	BILIC P, CHRIST P, LI H B, et al The Liver tumor segmentation benchmark (LiTS)[J]. Medical Image Analysis, 2023, 84: 102680 doi: 10.1016/j.media.2022.102680
24	SCHOENBERG I J. Contributions to the problem of approximation of equidistant data by analytic functions: part A. -on the problem of smoothing or graduation. a first class of analytic approximation formulae [J]. Quarterly of Applied Mathematics, 1946, 4(1): 45–99.
25	HOLLADAY J C A smoothest curve approximation[J]. Mathematical Tables and Other Aids to Computation, 1957, 11 (60): 233- 243 doi: 10.1090/s0025-5718-1957-0093894-6
26	MILLETARI F, NAVAB N, AHMADI S A. V-net: fully convolutional neural networks for volumetric medical image segmentation [C]// Proceedings of the Fourth International Conference on 3D Vision. Stanford: IEEE, 2016: 565–571.
27	EVERINGHAM M, VAN GOOL L, WILLIAMS C K I, et al The pascal visual object classes (VOC) challenge[J]. International Journal of Computer Vision, 2010, 88 (2): 303- 338 doi: 10.1007/s11263-009-0275-4
28	TAGHANAKI S A, ZHENG Y, ZHOU K S, et al Combo loss: handling input and output imbalance in multi-organ segmentation[J]. Computerized Medical Imaging and Graphics, 2019, 75: 24- 33 doi: 10.1016/j.compmedimag.2019.04.005

[1]	朱志航,闫云凤,齐冬莲. 基于扩散模型多模态提示的电力人员行为图像生成[J]. 浙江大学学报(工学版), 2026, 60(1): 43-51.
[2]	袁小平,王小倩,何祥,胡杨明. 用于遥感图像变化检测的深度监督网络[J]. 浙江大学学报(工学版), 2023, 57(10): 1966-1976.
[3]	王万良,王铁军,陈嘉诚,尤文波. 融合多尺度和多头注意力的医疗图像分割方法[J]. 浙江大学学报(工学版), 2022, 56(9): 1796-1805.
[4]	袁小平,何祥,王小倩,胡杨明. 基于多层级特征自适应融合的图像分割算法[J]. 浙江大学学报(工学版), 2022, 56(10): 1958-1966.
[5]	刘清清,周志勇,范国华,钱旭升,胡冀苏,陈光强,戴亚康. 基于3D scSE-UNet的肝脏CT图像半监督学习分割方法[J]. 浙江大学学报(工学版), 2021, 55(11): 2033-2044.
[6]	郑洲, 张学昌, 郑四鸣, 施岳定. 基于区域增长与统一化水平集的CT肝脏图像分割[J]. 浙江大学学报(工学版), 2018, 52(12): 2382-2396.
[7]	廖苗, 赵于前, 曾业战, 黄忠朝, 张丙奎, 邹北骥. 基于支持向量机和椭圆拟合的细胞图像自动分割[J]. 浙江大学学报(工学版), 2017, 51(4): 722-728.
[8]	张建廷,张立民. 新型自适应稳健双边滤波图像分割[J]. 浙江大学学报(工学版), 2016, 50(9): 1703-1710.
[9]	胡祝华, 赵瑶池, 程杰仁, 彭金莲. 基于改进DRLSE的运动目标分割方法[J]. 浙江大学学报(工学版), 2014, 48(8): 1488-1495.
[10]	刘中, 陈伟海, 吴星明, 邹宇华, 王建华. 基于双目视觉的显著性区域检测[J]. J4, 2014, 48(2): 354-359.
[11]	李光廷, 禹卫东. 马尔可夫随机场SAR图像分割的快速实现技术[J]. J4, 2012, 46(10): 1810-1815.
[12]	吴一全,张晓杰,吴诗婳,张生伟. 基于混沌PSO或分解的二维最小误差阈值分割[J]. J4, 2011, 45(7): 1198-1205.
[13]	谢强军, 侯迪波, 黄平捷, 张光新, 周泽魁. 基于半隐差分的单参数水平集快速分割[J]. J4, 2010, 44(8): 1496-1501.
[14]	蔡晋辉, 张光新, 才辉. 基于连通掩模的重构开算子及应用[J]. J4, 2010, 44(4): 675-680.
[15]	孔丁科, 汪国昭. 用于图像分割的边界保持局部拟合模型[J]. J4, 2010, 44(12): 2236-2240.

Viewed

Full text

Abstract

Cited

Shared

Discussed