Please wait a minute...
Journal of ZheJiang University (Engineering Science)  2026, Vol. 60 Issue (6): 1277-1288    DOI: 10.3785/j.issn.1008-973X.2026.06.015
    
Medical image segmentation model based on KAN and CKAN optimization
Shimeng LOU1(),Yubin SHAO1,*(),Qingzhi DU1,Jingmin TANG1,Zetao ZHANG2
1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
2. Yunnan Province Key Laboratory for Media Integration, Kunming 650228, China
Download: HTML     PDF(1375KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

An optimized model KUNet based on Kolmogorov-Arnold network (KAN) and convolutional KAN (CKAN) was proposed to enhance the performance of the UNet model in order to address the limitation of the UNet model in complex feature extraction and generalization capability for medical image segmentation task. Traditional convolutional layer was replaced with CKAN, KAN feature enhancement module was introduced, and skip connection was optimized. Then the diversity and accuracy of feature extraction were improved while preserving structural information by incorporating an adaptive basis function learning mechanism. Comparative experiments were conducted against the UNet baseline model, nnUNet model and Swin-UNet model on four different multimodal datasets: LiTS, CORN, DRIVE and Lungs. Results showed that the average maximum absolute performance gap (MAPG) between the UNet baseline model and the KUNet model across the four datasets were 0.679 9 and 0.620 3 for Dice coefficient and IoU coefficient, respectively, and the KUNet model achieved average improvement metrics of 0.3213 and 0.2625 compared with the optimal or suboptimal model across the four datasets. The KUNet model was utilized to effectively extract more feature within short training cycle and improve the accuracy of image segmentation.



Key wordsimage segmentation      UNet      Kolmogorov-Arnold network (KAN)      convolutional Kolmogorov-Arnold network (CKAN)      maximum absolute performance gap (MAPG)     
Received: 28 June 2025      Published: 06 May 2026
CLC:  TP 393  
Fund:  云南省媒体融合重点实验室资助项目(220245203).
Corresponding Authors: Yubin SHAO     E-mail: 2962772160@qq.com;shaoyubin999@qq.com
Cite this article:

Shimeng LOU,Yubin SHAO,Qingzhi DU,Jingmin TANG,Zetao ZHANG. Medical image segmentation model based on KAN and CKAN optimization. Journal of ZheJiang University (Engineering Science), 2026, 60(6): 1277-1288.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2026.06.015     OR     https://www.zjujournals.com/eng/Y2026/V60/I6/1277


基于KAN与CKAN优化的医学图像分割模型

为了解决UNet模型在医学图像分割任务中的复杂特征提取与泛化能力不足的问题,提出基于Kolmogorov-Arnold网络(KAN)和卷积KAN(CKAN)的优化模型KUNet,增强UNet模型的性能. 通过用CKAN替换传统卷积层,引入KAN特征增强模块,优化跳跃连接,结合自适应基函数学习机制,在保留结构信息的同时提高特征提取的多样性与精度. 在4个不同的多模态数据集LiTS、CORN、DRIVE和Lungs上,与UNet基线模型、nnUNet模型和Swin-UNet模型进行对比实验. 结果表明,UNet基线模型与KUNet模型在4个数据集上Dice系数和IoU系数的平均最大性能差异指标(MAPG)分别为0.67990.6203,且KUNet模型相较于最优或次优模型在4个数据集上的平均提升指标为0.32130.2625. 利用KUNet模型,能够在短周期内有效提取到更多的特征,提升图像分割的准确度.


关键词: 图像分割,  UNet,  Kolmogorov-Arnold network(KAN),  卷积KAN(CKAN),  最大性能差异指标(MAPG) 
Fig.1 Architecture diagram of KUNet model
Fig.2 Architecture diagram of convolutional Kolmogorov-Arnold network model
数据集样本数量分辨率分割目标
LiTS400512×512肝脏
CORN1238384×384角膜神经
DRIVE20565×584眼底血管
Lungs200512×512
Tab.1 Dataset statistics table
Fig.3 Visual segmentation result on LiTS dataset
Fig.4 Visual segmentation result on CORN dataset
Fig.5 Visual segmentation result on DRIVE dataset
Fig.6 Visual segmentation result on Lungs dataset
Fig.7 Performance curve of UNet, KUNet, nnUNet and Swin-UNet model on LiTS dataset
Fig.8 Performance curve of UNet, KUNet, nnUNet and Swin-UNet model on CORN dataset
Fig.9 Performance curve of UNet, KUNet, nnUNet and Swin-UNet model on DRIVE dataset
Fig.10 Performance curve of UNet, KUNet, nnUNet and Swin-UNet model on Lungs dataset
数据集周期DiceMAPG1IoUMAPG2
UNet模型KUNet模型UNet模型KUNet模型
LiTS1000.97860.978600.95820.9582
CORN2000.06530.12640.06110.03390.06770.0338
DRIVE30000.71470.714700.55610.5561
Lungs1000.96510.965100.93320.9332
均值0.67990.6203
Tab.2 Maximum performance gap between KUNet model and UNet baseline model on each dataset
网络模型DiceIoU
LiTSCORNDRIVELungsLiTSCORNDRIVELungs
UNet0.98090.51330.81040.96090.96260.35310.68130.9248
nnUNet0.99170.32190.79810.97470.98370.20490.65210.9563
Swin-UNet0.98420.08150.68750.95710.96880.0430.52390.9177
KUNet0.99050.53020.93010.98420.98110.36810.86940.9689
$ \varDelta $0.13480.03530.92250.46230.14770.02350.83730.3369
$ \overline{\varDelta } $0.32130.2625
Tab.3 Best performance of UNet, KUNet, nnUNet and SWin-UNet model on each dataset
数据集$ {E}_{\text{UNet}} $$ {E}_{\text{nnUNet}} $$ {E}_{\text{Swin-UNet}} $$ {E}_{\text{KUNet}} $$ {T}_{\text{UNet}} $/s$ {T}_{\text{nnUNet}} $/s$ {T}_{\text{Swin-UNet}} $/s$ {T}_{\text{KUNet}} $/s
LiTS100100100100557.114344544998.17
CORN20001002000200037674.0455071538057941.40
DRIVE500050005000100001630.206393216595458.41
Lungs10010010050262.875509383241.84
Tab.4 Calculation efficiency of UNet, KUNet, nnUNet and Swin-UNet model on each dataset
模型DiceIoU
完整KUNet0.99050.9811
变体A0.98860.9774
变体B0.99040.9810
变体C0.98250.9657
变体D0.98920.9786
Tab.5 Comparison of ablation study result under different module configuration
[1]   GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A review on deep learning techniques applied to semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1704.06857.
[2]   MINAEE S, BOYKOV Y, PORIKLI F, et al Image segmentation using deep learning: a survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44 (7): 3523- 3542
[3]   ROTH H R, LU L, FARAG A, et al. DeepOrgan: multi-level deep convolutional networks for automated pancreas segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 556–564.
[4]   LITJENS G, KOOI T, BEJNORDI B E, et al A survey on deep learning in medical image analysis[J]. Medical Image Analysis, 2017, 42: 60- 88
doi: 10.1016/j.media.2017.07.005
[5]   DEVALLA S K, PHAM T H, PANDA S K, et al Towards label-free 3D segmentation of optical coherence tomography images of the optic nerve head using deep learning[J]. Biomedical Optics Express, 2020, 11 (11): 6356- 6378
doi: 10.1364/BOE.395934
[6]   CORDTS M, OMRAN M, RAMOS S, et al. The cityscapes dataset for semantic urban scene understanding [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 3213–3223.
[7]   RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation [C]//Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2015: 234–241.
[8]   ZHOU Z, RAHMAN SIDDIQUEE M M, TAJBAKHSH N, et al. UNet++: a nested U-Net architecture for medical image segmentation [C]//Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018: 3–11.
[9]   ZHAO H, SHI J, QI X, et al. Pyramid scene parsing network [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6230–6239.
[10]   CHEN L C, PAPANDREOU G, KOKKINOS I, et al DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40 (4): 834- 848
doi: 10.1109/TPAMI.2017.2699184
[11]   CAO H, WANG Y, CHEN J, et al. Swin-Unet: Unet-like pure transformer for medical image segmentation [C]// European Conference on Computer Vision. Cham: Springer, 2023: 205–218.
[12]   ISENSEE F, JAEGER P F, KOHL S A A, et al nnU-NET: a self-configuring method for deep learning-based biomedical image segmentation[J]. Nature Methods, 2021, 18 (2): 203- 211
doi: 10.1038/s41592-020-01008-z
[13]   ISENSEE F, PETERSEN J, KLEIN A, et al. nnU-NET: self-adapting framework for U-Net-based medical image segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1809.10486.
[14]   MOU L, ZHAO Y, CHEN L, et al. CS-Net: channel and spatial attention network for curvilinear structure segmentation [C]//Medical Image Computing and Computer Assisted Intervention. Cham: Springer, 2019: 721–730.
[15]   STAAL J, ABRAMOFF M D, NIEMEIJER M, et al Ridge-based vessel segmentation in color images of the retina[J]. IEEE Transactions on Medical Imaging, 2004, 23 (4): 501- 509
doi: 10.1109/TMI.2004.825627
[16]   LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2404.19756.
[17]   BODNER A D, TEPSICH A S, SPOLSKI J N, et al. Convolutional Kolmogorov-Arnold networks [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2406.13155.
[18]   LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660
doi: 10.1609/aaai.v39i5.32491
[19]   MA X, WANG Z, HU Y, et al. Kolmogorov-Arnold network for remote sensing image semantic segmentation [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2501.07390.
[20]   AGRAWAL A, AGRAWAL A, GUPTA S, et al. KAN-Mamba FusionNet: redefining medical image segmentation with non-linear modeling [EB/OL]. [2025-08-17]. https://arxiv.org/abs/2411.11926.
[21]   OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention U-Net: learning where to look for the pancreas [EB/OL]. [2025-08-17]. https://arxiv.org/abs/1804.03999.
[22]   ZHANG Z, LIU Q, WANG Y Road extraction by deep residual U-Net[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15 (5): 749- 753
doi: 10.1109/LGRS.2018.2802944
[23]   BILIC P, CHRIST P, LI H B, et al The Liver tumor segmentation benchmark (LiTS)[J]. Medical Image Analysis, 2023, 84: 102680
doi: 10.1016/j.media.2022.102680
[24]   SCHOENBERG I J. Contributions to the problem of approximation of equidistant data by analytic functions: part A. -on the problem of smoothing or graduation. a first class of analytic approximation formulae [J]. Quarterly of Applied Mathematics, 1946, 4(1): 45–99.
[25]   HOLLADAY J C A smoothest curve approximation[J]. Mathematical Tables and Other Aids to Computation, 1957, 11 (60): 233- 243
doi: 10.1090/s0025-5718-1957-0093894-6
[26]   MILLETARI F, NAVAB N, AHMADI S A. V-net: fully convolutional neural networks for volumetric medical image segmentation [C]// Proceedings of the Fourth International Conference on 3D Vision. Stanford: IEEE, 2016: 565–571.
[27]   EVERINGHAM M, VAN GOOL L, WILLIAMS C K I, et al The pascal visual object classes (VOC) challenge[J]. International Journal of Computer Vision, 2010, 88 (2): 303- 338
doi: 10.1007/s11263-009-0275-4
[28]   TAGHANAKI S A, ZHENG Y, ZHOU K S, et al Combo loss: handling input and output imbalance in multi-organ segmentation[J]. Computerized Medical Imaging and Graphics, 2019, 75: 24- 33
doi: 10.1016/j.compmedimag.2019.04.005
[1] Zhihang ZHU,Yunfeng YAN,Donglian QI. Image generation for power personnel behaviors based on diffusion model with multimodal prompts[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 43-51.
[2] Xiao-ping YUAN,Xiao-qian WANG,Xiang HE,Yang-ming HU. Deep supervised network for change detection of remote sensing image[J]. Journal of ZheJiang University (Engineering Science), 2023, 57(10): 1966-1976.
[3] Wan-liang WANG,Tie-jun WANG,Jia-cheng CHEN,Wen-bo YOU. Medical image segmentation method combining multi-scale and multi-head attention[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(9): 1796-1805.
[4] Xiao-ping YUAN,Xiang HE,Xiao-qian WANG,Yang-ming HU. Image segmentation algorithm based on multi-level feature adaptive fusion[J]. Journal of ZheJiang University (Engineering Science), 2022, 56(10): 1958-1966.
[5] Qing-qing LIU,Zhi-yong ZHOU,Guo-hua FAN,Xu-sheng QIAN,Ji-su HU,Guang-qiang CHEN,Ya-kang DAI. Semi-supervised learning segmentation method of liver CT images based on 3D scSE-UNet[J]. Journal of ZheJiang University (Engineering Science), 2021, 55(11): 2033-2044.
[6] ZHENG Zhou, ZHANG Xue-chang, ZHENG Si-ming, SHI Yue-ding. Liver segmentation in CT images based on region-growing and unified level set method[J]. Journal of ZheJiang University (Engineering Science), 2018, 52(12): 2382-2396.
[7] LIAO Miao, ZHAO Yu-qian, ZENG Ye-zhan, HUANG Zhong-chao, ZHANG Bing-kui, ZOU Bei-ji. Automatic segmentation for cell images based on support vector machine and ellipse fitting[J]. Journal of ZheJiang University (Engineering Science), 2017, 51(4): 722-728.
[8] ZHANG Jian ting,ZHANG Li min. Image segmentation using novel adaptive robust bilateral filter[J]. Journal of ZheJiang University (Engineering Science), 2016, 50(9): 1703-1710.
[9] HU Zhu-hua, ZHAO Yao-chi, CHENG Jie-ren, PENG Jin-lian. Moving object segmentation method based on improved DRLSE[J]. Journal of ZheJiang University (Engineering Science), 2014, 48(8): 1488-1495.
[10] LIU Zhong, CHEN Wei-hai, WU Xing-ming, ZOU Yu-hua, WANG Jian-hua. Salient region detection based on stereo vision[J]. Journal of ZheJiang University (Engineering Science), 2014, 48(2): 354-359.
[11] PENG Hai, ZHAO Ju-feng, FENG Hua-jun, XU Zhi-hai, LI Qi, CHEN Yue-ting. Dual band image fusion method based on region saliency[J]. Journal of ZheJiang University (Engineering Science), 2012, 46(11): 2109-2115.
[12] LI Guang-ting, YU Wei-dong. Fast implementation of SAR image segmentation using
Markov random fields
[J]. Journal of ZheJiang University (Engineering Science), 2012, 46(10): 1810-1815.
[13] WU Yi-quan, ZHANG Xiao-jie, WU Shi-hua, ZHANG Sheng-wei. Two-dimensional minimum error thresholding based on chaotic
particle swarm optimization or decomposition
[J]. Journal of ZheJiang University (Engineering Science), 2011, 45(7): 1198-1205.
[14] XIE Jiang-Jun, HOU Di-Bei, HUANG Beng-Cha, ZHANG Guang-Xin, ZHOU Ze-Kuai. Fast single parameter level set segmentation based on
semi-implicit schemes
[J]. Journal of ZheJiang University (Engineering Science), 2010, 44(8): 1496-1501.
[15] CA Jin-Hui, ZHANG Guang-Xin, CAI Hui. Theory and application of connectivity maskbased reconstruction
opening operator
[J]. Journal of ZheJiang University (Engineering Science), 2010, 44(4): 675-680.