Please wait a minute...
浙江大学学报(工学版)  2026, Vol. 60 Issue (4): 772-781    DOI: 10.3785/j.issn.1008-973X.2026.04.009
计算机技术     
基于KAN和U-Net网络的颌面结构全景分割方法
蔡智(),周正东*(),袁晓曦,杨泽毅,袁梦瑶
南京航空航天大学 航空航天结构力学及控制全国重点实验室,江苏 南京 210016
A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks
Zhi CAI(),Zhengdong ZHOU*(),Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN
State Key Laboratory of Mechanics and Control for Aerospace Structures, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
 全文: PDF(2185 KB)   HTML
摘要:

为了提升复杂颌面结构的全景分割性能,提出结合Kolmogorov-Arnold表示定理与U-Net架构的多尺度特征提取模型(MC-UKAN). 该模型通过融合位置先验、可学习非线性激活函数和多尺度特征提取,有效增强了特征表达能力. 基于该模型,设计三阶段颌面结构全景分割框架,第1阶段在低分辨率图像中利用MC-UKAN进行粗分割和牙位标定;第2阶段在原始图像上,基于粗分割结果定位目标结构,利用K-means聚类算法将目标划分为5类,采用对应网络进行精细分割;第3阶段使用轻量化网络在原始图像上分割出颌骨及咽喉. 通过融合多尺度分割结果,实现对牙齿、牙髓、颌骨、神经管等结构的精确分割. 在ToothFairy3数据集上的实验结果表明,该方法在74类口腔颌面结构的分割中达到了88.3%的Dice系数和5.04 mm的HD95,平均推理时间为27.04 s,展现出优越的性能.

关键词: 口腔分割锥形束计算机断层扫描(CBCT)Kolmogorov-Arnold表示定理ToothFairy3牙位标定    
Abstract:

To improve the performance of panoramic segmentation of complex maxillofacial structures, a multi-scale feature extraction model (MC-UKAN) was proposed, which combined the Kolmogorov-Arnold representation theorem with the U-Net architecture. The model integrated position priors, learnable nonlinear activation functions, and multi-scale feature extraction to effectively enhance feature expression ability. Based on this model, a three-stage maxillofacial structure panoramic segmentation framework was designed. In the first stage, coarse segmentation and tooth position calibration were performed on low-resolution images using MC-UKAN. In the second stage, the target structures were localized using the coarse segmentation results on the original images, the K-means clustering algorithm was then applied to categorize these targets into five classes, and corresponding networks were employed for fine-grained segmentation. In the third stage, a lightweight network was used to segment the jawbones and pharynx on the original images. By fusing multi-scale segmentation results, precise segmentation of structures such as teeth, dental pulp, jawbones, and nerve canals was achieved. Experimental results on the ToothFairy3 dataset demonstrated that the proposed method achieved an 88.3% Dice coefficient and 5.04 mm HD95 for the segmentation of 74 categories of oral and maxillofacial structures, with an average inference time of 27.04 seconds. These results fully validated the superior performance of the proposed method in complex maxillofacial structure segmentation tasks.

Key words: oral segmentation    cone beam computed tomography (CBCT)    Kolmogorov-Arnold representation theorem    ToothFairy3    dental alignment
收稿日期: 2025-08-08 出版日期: 2026-03-19
CLC:  TP 393  
基金资助: 国家自然科学基金资助项目 (52375570); 中国航空研究院首批揭榜挂帅项目(F2021109); 上海航天科技创新基金资助项目(SAST2019-121); 南京航空航天大学研究生科研与实践创新计划项目(xcxjh20240111,xcxjh20240110).
通讯作者: 周正东     E-mail: caizhi123@nuaa.edu.cn;zzd_msc@nuaa.edu.cn
作者简介: 蔡智(1998—),男,硕士生,从事医学图像处理研究. orcid.org/0009-0002-2608-4030. E-mail:caizhi123@nuaa.edu.cn
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
作者相关文章  
蔡智
周正东
袁晓曦
杨泽毅
袁梦瑶

引用本文:

蔡智,周正东,袁晓曦,杨泽毅,袁梦瑶. 基于KAN和U-Net网络的颌面结构全景分割方法[J]. 浙江大学学报(工学版), 2026, 60(4): 772-781.

Zhi CAI,Zhengdong ZHOU,Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN. A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 772-781.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.04.009        https://www.zjujournals.com/eng/CN/Y2026/V60/I4/772

图 1  三阶段颌面分割框架
图 2  5类结构的 K-means 与 t-SNE 可视化结果图
图 3  MC-UKAN架构
图 4  MC-KAN卷积模块结构
图 5  ToothFairy3的完整注释样本
图 6  预处理图像
网络Dice/%
均值(不计牙髓)颌骨IAC上颌窦咽喉牙齿修复体牙髓
CNNsnnU-Net[20]70.9290.3171.3464.8195.6673.1729.50
nnU-Net ResEnc[20]74.1691.7773.0165.7195.2676.4837.10
Transf.TransU-Net[20]70.3290.3381.9659.6987.8972.4627.68
nnFormer[20]76.7991.5072.2875.1190.8579.3718.95
UNETR++[20]71.4392.3868.8774.5191.6073.7026.21
MambaUMamba[20]85.0590.0585.2677.0292.1886.5843.89
VMamba[20]73.1390.7560.6273.2088.2375.9914.86
Swin-UMamba[20]79.6494.2972.6487.7593.0580.7125.30
3-Stage3D UKAN_SE86.5194.9582.6276.8495.9689.4980.4986.09
PMFSNet3D87.9695.9484.9973.8095.9688.8280.2785.10
本研究模型88.8795.9584.2282.0995.9689.5181.1887.55
表 1  不同的方法Dice系数比较
网络HD95/mm
均值(不计牙髓)颌骨IAC上颌窦咽喉牙齿修复体牙髓
CNNsnnU-Net[20]17.8612.5329.1128.3919.2318.3220.95
nnU-Net ResEnc[20]14.4812.5327.8129.9917.7514.2616.32
Transf.TransU-Net[20]20.1749.0211.9959.7642.8715.0422.65
nnFormer[20]5.4520.0910.068.2224.532.9413.67
UNETR++[20]17.235.4415.1015.4818.9617.9119.70
MambaUMamba[20]5.2822.1716.234.3525.252.2313.97
VMamba[20]5.177.179.949.8923.953.3914.63
Swin-UMamba[20]2.944.252.022.382.792.5914.93
3-Stage3D UKAN_SE5.271.629.968.201.172.1824.872.86
PMFSNet3D5.481.6311.385.251.172.3729.032.80
本研究模型5.041.5910.782.331.171.9624.822.11
表 2  不同的方法HD95系数比较
方法数据集分割结构测试集样本数Dice/%
Daza等[13]ToothFairy2上下颌骨、IAC、上颌窦、咽喉、牙齿、修复体5085.90
Wodzinski等[21]ToothFairy2上颌骨、上颌窦、上颌牙齿、修复体、咽喉5073.80
下颌骨、IAC、下颌牙齿、修复体5087.80
薄士仕等[14]私有数据集牙齿(有修复体)1484.74
牙齿(无修复体)987.88
本研究模型ToothFairy3上下颌骨、IAC、上颌窦、咽喉、牙齿、修复体、牙髓10888.30
表 3  带牙位标定的分类平均Dice系数比较
图 7  上/下颌骨、咽喉、上颌窦、IAC与牙齿及其牙髓的分割结果
网络Dice/%
均值颌骨IAC上颌窦咽喉牙齿修复体
3D U-Net83.7186.9077.4695.5095.3683.3578.95
3D Unet_SE86.0088.3678.0695.4296.4885.7482.73
3D U-KAN85.8488.7179.3097.7795.6385.6280.50
3D UKAN_SE86.1988.5678.5694.4196.5786.1581.27
PMFSNet3D84.8087.8279.1688.6396.5084.7980.21
MC-UKAN(3×3×3)83.7085.4077.1583.5195.8883.6883.25
MC-UKAN(5×5×5)84.1386.5076.2181.4995.9383.9683.58
MC-UKAN(单一设备商)85.7195.90(下颌)79.3194.3385.9182.64
MC-UKAN86.8986.1077.1396.6196.0787.8783.75
表 4  粗分割模块不同网络的分类Dice系数比较
图 8  不同卷积方式下第2层编码器后横截面切片
模块结构Dice/%HD95/mm
局部高分辨细
分割模块
IAC94.690.97
上颌窦98.261.00
前牙及牙髓93.582.03
磨牙及牙髓90.693.30
修复体93.492.27
大尺度结构
分割模块
上颌骨85.7211.10
下颌骨96.452.57
咽喉95.962.60
表 5  局部高分辨细分割模块与大尺度结构分割模块的实验结果
模块耗时/s
粗分割模块3.86
大尺度结构分割模块2.46
局部高分辨细分割模块20.72
总计27.04
表 6  三阶段模块推理时间
PE3DNMC-KAN$ {N}_{\text{CNN}} $Dice%
0586.00
1482.66
2385.79
3285.54
4185.95
5086.89
×5086.06
表 7  PE3D模块与MC-KAN卷积层的消融实验结果
Model Scale(C1,C2, C3, C4, C5)Dice/%
MC-UKAN_S(16,32,64,128,256)71.28
MC-UKAN_M(32,64,128,256,512)83.79
MC-UKAN(32,64,256,512,1024)86.89
表 8  不同规模MC-UKAN模型的分割实验结果对比
噪声类型$ \sigma $$ \lambda $$ \delta $Dice/%
高斯噪声0.0286.50
泊松噪声0.587.94
脉冲噪声0.0186.68
高斯+泊松0.020.586.00
高斯+脉冲0.020.0184.42
泊松+脉冲0.50.0186.09
高斯+泊松+脉冲0.020.50.0183.98
无噪声88.30
表 9  ToothFairy3 测试集上添加不同噪声的分割实验结果对比
1 LOPES DOS SANTOS G, IKUTA C R S, SALZEDAS L M P, et al Canalis sinuosus: an anatomic repair that may prevent success of dental implants in anterior maxilla[J]. Journal of Prosthodontics, 2020, 29 (9): 751- 755
doi: 10.1111/jopr.13256
2 LIU Z, YANG D, ZHANG M, et al Inferior alveolar nerve canal segmentation on CBCT using U-Net with frequency attentions[J]. Bioengineering, 2024, 11 (4): 354
doi: 10.4274/meandros.galenos.2021.14632
3 JIANG B, ZHANG S, SHI M, et al Alternate level set evolutions with controlled switch for tooth segmentation[J]. IEEE Access, 2022, 10: 76563- 76572
doi: 10.1109/ACCESS.2022.3192411
4 SABANCı S, ŞENER E, TURHAL R I, et al Is manual segmentation the real gold standard for tooth segmentation? a preliminary in vivo study using cone-beam computed tomography images[J]. Meandros Medical and Dental Journal, 2021, 22 (3): 263- 273
doi: 10.4274/meandros.galenos.2021.14632
5 TRELENBERG-STOLL V, DRESCHER D, WOLF M, et al Automated tooth segmentation as an innovative tool to assess 3D-tooth movement and root resorption in rodents[J]. Head and Face Medicine, 2021, 17 (1): 3
doi: 10.1186/s13005-020-00254-y
6 POLIZZI A, QUINZI V, RONSIVALLE V, et al Tooth automatic segmentation from CBCT images: a systematic review[J]. Clinical Oral Investigations, 2023, 27 (7): 3363- 3378
doi: 10.1007/s00784-023-05048-5
7 ABESI F, HOZURI M, ZAMANI M Performance of artificial intelligence using cone-beam computed tomography for segmentation of oral and maxillofacial structures: a systematic review and meta-analysis[J]. Journal of Clinical and Experimental Dentistry, 2023, 15 (11): e954- e962
doi: 10.4317/jced.60287
8 CUI Z, LI C, WANG W. ToothNet: automatic tooth instance segmentation and identification from cone beam CT images [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 6361–6370.
9 LIU Y, XIE R, WANG L, et al Fully automatic AI segmentation of oral surgery-related tissues based on cone beam computed tomography images[J]. International Journal of Oral Science, 2024, 16: 34
doi: 10.1038/s41368-024-00294-z
10 WU Q, FENG B, LI W, et al Automatic segmentation and visualization of cortical and marrow bone in mandibular condyle on CBCT: a preliminary exploration of clinical application[J]. Oral Radiology, 2025, 41 (1): 88- 101
doi: 10.1007/s11282-024-00780-4
11 WEI X, WANG Y Inferior alveolar canal segmentation based on cone-beam computed tomography[J]. Medical Physics, 2021, 48 (11): 7074- 7088
doi: 10.1002/mp.15274
12 MORGAN N, VAN GERVEN A, SMOLDERS A, et al Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images[J]. Scientific Reports, 2022, 12: 7523
doi: 10.1038/s41598-022-11483-3
13 DAZA L, SCHNABEL J. DiENTeS: dynamic ENTity segmentation with local-global transformers [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 21–29.
14 薄士仕, 高承志 基于卷积神经网络实现锥形束CT牙齿分割及牙位标定[J]. 北京大学学报: 医学版, 2024, 56 (4): 735- 740
BO Shishi, GAO Chengzhi Tooth segmentation and identification on cone-beam computed tomography with convolutional neural network based on spatial embedding information[J]. Journal of Peking University: Health Sciences, 2024, 56 (4): 735- 740
doi: 10.19723/j.issn.1671-167X.2024.04.030
15 HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132–7141.
16 LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. (2025−02−09) [2025−10−13]. https://arxiv.org/pdf/2404.19756.
17 IMAK A, ÇELEBI A, POLAT O, et al ResMIBCU-Net: an encoder–decoder network with residual blocks, modified inverted residual block, and bi-directional ConvLSTM for impacted tooth segmentation in panoramic X-ray images[J]. Oral Radiology, 2023, 39 (4): 614- 628
doi: 10.1007/s11282-023-00677-8
18 LUMETTI L, PIPOLI V, BOLELLI F, et al Enhancing patch-based learning for the segmentation of the mandibular canal[J]. IEEE Access, 2024, 12: 79014- 79024
doi: 10.1109/ACCESS.2024.3408629
19 BOLELLI F, LUMETTI L, VINAYAHALINGAM S, et al Segmenting the inferior alveolar canal in CBCTs volumes: the ToothFairy challenge[J]. IEEE Transactions on Medical Imaging, 2025, 44 (4): 1890- 1906
doi: 10.1109/TMI.2024.3523096
20 BOLELLI F, MARCHESINI K, VAN NISTELROOIJ N, et al. Segmenting maxillofacial structures in CBCT volumes [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2025: 5238–5248.
21 WODZINSKI M, MÜLLER H. Automatic multi-structure segmentation in cone beam computed tomography volumes using deep encoder-decoder architectures [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 63–71.
22 LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660
doi: 10.1609/aaai.v39i5.32491
23 ZHONG J, TIAN W, XIE Y, et al PMFSNet: polarized multi-scale feature self-attention network for lightweight medical image segmentation[J]. Computer Methods and Programs in Biomedicine, 2025, 261: 108611
doi: 10.1016/j.cmpb.2025.108611
24 LIU Y, CASTRO M, LEDERLIN M, et al An improved nonlinear diffusion in Laplacian pyramid domain for cone beam CT denoising during image-guided vascular intervention[J]. BMC Medical Imaging, 2018, 18 (1): 25
doi: 10.1186/s12880-018-0269-1
25 ZHU L, WANG J, XING L Noise suppression in scatter correction for cone-beam CT[J]. Medical Physics, 2009, 36 (3): 741- 752
doi: 10.1118/1.3063001
[1] 陈思如,舒元超. 多模态大模型边缘部署与推理加速技术综述[J]. 浙江大学学报(工学版), 2026, 60(4): 723-737.
[2] 奎晓燕,张敏,肖伶,李钦松,陈立明,张文生,邹北骥. 数据去重与缩减技术的系统分类与性能分析[J]. 浙江大学学报(工学版), 2026, 60(2): 287-302.
[3] 王彦乐,张瑞峰,李锵. 融合全局信息和对比学习的图神经网络推荐模型[J]. 浙江大学学报(工学版), 2026, 60(2): 351-359.
[4] 陈文强,冯琳越,王东丹,顾玉磊,赵轩. 融合动态风险图与多变量注意力机制的车辆轨迹预测模型[J]. 浙江大学学报(工学版), 2026, 60(3): 455-467.
[5] 吕君陶,祁珏瑜,于淏辰,马雷,马惠敏,胡天宇. 自动驾驶综合仿真平台的现状与展望[J]. 浙江大学学报(工学版), 2026, 60(3): 513-526.
[6] 包晓安,陈恩琳,张娜,涂小妹,吴彪,张庆琪. 基于多尺度编码器融合的三维人体姿态估计算法[J]. 浙江大学学报(工学版), 2026, 60(3): 565-573.
[7] 李智慧,邓琨,许聪源. 基于关系嵌入的物联网未知攻击检测方法[J]. 浙江大学学报(工学版), 2026, 60(3): 624-632.
[8] 吴越,梁铮,高巍,杨茂达,赵培森,邓红霞,常媛媛. 基于SMPL模态分解与嵌入融合的多模态步态识别[J]. 浙江大学学报(工学版), 2026, 60(1): 52-60.
[9] 陈沪航,吕泉,苏子航,张君樵,陈筑,韩旭. 计及容量电费的供热机组运行模式选择模型[J]. 浙江大学学报(工学版), 2026, 60(1): 169-178.
[10] 何浩,舒永东,林勇刚,代富全,张举. 变负载下大功率船舶全回转推进器水动及液控仿真[J]. 浙江大学学报(工学版), 2026, 60(1): 191-198.
[11] 于翼铭,蔡巍,李毅,付玮,姚旭,张停毅,刁尚祺,李丹,林松清,陈永顺. 基于微电极阵列的环境重金属检测研究进展[J]. 浙江大学学报(工学版), 2026, 60(1): 217-230.
[12] 张凌浩,谭海波,赵赫,陈中,程昊天,马志宇. CompuDEX:基于区块链的大模型微调算力共享平台[J]. 浙江大学学报(工学版), 2026, 60(1): 1-18.
[13] 杨延璞,伍智泓,孟文昊,卓玥鸣,刘嘉玲. 基于Hammerstein模型和表面肌电的过头作业上肢肌肉疲劳评估[J]. 浙江大学学报(工学版), 2025, 59(12): 2483-2494.
[14] 刘宇轩,刘毅志,廖祝华,邹正标,汤璟昕. 面向动态交通流量预测的自适应图注意Transformer[J]. 浙江大学学报(工学版), 2025, 59(12): 2585-2592.
[15] 周雄雄,何秋江,何纪春,周璟. 考虑力学成因的高心墙坝变形解耦及参数反演[J]. 浙江大学学报(工学版), 2025, 59(12): 2616-2626.