基于KAN和U-Net网络的颌面结构全景分割方法

doi:10.3785/j.issn.1008-973X.2026.04.009

浙江大学学报(工学版)

2026, Vol. 60

Issue (4): 772-781 DOI: 10.3785/j.issn.1008-973X.2026.04.009

计算机技术

基于KAN和U-Net网络的颌面结构全景分割方法

蔡智(

),周正东*(

),袁晓曦,杨泽毅,袁梦瑶

南京航空航天大学航空航天结构力学及控制全国重点实验室，江苏南京 210016

A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks

Zhi CAI(

),Zhengdong ZHOU*(

),Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN

State Key Laboratory of Mechanics and Control for Aerospace Structures, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

全文: PDF(2185 KB) HTML

摘要：

为了提升复杂颌面结构的全景分割性能，提出结合Kolmogorov-Arnold表示定理与U-Net架构的多尺度特征提取模型（MC-UKAN）. 该模型通过融合位置先验、可学习非线性激活函数和多尺度特征提取，有效增强了特征表达能力. 基于该模型，设计三阶段颌面结构全景分割框架，第1阶段在低分辨率图像中利用MC-UKAN进行粗分割和牙位标定；第2阶段在原始图像上，基于粗分割结果定位目标结构，利用K-means聚类算法将目标划分为5类，采用对应网络进行精细分割；第3阶段使用轻量化网络在原始图像上分割出颌骨及咽喉. 通过融合多尺度分割结果，实现对牙齿、牙髓、颌骨、神经管等结构的精确分割. 在ToothFairy3数据集上的实验结果表明，该方法在74类口腔颌面结构的分割中达到了88.3%的Dice系数和5.04 mm的HD95，平均推理时间为27.04 s，展现出优越的性能.

关键词： 口腔分割; 锥形束计算机断层扫描（CBCT）; Kolmogorov-Arnold表示定理; ToothFairy3; 牙位标定

Abstract:

To improve the performance of panoramic segmentation of complex maxillofacial structures, a multi-scale feature extraction model (MC-UKAN) was proposed, which combined the Kolmogorov-Arnold representation theorem with the U-Net architecture. The model integrated position priors, learnable nonlinear activation functions, and multi-scale feature extraction to effectively enhance feature expression ability. Based on this model, a three-stage maxillofacial structure panoramic segmentation framework was designed. In the first stage, coarse segmentation and tooth position calibration were performed on low-resolution images using MC-UKAN. In the second stage, the target structures were localized using the coarse segmentation results on the original images, the K-means clustering algorithm was then applied to categorize these targets into five classes, and corresponding networks were employed for fine-grained segmentation. In the third stage, a lightweight network was used to segment the jawbones and pharynx on the original images. By fusing multi-scale segmentation results, precise segmentation of structures such as teeth, dental pulp, jawbones, and nerve canals was achieved. Experimental results on the ToothFairy3 dataset demonstrated that the proposed method achieved an 88.3% Dice coefficient and 5.04 mm HD95 for the segmentation of 74 categories of oral and maxillofacial structures, with an average inference time of 27.04 seconds. These results fully validated the superior performance of the proposed method in complex maxillofacial structure segmentation tasks.

Key words: oral segmentation cone beam computed tomography (CBCT) Kolmogorov-Arnold representation theorem ToothFairy3 dental alignment

收稿日期: 2025-08-08 出版日期: 2026-03-19

CLC:

TP 393

基金资助: 国家自然科学基金资助项目 (52375570); 中国航空研究院首批揭榜挂帅项目(F2021109); 上海航天科技创新基金资助项目(SAST2019-121); 南京航空航天大学研究生科研与实践创新计划项目(xcxjh20240111，xcxjh20240110).

通讯作者: 周正东 E-mail: caizhi123@nuaa.edu.cn;zzd_msc@nuaa.edu.cn

作者简介: 蔡智（1998—），男，硕士生，从事医学图像处理研究. orcid.org/0009-0002-2608-4030. E-mail：caizhi123@nuaa.edu.cn

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	蔡智
	周正东
	袁晓曦
	杨泽毅
	袁梦瑶

引用本文:

蔡智,周正东,袁晓曦,杨泽毅,袁梦瑶. 基于KAN和U-Net网络的颌面结构全景分割方法[J]. 浙江大学学报(工学版), 2026, 60(4): 772-781.

Zhi CAI,Zhengdong ZHOU,Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN. A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 772-781.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.04.009 或 https://www.zjujournals.com/eng/CN/Y2026/V60/I4/772

图 1 三阶段颌面分割框架

图 2 5类结构的 K-means 与 t-SNE 可视化结果图

图 3 MC-UKAN架构

图 4 MC-KAN卷积模块结构

图 5 ToothFairy3的完整注释样本

图 6 预处理图像

表 1 不同的方法Dice系数比较

表 2 不同的方法HD95系数比较

表 3 带牙位标定的分类平均Dice系数比较

图 7 上/下颌骨、咽喉、上颌窦、IAC与牙齿及其牙髓的分割结果

表 4 粗分割模块不同网络的分类Dice系数比较

图 8 不同卷积方式下第2层编码器后横截面切片

表 5 局部高分辨细分割模块与大尺度结构分割模块的实验结果

表 6 三阶段模块推理时间

表 7 PE3D模块与MC-KAN卷积层的消融实验结果

表 8 不同规模MC-UKAN模型的分割实验结果对比

表 9 ToothFairy3 测试集上添加不同噪声的分割实验结果对比

1	LOPES DOS SANTOS G, IKUTA C R S, SALZEDAS L M P, et al Canalis sinuosus: an anatomic repair that may prevent success of dental implants in anterior maxilla[J]. Journal of Prosthodontics, 2020, 29 (9): 751- 755 doi: 10.1111/jopr.13256
2	LIU Z, YANG D, ZHANG M, et al Inferior alveolar nerve canal segmentation on CBCT using U-Net with frequency attentions[J]. Bioengineering, 2024, 11 (4): 354 doi: 10.4274/meandros.galenos.2021.14632
3	JIANG B, ZHANG S, SHI M, et al Alternate level set evolutions with controlled switch for tooth segmentation[J]. IEEE Access, 2022, 10: 76563- 76572 doi: 10.1109/ACCESS.2022.3192411
4	SABANCı S, ŞENER E, TURHAL R I, et al Is manual segmentation the real gold standard for tooth segmentation? a preliminary in vivo study using cone-beam computed tomography images[J]. Meandros Medical and Dental Journal, 2021, 22 (3): 263- 273 doi: 10.4274/meandros.galenos.2021.14632
5	TRELENBERG-STOLL V, DRESCHER D, WOLF M, et al Automated tooth segmentation as an innovative tool to assess 3D-tooth movement and root resorption in rodents[J]. Head and Face Medicine, 2021, 17 (1): 3 doi: 10.1186/s13005-020-00254-y
6	POLIZZI A, QUINZI V, RONSIVALLE V, et al Tooth automatic segmentation from CBCT images: a systematic review[J]. Clinical Oral Investigations, 2023, 27 (7): 3363- 3378 doi: 10.1007/s00784-023-05048-5
7	ABESI F, HOZURI M, ZAMANI M Performance of artificial intelligence using cone-beam computed tomography for segmentation of oral and maxillofacial structures: a systematic review and meta-analysis[J]. Journal of Clinical and Experimental Dentistry, 2023, 15 (11): e954- e962 doi: 10.4317/jced.60287
8	CUI Z, LI C, WANG W. ToothNet: automatic tooth instance segmentation and identification from cone beam CT images [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 6361–6370.
9	LIU Y, XIE R, WANG L, et al Fully automatic AI segmentation of oral surgery-related tissues based on cone beam computed tomography images[J]. International Journal of Oral Science, 2024, 16: 34 doi: 10.1038/s41368-024-00294-z
10	WU Q, FENG B, LI W, et al Automatic segmentation and visualization of cortical and marrow bone in mandibular condyle on CBCT: a preliminary exploration of clinical application[J]. Oral Radiology, 2025, 41 (1): 88- 101 doi: 10.1007/s11282-024-00780-4
11	WEI X, WANG Y Inferior alveolar canal segmentation based on cone-beam computed tomography[J]. Medical Physics, 2021, 48 (11): 7074- 7088 doi: 10.1002/mp.15274
12	MORGAN N, VAN GERVEN A, SMOLDERS A, et al Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images[J]. Scientific Reports, 2022, 12: 7523 doi: 10.1038/s41598-022-11483-3
13	DAZA L, SCHNABEL J. DiENTeS: dynamic ENTity segmentation with local-global transformers [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 21–29.
14	薄士仕, 高承志基于卷积神经网络实现锥形束CT牙齿分割及牙位标定[J]. 北京大学学报: 医学版, 2024, 56 (4): 735- 740 BO Shishi, GAO Chengzhi Tooth segmentation and identification on cone-beam computed tomography with convolutional neural network based on spatial embedding information[J]. Journal of Peking University: Health Sciences, 2024, 56 (4): 735- 740 doi: 10.19723/j.issn.1671-167X.2024.04.030
15	HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132–7141.
16	LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. (2025−02−09) [2025−10−13]. https://arxiv.org/pdf/2404.19756.
17	IMAK A, ÇELEBI A, POLAT O, et al ResMIBCU-Net: an encoder–decoder network with residual blocks, modified inverted residual block, and bi-directional ConvLSTM for impacted tooth segmentation in panoramic X-ray images[J]. Oral Radiology, 2023, 39 (4): 614- 628 doi: 10.1007/s11282-023-00677-8
18	LUMETTI L, PIPOLI V, BOLELLI F, et al Enhancing patch-based learning for the segmentation of the mandibular canal[J]. IEEE Access, 2024, 12: 79014- 79024 doi: 10.1109/ACCESS.2024.3408629
19	BOLELLI F, LUMETTI L, VINAYAHALINGAM S, et al Segmenting the inferior alveolar canal in CBCTs volumes: the ToothFairy challenge[J]. IEEE Transactions on Medical Imaging, 2025, 44 (4): 1890- 1906 doi: 10.1109/TMI.2024.3523096
20	BOLELLI F, MARCHESINI K, VAN NISTELROOIJ N, et al. Segmenting maxillofacial structures in CBCT volumes [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2025: 5238–5248.
21	WODZINSKI M, MÜLLER H. Automatic multi-structure segmentation in cone beam computed tomography volumes using deep encoder-decoder architectures [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 63–71.
22	LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660 doi: 10.1609/aaai.v39i5.32491
23	ZHONG J, TIAN W, XIE Y, et al PMFSNet: polarized multi-scale feature self-attention network for lightweight medical image segmentation[J]. Computer Methods and Programs in Biomedicine, 2025, 261: 108611 doi: 10.1016/j.cmpb.2025.108611
24	LIU Y, CASTRO M, LEDERLIN M, et al An improved nonlinear diffusion in Laplacian pyramid domain for cone beam CT denoising during image-guided vascular intervention[J]. BMC Medical Imaging, 2018, 18 (1): 25 doi: 10.1186/s12880-018-0269-1
25	ZHU L, WANG J, XING L Noise suppression in scatter correction for cone-beam CT[J]. Medical Physics, 2009, 36 (3): 741- 752 doi: 10.1118/1.3063001

[1]	陈思如,舒元超. 多模态大模型边缘部署与推理加速技术综述[J]. 浙江大学学报(工学版), 2026, 60(4): 723-737.
[2]	奎晓燕,张敏,肖伶,李钦松,陈立明,张文生,邹北骥. 数据去重与缩减技术的系统分类与性能分析[J]. 浙江大学学报(工学版), 2026, 60(2): 287-302.
[3]	王彦乐,张瑞峰,李锵. 融合全局信息和对比学习的图神经网络推荐模型[J]. 浙江大学学报(工学版), 2026, 60(2): 351-359.
[4]	陈文强,冯琳越,王东丹,顾玉磊,赵轩. 融合动态风险图与多变量注意力机制的车辆轨迹预测模型[J]. 浙江大学学报(工学版), 2026, 60(3): 455-467.
[5]	吕君陶,祁珏瑜,于淏辰,马雷,马惠敏,胡天宇. 自动驾驶综合仿真平台的现状与展望[J]. 浙江大学学报(工学版), 2026, 60(3): 513-526.
[6]	包晓安,陈恩琳,张娜,涂小妹,吴彪,张庆琪. 基于多尺度编码器融合的三维人体姿态估计算法[J]. 浙江大学学报(工学版), 2026, 60(3): 565-573.
[7]	李智慧,邓琨,许聪源. 基于关系嵌入的物联网未知攻击检测方法[J]. 浙江大学学报(工学版), 2026, 60(3): 624-632.
[8]	吴越,梁铮,高巍,杨茂达,赵培森,邓红霞,常媛媛. 基于SMPL模态分解与嵌入融合的多模态步态识别[J]. 浙江大学学报(工学版), 2026, 60(1): 52-60.
[9]	陈沪航,吕泉,苏子航,张君樵,陈筑,韩旭. 计及容量电费的供热机组运行模式选择模型[J]. 浙江大学学报(工学版), 2026, 60(1): 169-178.
[10]	何浩,舒永东,林勇刚,代富全,张举. 变负载下大功率船舶全回转推进器水动及液控仿真[J]. 浙江大学学报(工学版), 2026, 60(1): 191-198.
[11]	于翼铭,蔡巍,李毅,付玮,姚旭,张停毅,刁尚祺,李丹,林松清,陈永顺. 基于微电极阵列的环境重金属检测研究进展[J]. 浙江大学学报(工学版), 2026, 60(1): 217-230.
[12]	张凌浩,谭海波,赵赫,陈中,程昊天,马志宇. CompuDEX：基于区块链的大模型微调算力共享平台[J]. 浙江大学学报(工学版), 2026, 60(1): 1-18.
[13]	杨延璞,伍智泓,孟文昊,卓玥鸣,刘嘉玲. 基于Hammerstein模型和表面肌电的过头作业上肢肌肉疲劳评估[J]. 浙江大学学报(工学版), 2025, 59(12): 2483-2494.
[14]	刘宇轩,刘毅志,廖祝华,邹正标,汤璟昕. 面向动态交通流量预测的自适应图注意Transformer[J]. 浙江大学学报(工学版), 2025, 59(12): 2585-2592.
[15]	周雄雄,何秋江,何纪春,周璟. 考虑力学成因的高心墙坝变形解耦及参数反演[J]. 浙江大学学报(工学版), 2025, 59(12): 2616-2626.

Viewed

Full text

Abstract

Cited

Shared

Discussed