A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks

doi:10.3785/j.issn.1008-973X.2026.04.009

Journal of ZheJiang University (Engineering Science)

2026, Vol. 60

Issue (4): 772-781 DOI: 10.3785/j.issn.1008-973X.2026.04.009

A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks

Zhi CAI(

),Zhengdong ZHOU*(

),Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN

State Key Laboratory of Mechanics and Control for Aerospace Structures, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

Download:

HTML

PDF(2185KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

To improve the performance of panoramic segmentation of complex maxillofacial structures, a multi-scale feature extraction model (MC-UKAN) was proposed, which combined the Kolmogorov-Arnold representation theorem with the U-Net architecture. The model integrated position priors, learnable nonlinear activation functions, and multi-scale feature extraction to effectively enhance feature expression ability. Based on this model, a three-stage maxillofacial structure panoramic segmentation framework was designed. In the first stage, coarse segmentation and tooth position calibration were performed on low-resolution images using MC-UKAN. In the second stage, the target structures were localized using the coarse segmentation results on the original images, the K-means clustering algorithm was then applied to categorize these targets into five classes, and corresponding networks were employed for fine-grained segmentation. In the third stage, a lightweight network was used to segment the jawbones and pharynx on the original images. By fusing multi-scale segmentation results, precise segmentation of structures such as teeth, dental pulp, jawbones, and nerve canals was achieved. Experimental results on the ToothFairy3 dataset demonstrated that the proposed method achieved an 88.3% Dice coefficient and 5.04 mm HD95 for the segmentation of 74 categories of oral and maxillofacial structures, with an average inference time of 27.04 seconds. These results fully validated the superior performance of the proposed method in complex maxillofacial structure segmentation tasks.

Key words： oral segmentation cone beam computed tomography (CBCT) Kolmogorov-Arnold representation theorem ToothFairy3 dental alignment

Received: 08 August 2025 Published: 19 March 2026

CLC:

TP 393

Fund: 国家自然科学基金资助项目 (52375570); 中国航空研究院首批揭榜挂帅项目(F2021109); 上海航天科技创新基金资助项目(SAST2019-121); 南京航空航天大学研究生科研与实践创新计划项目(xcxjh20240111，xcxjh20240110).

Corresponding Authors: Zhengdong ZHOU E-mail: caizhi123@nuaa.edu.cn;zzd_msc@nuaa.edu.cn

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Zhi CAI
	Zhengdong ZHOU
	Xiaoxi YUAN
	Zeyi YANG
	Mengyao YUAN

Cite this article:

Zhi CAI,Zhengdong ZHOU,Xiaoxi YUAN,Zeyi YANG,Mengyao YUAN. A panoramic segmentation method for maxillofacial structures based on KAN and U-Net networks. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 772-781.

URL:

https://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2026.04.009 OR https://www.zjujournals.com/eng/Y2026/V60/I4/772

基于KAN和U-Net网络的颌面结构全景分割方法

为了提升复杂颌面结构的全景分割性能，提出结合Kolmogorov-Arnold表示定理与U-Net架构的多尺度特征提取模型（MC-UKAN）. 该模型通过融合位置先验、可学习非线性激活函数和多尺度特征提取，有效增强了特征表达能力. 基于该模型，设计三阶段颌面结构全景分割框架，第1阶段在低分辨率图像中利用MC-UKAN进行粗分割和牙位标定；第2阶段在原始图像上，基于粗分割结果定位目标结构，利用K-means聚类算法将目标划分为5类，采用对应网络进行精细分割；第3阶段使用轻量化网络在原始图像上分割出颌骨及咽喉. 通过融合多尺度分割结果，实现对牙齿、牙髓、颌骨、神经管等结构的精确分割. 在ToothFairy3数据集上的实验结果表明，该方法在74类口腔颌面结构的分割中达到了88.3%的Dice系数和5.04 mm的HD95，平均推理时间为27.04 s，展现出优越的性能.

关键词： 口腔分割, 锥形束计算机断层扫描（CBCT）, Kolmogorov-Arnold表示定理, ToothFairy3, 牙位标定

Fig.1 Three-stage maxillofacial segmentation framework

Fig.2 Visualization results of K-means and t-SNE for five types of structures

Fig.3 Architecture of MC-UKAN

Fig.4 MC-KAN convolution module architecture

Fig.5 Complete annotation sample of ToothFairy3

Fig.6 Preprocessed images

Tab.1 Comparison of Dice coefficients of different methods

Tab.2 Comparison of HD95 coefficients of different methods

Tab.3 Comparison of categorical mean Dice coefficients with dental alignment

Fig.7 Segmentation results of maxilla/mandible, pharynx, maxillary sinus, IAC, teeth and their dental pulps

Tab.4 Comparison of classification Dice coefficients of different networks of coarse segmentation module

Fig.8 Cross-sectional slices after second layer encoder with different convolution methods

Tab.5 Experimental results of local high-resolution fine segmentation module and large-scale structural segmentation module

Tab.6 Three-stage modular reasoning time

Tab.7 Ablation experiment results of PE3D module and MC-KAN convolutional layer

Tab.8 Comparison of segmentation experimental results of MC-UKAN models with different scales

Tab.9 Comparison of segmentation experimental results with different noises on Toothfairy3 test set


[1]	LOPES DOS SANTOS G, IKUTA C R S, SALZEDAS L M P, et al Canalis sinuosus: an anatomic repair that may prevent success of dental implants in anterior maxilla[J]. Journal of Prosthodontics, 2020, 29 (9): 751- 755 doi: 10.1111/jopr.13256

[2]	LIU Z, YANG D, ZHANG M, et al Inferior alveolar nerve canal segmentation on CBCT using U-Net with frequency attentions[J]. Bioengineering, 2024, 11 (4): 354 doi: 10.4274/meandros.galenos.2021.14632

[3]	JIANG B, ZHANG S, SHI M, et al Alternate level set evolutions with controlled switch for tooth segmentation[J]. IEEE Access, 2022, 10: 76563- 76572 doi: 10.1109/ACCESS.2022.3192411

[4]	SABANCı S, ŞENER E, TURHAL R I, et al Is manual segmentation the real gold standard for tooth segmentation? a preliminary in vivo study using cone-beam computed tomography images[J]. Meandros Medical and Dental Journal, 2021, 22 (3): 263- 273 doi: 10.4274/meandros.galenos.2021.14632

[5]	TRELENBERG-STOLL V, DRESCHER D, WOLF M, et al Automated tooth segmentation as an innovative tool to assess 3D-tooth movement and root resorption in rodents[J]. Head and Face Medicine, 2021, 17 (1): 3 doi: 10.1186/s13005-020-00254-y

[6]	POLIZZI A, QUINZI V, RONSIVALLE V, et al Tooth automatic segmentation from CBCT images: a systematic review[J]. Clinical Oral Investigations, 2023, 27 (7): 3363- 3378 doi: 10.1007/s00784-023-05048-5

[7]	ABESI F, HOZURI M, ZAMANI M Performance of artificial intelligence using cone-beam computed tomography for segmentation of oral and maxillofacial structures: a systematic review and meta-analysis[J]. Journal of Clinical and Experimental Dentistry, 2023, 15 (11): e954- e962 doi: 10.4317/jced.60287

[8]	CUI Z, LI C, WANG W. ToothNet: automatic tooth instance segmentation and identification from cone beam CT images [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 6361–6370.

[9]	LIU Y, XIE R, WANG L, et al Fully automatic AI segmentation of oral surgery-related tissues based on cone beam computed tomography images[J]. International Journal of Oral Science, 2024, 16: 34 doi: 10.1038/s41368-024-00294-z

[10]	WU Q, FENG B, LI W, et al Automatic segmentation and visualization of cortical and marrow bone in mandibular condyle on CBCT: a preliminary exploration of clinical application[J]. Oral Radiology, 2025, 41 (1): 88- 101 doi: 10.1007/s11282-024-00780-4

[11]	WEI X, WANG Y Inferior alveolar canal segmentation based on cone-beam computed tomography[J]. Medical Physics, 2021, 48 (11): 7074- 7088 doi: 10.1002/mp.15274

[12]	MORGAN N, VAN GERVEN A, SMOLDERS A, et al Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images[J]. Scientific Reports, 2022, 12: 7523 doi: 10.1038/s41598-022-11483-3

[13]	DAZA L, SCHNABEL J. DiENTeS: dynamic ENTity segmentation with local-global transformers [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 21–29.

[14]	薄士仕, 高承志基于卷积神经网络实现锥形束CT牙齿分割及牙位标定[J]. 北京大学学报: 医学版, 2024, 56 (4): 735- 740 BO Shishi, GAO Chengzhi Tooth segmentation and identification on cone-beam computed tomography with convolutional neural network based on spatial embedding information[J]. Journal of Peking University: Health Sciences, 2024, 56 (4): 735- 740 doi: 10.19723/j.issn.1671-167X.2024.04.030

[15]	HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132–7141.

[16]	LIU Z, WANG Y, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks [EB/OL]. (2025−02−09) [2025−10−13]. https://arxiv.org/pdf/2404.19756.

[17]	IMAK A, ÇELEBI A, POLAT O, et al ResMIBCU-Net: an encoder–decoder network with residual blocks, modified inverted residual block, and bi-directional ConvLSTM for impacted tooth segmentation in panoramic X-ray images[J]. Oral Radiology, 2023, 39 (4): 614- 628 doi: 10.1007/s11282-023-00677-8

[18]	LUMETTI L, PIPOLI V, BOLELLI F, et al Enhancing patch-based learning for the segmentation of the mandibular canal[J]. IEEE Access, 2024, 12: 79014- 79024 doi: 10.1109/ACCESS.2024.3408629

[19]	BOLELLI F, LUMETTI L, VINAYAHALINGAM S, et al Segmenting the inferior alveolar canal in CBCTs volumes: the ToothFairy challenge[J]. IEEE Transactions on Medical Imaging, 2025, 44 (4): 1890- 1906 doi: 10.1109/TMI.2024.3523096

[20]	BOLELLI F, MARCHESINI K, VAN NISTELROOIJ N, et al. Segmenting maxillofacial structures in CBCT volumes [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2025: 5238–5248.

[21]	WODZINSKI M, MÜLLER H. Automatic multi-structure segmentation in cone beam computed tomography volumes using deep encoder-decoder architectures [C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland, 2025: 63–71.

[22]	LI C, LIU X, LI W, et al U-KAN makes strong backbone for medical image segmentation and generation[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39 (5): 4652- 4660 doi: 10.1609/aaai.v39i5.32491

[23]	ZHONG J, TIAN W, XIE Y, et al PMFSNet: polarized multi-scale feature self-attention network for lightweight medical image segmentation[J]. Computer Methods and Programs in Biomedicine, 2025, 261: 108611 doi: 10.1016/j.cmpb.2025.108611

[24]	LIU Y, CASTRO M, LEDERLIN M, et al An improved nonlinear diffusion in Laplacian pyramid domain for cone beam CT denoising during image-guided vascular intervention[J]. BMC Medical Imaging, 2018, 18 (1): 25 doi: 10.1186/s12880-018-0269-1

[25]	ZHU L, WANG J, XING L Noise suppression in scatter correction for cone-beam CT[J]. Medical Physics, 2009, 36 (3): 741- 752 doi: 10.1118/1.3063001

[1]	Siru CHEN,Yuanchao SHU. Survey on edge deployment and inference acceleration of multimodal large language models[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(4): 723-737.

[2]	Xiaoyan KUI,Min ZHANG,Ling XIAO,Qinsong LI,Liming CHEN,Wensheng ZHANG,Beiji ZOU. Systematic classification and performance analysis of data deduplication and reduction techniques[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(2): 287-302.

[3]	Yanle WANG,Ruifeng ZHANG,Qiang LI. Graph neural network recommendation model integrating global information and contrastive learning[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(2): 351-359.

[4]	Wenqiang CHEN,Linyue FENG,Dongdan WANG,Yulei GU,Xuan ZHAO. Vehicle trajectory prediction model integrating dynamic risk map and multivariate attention mechanism[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 455-467.

[5]	Juntao LV,Jueyu QI,Haochen YU,Lei MA,Huimin MA,Tianyu HU. Current status and future prospect of integrated simulation platform for autonomous driving[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 513-526.

[6]	Xiaoan BAO,Enlin CHEN,Na ZHANG,Xiaomei TU,Biao WU,Qingqi ZHANG. 3D human pose estimation based on multi-scale encoder fusion[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 565-573.

[7]	Zhihui LI,Kun DENG,Congyuan XU. Method for detecting unknown IoT attack based on relational embedding[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(3): 624-632.

[8]	Yue WU,Zheng LIANG,Wei GAO,Maoda YANG,Peisen ZHAO,Hongxia DENG,Yuanyuan CHANG. Multi-modal gait recognition based on SMPL model decomposition and embedding fusion[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 52-60.

[9]	Huhang CHEN,Quan LV,Zihang SU,Junqiao ZHANG,Zhu CHEN,Xu HAN. Operating mode selection model for heating unit considering capacity charge[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 169-178.

[10]	Hao HE,Yongdong SHU,Yonggang LIN,Fuquan DAI,Ju ZHANG. hydrodynamic and hydraulic-control simulation of high-power ship azimuth thruster under variable load[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 191-198.

[11]	Yiming YU,Wei CAI,Yi LI,Wei FU,Xu YAO,Tingyi ZHANG,Shangqi DIAO,Dan LI,Songqing LIN,Yongshun CHEN. Review of environmental heavy metal detection based on microelectrode arrays[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 217-230.

[12]	Linghao ZHANG,Haibo TAN,He ZHAO,Zhong CHEN,Haotian CHENG,Zhiyu MA. CompuDEX: blockchain-based large model fine-tuning compute-power sharing platform[J]. Journal of ZheJiang University (Engineering Science), 2026, 60(1): 1-18.

[13]	Yanpu YANG,Zhihong WU,Wenhao MENG,Yueming ZHUO,Jialing LIU. Upper-limb muscle fatigue assessment in overhead work based on Hammerstein model and surface electromyography[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(12): 2483-2494.

[14]	Yuxuan LIU,Yizhi LIU,Zhuhua LIAO,Zhengbiao ZOU,Jingxin TANG. Adaptive graph attention Transformer for dynamic traffic flow prediction[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(12): 2585-2592.

[15]	Xiongxiong ZHOU,Qiujiang HE,Jichun HE,Jing ZHOU. Deformation decoupling and parameter inversion for high core wall dams considering mechanical causes[J]. Journal of ZheJiang University (Engineering Science), 2025, 59(12): 2616-2626.

Viewed

Full text

Abstract

Cited

Shared

Discussed