<img src="https://www.zjujournals.com/eng/images/1008-973X/images/logo.png" class="img-responsive">

图 1 采用不同方法得到的合成图像分割结果对比

Fig.1 Comparison of synthetic image segmentation results by using different methods

图 2

图 2 不同方法在MSRA数据集上的分割结果对比

Fig.2 Comparison of segmentation results of different methods on MSRA dataset

图 3

图 3 采用不同方法得到的屏幕内容图像分割结果对比

Fig.3 Comparison of screen content image segmentation results by using different methods

如表1、2所示为不同方法的性能指标. 在合成图像数据（见表1）中，利用SDGLR方法取得了最好的性能，平均准确率为99.62%，平均召回率为84.93%，平均F1指数为91.69%，SD-GFT算法的性能略低于SDGLR方法. 利用LAD、SDTVM和SR方法取得了良好的性能，LRSD方法的P、R和平均F1指数都不大于80.0%. 在MSRA数据集（见表2）中，SDGLR取得了良好的性能.利用提出的SDGLR方法，取得了92.88%的平均准确率、79.88%的平均召回率和85.21%的平均F1指数，平均召回率比LRSD方法低约5.41%. 采用GFT基函数的线性组合有效地表示背景部分，减少将背景像素误检测为前景的情况. 这使得SD-GFT和SDGLR方法相对于LRSD、LAD、SDTVM和SR方法取得了更好的性能. 提出的SDGLR方法采用GLR正则化项来惩罚前景像素中的孤立点，强制前景像素相连，减少了前景文本和图形的不连续性. 这使得SDGLR方法相较于SD-GFT方法，取得了更好的性能. 总的来说，提出的SDGLR方法较对比方法取得了相对最好的性能，这得益于用于区分前景和背景成分的图模型和灵活的分割优化公式.

表 1 不同方法在合成图像上的分割性能对比

Tab.1 Comparison of segmentation performance of different methods on synthetic image %

方法	图1的第1张测试图			图1的第2张测试图				平均值
方法	P	R	F1	P	R	F1	P	R	F1
LRSD	73.11	71.49	72.29	84.57	83.98	84.27	78.84	77.74	78.28
LAD	80.83	82.08	81.45	83.13	79.64	81.35	81.98	80.86	81.40
SDTVM	85.99	82.63	84.28	88.33	75.04	81.14	87.16	82.63	82.71
SR	72.26	77.92	74.99	85.39	88.14	86.74	78.83	83.03	80.87
SD-GFT	98.33	82.97	90.00	92.12	86.69	89.33	95.83	84.83	83.24
SDGLR	99.45	84.34	91.28	99.78	85.52	92.10	99.62	84.93	91.69

新窗口打开| 下载CSV

表 2 不同方法在MSRA数据集上的分割性能对比

Tab.2 Comparison of segmentation performance of different methods on MSRA dataset %

方法	图2的第1张测试图			图2的第2张测试图			图2的第3张测试图			图2的第4张测试图				平均值
方法	P	R	F1	P	R	F1	P	R	F1	P	R	F1	P	R	F1
LRSD	75.56	77.96	76.74	81.47	84.50	82.96	84.44	83.77	84.10	88.13	94.93	91.40	82.40	85.29	83.80
LAD	50.99	68.14	58.33	84.65	77.17	80.74	58.45	58.09	58.27	62.50	61.15	61.82	64.15	66.14	64.79
SDTVM	56.09	82.27	66.63	83.26	72.76	77.65	41.64	52.91	46.61	42.93	72.81	54.01	55.98	70.19	61.23
SR	83.02	85.71	84.35	5.85	40.86	10.24	78.57	79.25	78.91	82.39	80.05	81.20	62.46	71.47	63.68
SD-GFT	89.42	86.81	88.10	94.77	87.53	91.01	85.34	80.09	82.63	94.04	57.33	71.23	90.89	77.94	83.24
SDGLR	91.53	89.78	90.64	94.79	87.53	91.01	91.13	84.71	87.80	94.05	57.50	71.37	92.88	79.88	85.21

新窗口打开| 下载CSV

3.4. 参数分析

以图1的第1张测试图像为例，对SDGLR方法中需要调整的参数进行分析讨论.

正则项参数$ \tau $的灵敏性分析. 在SDGLR方法中，采用参数$ \tau $来权衡背景模型参数$ {{\boldsymbol{\alpha }}_r} $和前景$ {{\boldsymbol{s}}_r} $. 如图4所示为当$ \tau $为[0, 1.0]时的F1指数. 可以看出，当$ \tau = 0.15 $时，该方法的分割性能最佳.

图 4

图 4 参数τ的灵敏度分析

Fig.4 Sensitivity analysis of parameter τ

正则项参数$ \gamma $的灵敏性分析. $ \gamma $为用于惩罚前景像素连通性的参数. 如图5所示为当$ \gamma $为[0, 1.0]时的分割效果. 可以看出，当$ \gamma = 0.5 $时，分割性能最佳.

图 5

图 5 参数γ的灵敏度分析

Fig.5 Sensitivity analysis of parameter γ

GFT基函数的数量$ M $. 为了评估$ M $对最终分割结果的影响，如图6所示为使用不同数量基函数导出的分割结果图. 可以看出，当$ M = 10 $时，分割效果最佳.

图 6

图 6 不同基函数数量下提出方法的分割结果

Fig.6 Segmentation results of proposed method with different number of basis functions

图像块大小$ l $和数量. 在HEVC^[34]标准中，最大编码单元（coding units, CU）通常被设定为$ 64 \times 64 $或$ 32 \times 32 $. 选取较小的图像块可以降低运算复杂度，但是图像质量受到限制. 较大的图像块可以提高图像质量，但需要增加计算量. CU为$ 64 \times 64 $的图像块能够在计算复杂度和图像质量间取得较好的平衡点. 因为HEVC是比较成熟的标准，选择CU为$ 64 \times 64 $的图像块，可以使研究工作与已有研究工作保持一致. 如图7所示为由不同$ l $导出的分割结果. 可见，当$ l = 64 $时，分割性能最佳. 图像块的数量由原图像大小和图像块大小$ l $决定，给定$ l $的情况，原图越大，图像块数量越多，程序运行时间越长.

图 7

图 7 不同图像块大小下提出方法的分割结果

Fig.7 Segmentation results of proposed method with different image block sizes

3.5. 收敛性分析

如图8所示为实验中F1与迭代次数N_i的关系. 从图8的F1变化趋势可以看出，在一定迭代次数后，指标的变化速度逐渐减缓，最终趋于稳定的常数值. 这表明所提出的方法在一定程度上已经收敛. 特别地，在保持较高F1的情况下，该方法能够在第12次迭代后达到收敛状态. 这说明所提方法的收敛效率较高，即在相对较少的迭代次数内取得较优的分割效果.

图 8

图 8 F1与迭代次数的关系

Fig.8 F1 value versus number iterations

4. 结　语

本文提出基于图信号处理和稀疏分解的图像前景背景分割方法，旨在将图像的前景与背景分离. 在SDGLR模型中，趋于平滑的背景区域可以通过GFT基函数的线性组合有效地表示. 在目标函数中添加GLR项来惩罚前景中的孤立点，以加强前景像素的连通性. 实验结果表明，利用提出的SDGLR方法，能够更好地刻画图像像素中的相关性，在视觉和定量评估方面取得了优异的效果. 在接下来的工作中，将基于图模型的图像前景背景分割方法扩展到其他场景，如视频的前景背景分割.

参考文献

原文顺序

文献年度倒序

文中引用次数倒序

被引期刊影响因子

[1]

MUKHERJEE D, CHRYSAFIS C, SAID A. JPEG2000-matched MRC compression of compound documents [C]// Proceedings. International Conference on Image Processing . Rochester: IEEE, 2002.

[2]

WANG G, LI W, ZULUAGA M A, et al

Interactive medical image segmentation using deep learning with image-specific fine tuning

[J]. IEEE Transactions on Medical Imaging, 2018, 37 (7): 1562- 1573

DOI:10.1109/TMI.2018.2791721 [本文引用: 1]

[3]

YIN X C, ZUO Z Y, TIAN S, et al

Text detection, tracking and recognition in video: a comprehensive survey

[J]. IEEE Transactions on Image Processing, 2016, 25 (6): 2752- 2773

DOI:10.1109/TIP.2016.2554321 [本文引用: 1]

[4]

LIN T, HAO P

Compound image compression for real-time computer screen image transmission

[J]. IEEE Transactions on Image Processing, 2005, 14 (8): 993- 1005

DOI:10.1109/TIP.2005.849776 [本文引用: 1]

[5]

BOTTOU L, HAFFNER P, HOWARD P G, et al

High quality document image compression with "DjVu"

[J]. Journal of Electronic Imaging, 1998, 7 (3): 410- 425

DOI:10.1117/1.482609 [本文引用: 1]

[6]

MINAEE S, WANG Y. Screen content image segmentation using least absolute deviation fitting [C]// IEEE International Conference on Image Processing . Quebec City: IEEE, 2015: 3295-3299.

[7]

MINAEE S, WANG Y. Screen content image segmentation using sparse decomposition and total variation minimization [C]// IEEE International Conference on Image Processing . Phoenix: IEEE, 2016: 3882-3886.

[本文引用: 4]

[8]

MINAEE S, WANG Y

An ADMM approach to masked signal decomposition using subspace representation

[J]. IEEE Transactions on Image Processing, 2019, 28 (7): 3192- 3204

DOI:10.1109/TIP.2019.2894966 [本文引用: 3]

[9]

HU W, PANG J, LIU X, et al

Graph signal processing for geometric data and beyond: theory and applications

[J]. IEEE Transactions on Multimedia, 2021, 24: 3961- 3977

[10]

ORTEGA A, FROSSARD P, KOVAČEVIĆ J, et al

Graph signal processing: overview, challenges, and applications

[J]. Proceedings of the IEEE, 2018, 106 (5): 808- 828

DOI:10.1109/JPROC.2018.2820126 [本文引用: 1]

[11]

SHUMAN D I, NARANG S K, FROSSARD P, et al

The emerging field of signal processing on graphs: extending high-dimensional data analysis to networks and other irregular domains

[J]. IEEE Signal Processing Magazine, 2013, 30 (3): 83- 98

DOI:10.1109/MSP.2012.2235192 [本文引用: 1]

[12]

ABIKO K, URUMA K, SUGAWARA M, et al. Image segmentation based graph-cut approach to fast color image coding via graph Fourier transform [C]// IEEE Visual Communications and Image Processing . Sydney: IEEE, 2019: 457-460.

[13]

BOGACH I V, LUPIAK D D, IVANOV Y Y, et al. Analysis and experimental research of modifications of the image segmentation method using graph theory [C]// International Siberian Conference on Control and Communications . Tomsk: IEEE, 2019: 490-493.

[14]

HU W, CHEUNG G, ORTEGA A, et al

Multiresolution graph Fourier transform for compression of piecewise smooth images

[J]. IEEE Transactions on Image Processing, 2015, 24 (1): 419- 433

DOI:10.1109/TIP.2014.2378055

[15]

PANG J, CHEUNG G

Graph Laplacian regularization for image denoising: analysis in the continuous domain

[J]. IEEE Transactions on Image Processing, 2017, 26 (4): 1770- 1785

DOI:10.1109/TIP.2017.2651400

[16]

刘娜, 李伟, 陶然

图信号处理在高光谱图像处理领域的典型应用

[J]. 电子与信息学报, 2023, 45 (5): 1529- 1540

LIU Na, LI Wei, TAO Ran

Typical application of graph signal processing in hyperspectral image processing

[J]. Journal of Electronics and Information Technology, 2023, 45 (5): 1529- 1540

[17]

DONG X, THANOU D, TONI L, et al

Graph signal processing for machine learning: a review and new perspectives

[J]. IEEE Signal Processing Magazine, 2020, 37 (6): 117- 127

DOI:10.1109/MSP.2020.3014591 [本文引用: 1]

[18]

CAI W, JIANG J, OUYANG S

Hyperspectral image denoising using adaptive weight graph total variation regularization and low-rank matrix recovery

[J]. IEEE Geoscience and Remote Sensing Letters, 2021, 19: 1- 5

[19]

ACHANTA R, HEMAMI S, ESTRADA F, et al. Frequency-tuned salient region detection [C]// IEEE Conference on Computer Vision and Pattern Recognition . Miami: IEEE, 2009: 1597-1604.

[20]

MIN X, MA K, GU K, et al

Unified blind quality assessment of compressed natural, graphic, and screen content images

[J]. IEEE Transactions on Image Processing, 2017, 26 (11): 5462- 5474

DOI:10.1109/TIP.2017.2735192 [本文引用: 1]

[21]

JIANG J, FENG H, TAY D B, et al

Theory and design of joint time-vertex nonsubsampled filter banks

[J]. IEEE Transactions on Signal Processing, 2021, 69: 1968- 1982

DOI:10.1109/TSP.2021.3064984 [本文引用: 1]

[22]

TAY D B, JIANG J

Time-varying graph signal denoising via median filters

[J]. IEEE Transactions on Circuits and Systems II: Express Briefs, 2021, 68 (3): 1053- 1057

[23]

SANDRYHAILA A, MOURA J M F

Big data analysis with signal processing on graphs: representation and processing of massive data sets with irregular structure

[J]. IEEE Signal Processing Magazine, 2014, 31 (5): 80- 90

DOI:10.1109/MSP.2014.2329213 [本文引用: 1]

[24]

THANOU D, FROSSARD P. Multi-graph learning of spectral graph dictionaries [C]// IEEE International Conference on Acoustics, Speech and Signal Processing . South Brisbane: IEEE, 2015: 3397-3401.

[25]

QIU K, MAO X, SHEN X, et al

Time-varying graph signal reconstruction

[J]. IEEE Journal of Selected Topics in Signal Processing, 2017, 11 (6): 870- 883

DOI:10.1109/JSTSP.2017.2726969 [本文引用: 1]

[26]

QI W, GUO S, HU W

Generic reversible visible watermarking via regularized graph Fourier transform coding

[J]. IEEE Transactions on Image Processing, 2021, 31: 691- 705

[27]

CHEUNG G, MAGLI E, TANAKA Y, et al

Graph spectral image processing

[J]. Proceedings of the IEEE, 2018, 106 (5): 907- 930

DOI:10.1109/JPROC.2018.2799702 [本文引用: 1]

[28]

LIU M, WEI Y. Image denoising using graph-based frequency domain low-pass filtering [C]// IEEE 4th International Conference on Image, Vision and Computing. Xiamen: IEEE, 2019: 118-122.

[29]

KE G Y, PAN Y, YIN J, et al

Optimizing evaluation metrics for multitask learning via the alternating direction method of multipliers

[J]. IEEE Transactions on Cybernetics, 2017, 48 (3): 993- 1006

[30]

孙菲, 厉小润, 赵辽英, 等

基于FrFT变换和全变分正则化的异常检测算法

[J]. 浙江大学学报:工学版, 2022, 56 (7): 1276- 1284

SUN Fei, LI Xiaorun, ZHAO Liaoying, et al

Anomaly detection algorithm based on FrFT transform and total variation regularization

[J]. Journal of Zhejiang University: Engineering Science, 2022, 56 (7): 1276- 1284

[31]

BOYD S, PARIKH N, CHU E, et al

Distributed optimization and statistical learning via the alternating direction method of multipliers

[J]. Foundations and Trends in Machine Learning, 2011, 3 (1): 1- 122