Nonparametric RGB-D scene parsing based on Markov random field model

doi:10.3785/j.issn.1008-973X.2016.07.014

JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE)

Computer Technology, Information Engineering

Nonparametric RGB-D scene parsing based on Markov random field model

FEI Ting ting,GONG Xiao jin

Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China

Download:

PDF(2801KB) HTML
Export: BibTeX | EndNote (RIS)

Abstract

An effective nonparametric method was proposed for RGB-D scene parsing. The method is based upon the label transferring scheme, which includes label pool construction, bi-directional superpixel matching -nd label transferring stages. Compared to traditional parametric RGB-D scene parsing methods, the approach requires no tedious training stage, which makes it simple and efficient. In contrast to previous nonparametric techniques, our method not only incorporate geometric contexts at all the stages, but also propose a bi-directional scheme for superpixel matching in order to reduce mismatching. Then a collaborative representation based classification (CRC) mechanism was built for Markov random field (MRF), and parsing result was achieved through minimizing the energy function via Graph Cuts. The effectiveness of the approach was validated both on the indoor NYU Depth V1 dataset and the outdoor KITTI dataset. The approach outperformed both state-of-the-art RGB-D parsing techniques and a classical nonparametric superparsing method. The algorithm can be applied to different scenarios, having a strong practical value．

Published: 23 July 2016

CLC:

TP 391

	Service
	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors

Cite this article:

FEI Ting ting,GONG Xiao jin. Nonparametric RGB-D scene parsing based on Markov random field model. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2016, 50(7): 1322-1329.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2016.07.014 OR http://www.zjujournals.com/eng/Y2016/V50/I7/1322

基于马尔科夫随机场的非参数化RGB-D场景理解

针对RGB-D场景下的场景理解问题,提出高效的基于标签传递机制的非参数化场景理解算法.该算法主要分为标签源构建、超像素双向匹配和标签传递三个步骤.与传统的参数化RGB-D场景理解方法相比,该算法不需要繁琐的训练,具有简单高效的特点.与传统的非参数化场景理解方法不同,该算法在系统的各个设计环节都有效利用了深度图提供的三维信息,在超像素匹配环节提出双向匹配机制,以减少特征误匹配；构建基于协同表示分类（CRC）的马尔科夫随机场（MRF）,用Graph Cuts方法求出最优解,获得场景图像每个像素的语义标签.该算法分别在室内的NYU-V1数据集和室外的KITTI数据集上进行实验.实验结果表明,与现有算法相比,该算法取得了显著的性能提升, 对室内、外场景均适用.

［1］ Velodyne. Velodyne hdl64e ［EB/OL］. \[20140610\]. http:∥velodynelidar.com/lidar/．
［2］ Kinect. Microsoft kinect ［EB/OL］. \[20140610\].http:∥www.microsoft.com/enus/kinectforwindows/develop/learn.aspx．
［3］闫飞, 庄严, 王伟. 移动机器人基于多传感器信息融合的室外场景理解［J］. 控制理论与应用, 2011, 28(8):1093-1098.
YAN Fei, ZHUANG Yan, WANG Wei. Outdoor scene comprehension of mobile robot based on multisensor information fusion ［J］. Control Theory and Applications, 2011, 28(8):1093-1098．
［4］谭伦正, 夏利民, 夏胜平. 基于多级Sigmoid神经网络的城市交通场景理解［J］. 国防科技大学学报, 2012, 34(4): 1001-2486.
TAN Lunzheng, XIA Limin, XIA Shengping. Urban traffic scene understanding based on multilevel sigmoidal neural network ［J］. Journal of National University of Defense Technology, 2012, 34(4): 1001-2486．
［5］ SILBERMAN N, FERGUS R. Indoor scene segmentation using a structured light sensor ［C］∥Proceedings of ICCV. Barcelona: IEEE, 2011: 601-608．
［6］ REN Xiaofeng, BO Liefeng, FOX D. RGB(D) scene labeling: features and algorithms ［C］∥Proceedings of CVPR. Providence: IEEE, 2012: 2759-2766．
［7］ TORRALBA A, FERGUS R, FREEMAN W T. 80 million tiny images: a large dataset for nonparametric object and scene recognition ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(11): 1958-1970．
［8］ SHOTTON J, WINN J, ROTHER C, et al. Textonboost for image understanding: multiclass object recognition and segmentation by jointly modeling texture, layout, and context ［J］. International Journal of Computer Vision, 2009, 81(1): 223．
［9］ FARABET C, COUPRIE C, NAJMAN L, et al. Learning hierarchical features for scene labeling ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(8): 1915-1929．
［10］ STURGESS P, ALAHARI K, LADICKY L, et al. Combining appearance and structure from motion features for road scene understanding ［C］ ∥ Proceedings of BMVC. London: BMVA, 2009．
［11］ LIU Ce, YUEN J, TORRALBA A. Nonparametric scene parsing: label transfer via dense scene alignment ［C］∥ Proceedings of CVPR. Miami: IEEE, 2009: 1972-1979．
［12］ TIGHE J, LAZEBNIK S. Superparsing: scalable nonparametric image parsing with superpixels ［C］ ∥ Proceedings of ECCV. Heraklion: Springer, 2010: 352-365．
［13］ YANG J, PRICE B, COHEN S, et al. Context driven scene parsing with attention to rare classes ［C］ ∥ Proceedings of CVPR. Columbus: IEEE, 2014．
［14］ EIGEN D, FERGUS R. Nonparametric image parsing using adaptive neighbor sets ［C］ ∥ Proceedings of CVPR. Providence: IEEE, 2012: 2799-2806．
［15］ ZHANG Lie, YANG Meng, FENG Xiangchu. Sparse representation or collaborative representation: which helps face recognition？［C］ ∥ Proceedings of ICCV. Barcelona: IEEE, 2011: 471-478．
［16］ BOYKOV Y, VEKSLER O, ZABIH R. Fast approximate energy minimization via graph cuts ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(11): 1222-1239．
［17］ OLIVA A, TORRALBA A. Building the gist of a scene: the role of global image features in recognition ［J］. Progress In Brain Research, 2006, 155: 23-36．
［18］ LEVINSHTEIN A, STERE A, KUTULAKOS N K, et al. Turbopixels: fast superpixels using geometric flows ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(12): 2290-2297．
［19］ BO Liefeng, REN Xiaofeng, FOX D. Kernel descriptors for visual recognition ［C］ ∥ NIPS. Vancouver: Neural Information Processing Systems Foundation, 2010: 244-252．
［20］ GEIGER A, LENZ P, URTASUM R. Are we ready for autonomous driving？ the KITTI vision benchmark suite ［C］ ∥ Proceedings of CVPR. Providence: IEEE, 2012: 3354-3361．

[1]	HE Xue-jun, WANG Jin, LU Guo-dong, LIU Zhen-yu, CHEN Li, JIN Jing. 3D head portrait sculpture by industrial robot based on triangular mesh slicing and collision detection[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1104-1110.

[2]	WANG Hua, HAN Tong-yang, ZHOU Ke. KeyGraph-based community detection algorithm for public security intelligence[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1173-1180.

[3]	YOU Hai-hui, MA Zeng-yi, TANG Yi-jun, WANG Yue-lan, ZHENG Lin, YU Zhong, JI Cheng-jun. Soft measurement of heating value of burning municipal solid waste for circulating fluidized bed[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(6): 1163-1172.

[4]	BI Xiao-jun, WANG Jia-hui. Teaching-learning-based optimization algorithm with hybrid learning strategy[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(5): 1024-1031.

[5]	MU Jing-jing, ZHAO Xin-yue, HE Zai-xing, ZHANG Shu-you. Contour reconstruction of overlapped bubbles based on concave-convex transformation and circle fitting[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 714-721.

[6]	HUANG Zheng-yu, JIANG Xin-long, LIU Jun-fa, CHEN Yi-qiang, GU Yang. Fusion feature based semi-supervised manifold localization method[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 655-662.

[7]	JIANG Xin-long, CHEN Yi-qiang, LIU Jun-fa, HU Li-sha, SHEN Jian-fei. Wearable system to support proximity awareness for people with autism[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 637-647.

[8]	WANG Liang, YU Zhi-wen, GUO Bin. Moving trajectory prediction model based on double layer multi-granularity knowledge discovery[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 669-674.

[9]	LIAO Miao, ZHAO Yu-qian, ZENG Ye-zhan, HUANG Zhong-chao, ZHANG Bing-kui, ZOU Bei-ji. Automatic segmentation for cell images based on support vector machine and ellipse fitting[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(4): 722-728.

[10]	DAI Cai-yan, CHEN Ling, LI Bin, CHEN Bo-lun. Sampling-based link prediction in complex networks[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 554-561.

[11]	LIU Lei, YANG Peng, LIU Zuo-jun. Locomotion-Mode recognition using multiple kernel relevance vector machine[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 562-571.

[12]	GUO Meng-li, DA Fei-peng, DENG Xing, GAI Shao-yan. 3D face recognition based on keypoints and local feature[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(3): 584-589.

[13]	WANG Hai jun, GE Hong juan, ZHANG Sheng yan. Fast object tracking algorithm via kernel collaborative presentation[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(2): 399-407.

[14]	ZHANG Ya nan, CHEN De yun, WANG Ying jie, LIU Yu peng. Incremental graph pattern matching based dynamic recommendation method for cold-start user[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(2): 408-415.

[15]	LIU Yu peng, QIAO Xiu ming, ZHAO Shi lei, MA Chun guang. Deep combination of large-scale features in statistical machine translation[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(1): 46-56.

Viewed

Full text

Abstract

Cited

Shared

Discussed