Please wait a minute...
JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE)
Telecommunication Technolgy     
Incremental large scale dense semantic mapping
JIANG Wen ting1,2, GONG Xiao jin1,2, LIU Ji lin1,2
1. Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China;2. Zhejiang Provincial Key Laboratory of Information Network Technology, Hangzhou 310027, China
Download:   PDF(2045KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

In order to efficiently achieve accurate large scale scene understanding result, A new large scale dense semantic mapping system was proposed. The system constructed a map by incrementally calculating with a conditional random field model. The method used stereo visual odometry to get the motion of the camera, and used the labeled image sequences to build semantic map. The key point was to incrementally build the semantic map which detected newly built voxels, over segment the points within these voxels into supervoxels, labeled these supervoxels under the guidance of neighboring frames and used the rigid transformation matrix to fuse the newly labeled points with the already built map. A conditional random field model was constructed which took labeling results of sequential frames as the data term, took the coherent labeling constraint between neighboring supervoxels as the pairwise term and solved the model by graph cut. Experimental evaluations show that the approach can get an accurate large scale semantic map and decrease computational cost, The approach can improve the labeling results at image level.



Published: 01 February 2016
CLC:  TP 242.6  
Cite this article:

JIANG Wen ting, GONG Xiao jin, LIU Ji lin. Incremental large scale dense semantic mapping. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2016, 50(2): 385-391.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2016.02.026     OR     http://www.zjujournals.com/eng/Y2016/V50/I2/385


基于增量计算的大规模场景致密语义地图构建

为了准确而高效地进行大规模场景理解,提出基于增量计算的条件随机场下的大规模场景致密语义地图构建方法.该方法利用双目视觉估算相机运动轨迹,根据图像序列语义标注结果构建语义地图.递增的语义地图的构建过程是关键,需要检测致密化处理后的输入帧相较于前一帧的新增体素,对新增体素内部三维点过分割成超体素,利用前后多帧的标注结果指导超体素的标注,如此逐帧地将新增体素融合到语义地图中.该方法将时序上的先验信息作为条件随机场中的数据项,依据超体素的邻接关系定义平滑项,利用图割法求解新增超体素的标签.实验表明,该方法能够获取准确的大规模语义地图,有效减少对冗余点的处理,改善图像上的标注结果.

[1] BAILEY T, DURRANT WHYTE H. Simultaneous localization and mapping(SLAM):Part II [J]. Robotics &Automation Magazine, 2006, 13(3): 108-117.
[2] SENGUPTA S, GREVESON E, SHAHROKNI A, et al. Urban 3dsemantic modelling using stereo vision [C]∥ Computer Vision ICRA. Karlsruhe: IEEE, 2013: 580-585.
[3] 谭光华. 三维几何模型的形状编辑技术研究[D].杭州:浙江大学, 2009: 3-17.
TAN Guang hua. Studies on shape editing techniquesof3d geometric models. Hangzhou: Zhejiang University, 2009: 3-17.
[4] HE Hu, UPCROFT B. Nonparametric semantic segmentation for 3dstreet scenes [C]∥IEEE IROS. Tokyo: IEEE,2013: 3697-3703.
[5] KUNDU A, LI Y, DELLAERT F, LI F, et al. Joint semantic segmentation and 3d reconstruction from monocular video [C]∥Computer Vision ECCV. Zurich: Springer, 2014: 703-718.
[6] PAPON J, ABRAMOV A, SCHOELER M, WORGOTTER F. Voxel cloud connectivity segmentation supervoxels for point clouds [C]∥IEEE CVPR. Portland, OR: IEEE, 2013: 2027-2034.
[7] BOYKOV Y, JOLLY M. Interactive graph cuts for optimal boundary& region segmentation of objects in nd images [C]∥IEEE ICCV. Vancouver, BC: IEEE, 2001: 105-112.
[8] LU W, XIANG Z, LIU J. High performance visual odometry with two stage local binocular [C]∥IEEEIV. Gold Coast, QLD: IEEE, 2013: 1107-1112.
[9] TRIGGS B, MCLAUCHLAN P F, HARTLEY R I, FITZGIBBON A W. Bundle adjustment — a modern synthesis [C]∥Vision algorithms: theory and practice. Corfu, Greece: Springer, 2000: 298-372.
[10] GEIGER A, LENZ P, URTASUN R. Are we ready for autonomous driving? the kitti vision benchmark suite [C]∥IEEE CVPR. Providence, RI: Springer, 2012: 3354-3361.
[11] HUANG Wen qi, GONG Xiao jin. Fusion based holistic road scene understanding [EB/OL]. (2014 06 29) [2015 07 22] http: ∥arxiv.org/pdf/1406.7525.pdf
[12] LIU Jun yi, GONG Xiao jin. Guided depth enhancement via an isotropic diffusion[C]∥ Advances in Multimedia Information Processing(PCM). Nanjing, China: Springer, 2013: 408-417.
[13] GOULD S, FULTON R, KOLLER D. Decomposing a scene into geometric and semantically consistent regions [C]∥IEEE ICCV. Kyoto: IEEE, 2009: 1-8.
[14] HORNUNG A, WURM K M, BENNEWITZ M, STACHNISS C, et al. OctoMap: an efficient probabilistic 3d mapping framework based on octrees [J]. Autonomous Robots, 2013, 34(3): 189-206.

[1] JIA Song min, LU Ying bin, WANG Li jia, LI Xiu zhi, XU Tao. Mobile robot human tracking using hierarchical features[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2016, 50(9): 1677-1683.
[2] MA Zi ang, XIANG Zhi yu. Calibration and 3D reconstruction with omnidirectional ranging by optic flow camera[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2015, 49(9): 1651-1657.
[3] CAO Teng, XIANG Zhi-yu, LIU Ji-lin. Obstacle detection based on V-intercept in disparity space[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2015, 49(3): 409-414.
[4] WANG Li-jun, HUANG Zhong-chao, ZHAO Yu-qian. New spatial-coherent latent topic model based on super-pixel segmentation and scene classification method[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2015, 49(3): 402-408.
[5] LU Wei, XIANG Zhi-yu, YU Hai-bin, LIU Ji-lin. Object compressive tracking based on adaptive multi-feature appearance model[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2014, 48(12): 2132-2138.
[6] CHEN Ming-ya, XIANG Zhi-yu, LIU Ji-lin. Assistance localization method for mobile robot based on
monocular natural visual landmarks
[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2014, 48(2): 285-291.
[7] LIN Ying, GONG Xiao-jin, LIU Ji-lin. Calibration of fisheye cameras based on the viewing sphere[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2013, 47(8): 1500-1507.
[8] WANG Hui-fang, ZHU Shi-qiang, WU Wen-xiang. Improved adaptive robust control of servo system with harmonic drive[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2012, 46(10): 1757-1763.
[9] OUYANG Liu, XU Jin, GONG Xiao-jin, LIU Ji-lin. Optimization of visual odometry based on uncertainty analysis[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2012, 46(9): 1572-1579.
[10] MA Li-sha, ZHOU Wen-hui, GONG Xiao-jin, LIU Ji-lin. Motion constrained generalized Field D* path planning[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2012, 46(8): 1546-1552.
[11] LU Dan-hui, ZHOU Wen-hui, GONG Xiao-jin, LIU Ji-lin. Decoupled mobile robot motion estimation based on fusion of
visual and inertial measurement unit
[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2012, 46(6): 1021-1026.
[12] XU Jin, SHEN Min-yi, YANG Li, WANG Wei-qiang, LIU Ji-lin. Binocular bundle adjustment based localization
and terrain stitching for robot
[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2011, 45(7): 1141-1146.
[13] CHEN Jia-qian, LIUYu-tian, HE Yan, JIANG Jing-ping. Novel dynamic mapping method based on occupancy grid
model and sample sets
[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2011, 45(5): 794-798.
[14] CHEN Jia-Gan, HE Yan, JIANG Jing-Ping. Improved FastSLAM algorithm based on importance weight smoothing[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2010, 44(8): 1454-1459.
[15] LU Ren-Quan, WEI Jiang, XUE An-Ke. State estimation of network control system based on
linear quantization
[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2010, 44(7): 1400-1405.