Please wait a minute...
J4  2009, Vol. 43 Issue (09): 1597-1603    DOI: 10.3785/j.issn.1008973X.2009.09.009
    
Threedimensional head pose tracking using registration and multiscale appearance model
 DIAO Gang-Jiang, CHEN Ling, CHEN Gen-Cai
(College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China)
Download:   PDF(2083KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

Scale invariant feature transform (SIFT) local descriptor based registration algorithm and multiscale viewbased appearance model were used aiming at the problem of large head movement tracking. The SIFT local descriptor based registration algorithm can estimate the pose change between two frames even when head scale was also changed by matching salient SIFT features between two intensity images. The multiscale viewbased appearance model was employed to reduce the drift accumulation during tracking in large range. The model selected key frames online when the head underwent different motions and the tracker bounded the drift of current frame by employing multiple registrations approach. Experimental results show that the method is not only accurate (4 °RMS error), but also robust with respect to the movement along the Z axis was about 1 m and the subject returned to the visual field of camera after abrupt leaving.



CLC:  TP 391  
Cite this article:

DIAO Gang-Jiang, CHEN Ling, CHEN Gen-Cai. Threedimensional head pose tracking using registration and multiscale appearance model. J4, 2009, 43(09): 1597-1603.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008973X.2009.09.009     OR     http://www.zjujournals.com/eng/Y2009/V43/I09/1597


基于注册和多尺度表观模型的三维头部跟踪

针对人体大范围运动下的头部姿态跟踪问题,提出一种基于尺度不变特征变换(SIFT)局部描述符注册和多尺度表观模型的三维头部姿态跟踪方法.基于SIFT局部描述符的注册算法通过在两帧灰度图像间进行特征点匹配计算两帧间的头部运动,在两帧人脸图像的尺度有一定变化时仍可得到精确结果.多尺度视角表观模型可以减少大范围跟踪时的误差累积,该模型在线选取具有不同头部姿态的关键帧,并通过多次注册的方法来减少当前帧的误差累积.实验结果表明,该方法不仅跟踪结果准确(均方根(RMS)误差为4 °),而且在人体前后运动约1 m和头部进出摄像机视角情况下均很鲁棒.

[1] 梁国远,查红彬,刘宏. 基于三维模型和仿射对应原理的人脸姿态估计方法 [J]. 计算机学报, 2005, 28(5): 792800.
LIANG Guoyuan, ZHA Hongbin, LIU Hong. Face pose estimation based on 3D models and affine correspondences [J]. Chinese Journal of Computers, 2005, 28(5): 792800.
[2] SEEMANN E, NICKEL K, STIEFELHAGEN R. Head pose estimation using stereo vision for humanrobot interaction [C]∥ Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition. Korea: IEEE, 2004: 626631.
[3] MALASSIOTIS S, STRINTZIS M. Realtime head tracking and 3D pose estimation from range data [C]∥ Prodeedings of IEEE International Conference on Image Processing. Barcelona: IEEE, 2003: 859862. 
[4] GORODNICHY D, MALIK S, ROTH G. Affordable 3D face tracking using projective vision [C]∥ Proceedings of International Conference on Vision Interfaces. Calgary: IEEE, 2002.
[5] RUDDARRAJU R, HARO A, ESSAFAST I. Multiple camera head pose tracking [C]∥ Proceedings of International Conference on Vision Interfaces. Halifax: IEEE, 2003.
[6] YANG R, ZHANG Z. Modelbased head pose tracking with stereo vision [C]∥ Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition. Washington DC: IEEE, 2002: 255260.
[7] MORENCY L, DARRELL T. Stereo tracking using ICP and normal flow constraint [C]∥ Proceedings of IEEE International Conference on Pattern Recognition. Quebec City: IEEE, 2002: 367372.
[8] LOWE D. Distinctive image features from scaleinvariant keypoints [J]. International Journal of Computer Vision, 2004, 60(2):91110.
[9] KRYSTIAN M, CORDELIA S. Performance evaluation of local descriptors [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10):16151630.
[10] RAHIMI A, MORENCY L, DARRELL T. Reducing drift in differential tracking [J]. Computer Vision and Image Understanding, 2006, 109(2): 97111.
[11] ZHU Y, FUJIMURA K. Head pose estimation for driver monitoring [C]∥ Proceeding of IEEE Intelligent Vehicles Symposium. Parma: IEEE,2004: 501506.
[12] LU F, MILIOS E. Globally consistent range scan alignment for environment mapping [J].Autonomous Robots,1997,4(4):333349.
[13] MORENCY L, RAHIMI A, DARRELL T. Adaptive viewbased appearance models [C]∥ Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. Madison: IEEE,2003:803810.
[14] HORN B. Closedform solution of absolute orientation using unit quaternions [J]. Journal of the Optical Society of America, 1987, 44(4):629642.
[15] FISCHLER M, BOLLES R. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography [J]. Communications of the ACM, 1981, 24(6):381395.
[16] KAILATH T, SAYED A, HASSIBI B. Linear estimation [M]. EnglewoodCliffs: PrenticeHall, 1999: 1816.
[17] VIOLA P, JONES M. Robust realtime face detection[C]∥ Proceedings of IEEE International Conference on Computer Vision. Vancouver: IEEE,2001:747.

[1] ZHAO Jian-jun, WANG Yi, YANG Li-bin. Threat assessment method based on time series forecast[J]. J4, 2014, 48(3): 398-403.
[2] ZHANG Tian-yu, FENG Hua-jun, XU Zhi-hai, LI Qi, CHEN Yue-ting. Sharpness metric based on histogram of strong edge width[J]. J4, 2014, 48(2): 312-320.
[3] LIU Zhong, CHEN Wei-hai, WU Xing-ming, ZOU Yu-hua, WANG Jian-hua. Salient region detection based on stereo vision[J]. J4, 2014, 48(2): 354-359.
[4] CUI Guang-mang, ZHAO Ju-feng,FENG Hua-jun, XU Zhi-hai,LI Qi, CHEN Yue-ting. Construction of fast simulation model for degraded image by inhomogeneous medium[J]. J4, 2014, 48(2): 303-311.
[5] WANG Xiang-bing,TONG Shui-guang,ZHONG Wei,ZHANG Jian. Study on  scheme design technique for hydraulic excavator's structure performance based on extension reuse[J]. J4, 2013, 47(11): 1992-2002.
[6] WANG Jin, LU Guo-dong, ZHANG Yun-long. Quantification-I theory based IGA and its application[J]. J4, 2013, 47(10): 1697-1704.
[7] LIU Yu, WANG Guo-jin. Designing  developable surface pencil through  given curve as its common asymptotic curve[J]. J4, 2013, 47(7): 1246-1252.
[8] HU Gen-sheng, BAO Wen-xia, LIANG Dong, ZHANG Wei. Fusion of panchromatic image and multi-spectral image based on
SVR and Bayesian method 
[J]. J4, 2013, 47(7): 1258-1266.
[9] WU Jin-liang, HUANG Hai-bin, LIU Li-gang. Texture details preserving seamless image composition[J]. J4, 2013, 47(6): 951-956.
[10] CHEN Xiao-hong,WANG Wei-dong. A HDTV video de-noising algorithm based on spatial-temporal filtering[J]. J4, 2013, 47(5): 853-859.
[11] ZHU Fan , LI Yue, JIANG Kai, YE Shu-ming, ZHENG Xiao-xiang. Decoding of rat’s primary motor cortex by partial least square[J]. J4, 2013, 47(5): 901-905.
[12] WU Ning, CHEN Qiu-xiao, ZHOU Ling, WAN Li. Multi-level method of optimizing vector graphs converted from remote sensing images[J]. J4, 2013, 47(4): 581-587.
[13] JI Yu, SHEN Ji-zhong, SHI Jin-he. Automatic ocular artifact removal based on blind source separation[J]. J4, 2013, 47(3): 415-421.
[14] WANG Xiang, DING Yong. Full reference image quality assessment based on Gabor filter[J]. J4, 2013, 47(3): 422-430.
[15] TONG Shui-guang, WANG Xiang-bing, ZHONG Wei, ZHANG Jian. Dynamic optimization design for rigid landing leg of crane
based on BP-HGA
[J]. J4, 2013, 47(1): 122-130.