一种方向梯度直方图的降维方法

doi:10.3785/j.issn.1008-9497.2017.02.002

浙江大学学报（理学版）

2017, Vol. 44

Issue (2): 134-138 DOI: 10.3785/j.issn.1008-9497.2017.02.002

一种方向梯度直方图的降维方法

傅红普^1,2, 邹北骥¹

1. 中南大学信息科学与工程学院"移动医疗"教育部-中国移动联合实验室, 湖南长沙 410083;
2. 湖南第一师范学院信息科学与工程学院, 湖南长沙 410205

A dimension reduction method of the histogram of oriented gradients

FU Hongpu^1,2, ZOU Beiji¹

1. Ministry of Education-China Mobile Joint Laboratory for Mobile Health, School of Information Science and Engineering, Central South University, Changsha 410083, China;
2. Department of Information Science and Engineering, Hunan First Normal University, Changsha 410205, China

全文: PDF(947 KB)

HTML

摘要： 为描述对象的局部外观和形状，方向梯度直方图首先将图像划分成小区域（被称为cell），然后在其上累加像素梯度方向的一维直方图.在被称为block的较大区域（由数个相邻的cell组成）上连接cell的直方图，经归一化处理形成特征向量.为减弱由block引起的区域量化走样，在计算检测窗口的特征时，采取部分重叠block的措施，从而大大增加了特征维度以及目标检测时的计算量.通过扩大参与相邻cell之间像素梯度插值的面积，并设置适当的高斯平滑核尺度，可消除block重叠，从而将64×128尺寸的窗口的方向梯度直方图特征维度由3 780降低为1 152.INRIA的行人数据集实验表明，该方法也可减弱区域量化走样，且其性能与原方向梯度直方图几乎相当，而检测速度却显著提高.

关键词： 方向梯度直方图; 降维; 线性插值; 重叠采样; 行人检测

Abstract: To characterize the local object appearance and shape, histograms of oriented gradients (HOG) divide an image window into small spatial regions (cells), and accumulate a local 1-D histogram of gradient directions over the pixels of the cell. The normalized combined histogram entering of a larger spatial region (blocks, are consisted of several cells) forms the representation. In order to weaken regional quantization aliasing, blocks are partly overlapped when HOG is computed in detection windows. Yet, it will increase the dimension vastly. So, it will bring extra computation for object detection application. By expanding the area (spatial region) where pixel gradients are interpolated between neighbor cells' centers, and setting the scale of block Gaussian weights properly, the overlapped area between blocks is cancelled. Then, the dimension of HOG feature in a 64×128 detection window reduces from 3 780 to 1 152, and region quantized errors are decreased. Experiment results on INRIA pedestrian dataset show that the performance of the 1 152-dimensional HOG and that of the original HOG are almost the same, however, its detecting speed is significantly improved.

Key words: histograms of oriented gradients dimension reduction linear interpolation overlapping sampling pedestrian detection

收稿日期: 2016-07-25 出版日期: 2017-07-08

CLC:

TP391.41

基金资助: 国家自然科学基金资助项目（61573380）；湖南省重点实验室培育基地项目（2015TP1017）.

作者简介: 傅红普(1973-),ORCID:http://orcid.org/0000-0002-6376-4716,男,博士生,主要从事计算机视觉研究,E-mail:fuhpi@126.com.

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	傅红普
	邹北骥

引用本文:

傅红普, 邹北骥. 一种方向梯度直方图的降维方法[J]. 浙江大学学报（理学版）, 2017, 44(2): 134-138.

FU Hongpu, ZOU Beiji. A dimension reduction method of the histogram of oriented gradients. Journal of Zhejiang University (Science Edition), 2017, 44(2): 134-138.

链接本文:

https://www.zjujournals.com/sci/CN/10.3785/j.issn.1008-9497.2017.02.002 或 https://www.zjujournals.com/sci/CN/Y2017/V44/I2/134

[1] DALAL N, TRIGGS B. Histograms of oriented gradients for Human detection[C]//The IEEE Conference on Computer and Pattern Recognition.Washington D C:IEEE Computer Society,2005:886-893.
[2] DOERSCH C, EFROS A. Improving the HOG descriptor[J/OL] [2016-04-09]. http://www.cs.cmu.edu/~cdoersch/projects/hogimprove/hogimprove.pdf.
[3] QUATTONI A, CARRERAS X,TORRALBA A. A latent variable ranking model for content-based retrieval[C]//34th European Conference on Information Retrieval.Berlin:Springer,2012.
[4] AVIDAN S, SHAMIR A. Seam carving for content-aware image resizing[C]//Proceeding of ACM SIGGRAPH. New York:ACM Transactions on Graphics,2007.
[5] BOSH A, ZISSERMAN A, MUNOZ X. Image classification using random forests and ferns[C]//IEEE International Conference on Computer Vision. Rio de Janeiro:IEEE Computer Society,2007:1-8.
[6] NAM W, DOLL'AR P, HAN J H. Local decorrelation for improved pedestrian detection[C]//NIPS2014 Montreal:Advances in Neural Information Processing Systems,2014:424-432.
[7] YANG B, YAN J, LEI Z, et al. Convolutional channel features[C]//ICCV2015. Santiago:Computer Science,2015:82-90.
[8] CAI Z, SABERIAN M, VASCONCELOS N. Learning complexity-Aware cascades for deep pedestrian detection[C]//ICCV2015. Santiago:Computer Science,2015:3361-3369.
[9] DANG L, BUI B, VO P D, et al. Improved HOG Descriptors[C]//In the Third International Conference on Knowledge and Systems Engineering. Washington DC:IEEE Computer Society,2011:186-189.
[10] DOLLAR P, TU Z, PERONA P, et al. Integral channel features[C]//BMVC. Belongie:BMVC 2009 London England,2009:1-11.
[11] FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al. Object detection with discriminatively trained part-based models[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,32(9):1627-1645.
[12] FRYER J, MCINTOSH K. Enhancement of image resolution in digital photogrammetry[J].Photogrammetric Engineering & Remote Sensing, 2001, 67(6):741-749.
[13] VIOLA P, JONES M J. Robust real-time face detection[J]. International Journal of Computer Vision, 2004, 57(20):137-154.
[14] CHANG C, LIN C. LIBSVM:A library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3):1-27. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm

[1]	罗月童, 韩承村, 杜华, 严伊蔓. 基于拉伸特征的B-Rep→CSG转换算法及其应用[J]. 浙江大学学报（理学版）, 2021, 48(2): 151-158.
[2]	陈佳舟, 王宇航, MohammedAmal Ahmed Hasan, 黄可妤, 卢周扬, 彭群生. 基于图像的二维剪纸自动生成方法[J]. 浙江大学学报（理学版）, 2020, 47(3): 274-283.
[3]	卢家品, 罗月童, 黄兆嵩, 张延孔, 陈为. 基于排名学习和多源信息的地图匹配方法[J]. 浙江大学学报（理学版）, 2020, 47(1): 27-35.
[4]	刘一璟, 张旭斌, 张建伟, 周哲磊, 冯元力, 陈为. DenseNet-centercrop: 一个用于肺结节分类的卷积网络[J]. 浙江大学学报（理学版）, 2020, 47(1): 20-26.
[5]	李丽, 高若婉, 梅树立, 赵海英. 基于Shannon-Cosine小波精细积分法的壁画降噪修复方法[J]. 浙江大学学报（理学版）, 2019, 46(3): 279-287.
[6]	张莉, 赵林, 檀结庆. 带互异权值的渐进迭代逼近算法及其应用[J]. 浙江大学学报（理学版）, 2017, 44(1): 22-27.
[7]	桂彦, 王培玉, 李峰, 刘杨. 基于GPU加速的几何纹理合成方法[J]. 浙江大学学报（理学版）, 2016, 43(6): 638-646.
[8]	刘玉杰, 庞芸萍, 李宗民, 李华. 融合抽象层级变换和卷积神经网络的手绘图像检索方法[J]. 浙江大学学报（理学版）, 2016, 43(6): 657-663.

Viewed

Full text

Abstract

Cited

Shared

Discussed