A hardware optimizing implementation with compressing attends was proposed for the problem that the critical path of traditional Gaussian filter hardware implementation is long and the logic delay is large. In the Gaussian filter hardware optimizing implementation process, shift operation was used to achieve multiplication and division calculation in order to avoid the use of multipliers and dividers. Three kinds of circuit structure, i.e. Carry Save Adder (CSA), 4-2 compressor based on two multiplexer (MUX), and tree structure for compressing attends, were used to compress the nine attends at three levels. After optimizing, only one full adder was needed to obtain the sum of results. The experimental results show that the hardware optimizing implementation of Gaussian filter with compressing attends can shorten the critial path and reduce the logic delay. The proposed method reduces the logic delay by32.48% and occupies less Macro Statistics of Adders compared with the traditional implementation. The saved Macro Statistics of Adders can provide greater design freedom for the following image processing units.
CHEN Chao, LUO Xiao-hua, CHEN Shu-qun, YU Guo-jun. Optimizing implementation of Gaussian filter based on field programmable gate array. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2017, 51(5): 969-975.
参考文献(References):
[1] DRAPER B A, BEVERIDGE J R, BOHM A P W, ROSS C, et al. Accelerated image processing on FPGAs [J]. IEEE Transactions on Image Processing, 2003, 12(12): 1543-1551.
[2] VEGA-RODRIGUEZ M A, SANCHEZPEREZ J M, GOMEZ-PULIDO J.A. Real time image processing with reconfigurable hardware [C]∥8th IEEE International Conference on Electronics, Circuits and Systems. Malta: IEEE, 2001: 213-216.
[3] LEESER M L, MILLER S, YU H. Smart camera based on reconfigurable hardware enables diverse real time applications [C]∥ Proceedings of the 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines. Napa: IEEE,2004: 147-155.
[4] KALOMIROS J A, LYGOURAS J. Design and evaluation of a hardware/software FPGA-based system for fast image processing [J]. Microprocessors and Microsystems, 2008, 32(2): 95-106.
[5] LEE P S, LEE C S, JU H L. Development of FPGA-based digital signal processing system for radiation spectroscopy [J]. Radiation Measurements, 2013, 48(1): 12-17.
[6] 杨帆,张皓,马新文,等.基于FPGA的图像处理系统[J].华中科技大学学报:自然科学版,2015,43(2): 119-123.
YANG Fan, ZHANG Hao, MA Xin-wen, JIANG Yong, et al. Image processing system based on FPGA [J]. Journal of Huazhong University of Science & Technology :Natural Science Edition, 2015, 43(2): 119-123.
[7] 付昱强.基于FPGA的图像算法的研究与硬件实现[D].南昌:南昌大学,2006.
FU Yu-qiang. Research and hardware design of image processing algorithms based on FPGA [D]. Nanchang: Nanchang University , 2006.
[8] NIDHI, GURINDERPAL S. Efficient design of ripple carry adder and carry skip adder with low quantum cost and low power consumption [J].International Journal of Engineering Research and Applications, 2014, 4(7):247-251.
[9] EFSTATHIOU C, OWDA Z, TSIATOUHAS Y. New high-speed multioutput carry look-ahead adders [J]. IEEE Transactions on Circuits and Systems II: Express Briefs, 2013, 60(10): 667-671.
[10] ANAND V, VIJAYAKUMAR V. Optimizing area of vedic multiplier using Brent-Kung adder [J]. Research Journal of Pharmaceutical, Biological and Chemical Sciences, 2016, 7(3): 1178-1185.
[11] PRIYADHARSHINI M, THIRUVENI M. Performance evaluation of carry select adder-review [J]. International Journal of Engineering Sciences & Research Technology, 2014, 3(11): 469-475.
[12] VARUN P.S, SHIV D, MANISH R. Efficient carry skip Adder design using full adder and carry skip block based on reversible Logic [J]. American Journal of Engineering Research, 2015, 4(12): 95-100.
[13] 赵忠明,林正浩.一种改进的Wallace树型乘法器的设计[J].电子设计应用,2006: 113-116.
ZHAO Zhong-ming, LIN Zheng-hao. Optimizing multiplier design and implementation based on optimizing Wallace tree [J]. Electronic Design & Application, 2006(8): 113-116.
[14] 李军强,李东生,李奕磊,等.32×32高速乘法器的设计与实现[J].微电子学与计算机,2009,26(12): 23-26.
LI Jun-qiang, LI Dong-sheng, LI Yi-lei, et al. 32×32 High-speed Multiplier Design and Implementation [J]. Microelectronics & Computer, 2009, 26(12): 23-26.
[15] 王红君,施楠,赵辉,等.改进中值滤波方法的图像预处理技术[J].计算机系统应用,2015, 24(5): 237-240.
WANG Hong-jun, SHI Nan, ZHAO Hui, et al. Image preprocessing technology based on improved median filter [J]. Computer Systems & Applications, 2015, 24(5): 237-240.
[16] ITOH N, NAEMURA Y, MAKINO H. A 600-MHz 54×54-bit multiplier with rectangular-styled Wallace tree[J]. IEEE Journal of Solid-State Circuits, 2001,36(2): 249-257.
XIU Si-wen, LI Yan-zhe, HUANG Kai, MA De, YAN Rong-jie, YAN Xiao-lang. Cache modeling for MPSoC performance estimation[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2015, 49(7): 1367-1375.
HUANG Kai-jie, HUANG Kai, MA De, WANG Yu-bo,FENG Jiong, GE Hai-tong, YAN Xiao-la. IP-XACT standard based SoC design methodology[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2013, 47(10): 1770-1776.
ZHANG Yang, WANG Xiu-min, CHEN Hao-wei. FPGA based design of LDPC encoder[J]. JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), 2011, 45(9): 1582-1586.