Please wait a minute...
J4  2010, Vol. 44 Issue (1): 75-80    DOI: 10.3785/j.issn.1008-973X.2010.01.014
计算机科学技术     
基于提前写回策略的数据转发优化方法
蔡卫光,姚庆栋,刘鹏,张奇,张贻雄
(浙江大学 信息与电子工程学系,浙江 杭州 310027)
Optimization of data forwarding based on early write-back strategy
CAI Wei-guang, YAO Qing-dong, LIU Peng, ZHANG Qi, ZHANG Yi-xiong
(Department of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China)
 全文: PDF  HTML
摘要:

针对深度流水线和复杂指令集结构,给出一种基于操作数访问时序的数据转发模型,使用5个参数描述指令执行过程,并以一种RISC/DSP结构MediaDSP64原型机为例进行分析.在分布式转发电路的基础上,提出一种基于提前写回策略的转发优化方法.该策略在不影响指令执行效率的前提下,通过将DSP指令中辅助寄存器的结果提前写回寄存器文件减少了转发源的数量.针对该方法造成的指令乱序执行情况,设计一种影子寄存器结构,保证了精确异常处理的实现.实验结果表明,转发电路的硬件资源占用减少了43.8%,关键路径延时下降了19.8%.

Abstract:

For deep pipeline and complex instruction set architecture, a data forwarding model based on operand access order was introduced, which utilized five parameters to describe the instruction execution process. Employing this model, a processor prototype MediaDSP64 of RISC/DSP architecture was analyzed, and a data forwarding optimization method based on distributed bypassing unit was introduced. The number of data forwarding source was reduced without instruction execution efficiency degradation, by earlier writing back the result of auxiliary register in DSP instructions. For the out of order execution caused by this method, a shadow register structure was designed to deal with precise exception handling. Experimental results showed that the hardware resource of data forwarding circuit was reduced by 43.8%, and the timing delay of critical path was reduced by 19.8%.

出版日期: 2010-02-26
:  TP 302  
基金资助:

国家自然科学基金资助项目(60873112);国家“863”高技术研究发展计划资助项目(2009AA01Z109).

通讯作者: 刘鹏,男,副教授.     E-mail: liupeng@zju.edu.cn
作者简介: 蔡卫光(1983-),男,河南新乡人,博士生,主要从事音视频处理、媒体SOC系统设计研究.
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  

引用本文:

蔡卫光, 姚庆栋, 刘鹏, 等. 基于提前写回策略的数据转发优化方法[J]. J4, 2010, 44(1): 75-80.

CA Wei-Guang, TAO Qiang-Dong, LIU Feng, et al. Optimization of data forwarding based on early write-back strategy. J4, 2010, 44(1): 75-80.

链接本文:

http://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2010.01.014        http://www.zjujournals.com/eng/CN/Y2010/V44/I1/75

[1] HENNESSY J L, PATTERSON D A. Computer architecture: a quantitative approach [M]. 3rd ed. New York: Morgan Kaufmann Publishers, 2003: 172-189.
[2] ERICH B. The engineering design of the stretch computer [C]∥Proceedings of the Eastern Joint Computer Conference. Boston: National Joint Computer Committee, 1959: 48-58.
[3] ABNOUS A, BAGHERZADEH N. Pipelining and bypassing in a VLIW processor [J]. IEEE Transactions on Parallel and Distributed Systems, 1994, 5(6): 658-663.
[4] AHUJA P, CLARK D W, ROGERS A. The performance impact of incomplete bypassing in processor pipelines [C]∥ Proceedings of the 28th Annual International Symposium on Microarchitecture. Michigan: IEEE, 1995: 36-45.
[5] SAMI M, SCIUTO D, SILVANO C, et al. Exploiting data forwarding to reduce the power budget of VLIW embedded processors [C]∥ Proceedings of Design, Automation and Test in Europe. Munich: IEEE, 2001: 252-257.
[6] DOLLE M, JHAND S, LEHNER W, et al. A 32-b RISC/DSP microprocessor with reduced complexity [J]. IEEE Journal of Solid-State Circuits, 1997, 32(7): 1056-1066.
[7] CHAVES R, SOUSA L. RDSP: a RISC DSP based on residue number system [C]∥ Proceedings of the Euromicro Symposium on Digital Systems Design. Antalya: IEEE, 2003: 128-135.
[8] SHI Ce, WANG Wei-dong, ZHOU Li, et al. 32b RISC/DSP media processor: MediaDSP3201 [C] ∥ Proceedings of SPIE-IS & T Electronic Imaging. San Jose: SPIE, 2005: 43-52.
[9] CHEN Xiao-yi, YAO Qing-dong, LIU Peng. The forwarding architecture and circuit design in 32-bits digital signal processor [J]. Journal of Electronics, 2005, 22(6): 640-649.
[10] 俞国军,刘鹏,姚庆栋. RISC/DSP处理器数据转发机制设计[J]. 计算机辅助设计与图形学报, 2006, 18(7): 999-1004.
YU Guo-jun, LIU Peng, YAO Qing-dong. Design of bypassing mechanism of RISC-DSP processor [J]. Journal of Computer-Aided Design and Computer Graphics, 2006, 18(7): 999-1004.
[11] YU Guo-jun, YAO Qing-dong, LIU Peng, et al. A processor for MPEG decoder SoC: a software/hardware co-design approach [C]∥ Proceedings of SPIE-IS and T Electronic Imaging. San Jose: SPIE, 2005: 742-752.
[12] 刘鹏,姚庆栋,李东晓,等. 32位媒体数字信号处理器:中国, 200410016753.8 [P]. 2007-01-31.
LIU Peng, YAO Qing-dong, LI Dong-xiao, et al. 32 bit media DSP processor: China, 200410016753.8 [P]. 2007-01-31.

[1] 叶霞,辛愿,刘勇,刘鹏. 基于媒体数字信号处理器的流预取机制[J]. J4, 2014, 48(2): 268-278.
[2] 全励, 程爱莲, 潘赟, 丁勇, 严晓浪. 基于旁路通道的片上网络差别型服务实现方法[J]. J4, 2013, 47(6): 957-968.
[3] 张振, 李善平. 变频感知的处理器服务时间估算方法[J]. J4, 2012, 46(4): 725-733.
[4] 傅朝阳, 高济, 周尤明. 基于承诺的agent组织描述工具[J]. J4, 2011, 45(4): 627-636.
[5] 曹晓阳, 潘赟, 严晓浪, 宦若虹. 低面积-时间复杂度的离散余弦变换脉动结构[J]. J4, 2011, 45(4): 656-659.
[6] 龚帅帅,吴晓波,孟建熠,丁永林. 基于历史链接关系的指令高速缓存低功耗方法[J]. J4, 2011, 45(3): 467-471.
[7] 徐鸿明,孟建熠,严晓浪,葛海通. 基于高速缓存资源共享的TLB设计方法[J]. J4, 2011, 45(3): 462-466.
[8] 黄江伟, 胡威, 项凌翔, 等. 基于电池模型驱动的软硬件低功耗设计[J]. J4, 2009, 43(12): 2149-2154.