电子、通信与自动控制技术 |
|
|
|
|
基于指令回收的低功耗循环分支折合技术 |
孟建熠, 严晓浪, 葛海通 |
浙江大学 超大规模集成电路设计研究所,浙江 杭州 310027 |
|
Instruction recycling based low power branch folding |
MENG Jianyi, YAN Xiaolang, GE Haitong, XU Hongming |
Institute of VLSI Design, Zhejiang University, Hangzhou 310027, China |
[1] ZMILY A, KOZYRAKIS C. Simultaneously improving code size, performance, and energy in embedded processors [C]∥ Proceedings of the Conference on DesignAutomation and Test in Europe. Munich: European Design and Automation Association, 2006: 224229.
[2] EMMA P G, DAVIDSON E S. Characterization of branch and data dependencies in programs for evaluating pipeline performance [J]. IEEE Transactions on Computers, 1987, 36(7): 859875.
[3] FAN Dongrui, YANG Hongbo, GAO Guangrong, et al. Evaluation and choice of various branch predictors for lowpower embedded processor [J]. Journal of Computer Science and Technology, 2003, 18(6): 833838.
[4] HEYDEMANN K, BODIN F, KNIJNENBURG P M W, et al. UFS: a global tradeoff strategy for loop unrolling for VLIW architectures [C]∥ 10th International Workshop on Compilers for Parallel Computers. Chichester: John Wiley & Sons, 2006: 14131434.
[5] 亨尼西,帕特森.计算机体系结构:量化研究方法[M].3版.北京:机械工业出版社,2002: 196206.
[6] BELLAS N, HAJJ I, POLYCHRONOPOULOS C, et al. Energy and performance improvements in microprocessor design using a loop cache [C]∥ IEEE International Conference on Computer Design. Austin: IEEE, 1999: 378383.
[7] DITZEL D R, MCLELLAN H R. Branch folding in the CRISP microprocessor reducing branch delay to zero [C] ∥ Proceedings of the 14th Aannual International Symposium on Computer Architecture. Pittsburgh: ACM, 1987: 28.
[8] LEA H L, SCOTT J, MOYER B, et al. Lowcost branch folding for embedded applications with small tight loops [C]∥ 32nd Annual International Symposium on Microarchitecture. Haifa: IEEE, 1999: 103111.
[9] MALIK A, MOYER B, CERMAK D. A low power unified cache architecture providing power and performance flexibility [C]∥ International Symposium on Low Power Electronics and Design. Rapallo: ACM, 2000: 241243.
[10] PARK S H, YU S, CHO J W. Speculative branch folding for pipelined processors [J]. IEICETransactions on Information and Systems, 2005, 88(5): 10641066.
[11] CSKY MicroSystems. 32bit high performance and low power embedded processor [EB/OL]. [200308]. http:∥www.csky.com.
[12] ARM Limited. Architecture and implementation of the ARM CortexA8 microprocessor [EB/OL]. [200510]. http:∥www.arm.com/pdfs/TigerWhitepaperFinal.pdf. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|