Please wait a minute...
J4  2013, Vol. 47 Issue (7): 1213-1217    DOI: 10.3785/j.issn.1008-973X.2013.07.012
    
Low power instruction cache based on adjacent line linking access
XIANG Xiao-yan, CHEN Zhi-jian, MENG Jian-yi, YAN Xiao-lang
Institute of VLSI Design, Zhejiang University, Hangzhou 310027, China
Download:   PDF(0KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

 The behavior of cache accessing was analyzed. A new low power instruction cache accessing method that links the current cache line and its adjacent cache lines was proposed. When a direct jump between cache lines occurs, the adjacent cache line links are reused to get the accurate way information of the target line. Then the accesses of tag array are reduced and tag lookups are avoided to reduce the dynamic power consumption. When a cache line is evicted, only its adjacent cache line links should be checked and invalidated to keep the correctness of the links. Experiment results show that dynamic power consumption can be reduced by 6% on average with the new method compared to the traditional way memorization instruction cache.



Published: 01 July 2013
CLC:  TN 332  
  TN 47  
Cite this article:

XIANG Xiao-yan, CHEN Zhi-jian, MENG Jian-yi, YAN Xiao-lang. Low power instruction cache based on adjacent line linking access. J4, 2013, 47(7): 1213-1217.

URL:

http://www.zjujournals.com/eng/10.3785/j.issn.1008-973X.2013.07.012     OR     http://www.zjujournals.com/eng/Y2013/V47/I7/1213


基于邻行链接访问的低功耗指令高速缓存

通过分析高速缓存访问的局部性原理,提出当前高速缓存访问行与若干紧邻行链接访问的低功耗指令缓存访问方法.该方法能够在发生相对跳转时依托于相邻行之间的访问链接信息,精确获得跳转目标行的路访问信息,减少对高速缓存标志存储器的访问,达到降低动态功耗的目的.在高速缓存行发生替换时,仅需检测并清除被替换行相邻范围内的若干缓存行的链接信息,从而实现链接关系的正确性.与基于路记忆访问的高速缓存器相比,应用该方法的高速缓存器的动态功耗可以平均减少6%.

[1] GONZALEZ R, HOROWITZ M. Energy dissipation in general purpose microprocessors [J]. IEEE Journal of Solid-State Circuits, 1996, 31(9):1277-1284.
[2] 孟建熠,严晓浪,葛海通,等.基于指令回收的低功耗循环分支折合技术[J].浙江大学学报:工学版, 2010, 44(4): 632-638.
MENG Jian-yi, YAN Xiao-lang, GE Hai-tong, et al. Instruction recycling based low power branch folding [J]. Journal of Zhejiang University: Engineering Science, 2010,44(4):632-638.
[3] TSAI Y Y, CHEN C H. Energy-efficient trace reuse cache for embedded processors [J]. IEEE Transactions on Very Large Scale Integration Systems, 2011,19(9):1681-1694.
[4] HASEGAWA A, KAWASAKI I, YAMADA K, et al. SH3: high code density, low power [J]. IEEE Micro, 1995, 15(6): 11-19.
[5] INOUE K, ISHIHARA T, MURAKAMI K. Way-predicting set-associative cache for high performance and low energy consumption [C]∥ Proceedings of ISLPED. California: [s.n.], 1999: 273-275.
[6] XU C P, ZHANG G, HAO S Q. Fast way-prediction instruction cache for energy efficiency and high performance [C]∥ Proceedings of NAS. Zhang Jia Jie: [s.n.], 2009: 235-238.
[7] MA A, ZHANG M, ASANOVIC K. Way memorization to reduce fetch energy in instruction caches [C]∥ ISCA Workshop on Complexity Effective Design. Sweden: IEEE, 2001.
[8] XIE Z C, TONG D, CHENG X. WHOLE: a low energy I-cache with separate way history [C]∥ Proceedings of IEEE International Conference on Computer Design. California: IEEE, 2009:137-143.
[9] 龚帅帅,吴晓波,孟建熠,等. 基于历史链接关系的指令高速缓存低功耗方法[J].浙江大学学报:工学版,2011,45(3):467-471.
GONG Shuai-shuai, WU Xiao-bo, MENG Jian-yi, et al. Linking history based low-power instruction cache [J]. Journal of Zhejiang University: Engineering Science, 2011,45(3):467-471.
[10] PANWAR R, RENNELS D. Reducing the frequency of tag compares for low power I:cache design [C]∥ Proceedings of ISLPED. California: [s.n.], 1995:57-62.
[11] C-SKY microsystems. 32-bit high performance and low power embedded processor [EB/OL]. [2003-08-01]. http:∥ www.c-sky.com
[12] BROOKS D, TIWARI V, MARTONOSI M. Wattch: a framework for architectural-level power analysis and optimizations [C]∥ Proceedings of the 27th Annual International Symposium on Computer Architecture. Vancouver: [s.n.], 2000:83-94.

[1] CHEN Zhi-jian, MENG Jian-yi, GE Hai-tong, YAN Xiao-lang. Translation lookaside buffer  design  based on
dynamic memory page merging
[J]. J4, 2012, 46(1): 118-122.
[2] CHEN Zhi-jian, MENG Jian-yi, GE Hai-tong, YAN Xiao-lang. High performance hardware stack for seamless context switching[J]. J4, 2011, 45(9): 1587-1592.