ABSTRACT
In a multi-core system, communication across cores is managed by an on-chip interconnect called Network-on-Chip (NoC). The utilization of NoC results in limitations such as high communication delay and high network power consumption. The buffers of the NoC router consume a considerable amount of leakage power. This paper attempts to reduce leakage power consumption by using Non-Volatile Memory technology-based buffers. NVM technology has the advantage of higher density and low leakage but suffers from costly write operation, and weaker write endurance. These characteristics impact on the total network power consumption, network latency, and lifetime of the router as a whole.
In this paper, we propose a write reduction technique, which is based on dirty flits present in write-back data packets. The method also suggests a dirty flit based Virtual Channel (VC) allocation technique that distributes writes in NVM technology-based VCs to improve the lifetime of NVM buffers.
The experimental evaluation on the full system simulator shows that the proposed policy obtains a 53% reduction in write-back flits, which results in 27% lesser total network flit on average. All these results in a significant decrease in total and dynamic network power consumption. The policy also shows remarkable improvement in the lifetime.
Supplemental Material
- Sukarn Agarwal and Hemangee K Kapoor. 2017. Lifetime enhancement of non-volatile caches by exploiting dynamic associativity management techniques. In IFIP/IEEE International Conference on Very Large Scale Integration-System on a Chip. Springer, 46--71.Google Scholar
- Niket Agarwal et al. 2009. GARNET: A Detailed On-Chip Network Model inside a Full-System Simulator. In Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on. IEEE, 33--42.Google Scholar
- Dmytro Apalkov et al. 2013. Spin-transfer torque magnetic random access memory (STT-MRAM). ACM Journal on Emerging Technologies in Computing Systems (JETC) 9, 2 (2013), 13.Google Scholar
- S. Arcaro. 2014. Integration of STT-MRAM model into CACTI simulator. In Design Test Symposium (IDT), 2014 9th International. 67--72. Google ScholarCross Ref
- Nathan Binkert et al. 2011. The gem5 Simulator. ACM SIGARCH Computer Architecture News 39, 2 (2011), 1--7.Google ScholarDigital Library
- Rahul Boyapati et al. 2017. Approx-noc: A data approximation framework for network-on-chip architectures. In ACM SIGARCH Computer Architecture News, Vol. 45. ACM, 666--677.Google Scholar
- Reetuparna Das et al. 2008. Performance and power optimization through data compression in network-on-chip architectures. In 2008 IEEE 14th International Symposium on High Performance Computer Architecture. IEEE, 215--225.Google Scholar
- O Golonzka et al. 2018. MRAM as Embedded Non-Volatile Memory Solution for 22FFL FinFET Technology. In 2018 IEEE International Electron Devices Meeting (IEDM). IEEE, 18--1.Google Scholar
- John L Henning. 2006. SPEC CPU2006 Benchmark Descriptions. ACM SIGARCH Computer Architecture News 34, 4 (2006), 1--17.Google ScholarDigital Library
- Yatin Hoskote et al. 2007. A 5-GHz Mesh Interconnect for a Teraflops Processor. IEEE Micro 27, 5 (2007), 51--61.Google ScholarDigital Library
- A. Jog. 2012. Cache revive: Architecting volatile STT-RAM caches for enhanced performance in CMPs. In DAC Design Automation Conference 2012. 243--252. Google ScholarDigital Library
- Ju Sung Kim et al. 2018. Lifetime Improvement Method using Threshold-based Partial Data Compression in NoC. In 2018 International SoC Design Conference (ISOCC). IEEE, 269--270.Google Scholar
- Young-Bae Kim et al. 2011. Bi-layered RRAM with unlimited endurance and extremely uniform switching. In VLSI Technology (VLSIT), 2011 Symposium on. IEEE, 52--53.Google Scholar
- Sparsh Mittal et al. 2015. A Survey of Architectural Approaches for Managing Embedded DRAM and Non-Volatile On-Chip Caches. IEEE Transactions on Parallel and Distributed Systems (2015), 14.Google Scholar
- Moinuddin K Qureshi et al. 2011. Phase change memory: From devices to systems. Synthesis Lectures on Computer Architecture 6, 4 (2011), 1--134.Google ScholarCross Ref
- Khushboo Rani and Hemangee K Kapoor. 2019. Write-variation aware alternatives to replace SRAM buffers with non-volatile buffers in on-chip interconnects. IET Computers & Digital Techniques 13, 6 (2019), 481--492.Google ScholarCross Ref
- Khushboo Rani and Hemangee K Kapoor. 2019. Write Variation Aware Buffer Assignment for Improved Lifetime of Non-Volatile Buffers in On-Chip Interconnects. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 27, 9 (2019), 2191--2204.Google ScholarCross Ref
- Clinton W Smullen et al. 2011. Relaxing Non-Volatility for Fast and Energy-Efficient STT-RAM Caches. In High Performance Computer Architecture (HPCA), 2011 IEEE 17th International Symposium on. IEEE, 50--61.Google Scholar
- YJ Song et al. 2018. Demonstration of Highly Manufacturable STT-MRAM Embedded in 28nm Logic. In 2018 IEEE International Electron Devices Meeting (IEDM). IEEE, 18--2.Google Scholar
- Chen Sun et al. 2012. DSENT-A Tool Connecting Emerging Photonics with Electronics for Opto-Electronic Networks-on-Chip Modeling. In Networks on Chip (NoCS), 2012 Sixth IEEE/ACM International Symposium on. IEEE, 201--210.Google Scholar
- Jue Wang et al. 2013. i 2 WAP: Improving Non-Volatile Cache Lifetime by Reducing Inter-and Intra-set Write Variations. In High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on. IEEE, 234--245.Google Scholar
- Ying Wang et al. 2016. DISCO: A low overhead in-network data compressor for energy-efficient chip multi-processors. In 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC). IEEE, 1--6.Google Scholar
- Jia Zhan et al. 2014. NoΔ: Leveraging delta compression for end-to-end memory access in NoC based multicores. In 2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC). IEEE, 586--591.Google Scholar
- Jia Zhan et al. 2016. Hybrid Drowsy SRAM and STT-RAM Buffer Designs for Dark-Silicon-Aware NoC. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 24, 10 (2016), 3041--3054.Google ScholarDigital Library
- Ping Zhou et al. 2009. Frequent value compression in packet-based NoC architectures. In 2009 Asia and South Pacific Design Automation Conference. IEEE, 13--18.Google Scholar
Index Terms
- DidaSel: dirty data based selection of VC for effective utilization of NVM buffers in on-chip interconnects
Recommendations
ZENCO: zero-bytes based encoding for non-volatile buffers in on-chip interconnects
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation ConferenceWith multiple cores integrated on the same die, communication across cores is managed by on-chip interconnect called Network-on-Chip (NoC). Power and performance of these interconnect become a significant factor as the communication network has ...
A Survey Of Architectural Approaches for Managing Embedded DRAM and Non-Volatile On-Chip Caches
Recent trends of CMOS scaling and increasing number of on-chip cores have led to a large increase in the size of on-chip caches. Since SRAM has low density and consumes large amount of leakage power, its use in designing on-chip caches has become more ...
A NOC closed-loop performance monitor and adapter
In a NoC, the amount of buffers allocated to each communication channel has a significant impact on performance and power consumption. Moreover, since there will be changes in the application communication pattern, or even because a new application is ...
Comments