skip to main content
10.1145/1999946.1999977acmconferencesArticle/Chapter ViewAbstractPublication PagesnocsConference Proceedingsconference-collections
research-article

A low-latency adaptive asynchronous interconnection network using bi-modal router nodes

Published:01 May 2011Publication History

ABSTRACT

A new bi-modal asynchronous arbitration node is introduced for use as a building block in an adaptive asynchronous interconnection network. The target network topology is a variant Mesh-of-Trees (MoT), combining a binary fan-out network (i.e. routing) and a binary fan-in network (i.e. arbitration) for each source-sink pair. The key feature of the new arbitration node is that it dynamically reconfigures based on the traffic it receives, entering a special "single-channel-bias" mode when the other channel has no recent activity. Arbitration is totally bypassed on the critical path, resulting in significantly lower node latency and, in high-traffic scenarios, improved throughput.

The router nodes were implemented in IBM 90nm technology using ARM standard cells. SPICE simulations indicate that the bi-modal arbitration node provided significant reductions in latency (41.6%), and increased throughput (19.8%, in high-traffic single-channel scenarios), when in biased mode. Node reconfiguration required at most 338 ps. Simulations were then performed on two distinct MoT indirect networks, "baseline" and "adaptive" (the latter incorporating the new bi-modal node), on eight diverse synthetic benchmarks, using mixes of random and deterministic traffic. Improvements in system latency up to 19.8% and throughput up to 27.8% were obtained using the adaptive network. Overall end-to-end latencies, through 6 router nodes and 5 hops, of 1.8-2.8 ns (at 25% load) and throughputs of 0.27-1.8 Gigaflits/s (at saturation rate) were also observed.

References

  1. J. Bainbridge and S. Furber. Chain: a delay-insensitive chip area interconnect. IEEE Micro Magazine, 22(5):16--23, Sept.-Oct. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. O. Balkan, M. N. Horak, G. Qu, and U. Vishkin. Layout-accurate design and implementation of a high-throughput interconnection network for single-chip parallel processing. In Hot Interconnects, pages 21--28, Aug. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. O. Balkan, G. Qu, and U. Vishkin. An area-efficient high-throughput hybrid interconnection network for single-chip parallel processing. In Proc. of ACM/IEEE DAC Conf., pages 435--440, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. Beigné, F. Clermidy, P. Vivet, A. Clouard, and M. Renaudin. An asynchronous NOC architecture providing low latency service and its multi-level design framework. In Proc. of IEEE Async Symp., pages 54--63, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Benini and G. De Micheli. Networks on chips: A new SoC paradigm. IEEE Computer, 35(1):70--78, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. T. Bjerregaard and J. Sparsoe. A router architecture for connection-oriented service guarantees in the MANGO clock-less network-on-chip. In Proc. Design, Automation and Test in Europe (DATE), pages 1226--1231, March 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. W. Dally and B. Towles. Principles and Practices of Interconnection Networks. Morgan Kaufmann Publishers, Inc., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Dobkin, R. Ginosar, and C. P. Sotiriou. Data synchronization issues in GALS SoCs. In Proc. of IEEE Async Symp., pages 170--179, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  9. R. M. Fuhrer and S. M. Nowick. Sequential Optimization of Asynchronous and Synchronous Finite-State Machines: Algorithms and Tools. Kluwer Academic Publishers, 2001. {MINIMALIST download site (URL): www.cs.columbia.edu/~nowick/asynctools}. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. van Gageldonk, K. van Berkel, A. Peeters, D. Baumann a nd D. Gloor, and G. Stegmann. An asynchronous low-power 80C51 microcontroller. In Proc. of IEEE Async Symp., pages 96--107, Apr. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. N. Horak. A high-throughput, low-power asynchronous mesh-of-trees interconnection network for the explicit multithreading (XMT) parallelarchitecture. Master's thesis, Univ. of Maryland, August 2008. http://hdl.handle.net/1903/8361.Google ScholarGoogle Scholar
  12. M. N. Horak, S. M. Nowick, M. Carlberg, and U. Vishkin. A low-overhead asynchronous interconnection network for GALS chip multiprocessors. In Proc. of ACM NOCS Symp., pages 43--50, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Imai, K. Takada, and T. Nanya. Fine-grain leakage power reduction method for m-out-of-n encoded circuits using multi-threshold-voltage transistors. In Proc. of IEEE Async Symp., pages 209--216, May 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. N. K. Jain, P. V. Gratz, A. Sprintson, and G. Choi. Asynchronous bypass channels: Improving performance for multi-synchronous NoCs. In Proc. of ACM NOCS Symp., pages 43--50, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Kumar, L.-S. Peh, P. Kundu, and N. K. Jha. Express virtual channels: Towards the ideal interconnection fabric. In Proc. of ISCA, pages 150--161, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Naishlos, J. Nuzman, C.-W. Tseng, and U. Vishkin. Towards a first vertical prototype of an extremely fine-grained parallel programming approach. IEEE Theory of Computing Systems (TOCS), 36(5):521--552, September 2003.Google ScholarGoogle Scholar
  17. S. M. Nowick, M. N. Horak, and M. Carlberg. Asynchronous digital circuits including arbitration and routing primitives for asynchronous and mixed-timing networks. US Patent App. PCT/US09/50561, 7/14/2009.Google ScholarGoogle Scholar
  18. U. Y. Ogras, J. Hu, and R. Marculescu. Key research problems in NoC design: A holistic perspective. In Proc. of CODES, pages 69--74, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. D. Owens, W. J. Dally, R. Ho, D. N. Jayasimha, S. W. Keckler, and L.-S. Peh. Research challenges for on-chip interconnection networks. IEEE Micro, 27(5):96--108, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Sheibanyrad, A. Greiner, and I. Miro-Panades. Multisyn-chronous and fully asynchronous NoCs for GALS. IEEE Design & Test, 25(6):572--580, Nov. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Singh and S. M. Nowick. MOUSETRAP: High-speed transition-signaling asynchronous pipelines. IEEE Trans. VLSI, 15(6):684--697, June 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. P. Teehan, M. Greenstreet, and G. Lemieux. A survey and taxonomy of GALS design styles. IEEE Design & Test, pages 418--428, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Teifel and R. Manohar. An asynchronous dataflow FPGA architecture. IEEE Transactions on Computers, 53(11):1376--1392, Nov. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Y. Thonnart, E. Beigne, A. Valentian, and P. Vivet. Automatic power regulation based on an asynchronous activity detection and its application to ANOC node leakage reduction. In Proc. of IEEE Async Symp., pages 48--57, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A low-latency adaptive asynchronous interconnection network using bi-modal router nodes

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      NOCS '11: Proceedings of the Fifth ACM/IEEE International Symposium on Networks-on-Chip
      May 2011
      282 pages
      ISBN:9781450307208
      DOI:10.1145/1999946

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 May 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate14of44submissions,32%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader