skip to main content
research-article

Graph-Based Approaches to Placement of Processing Element Networks on FPGAs for Physical Model Simulation

Published:15 December 2014Publication History
Skip Abstract Section

Abstract

Physical models utilize mathematical equations to characterize physical systems like airway mechanics, neuron networks, or chemical reactions. Previous work has shown that field programmable gate arrays (FPGAs) execute physical models efficiently. To improve the implementation of physical models on FPGAs, this article leverages graph theoretic techniques to synthesize physical models onto FPGAs. The first phase maps physical model equations onto a structured virtual processing element (PE) graph using graph theoretic folding techniques. The second phase maps the structured virtual PE graph onto physical PE regions on an FPGA using graph embedding theory. A simulated annealing algorithm is introduced that can map any physical model onto an FPGA regardless of the model's underlying topology. We further extend the simulated annealing approach by leveraging existing graph drawing algorithms to generate the initial placement. Compared to previous work on physical model implementation on FPGAs, embedding increases clock frequency by 25% on average (for applicable topologies), whereas simulated annealing increases frequency by 13% on average. The embedding approach typically produces a circuit whose frequency is limited by the FPGA clock instead of routing. Additionally, complex models that could not previously be routed due to complexity were made routable when using placement constraints.

References

  1. Romas Aleliunas and Arnold Rosenberg. 1982. On embedding rectangular grids in square grids. IEEE Transactions on Computers 31, 9, 907--913. DOI:http://dx.doi.org/10.1109/TC.1982.1676109 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Pritha Banerjee, Susmita Sur-Kolay, Arijit Bishnu, Sandip Das, Subhas C. Nandy, and Subhasis Bhattacharjee. 2009. FPGA placement using space-filling curves: Theory meets practice. ACM Transactions on Embedded Computing Systems 9, 2, Article No. 12. DOI:http://doi.acm.org/10.1145/1596543.1596546 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Francine Berman and Lawrence Snyder. 1987. On mapping parallel algorithms into parallel architectures. Journal of Parallel and Distributed Computing 4, 5, 439--458. DOI:http://dx.doi.org/10.1016/0743--7315(87)90018--9 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Abhinav Bhatelé and Laxmikant V. Kalé. 2008. Benefits of topology aware mapping for mesh interconnects. Parallel Processing Letters 18, 4, 549--566.Google ScholarGoogle ScholarCross RefCross Ref
  5. Shahid H. Bokhari. 1981. On the mapping problem. IEEE Transactions on Computers 30, 3, 207--214. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Woei-Kae Chen and Matthias F. M. Stallmann. 1995. On embedding binary trees into hypercubes. Journal of Parallel and Distributed Computing 24, 2, 132--138. DOI:http://dx.doi.org/10.1006/jpdc.1995.1013 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. John A. Ellis. 1991. Embedding rectangular grids into square grids. IEEE Transactions on Computers 40, 1, 46--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Thomas MJ Fruchterman and Edward M. Reingold. 1991. Graph drawing by force-directed placement. Software: Practice and Experience 21, 11, 1129--1164. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Elzbieta Gabryś;, Marek Rybaczuk, and Alicja Kedzia. 2005. Fractal models of circulatory system. Symmetrical and asymmetrical approach comparison. Chaos, Solitons & Fractals 24, 3, 707--715.Google ScholarGoogle ScholarCross RefCross Ref
  10. Padmini Gopalakrishnan, Xin Li, and Lawrence Pileggi. 2006. Architecture-aware FPGA placement using metric embedding. In Proceedings of the 43rd Annual Design Automation Conference (DAC’06). ACM, New York, NY, 460--465. DOI:http://doi.acm.org/10.1145/1146909.1147033 Google ScholarGoogle Scholar
  11. Keith Horsfield, Wendy Kemp, and Sally Phillips. 1982. An asymmetrical model of the airways of the dog lung. Journal of Applied Physiology 52, 1, 21--26.Google ScholarGoogle ScholarCross RefCross Ref
  12. Chen Huang, Bailey Miller, Frank Vahid, and Tony Givargis. 2012. Synthesis of custom networks of heterogeneous processing elements for complex physical system emulation. In Proceedings of the 8th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS’12). ACM, New York, NY, 215--224. DOI:http://doi.acm.org/10.1145/2380445.2380483 Google ScholarGoogle Scholar
  13. Chen Huang, Frank Vahid, and Tony Givargis. 2011. A custom FPGA processor for physical model ordinary differential equation solving. IEEE Embedded Systems Letters 3, 4, 113--116. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. David S. Johnson, Cecilia R. Aragon, Lyle A. McGeoch, and Catherine Schevon. 1989. Optimization by simulated annealing: An experimental evaluation. Part I: Graph partitioning. Operations Research 37, 6, 865--892. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Alexander Marquardt, Vaughn Betz, and Jonathan Rose. 2000. Timing-driven placement for FPGAs. In Proceedings of the ACM/SIGDA 8th International Symposium on Field Programmable Gate Arrays (FPGA’00). ACM, New York, NY, 203--213. DOI:http://doi.acm.org/10.1145/329166.329208 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Sang-Kyu Lee and Hyeong-Ah Choi. 1996. Embedding of complete binary trees into meshes with row-column routing. IEEE Transactions on Parallel and Distributed Systems 7, 5, 493--497. DOI:http://dx.doi.org/10.1109/71.503774 Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Bailey Miller, Frank Vahid, and Tony Givargis. 2012. Digital mockups for the testing of a medical ventilator. In Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium (IHI’12). ACM, New York, NY, 859--862. DOI:http://doi.acm.org/10.1145/2110363.2110473 Google ScholarGoogle Scholar
  18. Bailey Miller, Frank Vahid, and Tony Givargis. 2013. Embedding-based placement of processing element networks on FPGAs for physical model simulation. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA’13). ACM, New York, NY, 181--190. Google ScholarGoogle Scholar
  19. Erdem Motuk, Roger Woods, and Stefan Bilbao. 2005. Implementation of finite difference schemes for the wave equation on FPGA. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’05). IEEE, Los Alamitos, CA, 237--240. DOI:http://dx.doi.org/10.1109/ICASSP.2005.1415690Google ScholarGoogle ScholarCross RefCross Ref
  20. Jayram Moorkanikara Nageswaran, Nikil Dutt, Jeffrey L. Krichmar, Alex Nicolau, and Alexander V. Veidenbaum. 2009. A configurable simulation environment for the efficient simulation of large-scale spiking neural networks on graphics processors. Neural Networks 22, 5, 791--800. DOI:http://dx.doi.org/10.1016/j.neunet.2009.06.028 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yaghout Nourani and Bjarne Andresen. 1998. A comparison of simulated annealing cooling strategies. Journal of Physics A: Mathematical and General 31, 41, 8373--8385.Google ScholarGoogle ScholarCross RefCross Ref
  22. Julio C. G. De Pimentel, and Y. G. Tirat-Gefen. 2006. Hardware acceleration for real time simulation of physiological systems. In Proceedings of the 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS’06). IEEE, Los Alamitos, CA, 218--223. DOI:10.1109/IEMBS.2006.260298Google ScholarGoogle ScholarCross RefCross Ref
  23. Satnam Singh. 2011. The RLOC is dead—long live the RLOC. In Proceedings of the 19th ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA’11). ACM, New York, NY, 185--188. Google ScholarGoogle Scholar
  24. Kozo Sugiyama, Shojiro Tagawa, and Mitsuhiko Toda. 1981. Methods for visual understanding of hierarchical system structures. IEEE Systems, Man, and Cybernetics 11, 2, 109--125. DOI:10.1109/TSMC.1981.4308636Google ScholarGoogle ScholarCross RefCross Ref
  25. Ilias Tagkopoulos, Charles Zukowski, German Cavelier, and Dimitris Anastassiou. 2003. A custom FPGA for the simulation of gene regulatory networks. In Proceedings of the 13th ACM Great Lakes Symposium on VLSI (GLSVLSI’03). ACM, New York, NY, 132--135. DOI:http://doi.acm.org/10.1145/764808.764843 Google ScholarGoogle Scholar
  26. David Terman, Sungwoo Ahn, Xueying Wang, and Winfried Just. 2008. Reducing neuronal networks to discrete dynamics. Physica D: Nonlinear Phenomena 237, 3, 324--338.Google ScholarGoogle ScholarCross RefCross Ref
  27. Jeffrey D. Ullman. 1984. Computational Aspects of VLSI. W. H. Freeman & Co., New York, NY. Google ScholarGoogle Scholar
  28. Stuart Ullman and Bhagirath Narahari. 1990. Mapping binary precedence trees to hypercubes and meshes. In Proceedings of the 2nd IEEE Symposium on Parallel and Distributed Processing. 838--841. DOI:10.1109/SPDP.1990.143655 Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Alan S. Wagner. 1990. Embedding All Binary Trees in the Hypercube. Technical Report. University of British Columbia, Vancouver, BC, Canada. Google ScholarGoogle Scholar
  30. Ewald Weibel. 1963. Morphometry of the Human Lung. Springer-Verlag, Berlin, Germany.Google ScholarGoogle Scholar
  31. Xilinx Inc. 2010. Virtex-6 FPGA Routing Optimization Design Techniques. Retrieved Sept 1, 2013, from http://www.xilinx.com/support/documentation/white_papers/wp381_V6_Routing_Optimization.pdf.Google ScholarGoogle Scholar
  32. Henggui Zhang, Arun V. Holden, and Mark R. Boyett. 2001. Gradient model versus mosaic model of the sinoatrial node. Circulation 103, 4, 584--588.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Graph-Based Approaches to Placement of Processing Element Networks on FPGAs for Physical Model Simulation

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Reconfigurable Technology and Systems
          ACM Transactions on Reconfigurable Technology and Systems  Volume 7, Issue 4
          January 2015
          213 pages
          ISSN:1936-7406
          EISSN:1936-7414
          DOI:10.1145/2699137
          • Editor:
          • Steve Wilton
          Issue’s Table of Contents

          Copyright © 2014 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 15 December 2014
          • Accepted: 1 February 2014
          • Revised: 1 January 2014
          • Received: 1 September 2013
          Published in trets Volume 7, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader