File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: GraVF-M: Graph Processing System Generation for Multi-FPGA Platforms

TitleGraVF-M: Graph Processing System Generation for Multi-FPGA Platforms
Authors
KeywordsFPGA
Graph processing
GraVF-M
Multi-FPGA architecture
Performance modelling
Vertex centric
Issue Date2019
PublisherThe Association for Computing Machinery. The Journal's web site is located at http://trets.cse.sc.edu/index.html
Citation
ACM Transactions on Reconfigurable Technology and Systems (TRETS), 2019, v. 12 n. 4, article no. 21 How to Cite?
AbstractDue to the irregular nature of connections in most graph datasets, partitioning graph analysis algorithms across multiple computational nodes that do not share a common memory inevitably leads to large amounts of interconnect traffic. Previous research has shown that FPGAs can outcompete software-based graph processing in shared memory contexts, but it remains an open question if this advantage can be maintained in distributed systems. In this work, we present GraVF-M, a framework designed to ease the implementation of FPGA-based graph processing accelerators for multi-FPGA platforms with distributed memory. Based on a lightweight description of the algorithm kernel, the framework automatically generates optimized RTL code for the whole multi-FPGA design. We exploit an aspect of the programming model to present a familiar message-passing paradigm to the user, while under the hood implementing a more efficient architecture that can reduce the necessary inter-FPGA network traffic by a factor equal to the average degree of the input graph. A performance model based on a theoretical analysis of the factors influencing performance serves to evaluate the efficiency of our implementation. With a throughput of up to 5.8GTEPS (billions of traversed edges per second) on a 4-FPGA system, the designs generated by GraVF-M compare favorably to state-of-the-art frameworks from the literature and reach 94% of the projected performance limit of the system.
Persistent Identifierhttp://hdl.handle.net/10722/288080
ISSN
2023 Impact Factor: 3.1
2023 SCImago Journal Rankings: 0.802
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorEngelhardt, N-
dc.contributor.authorSo, HKH-
dc.date.accessioned2020-10-05T12:07:35Z-
dc.date.available2020-10-05T12:07:35Z-
dc.date.issued2019-
dc.identifier.citationACM Transactions on Reconfigurable Technology and Systems (TRETS), 2019, v. 12 n. 4, article no. 21-
dc.identifier.issn1936-7406-
dc.identifier.urihttp://hdl.handle.net/10722/288080-
dc.description.abstractDue to the irregular nature of connections in most graph datasets, partitioning graph analysis algorithms across multiple computational nodes that do not share a common memory inevitably leads to large amounts of interconnect traffic. Previous research has shown that FPGAs can outcompete software-based graph processing in shared memory contexts, but it remains an open question if this advantage can be maintained in distributed systems. In this work, we present GraVF-M, a framework designed to ease the implementation of FPGA-based graph processing accelerators for multi-FPGA platforms with distributed memory. Based on a lightweight description of the algorithm kernel, the framework automatically generates optimized RTL code for the whole multi-FPGA design. We exploit an aspect of the programming model to present a familiar message-passing paradigm to the user, while under the hood implementing a more efficient architecture that can reduce the necessary inter-FPGA network traffic by a factor equal to the average degree of the input graph. A performance model based on a theoretical analysis of the factors influencing performance serves to evaluate the efficiency of our implementation. With a throughput of up to 5.8GTEPS (billions of traversed edges per second) on a 4-FPGA system, the designs generated by GraVF-M compare favorably to state-of-the-art frameworks from the literature and reach 94% of the projected performance limit of the system.-
dc.languageeng-
dc.publisherThe Association for Computing Machinery. The Journal's web site is located at http://trets.cse.sc.edu/index.html-
dc.relation.ispartofACM Transactions on Reconfigurable Technology and Systems (TRETS)-
dc.rightsACM Transactions on Reconfigurable Technology and Systems (TRETS). Copyright © The Association for Computing Machinery.-
dc.rights©ACM, 2019. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Reconfigurable Technology and Systems (TRETS), 2019, v. 12 n. 4, article no. 21 (November 2019). http://doi.acm.org/10.1145/3357596-
dc.subjectFPGA-
dc.subjectGraph processing-
dc.subjectGraVF-M-
dc.subjectMulti-FPGA architecture-
dc.subjectPerformance modelling-
dc.subjectVertex centric-
dc.titleGraVF-M: Graph Processing System Generation for Multi-FPGA Platforms-
dc.typeArticle-
dc.identifier.emailSo, HKH: hso@eee.hku.hk-
dc.identifier.authoritySo, HKH=rp00169-
dc.description.naturepostprint-
dc.identifier.doi10.1145/3357596-
dc.identifier.scopuseid_2-s2.0-85077790628-
dc.identifier.hkuros315334-
dc.identifier.volume12-
dc.identifier.issue4-
dc.identifier.spagearticle no. 21-
dc.identifier.epagearticle no. 21-
dc.identifier.isiWOS:000535725900005-
dc.publisher.placeUnited States-
dc.identifier.issnl1936-7406-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats