Skip to content

XiongxiaoL/HitGraph

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HitGraph: High-throughput Graph Processing

Framework on FPGA HitGraph, an FPGA framework to accelerate graph processing based on the edge-centric paradigm. HitGraph takes in an edge-centric graph algorithm and hardware resource constraints, determines design parameters, and then generates a Register Transfer Level (RTL) FPGA design. This makes accelerator design for various graph analytics transparent and user-friendly by masking internal details of the accelerator design process. HitGraph enables increased data reuse and parallelism through novel algorithmic optimizations:
(1) an optimized data layout that reduces non-sequential external memory accesses
(2) an efficient update merging and filtering scheme to reduce the data communication between the FPGA and external memory
(3) a partition skipping scheme to reduce redundant edge traversals for non-stationary graph algorithms.

Based on our design methodology, we accelerate Sparse Matrix-Vector Multiplication (SpMV), PageRank (PR), Single Source Shortest Path (SSSP), and Weakly Connected Component (WCC). We use Intel Stratix 10 1SX280LH3F55I3XG to conduct our experiments.

Find the paper on https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8685122

We used "ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture (https://web.cs.ucla.edu/~chiyuze/pub/fpga17.pdf) and "GraphOps: A dataflow library for graph analytics acceleration" (https://dl.acm.org/doi/pdf/10.1145/2847263.2847337) for baseline comparisons

Hardware/Tools

  1. Targeted FPGA: Intel Stratix 10 1SX280LH3F55I3XG
  2. Tools: Intel Quartus 20.1

Definition of inputs and outputs

hit
The hitgraph core contains the implementations of graph algorithms discussed in the paper (shown in yellow). We have assumed that partial input and output data are stored in the internal memory before loaded into algorithm core (Refer to the above figure).

Directory Structure

Algorithm Processing Core:

include 4 algorithm processing cores for Sparse Matrix-Vector Multiplication (SpMV), PageRank (PR), Weakly Connected Component (WCC), Single Source Shortest Path (SSSP). Make each module as the top module while running each algorithm. Use these files for synthesis.

ip_template:

IP core templates generated by Intel Quartus 20.1. These can be used as reference while creating the ip-cores.

reference_module:

Contains all the individual modules.

test_tb:

complete_test:

contains complete testing flow for the configuration of 1 partition with 1 pipeline

UnitTests:

Contains unit test benches for the core modules.

Configuring IP Cores:

Floating point adder

Find: IP catalog => basic function => arithmetic => floating point function
Name: add
Other Info: choose Generate Enable and generate HDL

Floating point mult

Find: IP catalog => basic function => arithmetic => floating point function
Name: mult
Other Info: In Functionality choose Generate Enable and generate HDL

Template of these IP cores can be found in IP core template

Setting up the projects

  1. Create a new project using Intel Quartus 20.1 and use Intel Stratix 10 1SX280LH3F55I3XG as the targeted device
  2. Include project files in algorithm_processing_core folder
  3. Select the top module based on which algorithm to run on hardware
  4. Set up IP cores as mentioned in "IP cores Configurations"
  5. Run synthesis
  6. For simulating internal modules (Unit tests), use test benches in test_tb folder

About

Source code for "HitGraph: High-throughput Graph Processing"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Verilog 97.2%
  • Coq 2.8%