This project introduces an innovative approach to accelerate butterfly counting algorithms in bipartite graphs. By optimizing matrix-vector multiplication in Compressed Sparse Row (CSR) format and employing various parallelism techniques, our method significantly improves computational efficiency.
Before you begin, ensure you have met the following requirements:
- You should have access to the datasets located in the
/data/
directory.
We convert out.dbpedia-<dataset>
into CSR format using graph_reader_csr.c
. There is a MAX_EDGES variable need to be set before adding new dataset. The default value is 251000 and should be valid for exisiting four datasets.
To run our kernel sequentially, follow these instructions:
- Uncomment the block corresponding to the target dataset in
kernel.c
. - Update the path string to your dataset.
- Run the following commands:
make clean && make kernel && make run
To run our parallelized kernel, follow these instructions:
- Uncomment the block corresponding to the target dataset in
kernel_parallel.c
. - Update the path string to your dataset.
- Run the following commands:
make clean && make kernel_parallel && make run_parallel