Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks

Published in International Parallel and Distributed Processing Symposium (IPDPS), 2023

This work proposes fast sparse GPU kernels tailored for training graph neural networks (GNNs). By carefully optimizing memory access patterns, load balancing, and sparse computation primitives, our kernels significantly reduce training time for a variety of GNN architectures and large-scale graph datasets.

Recommended citation: **Ruibo Fan**, Wei Wang, and Xiaowen Chu, "Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks," in *Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS)*, 2023.
Download Paper | Code | Download Bibtex

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)