Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in International Parallel and Distributed Processing Symposium (IPDPS), 2023
Ruibo Fan, Wei Wang, and Xiaowen Chu, “Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks,” in Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2023.
Published in Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024
Ruibo Fan, Wei Wang, and Xiaowen Chu, “DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores,” in Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024.
Published in International Parallel and Distributed Processing Symposium (IPDPS), 2024
Weile Luo, Ruibo Fan, Zeyu Li, et al., “Benchmarking and Dissecting the Nvidia Hopper GPU Architecture,” in Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024.
Published in Proceedings of the Twentieth European Conference on Computer Systems (EuroSys), 2025
Ruibo Fan, et al., “SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs,” in Proceedings of the 20th European Conference on Computer Systems (EuroSys), 2025.
Published in The Thirteenth International Conference on Learning Representations (ICLR), 2025
Peng Dong, Lin Li, Yuke Zhong, Dazhen Du, Ruibo Fan, Yuxin Chen, et al., “STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs,” in Proceedings of the 13th International Conference on Learning Representations (ICLR), 2025.
ACM Transactions on Computer Systems (TOCS), invited, under review
Ruibo Fan, et al., “Exploiting Low-Level Sparsity for Efficient Large Language Model Inference on GPUs with SpInfer,” ACM Transactions on Computer Systems (TOCS), invited, under review.
ACM Transactions on Computer Systems (TOCS), under review
Weile Luo, Ruibo Fan, Zeyu Li, et al., “Dissecting the NVIDIA Hopper Architecture through Micro-benchmarking and Multiple Level Analysis,” ACM Transactions on Computer Systems (TOCS), under review.
Published in Proceedings of the 31st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2026
Weile Luo, Yuxin Chen, Xiangrui Yu, Qiang Wang, Ruibo Fan, Haibo Liu, et al., “ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities,” in Proceedings of the 31st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2026.
Published in Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2026
Ruibo Fan, Xiangrui Yu, Xinglin Pan, Zeyu Li, Weile Luo, Qiang Wang, Wei Wang, and Xiaowen Chu, ‘‘ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression,’’ in the Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS ’26), Pittsburgh, PA, USA, March 2026.
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, Peking University, Parallel Computing II, 2021
Teaching Assistant for Parallel Computing II at Peking University (Spring 2021).
Undergraduate course, HKUST(GZ), Introduction to Computer Science, 2024
Teaching Assistant for Introduction to Computer Science at HKUST(GZ)
(Fall 2024; Summer 2025).
Undergraduate course, HKUST(GZ), Mathematics for Data Science, 2025
Teaching Assistant for Mathematics for Data Science at HKUST(GZ) (Fall 2025).