About me
Hi, my name is Jiajun Huang (黄嘉俊). I am an Assistant Professor in the Bellini College of Artificial Intelligence, Cybersecurity and Computing at the University of South Florida (USF) since Fall 2025, and am currently hiring multiple Ph.D. students. In the High-Performance Computing (HPC) area of CSRankings, USF ranks 25th in the U.S. and 40th in the world for 2021–2026. I lead the High Performance & Intelligence Lab (Pi Lab or π Lab), where we focus on building high-performance systems for large-scale artificial intelligence and scientific applications. I received my Ph.D. in Computer Science from the University of California, Riverside, advised by Dr. Zizhong Chen. I previously worked with the MPICH and SZ teams at Argonne National Laboratory, and I continue to collaborate closely with Dr. Sheng Di, Dr. Yanfei Guo, Dr. Rajeev Thakur, and Dr. Franck Cappello.
I am the founder of ZCCL.org, an initiative developed in collaboration with scientists at Argonne National Laboratory. ZCCL.org is dedicated to advancing both compression and communication and has introduced the first compression-accelerated collective communications library–ZCCL, enabling direct communication and computation with compressed data. You can explore my publications here. Feel free to reach out to me at jiajunhuang(at)usf.edu.
Job Opportunities
I am actively recruiting 2–3 highly motivated Ph.D. students with full scholarships to join my research group. Areas of focus include high-performance computing and communication, high-performance deep learning, parallel & distributed computing, and big data management & analytics.
Click here for more details before reaching out via email.
News
- [5/2026] Our ICS '26 paper, OCTANE: Breaking the Neighbor-List Bottleneck in GPU Molecular Dynamics, has received a Best Paper Nomination!
- [4/2026] We have two papers accepted to ICS '26 and one paper accepted to HPDC '26. Congratulations to all my collaborators and students on this exciting achievement!
- [12/2025] Congratulations to Lingqi on his paper, "FRUGAL: Pushing GPU Applications Beyond Memory Limits", being officially accepted to the IEEE/ACM International Symposium on Code Generation and Optimization (CGO '26)!
- [6/2025] MPICH has been awarded the 2024 ACM Software System Award! Huge congratulations to everyone!
- [6/2025] Presented our paper "ghZCCL: Advancing GPU-aware Collective Communications with Homomorphic Compression" at ICS '25 in Salt Lake City, Utah.
- [4/2025] Our HPDC '25 paper, IPComp: Interpolation-Based Progressive Lossy Compression for Scientific Applications, has been nominated as one of the three Best Paper candidates!
- [4/2025] We have two papers accepted to ICS '25 and one paper accepted to HPDC '25. Congratulations to all my collaborators and students on this exciting achievement!
- [4/2025] Our paper, GlobaZip: An Interactive, Efficient Distributed Compression-as-a-Service Platform with Optimized Data Compression Techniques, has been accepted by IEEE Transactions on Parallel and Distributed Systems (TPDS).
- [3/2025] Benjamin De Jong has been selected as an Elmhurst Jans Fellow and will be joining our group as a summer intern at Argonne National Laboratory. Congratulations, Benjamin!
- [2/2025] I have won the Dissertation Completion Fellowship Award from the University of California, Riverside. This recognition identifies me as one of the top Ph.D. graduates in my program and university.
- [11/2024] Presented our paper "hZCCL: Accelerating Collective Communication with Co-Designed Homomorphic Compression" at SC '24 in Atlanta, Georgia.
- [9/2024] Presented our paper "FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance" at CLUSTER '24 in Kobe, Japan.
- [6/2024] Presented our paper "gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters" at ICS '24 in Kyoto, Japan.
- [5/2024] Presented our paper "An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression" at IPDPS '24 in San Francisco, California.
- [1/2024] Our paper "An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression" has been accepted by IPDPS '24. Looking foward to see you in San Francisco, California.
- [11/2023] Won the First Place Award at ACM Student Research Competition (Graduate) in SC '23!
- [11/2023] Our travel grant for participating in the ACM SRC at SC '23 has been granted!
- [11/2023] Will present a poster at SC '23 in Denver, CO. Looking foward to see you at SC '23.
- [10/2023] Our NSF Travel Grant application for the IEEE CLUSTER '23 conference has been ACCEPTED!
- [10/2023] Will present a paper at CLUSTER '23 in Santa Fe, New Mexico. Looking foward to see you at CLUSTER '23.
- [6/2023] Presented a poster at HPDC '23 in Orlando, Florida. Happy to meet you at FCRC '23.
- [12/2022] Attended SC '22 in Dallas, Texas as a student volunteer. Very happy to be part of the HPC community.
- [1/2022] Started my life as a PhD student in University of California, Riverside.
Research Interests
- High-Performance Computing and Communication
- Machine Learning Systems
- High-Performance Deep Learning
- Agentic AI for Kernel Optimization and Porting
- Distributed and Parallel Computing/Systems
- Big Data Management and Reduction
Educational Background
-
University of California, Riverside, 2022-2025
Ph.D. in Computer Science
Dean’s Distinguished Fellowship -
University of Glasgow, UK, 2017-2021
BEng (Hons) Electronics and Electrical Engineering with Information Engineering
Graduated with Honors of the First Class -
UESTC (University of Electronic Science and Technology of China), China, 2017-2021
Bachelor of Engineering in Electronic Information Engineering
Research Experience
-
Assistant Professor, Bellini College of Artificial Intelligence, Cybersecurity and Computing, University of South Florida, 2025-Now
-
Visiting Student - Graduate, MPICH team, Argonne National Laboratory, 2022-2025
Mentors: Dr. Rajeev Thakur, Dr. Yanfei Guo -
Visiting Student - Graduate, SZ team, Argonne National Laboratory, 2022-2025
Mentors: Dr. Franck Cappello, Dr. Sheng Di -
Graduate Student Researcher, SuperLab, University of California, Riverside, 2022-2025
Advisor: Dr. Zizhong Chen
Selected Publications
For the full list, please see my Google Scholar profile.
- [ICS '26] Hanieh Toutouni, Suman Chakraborty, Yicheng Tu, Jiajun Huang. "OCTANE: Breaking the Neighbor-List Bottleneck in GPU Molecular Dynamics." Proceedings of the 40th ACM International Conference on Supercomputing, 2026. (Best Paper Nomination)
- [ICS '26] Ruoyu Li, Yafan Huang, Longtao Zhang, Zhuoxun Yang, Sheng Di, Boyuan Zhang, Jiajun Huang, Jinyang Liu, Jiannan Tian, Guanpeng Li, Fengguang Song, Hanqi Guo, Franck Cappello, Kai Zhao. "GPZ: GPU-Accelerated Lossy Compressor for Particle Data." Proceedings of the 40th ACM International Conference on Supercomputing, 2026.
- [HPDC '26] Longtao Zhang, Ruoyu Li, Zhuoxun Yang, Robert Underwood, Sheng Di, Daoce Wang, Jinyang Liu, Jiajun Huang, Franck Cappello, Kai Zhao. "OPAL: On-demand Progressive Accelerated Scientific Lossy Compression." Proceedings of the 35th International Symposium on High-Performance Parallel and Distributed Computing, 2026.
- [CGO '26] Lingqi Zhang*, Tengfei Wang, Jiajun Huang*, Chen Zhuang, Ivan R. Ivanov, Peng Chen, Toshio Endo, Mohamed Wahib*. "FRUGAL: Pushing GPU Applications beyond Memory Limits." Proceedings of the 24th ACM/IEEE International Symposium on Code Generation and Optimization, 2026. (*: Corresponding authors)
- [SC '25] Huangliang Dai, Shixun Wu, Jiajun Huang, Zizhe Jian, Yue Zhu, Haiyang Hu, Zizhong Chen. "FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention." Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2025.
- [SC '25] Shixun Wu, Jinwen Pan, Jinyang Liu, Jiannan Tian, Ziwei Qiu, Jiajun Huang, Kai Zhao, Xin Liang, Sheng Di, Zizhong Chen, Franck Cappello. "Boosting Scientific Error-Bounded Lossy Compression through Optimized Synergistic Lossy-Lossless Orchestration." Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2025.
- [SIGMOD '25] Longtao Zhang, Ruoyu Li, Congrong Ren, Sheng Di, Jinyang Liu, Jiajun Huang, Robert Underwood, Pascal Grosset, Dingwen Tao, Xin Liang, Hanqi Guo, Franck Cappello, Kai Zhao. "LCP: Enhancing Scientific Data Management with Lossy Compression for Particles." Proceedings of the ACM SIGMOD International Conference on Management of Data, 2025.
- [HPDC '25] Zhuoxun Yang, Sheng Di, Longtao Zhang, Ruoyu Li, Ximiao Li, Jiajun Huang, Jinyang Liu, Franck Cappello, Kai Zhao. "IPComp: Interpolation-Based Progressive Lossy Compression for Scientific Applications." Proceedings of the 34th International Symposium on High-Performance Parallel and Distributed Computing, 2025. (Best Paper Finalist)
- [ICS '25] Jiajun Huang, Sheng Di, Yafan Huang, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur. "ghZCCL: Advancing GPU-aware Collective Communications with Homomorphic Compression." Proceedings of the 39th ACM International Conference on Supercomputing, 2025.
- [ICS '25] Chen Zhuang, Lingqi Zhang, Du Wu, Peng Chen, Jiajun Huang, Xin Liu, Rio Yokota, Nikoli Dryden, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib. "Scaling Large-scale GNN Training to Thousands of Processors on CPU-based Supercomputers." Proceedings of the 39th ACM International Conference on Supercomputing, 2025.
- [PPoPP '25] Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Huangliang Dai, Sheng Di, Franck Cappello, Zizhong Chen. "TurboFFT: Co-Designed High-Performance and Fault-Tolerant Fast Fourier Transform on GPUs." Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025.
- [GPGPU '25] Lingqi Zhang, Jiajun Huang, Sheng Di, Satoshi Matsuoka, Mohamed Wahib. "Can Tensor Cores Benefit Memory-Bound Kernels?" 17th Workshop on General Purpose Processing Using GPU, in conjunction with the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025.
- [CSUR] Sheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Robert Underwood, Zhaorui Zhang, Milan Shah, Yafan Huang, Jiajun Huang, Xiaodong Yu, Congrong Ren, Hanqi Guo, Grant Wilkins, Dingwen Tao, Jianan Tian, Sian Jin, Zizhe Jian, Daoce Wang, Md Hasanur Rahman, Boyuan Zhang, Shihui Song, Jon C. Calhoun, Guanpeng Li, Kazutomo Yoshii, Khalid Ayed Alharthi, Franck Cappello. "A Survey on Error-Bounded Lossy Compression for Scientific Datasets." ACM Computing Surveys, 2025.
- [TPDS] Yuanjian Liu, Sheng Di, Jiajun Huang, Zhaorui Zhang, Kyle Chard, Ian Foster. "Ocelot: An Interactive, Efficient Distributed Compression-As-a-Service Platform With Optimized Data Compression Techniques." IEEE Transactions on Parallel and Distributed Systems, 2025.
- [SC '24] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Zizhe Jian, Xin Liang, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur. "hZCCL: Accelerating Collective Communication with Co-designed Homomorphic Compression." Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2024.
- [SC '24] Jinyang Liu*, Jiannan Tian*, Shixun Wu*, Sheng Di, Boyuan Zhang, Robert Underwood, Yafan Huang, Jiajun Huang, Kai Zhao, Guanpeng Li, Dingwen Tao, Zizhong Chen, Franck Cappello. "cuSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation." Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2024. (*: Co-first authors)
- [Cluster '24] Shixun Wu, Yitong Ding, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Huangliang Dai, Sheng Di, Bryan Wong, Zizhong Chen, Franck Cappello. "FT K-means: A High-Performance K-means on GPU with Fault Tolerance." Proceedings of the 2024 IEEE International Conference on Cluster Computing, 2024.
- [SIGMOD '24] Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Sian Jin, Zizhe Jian, Jiajun Huang, Shixun Wu, Zizhong Chen, Franck Cappello. "High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation." Proceedings of the ACM SIGMOD International Conference on Management of Data, 2024.
- [ICS '24] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raffenetti, Hui Zhou, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur. "gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters." Proceedings of the 38th ACM International Conference on Supercomputing, 2024.
- [IPDPS '24] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Xiaoyi Lu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur. "An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression." Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium, 2024.
- [IPDPS '24] Zizhe Jian, Sheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Haiying Xu, Robert Underwood, Shixun Wu, Jiajun Huang, Zizhong Chen, Franck Cappello. "CliZ: Optimizing Lossy Compression for Climate Datasets with Adaptive Fine-tuned Data Prediction." Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium, 2024.
- [DRBSD-10] Tripti Agarwal, Sheng Di, Jiajun Huang, Yafan Huang, Ganesh Gopalakrishnan, Robert Underwood, Kai Zhao, Xin Liang, Guanpeng Li, Franck Cappello. "SZOps: Scalar Operations for Error-bounded Lossy Compressor for Scientific Data." 10th International Workshop on Data Analysis and Reduction for Big Scientific Data, in conjunction with SC '24, 2024.
- [Cluster '23] Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur. "PiP-MColl: Process-in-Process-based Multi-object MPI Collectives." Proceedings of the 2023 IEEE International Conference on Cluster Computing, 2023.
- [ICS '23] Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan Wong, Zizhong Chen. "Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs." Proceedings of the 37th International Conference on Supercomputing, 2023.
- [IWBDR-4] Jiajun Huang, Jinyang Liu, Sheng Di, Yujia Zhai, Zizhe Jian, Shixun Wu, Kai Zhao, Zizhong Chen, Yanfei Guo, Franck Cappello. "Exploring Wavelet Transform Usages for Error-bounded Scientific Data Compression." 4th International Workshop on Big Data Reduction, in conjunction with the 2023 IEEE International Conference on Big Data, 2023.
- [TPDS] Yujia Zhai, Elisabeth Giem, Kai Zhao, Jinyang Liu, Jiajun Huang, Bryan Wong, Christian R. Shelton, Zizhong Chen. "FT-BLAS: A Fault Tolerant High Performance BLAS Implementation on x86 CPUs." IEEE Transactions on Parallel and Distributed Systems, 2023.
Selected Awards
- 2026: Best Paper Nomination, ICS 2026
- 2025: Best Paper Finalist, HPDC 2025
- 2025: Dissertation Completion Fellowship Award, UC Riverside
- 2023: First Place Award, ACM Student Research Competition, SC 2023
- 2023: ACM SRC Travel Grant for SC 2023
- 2023: NSF Student Travel grant for IEEE Cluster 2023
- 2022: Dean's Distinguished Fellowship, UC Riverside
- 2021: Excellent bachelor’s degree thesis (3%)
- 2018, 2019, 2020: UESTC Outstanding Student Scholarship, (5%) thrice
- 2019: James Watt Innovative Talent Scholarship, (3%)
- 2019: 1st Class Academic Scholarship, (3%)
- Outstanding Individual of UESTC Summer Camp, twice
- Glasgow Excellent Volunteer Certificate
- Outstanding Individual of Summer Social Practice
Students
- Yuhao Guo – M.Eng., UIUC; B.S., Sichuan University
- Jiefeng Zhou – B.S., Yingcai Honors College, UESTC
- Benjamin De Jong - Elmhurst Jans Fellow & Summer Intern from Argonne National Laboratory
Talks & Presentations
- [3/2025] Advancing Exascale Collective Communications with Co-Designed Compression, Invited talk, FZ+ZF Workshop, Sarasota, Florida, USA
- [2/2025] Advancing Exascale Collective Communications with Co-Designed Compression, Research Seminar, University of South Florida, Tampa, Florida, USA
- [1/2025] Advancing Exascale Collective Communications with Co-Designed Compression, Research Seminar, The University of Alabama, Tuscaloosa, Alabama, USA
- [9/2024] Accelerating Collective Communication with Error-Bounded Lossy Compression, Research Seminar, Tokyo Institute of Technology, Tokyo, Japan
- [9/2024] FT K-means: A High-Performance K-means on GPU with Fault Tolerance, Papaer presentation, Cluster '24, Kobe, Japan
- [9/2024] Codesigning Compression with Communication, Invited talk, FZ Workshop, The Ohio State University, Columbus, Ohio, USA
- [6/2024] Accelerating Collective Communication with Error-Bounded Lossy Compression, Research Seminar, RIKEN Center for Computational Science (R-CCS), Tokyo, Japan
- [6/2024] Accelerating Collective Communication with Error-Bounded Lossy Compression, Invited talk, CASS Community MPICH BoF, Online
- [6/2024] gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters, Paper presentation, ICS '24, Kyoto, Japan
- [5/2024] An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression, Paper presentation, IPDPS '24, San Francisco, California, USA
- [11/2023] Accelerating Collective Communications with Lossy Compression on GPU, Poster presentation, SC '23, Denver, Colorado, USA
- [10/2023] PiP-MColl: Process-in-Process-based Multi-object MPI Collectives, Paper presentation, CLUSTER '23, Santa Fe, New Mexico, USA
- [6/2023] Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques, Poster presentation, HPDC '23, Orlando, Florida, USA
Academic Services
- Program Committee: SC ‘26 (Programming Frameworks area), ICPP ‘26 (Software track), BDXCS ‘26, DRBSD ‘25
- Session Chair: CLUSTER ‘24
- Reviewers & Subreviewers: ACM Computing Surveys, IEEE Transactions on Computers, IEEE Transactions on Parallel and Distributed Systems, Parallel Computing, Geoscientific Model Development, IPDPS ‘24, CCGRID ‘24 ‘25, ICDCS ‘25, Cluster ‘25, GPGPU ‘25, ICS ‘26, Cloud Summit ‘26
- Student Volunteers: SC ‘21, SC ‘22
Personal Interests
- Music: 🎷 Saxophone, Cucurbit flute (Hulusi), Bamboo flute (Dizi), Vertical bamboo flute (Xiao)
- Sports: 🏓 Table tennis