Publications

2025


[PPoPP] Baixi Sun, Weijin Liu, J. Gregory Pauloski, Jiannan Tian, Jinda Jia, Daoce Wang, Mingkai Zheng, Sheng Di, Sian Jin, Zhao Zhang, Xiaodong Yu, Guangming Tan, and Dingwen Tao, “COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Opti- mizers.” In Proceedings of the 30th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages , Las Vegas, NV, USA, 2025. (acceptance rate=38/189=20.1%)


2024


[SC] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Zizhe Jian, Xin Liang, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur, “hZCC: Accelerating Collective Communication with Co-designed Operation-supported Compression.” In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pages, Atlanta, GA, USA, 2024. (acceptance rate=102/450=22.7%)

[USENIX ATC] Zhen Xie, Murali Emani, Xiaodong Yu, Dingwen Tao, Xin He, Pengfei Su, Keren Zhou, and Venkatram Vishwanath, “Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel Performance Predictor.” In Proceedings of the 2024 USENIX Annual Technical Conference (USENIX ATC 24), pages, Santa Clara, CA, July 10-12, 2024. (acceptance rate=77/488=15.8%)

[ICS] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raf- fenetti, Hui Zhou, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur, “gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters.” In Proceedings of the 38th International Conference on Supercomputing (ICS), pages, Kyoto, Japan, June 21-23, 2024. (acceptance rate=%)

[HPDC] Milan Shah, Xiaodong Yu, Sheng Di, Michela Becchi, and Franck Cappello, “A Portable, Fast, DCT-based Compressor for AI Accelerators.” In Proceedings of the 33rd International Symposium on High- Performance Parallel and Distributed Computing (HPDC)pages, Pisa, Italy, June 4-7, 2024. (acceptance rate=26/152=17%)

[HPDC] Shihui Song, Yafan Huang, Peng Jiang, Xiaodong Yu, Weijian Zheng, Sheng Di, Qin- glei Cao, Yunhe Feng, Zhen Xie, and Franck Cappello, “CereSZ: Enabling and Scaling Error-bounded Lossy Compression on Cerebras CS-2.” In Proceedings of the 33rd International Symposium on High- Performance Parallel and Distributed Computing (HPDC)pages, Pisa, Italy, June 3-7, 2024. (acceptance rate=26/152=17%)

[ICDE] Lyuheng Yuan, Akhlaque Ahmad, Da Yan, Jiao Han, Saugat Adhikari, Xiaodong Yu, and Yang Zhou, ““G2-AIMD: A Memory-Efficient Subgraph-Centric Framework for Efficient Subgraph Search on GPUs.” In IEEE 40th International Conference on Data Engineering (ICDE), Utrecht, Netherlands, 2024, pp.. (acceptance rate=%)

[IPDPS] Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Xiaoyi Lu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur,““An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compres- sion.”.” In Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS), San Francisco, CA, USA, 2024, pp.. (acceptance rate=%)


2023


[SC] Yafan Huang, Sheng Di, Xiaodong Yu, Guanpeng Li, and Franck Cappello, “cuSZp: An Ultra-fast GPU Error-bounded Lossy Compression Framework with Optimized End-to-End Perfor- mance.” In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pages 1-13, Denver, CO, USA, 2023. (acceptance rate=24%)

[ICS] Milan Shah, Xiaodong Yu, Sheng Di, Michela Becchi, and Franck Cappello, "Lightweight Huffman Coding for Efficient GPU Compression." In Proceedings of the 37th International Conference on Supercomputing (ICS), pages 99-110, Orlando, FL, June 21-23, 2023. (acceptance rate=29.4%)

[ICS] Chengming Zhang, Shaden Smith, Baixi Sun, Jiannan Tian, Jonathan Soifer, Xiaodong Yu, Shuaiwen Leon Song, Yuxiong He, and Dingwen Tao, "HEAT: A Highly Efficient and Affordable Training System for Collaborative Filtering Based Recommendation on CPUs." In Proceedings of the 37th International Conference on Supercomputing (ICS), pages 324–335, Orlando, FL, June 21-23, 2023. (acceptance rate=29.4%)

[ICS] Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Martin Swany, Dingwen Tao, and Franck Cappello, “GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs.” In Proceedings of the 37th International Conference on Supercomputing (ICS), pages 348-359, Orlando, FL, June 21-23, 2023. (acceptance rate=29.4%)

[HPDC] Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Ding- wen Tao, and Franck Cappello, “FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs.” In Proceedings of the 32nd International Symposium on High- Performance Parallel and Distributed Computing (HPDC), pages 129-142, Orlando, FL, June 21-23, 2023. (acceptance rate=21%)

[IPDPS] Milan Shah, Xiaodong Yu, Sheng Di, Danylo Lykov, Yuri Alexeev, Michela Becchi, and Franck Cappello, “GPU-Accelerated Error-Bounded Compression Framework for Quantum Circuit Simulations.” In Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS), St. Petersburg, FL, USA, 2023, pp. 757-767. (acceptance rate=25.7%)


2022


[HPDC] Xiaodong Yu, Sheng Di, Kai Zhao, Dingwen Tao, Xin Liang, Franck Cappello, “Ultra-fast Error-bounded Lossy Compression for Scientific Dataset.” In Proceedings of the 31th International Symposium on High-Performance Parallel and Distributed Computing (HPDC), ACM, 2022 (AR:21/108=19.4%)

[IPDPS] Cody Rivera, Sheng Di, Jiannan Tian, Xiaodong Yu, Dingwen Tao, and Franck Cappello, “Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs.” In 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2022 (AR:132/474=27.8%)


2021


[SSSDU] Guo, Yanfei, Ken Raffenetti, Hui Zhou, Travis Koehring, Sudheer Chunduri, Xiaodong Yu, and Rajeev Thakur, "“Automated Validation and Verification for Scientific Software.” In Proceedings of the 2022 Workshop on the Science of Scientific-Software Development and Use (SMC), virtual, December 13-15, 2021. (position paper)

[SMC] Tekin Bicer, Xiaodong Yu, Daniel J. Ching, Ryan Chard, Mathew J. Cherukara, Bogdan Nicolae, Rajkumar Kettimuthu, Ian T. Foster, "High-Performance Ptychographic Reconstruction with Federated Facilities,” In Smoky Mountains Computational Sciences and Engineering Conference (SMC), Springer, Cham, 2021

[CLUSTER] Xiaodong Yu, Sheng Di, Ali Murat Gok, Dingwen Tao, Franck Cappello, “cuZ-Checker: A GPU-Based Ultra-Fast Assessment System for Lossy Compressions.” In 2021 IEEE International Conference on Cluster Computing (CLUSTER), pp. 307-319. IEEE, 2021 (AR:48/163=29.4%)

[CLUSTER] Jiannan Tian, Sheng Di, Xiaodong Yu, Kai Zhao, Sian Jin, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello, “Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs.” In 2021 IEEE International Conference on Cluster Computing (CLUSTER), pp. 283-293. IEEE, 2021 (AR:48/163=29.4%)

[ICS] Xiaodong Yu, Tekin Bicer, Rajkumar Kettimuthu, Ian T. Foster, “Topology-aware Optimizations for Multi-GPU Ptychographic Image Reconstruction,” In 2021 ACM International Conference on Supercomputing (ICS), June 14 - 17, 2021. Worldwide online event (AR:38/157=24.2%)


2020


[IPDPS] Xiaodong Yu, Fengguo Wei, Xinming Ou, Michela Becchi, Tekin Bicer, Danfeng Yao, “GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting,” In 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 18 pp. 274-284 (AR:110/446=24.7%) [PDF]


Before 2019


[CSET] Xiaodong Yu, Ya Xiao, Kirk Cameron, and Danfeng Yao, “Comparative Measurement of Cache Configurations’ Impacts on Cache Timing Side-Channel Attacks,” In 12th USENIX Workshop on Cyber Security Experimentation and Test (CSET) (AR:19/61=31.1%) co-located with USENIX Security '19 [PDF]

[ACMSE] Thomas C. H. Lux, Layne T. Watson, Tyler H. Chang, Jon Bernard, Bo Li, Xiaodong Yu, Li Xu, Godmar Back, Ali R. Butt, Kirk W. Cameron, Yili Hong, Danfeng Yao, “Novel meshes for multivariate interpolation and approximation,” In Proceedings of the ACMSE 2018 Conference (ACMSE '18), Richmond, KY, USA [PDF]

[SoutheastCon] Thomas C. H. Lux, Layne T. Watson, Tyler H. Chang, Jon Bernard, Bo Li, Xiaodong Yu, Li Xu, Godmar Back, Ali R. Butt, Kirk W. Cameron, Yili Hong, Danfeng Yao, “Nonparametric Distribution Models for Predicting and Managing Computational Performance Variability” In the IEEE SoutheastCon 2018 (IEEE SoutheastCon '18), St. Petersburg, FL, USA [PDF]

[BigData] Xiaodong Yu, Kaixi Hou, Hao Wang, Wu-chun Feng, “Robotomata: A Framework for Approximate Pattern Matching of Big Data on an Automata Processor,” In Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData '17), Boston, MA, USA (AR:79/437=18.1%) [PDF]

[ICS] Marziyeh Nourian, Xiang Wang, Xiaodong Yu, Wu-chun Feng, and Michela Becchi, “Demystifying Automata Processing: GPUs, FPGAs or Micron’s AP?” In Proceedings of the International Conference on Supercomputing (ICS '17). Chicago, Illinois, USA (AR:28/177=15.8%) [PDF]

[CF] Xiaodong Yu, Hao Wang, Wu-chun Feng, Hao Gong, and Guohua Cao, “An Enhanced Image Reconstruction Tool for Computed Tomography on GPUs,” In Proceedings of the ACM International Conference on Computing Frontiers (CF '17), Siena, Italy (Full paper AR:27/76=35.5%)

[CCGrid] Xiaodong Yu, Hao Wang, Wu-chun Feng, Hao Gong, and Guohua Cao, “cuART: Fine-Grained Algebraic Reconstruction Technique for Computed Tomography Images on GPUs,” 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid '16). Cartagena, Colombia (Short paper AR:25%)

[ANCS] Xiaodong Yu, Wu-chun Feng, Danfeng Yao, and Michela Becchi, “O3FA: a Finite Automata-based Pattern Matching Engine for Out-of-Order Packets,” In Proceedings of the 2016 Symposium on Architectures for Networking and Communications Systems (ANCS '16). Santa Clara, CA, USA (AR:12/58=20.7%)

[CF] Xiaodong Yu and Michela Becchi, “GPU Acceleration of Regular Expression Matching for Large Datasets: Exploring the Implementation Space,” In Proceedings of the 10th ACM International Conference on Computing Frontiers (CF '13), Ischia, Italy, May 2013



Contacts

Office: Gateway Center N411

Email: apecs.lab@gmail.com
        xyu38@stevens.edu

Phone: (201) 216-5649

Address

1 Castle Point Terrace,
Stevens Institute of Technology,
Hoboken, NJ
07030

© Copyright 2025 by Xiaodong Yu