Next-Generation AI Hardware
LLM Pretrain on Data-Flow Architectures
TBA
Light-Weight MPI for Distributed Machine Learning
Compressor-Assisted NCCL for LLM
TBA
Federated Learning
Lossy Compression-based Privacy Protection
TBA
Communication Optimizations for FL
TBA