Showing results 1 to 2 of 2
Title | Author(s) | Issue Date | |
---|---|---|---|
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training Proceeding/Conference:the Sixth Conference on Machine Learning and Systems (MLSys) (04/06/2023-08/06/2023, Miami) | 6-Jun-2023 | ||
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh) | 2-Mar-2024 |