Showing results 1 to 4 of 4
Title | Author(s) | Issue Date | |
---|---|---|---|
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training Proceeding/Conference:the Sixth Conference on Machine Learning and Systems (MLSys) (04/06/2023-08/06/2023, Miami) | 6-Jun-2023 | ||
CDMPP: A Device-Model Agnostic Framework for Latency Pre- diction of Tensor Programs Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens) | 22-Apr-2024 | ||
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh) | 2-Mar-2024 | ||
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices Proceeding/Conference:the 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS) (27/05/2024-31/05/2024, San Francisco) | 29-May-2024 |