Showing results 1 to 3 of 3
Title | Author(s) | Issue Date | |
---|---|---|---|
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training Proceeding/Conference:the Sixth Conference on Machine Learning and Systems (MLSys) (04/06/2023-08/06/2023, Miami) | 6-Jun-2023 | ||
CDMPP: A Device-Model Agnostic Framework for Latency Pre- diction of Tensor Programs Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens) | 22-Apr-2024 | ||
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh) | 2-Mar-2024 |