Browsing by Author Zhao, Juntao

Jump to: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Showing results 1 to 4 of 4
TitleAuthor(s)Issue Date
 
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training
Proceeding/Conference:the Sixth Conference on Machine Learning and Systems (MLSys) (04/06/2023-08/06/2023, Miami)
6-Jun-2023
 
CDMPP: A Device-Model Agnostic Framework for Latency Pre- diction of Tensor Programs
Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens)
22-Apr-2024
 
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh)
2-Mar-2024
 
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Proceeding/Conference:the 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS) (27/05/2024-31/05/2024, San Francisco)
29-May-2024