Browsing "Department of Computer Science" by Author zhao, juntao

Jump to: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Showing results 1 to 3 of 3
TitleAuthor(s)Issue DateViews
 
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training
Proceeding/Conference:the Sixth Conference on Machine Learning and Systems (MLSys) (04/06/2023-08/06/2023, Miami)
6-Jun-2023
 
CDMPP: A Device-Model Agnostic Framework for Latency Pre- diction of Tensor Programs
Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens)
22-Apr-2024
 
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh)
2-Mar-2024