Browsing by Author Peng, Yanghua

Jump to: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Showing results 1 to 5 of 5
TitleAuthor(s)Issue Date
 
Advisor(s):Wu, C
2020
 
CDMPP: A Device-Model Agnostic Framework for Latency Pre- diction of Tensor Programs
Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens)
22-Apr-2024
 
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Proceeding/Conference:the 29th ACM SIGPLAN Annual Sympo- sium Principles and Practice of Parallel Programming (PPoPP’24) (02/03/2024-06/03/2024, Edinburgh)
2-Mar-2024
 
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Proceeding/Conference:the 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS) (27/05/2024-31/05/2024, San Francisco)
29-May-2024
 
15-Apr-2023