Browsing by Author Jia, Zhen

Jump to: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Showing results 1 to 3 of 3
TitleAuthor(s)Issue Date
 
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
Proceeding/Conference:The Seventh Conference on Machine Learning and Systems (MLSys) (13/05/2024-16/05/2024, Santa Clara)
15-May-2024
 
DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens)
22-Apr-2024
 
Lancet: Accelerating Mixture-of-Experts Training by Overlapping Weight Gradient Computation and All-to-All Communication
Proceeding/Conference:The Seventh Conference on Machine Learning and Systems (MLSys) (13/05/2024-16/05/2024, Santa Clara)
15-May-2024