Showing results 1 to 3 of 3
Title | Author(s) | Issue Date | |
---|---|---|---|
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Proceeding/Conference:The Seventh Conference on Machine Learning and Systems (MLSys) (13/05/2024-16/05/2024, Santa Clara) | 15-May-2024 | ||
DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines Proceeding/Conference:EuroSys 2024 (22/04/2024-25/04/2024, Athens) | 22-Apr-2024 | ||
Lancet: Accelerating Mixture-of-Experts Training by Overlapping Weight Gradient Computation and All-to-All Communication Proceeding/Conference:The Seventh Conference on Machine Learning and Systems (MLSys) (13/05/2024-16/05/2024, Santa Clara) | 15-May-2024 |