File Download

There are no files associated with this item.

Supplementary

Conference Paper: Lancet: Accelerating Mixture-of-Experts Training by Overlapping Weight Gradient Computation and All-to-All Communication

TitleLancet: Accelerating Mixture-of-Experts Training by Overlapping Weight Gradient Computation and All-to-All Communication
Authors
Issue Date15-May-2024
Persistent Identifierhttp://hdl.handle.net/10722/347385

 

DC FieldValueLanguage
dc.contributor.authorJiang, Chenyu-
dc.contributor.authorTian, Ye-
dc.contributor.authorJia, Zhen-
dc.contributor.authorWu, Chuan-
dc.contributor.authorWang, Yida-
dc.contributor.authorZheng, Shuai-
dc.date.accessioned2024-09-23T00:30:15Z-
dc.date.available2024-09-23T00:30:15Z-
dc.date.issued2024-05-15-
dc.identifier.urihttp://hdl.handle.net/10722/347385-
dc.languageeng-
dc.relation.ispartofThe Seventh Conference on Machine Learning and Systems (MLSys) (13/05/2024-16/05/2024, Santa Clara)-
dc.titleLancet: Accelerating Mixture-of-Experts Training by Overlapping Weight Gradient Computation and All-to-All Communication-
dc.typeConference_Paper-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats