File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Portable and scalable algorithms for irregular all-to-all communication

TitlePortable and scalable algorithms for irregular all-to-all communication
Authors
Issue Date1996
Citation
Proceedings - International Conference On Distributed Computing Systems, 1996, p. 428-435 How to Cite?
AbstractIn this paper, we develop portable and scalable algorithms for performing irregular all-to-all communication in High Performance Computing (HPC) systems. To minimize the communication latency, the algorithm reduces the total number of messages transmitted, reduces the variance of the lengths of these messages, and overlaps the communication with computation. The performance of the algorithm is characterized using a simple model of HPC systems. Our implementations are performed using the Message Passing Interface (MPI) standard and they can be ported to various HPC platforms. The performance of our algorithms is evaluated on CM5, T3D and SP2. The results show the effectiveness of the techniques as well as the interplay between the architectural features, the machine size, and the variance of message lengths. The experiences of our study can be applied in other HPC systems to optimize the performance of collective communication operations.
Persistent Identifierhttp://hdl.handle.net/10722/151815

 

DC FieldValueLanguage
dc.contributor.authorLiu, Wenhengen_US
dc.contributor.authorWang, ChoLien_US
dc.contributor.authorPrasanna, Viktor Ken_US
dc.date.accessioned2012-06-26T06:29:49Z-
dc.date.available2012-06-26T06:29:49Z-
dc.date.issued1996en_US
dc.identifier.citationProceedings - International Conference On Distributed Computing Systems, 1996, p. 428-435en_US
dc.identifier.urihttp://hdl.handle.net/10722/151815-
dc.description.abstractIn this paper, we develop portable and scalable algorithms for performing irregular all-to-all communication in High Performance Computing (HPC) systems. To minimize the communication latency, the algorithm reduces the total number of messages transmitted, reduces the variance of the lengths of these messages, and overlaps the communication with computation. The performance of the algorithm is characterized using a simple model of HPC systems. Our implementations are performed using the Message Passing Interface (MPI) standard and they can be ported to various HPC platforms. The performance of our algorithms is evaluated on CM5, T3D and SP2. The results show the effectiveness of the techniques as well as the interplay between the architectural features, the machine size, and the variance of message lengths. The experiences of our study can be applied in other HPC systems to optimize the performance of collective communication operations.en_US
dc.languageengen_US
dc.relation.ispartofProceedings - International Conference on Distributed Computing Systemsen_US
dc.titlePortable and scalable algorithms for irregular all-to-all communicationen_US
dc.typeConference_Paperen_US
dc.identifier.emailWang, ChoLi:clwang@cs.hku.hken_US
dc.identifier.authorityWang, ChoLi=rp00183en_US
dc.description.naturelink_to_subscribed_fulltexten_US
dc.identifier.scopuseid_2-s2.0-0029719954en_US
dc.identifier.spage428en_US
dc.identifier.epage435en_US
dc.identifier.scopusauthoridLiu, Wenheng=7407340793en_US
dc.identifier.scopusauthoridWang, ChoLi=7501646188en_US
dc.identifier.scopusauthoridPrasanna, Viktor K=7005057102en_US

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats