Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
International Journal of High Performance Computing Applications
This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Vadhiyar, S. S.
Right arrow Articles by Dongarra, J. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Towards an Accurate Model for Collective Communications

Sathish S. Vadhiyar

Graham E. Fagg

Jack J. Dongarra

COMPUTER SCIENCE DEPARTMENT UNIVERSITY OF TENNESSEE, KNOXVILLE, USA

The performance of the MPI's collective communications is critical in most MPI-based applications. A general algorithm for a given collective communication operation may not give good performance on all systems due to the differences in architectures, network parameters and the storage capacity of the underlying MPI implementation. Hence, collective communications have to be tuned for the system on which they will be executed. In order to determine the optimum parameters of collective communications on a given system in a time-efficient manner, the collective communications need to be modeled efficiently. In this paper, we discuss various techniques for modeling collective communications.

Key Words: MPI • collectives • communications • tuning • modeling/broadcast

References

  • Culler, D., Karp, R., Patterson, D., Sahay, A., Schauser, K. E., Santos, E., Subramonian, R., and von Eicken, T. May 1993. LogP: towards a realistic model of parallel computation . In Proceedings of the Symposium on Principles and Practice of Parallel Programming, San Diego, CA, pp. 1–12 .
  • Fagg, G. E. and Dongarra, J. J. 2000. FT-MPI: fault tolerant MPI, supporting dynamic applications in a dynamic world . In Proceedings of EuroPVM-MPI 2000, Lecture Notes in Computer Science Vol. 1908, Springer-Verlag, Berlin, pp. 346–353 .
  • Fagg, G. E., Vadhiyar, S. S., and Dongarra, J. J. 2000. ACCT: automatic collective communications tuning . In Proceedings of EuroPVM-MPI 2000, Lecture Notes in Computer Science Vol. 1908, Springer-Verlag, Berlin, pp. 354–361 .
  • Frigo, M. 1998. FFTW: an adaptive software architecture for the FFT . In Proceedings of the ICASSP Conference, Vol. 3, pp. 1381-1381 .
  • Hensgen, D., Finkel, R., and Manber, U. 1988. Two algorithms for barrier synchronization . International Journal of Parallel Programming 17(1): 1–17 .
  • Huse, L. P. September 1999. Collective communication on dedicated clusters of workstations . In Proceedings of the 6th European PVM/MPI Users’ Group Meeting, Barcelona, Spain, LNCS Vol. 1697, Springer-Verlag, Berlin. pp. 469–476 .
  • Kielmann, T., Bal, H. E., and Gorlatch, S. May 2000. Bandwidth-efficient collective communication for clustered wide area systems. In IPDPS 2000, Cancun, Mexico .
  • Rabenseifner, R. 1997. A new optimized MPI reduce algorithm. http://www.hlrs.de/organization/par/services/models/mpi/myreduce.html.
  • Snir, M., Otto, S., Huss-Lederman, S., Walker, D., and Dongarra, J. 1998. MPI – the complete reference. In The MPI Core, Vol. 1, 2nd edition.
  • Vadhiyar, S. S., Fagg, G. E., and Dongarra, J. J. November 2000. Automatically tuned collective communications . In Proceedings of SuperComputing2000, Dallas, TX.
  • Whaley, R. C. and Dongarra, J. 1998. Automatically tuned linear algebra software. In SC98: High Performance Networking and Computing, Orlando, FL. See .

International Journal of High Performance Computing Applications, Vol. 18, No. 1, 159-167 (2004)
DOI: 10.1177/1094342004041297


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Free Full Text (Free PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Vadhiyar, S. S.
Right arrow Articles by Dongarra, J. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?