MxN Logo

The MxN problem in Distributed Scientific Computing

[Overview |Publications |Projects |Resources |Links |People |Acknowledgments]

Overview

Increasingly software components and even complete applications in scientific computing are being composed together to create new large scale multidisciplinary simulations. High performance software is often SPMD parallel, which leads to the MxN problem: connecting components running on differing numbers of processors. In research partly funded by the Department of Energy we are exploring the semantics of parallel remote method invocation, specification methods for data distribution templates, and dynamic data redistribution between parallel components.

Publications

Note: Slides in PDF format are exported from the original OpenOffice (SXI) format. Although OpenOffice generally does a good job, there might be problems with some of the graphics. In that case you might want to check the SXI version.

Data Redistribution and Remote Method Invocation in Parallel Component Architectures.
Felipe Bertrand, David Bernholdt, Randall Bramley, Kostadin Damevski, James Kohl, Jay Larson, Alan Sussman.
Accepted for publication in the Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS'05).
April, 2005.
[
PS] [PDF] [BIBTEX]

A Component Architecture for High-Performance Scientific Computing.
D. E. Bernholdt, B. A. Allan, R. Armstrong, F. Bertrand, K. Chiu, T. L. Dahlgren, K. Damevski, W.R. Elwasif, T. G. W. Epperly, M. Govindaraju, D. S. Katz, J. A. Kohl, M. Krishnan, G. Kumfert, J. W. Larson, S. Lefantzi, M. J. Lewis, A. D. Malony, L. C. McInnes, J. Nieplocha, B. Norris, S. G.Parker, J. Ray, S. Shende, T. L. Windus, S. Zhou.
Intl. J. High-Perf. Computing Appl., 2005, to appear in ACTS Collection special issue.
[PDF] [BIBTEX]

DCA: A distributed CCA framework based on MPI.
Felipe Bertrand and Randall Bramley.
Proceedings of the 9th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'04).
April, 2004.
[PS] [SLIDES-SXI] [SLIDES-PDF] [BIBTEX]

An Approach to Parallel MxN Communication.
Felipe Bertrand, Yongquan Yuan, Kenneth Chiu and Randall Bramley.
Proceedings of the Los Alamos Computer Science Institute (LACSI) Symposium.
October, 2003.
[PS] [SLIDES-PPT] [BIBTEX]

Programming the Grid: Distributed Software Components, P2P and Grid Web Services for Scientific Applications.
Dennis Gannon, Randall Bramley, Geoffrey Fox, Shava Smallen, Al Rossi, Rachana Ananthakrishnan, Felipe Bertrand, Ken Chiu, Matt Farrellee, Madhu Govindaraju, Sriram Krishnan, Lavanya Ramakrishnan, Yogesh Simmhan, Alek Slominski, Yu Ma, Caroline Olariu, Nicolas Rey-Cenvaz.
Journal of Cluster Computing, Special Issue on Grid Computing.
July, 2002.
[PDF] [BIBTEX]

Projects

libPRMI: A parallel-remote method invocation library.
A library to perform parallel-remote method invocations. Intended to be used in a component-based framework as a provider of parallel/distributed/collective calls.

Local Resources

A writeup about parallel data representations. [HTML]
Felipe Bertrand and Randall Bramley.

Notes from the CCA Data Meeting held in SLC 7/28/04. [HTML]
Randall Bramley.

A reading list regarding MxN used for my PhD qualifying exam. [HTML]
Felipe Bertrand and Randall Bramley.

Data Redistribution and Remote Method Invocation in Parallel Component Architectures. [PDF] [SLIDES-SXI] [SLIDES-PDF]
PhD thesis proposal.
Felipe Bertrand and Randall Bramley.

External Links

The CCA-Forum home page. [LINK]
The Common Component Architecture (CCA) Forum is a group of researchers from national labs and academic institutions committed to defining a standard component architecture for high performance computing.

The MPI-Forum home page. [LINK]
This location contains the official MPI (Message Passing Interface) standards documents, errata, and archives of the MPI Forum. The MPI Forum is an open group with representatives from many organizations that define and maintain the MPI standard.

MxN Parallel Data Redistribution @ ORNL. [LINK]
ORNL's effort on MxN Parallel Data Redistribution in the CCA.

Jay Larson's Modelling Coupling Toolkit. [LINK]
MCT is a set of software tools for coupling message-passing parallel models to create a parallel coupled model.

InterComm (University of Maryland) [LINK]
InterComm is a runtime library that achieves direct data transfers between data structures managed by multiple data parallel languages and libraries in different programs. Such programs include those that directly use a low-level message-passing library, such as MPI.

SCIRun (University of Utah) [LINK]
SCIRun is a multifunctional problem solving environment that can be best described as a computational workbench by which the user can "close the loop." All aspects of the modeling, simulation, and visualization processes are linked, controlled graphically within the context of a single application program.

People

Randall Bramley [HOME]
Felipe Bertrand [HOME] [RESUME]

Acknowledgments

Supported in part by NSF Grant EIA-0202048 and DoE SciDAC program.


Felipe Bertrand
Last modified: Thu Jul 8 12:17:00 EST 2004