Overview of the Distributed CSL Plans and Requirements

Version 1.1

March 31, 1995


Objective

The NSF Opportunity Fund Award enables GEO and CISE to pursue our vision of enabling, for the first time, explicit collaborations between the GCRP and the HPCC in the area of distributed, high-performance computing. Specifically, we will explore a potentially powerful climate modeling capability by utilizing the following emerging GCRP and HPCC assets within NSF:

We propose to distribute the NCAR Climate System Model (CSM) between the NCAR CSL and one CISE center. We will refer to this as the Distributed Climate System Model (DCSM). All model output will be transmitted to NCAR and stored on a high-performance shared filesystem. Our primary objective will be to evaluate the potential of using all or most of the NSF metacenter high performance systems to provide the amount of computing power required to undertake climate simulations that are simply impossible on any single computing system. We plan to demonstrate this potential by carrying out a four phase project that builds on the aforementioned assets and culminates with "real science" climate simulations consuming roughly 300 Y-MP-1 hours of computing resources. These experiments will require the storage and distributed visualization of the associated model output data. The critical technological milestones that must be accomplished to realize this objective are:


Technical Overview

The DCSM consists of four principle model components: an atmospheric model, a global ocean model, a sea-ice model, and the flux coupler. The coupling of the component models in DCSM (ocean,atmosphere,sea-ice) is achieved through the exchange of flux information via PVM. This exchange is mediated via the flux coupler. The flux coupler performs the interpolations between different grids and performs the flux computations itself. Because of the physical time scales in the problem, the flux coupler communicates with the atmosphere much more often than with the ocean. For this reason current vBNS bandwidth constraints will require the flux coupler and the atmospheric model to run tightly coupled at one site, and the ocean model at the other.

Good machine-to-machine communication will be needed to support flux exchange via PVM over the vBNS. While it is desirable for the coupled machines to both integrate forward in model time at the same wall clock rate and thus achieve perfect load balance, we recognize that this will be difficult to achieve in practice. We expect the most numerically intensive model components to determine the rate at which the overall simulation proceeds and we refer to the system upon which they run as the "pacing machine". The model components on the non-pacing machine will spend some part of their execution time waiting to receive data from the pacing machine. In all cases, the pacing machine will be at NCAR. We will tune the amount of computing resources allocated to the non-pacing machine so as to achieve as close to overall load balance as possible. We will also need to transport large volumes of model output back to the high-performance shared filesystem at NCAR, via the vBNS. While this is a "background" process, there may be model and system tuning issues involved in order to avoid overflowing the data buffers at both sites. See Appendix I for a detailed description of the DCSM model components and their characteristics.

Figure 1. - Conceptual diagram of project

Distributed climate modeling introduces a number of difficulties into the traditional visualization process, most of which center around the size and number of the datasets, and their wide-area distribution on the network. As processing power continues to grow and outstrip network bandwidth, visualization processes may necessarily become distributed: instead of moving data around we instead calculate imagery, geometry, and compressed movies close to the data, transmit the information over the net and then visualize it. The visualization environment may need to construct visualizations from multiple data sources.

The overall objective of the visualization component of the DCSL is to develop and demonstrate interactive visualization and analysis of distributed very-large (.1TB) atmospheric and oceanic datasets using the compute/storage/visualization resources at two centers and the vBNS as a high-speed network interconnect. Visualization techniques will range from browsing functions to state-of-the-art environments for visualizing atmospheric and related phenomena. Focus areas will include:

Schedules permitting, first results of this collaborative effort will be demonstrated at SuperComputing 95 in San Diego at the NSF Metacenter Booth.


Appendix I: Descriptions of DCSM Model Components

1. Flux Coupler

The flux coupler performs the interpolations between different model grids and computes the fluxes that couple the model components together. The flux coupler passes these fluxes between the model components using the PVM message passing library, communicating much more often with the atmosphere than with the ocean or sea ice models. The following machine resource estimates assume that the atmospheric component is running at T42 (2.5 degree) horizontal resolution and that the ocean and sea ice component is running at a 360x180 (1 degree) resolution.

2. DCSM Candidate Ocean Models

The following machine resource estimates assume that the candidate ocean model component is running at 1 degree resolution (360 x 180 horizontal grid).

2. Sea Ice Model

The following machine resource estimates assume that the DCSM sea ice model component is running at 1 degree resolution (360 x 180 horizontal grid).

4. CSM Atmospheric Model

The following machine resource estimates assume that the atmospheric component is running at T42L18 (128 x 64 horizontal grid) with 18 levels.


Appendix II: The National Library for Applied Network Research (NLANR)

The National Laboratory for Applied Network Research (NLANR) has as one of its research roles:

"... a Metacomputer Interconnect Laboratory to develop advanced distributed applications and to transition advanced capabilities, such as those developed in gigabit testbeds, toward robust services delivered on the vBNS and on the larger Internet as bandwidth increases over time."
@(#)index.html 1.3 95/11/29 don@ncar.ucar.edu