![]() |
||
|
|
Workshop on High Performance Computing for Geosciences Research
NCAR has a reputation and a core vision of being an integrator, innovator, and community builder. NCAR has been contributing to efforts to define both the science and cyberinfrastructure requirements for the geosciences. These findings are outlined in the two Establishing a Petascale Collaboratory for the Geosciences documents: Scientific Frontiers and Technical and Budgetary Prospectus. NCAR is working to further develop the concepts and implement the vision outlined in these documents. NCAR organized two NSF-funded workshops to encourage a common goal among the geosciences: to envision and develop the cyberinfrastructure necessary to support their full range of scientific objectives well into the future. These exploratory efforts support NCAR's strategic priorities of "Engaging a broader and more diverse community" and "Developing and providing advanced services and tools," and NCAR's strategic goal of "Improving understanding of the atmosphere, the Earth system, and the Sun." "High Performance Computing in the Geosciences" is the second NCAR-organized workshop held in FY 2006 addressing the challenges of preparing to run geoscience applications on petascale computing systems and beyond. The first workshop, "Geoscience Application Requirements for Petascale Systems" (GARPA) was held June 1-2, 2006. The High Performance Computing in the Geosciences workshop was organized and hosted by NCAR on September 25-27, 2006 in Boulder, Colorado. It attracted 125 participants, including university researchers with HPC research projects as well as representatives from other national laboratories, supercomputing facilities, and the NSF. Nine prominent NSF staff attended the workshop, including Margaret Leinen, Assistant NSF Director for the Geosciences; Jarvis Moyers, NSF/Division Director of Atmospheric Sciences; Art Goldstein, NSF Acting Division Director of Earth Sciences, and Eric Itsweire, NSF/Division of Ocean Sciences (OCE) Physical Oceanography Program Director. Workshop Day 1NCAR Director Tim Killeen welcomed participants to the workshop, then presentations were given by Dr. Steve Meacham (NSF Office of Cyberinfrastructure [OCI] Program Officer) and Dr. Margaret Leinen (NSF Geoscience Assistant Director). These presentations defined the overall NSF cyberinfrastructure strategy and provided context for further discussions. Dr. Meacham covered the OCI Portfolio (data, data analysis, and visualization; HPC; virtual organizations; and learning and workforce development), the HPC spectrum (Track 1, Track 2, Track 3, etc.), and the enabling technologies for the TeraGrid. Dr. Leinen described her view of the unique cyberinfrastructure requirements of the geosciences:
Dr. Leinen raised interesting questions for the participants:
These NSF presentations were followed by three thought-provoking talks from Dr. Dale Haidvogel of the Institute of Marine and Coastal Sciences representing coastal oceanography, Dr. Thomas Jordan of the Southern California Earthquake Center (SCEC), and Dr. Robert Wilhelmson, Chief Science Officer at the National Center for Supercomputing Applications (NCSA). Each showed important scientific applications that could use petascale hardware. There were notable commonalities in their talks:
It was clear from these presentations that all three researchers were facing the same challenges. Dr. Haidvogel discussed these challenges from the perspective of petascale coastal oceanography. He noted that the last five years have seen a lot of progress called for in the 2002 Ocean Information Technology Infrastructure (OITI) report titled Information Technology Infrastructure Plan to Advance Ocean Sciences. In particular, circulation models, observing networks, and assimilation methods are in place (or soon will be). The challenges for coastal oceanography cited by Dr. Haidvogel for the next five years include:
Dr. Jordan showed how SCEC is using frameworks and grid-based technology to solve earthquake problems. He also described the software development path for moving the TeraShake model to PetaShake. The petascale drivers for computational seismology cited include the need to:
Organizationally, he commented on the need for vertical integration of the national petascale initiatives and need for a mixture of capability, capacity, and data-intensive visualization systems. Dr. Wilhelmson made the comment that many students still spend 80% of their time doing low-level work, and only 20% on science: he advocated building tools to flip that ratio. Achieving this might be as significant as deploying a petascale system. Regarding grid technology and virtual organizations, he went on to point out lessons learned from the George E. Brown, Jr. NEES grid, namely:
Dr. Tim Palmer of the European Center for Medium Range Weather Forecasting (ECMWF) gave a well-received dinner talk titled "Petaflop Computing and Reliable Climate Predictiona European Perspective." In brief, Dr. Palmer presented the argument that if there is currently no bigger problem facing society than climate change, then quantifying the threat of climate change using the best resources available (including HPC resources) must surely be of the highest priority. Workshop Days 2-3At the start of the second day, the background from the two-part report titled Establishing a Petascale Collaboratory for the Geosciences was presented by Drs. Frank Bryan and Richard Loft of NCAR. Volume 1, Scientific Frontiers, established some of the science drivers for petascale geoscience computing, while the second report, Technical and Budgetary Prospectus, developed a straw-man cost model and feasibility plan for a geoscience collaboratory. Dr. Loft's presentation emphasized that the views expressed in the report had evolved significantly to include the NSF's cyberinfrastructure strategic plan. In particular, it incorporates:
Breakout sessions were conducted after these presentations. The morning sessions organized participants by discipline. The afternoon sessions were cross-disciplinary. The questions considered in the morning included:
The breakout sessions in the afternoon addressed:
After each breakout session, the deliberations were summarized by the breakout leaders in plenary sessions. Workshop Findings and RecommendationsThe community had several important recommendations and insights for how to go build a geocollaboratory, both what they need from it and what they can contribute.
Workshop DeliverablesA consensus was reached on several points for moving the Geocollaboratory formation process forward:
Impact of the WorkshopThe workshop brought together a critical mass of geoscientists and focused them on the issue of cyberinfrastructure needs for the geosciences. The clear message coming out of the workshop is that the geoscience community needs and is ready for better cyberinfrastructure that enables collaboration and provides access to next-generation HPC and data storage resources. Beyond that, the geoscience community made it clear that common tools for creating end-to-end modeling and forecast systems is critical to their scientific progress. An organizational structure facilitating that would be welcome. As a consensus-building and community-building exercise, the workshop was a successful first step. The deliverables cited above are relatively low cost, feasible, and will build momentum for the community moving forward. |
|