There are now over 490 distinct datasets in the archive, ranging in size from less than 1 MB to over 1 TB. The total volume of data in the DSS archive was 2.4 terabytes (TB) in August 1990 and 8.5 TB in October 1997. We have been adding a lot of reanalysis data and other analyses. The change of data storage with time has been as follows:
| Data stored for Data Support and total mass store | |||||
|---|---|---|---|---|---|
| Data Support
Section | Total mass
store | ||||
| Date | Bit files | Volume (TB) | Bit files | Volume (TB) | DSS %
of mass store |
| 13 Aug 90 | 61,335 | 2.437 | -- | 14.430 | 16.9 |
| 4 Aug 91 | 65,518 | 2.689 | 715,000 | 19.400 | 13.9 |
| 3 Aug 92 | 80,538 | 3.085 | 1,060,000 | 27.270 | 11.3 |
| Aug 93 | 103,314 | 4.072 | 1,351,271 | 36.280 | 11.2 |
| 15 Sep 94 | 119,703 | 4.751 | 1,849,466 | 47.423 | 10.0 |
| 14 Feb 95 | 123,877 | 5.085 | 1,966,990 | 52.456 | 9.7 |
| 24 Jan 96 | 137,680 | 5.950 | 2,486,471 | 67.590 | 8.8 |
| 28 Aug 96 | 143,340 | 6.770 | 2,888,639 | 78.964 | 8.6 |
| 28 Feb 97 | 151,509 | 7.513 | 3,289,224 | 91.399 | 8.2 |
| 17 Oct 97 | 159,945 | 8.482 | 4,046,678 | 110.359 | 7.7 |
The DSS staff provides assistance and expertise in using the archive and help researchers locate data appropriate to their needs. Users may obtain copies of data by network access, on various tape media, or they may use data directly from the NCAR MSS. DSS staff also assist scientists by providing data access programs (to read and unpack data), other software for data manipulation, and dataset documentation. At a later point we will present more information about the use of the DSS archives.
Data requests handled during October 1996 - September 1997:
DSS staff handled many requests for information about data, data processing tools, and online access programs. Staff handled 319 requests for data to be sent offsite. These requests required data from 389 datasets. Data were selected from 22495 archive volumes (holding 2030 GB), and 1841 GB were shipped to users. Users received the data on 10 round tapes, 102 cartridges, and 620 Exabyte tapes. In addition, at least 115 users received data by electronic transfer. We shipped 34 copies of the National Meteorological Center (NMC) gridpoint Compact Disk-Read Only Memory (CD-ROM). The University of Washington sold more of these CD-ROMs.
The following table shows a time history of our data requests from 1991-1996. The amount of data that we send to users has increased from about 150 GB per year to 750 GB. The use of half-inch tapes has been decreasing a lot, while Exabyte tapes are popular. They give us a method to send large datasets at low cost, and the tape drives cost as little as $700. Many other users access the data from the NCAR computers. We help them obtain our access software, etc. Then they run their programs, and we do not count those as data requests.
| Data sent from NCAR DSS
This shows the handling of user requests at DSS | ||||||
|---|---|---|---|---|---|---|
| 1992 (8/91-7/92) | 1993 (8/92-7/93) | 1994 (9/93-8/94) | 1995 (10/94-9/95) | 1996 (9/95-8/96) | 1997 (10/96-9/97) | |
| Requests handled | 400 | 417 | 441 | 399 | 328 | 319 |
| Data from datasets (#) | 475 | 497 | -- | -- | 477 | 389 |
| Read GB for user select | 230 | 242 | 354 | 520 | 915 | 2,032 |
| On MSS files (read for users) | 6,212 | 5,099 | 7,116 | 8,268 | 9,192 | 22,495 |
| Select data to send (GB) | 150 | 154 | 258 | 382 | 750 | 1,841 |
| a. 1/2-inch tapes sent | 727 | 333 | 262 | 92 | 7 | 10 |
| b. 3480 cartridges sent | 280 | 112 | 280 | 268 | 280 | 102 |
| c. 8-mm tapes sent | 103 | 117 | 260 | 240 | 336 | 620 |
| d. PC floppy disks sent | 147 | 102 | 85 | 49 | 42 | 0 |
| FTP transfers | 16 | 80 | 89 | 190 | 120 | 115 |
| CD-ROMs sold (1946-on analyses) | 15 | 29 | 35 | 11 | 35 | 34 |
| Reanalysis CD-ROMs | -- | -- | -- | -- | -- | 755 |
A good data exchange has started with the Chinese Academy (IAP Institute, Beijing). Jenne visited Beijing in September 1996. A considerable amount of data had been exchanged by December 1996. Documents are available.
NCAR and NOAA also reached an exchange agreement with State Oceanic Administration of the Chinese National Oceanographic Data Center in Tianjin, China. We will furnish a land surface station archive in exchange for the digitization of 1.8 million ship observations from U.S. logbooks.
This project has a heavy impact on our time at NCAR. In late 1995, we had to speed up the work on all of the older datasets. We have been using 5 or 6 FTE of effort on reanalysis from September 1995 to October 1997. This is a big drain on our small group, but it is a great project.
| Reanalysis data usage | |
|---|---|
| Method | Amount of use |
| On computers at NCAR | ~3500 GB/year |
| Sent on tapes | 1931 GB by Oct. 1997 |
| Sent on CD-ROMs | 755 CD; 500 GB by Oct. 1997 |
Data first became available on the NCAR MSS in late 1994. The annual summaries for 1995, 1996, and the first half of 1997 are shown for the NCAR and University user communities. The number of unique users, numbers of MSS files, and gigabytes accessed on a read transfer are shown. We expect the dramatic growth trends shown to continue as more research activities begin on this archive.
| Reanalysis data usage on MSS | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| NCAR | University | Total | |||||||
| Year | Users | Files | GBytes | Users | Files | GBytes | Users | Files | GBytes |
| 1995 | 4 | 1,417 | 292 | 9 | 569 | 123 | 13 | 1,986 | 416 |
| 1996 | 14 | 3,753 | 811 | 45 | 8,516 | 1,869 | 59 | 12,269 | 2,679 |
| 19971 | 14 | 1,287 | 264 | 48 | 7,795 | 1,707 | 64 | 9,476 | 2,063 |
| 1 January - June 1997 only | |||||||||
Sending reanalysis data by tape. The first order was in December 1994. The cumulative orders are given below.
| Reanalysis data sent by tape | ||
|---|---|---|
| Date | Number
of orders | Cumulative
data volume (GB) |
| 1 Jan 96 | 13 | 85 |
| 1 Jan 97 | 68 | 1,187 |
| 13 Oct 97 | 117 | 1,931 |
| Reanalysis CD-ROM sales | |||
|---|---|---|---|
| Unique CDs | Orders | CD-ROMs sold | |
| 21 Apr 97 | 8 | 14 | 81 |
| 27 May 97 | 10 | 31 | 203 |
| 23 Jul 97 | 10 | 58 | 387 |
| 12 Aug 97 | 10 | 72 | 502 |
| 13 Oct 97 | 12 | 106 | 755 |
During the period September 1996 through August 1997, 15 new datasets were added to the DSS archive. New sets include the 2.5 x 2.5-degree ECMWF 15-year reanalysis, data from recent exchanges with China, the UK Marine Data Bank, additional archives from the TOGA COARE project, and arctic Sea Ice Climatology. Updates: Over 100 different datasets were updated in the past year. Six sets were updated several times each month, and eight were updated monthly.
Plans for 1997-98: The work to prepare updates will have to continue for each of these years. In fact, there may be a few more datasets that will need updates.
COADS Release 1 (April 1985) contained global marine data for the 1854-1979 period. Interim extensions to Release 1 added data for 1980 through 1991. Recent accomplishments have further extended the time series and made improvements to Release 1 and the interim extensions. Release 1a adds data to the time series for years 1980-1995. Release 1b has upgraded and replaced data for the 1950-1979 period. Along with adding observations to COADS, we have made data processing improvements that increase the overall quality of the data, e.g. upgraded the format to include more data fields, fixed some known data errors, and changed processing rules to achieve an improved mixture (from the many data sources) of observations.
The next major development for COADS will be a reprocessing of the 1854-1949 time period. This phase will include a project that will merge the U.K. Marine Data Bank with COADS as well as include newly digitized data for this early period. In parallel, we will continue to develop our online documentation, accessible via FTP and the WWW, so that the COADS data users can conveniently keep informed about our achievements.
A few current achievements and forthcoming projects are briefly described to illustrate the scope of these activities.
By responding to data service needs like these and many other smaller activities, the DSS supports a wide variety research throughout the national and international community.
Under the CEDAR program, this effort has expanded to include related ground-based measurements and model output. For example, Fabry-Perot interferometer observations, Light Detection and Ranging (LIDAR) observations by the University of Illinois, Thermospheric/Ionospheric General Circulation Model (TIGCM) output from Ray Roble (HAO), Assimilative Mapping of Ionospheric Electrodynamics (AMIE) model output from Art Richmond (HAO), and Global Scale Winds Model output from Maura Hagan (HAO). Mesosphere-troposphere radar, medium-frequency radar, and high-frequency radar data have been added. Most recently data have been added from the Japanese MU (Mesosphere and Upper Atmosphere) incoherent scatter radar.
A minicomputer is maintained at NCAR for access to this database. Batch and interactive software and documentation have been written and installed. Internet access is maintained at two levels: Documentation and data inventories may be obtained via anonymous FTP or web page, but a login is required to obtain data. Current plans are to add an interactive data selection capability to the web interface.
We assist data contributors by designing new record layouts, providing conversion software and verifying results. DSS staff will periodically archive new data contributions, fill data and software requests, consult with users, and prepare and distribute an annual catalog.
NCAR/TN-427+PPR-CEDAR DATABASE COMMITTEE REPORT. JM Holt (MIT Haystack Observatory) and B.A. Emery. June 96.
| NCAR | UCAR | NSF | NCAR FY97 ASR |