U.S. Geological Survey Data Series 757
The Data Acquisition, Processing and Storage (DAPS) group maintains an online field activity catalog for the CMGP known as InfoBank. This catalog is a structured information storage scheme of databases and software that provide access to USGS-CMGP data and metadata. Infobank can be accessed at http://walrus.wr.usgs.gov/infobank.
The raw and processed digital data files are stored in InfoBank by cruise identifier (ID) and data type in ASCII text format. The DAPS group uses a numbered system of filename extensions to distinguish between data types and processing levels. For example, bathymetry files have a .3xx filename suffix and navigation files have a .0xx, or .6xx suffix. More information about the data type numbering (.xxx) conventions used within InfoBank is provided in appendix 1. InfoBank also denotes the most processed file as the "best file" (see fig. 2).
These "best" digital data files have been processed through various USGS developed software packages depending on data type (bathymetry or navigation) and data quality. Details about these programs can be accessed at the following Websites.
Bathymetry:
http://walrus.wr.usgs.gov/infobank/programs/html/programclassification/bat.html
Navigation:
http://walrus.wr.usgs.gov/infobank/programs/html/programclassification/nav.html
The bathymetry datasets were commonly processed using three programs. (1) batred, which calculates depth from travel times; (2) nosamebathy, which eliminates identical adjacent bathymetry values; and (3) nozerobathy, which removes records with a depth value of "0.0". Not all of the datasets included in this report were processed with one or more of these programs, however. Details about which processing steps were taken by the DAPS group are provided in the header information of the bathymetry and navigation ASCII files downloaded from InfoBank.
Prior to reformatting the data into MGD77T format the "best" bathymetry and navigation files from InfoBank were edited to remove the header information and reformatted using Excel 2010 to separate the data into tab-delimited columns. Next the bathymetry and navigation files were then merged. Lastly, the combined navigation and bathymetry was saved as a .txt file.
The NGDC created a program to assist in the formatting of geophysical data into the MDG77T format. This program is called GEODAS ReFormat to MDG77T and can be downloaded for use at http://www.ngdc.noaa.gov/mgg/gdas/gx_announce.Html.
The steps used for processing our single beam bathymetry datasets were as follows:
For step number 9 above, the first four input fields selected were always the same input choices. These were:
Additional data inputs depended on the type of bathymetry data retrieved from InfoBank, which was indicated in the original data file header information.
Choices included (corrections are for sound velocity to depth):
To create the accompanying header files (.h77t), an Excel spreadsheet was supplied from the NGDC with already populated required field names.
Information about the NGDC header field requirements and definitions is available at http://www.ngdc.noaa.gov/mgg/dat/geodas/docs/mgd77.pdf (74KB PDF).
When creating a header file for each cruise, fields were populated in the header spreadsheet based on InfoBank metadata records, digital data file headers, and accompanying publications (often cruise reports) for the survey. If a field was unknown or did not apply to the dataset it was intentionally left blank.
Processed bathymetric depth values are then spot checked based on position against the Global Multi-Resolution Topography (GMRT) grid within GeoMapApp. This ensures that data depths are reasonable; datasets with depth values significantly different from the global grid bathymetric depths were excluded from this archive and not submitted to the NGDC. They were also reported to the DAPS group as possibly erroneous or poor-quality data.