USGS

DIGITAL DATA SETS OF  DEPTH-DURATION FREQUENCY OF PRECIPITATION FOR OKLAHOMA

By Alan Rea and Robert L. Tortorelli

U.S. Geological Survey Open-File Report 99-463
 
 

Prepared in cooperation with the
Oklahoma Department of Transportation

Oklahoma City, Oklahoma
1999


TABLE OF CONTENTS

Abstract
Introduction
Purpose
Error Analysis
File Formats
Data Organization
Notes
References

ABSTRACT

These geospatial data sets were produced as part of a regional precipitation frequency analysis for Oklahoma. The data sets consist of surface grids of precipitation depths for seven frequencies (expressed as recurrence intervals of 2-, 5-, 10-, 25-, 50-, 100-, and 500-years) and 12 durations (15-, 30-, and 60-minutes; 1-, 2-, 3-, 6-, 12-, and 24-hours; and 1-, 3-, and 7-days). Eighty-four depth-duration-frequency surfaces were produced from precipitation-station data. Precipitation-station data from which the surfaces were interpolated and contour lines derived from each surface also are included. Contour intervals vary from 0.05 to 0.5 inch.

Data were used from precipitation gage stations with at least 10 years of record within Oklahoma and a zone extending about 50 kilometers into bordering states. Three different rain gage networks provided the data (15-minute, 1-hour, and 1-day). Precipitation annual maxima (depths) were determined from the station data for each duration for 110 15-minute, 141 hourly, and 413 daily stations. Statistical methods were used to estimate precipitation depths for each duration-frequency at each station. These station depth-duration-frequency estimates were interpolated to produce continuous grids with grid-cell spacing of 2,000 meters. Contour lines derived from these surfaces (grids) were used to produce the maps in the "Depth-Duration Frequency of Precipitation for Oklahoma," by R.L. Tortorelli, Alan Rea, and W.H. Asquith, U.S. Geological Survey Water-Resources Investigations Report 99-4232. The geospatial data sets are presented in digital form for use with geographic information systems. These geospatial data sets may be used to determine an interpolated value of depth-duration-frequency of precipitation for any point in Oklahoma. 


INTRODUCTION

These geospatial data sets were produced as part of a regional precipitation frequency analysis. Precipitation depths for various durations and frequencies, referred to as depth-duration frequency of precipitation in this report, have many uses. A common use of depth-duration frequency of precipitation is for the design of drainage structures that control and route localized runoff--such as parking lots, storm drains, and culverts. Another use of depth-duration frequency of precipitation is for compilation of rainfall-runoff river-flow models, which incorporate precipitation characteristics. Accurate depth-duration frequency of precipitation estimates are important for economical and safe structural designs at stream crossings and for developing reliable flood prediction models.

The geospatial data sets are presented in digital form for use with geographic information systems. The data sets consist of surface grids of precipitation depths for seven frequencies (expressed as recurrence intervals of 2-, 5-, 10-, 25-, 50-, 100-, and 500-years) and 12 durations (15-, 30-, and 60-minutes; 1-, 2-, 3-, 6-, 12-, and 24-hours; and 1-, 3-, and 7-days) for Oklahoma. Eighty-four depth-duration-frequency surfaces were produced from precipitation-station data. Contour lines derived from each surface and the precipitation-station data from which the surfaces were interpolated also are included. Contour intervals vary from 0.05 to 0.5 inch. These geospatial data sets may be used to determine an interpolated value of depth-duration-frequency of precipitation for any point in Oklahoma. A geospatial data set of county boundaries for Oklahoma (Rea and Becker, 1997) is provided for spatial reference.

Data were used from precipitation gage stations with at least 10 years of record within Oklahoma and a zone extending about 50 kilometers into bordering states. Three different rain gage networks provided the data (15-minute, 1-hour, and 1-day). Precipitation annual maxima (depths) were determined from the station data for each duration for 110 15-minute, 141 hourly, and 413 daily stations. Statistical methods (Tortorelli, Rea, and Asquith, 1999) were used to estimate precipitation depths for each duration-frequency at each station. These station depth-duration-frequency estimates were interpolated to produce continuous grids with grid-cell spacing of 2,000 meters.

Length-of-record weighting for the grid interpolation was done by multiplying the depth-duration frequency of precipitation values by years of record for each station, then interpolating a surface of the multiplied values using the POINTINTERP function in ARC/INFO GRID. Then POINTINTERP was used to interpolate a second surface of the years of record. Then for each grid cell the values of the first surface were divided by the values of the second surface. This method weighted the final surface by distance and years of record. As a result the final interpolated surface is closer to the observed values at stations having more years of record than at stations with less years of record. Contour lines derived from these surfaces (grids) were used to produce the maps in the Depth-Duration Frequency of Precipitation for Oklahoma (Tortorelli, Rea, and Asquith, 1999).

Additional information about the statistical methods is in Tortorelli, Rea, and Asquith (1999). Additional information about the spatial interpolation methods is included in the metadata for the surfaces data sets.

The data sets are provided in nonproprietary and ARC/INFO  file formats. No software is provided with these data sets. Users will need geographic information system (GIS) software to use the data sets. The U.S. Geological Survey does not recommend or endorse any particular software package for use with these data sets. For links to more information on GIS software and capabilities, see: The GIS FAQ (Frequently Asked Questions), GIS Companies on the WWW, USGS GIS Information.


PURPOSE

The purpose of this report is to provide updated digital estimates of depth-duration frequency of precipitation for any location in Oklahoma. The data sets presented were used to produce the maps in the Depth-Duration Frequency of Precipitation for Oklahoma (Tortorelli, Rea, and Asquith, 1999). The digital geospatial data sets  1) document the data editing process used to remove questionable data values from the original data sets and 2) present the results of the spatial interpolations in a form usable with GIS software.

ERROR ANALYSIS

The error associated with each surface was analyzed using methods described by Asquith (1998). The error analysis was conducted only for stations within Oklahoma (statewide). The error associated with a station is defined as

                                E(i) = X(i) - S(i)
where

 E(i) = error associated with precipitation depth for a duration-frequency at station i;
 X(i) = value of the precipitation depth for a duration-frequency at the station (control point); and
 S(i) = value of the precipitation depth for a duration-frequency at the station from the surface.

The error defined is only indicative of the true error because each station was used in the surface developments. An independent means to measure error is not available.

The error analysis results for precipitation depth are listed in table 1. The table shows the comparison of  the statewide mean for the control points, the standard deviation of differences between the statewide mean and each control point (station) value, and the coefficient of variation for the control points. The mean surface (contour) error, mean absolute surface (contour) error, maximum and minimum surface errors, root mean square error, and percent change from standard deviation to root mean square error for each depth-duration frequency also are listed. Each statistic is a weighted value, except for the maximum and minimum surface errors,  which means the length of record of each station was considered in the computation. Maximum and minimum surface errors are listed to show the range of errors found in each depth-duration frequency of precipitation map.

The weighted mean surface error is the mean difference between values of the control points and values from the surface. Mean surface errors near zero are desirable, because this indicates how well the interpolation routine worked. This error reflects any potential bias in the surface.

Root mean square error (RMSE) is analogous to the standard deviation (SD), and, therefore, the RMSE is comparable to the SD of the control points (differences between the statewide mean value and individual station values). The percent change from SD to RMSE, 100*[(RMSE ­ SD) / SD], indicates of the improvement in the precipitation depth estimate by using the surfaces (or contour maps) rather than simply using the corresponding statewide mean. If the percent decrease from the statewide standard deviation to the root mean square error is not at least 15 percent (Asquith, 1998), then a contour map is not better than a single statewide mean.

Percent changes from standard deviation to root mean square error for the (1) 15-minute to 60-minute durations ranged between -34.7 to -52.0 percent; (2) 1-hour to 24-hour durations ranged between -42.1 and -70.4 percent; and (3) 1-day and 7-day durations ranged between -51.8 and -70.3 percent. Therefore, the use of the surfaces (contour maps) results in more accurate estimates of the precipitation depth than simply using a statewide mean.


FILE FORMATS


The data sets provided are available in nonproprietary and ARC/INFO file formats. See the following table for an explanation of the file formats.
 
 
File/Data-set type
Nonproprietary format
ARC/INFO format
sites
.TXT--delimited ASCII
ARC/INFO coverages
surfaces
.ASC--ASCII
ARC/INFO grids
contours
.DLG--Digital Line Graph
ARC/INFO coverages
county
.DLG--Digital Line Graph
ARC/INFO coverages and .E00 export files
metadata
.HTM--HTML
--

The sites are stored as comma-delimited ASCII text files. The first record in each file contains a field identifier. This format is compatible with many spreadsheet and data-base management systems. Additional information about the field contents may be found in the "Entity_and_Attribute_Information" section of the metadata for the sites data sets.

The surfaces are stored in an ASCII grid format. This is a simple space-delimited array of values preceded by  header information. The header information identifies the number of columns and rows in the array, the X- and Y-coordinates of the lower left corner, the grid-cell size, and the value used to indicate missing data.

The contour data sets are stored in the generic, public-domain Digital Line Graph (DLG-3) Version 3, Optional format, and have file extensions of ".DLG". Designed for data interchange, the DLG-3 format allows the simple creation of a vector polygon or line data structure. The topological linkages are explicitly encoded for node, area, and line elements. The files are composed of 8-bit ASCII characters organized into fixed logical records of 80 bytes. A detailed description of the DLG-3 Optional format may be found in the data users guide 3, Digital Line Graphs from 1:2,000,000-scale maps (U.S. Geological Survey, 1990). The major and minor codes used in this application of the DLG format are not the same as those used in many USGS standard digital data products. The "Entity_and_Attribute_Information" section of the metadata file contains more information about the major and minor codes used. A free viewer capable of reading DLG files is available at http://mcmcweb.er.usgs.gov/viewers/dlg_view.html.

The ARC/INFO data sets are stored as coverages (points and lines) or grids (surfaces). ARC/INFO export files with ".E00" file extensions also are provided for the COUNTY and COUNTYDD coverages. A free viewer called ArcExplorer is capable of reading the ARC/INFO data sets. ArcExplorer is available from http://www.esri.com.

A documentation file (known as metadata) is provided for each type (sites, surfaces, and contours) of data set. The documentation files comply with the Federal Geographic Data Committee (FGDC) Content Standards for Digital Geospatial Metadata (Federal Geographic Data Committee, 1994). The FGDC-compliant metadata files contain detailed descriptions of the data sets, and include narrative sections that describe the procedures used to produce the data sets in digital form. The metadata files are stored as HTML files, with ".HTM" file extensions.
 


DATA ORGANIZATION

The data sets and files on the CD-ROM version of this report are organized into one root directory and four main subdirectories. The root directory contains the following files:
 
 
A_NOTICE.TXT Liability disclaimer
INDEX.HTM This file
SITES.HTM Metadata for the sites data sets
SURFACES.HTM Metadata for the surface data sets
CONTOURS.HTM Metadata for the contour data sets
TABLE1.HTM Table containing results of error analysis
COUNTY.HTM Metadata for county data set (from Digital Atlas of Oklahoma)
USGS.GIF USGS logo in Graphics Interchange Format (GIF)
DIR_EXAM.GIF Example graphic of portion of directory structure (GIF)

The four subdirectories contain the following:
 
raw_data Original  storm accumulation data files (see readme.txt)
fnl_data Storm accumulation data files after review and removal of questionable data (see readme.txt)
arc_data Data sets in ARC/INFO format
pd_data Data sets in nonproprietary format

Procedures used to review the storm accumulation data files and remove questionable data are described in the "Data_Quality_Information" section of the metadata for the sites data sets.

The surfaces and contours data sets are intended to estimate precipitation depth values within Oklahoma. A geospatial data set of county boundaries for Oklahoma (Rea and Becker, 1997) is provided for spatial reference. The geospatial data set is named "COUNTY" and is provided in DLG format (\pd_data\county.dlg) and in ARC/INFO coverage format in the arc_data subdirectory. The geospatial data set also is provided in the arc_data subdirectory stored in decimal degrees and named "COUNTYDD".

The arc_data and pd_data subdirectories each contain a subdirectory for each duration. The individual data sets are stored under the duration subdirectories. An example showing the arc_data subdirectory and its 12-hour duration subdirectory is shown below:
 
Each duration subdirectory contains contours, sites, and surfaces data sets. As shown in the example, the ARC/INFO data sets each consist of  directories and a parallel "info" directory. 

A "latlong" subdirectory under each duration subdirectory in arc_data contains copies of the contour and sites data sets stored in decimal degrees. 

The pd_data subdirectory is structured similarly, except the data sets are individual files in delimited ASCII, DLG, or ASCII grid format. The pd_data subdirectory contains the county-boundary data set in DLG format, named "county.dlg". The nonproprietary-format files are provided in Albers Equal Area projection only.


GEOMETRIC REGISTRATION


The Albers Equal Area map projection (Snyder, 1987) was chosen for the data sets. This projection is appropriate for maps of the conterminous United States because of the visual presentation and equal-area characteristics, which facilitates areal analysis. The projection is cast on the North American Datum of 1983. This projection slightly distorts shapes and distances (scale) in order to maintain equal-area properties. Scale is true along the standard parallels, which are to the north and south of Oklahoma. Scale distortion in Oklahoma reaches a maximum of slightly less than 1 percent at the northern border of the state. The following table provides map projection information.

Albers Equal Area projection parameters
[GRS1980, Geodetic Reference System 1980; NAD83, North American Datum 1983]
Projection parameters:
Spheroid GRS1980
Datum NAD83
First standard parallel 29 30 00 North
Second standard parallel 45 30 00 North
Central meridian 96 00 00 West
Latitude of projection origin 23 00 00 North
Coordinate system parameters:
False easting 0
False northing 0
Planimetric units of measure  meters

A "latlong" subdirectory under each duration subdirectory in arc_data contains copies of the contour and sites data sets stored in decimal degrees. The "COUNTYDD" coverage in the arc_data subdirectory also is stored in decimal degrees. These coordinates are based on the North American Datum of 1983.


NOTES

Use of trade names is for descriptive purposes only, and does not imply endorsement by the U.S. Government.

ARC/INFO software was used in the development of the data sets. The data sets were processed using the ARC/INFO Revision 7.1.1 software package, on a SUN Enterpise 4000 running Solaris Version 2.5.1. Final processing used ARC/INFO Revision 7.2.1.
 


REFERENCES


Asquith, W.H., 1998, Depth-duration frequency of precipitation for Texas: U.S. Geological Survey Water-Resources Investigations Report 98-4044, 107 p.

Federal Geographic Data Committee, 1994, Content standards for digital geospatial metadata (June 8): Federal Geographic Data Committee, Washington, D.C., 78 p. URL: https://www.fgdc.gov/Metadata/Metadata.html

Rea, Alan and Becker, C.J., 1997, Digital atlas of Oklahoma: U.S. Geological Survey Open-File Report 97-23, 2 CD-ROMs.

Snyder, J.P., 1987, Map projections--A working manual: U.S. Geological Survey Professional Paper 1395, 383 p.

Tortorelli, R.L., Rea, Alan, and Asquith, W.H., 1999, Depth-duration frequency of precipitation for Oklahoma: U.S. Geological Survey Water-Resources Investigations Report 99-4232, 113 p.

U.S. Geological Survey, 1990, Digital line graphs from 1:2,000,000-scale maps, data users guide 3: U.S. Geological Survey National Mapping Program Technical Instruction, 70 p.



FirstGov button  Take Pride in America button