Scientific Investigations Report 2015–5075
AbstractNitrate-concentration data are used in conjunction with land-use and land-cover data to estimate median nitrate concentrations in groundwater underlying the New Jersey (NJ) Highlands Region. Sources of data on nitrate in 19,670 groundwater samples are from the U.S. Geological Survey (USGS) National Water Information System (NWIS) and the NJ Private Well Testing Act (PWTA). In a study conducted by the USGS, in cooperation with the New Jersey Department of Environmental Protection, logistic regression was used to relate measured nitrate concentrations to five explanatory variables (percent urban and agricultural land use, septic-system density, total length of streams, and number of known contaminated sites) quantified in 610-meter-square grid cells. A method for calculating the median concentrations of nitrate from a series of logistic regression models was developed. Two calibration and two validation procedures showed that the logistic-regression-based method can estimate groundwater-nitrate concentrations in the Highlands Region accurately to within 0.1 milligram per liter as nitrogen (mg/L as N). Limitations of the logistic-regression-based method include the inability to select a logistic model with exactly 0.5 probability of exceeding the threshold value and lack of an algorithm to directly calculate the median value. Quantile regression was evaluated as a suitable alternative and was slightly less accurate than the logistic-regression method in estimating median groundwater nitrate concentrations in the Highlands Region. Multiple-linear regression with log-transformed nitrate-concentration data and the same five explanatory values was less accurate than either logistic or quantile regression in estimating median nitrate concentrations. On the basis of 4,516 2000 x 2000 foot grid cells that contain wells with data stored in NWIS and the PWTA database, the estimated median nitrate concentration for the entire Highlands Region is about 1.25 mg/L as N, and estimated median concentrations range from about 1.05 to 1.78 mg/L as N among 11 smaller administratively defined areas within the Highlands Region that vary in percentages of urban land use, agricultural land use, and septic-system density. The Kaplan-Meier method of estimating summary statistics from left-censored data was applied in order to include nondetects (left-censored data) in median nitrate-concentration calculations. Median concentrations also were determined using three alternative methods of handling nondetects. Treatment of the 23 percent of samples that were nondetects had little effect on estimated median nitrate concentrations because method detection limits were mostly less than median values. |
First posted July 9, 2015
For additional information, contact: Part or all of this report is presented in Portable Document Format (PDF). For best results viewing and printing PDF documents, it is recommended that you download the documents to your computer and open them with Adobe Reader. PDF documents opened from your browser may not display or print as intended. Download the latest version of Adobe Reader, free of charge. More information about viewing, downloading, and printing report files can be found here. Part of this report is presented in Microsoft Excel format (XLSX); the latest version of Excel Viewer is required to view it. Download the latest version of Excel Viewer, free of charge. |
Baker, R.J., Chepiga, M.M., and Cauller, S.J., 2015, Median nitrate concentrations in groundwater in the New Jersey Highlands Region estimated using regression models and land-surface characteristics (ver.1.1, August 2015): U.S. Geological Survey Scientific Investigations Report 2015–5075, 27 p., http://dx.doi.org/10.3133/sir20155075.
ISSN 2328-0328 (online)
Abstract
Introduction
Method of Study
Summary and Conclusions
References Cited
Appendix 1. Example spreadsheet for calculating median nitrate concentrations with logistic-regression models
Appendix 2. Geographic and environmental characteristics evaluated as possible explanatory variables in models of median nitrate concentrations in groundwater in the NJ Highlands Region