Reconstructing missing data by comparing interpolation techniques: Applications for long-term water quality data
Links
- More information: Publisher Index Page (via DOI)
- Data Releases:
- USGS data release - Dataset from the Upper Mississippi River Restoration Program (1993-2019) to reconstruct missing data by comparing interpolation techniques
- USGS data release - Interpolating missing water quality data
- Open Access Version: Publisher Index Page
- Download citation as: RIS | Dublin Core
Abstract
Missing data are typical yet must be addressed for proper inferences or expanding datasets to guide our limnological understanding and management of aquatic systems. Interpolation methods (i.e., estimating missing values using known values within the dataset) can alleviate data gaps and common problems. We compared seven popular interpolation methods for predicting substantial missingness in a long-term water quality dataset from the Upper Mississippi River, U.S.A. The dataset included 80,000 sampling sites collected over 30 yr that had substantial missingness for total nitrogen (TN), total phosphorus (TP), and water velocity. For all three interpolated water quality variables, random forests had very high prediction accuracy and outperformed the methods of ordinary kriging, polynomial regressions, regression trees, and inverse distance weighting. TP had a mean absolute error (MAE) of 0.03 mg (L-TP)−1, TN had a MAE of 0.39 mg (L-TN)−1, and water velocity had a MAE of 0.10 m s−1. The random forests' error rates were mapped and showed low spatiotemporal variability across the riverscape, indicating high model performance across many habitat types and large spatial scales. In the current era of “big data,” interpolation becomes an imperative step prior to ecological analyses yet remains unfamiliar and underutilized. Our research briefly describes the importance of addressing missingness and provides a roadmap to conduct model intercomparisons of other big datasets. We also share adaptable data analysis scripts, which allows others to readily conduct interpolation comparisons for many limnology applications and contexts.
Study Area
Publication type | Article |
---|---|
Publication Subtype | Journal Article |
Title | Reconstructing missing data by comparing interpolation techniques: Applications for long-term water quality data |
Series title | Limnology and Oceanography: Methods |
DOI | 10.1002/lom3.10556 |
Volume | 21 |
Issue | 2 |
Year Published | 2023 |
Language | English |
Publisher | Wiley |
Contributing office(s) | Upper Midwest Environmental Sciences Center |
Description | 15 p. |
First page | 435 |
Last page | 449 |
Country | United States |
State | Illinois, Iowa, Minnesota, Missouri, Wisconsin |
Other Geospatial | Upper Mississippi River |
Google Analytic Metrics | Metrics page |