Missing data in ecology: Syntheses, clarifications, and considerations

Ecological Monographs
By: , and 

Links

Abstract

In ecology and related sciences, missing data are common and occur in a variety of different contexts. When missing data are not handled properly, subsequent statistical estimates tend to be biased, inefficient, and lack proper confidence interval coverage. Missing data are often grouped into three categories: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). We review each category and compare their benefits and drawbacks. We review several approaches to handling missing data including complete case analysis, imputation, inverse probability weighting, and data augmentation. We clarify what types of variables should accompany imputation methods and how those variables are influenced by the analysis methods. Additionally, we discuss missing data that lack a formal basis for measurement and hence are fundamentally different from MCAR, MAR, and MNAR missing data. Throughout, we introduce concepts and numeric examples using both simulated data and data from the United States Environmental Protection Agency's 2016 National Wetland Condition Assessment. We conclude by providing five considerations for ecologists and other scientists handling missing data.

Publication type Article
Publication Subtype Journal Article
Title Missing data in ecology: Syntheses, clarifications, and considerations
Series title Ecological Monographs
DOI 10.1002/ecm.70037
Volume 95
Issue 4
Publication Date November 05, 2025
Year Published 2025
Language English
Publisher Elsevier
Contributing office(s) Northern Rocky Mountain Science Center
Description e70037, 41 p.
Additional publication details