U.S. DEPARTMENT OF THE INTERIOR
U.S. GEOLOGICAL SURVEY

Text only version

Using Logistic Regression To Predict the Probability of Debris Flows Occurring in Areas Recently Burned By Wildland Fires

By Michael G. Rupert, Susan H. Cannon, and Joseph E. Gartner
Open-File Report 03-500

Abstract

Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.

Maps predicting the probability of debris flows were developed in four steps

(1) The first step was to delineate basins that did and did not produce debris flows using National Elevation Data (NED). A total of 399 basins located within 15 wildland fires in 4 western states were delineated. As an example, basins from the Missionary Ridge Fire in Colorado are shown in figure 1.

(2) The second step was to determine the burn severity (fig. 2), geology, land surface gradient, rainfall, and soil properties for each of the 15 wildland fires.

(3) The third step was to observe statistical correlations of data from all 15 wildland fires and build the logistic regression models (table 1) by evaluating all possible combinations of variables.

Univariate correlations can be a good initial indicator of which independent variables might be significant in the multivariate models. Univariate correlations are usually observed first to explore the data (table 1).

Multivariate models were constructed by evaluating every possible combination of variables shown in table 1. Several logistic regression models were constructed using different combinations of variables, and then the most effective model was selected.

Table 1. Summary of univariate and multivariate correlations between debris flow occurrence and independent variables describing burn severity,
geology, land surface gradient, rainfall, soil properties, and variable interactions using data from 15 wildland fires in Colorado, Idaho, Montana, and
New Mexico, 2000-2002.
[GE, greater than or equal to; mm, millimeters; hr, hour; >, greater than; < less than; X, denotes statistically significant variable in multivariate correlations]

(a) Spearman's rho is a nonparametric rank-order test, similar to the student's t-test. Variables and models with no correlation with debris flows have spearman's rho numbers near zero, and variables with perfect correlations have numbers near plus or minus one.

(b) The Wilcoxon test is a nonparametric test to determine if there is a significant difference between areas with and without debris flows. As the effectiveness of the models improves, the Wilcoxon p-values approach zero.

(c) McFadden's rho is calculated by logistic regression, and conceptually is similar to an r-squared in linear regression. A McFadden's rho value of zero means there is no correlation; values between 0.2 and 0.4 denote significant correlation.

(4) The fourth step was to enter the most effective model into the GIS, and construct maps predicting the probability of debris flows occurring. As an example, the probability of debris flows occurring in the Missionary Ridge Fire is shown in figure 3.

Probability of Debris Flow= e (-29.693 + 2.864*SL + 10.697*BS - 9.875*SO + 0.208*SI + 5.729*OM - 0.957*P + 9.351*D - 8.335*BS*OM + 4.669*SO*D - 0.174*SL*SI)

(-29.693 + 2.864*SL + 10.697*BS - 9.875*SO + 0.208*SI + 5.729*OM - 0.957*P + 9.351*D - 8.335*BS*OM + 4.669*SO*D - 0.174*SL*SI)1 + e

Where:
SL = Percent of basin with slope greater than or equal to 30 percent
BS = Burn severity category
SO = Partical size sorting, in Phi units
SI = Average storm intensity, in millimeters per hour
OM = Soil organic matter, in percent by weight
P = Soil permeability, in inches per hour
D = Soil drainage category

References

Hosmer, D.W., and Lemeshow, S., 1989, Applied logistic regression: New York, John Wiley & Sons, Inc., 307 p.

Schwarz, G.E. and Alexander, R.B. 1995, State Soil Geographic (STATSGO) Data base for the conterminous United States: U.S. Geological Survey Open-File Report 95-449, http://water.usgs.gov/lookup/getspatial?ussoils

Acknowledgments

Montana Bureau of Mines and Geology, U.S.D.A. Forest Service, U.S. Geological Survey

Figure and Table captions

Figure 1. Basins with and without debris flows, Missionary Ridge Fire, 2002, near Durange, Colorado.

Figure 2. Burn severity at the Missionary Ridge Fire, 2002.

Figure 3. Probability of a debris flow occurring in response to a 25-year rain event (33 mm in one hour), Missionary Ridge Fire, 2002, using model M1.

Table 1. Summary of univariate and multivariate correlations between debris flow occurrence and independent variables describing burn severity, geology, land surface gradient, rainfall, soil properties, and variable interactions using data from 15 wildland fires in Colorado, Idaho, Montana, and New Mexico, 2000-2002.


Water Resources of Colorado
Contact: webmaster_co@usgs.gov

U.S. Department of the Interior, U.S. Geological Survey
Maintainer: Publications Team
Last update: 08:57:01 Thu 15 Jan 2004
Privacy Statement || Disclaimer || Accessibility
URL: http://pubs.water.usgs.gov/ofr03-500

FirstGov button  Take Pride in America button