Scientific Investigations Report 2010–5008
Table 12. Preliminary model statistics for correlation of Escherichia coli bacteria with continuous parameters at Fanno Creek near Durham, Oregon. [No Scenario 2 data sets were available because USGS did not collect E. coli data during the study period. Regression models are of the form E. coli = a*Turb+ b*Q + c*SC + d, where a, b, and c are regression coefficients and d is the intercept, Turb, Q, and SC are the explanatory variables turbidity (in Formazin Nephelometric Units), discharge (in cubic feet per second), and specific conductance (in microsiemens per centimeter), respectively, and E. coli is the dependent variable, E. coli bacteria counts, in colonies per 100 milliliters. In some models, as indicated by the model form column, the dependent or explanatory variables were log transformed. Where E. coli is log transformed, a bias transformation factor (BCF; Duan, 1983) is multiplied by 10(logE. coli) to get the final value. RMSE values are in colonies per 100 mL. The maximum Variance Inflation Factor (VIF) indicates the largest VIF obtained for any one variable in the correlation. Abbreviations: n, number of samples; Adj.-R2, adjusted R2, a coefficient of determination which adjusts for degrees of freedom and penalizes the use of too many explanatory variables; f, a function of indicated constituents; E. coli, Escherichia coli bacteria; log, base 10 logarithm; RMSE, root mean square error; USGS, U.S. Geological Survey; –, not included in the regression]
1Exceeds a threshold VIF value, calculated as {1/(1– (Adj.-R2)} and indicates possible multicollinearity. |
First posted June 18, 2010 For additional information contact: Part or all of this report is presented in Portable Document Format (PDF); the latest version of Adobe Reader or similar software is required to view it. Download the latest version of Adobe Reader, free of charge. |