Stability and bias of classification rates in biological applications of discriminant analysis

Journal of Wildlife Management
By: , and 

Metrics

11
Crossref references
Web analytics dashboard Metrics definitions

Links

Abstract

We assessed the sampling stability of classification rates in discriminant analysis by using a factorial design with factors for multivariate dimensionality, dispersion structure, configuration of group means, and sample size. A total of 32,400 discriminant analyses were conducted, based on data from simulated populations with appropriate underlying statistical distributions. Simulation results indicated strong bias in correct classification rates when group sample sizes were small and when overlap among groups was high. We also found that stability of the correct classification rates was influenced by these factors, indicating that the number of samples required for a given level of precision increases with the amount of overlap among groups. In a review of 60 published studies, we found that 57% of the articles presented results on classification rates, though few of them mentioned potential biases in their results. Wildlife researchers should choose the total number of samples per group to be at least 2 times the number of variables to be measured when overlap among groups is low. Substantially more samples are required as the overlap among groups increases
Publication type Article
Publication Subtype Journal Article
Title Stability and bias of classification rates in biological applications of discriminant analysis
Series title Journal of Wildlife Management
DOI 10.2307/3809051
Volume 54
Issue 2
Year Published 1990
Language English
Publisher Wiley
Contributing office(s) Patuxent Wildlife Research Center
Description 11 p.
First page 331
Last page 341
Additional publication details