As a supplement to the above-described data analysis, a combination of principal components and factor analysis was used to search for relationships among chemical elements that might be useful in petrologic interpretation. Principal components and factor analysis are described by Reyment and Savazzi (1999) and Cooley and Lohnes (1962), respectively. A number of experiments using various methods of principal components and factor analysis were conducted on chemical data on sandstones of the Minturn and Sangre de Cristo Formations. It is important to realize that, in factor analysis, multiple methods, solutions, and interpretations are possible. Choices among methods, number of principal components to rotate, and alternative factor interpretations affect the outcome. However, after repeated experimentation, some consensus is generally possible. The following discussion summarizes two experiments, using the method of log-contrast principal components analysis (method of Reyment and Savazzi, 1999), followed by rotation of principal components and factor interpretation. The results are considered to be representative, although by no means identical, for the experiments conducted. The first experiment presented here utilized data on 50 sandstone samples, for which 17 quantitatively determined chemical constituents (major oxides, loss on ignition, and trace metals) were available. The second experiment utilized the same data, but added data on 12 semi-quantitatively determined trace elements. In both experiments, steps in the analysis were:
The method of log-contrast principal components analysis (Reyment and Savazzi, 1999) was used instead of standard R-mode principal components (factor) analysis (Cooley and Lohnes, 1962) for compositional data. Although the author has found it to be a useful guide to identifying petrologic processes, standard R-mode factor analysis of compositional data is not always satisfactory because the theoretical sum of values in each row (sample) of the data matrix is 100 percent. This property, referred to as the "constant sum problem" in geochemistry (Chayes, 1960), constrains the values of correlation coefficients. Correlations are not free to range between -1 and +1. Correlations for major constituents are forced toward negative values. Moreover, when the number of constituents is reduced and recalculated to 100 percent as is done, for example, during conversion to a volatile-free basis, the correlations change. Solutions to the constant sum problem have been proposed, only to be declared invalid upon subsequent investigation, and the problem does not seem to be completely resolved. The method employed here, termed "log-contrast principal components analysis" (Reyment and Savazzi, 1999), is a newly proposed solution for the constant sum problem.
Although Reyment and Savazzi (1999) provide a convenient DOS program that performs a log-contrast principal components analysis and a traditional (R-mode) principal components analysis for comparison, I also used the program "Statview" to permit analysis of large data matrices and to extend the analysis to rotation of principal components. Using either program, a new matrix of log-ratios is calculated by dividing each cell of a row by the geometric mean of that row and converting the result to its logarithm. A new covariance matrix, termed the "centered log-ratio covariance matrix," is calculated from the log-ratio matrix, and a corresponding centered log-ratio correlation matrix is computed (Table 4). The new correlation matrix is a measure of proportionality between the original variables (columns). Raw data may be expressed either as "percent," "parts per million," or both, if consistent within columns. Finally, in principal components analysis, the centered log-ratio correlation matrix is solved for its roots. Centered log-ratio correlation coefficients are not comparable to correlation coefficients calculated from raw data, but the results of the principal components analyses (latent roots and vectors that specify factor loadings) can be inspected for similarities and differences. In the present analysis, I have relied exclusively on the method of log-contrast analysis.
Selection of the number of principal components to preserve for rotation and factor interpretation is not always obvious, as in the present case. Criteria for selection are discussed by Jackson (1993). The number of principal components selected for rotation can be based on 1) eigenvalue magnitude greater than one, 2) the point on the eigenvalue distribution curve where an obvious change in magnitude occurs (root curve method), 3) eigenvalue distribution curve compared to eigenvalues calculated from random data (broken-stick distribution), 4) maximum communalities under various rotation scenarios, and 5) trial interpretation of rotated factors. Criteria (1), (4), and (5) were used here.
Six principal components were selected for rotation and interpretation. Selection of such a large number of principal components risks interpreting random effects (Jackson, 1993) but, in the present case, solutions involving fewer components yielded low communalities and posed difficulties for interpretation. (Communality is the sum of the squared factor loadings, and is a measure of the degree to which a particular solution accounts for the variance of a variable, in this case a log ratio). The first two eigenvalues account for almost half of the total variance (Table 5) and, if low communalities for many log-ratios were ignored, a simple two-factor solution could be considered. The eigenvalue curve (not shown) becomes flat after the fourth root and 67 pct of the total variance is included, but communalities of some log ratios remain low. In contrast, the first six eigenvalues account for more than 80 pct of total variance (Table 5) and all communalities exceed 0.70 (Table 6). Results of rotating 6 principal components are summarized in Table 7.
Tables summarizing a second experiment, utilizing 12 semi-quantitatively analyzed trace elements in addition to the variables included in the first experiment, are given in Tables 8-11.
Table 4.--17X17 centered log-ratio correlation matrix, log ratio data, Minturn and Sangre de Cristo Formations. N = 50; X, row geometric mean of original data matrix. Click here for Excel file.
Table 5.--Eigenvalues (roots) of 17X17 log-ratio correlation matrix.
|
|
|
|
|
| Value 1 |
|
|
|
| Value 2 |
|
|
|
| Value 3 |
|
|
|
| Value 4 |
|
|
|
| Value 5 |
|
|
|
| Value 6 |
|
|
|
| Value 7 |
|
|
|
| Value 8 |
|
|
|
Table 6.--Communalities for orthogonal rotations, 6 principal components, 17 variables.
| Proportion |
|
| Log (SiO2/X) |
|
| Log (Al2O3/X) |
|
| Log (FeTO3/X) |
|
| Log (MgO/X) |
|
| Log (CaO/X) |
|
| Log (Na2O/X) |
|
| Log (K2O/X) |
|
| Log (TiO2/X) |
|
| Log (P2O5/X) |
|
| Log (MnO/X) |
|
| Log (LOI/X) |
|
| Log (Cu/X) |
|
| Log (Mo/X) |
|
| Log (Pb/X) |
|
| Log (Th/X) |
|
| Log (U/X) |
|
| Log (Zn/X) |
|
Table 7.--Six-factor orthogonal solution, 17 variables, Varimax rotation.
| Proportion |
|
|
|
|
|
|
| Log (SiO2/X) |
|
|
|
|
|
|
| Log (Al2O3/X) |
|
|
|
|
|
|
| Log (FeTO3/X) |
|
|
|
|
|
|
| Log (MgO/X) |
|
|
|
|
|
|
| Log (CaO/X) |
|
|
|
|
|
|
| Log (Na2O/X) |
|
|
|
|
|
|
| Log (K2O/X) |
|
|
|
|
|
|
| Log (TiO2/X) |
|
|
|
|
|
|
| Log (P2O5/X) |
|
|
|
|
|
|
| Log (MnO/X) |
|
|
|
|
|
|
| Log (LOI/X) |
|
|
|
|
|
|
| Log (Cu/X) |
|
|
|
|
|
|
| Log (Mo/X) |
|
|
|
|
|
|
| Log (Pb/X) |
|
|
|
|
|
|
| Log (Th/X) |
|
|
|
|
|
|
| Log (U/X) |
|
|
|
|
|
|
| Log (Zn/X) |
|
|
|
|
|
|
Factor 1--Iron-titanium oxides (detrital grains)--FeTO3, TiO2, P2O5, and Th--placer concentration (provenance).
Factor 2--Potassium feldspar and mica (detrital grains)--SiO2, Al2O3, K2O--chemical maturity (provenance).
Factor 3--Chlorite and/or mica (detrital grains or matrix) versus plagioclase (detrital grains)--MgO, LOI vs Na2O--if detrital mica vs plagioclase destruction, then provenance; if matrix vs plagioclase destruction, then greenschist metamorphism or burial diagenesis.
Factor 4--Manganese oxides vs molybdenite--MnO and Zn vs Mo--weathering of mineralized rock.
Factor 5--Pb--Uninterpreted unique factor.
Factor 6--U--Uninterpreted unique factor.
Table 8.--29X29 centered log-ratio correlation matrix, log ratio data, Minturn and Sangre de Cristo Formations. N = 50; X, row geometric mean of original data matrix. Click here for Excel file.
Table 9.--Eigenvalues (roots) of 29X29 log-ratio correlation matrix.
|
|
|
|
|
| Value 1 |
|
|
|
| Value 2 |
|
|
|
| Value 3 |
|
|
|
| Value 4 |
|
|
|
| Value 5 |
|
|
|
| Value 6 |
|
|
|
| Value 7 |
|
|
|
| Value 8 |
|
|
|
| Value 9 |
|
|
|
| Value 10 |
|
|
|
| Value 11 |
|
|
|
| Value 12 |
|
|
|
| Value 13 |
|
|
|
| Value 14 |
|
|
|
Table 10.--Communalities for orthogonal rotation, 5-7 principal components, 29 variables. h2(5), communality for rotation of five components.
| Proportion |
|
|
|
| Log (SiO2/X) |
|
|
|
| Log (Al2O3/X) |
|
|
|
| Log (FeTO3/X) |
|
|
|
| Log (MgO/X) |
|
|
|
| Log (CaO/X) |
|
|
|
| Log (Na2O/X) |
|
|
|
| Log (K2O/X) |
|
|
|
| Log (TiO2/X) |
|
|
|
| Log (P2O5/X) |
|
|
|
| Log (MnO/X) |
|
|
|
| Log (LOI/X) |
|
|
|
| Log (B/X) |
|
|
|
| Log (Ba/X) |
|
|
|
| Log (Be/X) |
|
|
|
| Log( Co/X) |
|
|
|
| Log (Cr/X) |
|
|
|
| Log (Cu/X) |
|
|
|
| Log (La/X) |
|
|
|
| Log (Mo/X) |
|
|
|
| Log (Ni/X) |
|
|
|
| Log (Pb/X) |
|
|
|
| Log (Sc/X) |
|
|
|
| Log (Sr/X) |
|
|
|
| Log (Th/X) |
|
|
|
| Log (U/X) |
|
|
|
| Log (V/X) |
|
|
|
| Log (Y/X) |
|
|
|
| Log (Zn/X) |
|
|
|
| Log (Zr/X) |
|
|
|
Table 11.--Six-factor orthogonal solution, 29 variables, Varimax rotation.
| Proportion |
|
|
|
|
|
|
| Log (SiO2/X) |
|
|
|
|
|
|
| Log (Al2O3/X) |
|
|
|
|
|
|
| Log (FeTO3/X) |
|
|
|
|
|
|
| Log (MgO/X) |
|
|
|
|
|
|
| Log (CaO/X) |
|
|
|
|
|
|
| Log (Na2O/X) |
|
|
|
|
|
|
| Log (K2O/X) |
|
|
|
|
|
|
| Log (TiO2/X) |
|
|
|
|
|
|
| Log (P2O5/X) |
|
|
|
|
|
|
| Log MnO/X |
|
|
|
|
|
|
| Log (LOI/X) |
|
|
|
|
|
|
| Log (B/X) |
|
|
|
|
|
|
| Log (Ba/X) |
|
|
|
|
|
|
| Log (Be/X) |
|
|
|
|
|
|
| Log (Co/X) |
|
|
|
|
|
|
| Log (Cr/X) |
|
|
|
|
|
|
| Log (Cu/X) |
|
|
|
|
|
|
| Log (La/X) |
|
|
|
|
|
|
| Log (Mo/X) |
|
|
|
|
|
|
| Log (Ni/X) |
|
|
|
|
|
|
| Log (Pb/X) |
|
|
|
|
|
|
| Log (Sc/X) |
|
|
|
|
|
|
| Log (Sr/X) |
|
|
|
|
|
|
| Log (Th/X) |
|
|
|
|
|
|
| Log (U/X) |
|
|
|
|
|
|
| Log (V/X) |
|
|
|
|
|
|
| Log (Y/X) |
|
|
|
|
|
|
| Log (Zn/X) |
|
|
|
|
|
|
| Log (Zr/X) |
|
|
|
|
|
|
Factor 1--Mafic vs base trace metals--Cr and V vs Cu and Pb--uninterpreted factor, possibly representing mineralization.
Factor 2--Potassium feldspar and mica (detrital grains)--SiO2, Al2O3, K2O, Ba, and La vs Co-- chemical maturity (provenance).
Factor 3--Iron-titanium oxides and zircon (detrital grains)--FeTO3, TiO2, P2O5, Th, and Zr vs Be--near-source placer concentration (provenance).
Factor 4--Manganese oxides vs molybdenite--MnO vs Mo--weathering of mineralized rock.
Factor 5--Chlorite and/or mica--MgO, LOI, B, and Ni vs Y--if detrital mica, then provenance; if matrix, then greenschist metamorphism or burial diagenesis.
Factor 6--Th-U mineral vs plagioclase--Th, U, and Zn vs CaO, Na2O, and Sr--destruction of plagioclase (provenance or metamorphism).