next up previous
Next: About this document ... Up: Tectonic discrimination diagrams revisited Previous: List of Figures

Figures

Figure 1: Discriminant analysis of three classes with equal covariance matrices leads to linear discriminant boundaries. The ellipses mark arbitrary (e.g., 95%) confidence levels for the underlying populations.
Image discriminant

Figure 2: Similarities and differences between linear discriminant and principal component analysis. x1 and x2 are the original variables, pc1 and pc2 are the principal components and ld1 and ld2 are the linear discriminant functions.
Image PCAvsLDA

Figure 3: One of the consequences of the constant-sum constraint of compositional data is that the arithmetic mean (marked by the open square) of populations (black dots) has no physical meaning. Instead, the geometric mean should be used (open circle).
Image closure

Figure 4: X, Y and Z are uncorrelated, uniform random numbers. The strong spurious correlation of the ratios Y/Z and X/Z is an artifact of the relatively large variance of Z relative to X, Y and Z.
Image spurious2

Figure 5: 95% normal confidence regions (e.g., Weltje, 2002) for synthetic trivariate compositional data partly fall outside the ternary diagram, a nonsense result illustrating the dangers of performing ``traditional'' statistics on the simplex.
Image closure_ternary_wrong2

Figure 6: Following Aitchison (1986), the statistical problems of Figure 5 can be avoided by mapping the data from the simplex $ \Delta _2$ to R2 using the logratio transformation.
Image closure_mapping

Figure 7: Linear discriminant analysis using the crude covariance approach of Figure 5. The red-shaded contours of the first three ternary diagrams represent the posterior probabilities for the three classes. The last diagram shows the linear decision boundaries. 10% of the training data are misclassified.
Image closure_discriminant_wrong

Figure 8: The same data of Figure 7, mapped to logratio-space using the approach illustrated by Figure 6. Linear discriminant analysis of these bivariate data misclassifies only 3% of the training data.
Image closure_binary_discriminant_right

Figure 9: Mapping the results of Figure 8 back to the ternary diagram with the inverse logratio transformation shown on Figure 6 yields curved posterior densities and decision boundaries.
Image closure_ternary_discriminant_right

Figure 10: Locations of the training data: 756 Island Arc (IAB), Mid Ocean Ridge (MORB) and Ocean Island (OIB) Basalts.
Image sample_locations

Figure 11: Linear discriminant analysis (LDA) of the Ti-V system of Shervais (1982). The red-shaded contours on the first three subplots show the posterior probability of a particular ``class'' (IAB, MORB, or OIB) given the training set of 756 basalt samples and a uniform prior. The last subplot (lower-right) shows the new decision boundaries. The number of training data used and a resubstitution error estimate are given for each of the tectonic affinities. The overall resubstitution error is shown above the lower-right subplot.
Image Ti_V_lin

Figure 12: Quadratic discriminant analysis (QDA) of the Ti-V system. In contrast with the LDA of Figure 11, each tectonic ``class'' was allowed to have a different covariance matrix, resulting in slightly different decision boundaries.
Image Ti_V_quad

Figure 13: Linear discriminant analysis of the Ti-Zr system of Pearce and Cann (1973).
Image Ti_Zr_lin

Figure 14: Quadratic discriminant analysis of the Ti-Zr system.
Image Ti_Zr_quad

Figure 15: Linear discriminant analysis of the Ti-Zr-Y system of Pearce and Cann (1973). The posterior probabilities of nearly all the IAB and MORB training data are low ($ <$0.4), resulting in large misclassification rates for these affinities. As noted by Pearce and Cann (1973), the Ti-Zr-Y diagram can be used to separate OIBs from IAB/MORBs, but cannot be used to distinguish between IAB and MORB. For this purpose, the Ti-Zr diagram (Figure 13) can be used.
Image Ti_Zr_Y_lin

Figure 16: Quadratic discriminant analysis of the Ti-Zr-Y system. The OIB/IAB decision boundary (at low Y) is nearly identical to that of Figure 15, whereas there is a lot more (unstable) structure at higher Y concentrations.
Image Ti_Zr_Y_quad

Figure 17: Linear discriminant analysis of the Zr-Y-Nb system of Meschede (1986). Like in Figure 15, posterior IAB and MORB probabilities are low, resulting in high misclassification rates.
Image Nb_Zr_Y_lin

Figure 18: Quadratic discriminant analysis of the Zr-Y-Nb system.
Image Nb_Zr_Y_quad

Figure 19: Linear discriminant analysis of the Th-Ta-Hf system of Wood (1980).
Image Th_Ta_Hf_lin

Figure 20: Quadratic discriminant analysis of the Th-Ta-Hf system.
Image Th_Ta_Hf_quad

Figure 21: Linear discriminant analysis of the Ti-Zr-Y-Sr system. ld1 and ld2 are the two linear discriminant functions, given by Equation 7. They represent two projection planes that optimally separate the three tectonic affinities (IAB, MORB, and OIB) (see also Figure 2). The encircled numbers on the lower right subplot are ``anchor points'' that can be used by the user to reconstruct the decision boundaries in logratio-space. The ld1/ld2 coordinates of these anchor points are given in Table 6.
Image discrimiFunc2

Figure 22: Linear discriminant analysis of major element data (SiO$ _2$, Al$ _2$O$ _3$, TiO$ _2$, CaO, MgO, MnO, K$ _2$O, Na$ _2$O), mapped to R2 using the logratio transformation. ld1 and ld2 are given by Equation 8. Anchor points are given in Table 6.
Image discrimiFunc1

Figure 23: Visual representation of the performance of all possible bivariate linear discriminant analyses using the major element data of the training set of 756 oceanic basalts. The upper right triangular section of each matrix shows the number of samples that contained both variables. The lower left sections color-code the fraction of successfully classified training data.
Image xPlotMajor2_linear_IAB Image xPlotMajor2_linear_MORB
Image xPlotMajor2_linear_OIB Image xPlotMajor2_linear_err

Figure 24: Same as Figure 23, but for quadratic discriminant analysis.
Image xPlotMajor2_quadratic_IAB Image xPlotMajor2_quadratic_MORB
Image xPlotMajor2_quadratic_OIB Image xPlotMajor2_quadratic_err

Figure 25: Matrices showing the performance of all possible bivariate linear discriminant analyses using combinations of 45 elements.
Image xPlotTrace2_linear_IAB Image xPlotTrace2_linear_MORB
Image xPlotTrace2_linear_OIB Image xPlotTrace2_linear_err

Figure 26: Same as Figure 25, but for quadratic discriminant analysis.
Image xPlotTrace2_quadratic_IAB Image xPlotTrace2_quadratic_MORB
Image xPlotTrace2_quadratic_OIB Image xPlotTrace2_quadratic_err

Figure 27: Performance analysis of all possible ternary discriminant analyses using TiO$ _2$ and other major element oxides.
Image xPlotMajor3Ti_linear_IAB Image xPlotMajor3Ti_linear_MORB
Image xPlotMajor3Ti_linear_OIB Image xPlotMajor3Ti_linear_err

Figure 28: Same as Figure 27, but using quadratic discriminant analysis.
Image xPlotMajor3Ti_quadratic_IAB Image xPlotMajor3Ti_quadratic_MORB
Image xPlotMajor3Ti_quadratic_OIB Image xPlotMajor3Ti_quadratic_err

Figure 29: Performance analysis of all possible ternary discriminant analyses using Ti and two of 45 other elements.
Image xPlotTrace3Ti_linear_IAB Image xPlotTrace3Ti_linear_MORB
Image xPlotTrace3Ti_linear_OIB Image xPlotTrace3Ti_linear_err

Figure 30: Same as Figure 29, but using quadratic discriminant analysis.
Image xPlotTrace3Ti_quadratic_IAB Image xPlotTrace3Ti_quadratic_MORB
Image xPlotTrace3Ti_quadratic_OIB Image xPlotTrace3Ti_quadratic_err

Figure 31: The best ternary linear discriminant analysis, using Si, Ti, and Sr.
Image Si_Ti_Sr_lin

Figure 32: Linear discriminant analysis using Eu, Lu, and Sr.
Image Eu_Lu_Sr_lin

Figure 33: The best performing linear discriminant analysis using only incompatible elements (Ti, V and Sc).
Image V_Ti_Sc_lin

Figure 34: The best performing quadratic discriminant analysis, using Na, Nb and Sr.
Image Na_Nb_Sr_quad

Figure 35: The best performing quadratic discriminant analysis using only incompatible elements (Ti, V and Sm).
Image V_Ti_Sm_quad

Figure 36: Illustration of the bias-variance tradeoff in a regression context. The thick gray line is the true model (Y = X$ ^4$). The white circles are 50 samples with random normal errors. The dashed line is the interpolator, which is one of infinitely many functions that go through all the datapoints and, thus, have zero bias. The solid black line is a linear regression model, which has a large bias but small variance. In this case, the fourth order polynomial (blue) is the best predictor of future behavior. Although it has larger bias than the 50th order polynomial (red) and larger variance than the first order polynomial (straight black line), it minimizes the mean squared error (MSE = variance + bias$ ^2$).
Image biasVariance

Figure 37: The test data (116/182 used) plotted on various versions of the Ti-V diagram with: a. the original decision boundaries of Shervais (1982), drawn by eye; b. LDA on the logratio-plot, with anchor points 1-4 given in Table 6; c. QDA on the logratio-plot, d. the same LDA as in subplot b., but this time mapped back to the ``traditional'' compositional data space; e. the QDA of subplot c. mapped back to Ti-V space. An error analysis of these and subsequent diagrams is given in Tables 5 and 7.
a. Image Shervais
b. Image log_Ti_V_lin c. Image log_Ti_V_q d. Image test_Ti_V_lin e. Image test_Ti_V_q

Figure 38: The test data (89/182 used) plotted on the Ti-Zr diagram with: a. the original decision boundaries of Pearce and Cann (1973); b-e as in Figure 37.
a. Image PearceTiZr
b. Image log_Ti_Zr_lin c. Image log_Ti_Zr_q d. Image test_Ti_Zr_lin e. Image test_Ti_Zr_q

Figure 39: The test data (85/182 used) plotted on the Ti-Zr-Y diagram with: a. the original decision boundaries of Pearce and Cann (1973); b-e as in Figure 37.
a. Image PearceTiZrY
b. Image log_Ti_Zr_Y_lin c. Image log_Ti_Zr_Y_q d. Image test_Ti_Zr_Y_lin e. Image test_Ti_Zr_Y_q

Figure 40: The test data (58/182 used) plotted on the Nb-Zr-Y diagram with: a. the original decision boundaries of Meschede (1986); b-e as in Figure 37.
a. Image Meschede
b. Image log_Nb_Zr_Y_lin c. Image log_Nb_Zr_Y_q d. Image test_Nb_Zr_Y_lin e. Image test_Nb_Zr_Y_q

Figure 41: The test data (36/182 used, but no MORBs!) plotted on the Th-Ta-Hf diagram with: a. the original decision boundaries of Wood (1980); b-e as in Figure 37.
a. Image Wood
b. Image log_Th_Ta_Hf_lin c. Image log_Th_Ta_Hf_q d. Image test_Th_Ta_Hf_lin e. Image test_Th_Ta_Hf_q

Figure 42: The test data (164/182 used) plotted on the Si-Ti-Sr LDA diagram with: a. the decision boundaries and anchor points (see Table 6) in log-ratio space; b. the decision boundaries mapped back to the simplex.
a. Image log_Si_Ti_Sr_lin b. Image test_Si_Ti_Sr_lin

Figure 43: The test data (103/182 used) plotted on the Eu-Lu-Sr LDA diagram; a & b as in Figure 42.
a. Image log_Eu_Lu_Sr_lin b. Image test_Eu_Lu_Sr_lin

Figure 44: The test data (72/182 used) plotted on the Ti-V-Sc LDA diagram; a & b as in Figure 42.
a. Image log_V_Ti_Sc_lin b. Image test_V_Ti_Sc_lin

Figure 45: The test data (61/182 used) plotted on the Na-Nb-Sr QDA diagram with: a. the decision boundaries in log-ratio space; b. mapped back to $ \Delta _2$.
a. Image log_Na_Nb_Sr_q b. Image test_Na_Nb_Sr_q

Figure 46: The test data (85/182 used) plotted on the Ti-V-Sm QDA diagram; a & b as in Figure 45.
a. Image log_Ti_V_Sm_q b. Image test_Ti_V_Sm_q


next up previous
Next: About this document ... Up: Tectonic discrimination diagrams revisited Previous: List of Figures
Pieter Vermeesch 2005-11-21