Beruflich Dokumente
Kultur Dokumente
Part I
Based on
1. “Statistics and Data Analysis in Geology, J.C. Davis, New York, John Wiley & sons, 2nd ed.,1996
2. “Chemometrics: A Textbook, Amsterdam”, D.L. Massart, B.G.M. Vandeginste, S.N. Deming, Y. Michotte,
and L. Kaufman, Elsevier,1988
3. “Course note: Multivariate Data Analysis and Chemometrics”, B. Jørgensen, Department of Statistics,
University of Southern Denmark, 2003
4. “Multi- and Megavariate Data Analysis- Principles and Applications”, L. Eriksson, E. Johansson, N. Kettaneh-Wold,
and S. Wold, UMETRICS, 2001
5. “Matlab” online Manual, The MathWorks, Inc.
PART I:
General Description of Principal Component Analysis
If there exist two variables, y1 and y2 with six samples, it can be shown in 2-dimensional space.
“0” is a centroid of 6 samples after the mean centering.
y2 2
1
y2 0
3 5
2 2 2
4 the spread of data = 01 + 02 + " + 06
6
y1 y1
If 1’, 2’, …, 6’ on the line shown in below figure are the projections of each point from above graph, then
x2 2 2 2
y2 1' 2 0i = 0i ' + ii '
1 2' 3'
0
y2 x1
3 5
4'
6'
Therefore, the maximum value of the total variation is
4 5'
6
2 2 2 2 2 2
the spread of data = 01' + 02 ' + " + 06 ' + 11' + 22 ' + " + 66 '
y1 y1 to be maximized to be minimized
T
X X Covariance Matrix
S = cov( X ) =
m −1
Eigenvalue Decomposition
Sp = λ p
eigenvector
eigenvalue
SP = PΛ
Eigenvalues on the diagonal of
this diagonal matrix
Eigenvectors forming the columns of this matrix
If P is nonsingular, this become the eigenvalue decomposition
Eigenvalue Matrix
−1
S = P ΛP
Loading Matrix=Eigenvector Matrix
6. Properties of ti and pi vectors
Eigenvalue Matrix
−1
S = P ΛP or cov( X ) pi = λi pi
Loading Matrix
=Eigenvector Matrix