Beruflich Dokumente
Kultur Dokumente
Engineering Geology
j o u r n a l h o m e p a g e : w w w. e l s ev i e r. c o m / l o c a t e / e n g g e o
School of Mining Engineering, University of Tehran, P.O. Box 11365-4563, Tehran, Iran
School of Mining, Petroleum and Geophysics Engineering, Shahrood University of Technology, P.O. Box 3619995161-316, Shahrood, Iran
c
School of Electrical & Computer Engineering, Control and Intelligent Processing, Center of Excellence, University of Tehran, P.O. Box 11365-4563, Tehran, Iran
d
Department of Petroleum Engineering, Curtin University of Technology, P.O. Box U1987, WA, Australia
b
a r t i c l e
i n f o
Article history:
Received 11 April 2010
Received in revised form 26 December 2010
Accepted 15 January 2011
Available online 22 January 2011
Keywords:
Joint set
Joint properties
Parzen
K-means clustering
Principal component analysis
a b s t r a c t
Up to 10 properties of joints can be recorded in the eld, yet only two (dip and dip direction) are commonly
used to identify joint sets. This paper investigates some of the shortcomings of commonly employed methods
for joint set clustering, based on an analysis of synthetic and eld data. First, eight synthetic joint sets were
generated using a normal distribution of joint orientations. Each joint was dened in terms of four properties
(dip, dip direction, inlling material and inlling percentage). A Parzen classier was used to conrm the
importance of using all the joint properties in identifying the joint sets. To investigate the generalization
ability of this approach, the analysis was extended to 178 joints measured in the eld, with seven properties
available for each joint. Joints were clustered based on rose diagrams, stereonets, and K-means clustering
methods, yielding three, ve, and seven joint sets, respectively. Calculation of the coefcient of variation and
principal component analysis (PCA) of joint properties resulted in an improvement in clustering, provided
that a large number of joint properties are considered.
2011 Elsevier B.V. All rights reserved.
1. Introduction
One of the essential steps during the early stages of eld
investigations in most geological, mining, geotechnical, and petroleum exploration projects is to undertake a survey of joints, for which
one of the aims is to distinguish different joint sets. The dip and dip
direction are the most common geometrical properties employed in
joint clustering, with clusters of joints being displayed graphically on
rose diagrams and stereonets. Individual rose diagrams enable just
one joint property (i.e., dip direction or strike) to be plotted, whereas
two joint properties (dip and dip direction or strike) can be shown on
a stereonet. Rose diagrams and stereonets are interpreted visually,
meaning that the interpretation may be subjective, depending on the
interpreter's experience.
The shortcomings of rose diagrams and stereonets can be better
understood when we consider that on some occasions it is necessary
to consider more than two properties of joints for clustering (e.g., two
joints with similar dip and dip direction, but different apertures, have
contrasting effects on uid ow). Of course it should be mentioned
that from an engineering viewpoint, the most important attribute is 3D orientation which answers the question "Is failure kinematically
76
(a)
Table 1
Semi-quantitative coding used to describe inlling percentage and inlling material for
synthetic joints.
Inlling material
Empty
Clay
Calcite
(b)
Inlling percentage
No lling
Half lled
Fully lled
0
0.5
1
0
0.5
1
correct number of joint sets. Hammah and Curran (1998) investigated the optimal delineation of sets of discontinuities using a fuzzy
clustering algorithm, and Hammah and Curran (1999, 2000)
investigated the use of a fuzzy K-Means algorithm for joint set
clustering.
Sirat and Talbot (2001) studied the application of self-organizing
maps of articial neural networks in joint set clustering, as applied at
the Aspo Hard Rock Laboratory, Sweden, where they observed a good
match between their model results and eld observations. Zhou and
Maerz (2001, 2002) applied multivariate clustering analysis to
discontinuity data collected from various sites. The authors developed
various visualization tools, including a 3D stereonet that enables the
use of three joint properties for the identication of joint sets.
Marcotte and Henry (2002) developed an automatic procedure to the
identication of joint sets. Jimenez-Rodriguez and Sitar (2006)
proposed a spectral method for clustering sets of rock discontinuities.
The performance of their algorithm was assessed from benchmark
test cases, using data sets compiled from eld measurements. Jimenez
(2007) investigated the importance of fuzzy logic in joint set
clustering, using both synthetic and real data, and demonstrated
that a fuzzy approach is applicable in this regard. Tokhmechi et al.
(2008) used a K-means clustering algorithm to cluster real joint data,
obtaining six joint sets when seven joint properties are taken into
account, but just three sets when only dip and dip direction are
considered. Finally, Tokhmechi et al. (2009a,b) utilized Bayesian and
multi-layered perceptron neural networks to investigate shortcomings in conventional joint set clustering methods in which various
joint properties are considered.
In the present paper, a Parzen classier is adopted to classify eight
synthetically generated joint sets. Subsequently, a K-means clustering
approach is applied to the clustering of real joint set data. The aim of
this investigation is to assess the importance of using all measurable
joint properties when identifying joint sets.
S
Fig. 1. a) Rose diagram and b) contoured stereonet of pole density for synthetically
generated joints.
x i i
i
Vn = hn ;
kn
;
Nn Vn
where kn is the number of data in class n located in the hyperspace, Nn is the entire training data of class n, and Vn is the volume
of hyper-space, as found from Eq. (2).
f) x0 belongs to the class with the largest value of f x0 .
g) Repeat the above steps for the entire test data (test data must be
selected randomly).
2. Methodology
In undertaking any system of classication, classes are employed
that contain members dened in terms of their properties. The aim is
to determine whether new, undened members belong to any of the
primary classes. In clustering, data are also dened based on their
properties; however, the number of classes and the dependency of
data on the classes remain unknown. The aim of clustering is to
determine the optimum number of classes and the optimum
distribution of the data among the classes. Parzen and K-means are
examples of classication and clustering techniques, respectively, and
each are considered below.
Table 2
Range of properties of synthetic joint sets.
Join Property
15-95
20-60
1
1
20-90
35-65
1
0.5
70-130
30-60
1
1
70-130
20-70
1
0.5
90-140
30-80
0.5
1
190-270
30-55
1
1
180-270
35-65
0
0
200-300
40-70
1
1
d) Calculate the distances between
a randomly chosen joint x j and
the mean of K joint sets k :
Infilling Percentage
(a)
1
1, 2, 3 & 4
0.5
5
60
Dip
30
90
0 0
djk = x j k ;
2
Joint Set 1
Joint Set 2
Joint Set 3
Joint Set 4
Joint Set 5
Joint Set 6
Joint Set 7
Joint Set 8
6&8
0
90
270
180
360
ection
Dip Dir
djk ;
k = 1 x j wk
Joint Set 1
Joint Set 2
Joint Set 3
Joint Set 4
Joint Set 5
Joint Set 6
Joint Set 7
Joint Set 8
6&8
0.5
2&4
0
90
60
Dip
30
0
90
180
270
1
1, 3, & 5
e) Move the joint to the joint set for which the mean vector distance
to the joint is minimized.
f) Repeat the above steps for all joints.
g) Calculate the cost function (J), which shows the accumulated
distance of the mean of all joint sets with their joints, as follows:
J=
(b)
Infilling Material
77
ection
Dip Dir
k
k
min min dmin i ; j
i=1 j=1
D U =
forij;
k
max dmax l ; l
l=1
Fig. 2. Eight synthetic joint sets plotted in 3D space dened by a) dip, dip direction, and
inlling percentage, and b) dip, dip direction, and inlling material.
1
x;
Nk x i wk j
where Nk is the number of joints in the kth joint set, and wk and
x j are the kth joint set and the property vector of the jth joint in that
set, respectively.
7.6
7.2
6.8
6.4
6.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
hn
Fig. 3. Optimizing hn in a Parzen classier (all four joint properties, i.e., dip, dip
direction, inlling material and inlling percentage, are considered).
78
(a)
1
1
2
3
4
5
6
7
8
(b)
Decided Classes
2
0.92 0 0.08 0 0 0 0 0
0 0.96 0 0.04 0 0 0 0
0.04 0 0.96 0 0 0 0 0
0
0 0 0.92 0 0.06
0.02 0
0
0
0
0 0 0 1 0
0
0
0 0 0.12 0 0.88
0
0
0
0
Real Classes
0
5 0
6 0.02
0 0.92 0 0.06
0 0.12 0 0.88
7 0
8 0
(d)
(c)
Decided Classes
1 2
3
4
5
6
1 0.70 0.22 0.08 0
0
0
2 0.08 0.86 0.02 0 0.04 0
3 0.04 0 0.76 0.20 0
0
0
0
0 0.64
6 0.02 0
0
0
0
0 0.38
7 0
0
0
0
0 0.06
8 0
Decided Classes
3
4 5
6
7
8
0
0
0
0
0
0
0
0
Real Classes
0
0
0.30 0.04
0.52 0.10
0.14 0.80
Decided Classes
1
2
3
4
5
6
7
8
10.58 0.36 0.06 0
0
0
0
0
20.66 0.28 0.02 0.04 0
0
0
0
0
0
3 0 0.04 0.46 0.24 0.26 0
0
0
0 0.58 0.28 0.14 0
4 0
Real Classes
0 0.12 0.02 0.86 0
0
0
5 0
0
0
0
0 0.54 0.32 0.14
6 0
0
0
0
0 0.56 0.32 0.12
7 0
0
0
0
0 0.14 0.24 0.62
8 0
Fig. 4. Parzen confusion matrices constructed with optimum hn and considering a) four joint properties (dip, dip direction, inlling material, and inlling percentage), b) three joint
properties (dip, dip direction, and inlling percentage), c) two joint properties (dip and dip direction), and d) one joint property (dip direction).
The classier was trained using 70% of the joints in each joint set
(selected randomly) and tested using the remaining 30% of the joints.
The results of such classication methods are generally presented
using a confusion matrix. The value of each element in the matrix shows
how effectively the data in each decided class (shown in columns) are
assigned to the actual class (shown in rows). A class is perfectly classied if
the diagonal element corresponding to that class is equal to 1, which
means that other elements in that row become zero (i.e. the accuracy of
classication is 100%). The ideal classication occurs when the
corresponding confusion matrix is a unit matrix (i.e., the matrix trace is
equal to the number of classes, n). Any deviation from this ideal situation
reduces the matrix trace from n; hence, the trace of the confusion matrix
can be used as an indication of classication performance.
As mentioned in Section 2, the optimization of hn is important when
using a Parzen classier. Fig. 3 shows the optimization of hn in the case that
four properties are used in the classication. The maximum trace of the
confusion matrix (i.e., greatest accuracy) is 7.6 (95% accuracy); this occurs
when hn is equal to 0.4.
The classication was rst performed in 4D space using all the joint
properties. It was then repeated in 3D space using the properties of dip,
dip direction, and one of inlling percentage or inlling material.
Classications using dip and dip direction (in 2D space) and dip
direction (in 1D space) were carried out to investigate the capabilities of
the stereonet and rose diagram in joint set classication, respectively.
The results of classication using the Parzen algorithm are shown
in Fig. 4. The trace of the confusion matrix is reduced with decreasing
Table 4
Quantication of survey data for joints.
Property
Description
Quantity Property
Continuity
b1 m
1-3 m
3-10 m
10-30 m
1
2
3
4
Table 3
Effect of the number and choice of joint properties on the performance of the Parzen
classier.
Row No of
properties
Confusion
matrix trace
Accuracy
(%)
7.60
95.00
6.54
81.75
3
4
5
3
2
1
6.78
5.24
3.94
84.75
65.50
49.25
Ends
No ends visible
One ends visible
Two ends visible
Roughness Stepped
Rough
Moderately
Rough
Slightly Rough
Smooth
Slickenside
1
2
3
1
2
3
4
5
6
Description
Quantity
1
2
3
4
5
6
7
1
2
3
4
5
6
(a)
(b)
79
(c)
Fig. 5. Orientation data for joints within the Asmari Formation at the Seymareh Dam site. a) Rose diagram, b) poles to joints, and c) contoured poles to joints.
Table 5
Absolute and relative coefcients of variation for the properties of joints in the study area.
Coefcient of variation
Dip direction
Dip
Continuity
Ends
Roughness
Aperture
Hardness
Absolute
Relative (%)
Accumulative relative (%)
0.81
28.6
28.6
0.48
17.0
45.6
0.43
15.2
60.8
0.37
13.1
73.9
0.33
11.6
85.5
0.29
10.3
95.8
0.12
4.2
100
joint properties (dip and dip direction) is 45.6%; therefore, the use of a
stereonet does not take into account the majority of the variation in
joint properties, suggesting that the stereonet is a non-complete
approach to joint set clustering.
0.8
0.7
Optimum Index
0.6
0.5
0.4
0.3
4
10
4
iv
Continuity
ii
iii
vi
1
90
Table 6
Eigen values of the PCA covariance matrix for the surveyed joint properties.
Eigen value
Eigen value (%)
Accumulative eigen
value (%)
2.58
1.34
1.04
0.73
0.55
0.40
0.36
36.9
19.1
14.9
10.4
7.9
5.7
5.1
36.9
56.0
70.9
81.3
89.2
94.9
100
p
Di
Sorted principal
components
60
30
vii
00
90
180
270
360
Dip Direction
Fig. 7. Discrimination of the seven optimum joint sets using seven joint properties,
based on KMC results.
80
Table 7
Properties of the seven joint sets.
Joint Set
No. of Joints
Dip Direction
(degree)
Dip (degree)
Continuity
Ends
Roughness
Hardness
Aperture
i
ii
iii
iv
v
vi
vii
34
16
22
32
25
14
29
5-40
20-50
50-80
60-90
170-200
250-300
250-290
40-80
45-75
45-85
40-70
40-85
70-85
10-50
1-2
3-4
1-2
3-4
1-2
1-2
1-4
2-3
1-2
1-2
1-2
1-3
1-3
1-3
1-3
3-5
2-5
3-5
2-4
2-4
1-3
3-4
2-3
3-4
2-3
3-4
3-4
2-4
2-3
3-5
3-5
3-5
1-4
3
2-4
E W
E W
(i)
(ii)
(iii)
E W
E W
(iv)
(v)
(vi)
(vii)
Fig. 8. Contoured stereoplots of poles to joints in the seven optimum joint sets identied when applying KMC to real joint data in 7D space.
81