Sie sind auf Seite 1von 12

Original Article

Proc IMechE Part C:


J Mechanical Engineering Science
Diagnosis methodology for identifying 0(0) 1–12
! IMechE 2017

gearbox wear based on statistical time Reprints and permissions:


sagepub.co.uk/journalsPermissions.nav

feature reduction DOI: 10.1177/0954406217721727


journals.sagepub.com/home/pic

Juan Jose Saucedo-Dorantes1, Miguel Delgado-Prieto2,


Roque Alfredo Osornio-Rios1 and
Rene de Jesus Romero-Troncoso3

Abstract
Strategies for condition monitoring are relevant to improve the operation safety and to ensure the efficiency of all the
equipment used in industrial applications. The feature selection and feature extraction are suitable processing stages
considered in many condition monitoring schemes to obtain high performance. Aiming to address this issue, this work
proposes a new diagnosis methodology based on a multi-stage feature reduction approach for identifying different levels
of uniform wear in a gearbox. The proposed multi-stage feature reduction approach involves a feature selection and a
feature extraction ensuring the proper application of a high-performance signal processing over a set of acquired
measurements of vibration. The methodology is performed successively; first, the acquired vibration signals are char-
acterized by calculating a set of statistical time-based features. Second, a feature selection is done by performing an
analysis of the Fisher score. Third, a feature extraction is realized by means of the linear discriminant analysis technique.
Finally, fourth, the diagnosis of the considered faults is done by means of a fuzzy-based classifier. The effectiveness and
performance of the proposed diagnosis methodology are evaluated by considering a complete data set of experimental
test, making the proposed methodology suitable to be applied in industrial applications with power transmission systems.

Keywords
Gearbox, condition monitoring, fault diagnosis, feature reduction, vibrations

Date received: 5 April 2017; accepted: 27 June 2017

Introduction
such as wear and tear in gears which are known as
Gears are extensively used in most of the mechanical superficial faults; consequently, an overheating gener-
power transmission systems due to their robustness, ated by the friction increases between gears causing a
competitive cost, and reliability.1–3 Despite their reli- reduction of its mechanical properties, which tends to
ability, the appearance of unexpected faults in gear- accelerate the degradation of the gearbox.10 Although
boxes may occur at any time, causing unscheduled incipient faults such as wear in gears are those which
breakdowns in the elements of the associated kine- originate crucial failures in gearboxes transmission
matic chain. It has been reported that the appearance systems, this kind of fault has not been totally
of gear faults account for 80% of the breakdowns in
transmission machinery systems and 10% of the faults 1
CA Mecatronica, Facultad de Ingenieria, Universidad Autonoma de
in rotating machinery.4,5 Therefore, strategies of Queretaro, San Juan del Rio, Mexico
2
condition monitoring and fault detection related Technical University of Catalonia (UPC), Department of Electronic
Engineering, MCIA Research Center, Terrassa, Spain
to gearbox transmission systems play a key role to 3
CA Procesamiento Digital de Señales, CA Telematica, Division de
ensure the effectivity and safety of multiple industrial Ingenierias, Campus Irapuato-Salamanca, Universidad de Guanajuato,
processes.6–8 Salamanca, Mexico
Considering real situations from industrial applica-
tions, the appearance of faults in gearboxes can be Corresponding author:
Rene de Jesus Romero-Troncoso, División de Ingenierı́as, Campus
generated by different sources such as fluctuating
Irapuato-Salamanca, Universidad de Guanajuato, Carr. Salamanca-Valle
load, poor lubrication, deficient cooling, gearing and km 3.5 þ 1.8, Comunidad de Palo Blanco, 36700 Salamanca, Guanajuato
coupling inaccuracies, among others.9 Most of the 36885, Mexico.
faults in gearboxes usually start from incipient faults Email: troncoso@hspdigital.org
2 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

detected until the appearance of a critical breakdown. dimensionality of data sets are principal component
In this sense, the most critical faults are related to the analysis (PCA) and linear discriminant analysis
appearance of gear teeth irregularities; such faults are: (LDA). Yet, each one of these feature reduction tech-
tooth breakage, chipped tooth, root crack, spalling, niques lies with a specific objective function, that
pitting, and tooth surface damage, and due to their is, the main objective of PCA is the identification
typical appearance, these faults have been widely of those orthogonal components that are aligned to
addressed.11–13 Hence, the appearance of faults in a those directions where maximum data dispersion is
gearbox transmission system affects its proper oper- retained, while, the objective of LDA is to maximize
ation causing mainly the occurrence of vibrations, the the separation between data sets of different
increasing of noise besides that the temperature in the classes.25,26 The main difference between PCA and
mechanical system is affected.14,15 LDA techniques is that LDA retains information of
Condition monitoring schemes that are used to the different classes, supervised approach, whereas
perform the assessment of gearbox transmission PCA does not, unsupervised approach; thus, a specific
systems involve the measurement and analysis of dif- dimensionality reduction technique is selected
ferent physical magnitudes such as stator currents, depending on the criteria of application and also
acoustic emissions, temperatures, and vibra- depends on the proposed focus for being used.
tions.8,16,17 However, due to most of the mechanical Although several methodologies related to the con-
power transmissions systems are composed of rotat- dition monitoring and fault detection in gearbox
ing parts, the occurrence of vibrations in these systems transmission systems have been presented in the last
is inevitable; in fact, vibrations are one of the main years, most of these are focused on the diagnosis and
characteristics of rotating machines, and under the detection of critical faults. In this regard, in Patel
incidence or appearance of mechanical faults their et al.,27 a multi-criterion fusion framework for feature
vibrational signatures tend to be modified.18,19 selection is proposed, where the effectiveness, correl-
Indeed, with regard to the appearance and identifica- ation, and performance of classification are taken into
tion of mechanical faults, advantageous results may account during the diagnosis of different failure
be performed through the measurement of physical modes in a planetary gearbox. Although this scheme
magnitudes which nature is purely mechanic like allows identifying different failure modes, the
vibrations.20 Thus, vibration-based schemes remain proposed strategy only involves the identification of
as one of the most accepted, reliable, and suitable critical gear faults such as root crack and missing
approaches used for condition monitoring and fault tooth. In Hua and Lam,13 a multi-stage feature selec-
identification of mechanical faults in industrial appli- tion based on genetic algorithms is proposed for
cations. Classically, the root mean square (RMS), carrying out the selection of the best set of condition
numerical indicator is estimated from vibration meas- parameters extracted from vibration signals through
urements in order to assess the general condition time, frequency, and time–frequency domains.
of the machine.21 The vibration signals can be char- The selected set of features is then used in a neural
acterized extending their analysis into frequency and network to classify different fault conditions of a gear-
time–frequency domains.16,22,23 Even though the most box. Although this approach is evaluated with a large
well-known frequency and time–frequency domain number of features, the proposed strategy is also
techniques such as fast Fourier transform, Wavelet focused on critical gear faults. Indeed, despite a
analysis or Hilbert–Huang transform among others, large number of proposed methodologies for assessing
have been satisfactory applied to condition monitor- the condition of gearbox transmission systems, most
ing schemes, the simplicity and low computational of these have been focused on critical faults, which
cost of statistical time-domain features represents a exhibit significant affectation patterns that facilitate
suitable characterization solution, mainly, for con- their detection.
sidering its capability to estimate general trends The main contribution of this work lies on the pro-
from signals.24 posal and validation of a diagnosis methodology
Commonly, the calculation of a high-dimensional based on a dimensionality feature reduction approach
set of features is considered in order to obtain large for the identification and classification of different
sets of information. Thus, the consideration of redun- levels of uniform wear as an incipient fault in a gear-
dant and nonsignificant information into the numer- box transmission system. The fault patterns of the
ical feature set proposed to characterize the physical different levels of wear considered are characterized
magnitude is inevitable. In this regard, dimensionality by calculating a statistical time set of features from
reduction procedures have been included in condition the acquired vibration signals. Then, the resulting sets
monitoring methodologies aiming to remove nonuse- of features are analyzed by means of the proposed
ful information which can lead to a posterior low multi-stage feature reduction approach. First, a
diagnosis performance.22 The purpose of such dimen- Fisher score based feature selection stage is proposed
sionality reduction is to obtain a significant represen- to determine the most representative subset of fea-
tation of the original set of features. The most tures; then, the resulting set of features is subjected
commonly applied techniques used to reduce the to a compression procedure through a linear
Saucedo-Dorantes et al. 3

discriminant analysis approach. This last feature This technique has been widely included in classic
reduction stage allows obtaining a two-dimensional feature reduction approaches to project a high-
visualization of the measurements. Finally, due to dimensional data set into a new and nonredundant
the resulting two-dimensional set of features, a set of features. In these extracted features, known as
simple neuro-fuzzy based classification algorithm is principal components, most of the data set variance is
used to obtain the gearbox condition and fault classi- represented. Although PCA technique is based on
fication. The validation of the proposed methodology statistical analysis, it is not considered as the separ-
is performed through experimental tests, where four ation of different classes. In this regard, LDA is the
different treated conditions at different operating fre- most well-known supervised feature extraction tech-
quencies are considered. The obtained results show nique used for linear dimensionality reduction in
the effectivity of detection and classification of wear problems where multiple classes are addressed. The
in a gearbox linked to a kinematic chain, making the main objective of the LDA technique lies in finding
proposed methodology suitable for industrial machin- a new lower-dimensional projection where the most
ery diagnosis. discriminative information among data points belong-
ing to different classes is maximized.25 Because the
LDA is a supervised technique, it is an appropriate
Feature reduction approaches technique to maximize the condition monitoring per-
Dealing with mechanical power transmission systems, formance dealing with the identification of an avail-
the feature set plays a key role that compromises the able set of fault conditions.
performance of the fault identification schemes. Thus, Indeed, LDA deals with multi-class problems, thus,
the use of an inappropriate set of features cannot be considering a multi-class problem with C classes com-
representative enough to describe the different condi- posed of N samples, the LDA computes the between-
tion in a rotating electromechanical system, thus, the class scatter matrix as follows29
use of a large number of features may increase the
capability of discrimination. However, such increase X
C   T
of features does not ensure the addition of relevant Sb ¼ Nj mj  m mj  m ð1Þ
j¼1
information related to the malfunctions in the work-
ing condition of mechanical power transmission
systems. For that reason, different strategies or pro- where Nj corresponds to the total number of samples
cedures related to feature reduction have been con- in the jth class Cj, taking into account all the classes m
sidered in condition monitoring schemes. In this is the mean of all the samples and mj is the mean of
regard, independently of whether condition monitor- the class Cj. Also the LDA considers the computation
ing schemes are used to assess electrical or mechanical of the within-class scatter matrix as
machines, the main included feature reduction strate-
gies are feature selection and feature extraction.24,25,28 Nj   T X
C X
X C
The feature selection is considered a strategy with Sw ¼ xij  mj xij  mj ¼ Swj ð2Þ
filtering purposes, where all the elements of a set of j¼1 i¼1 j¼1
features are individually evaluated in order to rank
them according to their individual discriminative
capabilities; and despite a specific feature does not where xij is the ith sample which belongs to the class
provide meaningful information by itself; it can Cj, resulting in Swj the corresponding covariance
provide relevant information by its combination matrix of the class Cj.
with other different features. Generally, most of the The optimal projecting vector WLDA chosen during
strategies with filtering purposes do not necessitate a the LDA allows to obtain well-separated classes since
specific learning algorithm and are very effective as the computed matrix contains orthonormal columns
well as being easy and fast to compute. Most of the which maximize the ratio of the determinant of
filtering techniques are based on general properties or the between-class matrix of the projected samples to
characteristics of the data sets, such as dependencies, the determinant of the within-class scatter matrix
distances, and consistencies among others.8,25 In gen- of the projected samples
eral, the objective of considering feature selection  T 
strategies for being applied to condition monitoring W Sb W  
WLDA ¼ arg max  T  ¼ w1 w2    wm
schemes is to retain those features with the best dis- W Sw W
criminative capabilities among conditions.13,27
ð3Þ
The feature extraction is considered a strategy with
transformation purposes, where the elements of a set
of features are combined in order to enhance a specific where fwi ji ¼ 1, 2, . . . , mg belongs to the set of gener-
characteristic. PCA is the most well-known unsuper- alized eigenvectors also known as discriminant vec-
vised technique used to reduce the dimensionality tors of the Sb and Sw that correspond to the C-1
of data sets by extracting a new set of features.22 largest generalized eigenvalues fli ji ¼ 1, 2, . . . , mg.
4 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

Figure 1. Flow chart of the proposed methodology used for identifying gearbox wear.

Thus, the feature extraction resulting in V is per- analyzed with the aim to preserve and filter those dis-
formed through the projection of the original data set criminant features that better describe the gearbox
of features X into the low dimensional WLDA as working condition. The feature selection process is
follows proposed to be performed by means of computing
the Fisher score, which can be considered as brute-
V ¼ WTLDA X ð4Þ force based Fisher analysis, in which the ratio between
the within-class scatter estimation and the between-
classes scatter estimation is obtained. Specifically,
this computed value may be interpreted as a relative
Diagnosis methodology measurement which represents the distance between
The proposed methodology for identifying gearbox different classes and dispersion among data points
wear is composed of five stages as the flow chart of belonging to each class. Then, a set of features
Figure 1 depicts. First, in the data acquisition stage, which produce a small Fisher score implies poor dis-
the occurrence of vibrations in the perpendicular criminative capabilities, while a large Fisher score
plane of the gearbox rotating axis is acquired. implies better discriminative capabilities. The ranking
Second, in the feature estimation stage, a charac- of the features is proposed to be performed according
terization of each acquired vibration signal is per- to their relevancy; in this sense, those features that are
formed by estimating a set of 15 statistical time better ranked in terms of the Fisher score are con-
features. The proposed set of statistical time features sidered the features with high faults discriminative
comprises: mean, maximum value, RMS, square root capabilities, while those features that are worst
mean, standard deviation, variance, RMS shape ranked are considered as nondiscriminative features
factor, square root mean shape factor, crest factor, without poor information related to the fault condi-
latitude factor, impulse factor, skewness, kurtosis, tion. The feature selection is proposed to be carried
and normalized fifth and sixth moments. These pro- out under a combinatorial approach; therefore, Fisher
posed statistical time features and their corresponding scores are obtained by carrying out combinations
equations are listed in Table 1. Moreover, due to the between all the available statistical features. Thus,
potentiality to analyze trends of signals and the high- the discriminative capabilities are evaluated by con-
performance source of information, advantageous sidering different subsets composed of one, two, and
and accurate results have been successfully obtained three features. In order to discard the useless and
by including the proposed set of statistical time nondiscriminative features, in the feature selection
features in condition monitoring schemes for fault procedure the discriminative capabilities of the statis-
identification in electromechanical systems.7,21,24 tical time subsets of features considering all of the
The third stage is the feature selection; in this healthy-faulty pairs of classes is analyzed. Then,
reduction stage, the discriminative capabilities of the after evaluating the Fisher score of each statistical
estimated statistical features, which belong to all of feature, the first ranked subset of statistical features
the healthy-faulty pairs of classes, are separately is considered as the most relevant and discriminative
Saucedo-Dorantes et al. 5

Table 1. Set of proposed statistical time features. interpretation of the different considered conditions.
Furthermore, the resulting representation of the
1 Xn
extracted features into a two-dimensional space
Mean x ¼  jxk j ð1Þ
n k¼1 makes easier the classification task. Thus, the most
discriminative statistical features selected in the previ-
Maximum value x^ ¼ maxð xÞ ð2Þ
ous stage are projected into a new base with a reduced
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi dimensional space where their discriminative capabil-
1 X n
Root mean square RMS ¼  ðxk Þ2 ð3Þ ities between the considered conditions are
n k¼1
maintained.
! Finally, the classification and fault diagnosis are
n pffiffiffiffiffiffiffi 2
1 X done in the fifth stage, and the set of extracted fea-
Square root mean SRM ¼  jxk j ð4Þ
n k¼1 tures are evaluated in order to obtain the diagnose
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi and classification of the different considered condi-
1 X n
tions. In this sense, it must be highlighted that, in
Standard deviation ¼  ðxk  x Þ2 ð5Þ
n k¼1 the proposed diagnosis methodology, a successive
processing of features (feature selection and feature
1 Xn
reduction) is considered; and through its implementa-
Variance 2 ¼  ðxk  x Þ2 ð6Þ
n k¼1 tion, those features that are not significantly capable
to represent the characteristic patterns related to the
RMS gearbox working condition are removed. Thus, since
SFRMS ¼ 1 Pn ð7Þ
RMS shape factor  k¼1 jxk j the number of features is reduced to two, and the fault
n
patterns have been emphasized, the use of simple clas-
SRM
SFSRM ¼ 1 Pn ð8Þ sification structure is possible. In this case, a fuzzy-
SRM shape factor  k¼1 jxk j
n based classifier for carrying out the diagnosis and
x^ classification of the considered conditions is proposed.
Crest factor CF ¼ ð9Þ In fact, in a classic fuzzy-based inference system there
RMS
is an antecedent that is the input, such input is eval-
x^ uated by means of a membership functions with the
Latitude factor LF ¼ ð10Þ
SRM purpose to determine its degree of association to a
x^ specific fuzzy-event. In the proposed classifier, the
Impulse factor IF ¼ 1 Pn ð11Þ extracted set of features is the input to be evaluated
n  k¼1 jxk j
in the membership functions. Then, a conclusion is
 
E ðxk  x Þ3 performed by computing the consequent or output
Skewness Sk ¼ ð12Þ of the fuzzy-based inference system through a series
3
  of logical operations that are also known as fuzzy
E ðxk  x Þ4 rules. The proposed classifier has four outputs that
Kurtosis k¼ ð13Þ
4 represent every one of the considered conditions, the
  consequent of each membership function in the fuzzy-
E ðxk  x Þ5
Fifth moment 5thM ¼ ð14Þ based inference system is determined by a Sugeno-
5
style membership.
 
E ðxk  x Þ6
Sixth moment 6thM ¼ ð15Þ
6 Experimental setup
The experimental setup used for testing the different
considered conditions and the data acquisition system
subset of features that provide a better description to (DAS), used to acquire the experimental vibration sig-
distinguish between the corresponding pair of a fault nals are shown in Figure 2. The test bench is based on
condition and healthy condition of the gearbox. a kinematic chain, it has a variable frequency drive
In the fourth stage, feature extraction, all the sub- (VFD) (WEGCFW08) to feed and control the rota-
sets of features identified as most discriminative for tional speed of a 1492 W, three-phase induction motor
each healthy-faulty pair of classes are then subjected (WEG00236ET3E145T-W22). The induction motor is
to a base transformation and a compression process coupled to a 4:1 ratio gearbox (BALDOR
by means of the LDA technique. As a result of this GCF4X01AA) driving its input shaft, this gearbox
process, a new set of features is extracted composed is used to test the different levels of uniform wear
by a weight combination of the previously selected studied in this work. Besides, the gearbox in turn
statistical features. Accordingly, due to the accom- couples the induction motor to a DC generator
plished base transformation, it is possible to project (BALDOR CDP3604), such generator is used as a
the new set of extracted features into a two- noncontrolled mechanical producing around 20% of
dimensional space allowing obtaining a visual the nominal load in the induction motor under
6 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

working conditions. The occurrence of vibrations in to automatize the test run, a relay controlled by the
the perpendicular plane of the gearbox rotating axis is DAS provides control over the induction motor
measured and acquired using a triaxial accelerometer startup. All the acquired vibration signals are stored
(LIS3L02AS4), this accelerometer is mounted on a in a personal computer (PC) for offline analysis.
board with its proper signal conditioning and its Ninety axial vibration measurements for each con-
anti-alias filtering. A 12-bit four-channel serial- sidered condition, where each one of the acquired
output sampling analog-to-digital converters measurement belongs to one second of the gearbox
(ADS7841) is used on board of the DAS to acquire working condition, are acquired. Due to each
the vibration signals, the sampling frequency is set to acquired vibration signal consists of 90 consecutive
3 kHz, and the DAS is configured to acquired 270 kS samples, for each considered condition it is estimated
during 90 s of continuous sampling in the induction a characteristic vibrational pattern with 15 numerical
motor from the startup to the steady-state. Such DAS statistical features and 90 consecutive samples.
is a proprietary, low-cost design, based on field pro- In this work, four different conditions are con-
grammable gate array technology (FPGA). In order sidered to be evaluated alternatively in the gearbox:
healthy (HLT), 25%, 50%, and 75% of uniform wear,
respectively. Regarding the considered faults, in order
to produce the wear fault condition in gears, it was
artificially made by a gear factory. Thus, considering
a set of gears in a healthy condition, these are sub-
jected to a machining process where all of its teeth are
worn by a tungsten cutter; then, such gears are also
subjected to a lapping process aiming to make as real
as possible the induced wear in gears. The gearbox
used in this work consists of only two gears, one of
them is the driver gear and the other one is the driven
gear, and each one of these gears has 5 and 72 teeth,
respectively. Therefore, in this work are addressed
three different levels of uniform wear and a healthy
condition in order to prove the effectiveness of the
proposed diagnosis methodology. From Figure 3(a)
to (d) the studied gears are shown: HLT, 25%,
50%, and 75% of uniform wear, respectively.
Figure 2. Experimental test bench used to experiment and to The experimentation for the considered conditions
identify gearbox wear. is carried out by replacing each one of the faulty gears

Figure 3. Set of gears evaluated in the gearbox: (a) Healthy, (b) 25%, (c) 50%, and (d) 75% of wear.
Saucedo-Dorantes et al. 7

(worn gears), with the healthy one, alternatively. multi-stage feature reduction approach, where pro-
Three different operating frequencies for driving the cedures for feature selection and feature extraction
induction motor have been considered, and the are included.
frequencies of 5, 15, and 50 Hz are set in the VFD Subsequently, in the feature selection stage, there
causing an averaged rotating speed of 294, 890, and are analyzed the discriminative capabilities of the esti-
2985 r/min in the induction motor, respectively. mated statistical time features by calculating their
Fisher score. Hence, in order to discard those features
with nondiscriminative capabilities, the feature selec-
Validation of the method tion is applied over all possible combinations in sets of
The proposed gearbox wear diagnosis methodology is features composed by one, two, and three features, for
implemented in Matlab, which is used to process the each healthy-faulty pair of classes. Also, it should be
acquired vibration signals and to provide the diagno- noted that by performing separately the analysis
sis of the condition. As aforementioned, in the pro- to the three healthy-faulty pairs of classes, the features
posed work, the acquired vibration signals belong to with better discriminative capabilities are filtered.
those vibrations in the perpendicular plane of the The three healthy-faulty pair of classes considered
gearbox rotating axis since some studies have reported are healthy-25% gear wear, healthy-50% gear wear,
that the occurrence of perpendicular vibrations on the and healthy-75% gear wear. As the proposed Fisher
rotating axis is related to the inappropriate working score analysis represents a combinatorial problem
conditions of rotational machines.5,13,24 Regarding among features, it could be unaffordable the required
the proposed methodology, the data acquisition is computational burden. In this sense, the Fisher ana-
performed by carrying out different experiments at lysis is limited to sets of features composed by a max-
different operating frequencies for driving the induc- imum of three features, since the computational
tion motor: 5, 15, and 50 Hz. Thereby, the stored burden responds as an unfeasible factorial function
vibration signals consist of 90 s which correspond to to the increase of analyzed features. Therefore, in
the continuous monitoring of the gearbox working order to carry out the selection process during the
under the three different considered conditions, then, experimental validation, only subsets composed of
each acquired vibration signal is segmented into equal combinations of one, two, and three statistical fea-
parts of one second in order to generate a consecutive tures are considered to be evaluated for computing
set of samples and to facilitate the processing of the their corresponding Fisher score. After the assessment
signals. of all the possible combinations of sets of features
Afterwards, the feature estimation is carried out composed of one, two, and three features, the sets
through the estimation of 15 statistical time features are ranked according to their relevancies in terms of
from each acquired vibration signal; that is, the stat- Fisher score; that is, sets of features with better dis-
istical features are estimated from each segmented criminative capabilities produce largest values of
part. As a result, a characteristic vibrational pattern Fisher score. Then, aiming to filter the best statistical
is estimated from each vibration signal; thus, each features, the first ranked subset is considered as the
considered condition is now represented by a set of best set of features to distinguish between the corres-
15 statistical time features with 90 consecutive sam- ponding pair of a fault condition and healthy
ples. Despite the high characterization provided by condition.
the statistical features, not all of them exhibit the Therefore, the statistical features that better
same representative information related to the gear- describe the gearbox working conditions are obtained
box condition. In this regard, aiming to retain the best through the proposed feature selection approach, and
discriminative statistical features, the estimated sets of the selected subsets are considered the best with cap-
statistical time features are then analyzed through a abilities of class separation. In Table 2, the details of

Table 2. Detail of the selected subsets composed by considering combinations of three statistical time features in the Fisher score
analysis.

Operating frequency

5 Hz 15 HZ 50 Hz

Gear Selected subset Computed Selected subset Computed Selected subset Computed
condition of features Fisher score of features Fisher score of features Fisher score

25% SRM–SD–RMS 1693.43 SD–RMS–SMR 34946.22 SD–RMS–Variance 366.91


50% Mean–Variance–SD 419.58 Mean–SD–Variance 877.73 Mean–S. factor RMS– 408.61
S. factor SMR
75% SD–Variance–RMS 74.92 SRM–RMS–SD 684.15 RMS–SD–Variance 1620.95
RMS: root mean square; SD: standard deviation; SRM: Square root mean.
8 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

Figure 4. Resultant projection of the extracted set of features obtained by the application of the proposed multi-stage feature
reduction strategy to the original set of features. When the induction motor is driven at (a) 5 Hz, (b) 15 Hz, and (c) 50 Hz.

the selected subsets of statistical time features for the more than one time, it must be considered only one
three considered faulty conditions under the different time for the final set of features selected. For example,
operating frequencies are summarized. It should be considering the selected subsets of features obtained
clarified that the selected subsets of features summar- when the operating frequency is 5 Hz, the final subset
ized in Table 2 correspond to the first ranked subset of of selected statistical features is composed by the
features in terms of Fisher score; furthermore, these square root mean (SRM), the standard deviation,
selected subsets of features are computed by analyzing RMS, mean, and the variance.
combinations of three statistical features. Hence, the In regard with the use of combinations of one, two,
computed values of Fisher score correspond to the and three features, the results do not show significant
Fisher score generated by those statistical features differences and, when it was considered combinations
that compose the selected sets of features. It should of one and two features the resulting subsets were
be considered that values of Fisher score equal to 1 composed by, in general, the same statistical features
mean that the within-class scatter estimation is equal but with a different value of Fisher score. Even
to the between-classes scatter estimation. Therefore, though combinations of one, two, and three features
the statistical features with better discriminant cap- were used to perform the Fisher score analysis, the
abilities are those that tend to generate a value of selected subsets resulted to be composed of three fea-
Fisher score higher than 1. Consequently, according tures. The Fisher score analysis by considering one
to the obtained values of Fisher score listed in and two features was useful to ensure that some spe-
Table 2, advantageous discriminative capabilities are cific statistical features prevail in the characterization
obtained between classes of the healthy condition and of the gearbox working condition. Moreover, the
faulty condition. Also, it is noticed that a statistical Fisher score analysis allows to understand that, as
feature can appear repeated in different subsets of well as a feature may produce a higher value of
selected features for the same gearbox working con- Fisher score by itself, this feature may take part also
dition. Although a statistical feature appears repeated of a low value of Fisher in a set of features due to its
Saucedo-Dorantes et al. 9

combination of other features, and the same can occur


in the opposite way. In this sense, the importance of
such analysis is emphasized.
In the feature extraction, the selected subsets of
statistical features are subjected to a compression pro-
cedure and a base transformation through the LDA
technique. Thus, through this strategy of feature
extraction, it is obtained a new subset of features,
and such new extracted features are composed of a
weighted combination of the previously selected stat-
istical features. As a result of the base transformation,
it is possible to obtain a visual representation of the
considered conditions into a two-dimentional space.
In this proposal, three different operating frequencies
to drive the induction motor are considered, then, the
feature extraction process is individually applied to
Figure 5. Projection of the extracted set of features com-
the selected subsets of features for each of the different puted through the application of the PCA as a unique feature
operating frequencies. reduction stage over the original data set of features that
Consequently, three different projections into a characterize the gearbox conditions at the operating frequency
two-dimensional space are obtained by carrying out of 5 Hz.
the feature extraction process through LDA tech-
nique; shows the resulting projections of the extracted
sets of features by driving the induction motor at besides data points of the condition of HLT, 50%
three different frequencies, 5, 15, and 50 Hz. As it is and 75% of gear wear are overlapped between them-
expected, in the three resulting projections corres- selves. Although PCA has been included in classic
ponding to the different operating frequencies, a feature reduction approaches used as a unique feature
clear separation between the considered faulty condi- reduction stage, in this proposed diagnosis method-
tions and the healthy one is obtained. Yet, although in ology, PCA as a unique feature reduction stage shows
some data points of different classes tend to appear disadvantages in the separation of the considered con-
close to each other, the centers of the clustered data ditions, since PCA seeks for global distribution of
are separated. Moreover, in order to ensure the effect- data and makes no consideration for different classes
ivity and applicability of the diagnosis, it is important that are nonconnected in the feature space.
to notice that no overlapping between the HLT and Moreover, aiming to highlight the effectiveness that
any of the faulty conditions appear. the application of the proposed feature selection offers
Even though there are other techniques that can be before performing the feature extraction and in order
used in feature extraction stages, for this proposed to demonstrate that the selected sets of features con-
diagnosis methodology, LDA is the most appropriate tain the most discriminative fault-related information,
technique considering that LDA deals with multi- the feature extraction is also carried out by means of
class problems from a supervised approach. In this PCA using the selected set of features obtained when
sense, in order to highlight the effectiveness of the the induction motor is driven at 5 Hz. Thus, Figure 6
proposed multi-stage feature reduction approach shows the projection of the extracted features per-
with respect to the results computed by a classical formed by the PCA from the selected sets of features
one-stage feature reduction approach, the feature for the gearbox conditions when the induction motor
reduction procedure is also carried out by the other is driven to 5 Hz. It is possible to notice from Figure 6,
well-known technique, the PCA. Thus, the feature that in this work the proposed feature selection pro-
reduction by means of PCA over the original data cess leads to improve the performance of the PCA in
set of 15 statistical features that characterize the the feature extraction. Yet, although PCA is an
gearbox condition at, for example, the operating unsupervised approach, the overlapping problems dis-
frequency of 5 Hz, has been performed. As aforemen- appear by means of considering a previous feature
tioned, PCA preserves as much as possible the vari- selection process since the nondiscriminative features
ance of the data distribution along its principal are removed.
components. In Figure 5, the extracted features result- With regard to the classification, a fuzzy-based
ing by considering the PCA as a feature extraction classifier is used to carry out the fault diagnosis
technique are shown. These extracted features repre- and to generate the output classes. Indeed, a high-
sent into a two-dimensional space the considered con- performance characterization of the considered
ditions of the gearbox when the induction motor is conditions is performed by the successive application
driven at 5 Hz. From Figure 5, it is possible to of the proposed multi-stage feature reduction
notice that the data points of the different classes method (feature selection and feature reduction).
are grouped in elongated areas as scattered data, Consequently, the consideration of a simple structure
10 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

Table 3. The performance of the achieved classification


during the training and test of the fuzzy inference system under
the considered conditions at different operating frequencies.

Operating frequency (Hz) Training (%) Test (%)

5 99.32 98.88
15 99.82 99.1
50 94.3 92.2

classification ratios obtained under the five-fold cross-


validation scheme exhibit a stable behavior, which con-
siders the different operating frequencies within the
range of 94.3% to 99.8% in the training stage, and
within 92.2% to 99.1% for the test stage. Moreover,
in order to prove the effectiveness of the proposed
Figure 6. Projection of the extracted set of features per-
formed by means of PCA to reduce the dimensionality of the
multi-stage feature reduction, the same structure of
selected sets of features that characterize the gearbox condi- the fuzzy-based classifier is also trained and tested
tions at the operating frequency of 5 Hz. using the extracted features obtained by considering
PCA as a unique feature reduction stage. Thus, the
resulting classification ratios achieved by applying
in the classifier allows to obtain accurate results with- PCA to the original data sets when the induction
out an excessive computational burden. Thereby, a motor is driven to 5 Hz are 88.9% during the training
classic fuzzy-based inference system is proposed to and 87.4% for the test. These classification ratios rep-
carry out the final diagnoses and the classification of resent a poor performance for a condition monitoring
the considered conditions. Thus, in the proposed methodology. In this regard, considering the results
fuzzy-based classifier, the extracted sets of features generated through the application of the proposed
are evaluated by means of membership functions in diagnosis methodology, the global ratio of classifica-
order to determine its degree of association to a tion is improved by around 11% in comparison with
specific fuzzy-event. Afterwards, the diagnosis is the results by using PCA as the unique feature reduc-
performed by computing the consequent of the tion stage. On the other hand, due to the proposed
fuzzy-based inference system through the evaluation feature selection stage is applied previously to the fea-
of the series of logical operations, and the consequent ture reduction, the performance of the of the PCA is
of each membership function in the fuzzy-based infer- improved when is performed the dimensionality reduc-
ence system is determined by a Sugeno-style member- tion of the selected sets of features that characterize the
ship. The training of the proposed fuzzy-based gearbox condition operating at 5 Hz. In this regard, the
classifier is done by considering 50 epochs. classification ratios achieved by applying PCA over the
Aiming to demonstrate the effectiveness of the pro- selected sets of features are 98.3% during the training
posed diagnosis methodology and to obtain statistic- and 97.1% during the test; in this work, this improve-
ally significant results, the training and test of the ment is generated by the fact of being carried out the
proposed fuzzy classifier has been carried out follow- proposed feature selection process. These results reflect
ing a five-fold cross-validation scheme. In this sense, the high performance of the proposed multi-stage fea-
the extracted sets of features composed of 360 samples ture reduction approach and its application in the
(90 samples per condition) are considered as the ori- development of diagnosis schemes for assessing gear-
ginal database. Then, this database is divided into two box transmission systems.
different data sets. The first data set composed of 288
samples (72 samples per condition), is used for train-
Conclusions
ing the classifier, and the second one that comprises 72
samples (18 samples per condition), is used for testing This work presents a new diagnosis methodology for
the classifier. assessing the condition of a gearbox under different
Thereby, with respect to the performance of classifi- incipient fault conditions of uniform wear. There are
cation, all the variance data available in the original three important characteristics that must be high-
databases are used, and through the consideration of lighted in this new proposed methodology. The first
the five-fold cross-validation scheme, five classification one is related to the use of vibration signals that
ratios are obtained. Then, the average of these five clas- remain as the most reliable for industrial applications,
sification ratios is computed, and Table 3 summarizes and the proposed set of statistical time features which
the average of the classification ratios achieved during allows obtaining a better characterization of the
the training and the test of the proposed fuzzy-classifier acquired vibration signals providing relevant informa-
of the extracted subsets of features. It is noticed that the tion that is correlated to the gearbox working
Saucedo-Dorantes et al. 11

condition. The second one lies in the application of 3. Yin S, Ding SX, Xie X, et al. Review on basic data-
the proposed multi-stage feature reduction approach driven approaches for industrial process monitoring.
for processing the estimated set of statistical features. IEEE Trans Ind Electron 2014; 62: 6418–6428.
The application of a consecutive feature reduction 4. Cheng G, Li H, Hu X, et al. Fault diagnosis of gearbox
based on local mean decomposition and discrete hidden
over the statistical time features allows obtaining the
Markov models. Proc IMechE, Part C: J Mechanical
approximation to an optimum set of features by dif-
Engineering Science. Epub ahead of print 10 March
ferent considerations, from the removal of the less 2016. DOI: 10.1177/0954406216638885.
discriminative features to the compression of the 5. Liu Z, Guo W, Tang Z, et al. Multi-sensor data fusion
most significant subsets of statistical features. The using a relevance vector machine based on ant colony for
third one is the use of a simple classifier based on a gearbox fault detection. Sensors 2015; 15: 21857–21875.
fuzzy inference system capable of performing the rec- 6. Lei Y, Lin J, Zuo MJ, et al. Condition monitoring and
ognition of the considered conditions. fault diagnosis of planetary gearboxes: a review.
Four different experimental conditions have been Measurement 2014; 48: 292–305.
evaluated in the gearbox, including the healthy and 7. Chen J, Zhang C, Zhang X, et al. Planetary gearbox
faulty conditions. Under the considered experimental condition monitoring of ship-based satellite communica-
tion antennas using ensemble multi wavelet analysis
conditions carried out at different operating frequen-
method. Mech Syst Signal Process 2015; 54–55: 277–292.
cies, the proposed diagnosis methodology shows reli-
8. Bartkowiak A and Zimroz R. Dimensionality reduction
ability in the obtained fault diagnosis results, and via variables selection – linear and nonlinear
92% of total classification ratio is achieved in the approaches with application to vibration-based condi-
worst-case classification. Besides, in this work, the tion monitoring of planetary gearbox. Appl Acoust
possibility to detect incipient faults through the evalu- 2014; 77: 169–177.
ation and the diagnosis of the three different levels of 9. Chaari R, Khabou MT, Barkallah M, et al. Dynamic
uniform wear is also shown. Note that this is the first analysis of gearbox behavior in milling process-
time that the appearance of uniform wear has been Non-Stationary operations. Proc IMechE, Part C: J
addressed in condition monitoring schemes applied to Mechanical Engineering Science 2016; 230: 3372–3388.
gearboxes transmission systems. The obtained results 10. Amarnathm M and Sujatha C. Surface contact fatigue
failure assessment in spur gears using lubricant film
make the proposed methodology suitable for the diag-
thickness and vibration signal analysis. Tribol Trans
nosis of gearbox transmission systems in industrial
2015; 58: 327–336.
applications. Future work will be focused on the 11. Kia SH, Henao H and Capolino GA. Gear tooth sur-
implementation of the proposed gearbox wear diag- face damage fault detection using induction machine
nosis methodology for online identification, besides stator current space vector analysis. IEEE Trans Ind
the analysis of the appearance of uniform wear in Electron 2015; 62: 1866–1878.
gearboxes combined with other faults. 12. Li Z, Yan X, Tian Z, et al. Blind vibration component
and nonlinear feature extraction applied to the nonsta-
Declaration of Conflicting Interests tionary vibration signals for the gearbox multi-fault
diagnosis. Meaurement 2013; 46: 259–271.
The author(s) declared no potential conflicts of interest with
13. Cerrada M, Sanchez RV, Cabrera D, et al. Multi-stage
respect to the research, authorship, and/or publication of
feature selection by using genetic algorithms for fault
this article.
diagnosis in gearboxes based on vibration signal.
Sensors 2015; 15: 23903–23926.
Funding
14. Lei Y, Li N, Lin J, et al. Two new features for condition
The author(s) disclosed receipt of the following financial monitoring and fault diagnosis of planetary gearboxes.
support for the research, authorship, and/or publication J Vib Control 2013; 21: 755–764.
of this article: This research work has been partially sup- 15. Qiao W and Lu D. A survery on wind turbine condition
ported by CONACYT, Mexico, under doctoral scholarship monitoring and fault diagnosis-Part I: components and
number 278033; FOPER-UAQ 20701213128; SEP- subsystems. IEEE Trans Ind Electron 2015; 62:
CONACYT 222453-2013 projects and by the Spanish 6536–6545.
Ministry of Economy and Competitiveness under the 16. Li C, Sanchez RV, Zurita G, et al. Gearbox fault diag-
TRA2016-80472-R. nosis base on deep random forest fusion of acoustic and
vibratory signals. Mech Syst Signal Process 2016;
References 66–67: 283–293.
1. Guoji S, McLaughlin S, Yongcheng X, et al. Theoretical 17. Waqar T and Demetgul M. Thermal analysis MLP net-
and experimental analysis of bispectrum of vibration sig- work based fault diagnosis on worm gears.
nals for fault diagnosis of gears. Mech Syst Signal Measurement 2016; 86: 56–66.
Process 2014; 43: 76–89. 18. Yang CY and Wu TY. Diagnostics of gear degradation
2. Kia SH, Henao H and Capolino GA. Trends in gear using EEMD approach and PCA process. Measurement
fault detection using electrical signature analysis in 2015; 61: 75–87.
induction machine-based systems. In: Proceedings of 19. Goyal D and Pabla BS. The vibration monitoring meth-
the 2015 IEEE workshop on electrical machine design, ods and signal processing techniques for structural
control and diagnosis (WEMDCD 2015), March 2015, health monitoring: a review. Arch Comput Methods
pp.297–303. Torino, Italy: IEEE. Eng 2016; 23: 585–594.
12 Proc IMechE Part C: J Mechanical Engineering Science 0(0)

20. Ruiz-Cárcel C, Jaramillo VH, Mba D, et al. 25. Van M and Kang HJ. Wavelet kernel local Fisher dis-
Combination of process and vibration data for criminant analysis with particle swarm optimization
improved condition monitoring of industrial systems algorithm for bearing defect classification. IEEE
working under variable operating conditions. Mech Trans Instrument Measure 2015; 64: 3358–3600.
Syst Signal Process 2016; 66–67: 699–714. 26. Abed W, Sharma S and Sutton R. Neural network fault
21. Igba J, Alemzadeh K, Durugbo C, et al. Analyzing diagnosis of a trolling motor bases on feature reduction
RMS and peak values of vibration signals for condition techniques for an unmanned surface vehicle. Proc
monitoring of wind turbine gearboxes. Renew Energy IMechE, Part I: J Systems and Control Engineering
2016; 91: 90–106. 2015; 1–13.
22. Wan X, Wang D, Tse PW, et al. A critical study of 27. Liu Z, Zuo MJ and Xu H. Fault diagnosis for planetary
different dimensionality reduction methods for gear gearboxes using multi-criterion fusion feature selection
crack degradation assessment under different operating framework. Proc IMechE, Part C: J Mechanical
conditions. Measurement 2016; 78: 138–150. Engineering Science 2015; 227: 2064–2076.
23. Guoji S, McLaughlin S, Youngcheng X, et al. 28. Saucedo-Dorantes JJ, Delgado-Prieto M, Osornio-Rios
Theoretical and experimental analysis of bispectrum RA, et al. Multi-fault diagnosis method applied to an
of vibration signals for fault diagnosis of gears. Mech electric machine based on high-dimensional feature
Syst Signal Process 2014; 43: 76–89. reduction. IEEE Trans Ind Appl 2017; 53: 3086–3097.
24. Li W, Zhu Z, Jiang F, et al. Fault diagnosis of rotating 29. Duda RO, Hart PE and Stork DG. Pattern classifica-
machinery with a novel statistical feature extraction and tion. 2nd ed.. NY: John Wiley & Sons, Wiley-
evaluation method. Mech Syst Signal Process 2015; Interscience, 2001.
50–51: 414–426.

Das könnte Ihnen auch gefallen