Analysis of Permanent Magnet Synchronous Motor

310 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 68, NO.
2, FEBRUARY 2019
Analysis of Permanent Magnet Synchronous Motor

Fault Diagnosis Based on Learning
I-Hsi Kao, Wei-Jen Wang, Yi-Horng Lai, and Jau-Woei Perng , Member, IEEE
Abstract— This paper presents an effective diagnosis algorithm possible to diagnose ball-bearing faults based on the vibration
for permanent magnet synchronous motors running with an signal using an artificial neural network and support vector
array of faults of varying severity over a wide speed range. machine [9]. Sugumaran and Ramachandran [10] used a deci-
The fault diagnosis is based on a current signature analysis.
The complete fault motor diagnosis system requires the extraction sion tree as a fuzzy classifier in the fault diagnosis of a roller
of features based on the current method and a subsequent method bearing based on its vibration signal. There are several motor
for adding classifications. In this paper, we propose two feature fault diagnosis methods based on vibration signal analysis.
extraction methods: the first involves a classification method Li et al. [11] developed a neural-network-based rolling bearing
that utilizes a wavelet packet transform and the second is a fault diagnosis. In [12], a diagnosis method based on wavelet
deep 1-D convolution neural network that includes a softmax
layer. The experimental results obtained using real-time motor analysis was applied to the feature extraction for mechan-
stator current data demonstrate the effectiveness of the proposed ical vibration signals. Tamilselvan and Wang [13] presented
methods for real-time monitoring of motor conditions. The results a novel multisensor health diagnosis method using a deep
also demonstrate that the proposed methods can effectively belief network based on a restricted Boltzmann machine. This
diagnose five different motor states, including two different method trained the layers one by one as a deep network struc-
demagnetization fault states and two bearing fault states.
ture. This paper showed that the deep belief network classifier
Index Terms— Convolution neural network (CNN), deep model yielded good results in complex systems. Vibration
learning, fault diagnosis, motor current signature analysis, signal analysis often encounters problems when noise cannot
permanent magnet synchronous motor (PMSM), wavelet package
transform. be fully suppressed. If the noise is loud, the accuracy of the
diagnosis system reduces. Thus, to decrease noise and increase
the feature learning ability, the maximum correntropy was used
I. I NTRODUCTION
to design a new, stacked autoencoder (SAE) loss function [14].
M ECHANICAL fault diagnosis is an important issue

in modern industries. The advent of industrial automa-
tion has enabled fast production of large quantities of products;
Some studies have determined the fault signal and disturbance
depending on complex math and deep theory in model-based
fault diagnoses [15], [16].
however, equipment maintenance still requires human inter- In machine learning, a convolutional discriminative feature
vention. As technology advances, the need for human inter- learning method has been developed for the fault diagnosis
vention in the fault diagnosis process has gradually decreased. of induction motors [17]. This method is more effective and
In recent years, machine learning has become a popular robust than other state-of-the-art methods. Although vibra-
technique. Many studies have demonstrated that both unsu- tion signal analysis yields good results in fault diagnosis,
pervised and supervised machine learning can be effectively the removal of background noise is often a complicated
used in fault diagnosis [1]. Many unsupervised learning process. In addition, vibration measurement suffers from the
systems have been applied in industry, such as the K -means uncertainty of installation location and effects of adjacent
algorithm [2], principal component analysis [3], [4], and self- machines. The need to install the sensor on the motor presents
organizing maps [5]. Supervised learning systems, such as an additional cost.
artificial neural networks [6], support vector machines [7], Utilizing current signal analysis to conduct motor fault
and partial least squares [8], also exhibit strong performances diagnosis reduces the problem of background noise and effects
in engineering applications. These applications demonstrate of other mechanical interference. In addition, if a Hall sensor
that machine learning can effectively solve or improve a is incorporated into the inverter to detect the stator current
variety of engineering problems. of the motor, there is no need to install additional sensors.
Numerous mechanical fault diagnoses based on vibration One method, based on coupled circuits and winding function
signal analysis are currently available. However, it is also theory for squirrel cage induction motors, has been proposed
Manuscript received November 7, 2017; revised May 10, 2018; accepted to successfully simulate the steady state and transient response
May 22, 2018. Date of publication June 29, 2018; date of current version of a normal induction motor. It also simulates the motor with
December 24, 2018. The Associate Editor coordinating the review process stator and rotor faults without changing the structure of the
was Dr. Massimo Lazzaroni. (Corresponding author: Jau-Woei Perng.)
The authors are with the Department of Mechanical and Electro-Mechanical model [18]. Using both the measurement data of the stator
Engineering, National Sun Yat-sen University, Kaohsiung 804, Taiwan current and the mechanical vibration spectra as the source
(e-mail: jwperng@faculty.nsysu.edu.tw). signal to train the neural network, we can diagnose problems
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. in induction motors, such as rotor stator and rolling bearing
Digital Object Identifier 10.1109/TIM.2018.2847800 faults [19].
0018-9456 © 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
KAO et al.: ANALYSIS OF PMSM FAULT DIAGNOSIS 311
Several studies have incorporated frequency domain The feature frequency and mechanical angular frequency can
analyses to diagnose faulty motors. According to [20], be expressed in the mathematical form [25].
the Daubechies 3 wavelet can be used to analyze the mother We focus on the bearing and demagnetization failures of
wavelet function by extracting the features through a wavelet PMSMs to develop an effective diagnostic method. Bearings
packet transform (WPT). The four nodes of the switching within rotating machinery directly affect the efficiency of
current were used for the energy characteristic index, which mechanical operations, and demagnetization problems result
serves as the basis for evaluating the fault. An artificial neural from the motor handling process or rotor magnet magnetiza-
network was then developed to identify the occurrence of the tion operation error.
fault in the converter. This system can detect a common three- In this paper, we present two feature extraction methods.
phase winding open circuit between the two phases that belong The first is an unsupervised learning technique that incor-
to an electrical-class fault detection. In [21], short- and open- porates a 1D-CNN and WPT. The second is a classification
circuit electrical faults were successfully detected, as were method that combines supervised learning and a softmax layer
simulated mechanical failures. The method for simulating the to classify PMSMs into five states. The experimental results
motor and load between the gearbox involved some degra- demonstrate that our methods can effectively diagnose five
dation of the gear. The feature extraction method involved different motor states running at speeds of 150–3000 r/min.
the application of the time–frequency analysis through the The results of the 1D-CNN method are better than those of the
stator current that incorporated a short-time Fourier transform, other methods. Therefore, we recommend using the 1D-CNN
undecimated WPT, Wigner distribution, and Choi–Williams for feature extraction and the softmax layer for classification
distribution of observing the magnetic field in the ac motor because the 1D-CNN has higher accuracy and requires less
guide current. A multidimensional discriminant analysis was human intervention for feature extraction.
then used to reduce the dimension of the features. The process The contributions of this paper are as follows:
was completed using a K -means algorithm. According to
the results of the classifier, the use of an undecimated 1) In [17], a 1D-CNN was used to diagnose motor faults by
WPT to deal with the current produced better classification extracting features from vibration signals. We used the
results. stator current as the analysis signal in our representation.
Several studies have used time-domain analysis to diagnose If the stator current is used as a feature extraction source
faulty motors. In [22], a fast and accurate motor condi- (instead of vibration signals), noise-related problems can
tion monitoring and early fault detection system was devel- be disregarded.
oped using 1-D convolutional neural networks (1D-CNNs). 2) In [22], an 1D-CNN was used to diagnose motor faults
The system has an inherent adaptive design that combines by extracting features from the current signal. However,
the feature extraction and classification phases of motor fault Ince et al. [22] only classified motors into two failure
detection into a single learning body. The experimental results modes, whereas our representation can classify motors
obtained using real motor data demonstrate the effectiveness of into five different failure modes while maintaining a high
the proposed method for real-time motor condition monitoring. degree of accuracy.
A fault diagnosis method based on an adaptive neurofuzzy 3) In addition, Ince et al. [22] only considered one
inference system, in combination with decision trees, has speed setting. Our study examines speeds in the range
also been presented [23]. This employed a classification and of 150–3000 r/min. The wider the coverage, the greater
regression tree (one of the decision tree methods) as a feature will be the diversity of features, but the additional
selection procedure to select pertinent features from the data feature dispersion does make classification more diffi-
set. Both frequency- and time-domain analyses’ results indi- cult. Although our conditions are more stringent, we can
cate that the model has potential applications in the fault still prove the effectiveness of our approach.
diagnosis of induction motors. 4) In summary, we propose a traditional feature extraction
Focusing on the demagnetization of a permanent magnet method or WPT and the 1D-CNN method and analyze
synchronous motor (PMSM) with bearing failure, and by the differences between them.
observing the current spectrum of the PMSM under demag-
netization failure, it is possible to develop a set of test The remainder of this paper is organized as follows.
flows that can effectively monitor its operation [24]. Using In Section II, we present the five motors used in the exper-
simulation software to establish a 2-D finite-element analysis, iments and describe the causes of the faults and reasons for
the PMSM can be used to compare the difference between designing the faulty motors. The harmonic distribution at each
simulation and experimental results. Furthermore, using analog speed is observed by measuring the time-domain amplitude
signals to promote the fault diagnosis process extends its signal and fast Fourier transform spectrum. In Section III,
applicability to a wider range of operating conditions. Bearing we describe the diagnosis system structure and explain
failure is usually accompanied by eccentricity problems, and the diagnosis steps. In Section IV, we describe the two
simple bearing problems have been comprehensively discussed feature extraction methods; 1D-CNN and WPT. In Section V,
in the literature. By analyzing the stator current spectrum of the experimental results obtained using real motor data are
an induction motor, we can detect and monitor the bearing presented to demonstrate the effectiveness of the proposed
failure status. Mechanical vibrations in the motor will be methods for real-time motor condition monitoring. The exper-
reflected in the air gap and mechanical angular frequency. imental results show that our methods can effectively classify
312 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 68, NO. 2, FEBRUARY 2019
the five different states of the motors. Finally, Section VI

concludes this paper and suggests topics for future research.
II. PMSM FAULTS

In the event of motor failure, there are several important
parameters to be considered, such as the current magnetic
flux in the electrical parameter category and torque and speed
in the mechanical parameter category. The above signals are
widely used in fault diagnosis. Errors in stator frequency are
synchronized with the current frequency, and the operating
condition of the motor will respond to the current shock of the
driver. Hence, the current signal is an effective and feasible
focus or target of observation.
A PMSM is embedded in the rotor using permanent magnets
to produce a permanent magnetic field. By driving the inverter
ac, so that the stator winding produces a magnetic field,
the magnetic pole on the rotor will rotate synchronously with
the rotating magnetic field. The ac power for the synchronous
motor supplied by the driver will have specific multiple
relationships with the rotation of the motor and can be
expressed as
Ns = 120 × f s /2 p (1)
where f s (Hz) is the ac frequency provided by the convertor,
Ns (r/min) is the rotation speed of the motor, and p is the
number of pole pairs in the rotor of the motor. The formula for
Fig. 1. Stator current signal of the normal motor. (a) Time-domain current
converting electrical angular frequency to mechanical angular signal. (b) Frequency-domain current signal.
frequency requires the conversion of motor speed from r/min
to r/s. We then obtain the electric angle frequency/mechanical
angle frequency conversion equation distribution is not perfect. Connecting all of the peaks reveals
that the peak connection is a sinusoidal waveform and not a
fr = f s / p (2)
horizontal line. However, this is within an acceptable tolerance
where f s and fr are the electrical angular and mechanical for the amount of magnetization present.
angular frequencies, respectively. Once the electrical angular In the frequency field signal of the normal motor shown
frequency is determined, the motor speed is immediately in Fig. 1(b), the fast Fourier transform spectrum reveals a
known. According to the above descriptions, the concept of considerable amplitude in the 40-Hz range. Other ranges have
harmonic phenomenon can be described. a weak harmonic response but do not experience any obvious
In this paper, we diagnose five different PMSMs: a normal abnormalities. According to (1) and (2), the electrical angular
motor, motors with two different degrees of demagnetization frequency generated by the converter is 40 Hz. Our measure-
faults, and motors with two different types of bearing faults. ment results and formulas (1) and (2) are consistent. Although
By measuring and analyzing the motor under a variety of there will be some transmission delay in the measurement
operating conditions, we can describe the integrity of each experiment, this does not greatly affect the results.
fault situation as the basis for the development of a fault
diagnosis system. B. Demagnetization Fault in PMSM
The permanent magnets embedded in the rotor of the PMSM
A. Normal PMSM are subject to the high current generated by the heat from
To explore the differences in signal composition between magnetic attenuation, which is known as demagnetization.
normal and faulty motors, it is necessary to use the normal The diagnosis of a demagnetization failure has important
motor as the control in the experiment. This allows us to applications in several scenarios such as when designing a low-
directly compare objects with various faults and observe the volume, high-power motor. In particular, the electric car and
difference in the position of fault features. aviation industries would benefit from this aspect of diagnosis
We measured 600 r/min steady-state speed under the current technology, where the possibility of detecting even partial
time-domain signal of the normal motor, as shown in Fig. 1(a). demagnetization could have a significant impact.
The signal is composed of sine wave combinations, and the In our experiments, we utilized two types of demagnetiza-
period of the sine waves is the period in which the driver tion faults, each with a different degree of demagnetization.
operates the motor. Furthermore, by observing the change The first demagnetization fault was created by replacing one
in the size of each peak, we can see that the magnetic of the permanent magnets in the rotor with aluminum pieces
Fig. 2. Stator current signal of the demagnetization fault in the motor. Fig. 3. Stator current signal of the semi-demagnetization faults in the motor.
(a) Time-domain current signal. (b) Frequency-domain current signal. (a) Time-domain current signal. (b) Frequency-domain current signal.
of equal weight. (Aluminum has no magnetic properties). magnetic potential affects not only the main frequency but also
This method ensures that the balance of the motor dynamics other harmonics. Using high- and low-frequency harmonics
is not affected, preventing eccentricity failures. The second that are close to the main frequency, the frequency of the fault
demagnetization fault was created by reducing the magnetic feature can be determined by [27]:
flux of one of the permanent magnets. We refer to this as the
semi-demagnetization fault herein.
By observing the magnetic field, we can compare the spatial f dmg = f s (1 ± k/ p) , k = 1, 2, 3, . . . (3)
distribution of the magnetic flux densities of the normal
motor and motors with demagnetization faults. Decreasing the where f dmg is the fault feature frequency, f s is the electrical
magnetization of one of the permanent magnets caused the angular frequency, and p is the number of pole pairs in the
magnetic flux density distribution of the entire motor to lose rotor.
uniformity, resulting in significant changes to the current. According to (3), the electrical and mechanical angular
When the motor with the demagnetization fault is at rest, frequencies are calculated as 40 and 10 Hz, respectively,
it is possible to consider the rotor being pulled in a certain when the motor is operating at 600 r/min. By examining
direction by manually rotating the rotor. The differences the frequency domain of the stator current signal, the lowest
between the two levels of demagnetization failure lie in the harmonic is the mechanical angular frequency 10 Hz; this
difference in the strength of attraction. Therefore, the fault is called a baseband. The motor will have a different main
diagnosis system designed in this paper can determine the frequency and baseband while operating at different speeds.
degree of failure [26]. During the experiments, the feature frequency of the demag-
By measuring the demagnetization fault, we observe that netization fault occurs when k = 1, 9, 10, and 19, as shown
the demagnetization phenomenon leads to a nonsinusoidal in Fig. 2(b).
component in the current signal. Therefore, nearby waveforms By observing the motor with a semi-demagnetization fault,
are also affected. The current signal when the motor with we determined that the peaks of this motor are relatively small,
demagnetization fault is operating at 600 r/min is shown as shown in Fig. 3. In the frequency domain, the frequency
in Fig. 2(a), which is significantly different from the current features of the semi-demagnetization and demagnetization
signal generated by the normal motor. faults are the same, but their amplitudes are slightly smaller.
Furthermore, by observing the frequency domain of the In summary, our algorithm is capable of classifying the stator
stator current, we can see that the uneven distribution of current of motors, although these are very similar.
Fig. 4. Specification and size of the bearing.
C. Bearing Fault in PMSM

According to the literature, approximately 40% of motor
failures can be attributed to bearing faults. Even under
balanced loads and normal operating conditions, bearings may
become susceptible to fatigue damage from prolonged use.
The reason may be that pieces of the ball peel away because
of fatigue and fall into the raceway, resulting in increased rota-
tional friction. According to this description, the mechanical
failure of bearings can be caused by three types of defects:
outer ring, inner ring, and ball defects. These faults often
become serious when the motor has a complex fault problem.
During operation, the resistance of the entire system is
reflected as extra harmonics in the stator current signal because
the rotor is affected by the bearing state. The feature frequency Fig. 5. Stator current signal of the motor with damaged inner ring.
of the mechanical failure of the bearing is affected by vibra- (a) Time-domain current signal. (b) Frequency-domain current signal.
tions between the mechanism and element, but the stator
current detection method is not affected by these vibrations.
In other words, the stator current is not disturbed by external metal fatigue in the form of raceway cracks. A drill on
factors. Instead, it is disturbed because the relative movement the inner ring was used in the processing method to create
between the stator and rotor is affected by air gap changes, four small holes. The holes were symmetrical to maintain
resulting in current wave disturbances. rotational balance. The second bearing fault was designed to
By adopting the motor current signal analysis method, simulate a bearing raceway produced in a factory environment,
we can detect bearing faults based on the harmonics generated where it was subjected to a large amount of fine iron powder.
by the current. The frequency of the harmonic is obviously The processing method involved the removal of the bearing
affected by the electrical angular frequency. Because bearing and adding soft aluminum powder to the raceway. The motor
damage can introduce eccentricity to the rotor, the bearing was subjected to higher friction during rotation, but no sound
behavioral damage and eccentricity problems are described by or vibration could be observed directly.
the same formula. According to [28] and [29], the relationship Observations of the motor with the first type of bearing fault
between bearing eccentricity and electrical angular frequency (damaged by drilling the inner ring), with the ball striking
is defined by the borehole, showed that the disturbance current caused the

f bng = fs ± m f i,o (4) waveform to turn slightly at the crest. However, it was not
easy to observe the difference between these failure mode
where f i,o is related to the specification of the bearing as motors [Fig. 5(a)] and the normal motor. The observations
of the stator current signal of this failure mode motor in the
n db
f i,o = fr 1 ± cos β . (5) frequency domain are shown in [Fig. 5(b)]; there were some
2 pd harmonics in addition to the main frequency.
Here, m is a positive integer, n is the number of balls in the The observations of the other type of bearing fault (bearing
bearing, f i,o is the feature vibration frequency, fr is the raceway filled with aluminum powder) showed that the
mechanical angular frequency, db is the diameter of the ball, increase in rotor friction caused the amplitude of the stator
and pd is the diameter of the raceway. The relevant mechanical current to rise slightly [Fig. 6(a)]. Because there is less noise
parameters of the bearing are shown in Fig. 4. when this failure mode motor is running, it is difficult to
In the experiments, motors with two types of bearing faults diagnose faults using a microphone. In the stator current of
were examined. The faults were located in different parts this failure mode motor, the harmonic distribution differed
of the bearing. One of the bearing faults was designed to from that of the demagnetization fault. However, the amplitude
simulate a bearing inner ring that had been damaged by of the motor with a bearing fault was not as obvious as the
methods can be roughly divided into data collection and data

training, as shown in Fig. 8. For both 1D-CNN and WPT,
the data collection method is the same. However, the data
training methods yield significantly different results.
A. Data Collection
The sample rate must be increased to 400 Hz to comply with
the Nyquist sampling theorem because the maximum speed
of the motor is 3000 r/min and the maximum electric angular
frequency is 200 Hz. In fact, the sample rate must be 10–20
times that of the Nyquist standard. Because of the operating
principle of the oscilloscope and staggered real-time sampling
architecture, the sample times are not the same. Furthermore,
the fault signal contained in the harmonics may produce high-
frequency components. Therefore, it is important to set aside
sufficient information.
To handle the wide range of electrical angular frequencies
(10–200 Hz), it is estimated that the sample rate should be
5 times the Nyquist standard, which is 2 kHz. From the fast
Fourier transform spectrum, we can determine whether there is
an aliasing effect within the 1-kHz spectrum. While observing
the high-frequency components, the >1-kHz spectrum may
contain some information. To avoid generating a large amount
of redundant data, we selected a sampling rate of 2 kHz for the
system specifications. However, this sampling rate also covers
Fig. 6. Stator current signal of the motor with added aluminum powder. the electrical angular frequency of the lowest 10 Hz of the
(a) Time-domain current signal. (b) Frequency-domain current signal. time information. Thus, we selected 2000 samples per second
for the fault diagnosis system specifications.
For every failure mode of the motor, we collected 400
amplitude of the motor with demagnetization fault. Therefore,
measurements for a total of 2000 samples. After every collec-
we can infer that the two bearing faults had not yet strongly
tion, the samples were labeled according to the failure mode
affected the smooth operation of the motors. However, their
of stator current signal extracted from the motor. The label
operation clearly varied from the normal motor, as shown
will be used in the supervised learning step. A large amount
in Fig. 6(b).
of data was effectively used for data training.
D. Differences Between the Five Signals

B. Data Training
We plotted the signals using the Yule-Walker method and
folded them together, as shown in Fig. 7. In Fig. 7, the After data collection, the most critical algorithm in this
line series represent the stator currents of the motors in the paper was employed for data training. There are two different
frequency domain for the normal motor (red line), motor with data training methods proposed in this paper.
a demagnetization fault (yellow line), motor with a semi- First, the samples were divided into two parts: one part that
demagnetization fault (green line), motor with a damaged inner includes the training data and the other the test data. Second,
ring (blue line), and motor with added aluminum powder the features were extracted from the data. There are two feature
(purple). In Fig. 7, the frequency of the demagnetization extraction methods proposed in our representation within the
fault is easy to observe because it can directly affect the 1D-CNN and WPT methods. The details of the feature extrac-
disturbance of the stator current. However, the bearing fault tion will be described in Section IV. Third, we substituted
signal is similar to that of a normal motor. This is also one the features and labels into the classification framework for
of the advantages of our algorithm. Through the test method training. We only trained the artificial neural network with
proposed in this paper, we were able to completely diagnose training data, and then verified the artificial neural network
the failure mode, undetectable to the human eye. using the test data to check whether the network suffered from
overfitting. Finally, once the number of training cycles reached
III. FAULT D IAGNOSIS S YSTEM S TRUCTURE the preset number of epochs, we stopped the training and
tested the network with the test data to calculate the accuracy.
In this paper, we examine two types of feature extrac-
tion methods, 1D-CNN and WPT. The 1D-CNN extracts the
features from the stator current based on the time-domain IV. F EATURE E XTRACTION
signal, whereas WPT extracts features from the stator current Two feature extraction methods are proposed in this
based on both the frequency and time domains. The diagnosis paper. WPT represents a traditional feature extraction method,
Fig. 7. Stator current signal of the five failure modes in the frequency domain.
on the frequency domain and is very similar to the Fourier

transform method. The two methods are based on a generating
function to transfer signals. WPT is based on a limited-length
function, while the Fourier transform is based on a sinusoidal
function.
A wavelet is defined as a small area of waves. These are
waves that have a special length, and the average of the wave
values is 0. The mathematical representation of WPT is
W Tψ {x}(a, τ )
∞
1 t −τ
= x, ψa,τ = √ x(t)ψ dt, a > 0 (6)
a −∞ a
where x, ψa,τ is the wavelet packet coefficient, a is the scale

factor, and τ is the translation factor. When |a| < 1, the length
of the mother wavelet is compressed, providing a good degree
of support in the high-frequency range. Conversely, when
|a| > 1, the length of the mother wavelet is enlarged, offering
good support in the low-frequency range. This phenomenon
can be likened to scaling a lens to zoomed-in and zoomed-
out [30]. The translation factor τ is used to determine the
Fig. 8. Motor fault diagnosis algorithm location of the mother wavelet. This phenomenon can be
thought of as the tracking of a lens to determine the focal point.
Accordingly, WPT is known as a mathematical microscope.
whereas 1D-CNN is a more modern approach. Although the WPT uses a window function that is fixed but can change
accuracies of the two methods are significantly different, shape, and the time and frequency window functions utilize a
we selected 1D-CNN; the reasons for this are discussed local analysis method.
in Section VI. Thus, WPT has good time resolution at high frequencies and
good frequency resolution at low frequencies. Comparing this
A. Wavelet Packet Transform to the short-time Fourier transform, in which the resolution
In this paper, we proposed a frequency analysis method to cannot change with the frequency and is set by the size of the
diagnose faulty motors. WPT is a signal analysis method based window function, WPT is a powerful signal analysis tool that
TABLE I
F REQUENCY BANDS OF D ISCRETE WAVELET T RANSFORM
Fig. 9. Process of the DWT.
can adjust the appropriate window function in the frequency

domain, before obtaining the appropriate signal resolution. expressed as
Computationally, the discrete wavelet transform (DWT)
uses the original signal after the discrete sampling method.
N Ci2j
In the experiments, we used Daubechies 24 as the mother E i = 100 · a+1 N (9)
2
j =1 i=1 j =1 Ci j
wavelet function. While Daubechies 24 can amplify high-
frequency signals, fault harmonics often exist at high frequen- where j is the variable of sample point N. In our experiments,
cies. DWT is a discrete input and discrete output system. N = 2000, i is the hierarchical parameter of multilevel
At present, there is no sample or clear formula that can directly decomposition, and a is the maximum number of layers.
describe the relationship between the input and output; hence, Therefore, i = 1, . . . , 7, (7 + 1) and Ci j is the wavelet
this is generally represented as a hierarchical architecture. transform coefficient of the layer.
To make some necessary assumptions, we must clearly The following steps describe the designed procedure of the
define the signal and filters. Given an input signal S of WPT
length N, the structure will produce at most log2 N layers
1) Step I: Choose the practicable mother wavelet function.
of the discrete Fourier transform. After inputting S, we obtain
The Daubechies mother wavelet (db24) was chosen
two sets of coefficients, namely the approximation and detail
in this paper. Check the central frequency, as well as
coefficients, c A1 and cD1 , respectively. The two sets of
the lower and upper frequency bounds associated with
coefficients are produced by the convolution of the input signal
each scale according to the requirement of the motor
and a low-pass filter, and the convolution of the input signal
speed.
and a high-pass filter; a down-sampling filter then gives c A1
2) Step II: Evaluate and calculate the percentage of energy
and cD1 , as shown in Fig. 9 [31]. The next step is to divide
associated with each coefficient of each scale using (9).
c A1 into c A1+1 and cD1+1 by the same method described
The energy-related WPT features are then extracted.
above. By repeating this decomposition and if the original
Frequently, the demagnetized fault of motor stator current
signal S needs to be decomposed into j layers, we can obtain
will face the same or similar position in the frequency domain,
the coefficient vectors [c A j , cD j , . . . , cD 1 ].
but the overall signal strength will be different. The same
According to [32], cD j is the high-frequency signal of the
failure mode of a situation but different magnitudes causes
j th layer. The signal contains the original signal coverage in a
some confusion. To correct this deficiency, we added two
specific frequency band section and is described in (7). c An
features to the parameters of the total energy intensity. The first
is the low-frequency signal of the nth layer, where n is the
parameter is the amplitude of the frequency domain signal.
maximum value of j . The signal contains the original signal
While the same motors operate at different speeds, they
coverage in a specific frequency band section and is described
may produce different amplitudes in the frequency domain.
in (8)
The second parameter is the amplitude of the time domain.

1 1 The stator currents of the motor with a damaged inner ring and
freq cD j ∈ , Hz (7)
2 j +1 T 2i T the motor with aluminum powder exhibit different amplitudes
in the time domain. The stator current of the motor with
1
freq (c An ) ∈ 0, n+1 Hz (8) aluminum powder is greater than that of the damaged inner
2 T
ring motor.
T is the reciprocal of the sample rate and is 1/2000 s in the After DWT, and the extraction of the time- and frequency-
case considered here. Using (7) and (8), we calculated the domain amplitudes, we completed the feature extraction of
frequency bands of the DWT which are listed in Table I. the motor, as shown in Fig. 10. There are 10 features that can
To compare different failure mode motors and calculate the be used to describe the stator current of the motor. Finally,
absolute strength of the fault features, we used the state of the softmax layer classifies the signal features into five failure
energy distribution as a fault motor signal feature. Therefore, modes. Assuming we have an array V , where Vi represents
the size of the time-domain signal must be subtracted before the i th element in V , the softmax value of this element is
the DWT. After the DWT, c A j and cD j must be normalized. given in (10). A diagram and description of softmax is shown
The percentage of energy in the calculated coefficients can be in Fig. 11.
Here, x i:i+N−1 represents a window of length N for a sequen-

tial signal, starting from the point indexed as i to the point
indexed as i + N − 1, and operator ⊕ concatenates each data
sample into a longer embedding. In addition, a bias term b is
added to the convolution operation, making the final operation

z i = φ ω T x i:i+N−1 + b (12)
where φ is a nonlinear activation function, such as the sigmoid,
tangent, or rectified linear unit (ReLu). In this paper, ReLu
is chosen as the nonlinear activation function. Each specific
vector ω can be regarded as a filter. The signal value z i can be
regarded as the learning feature for the window of sequences
in the corresponding filter. In addition, the filter width N
Fig. 10. WPT feature extraction and softmax layer. can be varied. Because each raw sequence of length D has
multiple windows of sequences {x 1:N , x 2:N , . . . , x D−N+1:D },
a feature map is obtained. The raw sequence D and j th filter
are given by

z j = z 1j , z 2j , . . . , z D−N+1
j . (13)
A pooling layer is applied to the feature maps generated by
the convolutional layer. Each feature map contains multiple
activations of local patches. After pooling, one value is
extracted from each feature map. The purpose of pooling is
twofold. First, the pooling operation will extract the most
Fig. 11. Softmax diagram. important and invariant information by considering that a
single feature is irrelevant to the active positions in each
feature map. Second, the pooling operation reduces the feature
The main application of the softmax layer is multiclas- dimensionality and enables input signals with varying lengths
sification. The sigmoid function can only be divided into to be handled. In this paper, the max-pooling function was
two categories, whereas softmax can be divided into multiple applied. Max pooling with a pooling length of l is conducted
categories. The softmax function is essentially an expansion in the feature maps obtained by the convolutional layer.
of the sigmoid function The single feature corresponding to the filter is then obtained

as
Si = e Vi / e Vi . (10)
j p j = p 1j , p2j , . . . , plj (14)
(t −1)l+i
The Adam method [33] with a cross-entropy error func- ptj = max z j (15)
tion was used to train the parameters of the softmax layer.
The results are discussed in Section V. where p j is an s-dimensional vector output by the pooling
layer applied to the j th feature map. s = (D − N + 1)/l,
B. 1-D Convolutional Neural Networks where D, N, and l are the length of input, filter size, and
pooling size, respectively. Multiple filters are generally applied
CNNs are deformed networks derived from artificial neural with different weights and different window sizes to derive a
networks and are widely used in image processing and video feature vector. Assuming there are K filters, the output of the
recognition systems [34]. The most popular CNN model is the pooling layer is the concatenation of all the filter outputs
Alex model for large-scale data sets, such as ImageNet [35].
However, regular CNN architectures are not suitable in our R = [ p1, p2 , . . . , p K ]. (16)
case for the following reasons. CNNs usually have a very
It is clear that R can be the final representation of the
deep structure, which result in overfitting for most data sets.
input sequence because the outputs of different filters capture
Furthermore, most CNN structures are suitable for 2-D training
all meaningful structure patterns of the sequence. After the
images, but our data sets are in the 1-D time domain. For these
concatenation of filter outputs, the dimension of the output of
reasons, we must alter the structures into a deep 1D-CNN.
the pooling layer, as the learned representation, is Rs .
The input of the convolutional layer is a sequential signal
When designing a CNN, the following concepts must be
of length D. The filter, of window size N, is applied to
satisfied: the filter size goes from big to small and the filter
the entire input signal. Specifically, the convolution operation
quantity from more to less. A pooling layer must be connected
in the convolutional layer can be defined as a multiplication
after the convolutional layer, and batch normalization must be
operation between a filter vector ω ∈ R N and convolution
connected after the pooling layer. Therefore, we propose to
vector representation x i:i+N−1 , which is given by
adjust the parameters based on the above principles, relying
x i:i+N−1 = x i ⊕ x i+1 ⊕ · · · ⊕ x i+N−1 . (11) on experience and past experiments to tune the parameters
TABLE II
PARAMETERS FOR O UR CNN
Fig. 12. Flowchart of the CNN design process.
Fig. 14. Motor fault diagnosis system hardware architecture.
The Adam method [29] with the cross-entropy error func-

tion was used to train the parameters of the CNN model.
Based on extensive experiments investigating the recall and
Fig. 13. CNN architecture of the proposed method. accuracy, a fixed learning rate of 10−5 was assigned for our
application. To generalize the ability of the CNN and avoid
the overfitting issue, the dropout regularization [37], [38] and
batch normalization techniques were combined to obtain an
of the neural network. A simple design flowchart is shown effective training model with a better performance.
in Fig. 12. Finally, the softmax layer classified the signal features into
After tuning the experimental results, our CNN architecture five failure modes. Through the proposed layers, the archi-
generally had eight convolution layers, eight max-pooling tecture of our CNN was completely characterized. The CNN
layers, one fully connected layer, and nine batch normalization provided excellent results in the diagnosis of faulty motors.
layers, as shown in Fig. 13. Convolution layers 1, 2, 3, 4, 5, The experimental results and experimental analysis are
6, 7, and 8 had 64 filters with a size of 256, 128 filters with a presented and discussed in Section V.
size of 128, 192 filters with a size of 64, 256 filters with
a size of 32, 320 filters with a size of 16, 384 filters with a
size of 8, 448 filters with a size of 4, and 512 filters with V. E XPERIMENTS
a size of 2, respectively. These layers were configured to In this paper, we used the motor stator current signal
act as a hierarchical feature extractor. After each convolution analysis method for the fault diagnosis experiment. The exper-
layer, there was a batch normalization process [36], and then imental hardware architecture included an inverter, a motor,
a max-pooling layer with two kernels. This improved the a current sensor, an oscilloscope, a power supply, and a
generalization of the network, enhancing resistance to small PC, as shown in Fig. 14. Through the appropriate control
distortions and reducing the dimensionality. theory, we let the motor reach a steady speed and reused the
The CNN model turns a lower level map into a higher current sensor in series with the phase measurement output
level feature. For each convolution and pooling layer, ReLu current, before using the oscilloscope for digital sampling.
is adapted to further generate high-level nonlinear feature The power supply provided an accurate voltage to drive the
responses. Next, one fully connected layer with a softmax current sensors, ACS711EX. Finally, the stored oscilloscope
classifier is used to generate the diagnosis result for a specific stator current signal was sent to the computer for diag-
current signal. The CNN parameters are presented in Table II. nostic analysis. ACS711EX can tolerate bidirectional currents
from −31 to 31 A with a 100-kHz bandwidth. When the TABLE III

supply voltage Vcc was 3.3 V, the measured current was D IAGNOSIS A CCURACY OF THE M ETHODS
converted into voltage output using the following formula:
Vcc Vcc
Vout = +i × . (17)
2 73.3
Therefore, the linear slope of the input current to output
voltage was 45 mV/A. Using the following formula,
the measured output voltage can be converted to the original
current: 988 stator current data sets were successfully diagnosed,
Vout giving a diagnostic accuracy of 98.8%. The individual accu-
i = 73.3 × . (18) racy rates show that the 1D-CNN can efficiently extract
Vcc
features from the stator current of the motor, and the softmax
After extracting the stator current, the PC provided the layer can efficiently classify the extracted features. For the
diagnosis result based on our algorithms. The experimental WPT using 10 features (WPT-10), 981 stator current data
data set contained 2000 data points covering five failure modes sets were successfully diagnosed with an overall accuracy
and operating at 20 different speed settings. Each failure mode of 98.1%. For the WPT using eight features (WPT-8), only
and speed of stator current was collected 20 times to obtain 887 stator current data sets were successfully diagnosed with
the 2000 data points. To demonstrate that the signal processing an accuracy of 88.7%. We also compared the feature extraction
method could be used in feature extraction, the experiments performance of a SAE incorporating a softmax layer. Recently,
required numerous data sets to show the reliability of the the autoencoder concept has been widely used for learning
method. We divided the 2000 data sets into 1000 training generative models of data [39]–[42]. For the SAE, 964 stator
and 1000 testing data sets. Prior to the division, all data were current data sets were successfully diagnosed giving an overall
normalized to values in the range [−1, 1]. accuracy of 96.4%. Although the results were not as good
as those of the 1D-CNN, the SAE still achieved an accuracy
A. Results above 95%. This represents a good performance for a motor
diagnosis. Based on the above analysis, it is obvious that the
For this application, we evaluated the 1D-CNN architec-
1D-CNN is more effective than the other two methods.
ture through extensive experimentation. To train the 1D-CNN
architecture, 1000 training and 1000 validation set samples
with an equal number of the five sample failure modes were B. Analysis of Results
selected. The same data set was used in the proposed method The learning curves of the algorithms are shown in Fig. 15.
and WPT. To show the merits of each method, we used the The final accuracy of the CNN is superior to that of the
following formula to calculate the diagnostic accuracy rate: other methods, but this method also needed more epochs
thn +tdf +tsdf +tdir +tfip to converge. The main reason is that in training 1D-CNN
acc = networks, feature extraction and classification are combined
thn +tdf +tsdf +tdir +tfip + f hn + f df + f sdf + f dir + f fip
into a single architecture. In contrast, WPT extracts features
(19)
prior to classification. Thus, WPT only needs to train the
where thn , tdf , tsdf , tdir , and tfip represent true classifica- classification softmax layer. Although the SAE needs to train
tions of the normal, demagnetizing fault, semi-demagnetizing the feature extraction and classification networks, the two
fault, damaged inner ring, and aluminum powder motors, architectures are trained one by one, unlike the 1D-CNN.
respectively; and f hn , f df , f sdf , f dir , and f fip represent the Accordingly, in Fig. 15, we only provided the learning curve
corresponding false classifications. In addition, we calculated for the classification phase of the SAE.
the accuracy of the individual diagnoses using the following The 1D-CNN not only achieves a high level of general-
equation: ization but also avoids the need for manual parameter tuning
ttype in hand-crafted feature extraction. Its weight-shared network
acci = (20) structure makes this CNN similar to a biosphere network.
ttype + f type
The CNN reduces the complexity of the network model and
where ttype and f type are the true and false classifications, a number of weights. The proposed network structure creates
respectively. Thus, we are not only concerned with the total the signal directly as a network input to avoid the traditional
accuracy rate, but also the individual accuracy rate for each identification algorithm during the complex feature extraction
state. and data reconstruction process. The CNN has a high degree
The accuracies of different methods can be calculated of invariance in translation, scaling, and tilting. The weight-
using (19) and (20), and are listed in Table III. Health sharing strategy reduces the need to train the parameters.
normal motor, demagnetized fault motor, semi-demagnetized The same weights can create a filter from the signal position
fault motor, damaged inner ring motor, and filled in powder to detect the signal features and generalize the training of the
motor represent the true classifications of the normal, demag- model. The pooling operation reduces the spatial resolution of
netizing fault, semi-demagnetizing fault, damaged inner ring, the network, thereby eliminating the small offset and distortion
and aluminum powder motors, respectively. For the 1D-CNN, of the signal. The pooling operation produces a high degree
Fig. 15. Learning curves.
of translation invariance in the input data. However, when classification results. Therefore, we added two other features
training a deep model, it is prone to gradient dissipation, but that were not extracted by WPT: the amplitudes of the time
this negative phenomenon has little effect on the experimental and frequency domains. After adding these two features,
results. Our proposed 1D-CNN inherits all the advantages of the classification results showed a good accuracy rate that was
CNNs and can efficiently diagnose fault states in all five failure only slightly lower than that of the 1D-CNN.
modes. Consolidating the experimental results and discussion,
The SAE is very similar to the CNN. Both are unsupervised we recommend using the 1D-CNN for the fault diagnosis
learning methods that do not require manual parameter tuning of PMSMs. In addition to providing accurate diagnosis
for hand-crafted feature extraction. The SAE can use sufficient results, this technique is superior for the following reasons.
tag-free data for model pretraining and has a strong ability The 1D-CNN only needs to analyze the stator current in the
to characterize data. However, the diagnosis results are not time domain. The WPT cannot achieve good feature extraction
as accurate as those obtained by the 1D-CNN because the with only eight features, so it must add two features extracted
networks are all fully connected. Such networks have certain by the amplitudes of the frequency and time domains. In addi-
parameters that must be trained, which can cause overfitting. tion, we must determine the frequency band of the WPT to
As mentioned above, gradient dissipation can occur while complete feature extraction. This mechanism, requiring human
training a deep model. Another limitation of the SAE is that intervention to carry out feature extraction, appears to be
the input data must be translation invariant. For these reasons, slightly weaker than that of the 1D-CNN. Needless to say,
the diagnosis results of the 1D-CNN are superior to the results WPT requires other hand-crafted features to achieve good
of the SAE, which was expected. classification. Thus, the 1D-CNN appears to be more powerful
The biggest advantage of the WPT over Fourier transform than the WPT. Because the accuracy of the 1D-CNN is much
is that it is more suitable for dealing with transient signals. higher than that of the SAE, there is no doubt that it is a
The WPT has better feature extraction and recognition ability superior classification technique.
than does the traditional Fourier transform. Although the
WPT can efficiently extract features in the frequency domain,
the features extracted from a given failure mode are similar C. Contribution
but have different magnitudes. For this reason, using the In this paper, we proposed two fault diagnosis methods for
WPT alone to extract features can very easily yield poor PMSM: 1D-CNN and WPT. The two methods contained both
TABLE IV desktop PC would be capable of running the testing diagnosis

D IFFERENCE B ETWEEN P REVIOUS M ETHODS AND T HIS PAPER analysis algorithm in real time. Our analysis project could even
be ported to an embedded system because the testing does not
require the same computing power as the training.
VI. C ONCLUSION
A. Discussion of Results
unsupervised learning and supervised learning algorithms. The stator current signals of the motors with demag-
The unsupervised learning served as the feature extraction netization and semi-demagnetization faults, whether in the
method and supervised learning served as the classification time or frequency domains, were very similar. In addition,
method. The proposed system has the ability to learn how the waveforms from the motors with a damaged inner ring and
to extract optimal features with proper training and can, aluminum powder were very similar, regardless of domain.
therefore, be applied to any stator current of PMSMs. Although it is very difficult for the human eye to classify
The proposed system was tested using real motor current the five failure modes of motors, our proposed systems can
data, and the experimental results demonstrate its potential efficiently classify them with an accuracy of 98.8% (1D-CNN)
and effectiveness as a real-time PMSM condition monitoring and 98.1% (WPT). These accuracy rates of motor diagnosis
system. This system can be easily modified to include the are extremely high.
detection and classification of both mechanical and electrical Although our experiments used relatively small training data
faults with signatures in mechanical or electrical quantities, sets, this had little effect on the experimental results. In most
respectively. cases, when the training sample is too small, overfitting may
Ince et al. [22] proposed a diagnosis detection system to occur. However, in our experimental results, this phenomenon
detect motor faults using current signals based on a 1D-CNN. was not significant. Our diagnostic methods can determine the
However, the system could only classify the motors into two status of the motor at any speed.
groups; normal and faulty. Furthermore, the system could Although the training of the systems requires a high-
only diagnose motors that were operating at a specific speed. specification PC, the testing does not need such computational
In contrast, our proposed methods can efficiently diagnose five power. While the motor driver detects the stator current of
failure modes of PMSMs: a normal motor; and motors with PMSMs, we can diagnose PMSMs at any time while they are
a demagnetization fault, semi-demagnetization fault, damaged spinning.
inner ring and filled with aluminum powder. In addition, our
proposed system can diagnose motors operating at a wide B. Future Work
speed range.
In the future work, we intend to add the vibration signal
In [17], the proposed method also used CNN as a feature
to the feature extraction to increase the accuracy of the diag-
extraction method. The difference between our representation
nosis results. Using the stator current and vibration signals,
and [17] is that the source signal of our representation is the
a 2D-CNN can be applied in place of the 1D-CNN. Using
stator current signal, whereas [17] used the vibration signal.
more data for the CNN training will greatly improve accuracy.
When using the vibration signal as the source, noise distur-
In addition, we can collect more training samples to reduce the
bances must be considered. Conversely, when using the stator
overfitting of the networks. However, the degree of overfitting
current signal as the source, there is little need to consider
in our proposed 1D-CNN is already low.
the noise. In addition, the results from our proposed system
We expect the fault diagnosis system will be used in produc-
indicate a high accuracy rate. The differences between [17]
tion line status monitoring. By collecting the stator current data
and [22] and our representation are presented in Table IV.
of any fault and establishing a prediction model through the
diagnosis system, it will be possible to continuously monitor
D. Computer System the operational state of the motor in real time. In the future,
The experiments were conducted on a desktop PC with as long as motor drivers are equipped with Hall components,
Intel i7, 32-GB RAM, and Nvidia Tesla K80. Training the the inverter can be upgraded to a motor diagnosis machine.
1D-CNN required 2 s per epoch. Training the autoencoder Because the monitoring of motor states can improve the yield
required 0.2 s per epoch, with another training session of 0.2 s and extend the warranty of motors, while providing remote
per epoch to train the softmax layer. There was no need for diagnosis and analysis of the direction and quantity of prepa-
any additional training when using WPT to extract features, ration, it represents the next generation of motor design, likely
but the softmax layer required training for the classification, leading to policy improvements. Therefore, the proposed fault
taking 0.2 s per epoch. After the training of the artificial neural diagnosis system should play a pivotal role in Industry 4.0.
networks was finished, the diagnosis results were generated
once the stator current data were input. While conducting the R EFERENCES
training, a high-specification PC may be required for training
[1] X. Dai and Z. Gao, “From model, signal to knowledge: A data-driven
neural networks with a large number of neurons. However, perspective of fault detection and diagnosis,” IEEE Trans. Ind. Informat.,
a high-specification PC was not required during testing—any vol. 9, no. 4, pp. 2226–2238, Nov. 2013.
[2] A. Ahmad and L. Dey, “A k-mean clustering algorithm for mixed [25] R. R. Schoen, T. G. Habetler, F. Kamran, and R. G. Bartfield, “Motor
numeric and categorical data,” Data Knowl. Eng., vol. 63, no. 2, bearing damage detection using stator current monitoring,” IEEE Trans.
pp. 503–527, Nov. 2007. Ind. Appl., vol. 31, no. 6, pp. 1274–1279, Nov. 1995.
[3] W. Zhou, C.-M. Yeh, R.-J. Jin, Z.-H. Li, S.-L. Song, and J. Yang, [26] Y. Liang, “Diagnosis of inter-turn short-circuit stator winding fault
“ISAR imaging of targets with rotating parts based on robust prin- in PMSM based on stator current and noise,” in Proc. IEEE Int. Conf.
cipal component analysis,” IET Radar Sonar Navigat., vol. 11, no. 4, Ind. Technol., Busan, South Korea, Feb./Mar. 2014, pp. 138–142.
pp. 563–569, Apr. 2017. [27] S. Nandi, H. A. Toliyat, and X. Li, “Condition monitoring and fault
[4] A. Maligo and S. Lacroix, “Classification of outdoor 3D lidar data based diagnosis of electrical motors—A review,” IEEE Trans. Energy Convers.,
on unsupervised Gaussian mixture models,” IEEE Trans. Autom. Sci. vol. 20, no. 4, pp. 719–729, Dec. 2005.
Eng., vol. 14, no. 1, pp. 5–16, Jan. 2017. [28] M. El Hachemi Benbouzid, “A review of induction motors signature
[5] M. N. Bawane and K. M. Bhurchandi, “Classification of mental task analysis as a medium for faults detection,” IEEE Trans. Ind. Electron.,
based on EEG processing using self organising feature map,” in Proc. vol. 47, no. 5, pp. 984–993, Oct. 2000.
IEEE Int. Conf. Intell. Hum.-Mach. Syst. Cybern., Hangzhou, China, [29] J. R. Stack, R. G. Harley, and T. G. Habetler, “An amplitude modulation
Aug. 2014, pp. 240–244. detector for fault diagnosis in rolling element bearings,” IEEE Trans. Ind.
[6] Z. Szadkowski, D. Glas, K. Pytel, and M. Wiedeński, “Optimization of Electron., vol. 51, no. 5, pp. 1097–1102, Oct. 2004.
an FPGA trigger based on an artificial neural network for the detection [30] J. Rosero et al., “Simulation and fault detection of short circuit winding
of neutrino-induced air showers,” IEEE Trans. Nucl. Sci., vol. 64, no. 6, in a permanent magnet synchronous machine (PMSM) by means of
pp. 1271–1281, Jan. 2017. Fourier and wavelet transform,” in Proc. IEEE Instrum. Meas. Technol.
[7] S. U. Jan, Y. D. Lee, J. Shin, and I. Koo, “Sensor fault classification Conf., Victoria, BC, Canada, May 2008, pp. 411–416.
based on support vector machine and statistical time-domain features,” [31] S. G. Mallat, “A theory for multiresolution signal decomposition:
IEEE Access, vol. 5, pp. 8682–8690, May 2017. The wavelet representation,” IEEE Trans. Pattern Anal. Mach. Intell.,
[8] F. Y. Wu and H. H. Asada, “Implicit and intuitive grasp posture control vol. 11, no. 7, pp. 674–693, Jul. 1989.
for wearable robotic fingers: A data-driven method using partial least [32] J. A. Antonino-Daviu, M. Riera-Guasp, M. Pineda-Sanchez, and
squares,” IEEE Trans. Robot., vol. 32, no. 1, pp. 176–186, Feb. 2016. R. B. Perez, “A critical comparison between DWT and Hilbert–Huang-
[9] P. K. Kankar, S. C. Sharma, and S. P. Harsha, “Fault diagnosis of ball based methods for the diagnosis of rotor bar failures in induction
bearings using machine learning methods,” Expert Syst. Appl., vol. 38, machines,” IEEE Trans. Ind. Appl., vol. 45, no. 5, pp. 1794–1803,
no. 3, pp. 1876–1886, Mar. 2011. Sep./Oct. 2009.
[10] V. Sugumaran and K. I. Ramachandran, “Automatic rule learning using [33] D. P. Kingma and L. J. Ba, “Adam: A method for stochastic optimiza-
decision tree for fuzzy classifier in fault diagnosis of roller bearing,” tion,” in Proc. 3rd Int. Conf. Learn. Representations, San Diego, CA,
Mech. Syst. Signal Process., vol. 21, no. 5, pp. 2237–2247, Jul. 2007. USA, 2015, pp. 1–15.
[11] B. Li, M.-Y. Chow, Y. Tipsuwan, and J. C. Hung, “Neural-network- [34] G. Liu, Z. Yin, Y. Jia, and Y. Xie, “Passenger flow estimation based on
based motor rolling bearing fault diagnosis,” IEEE Trans. Ind. Electron., convolutional neural network in public transportation system,” Knowl.-
vol. 47, no. 5, pp. 1060–1069, Oct. 2000. Based Syst., vol. 123, pp. 102–115, May 2017.
[35] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “ImageNet:
[12] J. Lin and L. Qu, “Feature extraction based on morlet wavelet and its
A large-scale hierarchical image database,” in Proc. IEEE Conf. Comput.
application for mechanical fault diagnosis,” J. Sound Vibrat., vol. 234,
Vis. Pattern Recognit., Miami, FL, USA, Jun. 2009, pp. 248–255.
no. 1, pp. 135–148, Jun. 2000.
[36] S. Loffe and C. Szegedy, “Batch normalization: Accelerating deep
[13] P. Tamilselvan and P. F. Wang, “Failure diagnosis using deep belief
network training by reducing internal covariate shift,” in Proc. 32nd
learning based health state classification,” Rel. Eng., Syst. Saf., vol. 115,
Int. Conf. Mach. Learn., Lille, France, 2015, pp. 448–456.
pp. 124–135, Jul. 2013.
[37] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification
[14] H. Shao, H. Jiang, H. Zhao, and F. Wang, “A novel deep autoencoder
with deep convolutional neural networks,” in Proc. 25th Int. Conf. Neural
feature learning method for rotating machinery fault diagnosis,” Mech.
Inf. Process. Syst., Lake Tahoe, NV, USA, 2012, pp. 1097–1105.
Syst. Signal Process., vol. 95, pp. 187–204, Oct. 2017.
[38] G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and
[15] D. Zhang, W.-A. Zhang, L. Yu, and Q.-G. Wang, “Distributed fault R. R. Salakhutdinov. (2012). “Improving neural networks by
detection for a class of large-scale systems with multiple incomplete preventing co-adaptation of feature detectors.” [Online]. Available:
measurements,” J Franklin Inst., vol. 352, no. 9, pp. 3730–3749, https://arxiv.org/abs/1207.0580
Sep. 2015. [39] D. P. Kingma and M. Welling. (2013). “Auto-encoding variational
[16] D. Zhang, Q.-G. Wang, L. Yu, and H. Song, “Fuzzy-model-based fault Bayes.” [Online]. Available: https://arxiv.org/abs/1312.6114
detection for a class of nonlinear systems with networked measure- [40] M. B. Lazreg, M. Goodwin, and O.-C. Granmo, “Vector representation
ments,” IEEE Trans. Instrum. Meas., vol. 62, no. 12, pp. 3148–3159, of non-standard spellings using dynamic time warping and a denoising
Dec. 2013. autoencoder,” in Proc. IEEE Congr. Evol. Comput., San Sebastian,
[17] W. Sun, R. Zhao, R. Yan, S. Shao, and X. Chen, “Convolutional Spain, Jun. 2017, pp. 1444–1450.
discriminative feature learning for induction motor fault diagnosis,” [41] F. Lv, M. Han, and T. Qiu, “Remote sensing image classification based
IEEE Trans. Ind. Informat., vol. 13, no. 3, pp. 1350–1359, Jun. 2017. on ensemble extreme learning machine with stacked autoencoder,” IEEE
[18] M. S. R. Krishna and K. S. Ravi, “Fault diagnosis of induction motor Access, vol. 5, pp. 9021–9031, May 2017.
using motor current signature analysis,” in Proc. IEEE Int. Conf. Circuits [42] Z. Chen and W. Li, “Multisensor feature fusion for bearing fault
Power Comput. Tech., Nagercoil, India, Mar. 2013, pp. 180–186. diagnosis using sparse autoencoder and deep belief network,” IEEE
[19] C. T. Kowalski and T. Orlowska-Kowalska, “Neural networks application Trans. Instrum. Meas., vol. 66, no. 7, pp. 1693–1702, Jul. 2017.
for induction motor faults diagnosis,” Math. Comput. Simul., vol. 63,
nos. 3–5, pp. 435–448, Nov. 2003.
[20] M. A. S. K. Khan and M. A. Rahman, “Development and implementa-
tion of a novel fault diagnostic and protection technique for IPM motor
drives,” IEEE Trans. Ind. Electron., vol. 56, no. 1, pp. 85–92, Jan. 2009.
[21] E. G. Strangas, S. Aviyente, and S. S. H. Zaidi, “Time–frequency
analysis for efficient fault diagnosis and failure prognosis for interior
permanent-magnet AC motors,” IEEE Trans. Ind. Electron., vol. 55,
no. 12, pp. 4191–4199, Dec. 2008. I-Hsi Kao was born in Taipei, Taiwan, in 1994.
[22] T. Ince, S. Kiranyaz, L. Eren, M. Askar, and M. Gabbouj, “Real-time He received the B.S. degree in electrical engi-
motor fault detection by 1-D convolutional neural networks,” IEEE neering from National United University, Miaoli,
Trans. Ind. Electron., vol. 63, no. 11, pp. 7067–7075, Nov. 2016. Taiwan, in 2016. He is currently pursuing the
[23] V. T. Tran, B.-S. Yang, M.-S. Oh, and A. C. C. Tan, “Fault diagnosis M.S. degree from the Department of Mechanical
of induction motor based on decision trees and adaptive neuro-fuzzy and Electro-Mechanical Engineering, National Sun
inference,” Expert Syst. Appl., vol. 36, no. 2, pp. 1840–1849, 2009. Yat-sen University, Kaohsiung, Taiwan.
[24] J.-R. R. Ruiz, J. A. Rosero, A. G. Espinosa, and L. Romeral, “Detection His current research interests include machine fault
of demagnetization faults in permanent-magnet synchronous motors diagnosis, neural networks, deep learning, systems
under nonstationary conditions,” IEEE Trans. Magn., vol. 45, no. 7, engineering, and advanced driver assistance systems.
pp. 2961–2969, Jul. 2009.
Wei-Jen Wang was born in Miaoli, Taiwan, in 1993. Jau-Woei Perng (S’02–M’04) was born in Hsinchu,
He received the B.S. and M.S. degrees from the Taiwan, in 1973. He received the B.S. and
Department of Mechanical and Electro-Mechanical M.S. degrees in electrical engineering from Yuan
Engineering, National Sun Yat-sen University, Kaoh- Ze University, Chungli, Taiwan, in 1995 and 1997,
siung, Taiwan, in 2016. respectively, and the Ph.D. degree in electrical
He has been with the Automotive Research and and control engineering from National Chiao Tung
Testing Center, Changhua, Taiwan, since 2016. University (NCTU), Hsinchu, in 2003.
His current research interests include machine fault From 2004 to 2008, he was a Research Assistant
diagnosis, machine learning, systems engineering, Professor with the Department of Electrical and
and advanced driver assistance systems. Control Engineering, NCTU. He is currently an
Associate Professor with the Department of Mechan-
ical and Electromechanical Engineering, National Sun Yat-sen University,
Kaohsiung, Taiwan. His current research interests include robust control,
nonlinear control, fuzzy logic control, neural networks, systems engineering,
and intelligent vehicle control.
Yi-Horng Lai received the B.S. degree from
Naval Academy, Kaohsiung, Taiwan, in 1992,
the M.S. degree in weapon system engineering from
the Chung Cheng Institute of Technology, Taoyuan,
Taiwan, in 2001, and the Ph.D. degree in mechanical
and electro-mechanical engineering from National
Sun Yat-sen University, Kaohsiung.
He has been with XWIN Prognostic Technology,
Kaohsiung, since 2018. His current research inter-
ests include nonlinear control, robust control, and
biosignal processing.

Analysis of Permanent Magnet Synchronous Motor

Hochgeladen von

Dokumentinformationen

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Analysis of Permanent Magnet Synchronous Motor

Hochgeladen von

Copyright:

Verfügbare Formate

310 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 68, NO.

Analysis of Permanent Magnet Synchronous Motor

M ECHANICAL fault diagnosis is an important issue

the five different states of the motors. Finally, Section VI

II. PMSM FAULTS

Fig. 4. Specification and size of the bearing.

C. Bearing Fault in PMSM

methods can be roughly divided into data collection and data

D. Differences Between the Five Signals

on the frequency domain and is very similar to the Fourier

where x, ψa,τ is the wavelet packet coefficient, a is the scale

Fig. 9. Process of the DWT.

can adjust the appropriate window function in the frequency

Here, x i:i+N−1 represents a window of length N for a sequen-

Fig. 12. Flowchart of the CNN design process.

Fig. 14. Motor fault diagnosis system hardware architecture.

The Adam method [29] with the cross-entropy error func-

from −31 to 31 A with a 100-kHz bandwidth. When the TABLE III

Fig. 15. Learning curves.

TABLE IV desktop PC would be capable of running the testing diagnosis

Das könnte Ihnen auch gefallen