Beruflich Dokumente
Kultur Dokumente
>
>
How does an to the hidden layers (for updating the hidden layer weights). In
ANN achieve essence, the numerical values of the weights (strengths of synapses;
‘what it see the box and Figure 2) of the network are updated using some rule
achieves’? In of the type:
general, an ANN
structure is new weights = old (previous) weights + learning rate times gradient
trained with of the cost function (with respect to the weights).
known samples
of data. For Once the ANN is trained, the ‘learned information’ is captured in
example, if a these weights in a condensed (and yet complex) way. In general,
particular pattern these weights have no direct physical significance or meaning
is to be related to the process or phenomenon which is described or mod-
recognised, then elled by the ANN. However, the overall function of the ANN is
the ANN is first relevant to a given task/application.
trained with the
known pattern/ Although the basic processing element is simple, when several of
information (in them are connected (and also needed in many applications), the size
the form of digital of an ANN could be very large in terms of the number of neurons
signals). Then the (hundreds) in a given layer. A maximum of one hidden layer is often
ANN is ready to sufficient for many tasks, to attain the required accuracy. The larger
recognise a the ANN, the more time it takes to train the network. Despite the
similar pattern simplicity of the basic units, the mathematical theory to study and
when it is analyse the various structures and schemes based on ANNs and their
presented to (non-linear) behaviour can be very involved.
the network.
Uses of ANNs
The ANNs are used for pattern recognition, image processing, signal
processing/prediction, adaptive control and other related prob-
lems. Due to the use of non-linear activation functions, the ANNs
are highly suitable for very accurate mapping (modelling) of non-
linear systems based on the input/output data. This non-linear
activation function is also useful in reducing the adverse effect of
outliers/spikes in data, and in improving the accuracy of estimates
obtained by RNN (see the box).
Some of the uses of ANNs (FFNNs and RNNs) are briefly men-
tioned here:
RNNs
l Information storage/recall - The recall is a process of decoding RNNs are specially suit-
the previously stored information by the network. able for estimation of pa-
l Pattern recognition/classification - to recognise a specific pat- rameters of dynamic sys-
tern from a cluster of data/ to classify sets of data/information. tems in an explicit way.
l Non-linear mapping between high-dimensional spaces (math- The RNN-based schemes
ematical modelling of non-linear behaviour of systems). for parameter estimation
l Time-series prediction (like weather forecasting), modelling of have been shown (by the
non-linear aerodynamic phenomena, detection of faults/fail- first author) to be the
ures in systems like power plants and aircraft sensors. generalisation of some of
the conventional param-
Some of the specific applications (based on open literature)/possi- eter estimation methods.
bilities are: These methods are useful
for estimation of param-
l Assisting IC-CIM (computer integrated manufacturing of inte- eters of dynamic systems
grated circuits). in explicit ways, e.g. math-
l Japanese-OCR (optical character recognition). ematical modelling of air-
l Financial forecasting (the Neuro-Forecasting Centre in craft dynamics. When
London). realised in hardware, the
l Process control (Fujitsu Ltd., Kawasaki, Japan). RNN architectures are
l Analysis of medical tests. such that they are natu-
l Target tracking and recognition (multi-sensor data fusion). rally adapted to obtain
fast solutions to param-
In the field of artificial intelligence, we have heard of expert systems eter estimation problems
(ES). In essence an ES is a software-based system that describes the (depending on the speed
behaviour of (human) experts in some field by capturing/collecting of the basic processing
the knowledge in the form of rules and symbols. If fuzzy or noisy data hardware elements).
is given to an ES, it might give wrong answers. Since the ANN-based RNNs are thus specially
system can be trained with some fuzzy or noisy data, the combina- suited for arriving at non-
tion of ES and ANN (hybrid) might be very useful to devise linear state space models
powerful systems called expert networks (ENs). of dynamic systems.
Concluding Remarks
The days of tive filters/circuits (MAPAFS). They are more like electrical/elec-
neural network- tronic (EE) circuits with some useful adaptive properties. In fact
based (parallel) ANNs can be realised using EE hardware components. With very
computers may large scale integration (VLSI) technology, it might be feasible to
not be too far off. fabricate microelectronic networks of high complexity for solving
Thus brain to many optimisation and control problems using artificial neural
bread networks. The days of neural network- based (parallel) computers
(industrial may not be too far off. Thus brain to bread (-industrial applications)
applications) neural computers will gradually become a reality. As of now the field
neural computers is fascinating, sometimes intriguing, and offers great challenges and
will gradually promises to scientists and engineers.
become
a reality.
Suggested Reading
The origin of ideas ... Ideas come when stepping onto a bus (Poincaré),
attending the theatre (Wiener), walking up a mountain (Littlewood), sitting at the
shore (Aleksandrov), or walking in the rain (Littlewood), but only after a long
struggle of intensive work. (from The Mathematical Intelligencer 17(2), 1995).