Sie sind auf Seite 1von 201

IFAC WORKSHOP SERIES

Editor-in-Chief
Pieter Eykhoff, University of Technology,
NL-5600 MB Eindhoven, The Netherlands

HALANG & RAMAMRITHAM: Real Time Programming (1992, No.l)


JAMSA JOUNELA & NIEMI: Expert Systems in Mineral and Metal Processing (7992, No2)
KOPETZ & RODD: Distributed Computer Control Systems (DCCS'91) (1992, No.3)
FLEMING & JONES: Algorithms and Architectures for Real-Time Control (1992, No A)
RODD & SUSKI: Artificial Intelligence in Real Time Control (7992, No.5)

HASHIMOTO & DAY: Mathematical and Control Applications in Agriculture and Horticulture
(1991, No. 1)

CHESTNUT et ah International Conflict Resolution Using System Engineering (7990, No.l)


SIGUERDIDJANE & BERNHARD: Control Applications of Nonlinear Programming and Optimization
(1990, No.2)
VILLA & MURARI: Decisional Structures in Automated Manufacturing (7990, No.3)
RODD et ah Artificial Intelligence in Real Time Control (7990, No.4)
MOTUS & NARITA : Distributed Computer Control Systems (DCCS'89) (7990, No.5)
KNUTH & RODD: Distributed Databases in Real Time Control (7990, No.6)
LOTOTSKY: Evaluation of Adaptive Control Strategies in Industrial Applications (7990, No.7)
O'SHIMA & VAN RUN: Production Control in the Process Industry (7990, No.8)
MOWLE & ELZER: Experience with the Management of Software Products (7990, No.9)

Other IFAC Publications

AUTOMATICA
the journal of IFAC, the International Federation of Automatic Control
Editor-in-Chief: George Axelby, 211 Coronet Drive, North Linthicum,
Maryland 21090, USA

IFAC SYMPOSIA SERIES


Editor-in-Chief: Janos Gertler, George Mason University, School of Information Technology and Engineering,
Fairfax, Virginia 22030-4444, USA

Full list of IFAC Publications appears at the end of this volume

NOTICE TO READERS

If your library is not already a standing/continuation order customer or subscriber to this series, may we recommend that you place a
standing/continuation or subscription order to receive immediately upon publication all new volumes. Should you find that these volumes
no longer serve your needs your order can be cancelled at any time without notice.

Copies of all previously published volumes are available. A fully descriptive catalogue will be gladly sent on request.
EXPERT SYSTEMS
IN MINERAL AND METAL
PROCESSING

Proceedings of the IF AC Workshop, Espoo, Finland,


26 - 28 August 1991

Edited by

S.-L. JAMSA-JOUNELA
Outomec Oy (Ltd), Finland

and

A J . NIEMI
Helsinki University of Technology, Finland

Published for the


INTERNATIONAL FEDERATION OF AUTOMATIC CONTROL

by

PERGAMON PRESS
OXFORD NEW YORK SEOUL TOKYO
UK Pergamon Press Ltd, Headington Hill Hall, Oxford 0X3 OBW, England

USA Pergamon Press, Inc., 660 White Plains Road, Tarrytown, New York 10591-5153, USA

KOREA Pergamon Press Korea, KPO Box 315, Seoul 110-603, Korea

JAPAN Pergamon Press Japan, Tsunashima Building Annex, 3-2-12 Yushima, Bunkyo-ku, Tokyo 113, Japan

Copyright 1992IFAC

All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system or
transmitted in any form or by any means: electronic, electrostatic, magnetic tape, mechanical,
photocopying, recording or otherwise, without permission in writing from the copyright holders.

First edition 1992

Library of Congress Cataloging in Publication Data

Expert systems in mineral and metal processing: proceedings of the


Workshop of the International Federation of Automatic Control,
Helsinki University of Technology, Espoo, Finland, 26-28 August 1991/
editors, Sirkka-Liisa Jms-Jounela, Antti J. Niemi. - 1st ed.
p. cm. (IFAC workshop series; 1992, no.2)
Includes index.
1. Mining engineeringAutomationCongresses. 2. MetallurgyAutomation
Congresses. 3. Expert systems (Computer science)Congresses.
I. Jms-Jounela, Sirkka-Iiisa. . Niemi, Antti J. . International
Federation of Automatic Control. IV. Series.
TN276.E861992 669dc20 92-9318

British Library Cataloguing in Publication Data

A catalogue record for this book is available from the British Library

ISBN 0-08-041704-3

These proceedings were reproduced by means of the photo-offset process using the manuscripts supplied
by the authors of the different papers. The manuscripts have been typed using different typewriters and
typefaces. The lay-out, figures and tables of some papers did not agree completely with the standard
requirements: consequently the reproduction does not display complete uniformity. To ensure rapid
publication this discrepancy could not be changed: nor could the English be checked completely.
Therefore, the readers are asked to excuse any deficiencies of this publication which may be due to the
above mentioned reasons.

The Editors

Printed in Great Britain by BPCC Wheatons Ltd, Exeter


IFAC WORKSHOP ON EXPERT SYSTEMS IN
MINERAL AND METAL PROCESSING

Sponsored by
International Federation of Automatic Control (IFAC) Committees on
- Applications (APCOM)
- Mining, Mineral and Metal Working Group

Organized by
The Finnish Society of Automation

International Programme Committee


A. Niemi (SF) (Chairman)
M. Aubrun (F)
Z. Chen (PRC)
H. Cohen (UK)
B. Cronhjort (S)
P. Duncan (SA)
M. Hadjinski (BG)
J. Heidepriem (D)
A. Herbst (USA)
H. Koivo (SF)
J. Meech (CDN)
J. Paiuk (RA)
G. Sommer (SA)
P. Uronen (SF)
Y. Yoshitani (J)

National Organizing Committee


E. Saarelainen (Chairman)
S.-L. Jms-Jounela (Editor)
L. Vahtera (Secretary)
S.Junno
M. Kongas
P. Lautala
L. Yliniemi
PREFACE

Use of experts systems for monitoring and control is increasing rapidly in the mineral and metal
industries. A number of related papers has been presented during recent years, separately in var-
ious meetings and journals. In order to obtain a comprehensive review of the newest results in
this field, the developers and users of such applications were invited to exchange information
and experiences at an IFAC Workshop. This event, the IFAC Workshop on Experts Systems in
Mineral and Metal Processing, took place at the Helsinki University of Technology, Espoo, Fin-
land, August 26 - 28, 1991.

This Proceedings Volume contains a selection of papers presented at the Workshop. Before this
publication, they have passed a repeated evaluation and screening process. The papers were first
chosen, on the basis of abstracts submitted, by the International Programme Committee for oral
and written presentation at the Workshop. At the final selection of the papers for Proceedings,
the Editor followed the recommendations of the same Committee which were based on reviews
of the full texts of papers by Committee members. A number of papers included here were re-
vised after the Workshop by their authors as required by the Committee.

The Workshop was one of a relatively narrow scope. According to a basic requirement, each
paper had to include contents on both expert systems, neural networks or fuzzy control, and on
mineral and metal processing. This double requirement was followed with almost no excep-
tions, and i.a. for this reason a number of good contributions had to be left out. Despite of this,
39 papers were accepted and duly presented which shows that the theme of the Workshop was
widely recognized as timely and interesting. This Volume contains a condensed subset of 29 pa-
pers.

During the three days of Workshop, the specialists from 23 countries communicated with each
other on their work and interests presenting papers and carrying out intensive discussions.
While the new ties of friendship which were formed and the earlier ones which were confirmed
cannot be reproduced by a written text, it is the aim of the present Volume to stand for the scien-
tific and technological content of the Workshop and to transmit information of newest develop-
ments of expert systems in mineral and metal processing to the professionals all over the world.
At the same time, it provides a permanent record of the level of achievement within a topical
field of the present day automatic control.

Sirkka-Liisa Jms-Jounela Antti J. Niemi


OPENING ADDRESS
Erkki Saarelainen
Chairman of the National Organising Committee
Chairman of the Finnish Society of Automation

Ladies and Gentlemen,


On behalf of the National Organising Committee and the Finnish Society of Automation I wish
you all welcome to the first IFAC Workshop on Expert Systems in Mineral and Metal Process-
ing. It is a great pleasure for us to host this event sponsored by the IFAC Applications Commit-
tee and its Working Group on Automation in Mining, Mineral and Metal Processing.

The President of IFAC has nominated Mr. Uolevi A. Luoto, who himself is Advisor and Past
President of IFAC, to represent the Federation and the sponsoring bodies here and we feel very
honoured by his presence. Mr. Luoto will address us at another occasion during the Work-
shop.

Many of you have made a long trip in order to be able to attend the technical sessions and possi-
bly one of the courses connected with the Workshop. I hope that we will be able to meet your
expectations by creating an open and friendly atmosphere for our technical sessions and by pro-
moting new unofficial contacts during our social events.

The interest raised by the Workshop has been an agreeable surprise for us. Although I am not a
specialist of expert systems but more one of automation for metallurgical industry in general, I
may state that its obvious reason is the advance taken place in industrial applications. In spite of
the declining trends we have succeeded in obtaining more than 90 registered delegates from 22
countries. The total amount of papers is about 40 and many of them report on practical work
and results. I leave a closer evaluation of advances in new methods and theory to the real ex-
perts in the auditorium.

In this context, I would like to express our warmest thanks to the authors and chairmen as well
as to the International Programme Committee chaired by Professor Antti Niemi. I also would
like to thank members of the National Organising Committee for their excellent contributions to
the Workshop. Finnish industry and universities have supported the event in several ways
which is highly appreciated.

Special thanks are directed to the personnel of the Finnish Automation Support Ltd for the prac-
tical work and excellent arrangements. We would also like to thank the Finnish Ministry of Edu-
cation for their support and the Helsinki University of Technology for a suitable Workshop site.

Xlll
OPENING ADDRESS
Paavo Uronen
Vice-rector of the Helsinki University of Technology
Ladies and Gentlemen,

On behalf of Helsinki University of Technology I have the pleasure of welcoming you to this
IFAC Workshop. In this connection, I would also like to introduce to you our University and its
connections with IFAC.

In 1849 the Helsinki Technical School was founded "to offer secondary school education for
youngsters who wish to enter industrial trades". The school was then developed into a Poly-
technical Institute in 1879 and received university status in 1908. The University started the ac-
tivities in downtown Helsinki and was moved in the sixties to this campus designed by the fa-
mous Finnish architect, Professor Alvar Aalto.

Helsinki University of Technology (HUT) is the oldest and biggest technical university in Fin-
land having today about 10,000 students, 150 professors and associate professors and about
1800 other staff members including project personnel.

The University is divided into six faculties according to the corresponding fields of study and
research. They are the Faculties of Information Science, Electrical Engineering, Mechanical En-
gineering, Process Engineering and Material Science, Civil Engineering and Surveying, and the
Faculty of Architecture. In addition to the faculties we have several separate units serving them
or specialising in certain research topics or special activity. These units include: Library, Com-
puting Centre, Language Centre, Centre for Continuing Education, Centre for Urban and Re-
gional Studies, Radio Research Station, Low Temperature Laboratory and Arctic Offshore Re-
search Centre.

In many fields of research, special institutes of several laboratories have been formed and these
institutes which we have now altogether, carry out interlaboratory and interdisciplinary research
in close co-operation with industrial and other outside parties getting funding mainly from other
than university sources. Also many international co-operative projects, for example, in connec-
tion with the European programs, are going on. The Institute of Industrial Automation in one of
the biggest and most active institutes concentrating so far mainly in manufacturing automation.

In the field of automation HUT has been active and creative since late 80 's. Today automatic
control and system engineering in their various forms are taught and studied in five laboratories,
in addition to the stated institute. Their national and international activities are widespread.

Also IFAC connections of our specialists are active and several IFAC events have been arranged
here earlier: The Conference on Digital Control Applications to Process Control 1971, MMM
Symposium 1983, and the Symposium on Man-Machine Systems 1988, as well as the Work-
shops on Modelling and Control of Biotechnical Processes 1982, Vision Control 1985, Differ-
ential Games 1990,. Also in organising the IFAC World Congress 1978 the role of HUT was
central.

The automation technology as applied to Mining, Metallurgy and Mineral Processing has been
one of the central areas of automation research at HUT, especially in the Laboratory of Control
Engineering led by Professor Niemi. In close co-operation with the Finnish industry remarkable
results and products in this field of automation have been reached.

XV
The development of research and education in technical universities includes some general de-
mands and trends: dynamism, intemationalisation, interdisciplinarity and more effective use of
information technology. The topic of this workshop includes ingredients from all of them, espe-
cially from effective use of information technology. The theme of expert systems or intelligent
control is very actual and popular among control scientists and users today and many interesting
results and further directions of development can be seen. Therefore I look forward to an inter-
esting workshop in Otaniemi and wish you all success.

XVI
CLOSING ADDRESS
Antti J. Niemi
Chairman of the International Programme Committee

Ladies and Gentlemen,

We are approaching the end of our work of three days. I hope you will remember this Work-
shop, not as just another technical event, but as one of high value to yourself, to your firm or to
your University. I also hope that you will remember Espoo, Helsinki and Finland for the new
associates met and the new friendships made.

Although most of you attend the first time an IFAC event in Finland, many of you have partici-
pated the 7th IFAC World Congress in 1978 or the IFAC Symposium on Automation in Mining,
Mineral and Metal Processing in 1983 held here. While the former one covered the whole scope
of control engineering and collected a high number of participants from all over the world, the
latter one was more closely connected with the content of this Workshop. Some of you may al-
so have attended one or more of the other IFAC events organized in Finland since the first one
in 1971.

Your Finnish colleagues joined the development of automation in mineral and metal industries at
an early stage. As the first steps were taken in the early 1960's, these industries were about ten
years late with regard to automation in e.g. chemical, pulp and paper and power generation in-
dustries, but the later advance has been fast and the new methods are today introduced parallelly
to the other branches of process industries. Due to the successful implementation of automation
in the Finnish industries, both the management and the workers and even the common people
take today the automation as a natural step of the technological development, and adverse opin-
ions are almost non-existent in our country. I wonder, if the acceptance of automation is equally
favourable elsewhere, and would consider an international comparison in this respect extremely
timely and interesting.

I have had since those early days the opportunity of participating the development of control of
mineral processes and the compilation of the required models, and to some amount the corre-
sponding work with metallurgical processes. Especially the dynamic models have become
much more detailed and sophisticated during the time passed. Their utilization has already for a
long time been limited by the absence of measuring instruments for many key variables, al-
though considerable developments have taken place in this respect also.
Therefore the control is, for a considerable part, still today based on observations and conclu-
sions of the human supervisor. E.g. the quality and colour of the froth in flotation cells and the
colour of the flame in metallurgical converters can be considered as quantities which are difficult
to measure with physical instruments. The expert systems imply potential to the application of
such variables in control by means of manual or automatic inputs. Another example could be
the flotation rate coefficient which cannot be observed by the human senses at all and which is
difficult to measure, not only by real time measurements but also by analysis of samples in labo-
ratory. By means of appropriate models, one is in a position of deriving rules through which to
base the control on a measurable subset of the quantities on which that factor depends. The ex-
pert systems therefore may add the possibilities of using both the human experience and theo-
retical knowledge for a better control.

XV11
The present Workshop is one of a relatively narrow scope. According to its basic requirement,
each paper had to include contents on both expert systems, neural networks or fuzzy control,
and on mineral or metal processing. This double requirement was followed with almost no ex-
ceptions, and therefore a number of good contributions had to be left out. Despite of this, 39
papers were accepted and duly presented which shows that the theme of the Workshop was
widely recognized as timely and interesting. The papers came from 16 countries which indicates
a relatively even, international distribution. While appreciating highly the work done in each of
these countries, I would especially state the extensive Japanese contribution which proves the
high level and penetration of expert systems in the iron and steel industry of that country.
The presented papers originate fairly evenly from industrial enterprises, universities and re-
search institutes. Especially the industries have provided the Workshop with a good and inter-
nationally representative number of participants, in addition to their speakers. It appears that the
Workshop has succeeded in its central task: the scientists and developers have presented their
new results, and potential exploiters have familiarized themselves with them, in order to subse-
quently apply them in the industry of their branch and their country. The International Pro-
gramme Committee and the National Organizing Committee have recorded this fact with great
satisfaction.
Since the present Workshop can be considered highly successful, the question on continuity of
international information exchange in the field of the Workshop has shown up and has been
brought to my knowledge. Certainly new information and experience will accumulate during
the next years as well. Therefore this question will be discussed by the International Pro-
gramme Committee of the present Workshop this afternoon, and if you feel that a new event of
the same or other type, at a suitable place and time, is needed or not needed, you are asked to
bring your opinion to the attention of a member of the IPC, preferably immediately after the pre-
sent Session; you will find their names on thefirstpages of your folder. The IPC will then de-
cide on a possible further recommendation to the appropriate body of the Federation.
The organizers tested two new features in connection with this Workshop. Thus the abstracts of
preliminarily accepted papers were mailed in advance to all registered participants, and this no
doubt inspired the discussions carried out during the event. As another feature, the IPC will
meet still today in order to preliminarily choose the papers to be included in the Proceedings.
The Committee may then require that individual speakers are asked to complete their papers dis-
tributed here as preprints or to make changes in them and I ask for their understanding and co-
operation.
While the principal aim of a Workshop is the exchange of scientific and technological informa-
tion, organization and much of voluntary effort is required, before it may materialize. I would
like to thank most cordially all authors who have presented papers including the substitute
speakers who have been needed in very few cases only to take up the task as charged by the au-
thor. I also thank all those who have participated in the sessions and discussions, all members
of the International Programme Committee and National Organizing Committee, the Plenary
Speakers, the Chairmen of Sessions, the Members of the Secretariat and, last but not least, the
International Federation of Automatic Control, its Applications Committee and its Working
Group on Automation in Mining, Mineral and Metal Processing, for their sponsorship. I trust
this has been a rewarding experience to all of us.
During these days, everyone of us has adopted new knowledge and new ideas which we may
put to work, after we have returned to our regular working posts. I thank you all for your par-
ticipation and contribution to the success of this Workshop, and wish you a safe and happy re-
turn home.
It remains to me to now declare the Workshop closed.

XVlll
Copyright BFAC Expert Systems in Mineral and GRINDING AND SEPARATION
Metal Processing, Espoo, Finland, 1991

AN EXPERT SYSTEM FOR CONTROL OF A SAG/BALL


MILL CIRCUIT
S.H. Bradford

Department of Mining Engineering, University of Nottingham, University Park, Nottingham, UK

Abstract. Within the remote mining areas of Australia the need to provide effective expert help has
increased. This is due to high labour turnover, increasing economic pressures and higher levels of process
technology used. As a counter to this a computer system to provide effective expert help to operators of
semi-autogenous (SAG) grinding mills has been developed. This support system is called HelpSAG. It was
initially developed for a SAG/ball mill grinding circuit on the gold plant at Paddy's Flat in Western
Australia.

The aim of the system is to provide a quick reference manual plus immediate on-line advisory help for the
shift operators. The on-line help is in the form of an interactive trouble-shooting procedure and a mill
power model for throughput optimisation.

This paper discusses some of the problems encountered with the initial system and its further development.
The solutions produced include the development of two separate systems tailored individually for the
separate requirements of shift operators and plant metallurgists. Data is now taken directly from the plant
data management computer setup into the Trouble-Shoot, instead of being manually entered as in the
original system. This direct access to plant data has allowed work to be carried out using the expert system
for closed loop control of the grinding circuit.

Keywords. Artificial intelligence; automatic control; computer control; control engineering applications of
computers; expert systems; hierarchically intelligent control; microcomputer based control; process control.

INTRODUCTION b) Being cheaper over an extended period than the


equivalent human expert.
Throughout industry the increasing economic pressures due to c) Producing consistent answers given the same or
reducing profit margins has brought about the need to similar information.
minimise both capital and operating costs and improve d) Being able to work in highly adverse conditions.
equipment utilisation. To carry this out, more and better e) Speeding up the passing of knowledge and
information is required. The developments in electronics has understanding to new personnel.
provided this, but in doing so has increased the complexity of f) Reducing the time input of professional staff when
problems. This then requires increasingly skilled knowledge training new staff and solving circuit problems.
and expertise on the part of the user for correct interpretation g) Giving the plant staff a better understanding of the
and application. However, the availability of this expertise is process and of the system as a whole, with better
limited not only in the new areas of technology but even in the morale and confidence and all that flows from this.
well established areas.
The mining industry has not escaped these increased economic
Expert systems (Forsyth, 1989) can provide a potential pressures and technological developments, but on top of this is
solution to these problems. Since, their introduction in the there are the problems associated with obtaining and retaining
1970's their application has been steady, but it was not until the required expertise on the increasing number of remote
the 1980's with the reduction in the cost of computing that they operations.
were more widely applied. They are described as computer
programs that try to simulate the reasoning component of Within mineral processing these problems and would-be
human expertise in a narrowly focused domain and can also solutions have often had a detrimental effect on plant revenue
provide an explanation of how a practical decision or and efficiency and thus decrease profit by:-
conclusion is reached.
a) Operation of the plant outside its optimum
The principles behind expert systems provide a method by efficiency envelope. This is usually caused by a
which practical expertise and knowledge can be permanently lack of process knowledge and understanding in
captured and utilised to analyse specific problems and to give operating staff. This is compounded by the high
detailed solutions. A carefully developed expert system can in turnover rates and the continual demands on the
principle give other benefits, such as:- professional staff for the training of new personnel.
b) Slow transfer of process knowledge from existing
a) The provision of expert help 24 hours a day, in to new personnel.
multiple locations simultaneously.

1
c) Loss of knowhow with the regular loss of trained for corrective action beyond their immediate
and experienced personnel. responsibilities.

The loss of experienced staff can have serious repercussions as Another criterion for the system was to provide a simple and
outlined above. On most plants training tends to be on the job friendly interface to the plant operators and metallurgists.
and misunderstandings and misinformation about the process
tend to persist from one generation of operators to another.
This is compounded because the detailed knowledge required DEVELOPMENT TOOLS
to run a plant is often buried in user-unfriendly and often
almost unreadable manuals. Better training and the systematic The system was developed using a deterministic expert system
collection of accumulated staff knowledge can overcome some (Menne, 1989) shell called electroMAGNETIC Book (eMB),
if not all of these problems. This, however, requires a which was authored and commissioned by David Menne of
substantial initial investment in both cost and the time of Mennetech in 1985 using Clipper, a Dbase compiler.
supervisory professional staff. For reasons well known to those
with operating experience these tend not to be available on Unlike conventional expert systems, which use inference
most plants. engines (Forsyth, 1989) and knowledge bases, an eMB puts the
control of the structure in the hands of the expert rather than in
So, even if only a few of the advantages that expert systems the program software. This done by allowing the expert to
can give accrue, the building of an expert system for a mineral explicitly define the logic links between pages, which hold his
processing plant operation should be seriously considered. experience and knowledge. These pages also provide an
interactive interface for the finished system by the use of input,
choice and output pages.
APPLICATION
The idea behind the system was to allow an expert to develop a
The rapid expansion of gold mining in Australia during the system without any real knowledge of programming and to
1980's brought an increased demand for technical personnel. keep the operation of the finished run-time program as simple
The increased economic pressure has brought about the as possible. The keys required to operate a finished system are
introduction of newer and more complex process units, for the numeric keys, 'Home', W , 'PgDn' and 'PgUp'.
example Autogenous Grinding (AG) and Semi-Autogenous
Grinding (SAG) mills, in an attempt to reduce capital and A management structure producing program was also used
operating costs (Siddall, 1989). The operation of these process particularly in the development of the Trouble-Shoot to show
units is far more subtle than with steel-on-steel (rod, ball mill) the structure and logic used.
comminution systems and they are only able to deliver reduced
operating costs if they are run at their optimum.
SYSTEM DEVELOPMENT
As a result of this Whim Creek Consolidated Ltd, now part of
Dominion Mining Ltd's Meekathara Gold Operations, decided The system was developed in three separate parts as defined in
to co-fund the development of an expert system for a SAG the application section. Figure 1 shows the final structure of
mill circuit with Orway Mineral Consultants and Mennetech. the system. It was initially implemented for a PC XT with
They also allowed on-site prototyping at their new Paddy's Flat 640k, a mono graphics monitor and a hard disk. This was
process plant. The plant processes gold ores from a wide found to be slow and the system was upgraded to a PC AT
variety of sources at rates between 85 and 125 tons/hour with 640k, an EGA colour monitor and a hard disk.
through a two stage SAG/ball mill grinding circuit. All the
plant information monitoring and control is carried out through
a Micromax Process Management Centre (PMC). The PMC MAIN MENU

allows computer data screens to be defined showing


graphically trends of process variables in the grinding or
carbon-in-leach (CIL) circuit or any other combination of
MICROMAX
AUTOMATION
f FEED MATERIAL
[CHARACTERISTICS]
c OPERATOR
NOTES
MILLING
CIRCUIT

available data. [GENERAL MICROMAX] OPERATING


TKEY DESCRIPTIONS I PHILOSOPHY J

The agreed requirements of the initial version of the expert OPERATION LOGSHEET GENERAL
THEORY
ACTION KEYS SPECIFICS
system were the following:-
DATA ENTRY MILL ] f CIRCUIT
KEYS LUBRICATION} I DESCRIPTION J
a) Quick Reference Training Manual - This
included an explanation of maintenance CURSOR
KEYS
GENERAL JOB]
. DESCRIPTION;
( PLANT "j
[ O P T I M I S A T I]O N
procedures, plant checks and the acceptable values
DISPLAY ACCESS
of operating variables, instructions for use of the KEYS
PMC, an explanation of the theory behind MICROMAX KEY
AG/SAG milling, processing definitions and plant DESCRIPTIONS

control strategies.
b) Plant Optimisation Procedure - Power modelling Fig. 1. HelpSAG System Structure.
program for optimising plant throughput based on
data such as feed material characteristics, mill
speed, ball charge and liner wear. The initial theory and knowledge base for the SAG mill
c) Trouble-Shoot - To give solutions to problems operation was provided by Bernard Siddall of Orway Mineral
based on data entered by the operator from the Consultants in Perth, WA. All the plant and operator
PMC and other operating data-gathering systems, information was provided by Ivan Hunter, Senior Metallurgist
to give the reason for the occurrence of a problem at the Dominion Mining Ltd's Meekatharra Gold Operations.
and either give instructions on how to solve it
and/or tell shift operators to contact shift foreman A first prototype version was installed at Paddy's Flat in
September 1989. The time taken to develop this was about

2
four months. Two full time computer personnel were involved, loading, slurry SG and liner wear.
the author and another graduate metallurgist with a computing
background. Calculations could be carried out on either of the mills
installed in the plant but are specific to the Paddy's Flat SAG
Although another month or so could easily have been spent in and ball mill combination. The values calculated can be used
further refining the system a decision was taken to commission as the set points for plant control and the Trouble-Shoot
the system as soon as it was operational. The reasons for this operation. In addition to being able to do specific power and
was the feeling that testing and development of the system weight calculations, the software can generate graphs of mill
would be best achieved 'on-line', rather than by submitting the speed, ball charge and mill loading vs power draw. These are
system to off-line testing. illustrated in Fig. 2, and can be used to help visualise the best
operating point.
This initial test period proved to be useful not only in
debugging and refining the system but also in establishing
operator acceptance of the system. On commissioning, along MILL POWER VERSUS MILL LOAD FOR 5% BALL CHARGE

with spending time on shift with the operators explaining the 750-f

system and listening to suggestions, a book was left beside the 700-

computer keyboard to log any faults found in the system, to


note areas not adequately covered and to list any other
I 650-

600-
z
suggestions. The feedback, where possible, was introduced o 550-
into the system on site. This significantly increased the g
%
EC 5 0 0 -
acceptance by the operators, because when they were able to 450-
find faults, they were subsequently corrected and their o
a, 4 0 0 -
suggestions were in general implemented.
350-

300-
About a week was needed to commission the system, iron out
250-
problems and add feedback from plant personnel into the
0 10 20 30 40 50 60
system, which was mainly on the Trouble-Shoot. MILL LOAD % <Press Esc or PgUp to exit>

To secure the expert knowledge provided, an electronic Fig. 2. SAG Mill Power Model
security system was incorporated in the site version of the
software. It was designed to restrict the use of the system to a
single machine at any one time. Trouble-Shoot

This section provides an interactive Trouble-Shoot procedure


Quick Reference Training Manual where current process data and trend inputs are asked for by
the system, before diagnosis and a recommended solution is
The Training Manual is broken down into five main sections generated.
and Fig. 1 shows their position and relationships in the system
as a whole. These five sections are:- Figure 3 shows the broad outline of the Trouble-Shoot
structure. The various Trouble-Shoot procedures are chosen by
a) Micromax Automation - General description of the user and the operation of the selected procedure is in the
the PMC, plus key definition and use. form of an interrogation. The idea behind this facility is not
b) Feed Material Characteristics - Mineralogy, only to give answers to problems, but to give the operators a
Bond Work Index (WI) figures, other grinding better understanding of the mechanics of the process and to
parameters and general information on processing confirm for them that the circuit is running correctly. This is
problems associated with the various ore types. carried out where possible by explaining what is happening in
c) Operator Notes - Operator information regarding the mill and how a problem might be avoided, along with the
the primary objectives of the mill operator in terms offered solution.
of mill operating philosophy and maintenance
requirements. The maintenance includes The first stage in using Trouble-Shoot is to input all relevant
description, position, method of measurement and plant information, which includes PMC data and current ore
likely values of hourly checks, plus details of characteristics details. Following this the variable set points are
lubrication routines and daily equipment checks. defined by the user (usually pre-defined by the metallurgists
d) Circuit Description - A brief description of the from the Plant Optimisation Procedure) eg feed rate, ball load,
grinding circuit, along with a graphical power etc. These define the acceptable limits for the system
representation of the circuit. operation.
e) AG/SAG Mill Theory - Text and graphics
Throughout the Trouble-Shoot data input sections emphasis is
explaining grinding definitions, the fundamentals
made on the importance of accurate data input and the
of AG and SAG mills, mill control strategies,
detrimental effect of significant inaccuracies on system
techniques for handling problems with over-
performance. Like a human expert, Trouble-Shoot can only
viscous pulps and effects of variation of mill
base its conclusions on the information it is given. These
speed, ball load and total mill charge on
problems are displayed and explained at all stages of data
throughput and output quality. input. Limited error checking in built into the system, which
rejects grossly erroneous data entries. However there is still
Plant Optimisation Procedure basically a fundamental dependence on the user to enter data
correctly.
This is mainly used by the shift and plant metallurgists to
optimise the plant throughput based on the feed material Having entered and verified all the required plant data, the
characteristics. It is based on defined mill operating and feed system asks whether the problems are with the SAG or the ball
parameters, eg percentage of critical speed, ball charge, mill mill. As shown in Fig. 3 there is system override if the power

3
electroMAGNETIC Books vl.5 WHIM CREEK Copyright 1985-8
(INTRODUCTION)

(WEIGHT HIGH, POWER DEC. SLOW, REF:-PG12:SHW


( DATAINPUp) VISCOUS)

SAG MILL POWER => 960KW A viscous slurry can cause the grates to block, which will
lead to a mill spill. To reduce the viscosity of the slurry
SAG MILL TRIP OUT SAG MILL POWER < 960KW
TROUBLE SHOOT the water addition rate to the feed is increased to the
value given below
Increasing the water will cause a temporary SAG mill power

) C
SAG MILL BALL MILL
TROUBLE SHOOTS TROUBLE SHOOTS surge and an increase in the BALL mill circulating load.

POWER DRAW ) POWER DRAW


INCREASE SAG MILL FEED WATER ADDITION RATE BY (1/sec)
MILL WEIGHT CIRCULATING LOAD )
PGDN:NEXTPAGE PGURPREVIOUS PAGE W:WINDOWSIZE HOME:FIRST PAGE END:EXTT
DISCHARGE SLURRY
DENSITY J
Fig. 5. Explanation And Solution Page.
MILL DISCHARGE MILL NOISE )

NONE OF THESE
DISCUSSION
BALL CHARGE )

NONE OF THESE
This report simplifies an essentially very complex procedure
which depended as much on interaction with site personnel as
with fleshing out a pre-defined expert system shell. It has
Fig. 3. Trouble-Shoot Structure demonstrated clearly the actual and potential problems faced in
the development of any expert system of this type and also the
benefits for remote operations. These benefits include
draw is higher than or equal to the mill trip warning level.
increased support available to site staff at all levels and a better
That is control automatically passes to the SAG overload understanding of process, which are essential precursors to
Trouble-Shoot which offers more drastic remedies than in sustainable improvements in plant operation and the resultant
normal operating situations. increase in gross revenue and reduction in operating costs.
Normally the operator is offered the choice of SAG or ball
Some of the actual and potential problem areas are
mill diagnosis (see Fig. 3). An example of one of these trouble
summarised below. These need to be carefully explored in the
shoots is given in Fig. 4, where the operator is finding that the
design and specification of any expert system of this type, and
SAG mill weight is high, the power draw is slowly decreasing
certainly well before site work starts:-
and the slurry is viscous. As the weight is high the operator
chooses the Mill Weight Trouble-Shoot. The program first
a) Detailed definition of project aims and means of
asks the trend of the SAG mill power. On choosing 'decreasing
implementation.
power', it then asks the speed at which the change is occurring,
and 'slowly' is chosen. The operator is then asked if the slurry b) User acceptance formula.
is viscous. On choosing 'yes', the solution is offered. Along c) Knowledge source and methods of data
with this an explanation of the problems of a viscous slurry are acquisition.
given plus the consequences on the circuit of the offered e) Clear description of system structure.
solution. A solution screen is shown in Fig. 5. A reference is f) Methods of system development and editing.
also given on each screen to show the user the path that has g) Deadlines.
been taken. On finishing the procedure, the operator can either
return to the very first page of the system or page back to the The amount of time and money available and the problem to
Trouble-Shoot choice page and select another option. The be solved between them define the scope of the final system.
Trouble-Shoot remembers the route that was last taken and the While clearly there is a basic minimum, the degree of
last set of data even if the system is rebooted. sophistication of the system is effectively unlimited, and can
be extended until it no longer becomes cost effective. An
attempt to define cost benefits, eg revenue implications per
0.1% increase in saleable product, will help to establish a
( CIRCUIT >
ITROUBLESHOOTJ
realistic budget, and give all concerned a practical yardstick
with which to measure performance.
I WHICH MILL HAS PROBLEMS?]
BALL MILL | | SAG MILL
j WHAT IS PROBLEM WITH SAG MILL?"]
kW
It may be possible to modify an existing system, provided the
l I I I I I I M TWEIGHT
I MILL TP. T . W ore treated and the process plant utilised are broadly similar,
1 ,
MILL DISCHARGE | WHAT IS SAG MILL POWER DOING?"] and this can result in substantial savings. If it is a prototype
MILL NOISE
BALL CHARGE
INCREASING | version, its development time and cost is going to be
NONE OF THESE DECREASING significantly and possibly substantially higher.

1 HOW IS SAG MILL POWER DECREASING? [ An example of the amount of extra time that was consumed in
SLOW FAST the production of the HelpSAG prototype was shown up by a
time audit carried out by David Menne. He found that only
| IS THE SAG MILL DISCHARGE SLURRY VISCOUST]
about 25% of the time was actually spent in direct
YES NO
development. The time was consumed by familiarisation with
IEXPILANATION AND]
1 so
SOLUTION PAGE expert shell and the problem to be solved, travelling, waiting
for data and software refinement. So, for a similar project
Fig. 4. SAG Mill/ High Mill Weight/ Power Slowly using basically the same system the implementation time could
Decreasing/ Viscous Slurry Trouble-Shoot. be significantly reduced, though allowances should still be
allowed for travel and obtaining of data.

4
From the audit carried out on HelpSAG and other eMB's, the development of Trouble-Shoot. Although this took extra
David Menne has found that significantly less time is required time, it facilitated both fault diagnosis and future system
to produce similar expert systems when using the eMB than development. The best presentation method was found to be in
with a conventional expert system shell. Although the time graphical form so that the logic used could be easily checked
saving is not obvious on smaller expert systems, less than 100 by the expert. Initially a very large sheet of paper was used and
rules, it does become more significant as the size of the expert the screen printouts pasted on, with logic routes between them
system increases. For example to increase the size of a 100 clearly shown. Unfortunately this soon proved to be
rule expert system to a 400 rule expert system like HelpSAG impractical due to the large number of screens and the amount
using eMB requires 80% less effort. Since unlike conventional of editing that was needed in the beginning. However, a
expert systems where the time needed to add equal-sized management organisation chart software package was
parallel branches is the same, the development time for re-used available and lent itself well to illustrating the logic structure,
simple pages/rules is halved after first application. as in Figs. 1 and 3. This provided an effective system and the
speed and ease of development significantly increased.
A problem often overlooked is the acceptance by users, which
is basically a human factor. A lack of sensitivity to this The importance of deadlines is well documented when
problem and the exclusion of the users in the systems developing any new system. System developers are notorious
development can only limit the system performance or even for generating improvements and it needs an armslength
result in its failure. With this particular project, operator manager to decide when the system is essentially workable.
acceptance of the system and its successful establishment were
mainly due to an above normal level of operator and There is a great potential use of these types of expert system,
management interest in its implementation. This in itself especially on remote operations in the devloping countries.
followed on from the constructive relationships that already Since the system can provide on site expertise, training, uses
existed on site between shift operators, maintenance staff and relatively low level hardware and a simple man-machine
management at all levels. At Paddy's Flat this is clearly interface. They may become even more applicable due to the
reflected in the low labour turnover. fact that a lot of these countries want to reduce expatriate
while increasing local personnel.
The limited availability of high quality knowledge can be a
major inhibiting factor, although knowledge may be drawn The system has been accepted well by the operators, though at
from a number of sources rather than a single source. times there is still some reluctance to follow the instructions
Considerable ingenuity may be necessary before a coherent given in the Trouble-Shoot, particularly if an operator believes
and consistent knowledge base is assembled. In the particular that the advice can be shortcut. This is not necessarily a system
case presented here, two separate sources were located and failure, since knowledge has been passed on, even in its
between them provided a very detailed model for the overall apparent contradiction.
system. One was a consultant who specialises in the
specification of SAG mill systems, and the other was the plant The introduction of the system has also significantly reduced
manager who dealt with day to day operations. the call on the professional staff for operator training and from
operators for help with routine circuit problems.
Normally there is a link person between the expert system
shell and the human expert. This individual is called a The Plant Optimisation Procedure has also proved effective in
knowledge engineer. His skill lies in drawing out the the prediction of successful milling strategies.
knowledge of the expert and converting it to a form that can be
used in the expert shell. This knowledge transfer is usually Improvements in performance leading to revenue increases and
carried out in a series of interviews. These interviews tend to unit cost savings were not positively identified on the Paddy's
be of limited durations due to other external pressures on the Flat project, mainly because the necessary data logging
expert. Because of the time limitations and the need to convert procedures were not set up beforehand to establish a bank of
the technology into a programmable form, the knowledge before and after data. There was however a subjective
engineer effectively becomes an instant expert in the field he is consensus on site that the system had achieved its targets and
currently working on. This increases the amount of time would be both used and developed on a longer term basis.
required for development and may even make very complex
problems prohibitive in cost (Aylett, 1990).
FURTHER DEVELOPMENTS
Although eMB allows an expert to enter his knowledge
directly, the problem of the availability of expert time Since, the commissioning of the initial version of HelpSAG,
mentioned above still arises. So knowledge engineers and the Paddy's Flat flowsheet changed with the inclusion of a
interview technique were used. pebble crusher due to the increasing ore hardness. The system
thus required further updating to allow for these changes.
One of the most important factors that affect the performance
of any expert system is the correctness of the data input. The commissioning time spent on the plant proved very useful
Expert systems tend to be what is called fault intolerant. That and provided a lot of information for further developments.
is, they assume that the data presented is correct. The
detection of faulty data can be achieved by further It was soon found for example that there was a significant
programming, but as the problem gets more complex the difference in the demands made on the system by the shift
amount of programming required increases and gets operators and the metallurgists. As a result two separate
prohibitive in cost. In reality it is effectively impossible to systems were developed.
think of all the possible data problems especially when dealing
with manual input. The only practical solution is manual HelpSAG Plus (Metallurgists) :- Contains the Plant
checking and then double checking of the initial input data, Optimisation Procedure, the Trouble-Shoot, and the
and the rigourous control of any later additions. Training Manual with an enhanced in-depth theory
section, but with optional Standard Instructions and
The importance of clearly showing the structure and logic Operator Duties. It is essentially an office based
behind the expert system became increasingly apparent with system.

5
HelpSAG Operator (Operators) :- Contains the Bernard Siddall, Orway Mineral Consultants, Suite 8,18
Trouble-Shoot and Training Manual with simplified Kearns Crescent, Applecross, Western Australia 6153.
theory concepts. The original Plant Optimisation David Menne, Menne tech Pty Ltd, 10 Mayfair Street, Mt
Procedure was removed. Claremont, Perth, Western Australia 6010.
Ivan Hunter, Senior Metallurgist, Dominion Mining Limited,
As described earlier the performance of the Trouble-Shoot 10 Richardson Street, (PO Box 465), West Perth, Western
depends heavily on the accuracy of input data. To reduce data Australia,6872.
transfer errors the logged process data available from the PMC Mark Rodgers, Greenbushes Ltd, P.O. Box 31, Greenbushes,
were automatically loaded straight from the PMC to the Western Australia 625.
Trouble-Shoot. This was carried out by a cable link between Dr Steve Hall, The Department of Mining Engineering, The
the PC and the PMC via a RS232 interface and some low-level University of Nottingham, University Park, Nottingham,
communications programming. England.
Bill Bradford, Bill Bradford and Associates, Penford, New
The ability to input plant data directly into the Trouble-Shoot Road, Tring, England, HP23 5EX.
has brought about the possibilities of closed loop control. This
would be achieved by connecting the expert system to the Science and Engineering Research Council (SERC) for the
highest level of control ie provide the set points for the provision of a standard research award to enable the author to
controllers (Elber, 1989). conduct further research on the application of artificial
intelligence in mineral processing.
Over the last year or so further Help packages have been
implemented. These are:-
REFERENCES
a) HelpSAG Designer - used by Morgardshammar, a
leading Swedish mill manufacturer, for optimizing Aylett, R. (1990). Knowledge acquisition tools. Expert
SAG/Ball Mill circuits. System Applications. June 1990 ,11-14.
b) HelpCIL - Provide technical help for the running Elber, Dr. L. (1989). SAG mill control. SAGSEM 89.
of CIL circuits. Murdoch University, Perth, Western Australia, Nov-Dec
c) HelpLEACH-Provide technical help on the 1989. pp 272-278.
problems of heap leaching. Forsyth, R. I. (1989). Expert Systems - Principles and Case
d) HelpSAG Plus - A copy has recently been Studies.2nd ed, 1989. Chapman and Hall Computing,
installed in Ghana on the gold plant of Canadian London.
Bogusu Resources Ltd. Menne, D. M. (1989). Deterministic Expert Systems: A
planning tool for project development. The AusIMM
Other possible developments are HelpFLOAT and Annual Conference. Perth-Kalgoorlie,Western Australia,
HelpASSAY. May 1989. pp 175-180.
Siddall, G. B. (1989). A review of SAG milling in
Australia.SAGSEM 89. Murdoch University, Perth,
CONCLUSIONS Western Australia, Nov-Dec 1989. pp 8-22.

This paper has covered briefly the problems involved in the


production of an expert system for a SAG/ball mill circuit on a
remote Australian site.

HelpSAG proved to be a useful tool and showed some of the


benefits that can be brought to mineral processing by the use
of expert systems. This and other developments have
highlighted most of the major potential application areas for
expert systems in mineral processing ie training, on-line
advisers, and process control.

Care must be taken in applying the methods and models


described here to other sites. Expert systems are fault
intolerant, and if not correctly set up will never perform well.
If not applied to the domain for which they were originally
intended they may even be positively damaging.

The main drawback in the implementation of expert systems,


especially the more complex problems, has been development
time and cost. However with improved development
techniques and tools the time and cost can be significantly
reduced. As these reduce the range of possible applications
will increase, thus extending the benefits in terms of on-site
expert knowledge and better process understanding.

ACKNOWLEDGMENTS

I would like to thank the following people and organisations


for giving me the opportunity to get involved in HelpSAG, for
their help during its development and for assistance and advice
in the production of this paper.

6
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

INTUITIVE PROCESS CONTROL SYSTEM


PROGRAMMING
R. Spring and M. Franklin

Noranda Technology Centre, 240 Hymus Blvd., Pointe Claire, QC, Canada

Abstract. Natural language and pictorial knowledge representations help make process control
system programming more intuitive. This was shown in the Brunswick Mining mineral
concentrator by the development of a flotation reagent control system built around a process
flowsheet. When an engineer uses a computer "mouse" to change the flowsheet, the control
system changes accordingly. This approach both reduces the programming effort and makes the
control strategy straightforward to understand. The control system required less than one
man-month to develop and paid for itself in under six months. The implementation uses Gensym's
G2(tm) software linked to a Fisher PROVOX(tm) distributed control system.

Keywords. Process control, mineral processing, programming environments, computer graphics

The pictorial approach makes the control system easy to


INTRODUCTION
maintain. If a piece of equipment must be taken out of
service, a process engineer simply uses the computer
The right tool makes any job easier. As control
mouse to reflect the changes on the flowsheet. The
engineers tackle ever more ambitious problems, their
computer makes all necessary adjustments to control
need for appropriate programming tools increases.
parameters based on the new flowsheet.
Software companies (Matthews, 1988; O'Reilly and
Nordlund, 1989; Shewchuk, 1987) have responded by
marketing a number of products . Many of these are
HARDWARE AND SOFTWARE
based on object-oriented programming because "a
highly self-documenting graphical language that uses
The development of the reagent control system used a
symbolic object-oriented constructs has been shown to
combination of Gensym's G2(tm) software running on a
aid the long-term maintenance of the source program."
V A X s t a t i o n 3500 (tm) c o m p u t e r and a Fisher
(Arnold, 1987) The Noranda Technology Centre and
PROVOX(tm) distributed control system (DCS). This is
Brunswick Mining Limited used one such product, called
shown in Fig. 1. This marriage takes advantage of the
G2(tm), to develop a control system for flotation reagents.
strengths of both components. G2 provides a
programming environment adapted to the control
As a programming tool, G2 made practical the synthesis
problem and the needs of process engineers. The DCS
of a novel control system based on process flowsheets.
provides efficient single-loop control and a robust
Elegantly called the "schematic capture of process interface for the flotation circuit operators. Moreover, by
knowledge" (Moore, 1986), this approach includes the integrating G2 with the existing DCS, the new control
process flowsheet of the Brunswick Mining concentrator system was implemented without the expense of new
in the knowledge base. The flowsheet defines what field wiring and without the need to re-train operators on
process equipment is used and how it is connected. new consoles.
This is an intuitive and efficient programming method that
makes control engineers more productive.
The data communications link between G2 and the DCS
is critical to the success of the overall system. The link
In addition to the plant flowsheet, the control system
uses additional hardware and software purchased from
includes models of the material transport dynamics of
Gensym and Fisher. Together, the various components
each type of process equipment. With this information,
provide the following important features:
the control system compensates for material transport
delays and properly stages the dosage of reagents as
1) Programs (or "knowledge bases" in G2
disturbances propagate through the concentrator.
parlance) have read/write access to the
During a plant shut-down, the system keeps track of the
operator's console and approximately 600
delays caused by the various tanks, pipes, etc. and
field devices such as flow meters, valves,
appropriately stages the shutdown of the reagent
etc. in the Brunswick Mining concentrator.
feeders.

7
2) Data transmission delays are typically less To illustrate consider the problem of adding reagents to
than a few seconds. the copper-lead rougher flotation circuit at Brunswick
Mining. The ore flowrate into the flotation circuit is not
3) G2 programs and DCS functions are not measured. The nearest upstream measured ore flowrate
synchronized: both continue to run while is the rod mill feedrate: it is measured by a weightometer
waiting for the other to supply data. on the mill feed conveyer belt. Under steady state
conditions the mill feed and flotation feed ore flowrates
4) The communications link is simple to use. are equal. However, as shown in figures 2 and 3, when
This is in keeping with the overall intention the ore to the rod mill is interrupted for 20 minutes, the
of simplifying the programming of the ore flow to the flotation circuit has a very different
process control system. dynamic response. It does not change for several
minutes, then drops down and only fully recovers after
for several hours. If the reagent control system only has
ANALOG SIGNALS knowledge about the ore flow rate to the rod mill, it must

m it necessarily underdose the reagents to the flotation circuit


for 20 minutes, then overdose the reagents for several
hours afterwards. This leads to poor mineral separation,
PROCESS alters the material balance and further disturbs the
I/O downstream processes.
UNIT

36
CO
\
35 L-^VlAT^
**
LU 34
H
<
DC
INTEGRAL 33
/ 01
OPERATOR
FUNCTION 5
CONTROLLER V CONSOLE o 3?
-1
UL
LU 3 1
cr
O 30
DATA HIGHWAY (CO-AXIAL CABLE) 6:00 8:00 10:00 12:00
TIME (a.m.)

Fig. 2: Rod mill feed ore flow rate.

DHI

VAXSTATION 3500
CHIP
ESDS
G2/GSI

Fig. 1: G2-PROVOX communications.

8:00 10:00 12:00


THE REAGENT CONTROL PROBLEM TIME (a.m.)

The objective of flotation reagent control is to meter the Fig. 3: Flotation feed ore flow rate.
reagents in proportion to the ore flow rate. The major
difficulty is the large and variable material transport delay
in the control loop. Engineering and economic CONVENTIONAL CONTROL
considerations severely limit the number of places where
the ore flow rate can be measured, whereas reagents are Automatic c o m p u t e r control has long been a
added at numerous locations. There can be any number preoccupation in mineral concentrators. (Kay, 1975;
of tanks, pipes, etc. between the point where an ore flow Konigsmann, 1976; Le Guen 1975; Lewis, 1971; Trusiak,
is measured and the point where reagent is added. This 1981) and conventional proportional-integral (PI) control
is of little consequence under steady state conditions. is successfully applied in many cases. However, PI
But, during transients the measured ore flow rates may control is ill-suited to applications with significant
be very different from the ore flow rates where reagents non-linearities or dead-time such as the foregoing
are added. flotation reagent example. Various other control

8
algorithms have been used, such as Dahlin's method, the d i s c h a r g e o r e f l o w of any rod mill = the average
minimum variance, adaptive, multi-variable, inferential, value of the f e e d o r e f low between 3.5 minutes ago
model-reference, dead-time c o m p e n s a t i o n , etc. and 1 minute ago.
(Franklin, 1988; M o r a r i , 1988; Siverns, 1984;
Stephanopoulos, 1984) This model captures the essential dynamic behaviour of
rod mills including the time delay and variable ore
Unfortunately, all these methods are highly mathematical inventory. Figure 5 compares the step response of the
and "the human understanding of a system and its model to a conventional first-order plus dead-time model
mathematical description are often alien, resulting in a and an actual measured response. (Flament, Hodouin
lack of comprehension and loss of confidence in control and Spring 1985; Spring, Larsen and Mular, 1985)
decisions." (Francis, 1984) Consequently, the conception
and implementation of advanced control strategies
becomes a slow, painstaking process. Once fielded, the
1.2
systems then require staff with extensive computing skills
to maintain.

THE INTUITIVE ALTERNATIVE LU


!< 0.8
As an alternative to the entirely mathematical formalisms 00
used in conventional control systems, the system 1 st
y\
developed at Brunswick Mining uses a flowsheet and Q0.6 ORDER '/
near-English models. For example, to estimate the MODEL ~ \ ' / MEASURED
instantaneous mass flow into the copper-lead rougher HI 0.4
flotation circuit, the control system uses the flowsheet
shown in Fig. 4.
o
5 0.2
LLI
O

z 1 2 3 4 5
TIME SINCE FEED CHANGE (minutes)

Fig. 5: Mill Step Responses.

As mediums of e x p r e s s i o n , the flowsheet and


English-language models have greater intuitive appeal
than a conventional computer program. With them, the
control engineer is more productive. He can spend more
effort solving the control problem and less effort solving
the computer programming problem.

COUPLING OF G2 AND THE DCS


WIQ-G101

T L1-CU-PB-RHGR The G2 program and the DCS work hand-in-hand. For


example, referring to Fig. 4, the object labeled WI-G101
represents the measured ore flow rate into the rod mill.
COND-1 The DCS sends this measurement to the G2 program at
unpredictable time intervals varying between a few
seconds to a minute. The G2 program records the time
series of these measurements. Using the time series and
Fig. 4: Portion of flowsheet for control system. material transport models such as the above rod mill
model, the G2 program maintains estimates of the
instantaneous ore flowrates at all downstream locations.
The flowsheet is supported by a collection of models for The object labeled WIQ-G101 represents an estimated
all the types of equipment used in mineral processing ore flow rate that the G2 program provides to the DCS.
circuits: rod mills, hydrocyclones, flotation cells, etc. The Conventional ratio and PI controllers in the DCS then use
models are defined by the user in near-English language the estimated flow to meter reagents to the copper-lead
statements. They can be as complex or simple as rougher flotation circuit.
desired. For example, the following is a model for the
material transport through a rod mill.

9
CASE STUDY ECONOMIC BENEFITS

The advantage of the flowsheet approach was proven The flotation reagent control system makes Brunswick
when Brunswick Mining was faced with a three-week Mining's metal recovery less sensitive to ore flow
shutdown of the Line No. 2 rod mill. Normally the rod mill interruptions due to plugged ore feeders, equipment
shutdown would have meant a loss of production or a maintenance, ore shortages, etc. For example, an
major reconfiguration of the DCS. However, with the G2 analysis of operating data for the eleven months
programming tool, a simple piping change allowed preceding the implementation of the reagent control
Brunswick to continue at almost full production. system showed a strong correlation between zinc
recovery and rod mill shutdown frequency: zinc recovery
As shown in Fig. 6, the Brunswick Mining concentrator is dropped 1.3 percent for every shutdown per day.
divided into three parallel lines, each with its own grinding However, after the new control system was implemented,
and flotation sections. When the Line No. 2 rod mill was zinc recovery dropped only 0.5 percent per shutdown
shutdown, engineers took advantage of excess grinding per day. The control system reduces zinc loses by
capacity in Line No. 3 to partially compensate for the loss compensating for material transport delays and
of the Line No. 2 rod mill. The Line No. 3 grinding section maintaining the correct reagent-to-ore ratios throughout
was operated at a higher than normal rate and its the circuit at all times. The value of the greater average
product was divided between the Lines No. 2 and No. 3 metal recovery is estimated at over $CAN 500,000
flotation sections. annually. The project costs were about $CAN 250,000,
so the payback period is less than six months.

Above and beyond the immediate monetary benefit, the


CASE STUDY use of flowsheets and English-language models makes
the control system more user-friendly. This encourages
LINE No. 3 LINE No. 2 LINE No. 1 plant metallurgists and control engineers to experiment

i. 1 1
with new control strategies and make better use of the
control system.

zjzi \
GRINDING CONCLUSIONS

The flotation control system at Brunswick Mining is easy


to understand and maintain because it uses flowsheets
and English-language models in its programming. The
models describe how ore is delayed in each piece of
process equipment; the flowsheet describes how the
FLOTATION equipment is connected. Together this information
provides the control computer with all the information it
needs to correctly stage reagent dosages during plant
startups and shutdowns. The results is a lesser
sensitivity to ore disruptions and greater metal recovery.
Fig. 6: Parallel lines at Brunswick Mining.

Before the flowsheet-based control system was REFERENCES


implemented, the reagents in a particular line were
metered according to the signal from the rod mill feed Arnold, J.A., Brandl, D.L, Jefferys, S.A., "The next
weightometer in the same line. In this case, the generation of process control programming",
temporary piping change would have necessitated a ISA 1987, Paper #87-1192.
major reconfiguration of the flotation reagent controllers. Flament, F., Hodouin, D., Spring, R., "The SPOC Manual
However, with the new control system, the piping change - Chapter 7.3: RTD and Mixers Computer
was quickly reflected in the control system's flowsheet Programs", Minister of Supply and Services
schematic. The system could then immediately correctly Canada, 1985.
estimate the flows into the flotation section of each line Francis, J.C., Leitch, R.R., "ARTIFACT: A real-time shell
and add the appropriate amounts of reagents. for intelligent feedback control", Proceeding of
the BCS Conference on Expert Systems,
1984, pp. 151-162.
Franklin, M.W., "Adaptive Control of Flotation Circuits",
Control Systems 88, Montreal 1988, pp.
133.. 136.
Kay, K.R., Paterson, J.G., "Computer control at the
Clarabelle mill", CIM Bulletin, Feb. 1975, pp.
113-.

10
Konigsmann, K.V., Hendriks, D.W., Daoust, C ,
"Computer control of flotation at Mattagami
Lake Mines", CIM Bulletin, March 1976, pp.
117-121.
Le Guen, F., "The control of a flotation process", CIM
Bulletin, April 1975, pp. 113-
Lewis, C.L., "Application of a computer to a flotation
process", CIM Bulletin, January 1971, pp.
47-50.
Matthews, B., Lindenfelzer, P., Hawkinson, L, Moore, R.,
"Process control with the G2 real-time expert
system, Proceeding of the First International
Conference on Industrial and Engineering
Applications of Artificial Intelligence and
Expert Systems, June 1988, Vol. I, pp.
492-497.
Moore, R.L, Kramer, M.A., "Expert systems in on-line
process control", Chemical Process Control -
CPCIII, Elsevier 1986. pp. 839-867.
Morari, M., "Advances in process control theory",
Chemical Engineering Progress, October
1988, pp. 60-67.
O'Reilly, J.J., Nordlund, K.C., "Introduction to SLAMII and
SLAMSYSTEM", Proc. of the 1989 Winter
Simulation Conference, MacNair, E.A.,
Musselman, K.J., Heidelberger, eds.
Shewchuk, C.F., "MASSBAL MKII: New process
simulation system", Pulp and Paper Canada,
88:5(1987), pp. T161-T167.
Siverns, S.L, "Modelling and control of industrial mineral
crushing and grinding circuits", Master of
Engineering Thesis (Chemical Engineering),
McMaster University, 1984.
Spring, R., Larsen, C , Mular, A., "The SPOC Manual -
Chapter 4.1: Industrial ball mill modelling",
Minister of Supply and Services Canada,
1985.
Stephanopoulos, G., Chemical Process Control, An
I n t r o d u c t i o n to t h e o r y and P r a c t i c e ,
Prentice-Hall, 1984.
Trusiak, A., "Computer control of Brunswick Mining's No.
12 Cu-Pb flotation circuit", Proceedings of the
13th annual meeting of Canadian Mineral
Processors of CAN MET, Ottawa, Jan. 1981,
pp. 96-113.

G2 is a trademark of Gensym Corporation.

PROVOX is a trademark of Fisher Controls International.

VAXstation 3500 is a trademark of Digital Equipment


Corporation.

11
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

THE CONTROL OF MINERAL PROCESSING PLANTS


USING NEURAL NETWORK TECHNIQUES
W. Stange

Department of Metallurgy and Materials Engineering, University ofWitwatersrand, Johannesburg, South Africa

Abstract. The use of neural networks in the control and A net is trained by presenting a sequence of input signals and
monitoring of two minerals processing operations is described. the associated outputs to the network. Weighting factor values
The application of nets, coupled with a novel control scheme, which minimise the difference between the required and
to the control of hydrocyclone classification proves most predicted network outputs are then found. A trained net
successful. The neural net is used to develop a model of consists of the numerical weighting factors on each connection
cyclone operation, allowing model based control to be and an appropriate structure (number of input, output and
performed. The monitoring of adsorbent activity in a CIP hidden layer neurons) which gives a good match between
adsorption plant is also described. A neural network can be required and predicted outputs.
trained to monitor carbon activity on-line, using only This may be seen as the estimation of adjustable model
measurements which are currently practical to perform. The parameters in order to fit a set of data, where the form of the
success of this approach depended on the use of physically model does not need to be specified explicitly. This provides,
meaningful grouping of variables as network inputs. in principle, the ability to adapt to changing circumstances if
Keywords. Neural nets; process control; mineral processing; on-line training data is used. As more data is supplied, for
adsorption; hydrocylones; CIP. example in an on-line control system, the predictive ability of
the net can be enhanced and extended significantly. Other
advantages of nets include using simulated data for initial
training, as well as the use of complex, nonlinear and "noisy"
INTRODUCTION data.
Circumstances are forcing mineral processing operations to This paper endeavours to illustrate typical applications and
become increasingly efficient in terms of economic benefits using nets, rather than a more formal and detailed
performance, safety, environmental impact and so on. examination of the properties of nets themselves. Where
Automatic control may assist in the attainment of these possible, the robustness and reliability of the net is
worthwhile objectives, although it is often difficult to investigated. The network used here was a Pascal
successfully implement sophisticated control systems due to implementation of the back propagation algorithm run on a
the complexity of such processes. This paper discusses the way 80386/80387 microcomputer.
in which neural networks may be used to control two common
processes in the minerals industry; hydrocyclone classification
and in-pulp adsorption. The results presented indicate that CONTROL OF A HYDROCYCLONE CLASSIFIER
neural net technology can play a very useful role in the control The hydrocyclone is one of the most widely used and flexible
of such systems. devices in minerals processing. This device, although based on
complex phenomena, is relatively simple to operate. Due to
Neural Nets this inherent complexity mathematical models of these devices
A neural net can be visualised as a means of correlating are normally empirical in nature, although some progress has
patterns or relationships between sets of input and output data. been made using a fluid dynamics approach. Empirical models
A net consists of a number of layers of nodes (or neurons) as (Plitt 1976) work well when calibrated for the particular
shown in Fig.l. Data is fed to the input layer, while outputs are circumstances.
obtained from the output layer. One or more intermediate or Good cyclone operation is only possible if fluctuations in the
hidden layer(s) of neurons are used to connect the input to the feed stream are minimised. It appears as though good control
output layer. Each neuron receives a number of input signals of cyclones is one of the most neglected areas in the industry.
which are summed and transformed using an appropriate Given that cyclones may have a significant impact on the
function. Before being transmitted to the next layer, these efficiency of one of the most expensive processes in the
signals are modified by a weighting factor. industry, i.e. grinding, this is certainly cause for concern. The
problem of poor cyclone performance probably arises due to
the operational simplicity of the cyclone, resulting in a lack of
Fig. 1 : Typical Neural Net
care and attention in a plant environment.
Output Signals
A typical single stage hydrocyclone installation is illustrated in
ft fl Fig.2. A sump or tank is fed with material, which is then fed to
a hydrocyclone (or several cyclones in parallel). The normal

o
control objective is to control the tank level as well as the
product size specification in one of the cyclone product
streams, despite fluctuations in the circuit feed.

Adjustable
Connection
Weight

Input Signals

13
In recent work (Stange, Moys and Hinde 1991) the control of a Fig 3 : Relationship Between Cyclone
hydrocyclone circuit for the preparation of backfill material is % Solids and Pressure
examined using dynamic simulation. In this application,
cyclones are used to de-slime tailings material. This must be
done properly or severe consequences may result. Good
cyclone control is thus crucial. It was found that the solid to
liquid ratio in the cyclone feed, and the pulp flowrate through
the cyclone both have a significant effect on cyclone
performance. This made it impossible to use conventional
SISO techniques which rely only on simple measurements such
as pulp density and pressure.
A single stage circuit can be controlled effectively under a 100 150 200
wide range of conditions using the following strategy (shown Cyclone Pressure, kPa
in Fig.2):
Feed Size Distrib.
Control of sump level using a variable speed pump - Normal Fine

Control of product size specification using dilution water Constan


t %-10u
m in U/F
addition rate
This strategy requires an on-line measurement of the product The neural net control system was tested by introducing a step
size distribution. Such devices are expensive and are often change in the feed stream pulpflowratefor different feed size
difficult to maintain. A more practical approach was therefore distributions, outside of the range of the distributions used to
developed. train the net. A typical dynamic response curve for the system
is shown in Fig.6, where a net containing 2 hidden 3nodes was
Fig 2: Single Stage Cyclone used. The feed flow was stepped from 110 to 80 m /hr at t=1.0
hr and then to 140 m3/hr at t=1.5 hr with a "fine" feed size
distribution. Good control of the cyclone product size
distribution results, considering that no direct measurement of
Dilution water
the size distribution is made. Observation of the dynamic
circuit responses using the nets with different numbers of
hidden nodes (2,3 and 6) showed that the number of hidden
nodes had no significant effect on the control system
behaviour.
For this relatively simple application, a neural net works at
least as well as other techniques. The advantages of the net
approach are that a satisfactory model of the system can be
developed without having to specify the form of the
correlation. The net can be trained using simulated data in very
reasonable times on a microcomputer and the calculations
required to find the net prediction in an on-line system are
simple. Obviously, instead of using simulated data, real data
Model Based Control of a Hydrocyclone Using a Neural Net collected by plant sampling or on-line measurement could have
Studying the dynamic responses to various feed perturbations been used. Such data would contain much more error than that
for the above control strategy, it was found that the control produced by simulation.
effect is to change the cyclone operating pressure and feed %
solids in such a way that the product size remains constant as Neural Net Control Using Noisy Data
shown in Fig.3. This illustrates the cyclone pressure required to The effect of noise in the training data was investigated by
maintain a given product size specification for various values adding random noise to the simulated data. Four different error
of the cyclone feed % solids, for 3 different feed size levels were tested; 5,10,15 and 25% respectively. Each set of
distributions (normal, coarse and fine). Figure 3 also shows the noisy data was used to train the network using 2 hidden
overflow % solids measurement which, at a given operating neurons. The average error between the net predictions and the
pressure and feed density, is a measure of the nature of the feed perturbed data varied from 18.3 to 21.9 kPa, which is
size distribution. Thus, a measure of the feed and overflow % significantly larger than the 6.2 to 7.9 kPa error obtained using
solids allows the cyclone operating pressure which will provide
the required product size distribution to be predicted. the simulated data. In addition, the net tended to learn more
slowly with the noisy data. The nets developed using noisy
This suggests the following model based cyclone control data were then used in the dynamic simulator to test the
strategy: behaviour of the control system. The feed flow was stepped
from 110 to 80 m3/hr at t=1.0 hr and then from 80 to 140 m3/hr
PID control of the sump level by manipulation of dilution at t=1.5 hr for all tests. Both "fine" and "coarse" feed size
water. distributions were tested. The following table summarises the
PID control of the cyclone operating pressure by results obtained.
manipulation of the pump speed.
The use of a model to estimate the cyclone operating Fig 4 : Model Based Control
pressure required for the measured feed and overflow
percent solids, which is used to update the pressure setpoint
in the PID loop. This is shown in Fig.4.
The data in Fig.3 was used to train a net, in which the inputs Residue

ap
were the feed and overflow pulp densities, and the output was
the pressure. The average error between measured and Variable
predicted cyclone pressures ranged between 6.2 and 7.9 kPa speed
pump
depending on the number of hidden nodes used. Figure 5 Y -
. -
shows the training curve obtained for 2,3 and 6 neurons in the
hidden layer, as well as for different learning rates. The &) (S) \ )
learning rate and number of hidden nodes do not affect the
ability of the net to learn much for this case.
v
L Yi i
1
1 Model

Backfill product

14
Fig 5 : Influence of Hidden Nodes and Fig 6 : Dynamic Response with Neural
Learning Rate on Convergence Control. Steps at t=1.0 & 1.5 hrs

Average Error in P, kPa


-
. - / . -
r
*l
n

v. v
*1

' 1
0 0.25 0.5 0.75 1 1.25 1.5 1.75 2 2.25 2.5 2.75 3
Time, hrs
Nodes
, Learnin
g Rate
a) 2 (E-0.9
) b)2(E=0.3
) c) 3 (E=0.9
)
d) 3 (E=0.3
) e)6(E=0.9
) )f 6 (E=0.3
) Pressure
, kPa Feed % Solid
s

TABLE 1 Control System Behaviour With "Noisy" Data


The output of the net would be the pressure required to operate
Ave. Feed Size Ave. % -10 the cyclone (at the given feed and overflow densities) at the
Error, % Distribution microns appropriate size setpoint as illustrated in Fig.7. The on-line
measures of density, pressure and size would then be used to
0 Fine 9.92 0.28 train the neural net on-line. This approach has the following
0 Coarse 9.91 0.30 advantages:
5 Fine 10.72 0.50
5 Coarse 9.18 0.25 The net is trained initially using simulated data, minimising
10 Fine 11.09 0.61 time consuming and possibly disruptive plant dynamic
10 Coarse 8.65 0.29 testing.
15 Fine 11.34 0.69 The net can be trained on-line to adapt to changing plant
15 Coarse 8.53 0.31 conditions. The net can "learn" how to deal with new
25 Fine 11.47 0.77 situations in an on-line manner.
25 Coarse 8.43 0.31
Inference of cyclone feed size distribution. A network could be
The average value, as well as the standard deviation, of the structured and trained such that given inputs such as the
product size distribution during each simulation is shown. As cyclone pressure, feed and overflow densities and a
the error in the training data increases, the offset due to the measurement of the product size, a prediction of the cyclone
cyclone feed size distribution becomes more prominent. The feed size distribution is output. This is obtained at no extra cost
standard deviation for the fine feed also increases as the error in terms of instrumentation, as the same instrumentation used
in the training data increases indicating larger variations in the for the primary control scheme is utilised. Another net is
product size distribution. However, control is still remarkably simply trained to use these measurements and provide some
good, with an error of 1.5% -10 microns (relative to the indication of the cyclone feed PSD.
desired setpoint of 10% -10 microns) for the 25% error case. Inference of a second size distribution point. Most PSMs
Observation of the dynamic responses showed that no measure only a single product size point. A measurement of
instabilities developed as the error in the training set was two size points provides significantly more information. A net
increased; the effect is merely that the ability of the net to could be trained, with similar inputs to those described above,
correct for the nature of the feed size distribution is not as to provide some inference of a second point. Again, there is no
good. extra instrumentation cost.
Application in a circuit with multiple modules. Many milling
Neural Net Control with On-line Measurement of PSD plants are constructed as several parallel modules. Using the
In many cases a particle size monitor (PSM) may be installed above approach, one module could be equipped with all the
in order to provide an on-line measurement of the product necessary instrumentation to train a neural network on-line.
particle size distribution if this has a critical influence on The other modules would be equipped only with the
downstream processes. PSM instrumentation is readily instrumentation required to measure the network inputs. The
available and works well if properly maintained. The network trained on the well instrumented circuit could then be
possibilities of using the control strategy described above when used to implement control which is just as effective on the
an on-line measure of the product size distribution is available other modules. The reduction in instrumentation cost and
is therefore explored. maintenance becomes significant as the number of modules
On-line training of the net. The most obvious option is to use increase.
the same strategy as discussed above : Back-up for instrumentation. An appropriately trained network
Control of the sump level by water addition, can be used as a replacement or back-up for the PSM when this
is taken off-line ensuring good control under all circumstances.
On-line calculation of the required cyclone feed flowrate
using a neural net, with appropriate pressure and density Future Directions
measurements, These methods are potentially an efficient general approach for
Use of a variable speed pump to achieve the required the control of hydrocyclones, irrespective of the application.
cyclone flowrate. Plant data is being collected to verify the approach used and to
train an appropriate network. Research, utilising dynamic
The on-line measurement of PSD (together with the other simulation as an experimental tool, is presently being
required measurements) would then be used to update and train conducted to determine how this strategy would function in a
the neural net on-line. The net is then a system model which recycle grinding circuit. Due to the inferences which nets may
allows compensation to be made for the variable interactions. allow regarding unmeasured variables, this would combine
The neural net would be structured slightly differently. Inputs well with an expert system control approach.
would consist of:
The desired product particle size measurement, as an
adjustable setpoint
The cyclone feed density
The cyclone overflow density

15
The rate expression R can be quantified using an appropriate
Fig 7 : Neural Network Control expression, which can vary dramatically in complexity. For the
with PSM Measurement purposes of this exercise, the complexity of the rate equation is
not an important issue. The modelling of the CIP adsorption
system is discussed in detail by Stange and colleagues (1990).
A simple expression for R was used in this work which has the
following form :

R=k\C-^
(2)
where k is a kinetic parameter and A is an equilibrium
parameter, corresponding to carbon activity.

Fig 8 : CIP Flowsheet

cyanide Leach
Feed Pulp

CONTROL OF AN ADSORPTION PROCESS


Adsorption systems play an important role; e.g. resins for the
recovery of gold and uranium, activated carbon to recover gold
etc. The carbon-in-pulp process (CIP) is widely used in many
countries for gold recovery. The heart of a CIP plant is the Elution

adsorption circuit. The modelling of this process on an t=i z>


industrial scale has been studied in detail (Stange and others Elect rowinning Regeneration
I X Xfr=
1990).
A typical CIP plant is illustrated in Fig.8. The adsorption plant Pulp

consists of a number of CSTRs in series. A batch of adsorbent


(carbon) is retained within each CSTR using a mechanical Carbon

screening system. The pulp flows through the CSTR and the Eluate
CIP Adsorption
gold is adsorbed from solution onto the carbon. The carbon is
moved countercurrent to the pulp flow in a periodic manner,
resulting in efficient scavenging. The highly loaded carbon is A Neural Network for Monitoring Carbon Activity On-Line
eluted to remove the gold and then regenerated to restore the Can a neural network be used, together with appropriate
adsorption capacity, or activity, of the carbon. Regenerated on-line data, to monitor carbon activity on-line ? An exact
carbon is returned to the circuit. measure of the activity is not required; a relative measure is
sufficient to serve as an early warning system for the plant
The rate of adsorption of gold onto carbon is dependent on the operators.
kinetic and equilibrium properties of the system. Factors such
as the size of the carbon particle, intensity of mixing, the It has been demonstrated on an industrial scale that on-line
carbon pore size distribution etc. will affect the kinetic measurements of the mass of carbon as well as the gold
processes. In any given plant, these kinetic factors should concentration in solution can be obtained. It is not practical to
remain approximately constant. Simulation work has shown obtain an on-line measure of gold on the carbon although this
that the system is often more sensitive to changes in the has a significant effect on the rate of adsorption. The model (1)
equilibrium parameters than in the kinetic parameters. It is and (2) above shows the variables which play a role in
possible, even probable, that the equilibrium parameters for a adsorption. Ignoring the problem of measuring the loading y
given system will change with time. This may happen due to on-line, an obvious first approach is to develop a network
the fouling of carbon by other adsorbed components, poor where the quantities Mc, Q, C, Cf and y are network inputs and
regeneration or unexpected contamination of the pulp. A (a measure of the carbon activity) is the output.
Normal operating practice is to take routine manual samples in A net trained on single stage data. An appropriate training set
order to evaluate the efficiency of process operation. If containing 208 points was developed using the simulator
problems are suspected, more intensive testwork is carried out. described by Stange and others (1990) and appropriate
This is often performed in a laboratory and subject to delays of parameters, for a single adsorption stage operating under a
days or weeks. Such delays may result in the carbon range of conditions. A network with inputs (Mc, Q, C, Cf and
undergoing significant fouling before conclusive results are y), 5 hidden nodes and one output, A, was trained. Values of A
obtained. Several weeks or months of careful operation may be ranged from 2 000 g/t to 20 000 g/t and the trained net was able
required to restore carbon activity. Significant loss of revenue to achieve average errors in the predicted values of A of about
may take place during this period. 820 g/t.
This appeared to be accurate enough for monitoring purposes.
A Basic Adsorption Model However, when the network was applied to simulated data
from a 5 stage plant in which a continual change in the
The behaviour of a single adsorption CSTR may be represented parameter A was simulated, it was found that the network
by the following differential mass balance: predictions did not correlate with the simulated value of A at
dCjt) all. It was concluded that the network was not able to
s Q{t){Cf{t)-C{t))-Mc{t)R{y,C) successfully abstract the essential features of the system in this
dt way.
(1)
where C is the gold solution tenor in the tank, Cf is the solution A network trained OJI multiple stage data. Data was then
tenor in the tank feed, y is the loading of gold on the carbon, Vs generated by simulation of a 5 stage CIP plant under various
is the volume of solution, Q is the solution flowrate, M<. is the operating conditions. The quantities M<., Q, C, Cf and y from all
mass of carbon and R is the rate of adsorption onto the carbon. 5 stages, together with the corresponding values of A for each
stage, were used as training data. This resulted in a large data
set of 2400 points. It was found that the network did not train
satisfactorily, achieving average errors in A of 1 400 g/t. This
is not surprising, given the enormous variation in magnitude in
some of the data items (C and y) over the 5 stages of the plant.
This could be up to 3 orders of magnitude.

16
This trained network was then tested, as shown in Fig. 10, using
Fig 9 : Carbon Fouling in Stages 1 to 3 simulated data. The network inputs were again subjected to
15% Error, 1 Network/stage random error and the simulated plant feed conditions were also
perturbed randomly. The network is able to track the parameter
A, g/t (Thousands)
A sufficiently accurately for our purposes even without an
on-line measure of the gold loading y. Thus, the original
^ifV tL objective has been achieved; using only quantities which can
be measured on-line (C, Q and Mc) a neural network has been
V* Oa:* ^ S3
trained which can given a reasonable on-line indication of the
carbon activity. The technique described above should be
applicable to other adsorption processes which operate on
similar principles.
Time, Hrs

Stage Numbe
r Fig 10 : Carbon Fouling in Stages 1 to 3
I.Actual 2,Actua
l 2,Net 15% Error, Rate Approach, 2 Inputs
3,Actua
l 4,Actua
l 4,Net
A, g/t (Thousands)
12
o a - B D - B b - - - - * ^ ; * - * *
A separate neural network was then trained on the data from 10 +
Q a *
a

each stage. Five data sets corresponding to each CIP stage were 8
** /
produced and used to train 5 different neural networks. Each 6
set consisted of 120 points and trained well, with average
errors in A ranging from 100 to 160 g/t. When the nets were
4

applied to the data from a simulated plant, they were able to 2

predict the simulated trend in the carbon activity reasonably


well. Figure 9 illustrates the correspondence between the 0 10 20
Time, Hrs
30 40 5

values of A used in the simulator and those predicted by the


network. When testing the prediction, a random error of 15% Stage Numbe
r

was introduced into the network inputs. In addition, the plant *


1,2,3-Actual
3
, Net
I.Net
4,Actua
l
+
0
2,Net
4,Net
feed conditions were also perturbed in a random fashion, in an
attempt to simulate reasonably realistic plant conditions.
CONCLUSIONS
Figure 9 illustrates a simulated case where fouling of the
carbon in the first three stages takes place. The correlation It has been shown that neural networks have the potential to
between the simulated conditions and the net predictions are play a very useful role in the area of mineral processing
sufficiently close to meet our objectives. The change in the control. For this purpose a network can be regarded as a model
activity in stage 1 affects the network prediction quite of general form which is able to abstract the relevant features
significantly for stage 2, and less significantly for stages 3 and from a training set. This training set can normally be generated
4. by computer simulation, but can also be supplemented or
replaced by plant or experimental data. This avoids the issue of
A rate based training approach. Although the above approach finding a correct form for the model, which is a significant
works reasonably well, it has several flaws. A large number of advantage given the complexity of many mineral processing
parameters are introduced into the system as a network must be operations.
trained for each stage. It has also been assumed that an on-line
measurement of the gold loading on the carbon y, is available. A network was used as part of a novel method for cyclone
This cannot be achieved at present and is unlikely to be control. Given a measure of the pulp density in the cyclone
achieved in the future. Is there a way to modify the approach to feed and overflow, the network is able to predict at what
reduce the number of network parameters which must be pressure the cyclone should be operated at in order to maintain
estimated, as well as to avoid the use of the carbon gold a given product size distribution specification. The predicted
loading y as a network input ? pressure becomes the setpoint of a conventional PID loop in
which pressure is controlled by pump speed. This procedure
Over a short time period, equation (1) can be simplified by was developed and tested by simulation which showed that
assuming negligible accumulation in the liquid phase: very good control results. The major advantage of the
technique is that it reduces the requirement for an on-line
^iCf(t)-C(t)) = R(y,C) particle size measurement. If an on-line measurement of
product size does exist, the technique can be extended in a
number of ways.
=Z
The use of neural networks for the monitoring of carbon
(3) activity in a CIP adsorption plant was also studied.
The term on the LHS of (3) is an approximation to the rate at Conventionally, the only way of monitoring this parameter is to
which gold is loading onto the carbon (e.g. grams of gold per take carbon samples and perform laboratory tests on them, with
ton of carbon per hour). All quantities in this group can be the associated delay. A number of different approaches were
measured on-line. The LHS of (3) is approximately equal to R, attempted for this exercise. In the first experiment, the network
which depends on y, C and equilibrium and kinetic parameters inputs were assumed to represent single plant variables, such as
A and k. This suggests that a network with inputs of Z and the mass of carbon in a tank, the gold tenor in solution and so
solution tenor C, and output A, can be trained. on. Networks were trained on data from a single as well as
multiple adsorption stages. For various reasons, this approach
The data obtained from the simulator was combined into a was not judged successful.
single training set of 600 points. The network had two inputs An approach in which the single plant variables were combined
(the rate quantity Z and the solution tenor C), 3 hidden nodes into physically meaningful groups and used as the network
and one output (the carbon activity parameter A). The network inputs was then developed. A single network could be trained
trained well, achieving an average error in A of 330 g/t. This which could predict the performance of a multiple stage plant
method has a significant advantage over the previous approach, with sufficient accuracy. The most important conclusion that
in that only one network for all adsorption stages is required, can be drawn from the neural network for carbon activity is
the size of the network is smaller and uses only inputs which that care should be taken when structuring the network inputs
can be measured on-line. and outputs. For this case much better results were obtained
when logical groupings of variables were used as network
inputs, rather than simply mapping all the significant variables
with network inputs on a one-to-one basis.

17
Overall, it is believed that neural network technology has a
significant role to play in the control of processing plants.
However, such techniques are not perfect and care must be
taken to structure the network and it's associated inputs and
outputs in a meaningful way. One of the uses of this
technology is to model data which is complex so that
phenomonological models describing the data would be
difficult to derive. However, it is felt that networks should not
be used to replace deterministic models of a process (if these
exist and are well proven) simply as a matter of convenience.
In addition, networks should in no way detract from research
efforts aimed at understanding and quantifying the many
complex phenomena in minerals processing.
An area in which there is significant potential for further
research is the development of techniques or principles for the
on-line training of networks.

REFERENCES
Plitt, L.R. (1976). A Mathematical Model for the Hydrocyclone
Classifier. CIM Bulletin, vol. 69. no. 776. pp. 114-123.
Stange, W.W, Moys, M.H. and Hinde, A. (1991) The Use of
Dynamic Simulation in the Development of Control Systems
for Backfill Plants. To be published in J. S. Afr. Inst. Min.
Metall.
Stange, W., Woollacott, L.C. and King, R.P. (1990)
Towards more effective simulation of CIP-CIL processes, Part
: A population balance based simulation approach. J. S. Afr.
Inst. Min. Metall. December 1990.

18
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

AUTOMATION EXPERT SYSTEM FOR AIR SEPARATION


PLANT
K. Tashiro*, T. Terasaki**, M. Watabane*** and N. Andot

^Nippon Steel Corporation, Kitakyusyu-City 804, Japan


**Fujifacom Corporation, Tokyo 191, Japan
***Fujifacom Corporation, Tokyo 151, Japan
^Fuji Electric Co. Ltd., Tokyo 191, Japan

Abstract. We i n t r o d u c e the e x p e r t system for an a u t o m a t i c c o n t r o l of a i r


s e p a r a t i o n plant in iron works. An a i r s e p a r a t i o n plant is used for s e p a r a t i n g
oxygen, nitrogen, argon and other inert gasas from liquefied a i r by u t i l i z i n g their
boiling point difference. Operators of a i r separation plants must always take care
of keeping the product p u r i t y s t a b l e , and a l s o must immediately i n c r e a s e or
d e c r e a s e product gases on demand. Generally t h i s p l a n t ' s d e t a i l s t r u c t u r e is
unknown, so operators are controlling by t h e i r experience and i n t u i t i o n . Therefore
it was necessary to reduce the operator's mental and physical load.
We transformed o p e r a t o r ' s knowledge ( i . e . experimental control method of s k i l l e d
operators ) into production r u l e s of AI Work S t a t i o n . In oder to simultaneously
control three a i r s p a r a t i o n p l a n t s by one AI Work Station, the system is divided
into four expert systems. We have r e c e n t l y developed the expert system for a i r
separation plants of Yawata Iron works. Due to a p r a c t i c a l use of t h i s system, the
plant operation has been s t a n d a r d i z e d , the manual i n t e r v e n t i o n of o p e r a t o r s is
minimized, and energy-saving operation is r e a l i z e d in the iron and s t e e l making
industry.

Keywords. A r t i f i c i a l intelligence; Automatic control; Computer control;


Knowledge engineering; Real time computer systems; Steel industry;

INTRODUCTION d i f f i c u l t owing to t h e development of t h e


knowledge engineering and computer technology.
The a i r separatin plant is used for separating
oxygen, nitrogen, argon, and other inert gases We have recently developed and commercialized an
by u t i l i z i n g their boiling point ( oxygen -183C automation expert system for the a i r separation
, nitrogen -188C , argon -185C ) differences plant of Yawata Iron works by using the r e a l -
after 1iquefying a i r . t i m e c o n t r o l AI Work S t a t i o n ' F E C S - A I '
(manufactured by Fuji E l e c t r i c Co.)to meet the
The g r e a t e r part of the oxygen demand in iron correspondence to an independent change of the
and s t e e l making i n d u s t r y i s a p p l i e d to the demands of oxygen and n i t r o g e n , the minimized
blowing in a converter. i n t e r v e n t i o n of o p e r a t o r s , and power-saving
The automation system for the a i r s e p a r a t i o n o p e r a t i o n in t h e i r o n and s t e e l making
plant has conventionally been g r e a t l y demanded industry. 'FECS- main object is to construct
for the purpose of m a i n t a i n i n g the product system, for automatic control process in many
p u r i t y s t a b l e , s a t i s f y i n g t h e demand of f i e l d , using h e u r i s t i c knowledge acquired from
i n c r e a s i n g or d e c r e a s i n g p r o d u c t g a s e s to s k i l l e d o p e r a t o r s and p r o b l e m - o r i e n t e d
immediately meet an independent change of the mathematical methods.
demand of subproducts (oxygen, n i t r o g e n ) , and
also reducing the o p e r a t o r ' s burden. This paper will report t h i s automation expert
system for the a i r separation plant. This system
In t h e s e y e a r s , t h e a p p l i c a t i o n e x e c u t i o n has been developed by the j o i n t work of Fuji
examples of the expert system have been reported E l e c t r i c Co., and Nippon S t e e l Corporation,
as a means of approaching the area where the Yawata.
automation has conventionally been regarded as

19
AN OUTLINE OF AIR SEPARATION PLANT temperature. The residual nitrogen gas at the
lower column top is condensed to be liquefied by
The air separation method will be described means of liquid oxygen which is pooled at the
referring to Fig.1. b o t t o m of the upper c o l u m n . This liquid
nitrogen is divided into two direction. One is
Air as a raw material is boosted to a pressure used as a reflux liquid in the lower column,
of about 6 atg. by a compressor after passing while the other is used as a reflux liquid in
an air filter, and then, it is cooled and the upper column.
washed in a water wash cooling tower. After
carbon dioxide gas and moisture have been In the upper column, the final rectification is
removed by the adsorption in a molecular sieve done, so that the liquid air supplied from the
adsorption bath, air is fed into a main heat lower column enriches oxygen by means of the
exchanger to execute the heat exchange with rectifying action as the liquefied air lowers
product gas and exhaust nitrogen. Next, it is through the rectifying trays. It finally becomes
cooled down to about -170C , and supplied to liquid oxygen having the purity of about 99.6%
the lower column of the rectifying column. This to be equivalent to the purity of the product
rectifying column is provided with plural oxygen, and it is collected at the bottom of the
rectifying trays (porous plate trays), and c o l u m n . T h e c o l l e c t e d liquid o x y g e n is
coarse rectification is done in the lower evaporated by the heat of lower column nitrogen
column. gas. A part of this liquid oxygen is drawn out
as a product oxygen gas, and sent to demand
As a result, a liquefied air containing about destinations with pressure after passing the
40% of pure oxygen is pooled at the bottom of main heat exchanger.
the column, while nitrogen gas of 99.999% is
separated to the column top. The liquefied air On the other hand, the evaporated gas ascending
is fed to the upper column of the rectifying through the rectifying trays enriches nitrogen
column. A part of nitrogen is fed to the main as it ascends. A part of this evaporated gas is
heat exchanger, and then, mixed with the drawn out as exhaust nitrogen in the vicinity
product nitrogen gas being generated at the of the upper part, and diffused into the open
upper column top by generating the cold required air after passing the main heat exchanger. The
for the plant by means of an expansion turbine. residual evaporated gas continues lifting. It is
Then, it passes the main heat exchanger again finally drawn out as a product nitrogen gas of
to be the product nitrogen gas at normal 99.999% at the column top, and it joins the

Exhaust Product
Oxygen nitroRen nftrofcen
compressor - +

Argon compressor

Hydrogen

Bag filter

Raw air
compressor

0 O O

Fig. 1. Tobata No.10 air separation plant process flow

20
product nitrogen gas being generated from the Assume that the separation efficiency is 100% in
lower column. The raw argon gas drawn out of the a p l a n t , and we o b t a i n , 0.21M A = Mo .
h i g h e s t e n r i c h e d argon p o r t i o n at an On t h e o t h e r hand, t h e e q u i l i b r i u m of t h e
intermediate portion of the upper column is thermal balance is represented by the following
introduced into the crude argon column, where it equation.
is coarsely rectified up to the purity of about
96% by the same action as in the rectifying MA IA + QL == ( MO + MN ) I ON + QI (3)
column, and then, fed to the argon purification MA (IA - ION ) + Q. = Q. (4)
unit as product argon of 99.999%.
Lo Liquid oxygen quantity
Q. Cold loss quantity
CHARACTERISTICS OF AIR SEPARATION PLANT (including heat exchange loss)
Q. Cold supply quantity
The air separation plant liquefies raw air, and IA Air enthalpy
separates oxygen, nitrogen, argon, and other I ON Enthalpy of separated gas
inert gases by means of the rectifying action
utilizing a boiling d i f f e r e n c e of Lo is kept constant under a thermally balanced
thermodynamic physical properties. Accordingly, state. Assuming that the air separation plant is
it is important for keeping the plants table to an air purifier, a disturbance of the thermal
stabilize the thermal balance and material balance appears as a change of L . .
balance.
Q, = Q, - ( MA ( IA " I ON ) + Q> ) (5)
In the air separation plant, it is possible that Lo = Q, / ( IA - ILO ) (6)
the disturbances of the thermal balance and
material balance cause the disturbances of the I Enthalpy of liquid oxygen
purity of separated gas. The relation of the
thermal balance - material balance - separated Assume t h a t t h e c o l d has become e x c e s s i v e
gas purity will be described, referring to Fig. 2 without changing a i r f l o w MA and s e p a r a t e d gas
and Fig.3. g u a n t i t y Mo , MH , an i n f l u e n c e to a f f e c t the
p u r i t y i s r e p r e s e n t e d by t h e f o l l o w i n g
The following equation holds true about the equation.
material balance-to-product purity relat ion.
0.21MA - Xo ( Mo + Lo ) (7)
MA = Mo + MN (1) Xo = 0.21MA / ( Mo + Lo (8)
0.21MA = XO MO + MN (2) Xo P u r i t y of oxygen

MA - Air flow A disturbance of the thermal balance affects the


Mo MN - Separated gas flow r e c t i f y i n g a c t i o n to change t h e l i q u i d gas
Xo Xn - Purity of oxygen r a t i o (reflux l i q u i d L/evaporated gas G) which
in separated gas i s t h e most i m p o r t a n t f o r t h e s e p a r a t i o n
efficiency. The r e l a t i o n between generally well-
known l i q u i d gas r a t i o and t h e s e p a r a t i o n
efficiency is shown below.

Cold
loss
Thermal introduction of
open air, sunshine, etc
Thermal loss in unit,

f
changeover, etc.
Heat exchanger loss

o
3
/
2: &./ / (x 3 ,y 3 )
/ U(x2,y2)
>, / V
/ / L i ( , y )

X ( 2 -mol
Cold ,{> L Reflux liquid
supplement
Q Evaporated gas
Generation o f coldness by an
expansion tu rbine L/G- Liquid gas ratio

Fig. 2. Air separation plant model Fig. 3. Rectifying action on multistage


rectifying trays

21
Excessive cold -* Liquid gas r a t i o : Large Expert system structure
-* High boiling point substance (oxygen)
Purity decreases. In order to simultaneously control three oxygen
Low boiling point substance (nitrogen) plants by one FECS-AI, the system is divided
Purity increases. into four expert systems in total to share
Undercold - Liquid gas r a t i o : Small individual functional burdens, and this multi
- High boiling point substance (oxygen) expert system is constructed to execute
Purity increases. parallel operation.(See Fig.4)
Low boiling point substance (nitrogen)
Purity decreases. Increase/decreas
e Purit
y adjustmen
t LO leve
l
Plant
Argon Nitroge
n Oxyge
n Argon Nitrqgen n adjustmen
l Oxyge t

A disturbance of thermal balance a f f e c t s both


material balance and s e p a r a t i o n e f f i c i e n c y to -~*Q ^ 1 OT-7 increase
J decreas
e ES
/1
] i 1 7 OT-8
OT-
adjustmen
t operatio
n
cause the s e p a r a t e d gas p u r i t y to change as A E
S -
described above.
fs 1
|
OT-
8 increase
decreas
e ES

1
/ 1
|j

- ^
n.
The air separation plant operation is controlled
for t h e purpose of p u t t i n g a l l c o n t r o l l e d
v a r i a b l e s within allowable ranges to c o l l e c t #Q J OT-JL
O increase/decreas
e adjustmen
t operatio
n ES

oxygen, nitrogen, argon, and other inert gases


1 1 1 1 1 1
from the plant as products with secured purity. Fig. 4. Multi expert system configration

These c o n t r o l l e d v a r i a b l e s a r e r e l a t e d t o
respective manipulated variables complicatedly, Performance and Scale of ES
and they i n t e r f e r e with each o t h e r . A p u r i t y
change occurs a f t e r passing p l u r a l r e c t i f y i n g The contorl of ES is executed with a few minute
trays, and there is a noticeable delay time. cycle. The computer receives the process data
and infers. The controller controlls the air
Accordingly, the plant s t a t u s is f i n a l l y judged plant according to the inference results ( i.e.
from the o p e r a t o r ' s o p e r a t i o n experiences to set values ).
s e c u r e the p r o d u c t gas p u r i t y most
p r e f e r e n t i a l l y , and t h e n , the p r o d u c t gas To keep real-time performance, we have developed
q u a n t i t i e s are manipulated to be suppressed the supervisory program. The program is
within allowable r a n g e s in c o n v e n t i o n a l watching the plant conditions. When the process
operation. Also, s l i g h t o v e r - a c t i o n caused by values exceed their allowable values, the
t h e d e c i s i o n and o p e r a t i o n of i n d i v i d u a l purity adjustment ES is executed in response to
operators occurs unavoidably. the supervisory program's instructions.

On the other hand the increase/decrease ES is


AUTOMATION MEANS usually suspending their action. When operators
must increase or decrease product gases on
Reasons for the application of AI demand, the increase/decrease ES is execute in
response to operator's instructions. Table 1
We decided the application of the expert system shows the performance and scale of ES.
for automating the plant by taking the following
four points into consideration.
TABLE 1 Performance and scale of ES
1. It is technically impossible to represent all
control systems of the plant by numerical Name of Expert system Cont rol frequency Throughput Number of rule
( minite ) ( second )
models.
OT-7.OT-8
adjustment operation 1 2 220
2. The expert system can be judged as effective OT-7
for systematizing experimental control methods increase/decrease 3 2 229

of ski lied operators. OT-8


increase/decrease 3 2 346

3. Since only a few air separation plants have OT-10


Increase/decrease 1 2 183
been automated until now, the control accuracy & adjustment

level is presumable to be improved by the TOTAL 978


addition and modifications of knowledg base in
Note: Increase/decrease ES is executed by operators instruction
the future. For this purpose, the expert system
will be more flexible. Addition, modification,
and other changes are comparatively easy. The
knowledge base is described in Japanese and Features of the functions of expert system
mathematical formula like Fortran.
1. Multi i n c r e a s e - d e c r e a s e o p e r a t i o n ES .An
4. The application of the expert system to the operation change system is necessary to flexibly
automation of air separation plants has not been meet a change of t h e unbalanced demand or
reported yet, and this expert system will be frequent demand of oxygen and n i t r o g e n as a
verifiable as effective in an actual plant. need to a i r separation plants in iron and steel

22
m a k i n g i n d u s t r y , and we h a v e r e c e n t l y SYSTEM CONFIGURATION
constructed a multi increase/decrease operation
expert system which can execute a l l operation Fig.6 shows the hardware configuration of the
change p a t t e r n s . a u t o m a t i c o p e r a t i o n f a c i l i t i e s in an a i r
separation plant.
2. Predictive control type adjustment ES . When
a process change is delayed and the lag time
changes according to manipulated v a r i a b l e s and s t a t lno
OT-7 OT-8 OT-10 AI Work S t a t i o n , r RP Y [ J : !!!""
o t h e r v a r i a b l e s l i n e in the a i r s e p a r a t i o n M
Ml I K tX OCS
: CRT operato r statio n
p l a n t , the c o n v e n t i o n a l PID c o n t r o l system CRT CRT CRT ~
CRT
m syste
DOS: Data bas statio
DPCS-FI: Optical datawa
n
y
I I FECS-
e Ai
cer
causes e x c e s s i v e o p e r a t i o n and r e s u l t a n t l y OCS G500/32F
DBS
causes the maintenance of the process s t a b i l i t y
to be d i f f i c u l t . Accordingly, whether abnormal
process condition values and the recovery after
operation can be predicted in the same way as
in conventional operators or not is important PCS

for maintaining the process s t a b i l i t y .


This expert system judges the process conditions plant plane plant
OT-7 OT- 1 0
as abnormal when t h e s e v a l u e s exceed t h e i r OT- 8
allowable values (PH inrush). It p r e d i c t s the
process conditions according to the arrival time Fig. 6. Air separation plant automation
to t h e a l l o w a b l e v a l u e s (PH) in c a s e of system configuration
abnormal v a l u e (PHH) or r e c o v e r y a f t e r t h e
l i n e a r approximation by the method of l e a s t This system c o n s i s t s of the d e c e n t r a l i z e d
square based on past data. (See Fig.5) d i g i t a l computer (PCS), CRT o p e r a t o r ' s s t a t i o n
Then, t h e s y s t e m d e t e r m i n e s m a n i p u l a t e d (OCS), process data base s t a t i o n (DBS), and AI
v a r i a b l e s or operation methods by s o r t i n g the Work S t a t i o n (FECS-AI), and connected by an
arrival time into ranks. optical dataway (DPCS-F). The present automation
By combining t h i s s t a t u s decision method with system i s d e s i g n e d as an o n l i n e c o m p u t e r -
the s e l e c t i v e control method described below, instrumentation system (MICREX).
t h i s system executes the process s t a t u s control
without any o v e r s h o o t phenomenon due to an FECS-AI is designed as an expert computer system
excessive operation and other f a i l u r e s . where AI t o o l (EIXAX) i s mounted in UFAS
consisting of F u j i ' s 32-bit super-microcomputer
3. S e l f - s e l e c t i o n c o n t r o l type a d j u s t i n g ES. FASMIC-G500/32F. 'EIXAX' is sort for ' e f f e c t i v e
Since the air separation plant is characterized industrial application aimed expert system shell
with strong mutual interference action between for advanced control system'. I t s main object
process s t a t u s , an o r d i n a r y c o n t r o l method, is to c o n s t r u c t system, for automatic control
which i n c r e a s e s or d e c r e a s e s t h e oxygen p r o c e s s in many f i e l d s , u s i n g h e u r i s t i c
generation q u a n t i t y for a d j u s t i n g the oxygen knowledge acquired from s k i l l e d o p e r a t o r s and
purity, affects the purity of other elements to problem-oriented mathematical methods such as
cause a s t a b l e running f a i l u r e depending upon l i n e a r programming and s t a t i s t i c a l a n a l y s i s .
the plant conditions. I t s g r e a t e s t f e a t u r e l i e s in the fact t h a t a
It i s , therefore, necessary for a i r separation system can be constructed simply by assembling
p l a n t s to always keep t r a c k of a l l p r o c e s s s p e c i f i c r u l e s and models of a i r s e p a r a t i o n
conditions and also control them by selecting or p l a n t s , s i n c e t h i s system employs a r u l e &
combining the adjusting operation methods, each model-based AI t o o l (EIXAX) as t h e l e a d i n g
time multiple faults occurred, for example. The component.
present adjustment work ES contains a control
method which is similar to the human concept as The FECS-AI system c o n s i s t s of two CPUs whose
the most noticeable merit of the expert system. functions are provided for online and o f f l i n e

(Fault detection) (Recovery detection)

PHH
PH inrush ..-'''

PH
^-^^' Program stops
running in ! Allowable range
dead band I
: (dead band)
PL
>i
Preparation of a PHH arrival time Prepara tion Predic- \
gradient by the is predicted from of a gr adient tion of ',
linear approxima- a gradient in the same PH ; PLL
tion of past data way as in arrival
by the method of fault detec- time
least square tion

Fig. 5. Process fault detection and recovery detection predictive method

23
uses separately. Accordingly, an offline system By shortening the increase/decrease operation
develops and tests EIXAX rules as its major time and minimizing the raw air operation, the
functions without affecting any online system. system decision and adjustment results will be
This function is absolutely necessary for an AI at least e q u i v a l e n t to those of skilled
system where the system level is presumed to be operators as an energy-saving effect.
sequentially graded up. Since this system is
constructed in a computer system, the offline
TABLE 3 Energy-saving effect by automated operation
system can execute tests using online real-time
data. Physical unit of Reduction of Annual reduction
Name of plant reduced air cost of cost
(Air Nm3/02Nra3)
This system is also provided with a dynamic test (yen/02Nra3) (1,000,000 yen)

environmental function (plant simulator) OT-7 0.18 0.063 6


consisting mainly of a transfer function type
plant model, which is useful for improving the OT-8 0.16 0.056 7

system testing efficiency and reliability. OT-10 0.10 0.013 3

Note: The annual reduction of cost is given, assuming that


the rated operation is continued for one year.
EVALUATION OF AUTOMATION SYSTEM

Table 2 and 3 show the OT-8 automated operation CONCLUSION


results by this system in and after February,
1989. We a p p l i e d t h e AI system t o a u t o m a t e a i r
separation plants operation as an f i r s t attempt
Results of the automated multi increase-decrease in iron and s t e e l making industry.
operation We a l s o planned to i n t r o d u c e an AI system,
because i t i s impossible for a i r s e p a r a t i o n
No manual intervention has been done due to an p l a n t s to accurately keep track of an internal
purity error or other failure, irrespective of s t a t u s change by p r e s e n t s e n s o r s and i t i s
considerable increase/decrease operation d i f f i c u l t t o c o n t r o l a l l a u t o m a t i o n by
results. mathematical models. The automated operation has
This proves that this system fully satisfies the been continued favorably with an energy-saving
automated functions of multi-increase/decrease effect, and i t s introduction effect is more than
operation required for air separation plants in expected.
iron and qtPAl making industrv.
This system will r e a l i z e an o n l i n e r e a l - t i m e
TABLE 2 Increase/decrease operation result automated o p e r a t i o n system over a range from
the demand and supply prediction of oxygen and
Change pattern Monthly average
nitrogen to the a i r separation plant operation.
Manual intervention
operation frequency frequency

Independent increase/ 62 0
This system has shown remarkable progress in
decrease of O2 establishing a final operation system.
Independent increase/ 38 0
decrease of N2 The level of the knowledge base constructed by
Simultaneous increase/ 21 0 p r e s e n t AI t o o l ' has been graded up
decrease of O2 and N^
s e q u e n t i a l l y in the online mode through t r i a l
Simultaneous operation A 0 and commercial r u n s . This system has been
constructed in a short term by using fEIXAX'. As
of O2 increase and
N2 decrease

Simultaneous operation 7 0
a r e s u l t , the probability of knowledge base and
of O2 decrease and t h e f l e x i b i l i t y in a d d i n g and m o d i f y i n g
N2 increase
knowledge base have been verified. On the other
Preferential operation
of Ar
h 0 h a n d o p e r a t o r s w e r e r e l e a s e d from t h e
observation and the operation. Therefore i t was
Automation results of adjusting operation a b l e t o r e d u s e t h e o p e r a t o r ' s mental and
physical load.
Manual intervention has partially been done due
to compound adjustment o p e r a t i o n or the I t w i l l be f u l l y expectable to apply t h i s AI
magnitude of disturbances. However, the accuracy s y s t e m t o p l a n t s which a r e d i f f i c u l t t o
is almost satisfactory in the present stage. By represent by using mathematical models in the
improving the control accuracy level through future.
the a n a l y s i s of result data, the manual
intervention ratio may be reduced to zero in the
same way as in increase/decrease operation. REFERENCES

Energy-saving effect by automation T a s h i r o , k . , N. Ando, and M. Uekusa (1990). An


e f f i c i e n t c o n t r o l e x p e r t system for a i r
Owing to a practical use of this system, the s e p a r a t i o n p l a n t in i r o n w o r k s .
plant operation has been standardized without I n s t r u m e n t a t i o n and Automation, Vol.18,
any dispersion of the action due to individual No.13, pp. 42-48 (in Japanese).
differences of operators and excessive operation J a p a n I r o n and S t e e l association,
of raw air by taking an allowance into account Instrumentation and Control Congress (1990),
for maintaining the purity. Inst. 101-4-1 (in Japanese).

24
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

MODELLING AND CONTROL OF MINERAL PROCESSING


PLANTS USING NEURAL NETWORKS
J. Thibault*, F. Flament** and D. Hodouin**

* Department of Chemical Engineering, Laval University, Sainte-Foy (Quebec), Canada G1K 7P4
**Department of Mines and Metallurgy, Laval University, Sainte-Foy (Quebec) Canada G1K 7P4

Abstract. This paper considers the identification of the dynamics and the inverse dynamics of a simulated
grinding circuit, using feedforward neural networks. Various control strategies based on the plant and plant
inverse neural models, with and without adaptation, are presented. Finally, the results of some control
strategies are evaluated and discussed.

Keywords. Mineral processing; grinding simulation; neural controller; feedforward neural network; plant
neural model; plant inverse neural model.

INTRODUCTION 1987; Hodouin et al., 1988). The simulator is written in APL,


which compromises greater flexibility for slower execution (as
Model based control has now gained sufficient notoriety to be it is an interpreted language). The choice lies in its interactive
accepted and implemented in traditional processes with relati- and modular structure, which both allow a rapid program
vely good success. Some processes offer greater difficulty development and an easy implementation of new features, by
for finding an acceptable plant model because they are highly including new modules.
nonlinear and time-varying. In addition, it is not always ob- The user accesses the various input modules through con-
vious to decide on a structure of a proper model. An in- venient menus and windows. These modules refer to the set-
dustrial grinding circuit, where ore grindability may up of model parameters and definition of the control strategies
significantly vary over time and recycle streams are present, and the various noises generated by sensors and actuators.
is indeed a difficult process to model and control. It is impor- The unit models contained in DYNAFRAG are: rod mill, ball
tant to react as quickly as possible to changes in the process mill, fixed-separation curve classifiers, hydrocyclones, mixers,
in order to alleviate upset in the downstream processes. sump boxes, pumps, pipes and conveyor belts.
A promising alternative for the modelling of complex systems The dynamic model of the process basically consists of a set
seems to be artificial neural networks which have the ability to of mass balance equations for the pulp, the water and the
capture the nonlinear dynamics of the process from the particles of different mineral hardness. This implies 50 to 100
input/output data set. The network architecture that appears simultaneous differential equations to be solved.
to offer more potential for the type of engineering problems
encountered in identification and process control is the Simulated Plant Flowsheet
feedforward neural network. This network normally con-
sists of several layers of processing units (or artificial For this study, the units listed in Table 1 were implemented in
neurons) where neuron connections occur only between the simulator in an arrangement depicted in Fig. 1. In addi-
adjacent layers as schematically shown in Fig. 3 for one of tion to these major units, five actuators (aj) and eight sensors
the network used in this investigation. The squares simply (rrii) are part of the simulated circuit as they introduce dyna-
indicate that each input is distributed to the neurons of the mic components to the system response. These items are
next layer. The neurons, represented with circles, perform a also listed in Table 1 and illustrated in Fig. 1. Finally, five
simple task which consists of performing a weighted sum of internal variables (pj can be manipulated within the simulator,
all the inputs, and then applying a transfer function to this in order to easily introduce realistic perturbations to the pro-
sum. In essence, a feedforward neural network can simply cess (see Table 1 and Fig. 1).
be viewed as a nonlinear model and the parameters of the Actuators (uj and sensors (rrij) are simulated using additive
model are the weight connections between neurons of adjac- noises (^ or n) to the desired actions (manipulated variables)
ent layers. For a detailed description of feedforward neural or to the measured values. Various types of noise are pro-
networks, the reader is referred to Lippman (1987), vided within DYNAFRAG and more specific models or time
Wasserman (1989), and Widrow and Lehr (1990). series can be programmed by a user with some experience in
APL language. The available noise patterns are: white noise,
An experimental program has been set up to investigate the
step, ramp, AR, MA, ARMA models, PRBS and PRTS series.
control of a simulated industrial grinding circuit using a num-
ber of control strategies based on feedforward neural net-
works. This paper briefly describes the main control strate-
gies that were evaluated, and the results obtained with some
of them are presented and discussed.

THE GRINDING DYNAMIC SIMULATOR

General Description
The dynamic simulator used is the PC version of the general
dynamic grinding circuit simulator DYNAFRAG (Dube et al., Fig. 1. DYNAFRAG simulator plant flowsheet.

25
Table 1: Main features of DYNAFRAG simulated circuit. pure delays (15 s for cyclone feed pipe). Their values can be
modified interactively, whereas the pump characteristics must
still be programmed.
unit models
Belt conveyor to rod mill Table 2. Definition of fresh ore composition
Rod mill + water addition point
Sump box + water addition point
Hardness Size distr'n
Variable speed pump
Cluster of hydrocyclones Hard Coarse
Ball mill + water addition point Component 1: Normal Normal
Pipes Soft Fine

actuators Hard Coarse


Component 2: Normal Normal
u1 Fresh ore feed rate controller
Soft Fine
u2 Water addition valve to the rod mill
u3 Pump speed controller
Hydrocyclone model. A steady-state model is perfectly jus-
u4 Water addition valve to the sump box
tified for the hydrocyclones as their response time is much
u5 Water addition valve to the ball mill shorter than that of the other main units (ball and rod mills).
sensors The model used by DYNAFRAG is Plitt's model, which implies
ml Ore flowrate to the rod mill the calculation of the cyclone pressure drop, the
underflow/overflow pulp distribution, the classification cut-size
m2 Pulp level in the sump box
(d50c)' t n e s o | i d s separation efficiency and the water short
m3 Volumetric pulp flowrate to the cyclones circuiting. A different model for the fine particle short-circuit-
m4 Volumetric percent solid in cyclone feed ing to the underflow is also implemented in the simulator
m5 Volumetric pulp flowrate of cyclone overflow (Finch et al., 1985; Hodouin et al., 1986). All calculations are
m6 Volumetric percent solid in cyclone overflow based on the cyclone dimensions and operating conditions.
m7 Percent passing 200 Mesh (75) in cyclone over- Five adjustable coefficients permit the calibration of the model
flow to the available grinding circuit data. The cyclone dimensions
and the model parameters can be modified interactively.
m8 (not assigned yet)
perturbations Ball mill model. This model is the most important feature of
p1 Composition of the ore fed to the circuit the simulator, as it determines (through both the rod mill and
p2 Size distribution of the ore fed to the circuit the ball mill) most of the dynamic behaviour of the circuit.
The dynamic model of the ball mill is based on the description
p3 Specific gravity of the ore fed to the circuit
of three particular phenomena (Dube et al., 1987): 1) the mill
p4 Sump/pump behaviour content mixing, 2) the mill pulp volume variation and 3) the
p5 Hydrocyclone behaviour grinding kinetics. The mixing properties are described by a
series of two perfect mixers in the case of the ball mill and
four in the case of the rod mill. The pulp volume variation
The Unit Models within the mill is modeled by a series of interactive tanks
(Hodouin et al., 1988) and the grinding kinetics by the break-
Ore description. The ore is assumed to be composed of two age and selection functions model.
components, each having its own characteristics in what con-
cerns flowrate, size distribution, hardness and specific gravity. The Control Strategy
Such a feed ore model permits the study of the influence of The control strategy used in this study consists of one local PI
changes in hardness, size distribution and specific gravity loop to control the sump box level by the pump RPM and of
(perturbations) on the quality of the final product and on the one neural network to control the cyclone overflow fineness
process behaviour. The ore hardness is implemented in the by the water addition to the sump box. In this study, the
grinding model through the selection function values whereas circulating load level is not controlled. Obviously, there is an
the circuit feed size distribution is directly used in the calculat- interaction between the controllers. A water addition
ions. Ore specific gravities are not at present directly used in increase, applied to improve the product fineness, will result
DYNAFRAG, but they will be implemented later in the cyclone in an increase of the pulp level in the sump box. The PI loop
model. Both ore hardness and feed size distributions are will respond by increasing the pump speed to avoid overflow-
categorized in three levels: hard, medium and soft (for ing. It is interesting to notice on Fig. 2 that the pair of vari-
hardness), and coarse, medium and fine (for size distribution), ables (water addition, product fineness) demonstrates a
as shown in the computer menu reproduced in the Table 2. strong non minimal phase behaviour. We have observe that
such a behaviour cannot be controlled efficiently with a PI
Rod mill feed conveyor model. The conveyor is modelled controller only.
using a pure delay of 35 seconds. At present, the simulator
does not permit to modify this value interactively.

73 =^~~^ " ' " ,

f V__
Rod mill model. The rod mill is modelled using two identical
ball mills in series. A kinetic model is used for each ball mill.
The model parameters used, were obtained by retrofitting
71 i
experimental size distribution data provided by a company. 69 m7
At present, these values cannot be entered or modified
interactively. They must be introduced in the source code. 67 ~
0 1 2
Pump sump box and pipe models. The pump model corres-
ponds to the pump characteristic curves. The sump box is Time (hours)
represented by a cubical open perfect mixer, with a maximum
capacity of 10 m 3 (2 m x 5 m 2 ). The connecting pipes Fig. 2. % passing 200 mesh following a step change of
(suction and discharge ends) are modelled using variable the water addition to the sump box.

26
NEURAL BASED CONTROL STRATEGIES

This section briefly describes the various control strategies


that were evaluated in this investigation. However, only a few
will be illustrated in the results and discussion section. All
neural based control strategies rely strongly on the iden-
tification of an accurate neural transfer function. Therefore,
before describing each of the control strategies, a brief ac-
count on the identification methodology is presented. m7 ( t +)1

Identification Methodology
This is undoubtedly the most important aspect since the
design of effective controllers depends greatly on how well
the dynamics of the controlled process is known. The identifi-
cation of neural models is akin to the identification of linear
systems using ARMA models. In fact, traditional convolution
and ARMA dynamic models are equivalent to a two-layered
linear feedforward network model (Bhat and Mc Avoy, 1990).
For the identification of the neural plant dynamics and its
inverse, it is respectively assumed that the output and the ( b ei s)
manipulated variable can be reconstructed from a finite num-
ber of past values of the input and output. A typical architec- Fig. 3. Feedforward Neural Network used as a DNC
ture of the neural transfer function of the grinding circuit is
presented in Fig. 3. The controlled output of the plant, at the Direct Neural Controller
next sampling instant, is predicted from the current and two Once a neural transfer function of the plant (Fig. 3) has been
past values of the output (m7 - the percent passing 200 mesh identified, it can be used as a one-step ahead predictive con-
in cyclone overflow), three values of the manipulated vari- troller where at each sampling instant, a control move is
ables (u4 - water addition to the sump box), sufficiently determined in order that the estimated output of the plant
delayed and three values of an internal variable (the circulat- equals to the desired setpoint (Ydstie, 1990). The direct
ing load). The pure delay between the manipulated and con- model can also be used recursively in model based predictive
trolled variables was found with experimentation and cross- control strategies where an optimization routine is used to
correlation function to be equal to one sampling period. determine the optimal sequence of future control moves to
When the structure of the neural network is selected, the minimize an objective cost function, subject to some inequality
identification process boils down in finding the values of the constraints. However, only the first control move of the se-
connection weights, and Wjk, of the feedforward neural quence is implemented. A new optimal sequence is then
network that minimize the sum of squares of the errors bet- computed at the next sampling instant and so on. This is the
ween the predicted and target output, using an optimizing basic configuration of a direct neural controller, denoted
procedure. In this investigation, backpropagation and quasi- DNC
Newton were used. A good neural transfer function contains the main features of
the dynamics of the plant since it has learned from an
Table 3 - Input/output information of neural models unsteady state data set. However, there is no guarantee that
there will be no offset under steady state conditions. To al-
Input DNC INC IDNC leviate this problem, a pure integrator can be added in parallel
Neurons to the DNC as presented in Fig. 4. The lower portion of the
diagram is the direct neural controller and the upper part is
1 m7(t) m7(t+1) Am7(t+1) the parallel integration loop. The control move is then the
2 m7(t-1) m7(t) Am7(t) additive contribution of the manipulated variable calculated
3 m7(t-2) m7(t-1) Am7(t-1) with the neural transfer function and the output of an
4 u4(t) m7(t-2) Au4(t-1) integrator as described by the following two equations:
5 u4(t-1) u4(t-1) Au4(t-2)
6 u4(t-2) u4(t-2) 1 (Bias)
7 CL(t) u4(t-3) -
8 CL(M) CL(t) - < W 0 " W " 1 ) + K[m;{t) - m7(t) (2)
9 CL(t-2) CL(t-1) -
10
11
1 (Bias) CL(t-2) -
I - 1 (Bias)
I ^>
Output m7(t+1) u4(t) Au4(t)
Neuron

Neurons
in hidden 6 6 6 <,
I layer l -O

Number of 56 61 36 OPTI Ml ZI NG
Parameters -<t> PLANT
ROUT
I NE

For the model of the inverse plant dynamics and the inverse FUTURE m
.
differential plant dynamics, the list of input and output varia- 4, no
t
bles, used with the network, is given in Table 3. All these
NEURAL
variables were scaled into the range of 0 to 1 or -1 to 1. The
number of neurons in the hidden layer was set to six. Table 3 MODEL
also provides the number of parameters to identify. The num-
ber of parameters is significantly larger than the number of Fig. 4. Diagram of a DNC-I strategy.
parameters normally found in ARMA models.

27
This controller is denoted as DNC-I (Direct Neural Controller 1989; Saerens and Soquet, 1989). The problem is in the
with Integration). Similar results would be obtained if only the evaluation of the error signal on the manipulated variable that
weight associated with the bias of the output neuron was corresponds to the error signal available at the output of the
modified by backpropagation using the measured offset. Both plant. Many schemes, to evaluate the error signal on the
correction schemes are excellent when it is believed that the manipulated variables, have appeared in the literature
dynamics of the system has not significantly changed from (Thibault and Granjean, 1991). The approach that was
the dynamics that prevailed at the time of identification. retained in this investigation relies on the evaluation of the
Another alternative is to use the prediction error of the con- Jacobian obtained analytically by differentiating the equations
troller (m*7 - m7) to directly adapt the neural transfer function of the feedforward neural network. In order to adapt the
of the plant. This has the advantage that both the dynamics weights of the inverse neural model, using the back-
and the offset are considered simultaneously in the adaptation propagation algorithm after each sampling, the estimated
of the model. However, if the process is nearly under steady error on the control move (u*4 - u4) at the previous sampling
state conditions, it is possible that the neural model will instant has been determined from the error on the output
unlearn the dynamics of the plant trying to correct for the (y*7 - y7) determined at the current sampling instant with the
offset. Care must be exercised to make sure that the learning following relation, using the nomenclature of both Fig. 3 and
data are sufficiently informative about the dynamics of the Table 3:
system. This control scheme is denoted as DNC-A (Direct
Neural Controller with Adaptation). [u^-uAit_,)--[m;it-m7}^ (3)
The online neural model adaptation can be performed in two
different manners. In the first adaptation scheme, at each
sampling instant, the most current prediction error is used to The partial derivative can be calculated from the inverse neu-
correct the connection weights of the neural model with a ral model which has learned to give explicitly the control move
single backpropagation iteration. In the second adaptation ut. If the position of the target output value m7t+1 in the input
scheme, data contained in a fixed length moving window are layer of the neural network is i=n, an estimate of the plant
used to adapt the network by performing a number of itera- inverse Jacobian can be obtained by the following equation:
tions with one of the learning algorithm. The weighting of the
data within the moving window can be uniform, triangular or
J 1 J
(, S,

exponential.
^,,, ^'' Wiy */ w \
An identical control strategy can be used with a neural model
LH J LH
l'=1 ; -J

that has learned with a data set containing changes of each where su and sm are the scaling factors applied respectively
of the input/output variables, between two consecutive sam- to u4 and m7 before being presented to the neural network.
pling instants that is a first order differentiation, instead of the Indices i and j refer to the position of the neurons in the input
values of the variables. Therefore, rather to evaluate the and hidden layers which contain respectively / and J
control move to give the target output, the optimizing routine neurons. X and H are respectively the outputs of the input
calculates the change in the control move that will return the layer (scaled variables of Table 3) and of the hidden layer, f
output of the process to its desired setpoint, provided no is the derivative of the nonlinear transfer function. It is
constraints are being violated. In other words, an incremental important to note that the correction of the network, performed
control move is calculated to predict a network output that is at time t, considers the error on the manipulated variable at
equal to the negative of the current error signal. This control time t-1 because it is necessary to wait the number of
scheme, denoted as DDNC (Direct Differential Neural sampling periods corresponding to the pure delay between
Controller), has the disadvantage to learn the average dyna- the input and output of the process, in this case equal to one
mics over the full operating range of the process since the for a sampling period of 2 minutes.
knowledge of the dynamics of the process, acquired by the Similar to a DDNC, an Inverse Differential Neural Controller
network, is independent on the level of operation. This may (IDNC) can also be used where input/output variables, fol-
result in a loss of performance during the transient behaviour lowing a first order differentiation, are used for teaching the
of the process but may show an improvement under steady network. The main advantage of this strategy id the natural
state conditions by reducing possible offsets. However, the incorporation of the constraint on the control action. The list
addition of the level of operation as an input to the network of variables used with this network are given in the last
would improve the dynamic performance. This control stra- column of Table 3.
tegy could also be used in conjunction with adaptation. Another interesting control strategy, is to use both the direct
and the inverse models of the plant dynamics in an internal
Inverse Neural Controller (INC) model control {IMC) structure (Bhat and Mc Avoy, 1990).
An important feature of neural networks is the possibility to
generate the inverse plant dynamic model from input/output
ILLUSTRATIONS AND DISCUSSION
data sets, and to calculate explicitly the manipulated variables
in order to bring the process output back to its setpoint value.
There are no iterations involved. For the grinding circuit, the Training Data Set
inverse plant dynamics was modelled with a feedforward The training data set was generated using DYNAFRAG. After
neural network with the input and output variables listed in a steady state has been computed, the water addition to the
Table 3. The INC is a feedforward controller that anticipates sump box (u4) was manipulated at a sampling rate of 2 min-
the control action that will produce the desired output of the utes according to a white noise. It was centred for a first
plant. experiment around 88 m3/h and for a second experiment
The INC can also lead to an offset under steady state con- around 98 m3/h. In both experiments, the noise standard
ditions and there is no way for the network to know if the deviation was 5 m3/h. The various sensors signals were
correct control move was determined. A pure integrator, recorded in data files during 10 hours. Then, cross correla-
similar to the DNC-I can be added to correct the offset. This tion studies have demonstrated high positive correlations at
controller is denoted as INC-I. The inverse plant model can lag 1 between u4 and m7 (the final product fineness) and
also be adapted based on the difference between the actual between u4 and the circulating load CL (computed from the
and the desired outputs of the plant (INC-A). However, with values of sensors m3, m4, m5, m6 and the ore specific
the inverse model, the plant is located between the neural gravity). These results were used to test different patterns of
model and the error signal of the plant output so that the neural networks for their applicability to control the final prod-
learning rules, such as backpropagation, cannot be used uct fineness.
directly to adapt the inverse neural model (Chen and Pao,

28
Direct Neural Controller Inverse Neural Controller
Here, the objective of the neural network is to mimic the The neural network purpose is here to inverse the process
grinding circuit behaviour and to predict the future product dynamics and to predict which action should be taken to
fineness. Several input layer patterns were tested and the obtain the desired process output. After having tested several
best compromise between a good prediction and a small network patterns, a good compromise was obtained with the
number of input neurons was obtained using the past 3 network described in Table 3. The past 3 values of the pro-
values of m7, the past 3 values of u4 and the past 3 values of cess output (m7), the past 3 values of the action (u4) and the
CL (see Table 3 and Fig. 3). Six intermediate neurons were past 3 values of the circulating load (CL) are again used. But
found to be sufficient. in addition the first neuron is fed, during training, with the
The network was then implemented in DYNAFRAG as a cont- value of m7 resulting from action u4, the value being learned
roller. On Fig. 5 is shown the response of the controller to a at the network output layer. During prediction, the same
change of set-point from 67.3% to 68% passing 200 mesh in neuron is fed with the desired set-point value.
the final product. As can be seen, the controller response is The network was implemented in DYNAFRAG as a controller.
originally fast, but the new steady state obtained is not at the On Fig. 8 is shown the response of the controller to a set-
requested set-point of 68%. point change of 67.3% to 68% passing 200 mesh in the final
product. As can be seen, the controller response is good but
again a slight bias exists at steady state.

1 2
Time (hours)
1 2 3
Fig. 5. Variations of the controlled (m7) and manipulated (u4) Time (hours)
variables for a setpoint change under DNC strategy.
Fig. 8. Variations of the controlled (m7) and manipulated (u4)
To remedy to the problem, an integrator was introduced. A variables for a setpoint change under INC strategy.
calibration of the integrator constant K (see Eq. 2) was then
performed. On Fig. 6, the sums, over 4 hours of processing, An integrator was implemented to remedy to the bias pro-
of the squared deviations to the set-point are plotted for vari- blem. For calibration, the same procedure as before was fol-
ous values of the integrator constant. A value of 0.5 was lowed and the results are presented in Fig. 9. The value of
retained as a good one. The corresponding controller 0.5 was retained and the corresponding process output is
response can be found in Fig. 7. By comparison to the given on Fig. 10. By comparing Fig. 10 to Fig. 8, one can
results of Fig. 5, the improvement is drastic on a long term see that the improvement is mainly obtained during the first
basis. two hours after the change of set-point.

. * : - _ _ ' ___I__L__A__:_^_____J____i_'_IJ

! DNC-I .
6
|
\im-i j
.2 6

! 1 !;
> .
j
!
8 2
a 2 ; !
;
...|* 1: *
0
a
:=>
f| i

0
0 0.5 I 1-5 2 25 rhvrri""
0 1 v;'i"V","r1":v;'.'t"
0.5 rviv; 1.5 2 2.5
Integrator constant value Integrator constant value
Fig. 6. Sum of squared deviations to setpoint as a function
of the integrator constant under a DNC-I strategy. Fig. 9. Sum of squared deviations to setpoint as a function
of the integrator constant under an INC-I strategy.

Inverse Neural Controller with Weight Adaptation


The following application is derived from the previous one.
When a neural controller is used without any integration, a
bias may be observed on the set-point. In the present case,
at least two reasons may explain this phenomenon. First, the
new steady state does not correspond to a state learned by
the network at training time. Second, due to the strong non
Time (hours) minimal phase behaviour and to the procedure followed to
train the network, only the high initial gain was in fact learned.
When the new steady state is almost reached, the network is
Fig. 7. Variations of the controlled (m7) and manipulated (u4) just not capable of providing the correct action which corre-
variables for a setpoint change under a DNC-I stra- sponds to a much smaller gain. There are two dynamical
tegy (K=0.5). behaviours, onlv one of which was learned.

29
CONCLUSION

All model based control strategies require the availability of a


good dynamic model. If the process is time-varying, it is
necessary to adapt online the model for proper performance
of the controller. Neural networks offer both ease of adap-
tability and plasticity in their structure to be used successfully
1 2 for modelling the dynamic behaviour of complex processes.
Time [hours) With neural networks, the model is able to self-organize to
match the underlying dynamics of the process. This paper
presents a series of control strategies, based on feedforward
Fig. 10. Variations of the controlled (m7) and manipulated (u4) neural models, for the dynamics and the inverse dynamics of
variables for a setpoint change under an INC-I a simulated industrial grinding circuit. Some of them are
strategy (K=0.5). illustrated and discussed.

It is shown that a neural network dynamic model of the plant


To overcome such a difficulty, as well as any difficulty and the inverse plant can serve as an efficient controller pro-
resulting from a change of state of the process, a possible vided that it is coupled with some offset correction scheme
solution consists in adapting on line the network weights. In such as a pure integrator in parallel to the neural controller or
Fig. 11 is shown the response of the controller to a change of a neural network adaptation routine. It is believed that neural
set-point from 67.3% to 68% passing 200 mesh. The network network will find wide use in process control and mineral
used here was the same as in the previous section except processing is certainly one area that can greatly benefit from
that weight adaptation was performed at each control step. this new technology.
The adaptation coefficients were 0.1 for the hidden layer
weights and 0.01 for the output layer weights. We have
observed that high values have to be avoided, especially at REFERENCES
the output layer, to prevent oscillatory responses. In the pre-
sent case, it can be seen that the response is very fast (less Bhat, N. and T.J. Mc Avoy (1990). Use of Neural Nets For
than 30 minutes) and stable, and that the set-point bias is Dynamic Modeling and Control of Chemical Process
corrected. Systems. Computers Chem. Eng., 14, 4/5, 573-582.
Chen, V.C. and Y.-H. Pao (1989). Learning Control With Neu-
ral Networks. Proc. 1989 IEEE Int. Conf. Robot. Autom.,

t
u4 m7 Vol.3, 1448-1453.
Dube, Y., R. Lanthier and D. Hodouin (1987). Computer
100 68.1 Aided Dynamic Analysis and Control Design for Grinding
Circuits. CIM Bulletin. 80, 905, 65-70.
m7 i
94 67.7 Finch J.A., A.R. Laplante and R. del Villar (1985). Modelling
^u4 cyclone performance curves with a size dependent cor-
88 67,3 rection factor. SME-AIME Annual Meeting, New York,
February 1985.
0 1 2 3 4 Hodouin D., S. Caron and J J . Grand (1986). Modelling and
Time (hours) Simulation of a Hydrocyclone Desliming Unit. First World
Congress on Particle Technology, Nrnberg, Fed. Rep. of
Germany, April 16-18.
Fig. 11. Variations of the controlled (m7) and manipulated (u4) Lippmann, R.P. (1987). An Introduction to Computing With
variables for a setpoint change under INC-A strategy. Neural Nets. IEEE ASSP Mag., April, 4-22.
Saerens, M. and A. Soquet (1989). A Neural Controller Based
On Back-Propagation Algorithm. Technical Report,
IRIDIA-NN 005, Universite Libre de Bruxelles, 23 June.
Thibault, J. and B.P.A. Grandjean (1991). Neural Networks in
Process - A Survey. IFAC Symp. ADCHEM, Toulouse,
France, October 14-16.
Wassermann, P.D. (1989). Neural Computing: Theory and
Practice. Van Nostrand Reinhold.
Widrow, B. and M.A. Lehr (1990). 30 Years of Adaptive Neu-
ral Networks: Perceptron, Madaline, and Backpropaga-
tion. Proc. of the IEEE, 78, 9, 1415-1442.
Ydstie, B.E. (1990). Forecasting and Control Using Adaptive
Connectionist Networks. Computers Chem. Eng., j 4 , 4/5,
583-599.

30
Copyright IFAC Expert Systems in Mineral and IRON- AND STEELMAKING
Metal Processing, Espoo, Finland, 1991

CONTROL OF ELECTRIC ENERGY CONSUMPTION IN


STEEL INDUSTRY USING KNOWLEDGE BASED
TECHNIQUES
P. Koponen*, R. Viherma**, T. Rm*** and P. Uronenf

^Technical Research Centre of Finland, Laboratory of Electrical and Automation Engineering, Espoo, Finland
**Outokumpu Oy, Tornio, Finland
***Mikkeli Technology Centre, Mikkeli, Finland
tHelsinki University of Technology, Department of Chemical Engineering, Espoo, Finland

Abstract. Highly varying electric power consumption is characteristic to steel production and it is
expensive to meet these peak loads. It seems possible to predict the short term power demand and to
use these predictions to control the plants so that expensive simultaneous peak loads can be avoided.
In order to test this idea we built a prototype system.

There are several reasons for applying knowledge engineering to this problem. Object oriented
programming is powerful in the development and maintenance of the system and its prediction and
structure models. The models of the steel making processes as well as the decision making knowledge
are often heuristic. Also prototypes are needed in order to get the design feedback from the end users.

In this paper the problem and the prototype are described. The prototype mostly consists of Lisp-
programming structured with frames. We have used rules only in one knowledge base. Good user
interface development tools turned out to be essential for the prototype building.

The purpose of the prototype is to provide a basis for the definition of the real system. The prototype
demonstrates a proposed solution but leaves some questions unanswered. Nevertheless the prototype
shows how power can be predicted and that these predictions are important in controlling the peak
loads.

Keywords, steel industry; metals production; power management; prediction ;knowledge engineering.

INTRODUCTION the criterium of the choice.

The electric power taken by arc furnaces and rolling mills The optimal control of the electric energy consumption of
highly varies with time. These variations in power demand a steel plant has strong interactions with the control of the
are due to the batch like operation cycle of these proces- steel production. The scheduling of the production itself is
ses. Simultaneous power consumption peaks are very the primary goal and the control of peak electricity loads
expensive to meet when there is a shortage of power is only one additional feature of it. This is the case
production capacity as the case often is in Finland during especially in highly integrated steel plants that produce a
a severe winter. The current practice is to impose tem- wide spectrum of steel grades.
porary power restrictions on the plants at a short notice in
a previously agreed order. Although this practice by large The short term forecasting of power is discussed in the
gives remarkable savings in power expenses it cannot take literature (David 1988; Gross and Galiana, 1987; Rahman
into account the production situation of the plants and and Baba, 1989) but in none of these are the predictions
often causes unnecessary or badly scheduled production based on the production plans of steelmaking. Nor is the
restrictions. peak load control considered in any paper about steel
plant scheduling systems, for example (Numao and Mori-
In principle, the coincidences of the peak loads can be shita, 1988; Sztrimbely, Weymouth and Ponzo 1989;
prevented or met cost effectively, if the loads are predicted Takahashi and others, 1989).
a few hours ahead. To put this principle into practice is far
from easy, however. The predictions are uncertain and The potential for savings in electricity costs increases, if
frequently changed by the operating decisions and distur- more plants are added into the system. These can be other
bances of the controlled plant. Suggested power restric- base metal plants or even power plants. The predicted
tions must then be computed based on these predictions. total load helps them adjust their operation according to
Because of the time dimension the actual number of pos- the probable marginal cost of the electricity. The predic-
sible power restriction alternatives becomes too large for tions also make it possible for them to choose the best
exhaustive search. Moreover, the relative cost of different method and time to change their own power level.
predicted alternatives must be approximated, because it is

31
Knowledge based techniques have some advantages in this figure 2. The power prediction can be based on the
kind of a problem where heuristic reasoning is used be- planned production and scheduled down times. If the
cause of incomplete knowledge of the process-, cost- and power restrictions are known in advance, it is possible to
decision-models. Object oriented modelling helps keep the avoid unnecessary preheating with its associated costs and
simulation and prediction programs modular and trans- quality effects. The power demand of cold rolling and so
parent by giving a chance to imitate the structure of the the potential for restrictions is usually smaller.
modelled reality in the program structure. The Lisp-based
tools give flexibility that is very important in prototyping As the value of the product increases and energy intensity
but on the other hand the programs must be rewritten decreases downstream the material flow, figure 3, the
using more conventional programming techniques in order average costs caused by power restrictions tend to rise.
to meet the requirements of the steel plant environment. However, the actual cost of a particular restriction is often
very far from the average because it follows the production
We built a prototype peak load control system with a Lisp- situation. Meeting an urgent order or avoiding problems in
based expert system development environment. We use later production phases can be much more important than
this prototype in the analysis and definition of a planned the market value of some intermediate product.
real system. With the prototype we collect and test the
ideas and views of different specialists and users of the Occasionally all the possibilities to restrict the power taken
planned system. by a steel plant can be expensive. That is why it is
tempting to include other processes like electrolysis and
power plants into the system and use the peak load predic-
ORGANIZATIONAL ASPECTS tions in order to optimize their operation and power level.

The system will be used for planning the short term elec-
tricity consumption of many organizationally and geo- MODELLING AND PREDICTION
graphically distributed plants. Some of these can even
belong to separate companies. This requires that the rules It has been demonstrated with a small experiment even
on how to share the restrictions and benefits are agreed before the prototype that object oriented programming
upon. Of course, when there are remarkable benefits to makes the modelling and simulation of this problem easy
share it will not be impossible to reach an agreement. to understand, develop and maintain. Autonomous objects
However, the system should be able to give a firm foun- communicate with each other by sending messages. That
dation to the estimation of these benefits. makes it possible to make changes in one object without
changing others. Classification and generalization by in-
This also helps users understand when the system is worth heritance save code. A small example of a class hierarchy
much attention and when its suggestions can be ignored is in figure 4. Application specific definitions of relations
without significant losses. Every plant wants to know the between objects make the program easier to understand.
predictions of power restrictions but it is usually con- For example the grouping of the units is easily changed if
sidered too difficult to manually input the system with the it is defined using an appropriate inherited parts relation.
short term information these predictions need. This is a A simplified example of the use of a parts relation is in
challenge for system development and integration. A figure 5. In it the power of a unit is defined to be the sum
reasonable balance between these conflicting requirements of the powers of its parts. Similarly the material flows
should also be found and accepted. between process units and plants can be effectively
modelled. These principles have then been used in the
prototype but maybe not enough.
PROCESSES
The methods chosen for modelling the individual plants
The arc furnaces need very much electric energy. Usually must be chosen according to the behavior of that parti-
they operate in repeating batch cycles which is also seen in cular plant. For example statistical methods alone do not
their power consumption, figure 1. The shape of the curve help much when predicting the power consumption of arc
depends on the size of the batch, the raw materials and furnaces or rolling where the shape of the curves is nor-
the quality produced. The addition times of scrap or other mally quite deterministic and the difficulty lies in predic-
raw materials can also vary. Wait and maintenance times ting the start and break times.
change the intervals between batches. Because of these
variations the prediction methods based only on past The power taken by arc furnaces and rolling mills can be
power measurements fail. It is also important to check predicted, because they repeat their characteristic power
whether the planned power restrictions would have ad- trend curves in their batch like operation cycle. Their
verse effects on the steel quality (through chemical com- power consumption can also be restricted by postponing
position and temperature) in the particular process. the start of the operation cycle or limiting the power taken
by arc furnaces. The power of these units is predicted by
The power consumption of converters and refinement modifying the typical power consumption curve of a batch
processes is much smaller than that of the arc furnaces. with heuristic rules derived from operation experience.
The regularity of their operation is also disturbed by the Short term operation plans of the units are used as input
reasons described above. The material flow to these ope- for these predictions.
rations and further to continuous casting can be disturbed
by power restrictions imposed on arc furnaces upstream. It The electric power taken by continuous casting is very
may also happen that problems in these units force the small compared to the arc furnaces. However, the power
steel furnace upstream to wait. So their indirect effects on restrictions to arc furnaces can prevent the melt from
the control of power consumption cannot be ignored. coming in time to the casting and this has to be taken into
account when deciding about the target of a power restric-
Hot rolling is also a potential target for power restrictions. tion. The modelling of the material flow from arc furnaces
Typical power consumption of a rolling mill is shown in trough converters and refinement processes to continuous

32
casting could help avoid this problem, figure 3. The the operation of the processes. However, the cost of these
material flow has not yet been modelled in the prototype. disturbances should be smaller than the cost of the respec-
tive peak loads. The problem is that the cost of these
Base loads usually consist of many different small loads restrictions strongly depends on their timing and on the
that are difficult to control as a whole. We plan to use operation of the plants. So the peak load control is an
statistical methods to forecast their hourly total power. additional constraint in the scheduling of the process. User
interfaces and maintenance are other reasons for close
The optimization and decision making in the prototype integration of the peak load control with the production
needs also approximate predictions of the benefits and control.
costs of different decision alternatives. These are often
much more difficult to get than the power predictions. The system will also need a significant amount of time
This is because they depend dynamically on the production variable input data in order to be able to predict the
situation and the cost or benefit of a decision that has power consumption. If the data is in some other system, it
additional delayed effects also depends on later decisions. should be taken from there rather than asking someone to
For example consider delaying a load now so that it will feed that data into the expert system. For example produc-
increase the total power in some future hour. In order to tion plans and equipment maintenance schedules signifi-
avoid excessive complexity we plan to neglect the effect of cantly contribute to the accuracy of forecasts and can
other decisions when we calculate a delayed cost or prevent unnecessary power consumption restrictions.
benefit of a decision. In the future these connections of
costs need to be modelled, implemented and tested in the In the prototype a suggestion for the interfaces between
prototype before it is feasible to put the optimization the peak load control system and the process control
principle into practice. systems is modelled. That is why we have built models of
the operation of the controlled plants from the point of
view of power consumption. We plan to develop these
DECISION MAKING IN THE models further.
PROTOTYPE

Different predictions are transferred from the power pro- User interfaces
duction and process plants to the centralized parts of the
system. These consist of the planned power level and the One major challenge in developing this system is that it
possibilities to change the power including the price es- will have many different user groups who will need dif-
timate of these changes. All these power consumption ferent access to it. Transparency, understandability and
needs are then combined as are also the power acquisition good explanations are necessary in order to convince the
possibilities. Then all those power consumption needs are process operator or supervisor to give the necessary input
recommended that give more benefit than the cost of its data and to follow the advice given. The prototype user
acquisition. The recommendations and the combined interfaces can also be considered as tools for knowledge
predictions are then distributed to the plants. acquisition because they help the domain experts check
that their ideas work or have been correctly understood.
The individual plants form their own plans on the basis of
their own production situation and the information given The results available to the users are predictions of power
by the centralized part. They then use these plans to com- and its associated marginal costs as well as planned power
pute their predictions concerning the power consumption consumption restrictions. Some summary reports are also
and possibilities to change it. planned. Some of the planned power restrictions will be
given to the process control system automatically but also
The prototype consists of four knowledge bases that are then the process operator will be notified and he always
implemented with frames and Lisp-code, figure 9. There is has the possibility to cancel the suggested power restric-
so far only one rule base that is used in determining the tion.
price of bought power.
The prototype user interfaces are used to clarify the ope-
ration of the prototype to the different users and domain
INTERFACES specialists. They are then asked how the system should
function, what information and how it should be presented
to the users and is it possible to feed the system with the
Modelling the interfaces in the prototype information it requires from them. Some examples of the
user interfaces of the prototype are in figures 7 and 8
One of the main goals of the prototype is to help in the (with texts translated from Finnish).
definition of interfaces between all the different systems
and their users. Here the scope of the prototype is of
course restricted only to the solution of the peak load RESULTS
control problem. A simplified example of the structure of
this model in the prototype is in figure 6. The first major version of the prototype was evaluated in
November 1990. A definition of a load prediction system
The prototype is not intended to be connected with the based on it will be made in 1991. However, we have not
existing systems. Rather some parts of the program could yet decided to further develop the centralized decision
later be rewritten and embedded there. making part. Simulation models and other refinements of
the prototype would be needed, before its load control
principle could be adequately tested, its appropriate com-
Integration with process control systems plexity determined and its possible unstability detected.
The major remaining problems seem to be predicting the
The peak loads cannot be controlled without disturbing costs and benefits of a decision and defining the interfaces

33
to other systems and operators. Effective division of the tasks among knowledge engineer
and domain expert is very important because of the
An additional purpose of the prototype is to teach the limited availability of competent domain experts. It is also
participants knowledge engineering. In this respect the important to analyze the knowledge immediately before
project has succeeded. The prototype has also clarified the more is gathered.
peak load control problem to different domain specialists
and contributed to improvements in the existing peak load
control system. Lacking versatility of tools

Unfortunately there is not yet any expert system shell that


EXPERIENCE AND FUTURE would meet all the requirements of this application. What
CHALLENGES we would like to have is a system development tool that is
easy to use, flexible, reliable and supports object oriented
modelling, scheduling, prototyping of user interfaces and
Prototyping comparison of decision alternatives. Significant portions of
the built system would rather be embedded into existing
The prototype approach is useful. However, the prototypes process control systems and maintained by the plants' own
should be kept inside the phases of the system develop- personnel. Experience is needed in order to avoid spen-
ment. They should also be properly defined before their ding too much effort on the details of the tools instead of
implementation is started. It is important to plan the the application itself.
restriction of the scope of a large prototype and its decom-
position into smaller systems as early as possible. The time
schedule tends to be weighed too much compared to the User interface design
contents and quality of intermediate phases of the proto-
type and the time and tools required for testing are often The operators and supervisors of the steel industry can
underestimated. The problems result from the fact that better understand the proposed plan of an interactive
prototypes are typically used when it is not possible to system from a prototype interface than from written docu-
define the system in advance. mentation. More than half of the implementing of the
prototype has been required by user interfaces although
there are high level tools for it in the expert system
Object oriented programming development system that we use.

Object oriented programming is very useful in the proto-


type because model based simulation, complex user inter- Distributed expert systems
faces and flexible maintenance are required. Different
structure models (like in figures 3,5 and 6) and inheritance This kind of a system should be well integrated in the
models (for example figure 4) are essential in the proto- process control system. An appealing way to meet this
type. Separate class hierarchies for visual appearance on requirement is to distribute parts of the system in different
the screen and for the connection of the user-interface process plants and let these parts communicate with each
module to the other parts of the program are needed in other. There are very few tools that support this approach.
order to achieve this modularity.
We have modelled a distributed system structure in our
prototype. However, it can only be used as a test bench for
Predicting the ideas that then must be rewritten with more conven-
tional tools.
The prototype and its background information has shown
us that the power consumption of a steel plant can be so
well predicted that it is feasible to control the loads by Reasoning under uncertainty
using these predictions. We are seeking further possi-
bilities to improve the accuracy of these predictions. Pre- In the control of the peak loads of steel production it is
dicting the costs or benefits of a decision is much more essential to take into account the uncertainties in pre-
difficult. However there even very rough estimates will dicted power and costs. In the predictions there is uncer-
help prevent the very expensive choices. tainty both in timing and level of the electric power. It is
caused by disturbances, uncertainty of the models and later
process control decisions. The estimates of power and
Knowledge acquisition costs resulting from a control action can seldom be made
very accurate. The uncertainty should not be neglected in
The traditional methods for knowledge acquisition are this problem where the costs often steeply depend on the
time consuming and prone to errors. Even expert system timing of actions.
development tools for problems of medium size or com-
plexity are too difficult to learn and use for a domain There are several representations of uncertainty that are
specialist and a knowledge engineer is needed to master used in expert system technology (eg. Bayesian statistics,
both the tool and the principles of expert system tech- belief functions and fuzzy logic). They all seem to lead to
nology. Usually knowledge engineer spends much time too inefficient, limited, unrealistic and complex modelling
learning the basic terminology and principles of the subject of uncertainty in this case. So we use very simple
matter. On the other hand it is not easy to tell the domain approaches in the prototype. Roughly speaking we predict
specialist what kind of knowledge is needed to make the the maximum possible power consumption because it is
expert system work. This knowledge acquisition bottleneck more important in our application than the most probable
makes the development and maintenance of expert systems one.
expensive and slow.

34
CONCLUSIONS Fbwer
HW/k
From the prototype and knowledge gathered we conclude

fflilfL tf\ r\
that it is possible to build a system that predicts the power

n rt\
Lildil
consumption of steel plant for controlling the peak loads. S+
However, there will be many difficulties in building a
power consumption optimization system that works in the
real industrial environment. Further research would be
needed in order to test and develop the load scheduling
approach suggested in the prototype. The feasible com-
plexity of a real system depends on the savings potential
and the difficulty of the problem. We think that building a
prototype using object oriented programming is a good
approach when solving this kind of complex problems.
* time/h
-, batch
ACKNOWLEDGEMENTS

This project is funded by the Finnish Ministry of Trade


(KTM) and Industry and the members of the Association Figure 1. Typical power consumption of an arc furnace.
of Finnish Steel and Metal Producers. Also Imatran
Voima Oy (IVO) has financially supported it.

REFERENCES

David A.K.(1988). Load forecasting under spot pricing.


IEE Proceedings, vol 135. Pt C,No 5, pp.
369-377.

Gross G.and F. D. Galiana (1987). Short term load fore


casting. Proceedings of the IEEE, vol 75 no
12, pp. 1558-1573.

Numao M., S. Morishita (1988) Scheplan-A Scheduling


expert for steel-making process. International 3 time/h
I i t
Workshop on Artificial Intelligence for In- maintenance maintenance
dustrial Applications 1988, pp. 467-472 (unscheduled)

Rahman S. and M. Baba. (1989). Software design and Figure 2. Typical power consumption of a hot rolling mill.
evaluation of a microcomputer-based auto-
mated load forecasting system. IEEE Transac-
tions on Power systems. Vol 4 No 2. pp. 782-
788.

Sztrimbely W. M., P. J. Weymouth and T. Ponzo (1989).


An Expert Systems Approach to Rule-Based
Dynamic Process Plant Scheduling. In M. Y.
Demeri (Ed), Expert System Applications in
Material Processing and Manufacturing. The
Minerals, Metals and Materials Society, pp.
211-217.

Takahashi T., M. Konishi, S. Tamura, H. Hanaoka and H.


Nakagawa (1989). Scheduling for steel making
process using mathematical programming
method. Preprints of the 6th IFAC Sym-
posium on Automation in Mining. Mineral
and Metal Processing 1989. pp. 133-138.

35
Mine - -CH Sintering and peletizing I


j FeCr-furnace
n L FeCr-furnace

I <l i>C storage

Scrap, additives -->{ Stcel-furna

i:
Laddie-furnace |
{ Continuous-casting &~

Legend: \
V
Hot-rolling
;
|
-^C storage :

slow inter action HI storage


remarkable interaction
_fc.
Cold-rolling ~"| [>~ storage

Figure 3. The material flow of a steel plant.

e-consumption-units

, e-production-units
e-management-units ;
~-simulation-units - Sendzimir-mill-I
cold-rolling-mill
" units-with-e-contracts Sendzimir-mill-II

. walking-beam-furnace
steel-plant hot-rolling-mi 11 roughing-mill
-FeCr-furnaces Steckel-mill
arc-furnaces d ^
- steel-furnaces ^^^. arc-furnace
, continuous-casters steel-melting-shop converter
^ ^ * " continuous-caster
unit-processes ' - power-plants ^ Iaddle-furnace
-refining-processes -A00-converters

\ cold-rolling-mi Us
rolling-mills -zzZ power = power +
hot-rolling-mi I Is k kO

Figure 4. Example class hierarchies. Figure 5. An example of a parts relation.

1 V
Electrolysis

Figure 6. The in the prototype modelled distribution of the system.

36
pfice demand supply

i i i i | i i i i | i i i i | i i i i | i i i i | I I I I | I I I I | I I I I |
50 10 150 200 250 300 350 400

hourly power MW

Figure 7. The marginal price balance display.

Figure 8. A display of power demand and supply stackbars.

Figure 9. Modules of the prototype (Knowledge bases).

37
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

DEVELOPMENT OF A SCHEDULING EXPERT SYSTEM


FOR A STEELPLANT
K. Stohl*, W. Snopek*, Th. Weigert** and Th. Moritz**

*VOEST-Alpine Industrieanlagenbau Gesellschaft m.b.H., Dept: ET AI, Turmstrasse 44, A-4031 Linz, Austria
**Research Institute for Symbolic Computation, Johannes Kepler University, A-4040 Linz, Austria

Abstract. This paper describes the development of the scheduling expert system VASE
(Voest Alpine Scheduling Expert). It deals with a LD-steelplant, comprising 3 LD-convert-
ers, 3 secondary metallurgy aggregates and 2 continuous slab casters. We selected this
domain to gain experience in applying expert system technology to large-scale manufac-
turing problems with stringent constraints. We describe our initial prototypes, and point
out problems encountered during their development. Our initial prototypes pointed out
that the scheduling problem could not be solved by conventional programming techniques
or by expert system technology alone. A combined approach of expert system technology,
conventional programming, and data base management proved to be the adequate solu-
tion. Conventional parts were solved in C, for the knowledge-based parts we used NEX-
PERT OBJECT and as the data base we used ORACLE. Comparisons between schedules
produced by h u m a n experts and schedules done by the computer showed t h a t the compu-
ter is much faster and does not make mistakes as h u m a n experts do. Although there is
still a potential for improvement the comparison showed that the quality of the expert
system's schedules is a t least as good as t h a t produced by the most experienced h u m a n
experts (neglecting their mistakes).

Keywords. Automation; Expert Systems; Knowledge Engineering; Metallurgical Indus-


tries; Steel manufacture; Scheduling;

THE SCHEDULING PROBLEM Typically a converter produces for a designated


IN STEELMAKING caster. One route r u n s via the conditioning stand,
the other includes the ladle furnace. Both routes
The Steelmaking Process overlap at the vacuum treatment stand (however,
not all h e a t s require vacuum treatment). Deviations
The steelplant of VOlEST-Alpine Stahl Linz has from the standard routes are sometimes necessary
three LD converters, whereof normally two produce to meet time constraints.
and one functions as stand-by. After tapping the
crude steel is treated at one or two of the three sec-
ondary metallurgy aggregates (ladle furnace, condi- The Scheduling Process
tioning stand, vacuum treatment stand) and finally
the steel is cast at one of the two continuous slab Daily a production schedule for the next 24 hours
casters, whereof one is singlestranded and the other h a s to be established. Currently this is done by the
is twostranded. All casters are equipped with width- h u m a n experts (dispatchers) and takes about three
adjustable moulds. The heats can be cast in to four hours.
sequence as long as they are delivered in time to the
caster and the thickness of the slabs does not Scheduling starts with a pool of several hundred
change. Casting might also be interrupted due to orders characterized by steel grade, weight and slab
the chemical composition of certain steelgrades. dimensions. The orders are prepared by the rolling
Interruptions of the sequence of less than about mill and should be produced within designated time
three minutes may be tolerated, otherwise casting intervals. An order constitutes only a partial heat.
has to be stopped and the casters have to be read- Two or sometimes three of them have to be combined
justed. This takes about one hour and should there- to form a single heat (= 140 metric tons of steel).
fore be avoided. The resultant strand is cut into Various metallurgical criteria determine how orders
slabs of predefined length. The processing of the may be combined to a heat.
slabs is not part of the scheduling described here.

39
Heats are then arranged into production sequences. high quality steel with narrow tolerances for the
Complex criteria determine whether two heats may allowed chemical composition.
be cast consecutively. Otherwise casting has to be
interrupted; prior to continuation steel tubes are Various constraints determine whether production
put into the mould to separate adjacent heats to pre- orders may be combined to heats, and which heats
vent intermixing of steel of different grades (this is may be cast consecutively in the same sequence.
referred to as "quality separation"). Interruptions of Typically, these constraints deal with the chemical
less than about three minutes can be tolerated and composition of the desired steel. They also prescribe
do not force new preparation of the casters. Ibo which corrective measures (quality separations,
large deviation of the chemical composition between tundish changes) are required to consecutively cast
adjacent heats may force a change to a new tundish. heats which violate these constraints.
If there is no tundish ready to use, which happens
when the utilisation time of the preceding tundish
was too short, the sequence ends. If
the overlap of the carbon tolerances
of several orders is lower than a limit
The dispatchers, through experience, have devel- Then
oped planning strategies which allow them to gener- they must not be combined to a heat
ate production sequences that are legal in the sense
of not violating the metallurgical constraints. Their
strategies are geared towards meeting the following If
goals. the median carbon concentrations of
two consecutive heats differs by more
than a limit
Then
Aims of Scheduling a quality separation is required

Scheduling has various, sometimes conflicting aims:


Fig. 1 Examples of metallurgical knowwledge
- Maximization of the length of casting sequences. If
possible, the casters should cast all the time.
In addition, experience with the plant has taught
acceptable tolerances in casting certain grades of
- Minimization of quality separations in a tundish.
steel (such as minimal and maximal casting veloci-
ties, or allowed variations in the width of a slab).
- Minimization of tundish changes in a sequence.
This means a maximization of the utilisation time of
The constraints, described so far deal with the needs
each tundish. The limit is the life time of a tundish.
of the casters and the tundishes respectively. In
addition, there are constraints dealing with the
- Minimization of variation in the width of the slabs
other aggregates of the plant. They typically depend
in a sequence.
on the chemical composition of the desired steel as
well.
- No violation of production rules, both when com-
bining steel orders to heats and when arranging
Treatment times and (if necessary) waiting times for
heats to production sequences.
various production steps are given as standards,
which have to be met.
- Minimization of waiting times of the converters. If
possible, the converters should produce all the time.
Planning knowledge. Dispatchers have developed
strategies which allow them to meet the throughput
These goals have a direct correspondence in produc-
requirements of the plant while meeting the quality
tion costs: Large variations in the width of slabs,
constraints for the steel to be cast. Since dispatchers
tundish changes, and quality separations lead to
do not have the time to try a large number of permu-
waste of material. Violation of metallurgical con-
tations of several hundred orders they have devel-
straints may result in products not meeting any
oped heuristics describing schedule schemata which
order. Maximization of throughput reduces the pro-
are likely to yield successful sequences. Scheduling
duction costs.
for the dispatchers is applying these schemata to
the set of heats.
Knowledge in Scheduling
Scheduling knowledge. Various constraints are typ-
As indicated above various criteria have to be con- ical for any scheduling problem. For example it is
sidered during scheduling. Correspondingly we can not possible to treat a heat at two different aggre-
identify different types of knowledge in the schedul- gates at the same time. There are due dates which
ing process. must not be missed, etc.

Metallurgical knowledge. Market demands require

40
Difficulties in Scheduling INITIAL PROTOTYPES

At a minimum, 150 orders have to be combined to First Prototype


heats to form a schedule. The resulting heats have
to be arranged to a legal schedule. Often several As a first development step we took the available
combinations are possible which might result in parts of the metallurgical knowledge, and put it into
radical differences in the' schedule later on. Nor- the knowledge base. With this knowledge it was pos-
mally there are about 20 different steelgrades in the sible to select all combinable orders relatively to a
pool for one day (the Linz-plant produces about 2000 specific order ("combining rules") and to select all
different steelgrades). sequenceable heats relatively to a specific heat
("sequencing rules"). Additional knowledge in the
The dispatchers cannot check the multitude of pro- knowledge base was determined how to select the
duction constraints for the scheduled sequence. As a best order from the set of feasible orders ("best next
result dispatchers often combine orders illegally to order rules").
heats and they often build sequences of heats t h a t
must not be cast in this order. Negative effects on Scheduling with this first prototype was done in the
the quality or expensive corrective actions are the following way. All orders of the pool were loaded into
consequence. the knowledge base. An initial order was selected by
the user. Then the systemrepeatedly performed the
Due to the differences in the time it takes to pro- following steps. It decided whether it should look for
duce, treat and cast a heat, stops of the sequence are an order to start a new heat (applying the "sequenc-
unavoidable. The dispatchers are not able to calcu- ing rules" and the "best next order rules") or for an
late the expected production times for each heat of a order to complete a h e a t (applying additionally the
given sequence. Therefore these - in principal fore- "combining rules"). The so found heats were added
seeable - stops and the required new preparation of to the actual schedule. If the schedule couldn't be
the caster cannot be predicted by the dispatchers. continued, the prototype removed the last found
This is a disadvantage because such (predictable) order from the sequence and looked again for a next
stops might otherwise be used to produce certain order. If this was not possible too, it asked the user
steelgrades which, due to their chemical composi- what to do.
tion, also force such stops.
The prototype proved t h a t the problem could in
principal be solved with expert system technology,
Why an Expert System allthough the dispatcher's schedules were better
than the system's. We had to add missing knowl-
The dispatchers work at their limits, but the market edge, like knowledge about tundish changes, deal-
demands (small tolerances for the chemical compo- ing with the lifetime and with the minimal
sition, j u s t in time delivery) are still increasing. utilization time of a tundish, as well as some other
Therefore a computer aid >r the dispatcher's sched- aggregate specific knowledge.
uling activities is necessary. Expert system technol-
ogy seemed to be the best way to meet the challenge, The improved version of the first prototype rendered
for several reasons. much better schedules, but still they were not satis-
factory, because they weren't as good as the dis-
- Conventional scheduling methods (such as job patcher's. In addition the speed of the system was
shop scheduling) were not possible due to the strin- quite low (about 50 minutes for one schedule). Plan-
gent constraints on consecutive heats. ning time for a whole production sequence (for one
day) with this system was shorter than the dis-
- Metallurgical knowledge was already available in patcher's planning time, but dispatchers have to
rule form. react quickly to disturbances in the steelmaking
process. For this task our system couldn't be of any
- Planning strategies could easily be captured in help.
rule form.
Supplementing the prototype we started to develop
- The planning strategies were imprecisely formu- a program, t h a t takes a sequence of heats as input
lated and sometimes conflicting. An iterative devel- and calculates the time required for the individual
opment style (prototyping) was required. steps in producing, treating and casting of the heats.
We needed such a program to be able to predict
- Scheduling constraints are sometimes changing. interruptions of the sequence because of time con-
Moreover at the beginning of development these straints. We decided to implement this complex task
were neither complete nor consistent. It had to be by conventional methods.
possible to change the knowledge base easily.
For each h e a t the minimal and maximal casting
time are calculated, taking into account limits for
the casting velocity and the changes in the widths of

41
consecutive slabs. Also quality separations and THE CURRENT SYSTEM - VASE
tundish changes have to be considered because they
influence the minimal and maximal allowed casting Basic Ideas of VASE
velocity.
Based on the experience gained with the initial pro-
Next, we calculate all treatment- and waiting-times. totypes described above, we decided not to try to
The goal is to produce the heats as fast as possible improve the existing prototype but to develop a new
but as slow as necessary in order for the sequence system. The new system was to be developed as a
not to stop. "hybrid" system, i.e., only the "intelligent" part of
the scheduling process (the planning strategies of
the dispatchers), should be kept in the knowledge
Second prototype base. The rest was to be put in external C-routines.
This should result in
Before dispatchers begin to schedule, they build
groups of similar orders. Many planning strategies - Clear separation of the different types of knowl-
deal with groups of orders as well. This alternative edge and therefore an easier maintenability.
approach seemed to be promising and was the start-
ing point for our second prototype. - Much better performance due to the use of C.

We developed a pattern matching algorithm to build


groups of similar orders. Similar means t h a t these Components of VASE
orders can be cast in arbitrary order without any
corrective actions at the caster. These groups Fig.2 gives an overview of the components of VASE.
became the new input for the expert system.

The method of scheduling changed slightly. The first


step was now to load information about the groups,
like "vacuum treatment is necessary for all orders in
[ Preprocessing |
the group" into the knowledge base. Then a loop was
| of orders I
started. Based on some simple criteria one or more
of these groups were selected to become the new
pool. The system then performed the same actions
as the first prototype. After finishing one tundish,
all remaining orders were put backinto the initial
pool, and the expert system selected one or more of I Scheduling Calculation \
of heats of production
groups to become the new pool.
times J
As the scheduling was based on preprocessed data it
was easier to implement planning strategies. The
addition of planning strategies and the use of groups
were the major improvements of the second proto-
type. Besides we made some further little enhance-
ments of the knowledge base.

The schedules generated by the second prototype


were much better t h a n those generated by the first Fig.2 Structure of VASE
prototype. They were always correct but if a dis-
patcher's schedule was correct it was preferable. Preprocessing of the set of orders. VASE begins with
Performance was improved, but the system still was a set of orders. A C program preprocesses the orders
too slow. and prepares the relevant knowledge for schedul-
ing. The following actions are preformed:

Results of the initial prototypes - Transformation of a set of orders to a set of heats

All though the expert system scheduled much faster - Grouping of heats of equal chemical composition to
than the dispatcher, it was too slow for use in prac- "cliques"
tice. The number of rules now was so big t h a t main-
tenance of the system turned out to be not as easy as - Generation of information about the cliques
expected. All types of knowledge were freely inter-
mingled in the system. - Generation of information about the relationships
between the cliques, e.g., clique x and clique y may
be cast in arbitrary order without quality separation

42
- Arrangement of similar cliques in metacliques, so and fulfill all production rules manually, therefore
that all heats of one metaclique might be cast in their schedules sometimes include illegally com-
arbitrary order bined orders and heats in an illegal order. On the
other h a n d dispatchers sometimes deliberately vio-
All metallurgical knowledge is included in this C- lated production rules because they know or hoped,
program. If other parts have to use such knowledge, t h a t these violations have no negative effect on the
they use condensed information that is generated quality of the slabs. This led to a reformulation of
here. Should the metallurgical knowledge change the production rules. This reformulation was
only the preprocessing routines have to be updated. repeated since the first tests of VASE.

Knowledge base. The knowledge base holds the The production rules had to be made relatively sim-
planning strategies, which describe what kind of ple for manual scheduling, because they cannot be
orders yield successful production sequences. The adhered to by men otherwise. Now, they can be as
planning strategies are used to rank the various complex as required, because there is almost no
cliques according to their suitability for adding limit in the number and in the complexity of the
them to the actual schedule. rules for the computer.

Allthough dispatchers frequently violate production


If rules, in relatively simple situations the dispatchers
Desired caster is twostranded are able to establish legal schedules. In such situa-
Then
begin sequence with heats that require tions comparisons between VASE and the dis-
vacuum treatment patcher showed that VASE's schedules are as good
as the dispatcher's, sometimes better. But the
strength of VASE lies of course in difficult schedul-
Fig. 3 Example for planning strategies ing situations, when the dispatcher is not able to
generate correct schedules.
Scheduling of heats. The ranking of the cliques is
taken by a C-program, which identifies the meta- Calculation of the times necessary for the various
clique containing the best ranked clique. The heats production steps could not be performed by the dis-
of this metaclique are then scheduled using con- patchers. VASE provides this information and can
densed information generated by the preprocessing adapt schedules by utilizing knowledge about
step. Only correct sequences of heats are established expected stops in a sequence.
and the differences in the widths of the slabs are
minimized. All tests showed t h a t VASE is now fast enough to
help also in rescheduling situations. It takes about
The program stops if either the tundish is fully uti- one minute to establish a schedule for one day.
lized or if the metaclique is empty. Then control is
handed back to the knowledge based component.
LESSONS LEARNED
Calculation of production times. Production times
are now calculated for the resulting schedule. In The following summarizes our experiences during
addition, further integrity checks are performed on the development of VASE. These lessons should
the schedule. The calculated time points are written equally apply to other expert system projects for
into the data base, the knowledge based component similar problems.
is informed whether the sequence can be cast and if
there are time restrictions for the next heats due to - First of all we saw, we could solve our problem
the calculated results. using expert system technology.

These results influence the ranking of the remain- - At the beginning of development knowledge about
ing cliques, which is done next. scheduling was ill-structured and incomplete. The
chosen prototyping approach made it possible to
Schedule optimizer. There are criteria for schedul- tackle the problem. During development old knowl-
ing which refer to a complete schedule and not to edge was re-examined and new knowledge was dis-
individual heats and they cannot be considered dur- covered, a process which is typical for many expert
ing scheduling. It is possible to improve schedules system projects.
by shifting tundish changes, by changing the order
of the tundishes or by inverting the sequence of - If the knowledge base grows very big, (i.e. it con-
heats of a tundish. tains many rules and objects) performance problems
might occur. A generous amount of main memory
and a fast machine are helpful.
Results of VASE
- Knowledge representation in rules only does not fit
On the one hand dispatchers are not able to check complex problems. Therefore the choice of a tool or a

43
programming language should also be based on to be necessary (e.g. because of performance rea-
whether alternative forms of knowledge representa- sons), especially if one uses backward chaining as
tions are available. inference process.

- A crucial point in building an expert system is


knowledge acquisition. It is very important to have FUTURE DEVELOPMENT
highly motivated experts. Prototyping helps in this
process, because one sees early, whether the prevail- Improvements of VASE
ing acquired knowledge is correct or not. Prototyp-
ing also helps to motivate the expert, because he can We plan to improve VASE in two ways. First we
see how his ideas are used to solve the problem. want to include missing knowledge. Secondly prac-
tice with VASE will help to develop or uncover new
- Filling the knowledge base is more than only trans- scheduling knowledge, which also will be includea-
lating the expert's knowledge in rule form. The ble.
knowledge engineer has to know exactly how the
inference engine is working. A very important topic will be the addition of real
time features. This means that VASE should be able
- Prototyping proved to be very helpful in building to react to events in the production process. Prob-
an expert system. It helps in knowledge acquisition lems in the production process may require heats to
and allows to test new ides quickly (without recod- be rescheduled; sometimes it will be sufficient to
ing as often in conventional programming). shift the production times. This is currently done by
the dispatchers.
- It is often said that maintenance in expert systems
can be done easily, l b indeed make maintenance
easy it is very important to structure the knowledge Scheduling Shell
base in different modules and to use different forms
of knowledge representation for different parts of We started to develop a scheduling shell, which
the problem solution. Also it is very important to facilitates the development of a scheduling system.
have a tool that offers powerful debugging methods. The shell will already contain domain independent
scheduling knowledge. Adaptations to an existing
- Stand alone expert systems often are not sufficient. facility by adding domain specific knowledge will be
They have to be embedded in standard software like simple. The shell will be coupled with a consistency-
databases and they should give the possibility to and completeness checker (developed by RISC-
integrate them with conventional software. Linz). It will reduce the development time for a
scheduling system for our personnel.
- In literature one can often read about the explana-
tion component of expert systems. We saw that this
component is often not sufficient for the user of the CONCLUSIONS
expert system.
The development of VASE showed that expert sys-
- Some expert system shells offer consistence checks tem technology is an adequate approach to solve
of the knowledge base. Typically these checks are complex scheduling problems (in a steel plant). The
done in too trivial a fashion. increasing market demands (narrower tolerances
for the chemical composition, just in time delivery)
- Expert systems should not be limited to one (espe- makes it more and more difficult for the human
cially the developer's) machine. Of course it easier to experts to adhere the rules of production. The use of
develop an expert system on a workstation, but VASE guarantees that all production prescriptions
often the target machine might be a PC. Moving are fulfilled and will therefore lead to a planable
between platforms has to be done easily. quality control in the steel plant. VASE has been
designed for easy adaptation to new scheduling con-
- We saw that using backward chaining as inference straints.
strategy makes it easier to develop and maintain a
system, allthough the risk to program the knowl-
edge base very algorithmicly is high. ACKNOWLEDGEMENTS

- NEXPERT OBJECT proved to be a good choice. It We want to thank Dipl.Ing. J. Neuwirth, VOEST-
offers different forms of knowledge representation Alpine Stahl, whose scheduling knowledge made
(rules, classes and objects), can be easily embedded the development of VASE possible. We also want to
in other software and has powerful debugging fea- thank the various dispatchers for their assistance
tures. during the development of the system.

- It is often quite straightforward to translate the


rules into a conventional program, if this turns out

44
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

AN EXPERT SYSTEM TO AID OPERATION OF BLAST


FURNACE
T.H. Choi*, S.H. Yi*, I.O. Lee*, H.G. Lee** and J.G. Jin***

^Department oflronmaking, RIST, Pohang, Korea


^Department oflronmaking, POSCO, Pohang, Korea
***Department of Instrumentation & Control, POSCO, Pohang, Korea

Abstract. An expert system has been developed to support the


blast furnace operation in the integrated iron and steel making
works. In the present study, the knowledge bases were built up,
which diagnosed the formation of a lower part inactive zone and
and the unstable condition of inner furnace gas flow. In order to
enhance the diagnostic resolution and the robustness to counter-
act various furnace abnormalities, the data acquired by several
sensors, which were installed in the blast furnace, they were
pre-treated before using them as input sources for the diagnosis.
The developed expert system diagnosed the real on-line process
of the blast furnace operation rapidly and precisely.

Keywords. Artificial intelligence; expert system; knowledge


engineering; blast furnace operation.

INTRODUCTION 2) difficulties in revising the existing models


by themselves.
The control for sound blast furnace operation
tends to depend on operator's experience during In order to compensate for the limitations of the
operation. However, the men's abilites to make a conventional approaches, the expert system which
quick judgement and to take appropriate actions solves the problems in a specific field based on
against the gradual changes of in-furnace condi- the knowledge and experience of the operators, has
tions are limited technically as well as psycholo- emerged with its most useful features.
gically. As a result, different measures could
made by each operator for similar in-furnace An on-line based expert system which applys the
conditions. Until now, many numerical and statis- knowledge engineering methods, was equipped to the
tical models attempted to simulate the in-furnace existing computer system of POSCO (Pohang Iron &
conditions of the blast furnace, but they could Steel CO.) No.3 B.F. since April of 1990.
not overcome the following limitations;
In this report, the procedures of system integra-
1) difficulties in expressing the knowledge of tion will be described including the pre-treatment
experienced operators quantitatively, processing of data and knowledge base, which were

BF P/C AI Processor

Machine(A60) Tool(ElXAX 11)


Temp.

YEWCOM
-8000 Data Data Inference Engine
Gathering Processing ">

1
Data Base
A
KB

Operational Actions
_(^1VC CRT

Fig. 1. Overall structure of the expert system.

45
mainly performed by the statistical analysis of
past operation data as well as with the know-how
of operation experts.

STRUCTURE OF THE EXPERT SYSTEM

Figure 1 shows the configuration of the expert


system for No. 3 B.F. at POSCO. The system
comprises of a process computer, AI(Artificial
Intelligence) processor, DDC(Direct Digital
Controller) and PLCCProgramable Logic Controller).

The process computer collects operational data


from the sensors and then, transfers them to the
AI processor. Those operational data are pre-
treated for final reasoning about the current
in-furnace condition, combining with knowledge
bases. Approximately 700 points of sensor data are
stored for every minute in the process and 200
points of these are selected as the pre-processing
data used for discriminating each in-furnace
condition. The results from the hourly reasoning
are displayed on the end user's graphic terminal.

The system has a unique advantage in separating


the operation system from the development system,
which facilitates the pretesting in advance and
rapid inference.

DATA PROCESSING

A blast furnace consists of very complicated


facilities which require the operator's attention.
Its main feature is a long time-lag of about 5 to
6 hours between charging raw material at the top
of the furnace and discharging hot metal at the
bottom. Also it is difficult to describe the in-
Fig. 2. Sensor data processing.
furnace conditions from the explict data reading
through the sensors. As a result, most operators
believed that each sensing data only stood for the
instantenous change of the in-furnace conditions, In order to understand the transition of indices
which has resulted in incomplete diagnosis. calculated in the primary and secondary processing,
the correlation between each index was analysed
As a means to solve the limitations on the sensing and the level, fluctuation, gradient and integral
and conceptualizing of the data trend recognition value were compared at this stage. The results
which was done through the operator's own experi- were stored in the memory and used as actual data
ence, a data pre-treatment process is required for reasoning.
prior to the system integration.

In the present study, various cases of markedly


changed in-furnace conditions were gathered and CONSTRUCTION AND FEATURES OF
analyzed statistically. The basic concept of data THE KNOWLEDGE BASE
pre-treatment processing is illustrated in Fig.2.
As seen in the figure, the sensor data were proce- Basic Conception
ssed in three steps.
The procedure on the abnormal behavior of in-
Primary Processing furnace conditions can be divided into three
stages as shown in Fig.3, which were deduced from
In this processing, four types of processing operational experience and performance data
methods were applied to determine the specific in accumulated over a long time. The profile of gas
-furnace patterns. These were as follows; flow within the furnace is usually changed by
external disturbances, related to raw materials
a) circumferencial balance quality, burden distribution, e t c as the first
b) vertical balance stage of abnormal conditions.
c) standard deviation
d) comparison of level & variation The furnace irregularities can be recognized by
the unusual behavior of sensing data. In more
Each index was calculated by comparing sensor data severe situations, the degree of deviation between
to definite standard value. current and standard gas flow profiles becomes
greater, accompanying the worse permeability and
Secondary Processing excessive heat load on the counter part of the
furnace. Finally, such abnormal phenomena inevi-
Characteristics of the index connected with a tably bring about frequent slips and drops, and a
change of in-furnace condition, such as fluctu- lessened wind rate due to the deterioration of gas
ation, duration and relationship were analyzed by permeability.
the use of a statistical method.
The aim of the system is to prevent unstable
Tertiary Processing furnace conditions by the early detection of the
variation in gas flow profiles (e.g. circumfer-

46
encial and vertical balances etc.) which are diagnosis. In the first stage, the variations on
regarded as the typical cause of advanced abnor- circumferential and vertical balance were investi-
malities in blast furnace operation. gated in order to diagnose the primary phenomena
of the unstable conditions and to prepare the fol-
The knowledge base was built up to diagnose the lowing action guidances. In the second stage, the
abnormal conditions of blast furnace operations blowing condition and heat level were investigated
with respect to the formation of inactive zone at and action guidances were proposed to prevent
the lower part of the furnace and the unstable unstable gas flow.
conditions of inner furnace gas flow.
Figure 5 shows the structure of the knowledge base
for the unstable gas flow diagnosis.
Variation of circumferential
First balance
st ace (First stage diagnosis)
Variation of vertical balance
Vertical balance KS First action guidance

V Circumferential
Burden distribution
control - I
Short term fluctuation of shaft balance KS
Second temperature & pressure

0
stage
Permeability change

0 (Second stage diagnosi s)

Blowing condition KS Second action guidance


Third SIip & drop
stage
Heat level variation Burden distribution
Heat level KS control - 11

Fig. 3. Procedure on the abnormal behavior of


in-furnace conditions.
Fig. 5. Configuration of the knowledge base
of unstable gas flow diagnosis.

DIAGNOSIS OF THE FORMATION OF INACTIVE


ZONE AT THE LOWER PART OF THE FURNACE RESULTS OF THE APPLICATION ON A REAL
BLAST FURNACE
The knowledge base consists of two stages of
diagnosis, as follows. In the first stage, the Consideration about Blast Furnace Application
initial phenomena of inactive zone formation were
diagnosed such as the decreasing of the lower This expert system was implemented to the on-line
shaft temperature, the deepening of the central process by using the EIXAX knowledge engineering
gas flow and the lowering of the root of cohesive environment, FORTRAN and C languages, etc. A FUJI-
zone. Due to the results of this diagnosis, action A 60 workstation was exclusively connected to the
guidances were proposed to prevent the fluctuation existing process computer. In order to make the
of the shaft temperature and the pressure profile. expert system user-friendly and to provide useful
In the second stage, the deepening status of the information, an user environment was developed,
inactive zone formation was diagnosed and action such as the setting of standard data for diagnosis
guidances were proposed to prevent long term and the display of the diagnosis results on a CRT
unstable conditions. (Cathode Ray Tube) monitor. The knowledge bases
have 310 rules and 13 frames.
Figure 4 shows the structure of the knowledge base
for the inactive zone formation diagnosis. Results of the Application

(First stage diagnosis) This expert system makes inferences, judgments and
action guidances every hour based on the data
Lower shaft temp. KS First action guidance acquired at 10 minutes intervals.
Burden distribution
Central gas flow KS control Figure 6 shows the diagnosis results of the inact-
ive zone formation and operation trend. There, the
operator could not recognize the initial phenomena
Cohesive zone KS of inactive zone formation which caused the decre-
ase in lower shaft temperature and the bed permea-
bility, etc., resulting in no operator's actions.
V By contrast, the expert system detected the
initial phenomena and suggested the action one day
(Second stage diagnosis) before.
Short term variation KS Second action guidance As seen in Fig. 7 illustrating the diagnosis
0/C, PCI, Vb control
results of an unstable gas flow, the expert system
Heat level KS also detected the unstable furnace condition very
well and suggested the action guidance in time.

The blast furnace operation results with and with-


Fig. 4. Configuration of the knowledge base out the application of expert system were compared
of inactive zone diagnosis. in Fig.8. There were a lot of slip, drop and unst-
ableness of heat level without the expert system.
DIAGNOSIS OF THE UNSTABLE CONDITIONS However, the number of slips and drops decreased
OF THE INNER FURNACE GAS FLOW remarkably after the application of the expert
system, and the furnace was more stabilized than
This knowledge base also consists of two stages of when it was operated without the expert system.

47
400
Cross sonde |
temp. No.l
( ' C) 3001

(lower zone)

Certainty 8|
factor of
diagnostic
results(%)

'90 12/19 20 22 23 24 26
Time(day)
Fig. 6. Operation trend and diagnostic results in case of
inactive zone formation.

Cross sonde
temp, no.l
( * C) 260

220

2.

*+*S
K(lower)

Throat temp.
( * C)
100

300

SP1 (kg/erf)
3.1

Certainty 80
factor of
diagnostic 40 Charging mode
results(%) (by operator)
change(by E/S)
0
<J0 9/9 10 11 12 13 14 15 16 17
Time(day)

Fig. 7. Operation trend and diagnostic results in case of unstable


gas flow.

Without theapplication With the application


of expert system of expert system

F.R.(Kg/t-p)

H.M.T..( ' C)

...
(-)

SI ip & drop
(-)
20

90.1 4 5
Time (month)
Fig. 8. Comparison of blast furnace operation results with
and without the application of expert system.

48
CONCLUSION

The expert system developed in this study showed


rapid and precise diagnosis of a real on-line
process for blast furnace operations. And the
system's prevention guide to the future furnace
irregularities turned out to be very useful for
the blast furnace operation. Also, it was possible
to improve the standardization of each operator's
knowledge.

REFERENCES

AMANO.S. (1990). Expert System for Blast Furnace


Operation at Kimitsu Works. ISIJ interna-
tional, 3a. 105-110.
CH0I,T.H. (1991). An Expert System to Aid Blast
Furnace Operation Using Artificial Neural
Network Techniques. HWAHAK KONGHAK, 29,
270-283.
LEVINE.R.I. (1986). Developing an Artificial
Intelligence System. A Comprehensive Guide
to AI and Expert Systems. McGRAW-HILL, N.Y.,
10pp.
NAKAJIMA,R. (1987). Development and Application of
Blast Furnace Operation Control Expert
System Making Use of Artificial Intelli-
gence. ISLL, 73, 2100-2107
SAT0,T. (1985). Operation Control System for
Blast Furnace at Ohgishima. NIPPON 0AN
TECHNICAL REPORT, 44, 1-12.

49
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

A HYBRID EXPERT SYSTEM COMBINED WITH A


MATHEMATICAL MODEL FOR BOF PROCESS CONTROL
T. Yoshida*, H. Tottori*, K. Sakane*, K. Arima*, H. Yamane** and M. Kanemoto**
^Electronics and Information Systems Division, Nippon Steel Corporation, Chikko Yawata-cho, Sakai,
Osaka-fu 590, Japan
**Technical Division, Sakai Works, Nippon Steel Corporation, Chikko Yawata-cho, Sakai, Osaka-fu 590, Japan

Abstract, Nippon Steel Corporation has developed the new blowing control system
which incorporates empirical knowledge of operators in its knowledge base. The
system applies a rule-based reasoning technique to improve the output of the
presently available mathematical models. It also utilizes fuzzy reasoning to
estimate the in-furnace reactions, for which a mathematical model can hardly be
utilized. As a result, it has become possible to perform blowing control more
accurately than does the operator.

Keywords. Knowledge engineering; Real-time expert system; Fuzzy system;


Blowing control; Refining process.

INTRODUCTION chemical composition of steel and the tempera-


ture. As the slag formation reactions are not
The application of computer control system and reflected in the conventional model, this sets
the development of sub-lance measuring tech- another limit on the accuracy of the predicted
nique has remarkably improved the accuracy of values of the blow-end carbon content of steel
blowing control in the basic oxygen furnace and the temperature. Moreover, it is very
(BOF). Recently, requirements for blowing difficult to predict and control the blow-end
control have become increasingly severe to phosphorus content and manganese content of
meet the increase in continuous casting ratio steel using this model. In view of these
and the widespread use of hot metal pretreat- problems, in the actual operation, an operator
ment. In order to satisfy such requirements, determines control values using his empirical
the blowing control accuracy must be further knowledge and know-how, in addition to the
improved. calculation results by blowing control models.
The conventional blowing control system con- However, the accuracy of control depends on
sists of two subsystems. The first subsystem the skills of the operator.
is static control model based on the informa- Nippon Steel Corporation has developed the new
tion obtained before the start of the blowing. blowing control system by applying knowledge
The second subsystem is dynamic control model engineering. The feature of the new system are
based on the process signals representing the as follows:
state of the refining during the blowing.
The static control model calculates the (l)In the static control model, rule-based
initial condition of the raw material charging reasoning has been applied to systematize
and oxygen blowing based on heat balance and the empirical operator's knowledge for
material balance using statistical processing correcting of error factors in control
method. This model performs calculations using values.
only quantitative data obtained from sensors,
and qualitative data is not reflected. There- (2)In the dynamic control model, fuzzy
fore, the output for the control contains some reasoning is applied to the new model
errors due to the inaccuracy of the model. with practical dimensional size, that
The dynamic control model predicts the blow- follows changes in in-furnace reaction.
end carbon content and temperature based on
the measured values of carbon content and This paper describes the newly developed hy-
temperature using sub-lance during blowing. It brid expert system for blowing control combin-
calculates the amounts of the coolant charging ed with mathematical model and the application
and the volume of the oxygen blowing based on results in the actual BOF operation.
the predicted values. Among the actual in-fur-
nace reactions, only the temperature rise of
steel and decarbonization reaction are consid- OUTLINE OF BOF BLOWING PROCESS
ered in the conventional dynamic model. It has AND BLOWING CONTROL SYSTEM
been well known that the slag formation reac-
tions are closely related to the blow-end The outline of BOF equipment is shown in Fig.l.

51
Exhaust gas analysis course of refining following the variations in
the reactions during the blowing. Therefore,
the static control model is usually utilized
to specify the initial condition of the blow-
ing as auxiliary to the dynamic control model.
Gas analyzer
(Mass spectrometer) The dynamic control model calculates the
volume of oxygen and the amount of coolant re-
quired to hit the blow-end carbon content and
the temperature on the basis of the measured
values of steel by sub-lance.

Actually, because of the continuity of blow-


Process
ing, the two models are mutually dependent
computer p^y3-Slag
strongly on each other. For example, when the
A ^ - ^ Molten
i steel *c*- static model has poor accuracy, the control
L
-0
action during blowing increases, and the con-
tinuity of the blowing is impaired by dis-
turbance in the in-furnace reactions. If such
-o
Bottom blowing
gas rate a condition continues, the accuracy of the
Exhaust gas flow rate
Gas flow meter
static model gets worse, and as a result, the
total accuracy of blowing control also gets
Fig. 1. Outline of BOF process worse.

The static control model is expressed by


Scrap and hot metal are charged from the Eq. (1).
vessel mouth. Then, pure oxygen is blown into
the molten metal through the main-lance. [Control value] f(Xi ,x2 , , x n ) (1).
Coolant and flux, such as iron ore, manganese-
ore, lime, and fluorspar, are charged in order In the Eq. (1), f is function based on materi-
to control the slag formation reaction and al and heat balance, ,2,, are quanti-
adjust the values of chemical compositions and tative data automatically obtained from sen-
temperature in molten steel. Bottom-blowing sors, and e is an error term. The error is
equipment which blows carbon dioxide gas into statistically processed and is mainly caused
the molten steel through the nozzle is provid- by qualitative factors which are not reflected
ed to promote the refining reaction by agitat- in function (f), such as, furnace heat retain-
ing the molten steel. In the final stage of ing condition, a condition of slag remaining
the blowing, the sub-lance is inserted into in furnace. In the actual operation, the
the furnace to measure the temperature and operator corrects this error term.
carbon content of molten steel.
The accuracy of the dynamic control model
The configuration of conventional blowing con- depends largely on the following three in-fur-
trol system is shown in Fig. 2. Before the nace reactions.
start of the blowing, the static control model The first is decarbonization reaction. In the
calculates the volume of the oxygen blowing, dynamic control model, the relationship
the amounts and timing of the flux and cool- between decarbonization efficiency (dC/d02 )
ant charging, the pattern of the lance height, and carbon content of steel is expressed by
and the pattern of the bottom-blowing gas Eq. (2):
volume based on the material balance and heat
balance. This model is unable to adjust the dC [1 - exp {-(C-a2)/a3} ] , (2)
dO,

Control
where ai ,a2 ,a3 are parameters.
Control period Target
of dynamic modell
period I J
of static
The second is temperature rise. In the dynamic
model model, the relationship between temperature
rise efficiency (dT/d02) and decarbonization
efficiency is expressed by Eq. (3):

dT
d02 a
* ^ d02 ; (3)

where a4 , as are parameters.


The dynamic model determines the control val-
ues by solving Eq.(2) and (3) using measured
values of temperature and carbon content as
the initial values.
The third is slag formation reaction. This
A reaction is related to the formation of slag
Start of blow
A
Sub-lance measurement End of blow which consists of lime and oxides of iron,
phosphorus, manganese, silicon, etc. It has
Fig. 2. Conventional blowing control. significant effect on decarbonization

52
and temperature rise reactions, and causes the THE NEWLY DEVELOPED BLOWING
parameters in Eq.(2) and (3) to fluctuate. CONTROL SYSTEM
However, an accurate theoretical model of this
reaction has not been developed. Therefore, In the new control system, knowledge of opera-
not only the accuracy of the control values of tor is divided into three functional groups.
temperature and carbon content gets worse, but First group compensates the static model, sec-
also the control of phosphorus and manganese ond estimates and controls the slag formation,
contents is difficult. In the actual opera- and third corrects the parameters of dynamic
tion, the operator estimates the condition of model. Figure 4. shows the diagram of the new
slag formation, corrects the parameters of blowing control system.
dynamic model, and controls the contents of
phosphorus and manganese based on his empiri-
cal knowledge. Reasoning Compensating the Static Control
In order to solve those problems in the con- Model
ventional blowing control and enable the
accuracy of temperature and carbon content The conventional static control model is ex-
control to be improved and the control of pressed by Eq. (1). In Eq. (1), e is error
phosphorus and manganese contents to be auto- term, and it is determined by statistically
mated, Nippon Steel Corporation has developed processing. This term depends on two groups of
a hybrid control system which incorporates the factors. The first group of factors are not
existing mathematical models and systematized reflected in the model, because they are
knowledge of skilled operators. qualitative data and difficult to be expressed
using mathematical model. For example, the
furnace heat retaining condition, the remain-
SYSTEM CONFIGURATION ing slag condition in the furnace, the opening
of the bottom-blowing nozzle, are very diffi-
The configuration of the newly developed hy- cult to be quantitatively measured, and so it
brid control system is shown in Fig. 3. This is very difficult to express them by mathemat-
system consists of the existing process con- ical model. The second is the group of factors
trol computer and a newly installed computer which depends on changing of reaction and er-
for the expert system. The process control rors of quantitative data between and during
computer collects and transmits process data blowing. In the new system, first group of er-
to the expert system computer. The expert sys- ror is systematized using production rule.
tem computer consists of three subsystems As a result in actual operation, the accuracy
process data receive processing, mathematical of static control is improved about 0.3% in
model, and reasoning. Data received from the equivalent scrap ratio.
process computer is stored in an intermediate
file, which is a data base for heats of the
immediate past. The reasoning subsystem con- TABLE 1 Control Result of Reasoning
sists mainly of production rules and frames. for Static Model
Fuzzy reasoning is performed by calling the
LISP function from the production rules. The
results of reasoning are displayed on the dis- New Conventio-
pay terminal for operation guidance. The math- Item system nal system
ematical model subsystem is called from the
reasoning subsystem. In the new system, the Equivalent X -0.06 -0.37
mathematical models are installed in the scrap ratio(%) 1.12 1.13
expert system computer for high speed calcula-
tion.

In this way, it has become possible to improve


-
Expert system the accuracy of model by systematizing quali-
process computer
-computer tative information using symbol processing
r\
-On-line system-
Data Date , . . method and combined with mathematical model.
collec- reception |_j\ ' [ Reasoning
tion^ subsystem

fftftfD Intermediate!
T~T
Mathematical! Operation Reasoning for Estimating and Controlling the
Process file model guidance
data subsystem Slag Formation
Copy
Conventionally, the slag formation is estimat-
i-Off-line system^
ed based on the oxygen balance in the furnace
Reasoning continuously. The amount of oxygen accumulated

U
subsystem
in the furnace is calculated based on the
JZL
Mathematical
difference between the amount of oxygen input-
L
Qualitative
data input
model
subsystem |
ted in the furnace (i.e., oxygen supplied
through the lance, and so on) and the amount
of output oxygen from the furnace (i.e., oxy-
gen contained in the exhaust gas). As the
amount of oxygen accumulated in the furnace is
Fig. 3. Configuration of the new system. related to the slag oxidation parameter, it is

53
-30 ~ 40 min.-
Blowing sequence

Scrap charging Hot metal Start of blow Sub-lance blow-end Composition


charging measurement- measurement
T

Static model Dynamic model

Heat balance Sub-lance


Estimation and measurement
Conventional Material balance control of slag results
formation by
control
Determination of
I operator
4
Determination
the amount of Compensation when to turndown,
the oxygen, flux by operator Compensation the amount of
and coolant by operator flux and coolant

---O-
Reasoning for slag Reasoning for
Reasoning for static model formation dynamic model Leaning
of fuzzy
ES fuzzy Setting of blowing control Estimation of ' Compensation member-
end-point slag formation for dynamic ship
model with
control Compensation for static model| Action for function
the result of
with qualitative information slag formation estimation of
control slag formation

Fig. 4. Diagram of the new blowing control system.

used in the estimation of the slag formation. know-how that contains such words as "large"
However, it is not accurately estimated using and "good" which can hardly be described quan-
only the amount of oxygen accumulated in the titatively, the new system applies fuzzy rea-
furnace, because the slag formation is affect- soning. The relationship between the process
ed by the chemical composition and temperature information (Xi ) and the slag formation is
of hot metal, the condition of slag in the expressed by the following fuzzy rule:
furnace, the amount of flux charging, and so
on. If Xi is A, then slag formation is B. (4)

In order to make the accurate model of slag The fuzzy rules used in this system are shown
formation, there are two problems. The first in Table 2.
is that this model is non-linear model and the
number of the dimension of state variables is
very large. Secondly, it is difficult to mea- TABLE 2 Fuzzy Rules for SI ag Formation
sure all of the data required to specify the Estimation
system state. Therefore, it is difficult to
make practically accuete model.
Conclusion
The new model has solved the problems by in- Condition part
part
Rule Weighted
stalling operator's empirical knowledge. No. Range Degree of mean
Factor Status - slag of rules
Min. Max. formation
The knowledge obtained by hearing from operat-
1-1 Large Nearly good
ors contains some degree of fuzziness as the [Si] in hot
2 Medium 0.15% 0.40% Normal 0.5
following example: metal
3 Small Nearly bad

When the silicon content of hot metal is 2-1


Hot metal
Large Nearly good

relatively large, it is very likely that 2 temperature Med ium 1450C 1150C Normal 0.5
3 Small Nearly bad
the condition of slag formation will
become good. 3-1
Oxygen
Large
Learning from
Good
2 accumulated Medium
preceding heats
Normal 1.0
in furnace
Moreover, operators estimate the slag forma- 3 Small Bad

tion using fluctuation of the pressure at the 4-1 Large Nearly good
Quantity of
food of BOF which is not theoretically 2 fluorspar Medium 1.5 t/ch 0 t/ch Normal 1.0
reflected to furnace reaction. However, it is 3 Small Normal
indicated that the slag formation is active 5-1 Large Good
Amplitude
when the hood pressure is changing widely, and 2 of hood Medium
Learning from
Normal 0.5
the slag formation is poor when the hood 3
pressure
Small
preceding heats
Bad
pressure is stable.

In order to effectively express the operator's

54
"Medium" "Nearly Bad" "Nearly Good"
Reasoning Compensating Dynamic Control Model
"Bad" "Normal" "Good"

The dynamic control model is expressed by


Eq. (2),(3). In Eq. (2),(3), ai ,a2 ,a3 ,a* ,a5
are parameters. These are affected by the slag
0 + Degree of formation. In the new model, these parameters
slag forma- are determined using the estimated value of
Membership functions Membership functions tion the slag formation.
of condition part (A) of condition part (B)

As a result in actual operation, accuracy of


Fig. 5. Membership functions for values of blow-end carbon content and tempera-
slag formation estimation. ture are improved in comparison with the con-
ventional model. It is assumed that the pract-
ical model for blowing control is realized by
| [RULE 1] if Xi is Big then Degree of slag formation is Good]
applying fuzzy reasoning.

;-\
. 1'
1.0
0.8
/
measured
.., l APPLICATION RESULTS
IN ACTUAL OPERATION
value ^

n-
|[RULE 2] if Xi is Medium then Degree of slag formation is Moderate]
The result of the blow-end temperature control
using the new system is shown in Fig. 7. in
0.2 comparison with the conventional system. The
0 i^L 0 center of variance of temperature controlled by the new
gravity
system decreased from 11.6 degrees to 8.2
[RULE 3] if Xi is Small then Degree of slag formation is Bad]
degrees.
The chemical cmposition control results is
shown in Fig. 8. and Fig. 9. The variances in
carbon, phosphorus, and manganese contents
Condition *-r+ Conclusio
part I part decreased, in comparison with the conventional
system.
Fig. 6. Mapping and synthesis of
membership functions.

V 1680
New system
As shown in Fig. 5, fuzzy set (A) in the con-
2 1660 a Conventional
dition part is given by three levels of mem- system
bership functions, and fuzzy set (B) in the CO

.jfK
SJ 1640
conclusion part is given by five levels of
membership functions.
The way deriving fuzzy set of a conclusion
I
1620
part from a fuzzy set of a condition part is m
o
as shown in Fig. 6. +J 1600
i -l
P
The weighted means of all of the fuzzy rules <S 1580
1580 1600 1620 1640 1660 1680
are summed to get a value of a slag formation.
Target of temperature at blow-end (C)
The membership function for oxygen accumulated
in the furnace quitely changes with past heat-
s. Therefore, the new system learns the mem- Fig. 7. Result of blow-end
bership functions for conclusion part of fuzzy temperature control.
rule 3-1 to 3-3 and 5-1 to 5-3 using a learn-
ing method. The method is parallel displacing
the membership functions to minimize differ- 60
ence between the actual results of slag forma- New System
tion in past heats and their calculated value- S ^ 50 h D Conventional
s. system
o o 40

The slag formation is controlled by the esti- 30


_
D
mated results obtained by the above-mentioned
20 D
"D" eQ , D ft D
method, in order to control blow-end phospho- Q G
rus and manganese contents, because they 10
are closely related to the slag formation re-
1 1 1 1
action. The control law is production rule 10 20 30
based on the operatorfs empirical knowledge.
[C] at the blow-end ( 10
-2 % )

As a result in actual operation, the accuracy


of blow-end phosphorus and manganese contents Fig. 8. Result of blow-end carbon
is improved compared with the conventional and phosphorus control.
method.

55
^80r produced satisfactory results in an actual
2 New System
D
operation.
g o 60}- Conventional
system

40 h D
u u
a
REFERENCES

20 - Iso, H., Y. Jyono, M. Kanemoto, Y. Ueda, T.


Yoshida, and K. Isogami (1987). Dynamic
1 1 1 refining control by analysis of exhaust
0 10 20 30
gas from LD converter. Trans. Iron
2
[C] at the blow-end (x 10" %) Steel Inst. Jpn., 27, 351-356.
Iso, H., U. Ueda, T. Yoshida, S. Osada, S.
Fig. 9. Result of blow-end carbon Etoh, and K. Arima (1988). Refining con-
and manganese control. trol of top and bottom blowing converter
by manipulating bottom-blown gas flow ra
te. Trans. Iron Steel Inst. Jpn., 28,
The results of control are summarized in 372-377.
Table 3. As shown in Table 3, the new control Kanemoto, M. , H. Yamene, T. Yoshida, and
system shows better results than the conven- H. Tottori (1990). An application of
tional system. expert system to LD converter process.
The new system applying symbolic and fuzzy Trans. Iron Steel Inst. Jpn., 30, 128-
method based on operator's empirical knowledge 135.
is proved to be a effective method for model- Shimada, M. , S. Oishi, N. Iwano, K. Naito and
ing an in-furnace reaction of BOF which a M. Maeda (1969). Computer control of
quantitative mathematical model is not BOF. Seitetsu Kenkyu, 266, 136-141.
perfectly obtained.

TABLE 3 Comparison of Results


a t Blow -end

New Conventio-
Item system n a l system

Number of h e a t s 64 53

Blow-end X 1623.8 1623.4


temperature(C ) 8.2 11.6

Blow-end X 17.2 16.6


[C] (X10" 2 %) 2.7 3.2

Blow-end X 31.6 28.4


[P] (X10_3%) 5.0 5.6

Blow-end X 57.1 54.3


[Mn] (X10" 2 %) 5.8 7.5

CONCLUSION

By applying knowledge engineering in BOF blow-


ing control, Nippon Steel Corporation has
developed a hybrid expert system combined with
the presently available mathematical model.
The feature of the system are as follows:

(l)The static control model is compensated


using rule-based system.

(2)The slag formation is estimated using fuzzy


reasoning.

(3)The dynamic control model is compensated


using fuzzy reasoning.

We demonstrated that the new system has

56
Copyright BFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

KNOWLEDGE BASED MODEL OF THERMAL STATE OF


METALLURGICAL LADLE
M. Hadjiski*, K. Spassov* and D. Filev**
*Sofia Technology University, Department of Automation of Production, Sofia, Bulgaria
**Central Laboratory of Bioinstrumentation and Automation, Bulgarian Academy of Sciences, Sofia, Bulgaria

Abstract The paper deals with an approach t o k n o w l e d g e -based models


const ruct ion usi ng b o t h a m a t h e m a t ica 1 model a n d addi t iona 1 emp i rica 1 a n d
e x p e r t data., A c o m p l e t e m a t h e m a t i c a l ma del o-F t h e t h e r m a l state of the
!-efractory 1 i ning o f t h e m e t a 11 u r g i c a 1 1 ad 1 e for t h e new t y p e of Direct
C u i-1" e n t L a d 1 e F u r n a c e ( D C L F ) h a s b e e n w o r k e d o u t T h e LI n c e r t a i n t i e s
:
connec:ted w i t h t h e e x t e n t o f t h e 1 i ni n g erosion,, t h e coeff i.c ients of
c o n v e c t :i. v e a n d r a d i a n t h e a t t r a n s f e r? t h e 1 i q u i d steel t e m p e r a t u re, and
11"ie p o s s i b 1 e v a r i a t i o n s in t h e t e c h n o l o g i c a l s t a g e s d u r a t i o n a r e u s e d to
obtai n m e m b e r s h i p f unct :i.ons ancl ru 1 e base of a k n o w l e d g e - b a s e d fuzzy
model of t h e ladle,; T h i s model is u s e d to estimate the mean thermal
c a p a c i t y a n d t h e i n i t i a 1 t h e r m a 1 s t a t e o f 1 a d 1 e:' s 1 i n i n g , w h :i. c: h is ve r y
useful for ladle m e t a l l u r g y control. Simulation experiments data are
displayed

K e y w o r d s F u z z y m o d e 1, K n o w 1 e d g e base Lad 1 e m e t a 11 u r a v , M a t h e m a t ica 1


m o d e 1 , R u 1 e b a s e , Thermal stte

INTRODUCTION metal bath w i t h the aid of DC e l e c t r i c arc


heating system to achieve the target
In the last few y e a r s ladle m e t a l l u r g y has temperature [ !
been w i d e 1 y u s e d in hi gh q u a 1 :i. t y st ee 1 2.. To calculate the current steel
production T h e a p p e a r a n c e of a new type t e m p e r a t. u r e du r i n g t r eat me nt and i ts
0 t: D i r e c t Cu r r e nt La d 1 e Fu r n ace (DC L F) p e r d i t i o n o n t h e b a s i s CD f a p r o c e s s m CD d e 1
CD f f e r s new p CD S S i b i 1 i t i e s fo r steel and r a r e di rect m e a s u r e m e n t s o f t h e 1 i.qui d
d e s u 1 p h u r i z a t i CD n u n til ve r y 1 CD W metal temperature
c o nc e nt r at i o ns .& o b t a i n e d T he CD t h e r The met a 11 ur g i ca I. ladle :i. n ever y cycle
advantage is the thermal efficiency of p a s s e s t h r o u g h four t e c h n o 1 o g ica1 stages
DCLF and reduced thermal erosion of the C F i g 2) w i t h d i f f e r e n t du r at ion.
1 i n i ng w h i c h i s due to t he c CD n s t r u c t i v e A s DCLF" w o r k s a s a buffer b e t w e e n t h e
p e c u l i a r i t y of the electric arc heating ni e 11 i n g f u r n a c: e (B O F , E A F) a n d c o n t i n u o u s
system (Fig 1) (Bavov P. and co w o r k e r s , c a s t i ng mac h i ne., t he st ag e du r at i CD ns j .
19 8 4) n 1" h e wo r k i ng CD U t, p at e nt i ng and
C i::1 4) c a n fo e q u i t e d :i. f f e r e n t f r om
int.)"oduct:i.CDn o-F DCLi::r is i n d i c a t i v e CDf t h e
nom i na 1 du r at i o ns dep e n d i ng on t he
need to develop modern control system
o p e r a t i on condi t ions a n d s:i. tuat ions.
ensur :i. ng ternperature and ehemica 1
composit:i.CDn of the 11 quid metal,, at the
There are different approaches to the
target v a l u e s in time
ladle's initial thermal state estimations!
1., The usi ng of stat ic regression
The analysis of Ladle Furnace (LF)
m o d e 1 s i n w h i c h t h e d u r a t i CD n s j . C i ~1., 4 )
aggregates investigations (Bakakin.,
M o r CD s h i 1 o v a n d G a 1 p e r in, J. 9 8 1 , B i rat and ar e i nc 1 u ded . as f ac t o r s - Th i s ap p r oac h
P e t e n i e -f: s 19 8 7, P f e i f e r a n d c o 11 e a g u e s,, g i ves i n s u f f i c i e n 1". ace u r a c y bec ause
19 8 7 ) a n d o u r CD W n e x p e r i m e n t a 1 e x p e r i e n c e considerable influence of other
< S a v o v;| 19 S 4? I-l a d .j i :i. s k i ., Bp assov and t e c h n o l o g i c a l f e a t u r e s of t h e melt are not
Savovj, 1989) have s h o w n that the thermal t a k e n i n t o c: o n s i d e r a t i o n.,
state of the metallurgical ladle has a 2 T he a p p 11 c a t :i. o n CD f a na 1 yt i c a 1
c o ns1 d e r a b l e inf1uence on t he st ee1 m o d e 1 s T h e t h e r m a 1 p r CD C e s s e s i n t h e m e t a 1,
ternerature in s e c o n d a r y s t e e 1 m a k ing. T h e t he s 1 ag and t he i nsu 1 at :i. on can be
est i ma t i CD n o f t he i n i t i a 1 t he r ma 1 s t t e o-f desc r i be d us i ng a na 1 yt i c a 1 m CD d e 1 s of
the ladle b e f o r e t a p p i n g CBOF or E A F ) . is dif f e r e n t c o m p l e x i t y CBakaki n, HCDroshii 1 CDV
of p a r t i c u l a r importance because the a n d G a 1 p e r i n , 19 81., B i r at and P e t e n i e f?
measur ed t emper atur e o f 1 i qu i c:i m e t a 1 i s 1984,, Hadjiiski, Spassov and Savov, 19891,
i n s u f f i c i e n t t o s o l v e t h e next important Pfeife!" ancl co 1 leagues, 1987) . Their
p r o b 1 ems sat i sfac t a r i 1 y s c CDmp a r a t. i v e a n a 1 y s i s i n d i c a t e s t hi a t with
1. T CD c a 1 c u 3. a t e t h e e 1 ec t r i c e ne r q y t h e s a m e c o m p u t i n g c o n d :i. t i o n s (initial a n d
a m CD u n t n e e c:l e d t CD b e i n t r o d u c: e d nto t h e boundary conditions, numerical v a l u e s of

57
t h e r m c \ p h y s i c a l tconstants,, test signals, m e t a 1 dur i ng t h e me 11,, it i s possi b3.e to
etc- ) the <a n e >c i i me ns io n a1 mode1s a re say
sufficiently iaccurate The pure
model b a s e d a|p p r CD a c h s LI f f e r s f r om (5) $(T,o> * i m
:
i n s u f f i c i e n t a c c iuracy b e c a u s e o f t h e lack
of a f f e c t i v e ada|p t i ve t u n i n g p r o c e cJ u r e s |t D u r i n g t h e t i m e that t h e ladle is e m p t y ,
w h i e h are due t o t he i nsuff i c i e nt num be r b o u n d a r y c o n d i t. i o n s a r e a c t i n g o n t h e
of mel a s u r e m e n t s under greatly altered i n s i d e o f t h e ladle w h i c h are determined
regime> condition* by i" a d i a t i o n h eat t r a n s -f e r wit h
i . T h e u< e io f l< n o w 1 e d g e b a s e d m o d e 1 environment
b a s e d o n t h e b a s is of fuzzy sets (Dubois
a n d Pr ade,{ 1980 , F i 1 e v j, 199 0,, X a f a r o v,, (6 ) )4>
# (T ,, 0) A ( (0 <T r. 0) + 2 7 3 ) 4- <#^ ,. ..-i-273
Dorohov and Ma rkov, 198,, Lee,, 1990,
Skala., 1978, T o m g, 1978) has been selected T h e g e n e r a l i z e d c o e f f i c i e n t A is o b t a i n e d
From va.r i o n s a p p roaches to know 1 adge-based by tak i ng i nto aecount am i s s i v i ty
m o d a l i . n g ( B r i s t o 1., 19 8 4;t l< a f a r o v ,, D o r a h o v c o e f (' i c i e n t s a n d c o n f i g u r a t i a n f a c t o r s
a n d Mi r k o v . 1986 >i i n a c y 1 i n d e r (K r e i t h a n d B l a c k , 19 8 0) .,

The approach, described below,, c a n be D u r i n g t h a heat i n g o f t h e 1 adle it is


regarded a s a step towards a better c o n s i d e r e d that t h e g a s b u r n e r i s o f a n
t.i n d e r s t a n d i n g o f h o w t CD C O m b i n e k n CD W 1 e d g e i nvar iab1a heat i ng power and t he exceedi ng
of d i ffere nt o r i g i n s, so as to ma k e CDf a g i v e n t a m p a r at ur e o f t h e i ns i c:le
moda1ing mora t r u s t w o r t h y . s u r f a c: e i s i m p CD S S i b 1 a

H e i- e t he t he r ma 1 stte of 1 a d 1 e:' s O^T.O) <#


(7) b
i nsu 1 at i CDn i s reqar ded a s a c o n t i n u o u s
dynamic system of which we have e<Ti0)= T 2 Tb
'b
cons i de r a b 1 e p re 1 i m i nar y i n f o r m a t i CD n
c onc e r n i ng t he p oss i b 1 e be hav i o r in where j,, is the time instant when $(T!>C*3
d i f f e r a n t c o m b i n a t i CD n s o f t h e i n p u t s? aehleva maxima1 possib1e value tb>
whosa c haractsr is i rregu1ar and
p a r i o d i c a 1 "!" h e u n c e " t a i n t y, w i t. h w h i c h r e1a t i v e heat f1ow,,
t he i n i t i a 1 a n d boundar y cond i t i o n s o f
e v e r y c y c l e c a n be d e t e r m i n e d a s well as The mat hemat ica1 mode1 <I) + < 7) can be
some parameters which are difficult to so Ived numer i c a 11 y by be i ng t r a nsf CD r ma d
m e a s u r e H such as for i n s t a n c e the extent into a system of algebraic equations In
of t he lad 1 e:" s erCDSi CDn m a k a f uzzy m a d e 1 :i. ng t he cCDnt r i but i CDn t in i s is dona accCDr d i ng to
a p r e f e r a b l e a l t e r n a t i v e in t h e a c c u r a c y t he met hod of cont ro1 v o1ume (Pat a nk a r
19 B 0) ,, w h i c h i s a f u 11 y i m p 1 i c i t i nt ag r a 1
race w i t h a p u r e model based approach,,
method with a guaranteed convergence.

0u r ow n axp er i a nc e and s i mu 1 at i o n
MATHEMATICAL MODEL OF A e x p a!- i m e n t s fo r a m u 11 i t u d e CD f
METALLURGICAL LADLE c om b i nat i o ns CD f . s how t hat ac t i ve
The thermal processes in t h e wall and dynamic: pre:)cesses of heat transfer are
bottom o f t h e m e t a l l u r g i c a l ladle (Fig,, 1) t a k i n g p 1 a c e i n a n i n s u 1 a t. i o n lay e r wit h
t h r oug h eq u i v a 1 e nt t r a n sfo r mat i o ns c oma t o t h i c k n e s s o f 15 cm ( F i g 3 ) . In t h e rest o f
one dim e n s i CD n a 1 t he r ma 1 c o n d u c t i v i t y,, t li e r ef r ac t o r y 1 i n i ng the r e is a
m u 11 i --1 a y e r ,, c y 1 i n d r i c: a 1 w a 11 w i t h CD U t p r ac t i cally sta t i ona r y t he r mal
i nt a r na 1 s o u r c e s,, das c r i ba d by the c o n d u e t i v i t y,, w i t h v a r i o u s t emp e r at u r e
fo H o w i ng d i ffe r a nt i a 1 aq uat i o n d i f f e r e n c e s T h i s r e s u 11 i s i n c CD n f o r m i t y
w i t h t he data o b t a i ned f r o m B i rat a n d
P e t a q n e i. f (1987) a n d P f e i f e r a n d a t h e r s
><T> r )
dB(-r>^ (1984)
(1)
T dr
rr 0r TI")e m e t a 11 u r g ica 1 lad 1 e h a s a s p e c i-f::ic
character istics:
w h e r e t h e initial c o n d i t i o n is
1 T ha r ef r ac t o r y 1 i n i ng bac omes
e r CD de d du r i ng t h e p r oc a s s i ng of t he 1 i qu i d
(2) $<0,(r> (r) metal (stage 1). This leads to a
n o n - - s t a t i CD n a r y c h a n g a in t h e the r m a 1 s tte
7"he bCDundar y condi t i on o n t he o u t s i de o-f of t h e ladle,, For this reason the
the ladle is date rm ina d by nat u ra1 i nsu1 a t i o n t h i c k n e s s r e g a r d s a s an i nput
convection i n t ha fu Z Z y mo da 1
2 Parameters o f t h e first stage
(3) ^(T!.R) C<<S<T-R> 3air> (1 i q u i d met a 1 t reat ma nt ? F1 g 2) -- dur at i o n
0 n t h e i n s i d e o f t h e 1 a d 1 e, t h a b CDdary
Un
7 .. a n d steal temperature & ., can
m'
be
condition depends on the stage the 1a d 1 e nag lee:te- d as an inputs in the model due to
is in (Fig 2) Dur i ng t he p r o c ess i nq al ways i , >40 mi n and t he t her rna 1 ba 1 ance
s t a g e t lie ladle i s f u 11 o f 1 i q u i. d nie tali' of i nsu1 at ion' s active layer
w i c h i s s t j. i- r e d i n t e n s i v e 3. y w i t ]-h g o an is to be
achieved and may be assumed constant
(Ar) F o r t h i s reason, a m o d e l c a n be
1hr e a t e n o f a n i dea 1 m i xtur e i ng CDf the because of the temperature control system,
1 i q u i d m e t a 1 "i" h a r e f CD r e i t c a n b e a si umed The thermal model of the ladle is
t h a t t h e t a m p a r a t LI r e CD f t h e i n s i d e of the solved w i t h a s u p p o s i t i o n o f a con s t a n t
ladle is equal to t h e t e m p e r a t u r e of the thermal p CD wer of the gas burner CW-ISOOO
liquid metal kW). If tl"ieI"e is a deviat ion f rom this
(4) $<*0) - $m<T>
value,, w i t hout i nt r o duc i ng a naw
independ ant input factor,, the result
Ta k i ng t he mea n t amp a r at u r a of t ha
effect c a n b a t a k e n i n t o a c c CD U n t by
iquid

58
d e s c: r i p t i o n CD f t h e 1 ci e a 1 a b j e c t c a si b e
c: o r r e o t i n g t h e i n t e r v a I 7,,,. d e f o r m e d by way of a c h a n g e in parameters
, af f , b. B y i n t r o d u c i n g s u i. t a b 1 e p r o c e d u r e s o f
conformity between the prototype and the
r e a 1 o b j a c t., t h e r a s p e c t i v e members h i p
where t h e .,,. is determi ned a s a f unct :i.on f unct i o n s u ( x) <:::an be determ i ned The
of t he de v i at i D n of t h e bu r ne r p owe r W f LA r t h e 1- d a v a 1 o p m a n t o f S k a 1 a? s met ho d
m a k e s it p o s s i b l e to t a k e into account
(9) 7.,, - p-riW) a dd i t i CD na 1 emp i r i c a 1 dat a and ex p e r t
k n o w I e d g e., " h e m e m b e r s h i p f u n c t i o n s a r e
Using t he m a d e 1 wo rke d o u t , so1LAt ions w e r e f o r m e d in such a way a s to have t h e
0 b t a :i. n e d f o r a 1 a r g e n u m b e r o f r easo na b I a f: o 11 o w i n g p r CD p a r t i e s s
c o m b i n a t i a n 3 b e t w e e n t h e :i. n t e r v a l s . a n d I,, "!" o m e e t c o n d i t :i. o n s sat do w n
1 be-fare hand. T h i s m e a n s that the m e m b e r s h i p
t hi e t h i. e k n e s s a f t h e a c: t i v e 1 a y e 1 o :: t h e f u n c t i o n s m u s t b e i n c o n f CD r m i t y wit hi t h e
1 adIe., T h e s & r a s u I t s a r a idealized data nCDm :i. na 1 par ametar s of t ha mCDda I of va 1 u e s
about t h e thermal s t a t e of t h e ladle and a
1 i D !- i i n f o 1" m a t i o n a b o u t t h a c: r e a t i o n o f 2 , T h e e ;< p e r i m e n t a 1 i n f CD r m a t :i. o n a n d
i t s f u z z y fnocie 1 T h a i r i m m a d i a t e u s e a s the a m p i r :i. c a 1 d a t a ara .j u s t :i. n i t i a 1
skeleton data base for situation i n -f a r m a t i a n a b o u t -f LI r t h a r p r o c e s s i n g of
c a n t !- a 1, u s e d in some c: a s e s is the m e m b e r s h i p functions,, H e r e belong the
LA n ace is p t a b 1 e, d u e to t h e i n s u f f i c :i. t p o s s i b l e d e v i a t i o n s of t h e t h e r m o - p h y s i c a l
a c c u r a c y o f t h e m o d e 1,, d u e t o c CD n s t a n t s i n t h e m o ci e l , t h e c h a n g e in t h a
s i m p 1 i f i c a t i D n s i. n t h e m o d a 1; t h i c k n e s s o -f: t h a r ef r ac t o r y 1 i n i ng , t h e
b o LI n d a r y c: o n d i t i o n s and mo r e c h a n g e i n t hi e b o u n d a r y c CD n d i t i o n s
p a r t i c u 1 a r 1 y t he c o e 1" f i c i a n t US O f hi e a t 3 A d d i t i o n a 1 c o n c:l i t i o n s o -f: t he
t r a n s f e r ^ a n d e s p e c i a l 1 y f o r 11"! e :i. n s i d e membership functions c a n be imposed on
:
o-f t h e l a d 1 a a r a g i v a n a p r a x i m a t a 1 y 5 t he i r s h a p e and 1 ocat i CD a s we 11 a s dur i nq
t he d i rect in a a s LI r a m e n t s of t ha p r o c e s s i n g p r o c e d u r e (Skala, 1 9 7 8 , To rig,
i n s u 1 a t i o n t e m p a r a t u r e t hi a u g h p a s s::. b I e a r a 1978) ,, I n t h i s way a m p i r i c a 1 c! a t a a n d
a c: D m p I a ;; e n g i n a a r i n q p r o b 1 e m:; e x p e r t k n o w l e d g e c a n be t a k e n account of
t he i nsulat ion a r a s i o n data ara c o nc e r n i ng t he p CDSS i b 1 a ma x i mum and
s t a t ist i c a 1 1 y aver age and cQnsi derab1e m i n i n u m d u r a t i o n o f: t a c h n o 1 o g i c: a 1 stage
d e v i a t i o n s a r a p o s s i b 1 a; 7 , ,, T h a in e t a 11 u r g :i. c a 1 la d 1 a i s r e g a r d e d a s
t hi a s a I u t i a n o -f: t h e i n v e r s e p r D b .1 a m
a m u 11 i f a c t o r o b j a c t w i t h 4 i n p LI t s a n d 4
b y w a y a f a q u a t i o n s a f t h e t hi a r m a . b a 1 a n o e
o u t p u t s (Fig 5 ) .
of t h e ladle of the b a s i s of temperature
T ha ci u r a t i o n CD -f:: t h e t e c h n o 1 o g :i. c a 3.
m a a s u r erne n t a. o f t hi a 1 :l. q u i. c::l m eta 1 is
s t a g e s . <i=2,3., 4) (F'i q u 2) a s well as
:i. nac:c:ur ate dua t o t ha sma 11 possi b 1 e
number" of m e a s u r e m e n t s ; t hi a t h i c: k n ess CD f t he :i. n s u 1 a t i o n ma r k a d
f a r t hi e s a m a r aas o n (t h e s rn a 1.1 p,...::::: d, a !" a c o n s i d e r a d t CD b e t hi a i np u t
n u m b a 1- a -f: m e a s u r a m e n t s ) ,, t h e a dapt i v e
f a c t o r 13. T h a o u t p u t f a c t o r s a r e s how n in
! o c a d u r a s,, t h r a u g h a c u r r a n t a s t i m a t i o n
F ig 5 a n ci a r e s
o f t hi a p a 1-" a m e t e r s , a r a i. n is u i'f :i. c: i e n 11 y
effect ivs. ^ m a a n i n t a q r a 1 t a m p e r a t. LA r a o f the
a c t i v a p a r t o -F t h e i n s LA 1 a t :i. o n |i
Bear i n g i n m i n d a 11 t hi i s, i t i s a pa d i a nt ^ . ( i = 1,2., 3) t amp e r at u r es in t hi e
for the thermal s t a t e of the ladle to be
t h r e e z o n e s of t h e a c t i v e part of the
estimated by m e a n s of a k n o w l e d g e based
t hi e r f n a 1 i n s u 1 a t i CD n,,
mo de 1 us i ng c o ns i de r a b 1 e a pr i or i
" hi e b a s i s o -f t he f o r ITS a t i o n o -f: t h e
i n f CD 1- m a t i a n T h i s a p p r o a c h na c e s s i tte s
m e m b e r s h i p f u n c t i o n s is a set of s o l u t i o n s
f u rt ha r de v a 1 o p m e n t o f t h a i deas, pu t
CD f t hi a m a t h e m a t i c a 1 m o d a 1, w i t h d i ffe r t
forward from Sei insky and Guez (1989),
c CD m b i n a t i o n s CD f t h e i n p u t v a r i a b 1 e s H 1" hi e
m a mtaa r s h i p f LA n c t i o n s a n d t h e r u 1 e b a s e a r e
CD t a i n e d by t a k :i. n g i nt o ae c ou nt t hi e
BASIC CONCEPTS e m p i )" i c a 1 c:lata,, a x p e r t k n o w 1 acJga a n d t h e
v a r i a t i o n o f t hi a paramete rs o -f the
T hi e r e 1 s c CD n s i d e r a b l e e x p e r i e nc a i n t he i d e a 1 :i. z e d t h e r m a l m o d e 1
c o n s t r u c t i o n of f u z z y m o d e l s on t h e basis
of e x p e r i m e n t a l data (Czogola and P e d r y c z ,
19 81 , F :i. 1 a v , 19 9 0, K a f a r o v, Do r o hov a n d
Markov,, 1986,, S k a l a , 1978, ' Tang, 1978) DECISION MAKING LOGIC
T hi a b a s 1 c s t r LI C t LA r a 1 a 1 a m a n t s o f s u c h
: D e c :i. s i o n m a ki ng logic is basa d on
mo de 1 s a r a s how n i n F i g 4 A ma j o r
dete r m i n i ng the s i m i. 1 a r i t y between t he
p r o b 1 em j st a n d i ng i n t hi a way o f t hi e
i nt e ns i ve ap p I i c at i o n of t ha t hi a o r y a f newly obtained data about the input
fuzzy s e t s is t h e fact that the adequacy v a r i a b l e s p. <i--:L,4) and the N rules r.
* 1 1
of t !e mam be r s h :l. p f unc t i o ns i. s d i f f" i c u 11 o b t a i n e d about eve r y c o m b i n a t i CD n o f i LA t
to check (Dubais and P r a d e , 1 9 8 0 , K a f a r o v ,
Dorahov and M a r k o v , 1986) v a r i a b l e s P. , (i- 1,N, j"" 1 , 4 ) , w h i c h take
the form CDubo is and P r a d e , 1980)
I n t hi i. s c o n t r i b u t i o n , t h e d a t e r m i n a t i o n o f
t h e m e m b e r s hi :i. p> f u n c t i o n s f o 11 o w s, as a
( 11) !-1 . ;: IF (P
11. , 3.,'ZP P ....14
13' k 1 fjl .
, P . ) THEN
v
g e ne r a 1 d i r ec t i o n t ha mat ho d p r op ose d by
S k a l a ( 1 9 7 8 ) , w h i c h is based on t h e idea (i-~l,N, k^0,3)
of t h e setting down of a p r o t o t y p e (ideal
object). w hi a r e u p p a r s c r i p t L. m e a n s t hi e nom i na 1
P. . are linquistic variables,, the vector
v a1 lJ u e of input and output variables As
w h e r e x <e X e l e m e n t s of t h e u n i v e r s a l
o
set o f t h e i n p LA t v a r i a b 1 e s p "-"= (p , p.~., p ~r, p )
X, b~ -" p a r a m e t e r s of t h e ideal object The

59
nominal periods are taken 7^=100 min,
must be qualified as a membership
f u n c t i o n , a s foilowss 7 o~ 5 0 min, 7-,.= 180 min, 7^=30 min.

f 1

J
j
J
p
A e c o r d i n g to e m p i r i c a l data,

mm per 100 min metal


the

treatment.
average
s p e e d of e r o s i o n of the lining t a k e n is 1
The
(12) . <x> = -
t h e r m a l s t a t e of t h e ladle is m o d e l e d for
a p e r i o d of 3 0 m e l t s , w h i c h is determined
The p r o x i m i t y of t h e p. data w i t h t h e P. .
by t h e a d m i s s i b l e extent of e r o s i o n of the
element is d e t e r m i n e d by the similarity r e f r a c t o r y lining. T h e output temperature
operator Idn (Dubois and Prade, 1980, #o is presented according to the
Skala, 1 9 7 8 ) 5
experimental data with 7 linguistic
variables &
(13) Idn (p./P. .) ~ U (u (x.) W n (x.)) 0
j 1 j w j J * ' frb j j The fuzz ification of the mean integral
t e m p e r a t u r e a r o u n d t h e nominal values of
w h e r e x. is the* s p h e r e of definition of
J the linguistic variables takes into
t h e p a r a m e t e r ^, () is membership
account both expert knowledge and the
ij accuracy of the measurements. The
f u n c t i o n of j input p a r a m e t e r in i rule. t e m p e r a t u r e of the 1iqui d metal is in
he s i m i1a r i t y of t he e t i r e i np ut vector r e?a 1 i t. y m e a s u red with a accuracy of no
with all data from the i rule is m o r e t h a n 5 * C On the other hand errors
obtained as a generalization of (13) as a 1-e a d m i s s i b l e i t h e estimati0 f the
foilows: t emp e rat u r e of t he 1i quid met a1 w i t h i n the
b o u n d a r y 2 0 " C . D u e to t h e heat transfer
I d
(14) Idn (/.) = n (p./P. .) w i t h t. he I i i n g t h e s e data g i ve the basis
of the choice of membership functions
with t h e following designation for the v
o
c o m p o n e n t s of t h e vector of t h e i-rule
(i=l,N) I t he c a s e of a 100 1 0 ladle concerning
in the paper, the correlation of the
(15) P . = (P. , ,.^,.,,,. .)
9 9 t h e r m a l c ap ac i t i e s is ap r 0 i mat e1y 2x1
11 12 13 * 14
his r esuIt allows f0r t he mem be r ship
The membership degree function ^ () of
f u n c t i o n s to be t a k e n a s trapeziums with
bases of 20 C and SO C, situated
the output temperature (k-0,1,2,3) is
s y m m e t r ica11y a r o u n d the nomi na1 v a r i a b 1 e s
obtained from t h e expressions .
( 1 >6 s u < In d (/ p 3 *o
< V p i * <V>
k i = l,N ^ki The membership functions of the input
E x p r e s s i o n (16) is t h e b a s i s on w h i c h the v a r i a b l e s 7 . (i=i,5) w e r e o b t a i n e d on the
participation of every i rule is
b a s i s of t h e ideal images received from
e s t a b l i s h e d in d e t e r m i n i n g the functions
the mathematical model, the accepted
of t h e membership extent ^() of the
ling LI i s t i c vari a b 1 e s d t he e pert
output temperatures
knowledge. The admissible intervals of
alteration of the input variables are
DEFUZZYFIER
from the expefU' The m e m b e r s h i p f u n c t i o n s
0 bt a i ne d a r e s how i F ig. 6..

T h e d e t e r m i n i n g of an estimation of the
Using all p o s s i b l e c o m b i n a t i o n s among the
output t e m p e r a t u r e from (16) can be
nom i na 1 a 1Lies of t he i np u t 1 i ng u i st i c
carried out according to different
variables 4J 240 nominal output
a l g o r i t h m s . In t h i s paper is calculated
L
by taking the weighted average of the t e m p e r a t u r e s v& . a r e c a l c u l a t e d with the
oi
t e m p e r a t u r e of all t h e r u l e s and c a n be aid of the mathematical model. The
derived a s 2
f u z z i f i c a t i o n of every is done on the
basis of empirical knowledge, computer
h simulation with varying termo-physical
(17) $ *
constants in the analytical model and
optimization procedures. Membership
functions ($.) a r e obtained to have

"oi
(k=0,1,2,3)
t r a p e z i u m form with b a s e s of 2 0 C and 160
C;, placed symmetrically around - This
FUZZY MODEL OF MEAN
is t h e way to form the mean integral
INTEGRAL TEMPERATURE
t e m p e r a t u r e ' s ru1e b a s e ,
The estimation of the mean integral
temperature ^ is necessary for the
FUZZY MODEL OF THE INITIAL
balance calculation of the necessary THERMAL STATE OF THE LADLE
a m o u n t of e l e c t r i c e n e r g y which must be
i n t r o d u c e d into t h e metal bath. The e s t i m a t i o n of three temperatures of
t h e i n s u l a t i o n layer makes it
In order to d e r i v e t h e ideal images from
p o s s i b l e t o set down trustworthy initial
the mathematical model, the following

60
c: o n d i t i o ns o f t h e t ha r ma 1 mo de 1 i. n t he m o d e 1 j::) r e c i s J. o n w j. t h m o r e t he n 30 "A i n
control s y s t e m of t h e ladle r e 1 a t i o n t o p u r e e x p a r t ap p r o a c h .

The a b o v e m e n t i o n e d mathematical model T h e t h e r m o p h y s i c a 1 c: o n s t a n t s i n t h e m o d e 1


p r o v i d e s f u 11 a p r i o r i i n f a r m a t i o n a b o u t h a v e b e a n v a r i e d up t o 3 0 + 5 0 "A a t t h e same
the ideal images of the vector i np ut at a vac t o r s -C p , 3.. An a na 1 og ous
: " ^B-\9 Bo $ -r* w**n e v e r y cmD*na^ *n -f: ^, e c r i ter ion to <19) have been used
i np u t va r i a b 1 es p: (j. ~~ 1 , 4 ) T he ap p r oac h
i n t he f a r m i ng of a f uzzy mo de 1 of t he (
i n i t i a l thermal state is analogical to <20) J.U . - V N <P-rt, > </?
ff y m i ^ ' k u m j K '
^
IM
t h a t o f t h e mean i n t e g r a l t e m p e r a t u r e The
following procedures have been carried i = i s 2 , 3 ,, .j == 1, 2,, 3 H k == 0 , 1, 2 ,, N 3.3 5
out
where N is t he nufnber of expar i ments.
1 T h e member s h i p f u nc t :i. o n s of all
F i g u !" e 8 i 11 u s t r t e t he c omp a r at i v a
i n p u t v a r i a b l e s are p r e s e n t e d ( F i g . , 6 )
analysis wo r ke d o u t-.. R a s u 11 s ar e
2 . Mew out p ut mem be r s h i p f u n c: t i o n s
s u m m a r i z e d i n 1" a b l e /
a r e o b t a i n e d f o r e a c h o f t hie ternper a t ur es
i he '.- oh i i n n i J , .'-1 -IM L'.-J the
_ v
"'u Ni W ! ulf, tn f ' c n':d ft" di h D p i upO<_,i -d flu * V mode : u f liif t O MM q [ r i J
1
icii'!; LS :ui f j .v.' M y : ^ " ; i'--' u -"tr
MIL' lu-mpei a L u i .-*>, LIui-_> t o i m.i nrj I h
> "*- * > n t c c j i =" _ ; i l in i* r >*
J ndrjp-'ntli-M^t l u J t ? ba<='."j 1
I TiU l i l t I f 11 <-, L-i !>f. i." ' i f i i ^ l ' . ' d W l i h ! / ' V r
- 1 he I U J P S a n d mr^mbrM S ! I unt L i ' M i . ,
16. 'C p i ' i LU o-i., '.A,hicl - : iji n : - pur i^-it to
a I o v e i . i t / i - H p: u c i ' i ' m q fron I h r ^ nee
6 ' \ + 0 '"'"' f i h-1 i u u J c c c ' i : i . The- fno. it I
c o r r e I iW Ljn
J J C j ' j i t o r-iDLlin." fflhii 7) (V C LUIS ' C-
1
q U L C I i. y t i i l T i > ' i f a , I J y t J ' v I KH K .

CONCLUSIONS
SIMULATION RESULTS
Knowledge based models of the mean
Simulation experiments have been worked
out t o obtai n qua1i ty and quantity i n teg r a 1 t e m p e r a t u r a a n d :i. n i t i a 1 t h a r m a 1
c o nc 1 us i o ns a bout t he i nf 1 ue nc a of s t a t e o f t h e l a d l e h a v e b e e n w o r k e d a Lit
the p a r a m e t e r s on t h e fLizzy models for a new t y p e o f D i r e c t Currant Ladle
F' u r n a c e (D C L.. F ) - "!" h a r e s u I t s o b t a j. n e d s h o w
1,, 7' h e n u m b s r o f i np ut 1 i ng u i st i c that t h e c o m b i n a t i o n of data of of a
v a r i a b l e s of each of t h e i n p u t s , have been m a t h a m a t i c:: a 1 m c^ d e l , e m p i r i c a 1 r e s a a r c h a n d
change d, w h :i. c h i s e q u i. vale n t t o v a r y t h s
number of rules N. a x p e r t n o w J. e d g e is
2 "I" h e 1 o c a t i o n s o : t h a nom i na 1 better r e s u l t s than t h e p u r e modal based
approach
v a 1 ueB of 1 :i. n g u i s t i c va r i a b 1 es 7 have
been varied REFERENCES
3- Parameters o f the membership
f i q u r e s o-f: i np u t 1 i ng u i st i c va r i a b 1 e s a n d B a k a k i n., A , B H o r o s h i 1 o v a n d G., G a 1 p e r i n
r u l e s have been analyzed., (19 8 1 ) ,, ivi a t h e m a t i c: a 1 m o d e 1 o f t he
t hi e r m a 1 p r oc asses in sac o n da r y
hean s q u a r e error have been used a s a is t e a 1 m a k i n g .. I z v e s t i a V U Z - Cher nai a
c r i t aria n of t he m o d e 1:' s a d e q u a e y,, m e t a l l u r g y a , 4 , 143 147 (in R u s s i a n ) -
c a l c u l a t e d o n t h e data s e t d i f f e r e n t from B i r a t , J.!:::' and J P e t e g n i a f:: ( i 9 8 7) .,
t he nom i na 1 va 1 ues C o m p o r t a m e n t t h a r m i q u a d e 1 "' a c i a r , d a
1 a o c h a et d u r a p a r t i t a u r d e c o u 16 e
I n c o n t i n u a = R e v u e d e M e t a l 1 u r g i e - C I T ,
/ " A o u t - S e p t a m b r e, 6 2 5 6 4 5
BristolHE. (1989)1 Objects, rules and
m "F p!" oc: ess c o nt r o 1. Proc of t h e A C C , USA.
w h e r e ft (p.) and $ (p.) are values of the
o v h
j o j CzogolajE. and W.Pedrycz (1981). On
mean integ r a 1 teinper a t u r e o-f the i d e n t i -f i c a t i o n i n -fuzzy s y s t e m s a n d
m a t In e m a t i c a 1 a n d r e s p e c t i v e 1 y o f t h e f u z z y i t s a p p 1 i c:at i o n s i n c o n t r o 1 p i"ob 1 e m s
model at t.hie s a m e i nput p a r a m e t e r s vector Fuzzy Sets and Systems,, 16,,
p . (j 1, 4) . The n u fn b e r o -F e x p e r i m a n t s n A c a d e m i c P r e s s , N e w Y o r k >,
Du bo i s j, D . a n d H - P r a de (1980) . F u z z y S e t s
v a r i e s f rom 135 t o 6 2 5 .
and S y s t e m s : T h e o r y a n d Apllications
A c: a d e m i c: P r a s s N e w Y o r k ,
I r r eg u 1 a r 1 oc at i o n of t he nom i na 1 va 1 ues
FilevHD (1990). Towards the consept of
J
p " a n d ~" mor e de n s a 1 y at t he beg i nn i ng q u a s :L 1 i n a a r f u z z y s y s t e m s . Pr oc o f
t h e Int Conf o n F u z z y Logic a n d
of wor k i ng a r e a , is pre-ferab 1 e due to N e u r al N e t w o r less. I i z u k a, J a p a n
n o n --1 i n e a r i nf 1 ue nc e o -f: t h& i np u t
H a d j i i s k i , !v! :i K Spa s s o v and P. S a v o v
p a r a m e t e r s (p.. a n d p^> ,
(1989) ,. M a t h e m a t i c a 1 mo de 1 o f t he
t h e r m a 1 p r o c e s s e s i n 1 ad 1 e m e t a 11 ur gy,.
Number o f 1i nguist i c variab1es i s of g r e a t Modern Steelmaking and Casting
i m p o r t a n c e for t h e model accuracy With Tec h n o l o g i es - I V Int Conf- J une
the number of linguistic variables 15 1 7 , A1 b e n a., B u 1 g a r i a. ( i n B u 1 g a r i a n)
(Z, 3? 3,5) t hi e.mi n i m u m v a l u e o f t h e e r r o r Kafarov ! ! y u ,, I.Dorohov a n d E. M a r k o v (1986)
(19) is 2 3 4 C . w h i l e w i t h (3,4,4,5) t h e S y s t e m A n a l y s i s of C e m i c a l Process
e r ror d ec r e a s e s t o 1 6 , 0 6 " C (Fig. 7 ) . C o n t r o l C A p p l i c a t i o n of F u z z y Sets}.
N a u k a., M o s c o w., ( i n R u s s i a n)
An a p p r o p r i a t e choice of membership K r e i t H F a n d W . Z - B 1 a c k (1980) Basic Heat
f u n c t i o n s p a r a m e t e r s c a n improve? t h e f u z z y T r a n s f e r H a r p e r a n d R o w, N e w Y o r k..

61
Lee,, C. C. < i 990) I nte 11 i q e n t c o n t ro 1 b a s e d S e 1 i n s k y,, J,. a n d A,, G n e z (19 8 9) ,, "!" h e r l e o f
o n f u z z y 1 o g i c a n d n e u r a I n e t t h e o r y. a p !* i D r i k n a f l e d g e a f p 1 a n t d y n a m i c s
Proc* of Int Conf on Fuzzy Logic and :i. n n e u r o c o n t r a 1 e r des i g n.. P r o c # o f
Keural Networks,. lizuka., Japan,, CDC Tampa,, Florida., USA,,
Pfeifer, Hu FFett,, H Schfer,, KJi,Heinen Ska 1 a,, A.. (197S) On many va 1 Lied 1 og ic ,
(1984). Modell Zur Thermischen f u. z z y s e t s,, -f:: LI z z y 1 o g i c: s a n d t h e i r
Simulation Von Stahlgie., Stahl und ap p 1 i c a t i o n.. F u z z y Set s a n d Syst e m s .
Eisen,, 24, 1279-1287. V o 1. 1,, A c a d e m i c: F:' i- e s s,, M e w V o r k.
Patankar,, S (1980) . Numerical Heat Transfer Tang ,, R (1978) ,, Sy nt h e s i s of f uzzy made 1 s
and F l u i d Flow., Hernisphere., N e w YDI-k f o i" i n d u s t r i a 1 p r oc esse s some
Savov,, P. (1984) Met. had o f t reat i ng a n d p r e s e n t r e s u l t s . Int J G e n e r a l Syst*
! e f i n :i. n g I :L q u i d m e t a I a 11 y e s by vol. 4.
d i r e c t c u. r r s n t e 1 e c t r i c a r c h e a t. i n g,.
US Patent 4 462 878
= T 80
Pi 2_
T
P2 3_
METALLURGICAL I
= T
P3 4 LADLE 2
P4 - < ^
*3 "

Fig. 5 Metallurgical ladle as a multi factor object


kOfiun)

M _.
f AR60N

Fig. 1 Sketch of DCLF


TOfmSnj

k= ==d
" f-
U~-
l- h-

EMPTY
\l I
EMPTY
J
l- - fl
LADLE LADLE 7 /<0 itO 200 250 fmln]

\
K
- - f
- T> -
-1 _^
/ ' "' \
/
Fig. 2 Technological stages through which
the ladle passes within one working cycle
\ 2QCcmJ
1
5 '
/k
Fig. 6 Membership functions of the inputs

rffcjf
20 i

0 ft HfH,.l\.IWh fl ft.
-
Mil "," y y
V I L)

'.'

35 dcm] Fig. 7 Mean quadratic error of the fuzzy model


Fig. 3 Temperature distribution in the refractory in comparison with analytical model
lining at the end of every stage of one cycle

RULE
BABE

MEMBERSHIP
FUNCTION DEFUZZIFIER

Fig. 8 Comparative analysis

'>0c
DECISION
,* c C *, a,v" ^ C
L.061 C
14 50 12.77 12. 12
Pi 16 06
! P2 29 50 22.48 21.05
Fig. 4 Structure of the fuzzy model
Table 1 Results of comparative analysis

62
Copyright IFAC Expert Systems in Mineral and CONTINUOUS CASTING AND
Metal Processing, Espoo, Finland, 1991 ROLLING

APPLICATION OF EXPERT SYSTEM TO REAL TIME


COLD COIL TRANSPORTATION CONTROL IN
FINISHING LINE
Y. Anabuki*, R. Owaki* and H. Sakiyama**

*Plant Control Technology Section, Mizushima Works, Kawasaki Steel Corporation, Kawasakidori 1-chome,
Mizushima, Kurashiki 712, Japan
^Electrical Steel Technology Section, Mizushima Works, Kawasaki Steel Corporation, Kawasakidori 1-chome,
Mizushima, Kurashiki 712, Japan

Abstract. Expert system (ES)-assisted control has fully automated the operation of
transportation facilities on the electrical steel finishing line, in the Mizushima Works of
Kawasaki Steel Corporation. The system forms the optimum schedules for transferring slit
coils and transfers them all on-line by 24 monorail-type coil carriers under its control. In
order to obtain the necessary response for real-time use, control rules are divided into groups
according to their functions, and only designated rule groups are made to run at one time,
securing a response of less than 3 seconds. The adoption of ES has enabled the system
successfully to handle the complex operations of the line, and reduced the man-months
required for software development to one half compared with using conventional
programming language. The system has been working smoothly, and contributing greatly to
the high-efficiency operation of the line, since its commission in March 1990.

Keywords. Expert system; computer control; real-time computer system; transportation


control; steel industry; electrical steel finishing line

INTRODUCTION Mizushima Works has also been automated by ES.


This application is described in the present paper.
The wide ranging and rapidly changing needs of
customers, with the consequent short-term life cycle BACKGROUND TO THE DEVELOPMENT
of products, have recently become a feature of the steel OF THE EXPERT SYSTEM
industry. In step with this, a flexible and efficient
programming method is required in information Outline of the Finishing Line
management systems. To meet this requirement, the
expert system (ES), which utilizes expert's knowledge The purpose of the electrical steel finishing line is to
stored in the knowledge base of a computer, has been slit the coils according to a customer's order, and to
applied to many tasks such as fault diagnosis in assure the quality by inspecting the surface of the
machines, and to the solution of designing and coils. With the aim toward a flexible manufacturing
planning problems (Koroku,1990). However, almost system (FMS) , this is the most up-to-date finishing
all of these applications are used off-line systems, with line, with the slit coils automatically being trans-
a few exceptional on-line use examples. ported by monorail carriers suspended from C-hooks
( see Fig. 2 ). Full automation has been achieved with
Kawasaki Steel Corporation Mizushima Works has automatic coil inspection and wrapping.
applied ES to real-time t r a n s p o r t a t i o n control
systems, for example, flatcars on the billet condition- The material flow down the finishing line is shown in
ing line and No.l EGL (electrolytic galvanizing line) Fig. 1. The coils are slit on two slitting lines (SL),
coil yard are controlled by ES. The advantages of with the coil grade being decided by surface
applying ES to t r a n s p o r t a t i o n control can be inspection, according to the information from the
summarized as follows (Tsushima,1885): automatic surface inspector on the annealing line.
According to the width, the flow of the slit coils is
(1) The complex transportation control can be divided in the following two ways:
systematically planned in stages by knowledge. (1) Narrow coils ( 40-500mm width )
(2) The productivity for creating software can be After separation by the coil separator (CS), the
improved. narrow coils are stored in the automatic warehouse
(3) The system is sufficiently flexible for easy (AW), and then transported to the packaging line
modification. (PL), according to the order lots. The maximum
transportation rate from SL to AW is 80 coils per
With these expectations, the coil transportation hour, and from AW to PL is 30 coils per hour.
system for the electrical steel finishing line at the (2) Wide coils ( 500-1270mm width)

63
After separation from the narrow coils, the wide coils Resident in a process computer (PC), the ES achieves
are transported to the coil yard by flatcar, and then real-time transportation control by sending definite
transported to PL. carrier movement orders to a direct digital controller
( DDC ), after deciding on the optimum transportation
Any defective slit coils are transported to the welding schedule for coil transportation orders from an on-line
and recoiling line (WR), where they are recoiled after computer (OC), which supervises all product infor-
removing the defects. mation. In accordance with the product order from
OC, PC controls all facilities on the line, including the
Fig. 2 shows the layout of the finishing line, which automatic coil inspector and coil wrapper.
features the following facilities.
(1) Automatic coil transportation by carriers OC Functions
The narrow coils are automatically suspended from C- (1) Transportation orders are sent from OC to PC, coil
hooks and transported by 24 monorail-type carriers to by coil, when the transportation demands occur at
prevent any defects due to transportation. each facility on the l i n e . These are macro-
(2) Tracking checks by barcode readers transportation orders, which indicate the coil
In order to assure precise coil tracking, barcode transportation terminals, for example, from SL to
readers are utilized to check the coils at the CS and AW, from AW to PL, and from AW to WR.
automatic coil inspection positions. (2) All coil production schedules and coil storage
(3) Automatic coil inspection and wrapping states on the line are supervised by OC. Also managed
Before the coil packaging, the coil products are by PC, the coil storage states in AW are easily
automatically inspected and wrapped, to assure recovered by down-loading data from OC to PC and
quality. vice versa, in case of coil tracking problems.
(4) Automatic warehousing (3) By utilizing the barcode readers, tracking checks
The narrow coils are automatically stored in AW, at each facility are made by OC, which compares the
which has 4 cranes and about 1500 shelves, and can actual coil number detected with the scheduled coil
supply coils to any facilities on the line. number. If these numbers don't coincide, the coil
information that is needed to operate the facilities
Objectives of Transportation Control will not be sent from OC, which prevent operation
with wrong information.
The most important objective of transportation
control was to operate the facilities on the line at PC Functions
maximum efficiency, utilizing 24 carriers. To achieve (1) According to a coil transportation order from OC,
smooth coil transportation, the following aspects, PC makes the optimum transportation schedule and
which are shown in Fig. 3, had to be considered. carries it out in real-time by utilizing the ES. The ES
(1) Carrier congestion control decides the transportation carrier and timing in real
Because the exclusive carrier waiting equipment was time, in order to fulfill the order in the shortest time,
not going to be used, the transportation timing and then sends the carrier movement order to DDC at
between carriers had to be strictly controlled, so that appointed points on the route to the terminal.
there would be no carrier traffic congestion. (2) The tracking of coils and carriers on the line is
(2) Carrier deadlock control supervised by PC. Each coil number is checked by
Carrier deadlock had to be avoided on the WR automatic barcode readers to control the flow of coils
monorail line, where carriers move in both directions. strictly so that the wrong coils are not transported.
(3) Maximum efficiency of AW control (3) The coil storage states in AW are also managed by
In order to attain maximum AW efficiency, the 4 PC so that a double-check of the tracking is made. As
automatic cranes would need be operated under the a result, any t r a c k i n g problems can be easily
most appropriate working ratio control. recovered from.
(4) Automatic transportation route changes (4) PC supervises the operation states of all facilities
A product coil, if it is rejected by coil inspection on the on the line, in order to detect any problems in the
way to PL, would need to be automatically returned to machines in an early stage. As a consequence, they
AW with the same order lot coil products. are only monitored by the display screen of PC, to
unify the man-machine interfaces.
The transportation control method that was required
would need to operate in real time and automatically DDC Functions
make the optimum coil transportation schedules. In (1) Under the control of PC, DDC operates the
addition, the system would have to provide complex carriers and cranes in real time.
transportation control of 24 carriers, which requires a (2) DDC achieves the perfect carrier control, so that
substantial acquisition of knowledge. there is no chance of a collision even after a wrong
ES was therefore chosen, because it can acquire the transportation order from PC.
deep knowledge efficiently, and be constructed step (3) By dividing the railway into 2m tracking sections,
by step. precise carrier tracking can be done by DDC. To
assure accurate tracking, the number of each carrier
SYSTEM CONFIGURATION is checked by carrier-number readers t h a t are
installed at regular points along the railway.
Assignment of Computer System Functions

64
System Configuration the carriers were actually moving (see Fig. 6). The
movement of the carriers would be locked, if the
According to the assignment of functions j u s t transportation knowledge was incomplete. The
mentioned, the system configuration for the finishing transportation capacity, e.g. the number of carried
line was determined as shown in Fig. 4. The features coils per hour, was also shown on the screen so that
of the computer system are as follows: the simulated results could easily be investigated.
(1) PC is a HIDIC V90/45 with back-up computer, and
the expert shell used is EUREKA- II (Electronic By acquiring knowledge through simulation, the
Understanding and R e a s o n i n g by Knowledge transportation control rules could be improved step
Activation, version 2), developed by H i t a c h i by step in order to construct the complete knowledge
(Funabashi, 1988). The number of rules for ES is 310 base. The debugger of EUREKA-II enabled the
in all. development of the ES on a work-station to be very
(2) For PC input and output signal processing, front easily carried out. The trace of any production rule at
end processor (FEP) to control the data transfer any inference step could be analyzed by the debugger,
between PC and DDC is installed in the system. with multi-window operation to assist the simulation
(3) Data communication between PC and trans- process.
portation DDC is made through a serial connection
with a modem, and to parallel with a process input Configuration of the Expert System
and output device (PI/O). The carrier position signals,
which need a fast response, are sent from DDC to PC Assurance of fast response. To secure fast response
through PI/O. On the other hand, the transportation for real - time control, the number of the rules, which
orders, which don't need a fast response, are are made to run at one time, should be reduced to as
transmitted from PC to DDC, through a modem. small a set as possible. Because the maximum
(4) The automatic machines, e.g. the automatic transportation rate is 110 coils per hour ( 80 coils per
inspector, are controlled via serial communication. hour from SL to AW, plus 30 coils per hour from AW to
Their operation orders are sent collectively to DDC as PL), this gives an interval of about 30 seconds
transportation patterns to reduce the number of between each coil on the average, so that the response
communication transactions. time needed to be less than 3 seconds. The methods to
secure such a rapid response are shown below.
OUTLINE OF THE EXPERT SYSTEM
Division of functions: According to its functions, the
Development Method transportation control system was divided into two
subsystems. One which is written in non-procedure-
ES was applied in this case to the new plant, so that no oriented language, called the ES support system,
operation expert existed. Therefore, the prototype makes simple transportation schedules, and the other,
approach was adopted to construct the ES, in order to written in the EUREKA-II procedure-oriented
teach the complex transportation knowledge most language, called ES, makes more complex trans-
effectively. The prototype transportation model was portation schedules. The simple schedules involve
made in a work-station, after the first teaching of transportation control with no high degree of
k n o w l e d g e h a d b e e n c o m p l e t e d by a d e s k judgment, which make carriers simply move ahead,
examination, and this knowledge was tested by for example. On the other hand, the more complex
simulation with the prototype model. In this way, the schedules imply high-level control, to judge the trans-
transportation knowledge was progressively modified portation timing and terminal of one carrier from the
to perfect the control rules. The work-station was a state of other carriers in real-time. The information
2050, and the expert shell used was EUREKA-II flow of ES is shown in Fig. 7. After deciding the
developed by Hitachi. arrival information by the tracking program, only the
intricate problems are transferred to ES to create the
By modeling each facility on the line, the simulator complex schedules. By using ES only for handling
shown in Fig. 5, could produce coil transportation complex work, the response time for real-time use was
requests from any facility at any time. The movement assured to maximize the advantage of ES .
of each carrier was simulated to match actual
movements so that event generation at the appointed Division of control rules: In addition to the division of
place and time could be achieved. The features of the control functions, the control rules of ES themselves
simulator are as follows: are divided into 37 groups according to their
(1) The operation of the facilities on the finishing line functions, as shown in Fig. 8. Only a designated rule
was modeled fairly well by the simulator so that the group with a maximum of 44 of the total 310 rules is
transportation requests from each facility on the line made to run at one time, so that rapid response of the
could be simulated at any time. transportation control system can be achieved.
(2) The time needed to move a carrier from one place
to another in the simulation was nearly equal to the Information flow. The information flow of ES is
actual time. according to rule-based forward reasoning control
(3) The simulated results were displayed on a screen, system.
in which the tracking picture of the line is shown as if
After receiving the transportation order from OC,

65
the transportation database is produced, and ES is all carriers on the WR monorail line.
activated.Then, after making transportation route
from the schedule production rules by ES, the first Maximum efficiency of automatic warehouse control:
carrier movement order is sent to DDC. When the first To achieve the most efficient operation of AW, the 4
movement order has been completed, DDC sends to automatic cranes should be operated under an equal
PC the arrival information for the appointed position, working ratio system, which uses the cranes in a
which is then used to judge the subsystem to create cyclic operation to store the coils. By waiting for the
the next movement order. If this involves a junction packaged coils in front of PL, a transportation time
position, which needs high-level judgment for smooth lag from AW to PL is avoided to maximize the
control, the information is sent to ES for making efficiency of taking out coils from AW. Furthermore,
complex schedules. If this is not the case, the in the case of simultaneous operation of storing and
information is transferred to the ES support system t a k i n g out coils, a crane should be operated
for making simple transportation schedules. After continuously to store and take out (a double-command
reiteration of the same information flow and operation), to give the most efficient control. In order
completing the transportation order, the transport- to minimize the movement time, the nearest shelves
ation result is sent to OC from PC. to a storage position are selected.

Any unoccupied carriers are controlled by ES without Automatic transportation route changing : A product
orders from OC. ES judges the availability of waiting coil that has been rejected by the coil inspector on the
positions (positions 212, 424 and 223 in Fig. 3, for way to PL is automatically returned to AW with the
example ) at certain time intervals, and moves the same order lot of coil products. After the inspection by
unoccupied carriers to these positions when available. the operator, the order cancellation is sent from OC
and the transportation route of this order is auto-
Transportation Control Methods matically changed.

The basic principle of the transportation control by ES RESULTS OF APPLYING


to achieve the smooth flow of coils is that no carriers THE EXPERT SYSTEM
should wait on a rail section where occupied carriers
could move. With no exclusive carrier waiting Productivity in creating the software. Making the
facilities, the following methods were applied to the most use of the expert shell stored in the work-station
control. enabled the investigation of knowledge to be done by
simulation with a prototype model, so that the ES
Congestion control: In order to prevent traffic knowledge base could be constructed step by step. As a
congestion, imaginary carrier waiting positions at the result, the complex transportation rules for the new
212, 424 and 223 positions shown in Fig. 3 are plant, for which no expertise in operation existed,
allocated. The loaded carriers don't go through these could be systematically built up in about half the
positions in principle, so that the unloaded carriers man-months t h a t would have been needed by
can wait there after their job without occurring any conventional programming language, as shown in
congestion. In addition to these positions, one of the Fig. 11.
pair of storage positions for each crane, which operate
as coil transfer positions between carriers and cranes Quality of the software. Transferring the prototype
in AW, are allocated as waiting positions, so that coils ES to PC, for which the control rules had been
can be taken out without loss of time. Furthermore, perfected by simulating on a work-station, assured the
the storage of coils is always possible in one of these quality of the software. As proof of this, no software
positions to avoid congestion. problems have occurred since the system was applied,
and ES had contributed greatly to the smooth
Deadlock control: The railway around WR is for two - operation of the line.
way traffic, along which the carriers can move back
and forth. Therefore, to prevent any dead-lock of Software maintainability. The control rules are
carriers, positions 501 and 502 in Fig. 3 are allocated written in Japanese in the knowledge base, and the
as temporary carrier waiting positions to achieve modification of operation methods has been easy,
smooth coil transportation to and from WR. Two of the resulting in high maintainability for the software.
examples of deadlock control are shown in Figs. 9 and
10. Fig. 9 shows the case of moving carrier A to Response time. To ensure a fast response, in addition
position 522 against the movement of carrier B out of to restricting the functions of ES, the control rules
WR. While carrier B is between positions 501 and 522, were divided into groups according to their functions.
carrier A waits at position 502, and then begins to This has resulted in a response time in 97% of cases of
move to position 522, after carrier B has moved off the less than 3 seconds, as shown in Fig. 12. With the 3%
WR railway. In the case of simultaneous movement exception for simultaneous event control, which
from positions 522 and 525, carrier A temporarily needed more t h a n 3 seconds for response, the
moves to 501 from 502, letting carrier C move off the transportation control in real time has been
WR railway, and then again moves to 502, so as not to successfully achieved.
interfere with carrier B. In this way, the deadlock of
carriers is avoided to achieve the smooth movement of

66
CONCLUSION

The ES method was applied to the r e a l - t i m e


transportation control on an electrical steel finishing
line, resulting in a systematic analysis of the complex
operation method for the carriers. Development of the
system took half the man-months that would have
been required by conventional p r o g r a m m i n g .
Simulation of the prototype ES on a work-station was
effective for acquiring the necessary knowledge to
establish the operation procedure for the new plant,
which was vital as no operation experts existed. The
ES method is expected to be a most effective means
for systematically analyzing complex transportation
control in factory automation.

REFERENCES

Koroku, M. ,T. Yamanaka, K. Yoshimura, and T.


Tamura ( 1990 ). Trend of artificial intelligence for
practical use and Hitachi's activities. The Hitachi
Hyoron, 11,1-8 (in Japanese )

Tsushima, I. ,T. Tashiro, N. Komada, K. Baba, and S.


Takakura (1985 ). Application of rule based control to
flow line control - Billet conditioning line control -.
Trans, of SICE, 10,107-114

Funabashi, M. and K. Mori (1988). Knowledge based


control system and software for building expert
systems"EUREKA-II". Hitachi Review, 4, 267-274

[Annealing! 'Vj Coil yard


line ./i

H -A--
.IWide!"
No. 2 SL No. 1 SL | coils L
No.lWR

Normal route
Slit coils ! - H CS
Exceptional route .?.
j Narrow j
! coils ! No.2WR No.3WR
Automatic
warehouse
Automatic
coil inspection
-*
SL: slitting line CS: coil separator and wrapping
WR: welding and recoiling line PL: packaging line
Coil products

Fig. 1 M a t e r i a l flow down the finishing line

C-hook carrier

Coil Specifications
Width: 40-500mm j Automatic
Diameter: 548-1150mm ! coil inspection j
Weight: 3000kg max. |and wrapping
SL: slitting line
WR: welding and
recoiling line
PL: packaging line
Fig. 2 Layout of the finishing line

67
Imaginary
carrier waiting
position
Transport
position
Two-way
traffic

Automatic
coil inspection
and wrapping

Fig. 3 Technical subjects of transportation control

OC: on-line
computer Coil transportation
PC: process i expert system
computer
FEP: front end PC (32Mb) :OCj PC (32Mb) i(310 rules)
processor Backup On-line
DDC: direct
digital
controller

. . , , <CTT)
Data bus
FEP (8Mb)| FEP (8Mb) Disk \ /
SL.WR Transportation
control control

i/o- I/O
Modem
(CRT) X4
j X4 ^ X4 PI/O: process
input and
"SkWRDDC"] ; Transportation DDC | output
device

Fig. 4 System configuration of the computers for the finishing line

i Display of Simulated Results


L _ __ J

Fig. 5 Configuration of the tracking simulator

HBC2 HBC1 KBC1 I|KBC


2

HAB2 HAB1 KAB1 KAB2

Tr'tTNo.
wmJBJ**? uu 5 ] 4
0 I-IO
1 1 -Q
21-3Q
2

rtf Stt
mL
A 72
h" ggswi
B 72
C 71
D 731
E 761
F 72
G 0

* 4o foteuo 4-0 4-tiHo--'

Fig. 6 Simulated results displayed on the screen

68
] Process computer f

Transportation
! On-line U-4 order- File control
; computer receiving program! program

Transportation Transportation Transportation


database p"| expert order-
program system sending program|
Complicated
problem
Transportation [ Tracking j
results- program Expert system
receiving I (judgement of | Qbln .le,support program
program I complications)l ?P L
i _ iproblem

Results Transportation [
! controller !

Fig. 7 Information flow of the expert system

Automatic transportation 37 rule groups


expert system 310 rules in total
Reasoning Order-receiving rules
control
rule groups Slit coil transfer
schedule production rules
Carrier transportation
schedule production rules

Transportation pos. 212-pos. 201


j u d g e m e n t rules t r a n s p o r t a t i o n rules
pos. 209-^pos. 512
transportation rules
Reasoning completion a n d
j u d g e m e n t r u l e groups
pos. 424-H>pos.212
transportation rules
(pos.: position)
Fig.8 Configuration of t h e expert system

Coil t r a n s p o r t i n g carrier

V O J V O J \ o J
(1) A is moving to 502 (2) A waits a t 502 (3) After B is out of WR line,
A begins to move to 522

Fig. 9 Carrier deadlock control for t h e case of


moving carrier A to position 522 a g a i n s t
the m o v e m e n t of carrier B Coil t r a n s p o r t i n g carrier

(1) A waits at 502 (2) A moves to 501 (3) A moves to 502


in order to let C go again in order not
out of WR line to interfere with B,
and then moves to 522
Fig. 10 Carrier deadlock control for the case of
moving carrier A to position 522 a g a i n s t
the simultaneous movement of carriers B and C

69
with conventional with expert system
programming language
Fig. 11 Comparison of productivity for system development

Fig. 12 Response of the expert system

70
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

EXPERT SYSTEM FOR MANUFACTURING ORDER


DETERMINATION IN HOT-ROLLING PROCESS
H. Fujimoto*, S. Arai*, K. Sanou* and N. Fukaya**
^Systems Laboratory, Systems Planning and Data Processing Department, Kawasaki Steel Corporation,
Tokyo, Japan
** Systems Engineering Division, Kawasaki Steel Systems Research and Development Corporation, Chita, Japan

Abstract. We have developed an expert system for determining manufacturing order in the
hot-rolling process of seamless steel pipe based on the knowledge engineering. The purpose
of this expert system is to decide optimal manufacturing order, satisfying various require-
ments. This kind of decision making is one of the synthesis type problems which are known as
the hardest one for a practical use. We paid close attention to the professionals for their ability
of optimized problem solving using various broadened and unstructured standards, and then
attended to systematize with the knowledge-based system technology. In this development,
we confirmed effectiveness of the expert system in relation to problems involved in sequence
composition, and developed its inference method.

Keywords. Artificial intelligence; expert systems; synthesis methods; inference processes;


production control; heuristic programming; steel industry

INTRODUCTION guage which attaches importance to universal applica-


bility and compatibility, and have prepared a develop-
We have constructed those full-scale systems oriented ment guide summarizing the approaches to systema-
for planning and administrating the management of a tize the ES, thereby contributing to the application de-
steelworks integrally and consistently as a whole, re- velopments. As an example of applying both the ES
sponding to the diversified users' requirements and structuring tool and the development guide, we have
facing the complicated production processes more and taken up a small-sized seamless pipe rolling sequence
more. Those operations in which a high level determi- composition ES. First of all, this paper will describe
nation or decision must be made, however, have re- the overview of the system for small-sized seamless
portedly turned out to be difficult to systematize with pipe rolling sequence composition, then a description
the conventionally available technologies only, and will be given concerning those sequence composition
they have been still left unsystematized. We have rec- problem dissolving methods.
ognized that the expert system (hereinafter referred to
as the ES for short) would work effectively on such
operations as referred to above. The ES has been so OUTLINE OF SEAMLESS PIPE
far applied to some operations in reality. Our tackling ROLLING SEQUENCE COMPOSITION
with the ES has been started since 1982, and espe-
cially in business computer applications we began to Rolling Process
make a substantial study on the applicability of the ES
five years ago. So far we have developed an in-house Our seamless pipes may be broken down into three
ES structuring tool (Midori, 1991), based on the C-lan- categories by material: carbon steel, alloy steel and

71
Fig.l Seamless Steel Pipe Manifacturing Process

stainless steel(Hiroaki, 1990). They have been used series of orders subject to the identical ratings of out-
for a variety of purposes, such as various pipelines, side diameter, plate thickness, length, and material.
line piping, boilers, heat exchangers, oil wells, general The term, rolling sequence composition, means that a
structural members, mechanical structural members, production sequence is determined in each rolling
material pipes and landslide preventives. These seam- chance. Fig. 2 shows a flow of the conventional opera-
less pipes are produced either in small diameter or in tions. An expert of the rolling sequence composition
medium diameter seamless steel pipe rolling mills ac- maps out a plan of rolling sequences enough to cover
cording to their outer diameter. This paper will pertain approximately three working days, which cover one
to small diameter seamless steel pipe rolling sequence rolling chance, to compose one sequence of rolling.
composition. Fig. 1 shows the production process of To determine a rolling sequence, a task in the main-
seamless steel pipes. The base material columnar bil- frame is first perform to establish a provisional rolling
lets are supplied by our Mizushima Works. These bil- sequence, which an expert corrects manually in ap-
lets are heated in a rotary heating furnace to be pierced proximately 11 hours (two working days). After satis-
with a piercer mill to produce a cored steel pipe, which fying every requirement specified in the product
is in turn rolled on both external and internal surfaces specifications and for production technology, the ex-
with a mandrel mill for bore formulation and stretch- pert determines such an optimum sequence to maxi-
ing. Then, the pipe is heated in a reheating furnace mize the productivity.
and rolled to a specified pipe diameter and thickness
by a hot stretch reducer (HSR). Finally, the pipe is cut
into a specified length after cooled down on the cool- Problems Involved in the Current Sequence
ing table, to be shipped as finished product after sub-
jected to non-destructive inspection.

Process section Technical section

Operations of Composing a Rolling Sequence

Three sizes of billets are used as materials to produce


a seamless steel pipe of different outside diameter and
of different plate thickness. Three rolling chances are
given to each type of billet every month (totaled at
nine rolling chances). One rolling chance rolls ap-
proximately 400 lots of materials. Each lot covers a
Fig.2 Current Operation Flow

72
Composition use, because a composition generally explodes,
thereby making it difficult to obtain an optimum an-
The conventional rolling sequence composition has swer within a practical time limit. Dissolving this
been supported by the "Rolling Scheduler," a subsys- problem, an expert has been using a wide range of
tem in the steel pipe production management system. unstructured operation standards properly to flexibly
This system has determined a rolling sequence, based practice the solution toward the problem. Upon sys-
on the limitations of equipment in the rolling process tematization, we have set our eyes to this point and
and on those general restrictions which determine a decided to apply the knowledge engineering ES tech-
rough frame of processing steps. It outputs a provi- nologies. In other words, the operations by an expert
sional rolling sequence table. Coupling the recently try to be put into an expert system as far as practicable,
diversified and the upgraded product specifications, based on the conventional rolling sequence composi-
the problems specified below have turned out con- tion steps. This has been aimed at not only reducing
spicuous. the sequence composition time but also enabling any
unskilled worker to compose a rolling sequence.
Defects in "Rolling Scheduler". Since those days
when the "Rolling Scheduler" was initially developed, Distributed processing in mainframe and worksta-
the environments have changed as described below. tions. We have thought that the existing "Rolling
As a result, the rolling scheduler has not been per- Scheduler" should be restructured. On the other hand,
forming its original target functions: it is assumed that the ES is to operate on a workstation
1. A configuration of ratings have greatly changed. (hereinafter referred to as WS for short). It is neces-
2. An accumulation of rolling technologies have re- sary, therefore, to clarify the way of processing with
quired rolling timing to be set delicately. the mainframe in which the conventional system can
3. More importance has come to be attached to quality operate.
than to working efficiency.
We have decided to adopt distributed processing in
The expert is overloaded with his operations. both the mainframe and a WS for the following rea-
As a derivative of the problems referred to above, sons:
many hours were required to manually correct the out-
puts of the "Rolling Scheduler." 1. Processing in the WS requires the data owned by the
existing system. The distributed processing form,
No knowledge built up for scheduling. Since the de- however, may be taken readily with the WS connected
fect of the "Rolling Scheduler" resorts to a human on line to the mainframe, if processing in the WS is
manual correction, the knowledge about scheduling positioned as a subsystem in the existing production
has turned out inherent to experts. On the other hand, control system.
engineering expertise, process-control-oriented ex-
pertise and productivity-oriented expertise have been 2. A CPU load on the mainframe is reduced, thereby
owned by different experts, respectively. allowing a suppression of cost increases for the main-
frame.

OVERVIEW OF THE SYSTEM 3. With WS data edited in the mainframe beforehand,


it is possible to process a large volume of data without
Principles of Systematization the necessity of bringing it into the WS.

Prior to the systematization, we decided to adopt a These have been aimed at effectively using computer
system development policy as described below, taking resources, and improving the development efficiency
into consideration the problematical characteristics in- and maintainability.
volved in a rolling sequence composition.

Modeling experts' map-out methods. Composing a System Configuration


rolling sequence is one of the typical composite type
challenges considered difficult to put into practical Fig. 3 shows a configuration of the system, which

73
Mainframe System UNIX Workstation
Production
Planning Selected Data
Data Base

hQH
Production Data
Planning Determining
Select Fixed Quantity
System Temporary Knowledge
3CT Rolling
A sequence
Maintenance

Listing
Rolling
Selected
Data
B Temporary
sequence
Q Fixed Quantity
Data
sequence

Determining
Rolling sequence

User Defined
Function Knowledge
Update
Rolling
sequence
Rolling Rolling sequence Data
sequence Data

Fig.3 System Image of Mainframe and Workstation

Table 1 System Size problems involved in the rolling sequence composi-


Classification Program Size Application tion development.
Language

HOST COBOL approx. existing system


1500 steps interface

WS COBOL approx. provisional Problems Definitions


3500 steps scheduler
Informix approx. external parameter
maintenance
(4GL) 4500 steps
approx.
The problems involved in this case are defined as fol-
user-defined functions
C-language 6500 steps
to be used in rules lows:
approx. description of
ES tool
800 rules knowledge
Object. The seamless steel pipe production equipment
reported herein has been processing three sizes of bil-
comprises a mainframe and WSs. A file transfer sys- let materials. When and how much these round billets
tem is employed to give and take data between main- are to be rolled have been determined subject to the
frame and WS. limitations of the equipment. This is called the "roll-
ing chance" which defines a rough frame of the quan-
An ES structuring tool made in-house is incorporated tity in which one rolling chance should compose a se-
in the WS, and also, COBOL and Informix have been quence. The object is a collection of those sequential
introduced to structure peripheral systems. orders whose billet is identical in outside diameter,
Table 1 shows sizes of the system. plate thickness, length and other ratings. Such a col-
lection of orders is called a "lot," and each lot has a
certain number of restrictive items.
The problem is how to sequence the lots in one rolling
chance. Approximately 200 to 500 lots are to be se-
quenced in one rolling chance.

METHOD OF DISSOLVING Restrictive requirements. Those requirements for a


PROBLEMS INVOLVED IN rolling sequence composition, which will turn out re-
SEQUENCE COMPOSITION strictive in determining the sequence, could be sum-
marized into two categories: product specification re-
In systematizing the system reported herein, a critical quirements and limitations of the equipment.
challenge was how to model experts' planning meth-
ods. A description will be given below concerning the These requirements can be classified into four catego-

74
ries by restrictions. Restrictions which comprise a to- technology requirements as shown below. It may be
tal of approximately 50 items are: structured, furthermore, as a collection of knowledge
1. Priority. in more detail. Fig. 4 shows a configuration of the
2. Quantity. knowledge about a sequence composition. The
3. Position. knowledge is expressed subject to approximately 800
4. Relations to preceding and following lots. rules of an "IF - THEN" form. One collection of the
knowledge about the sequence composition is called"
Evaluation items (objective functions). After satisfy- a rule set."
ing every requirement, evaluation items in sequence Each rule set is defined as a collection independent of
composition are as follows; any other rule sets, thereby securing both readability
1. Quality. and maintainability.
2. Delivery date.
3. Productivity. Sequence composition inference method. To dissolve
the problems involved in a sequence composition, it is
From all these definition, we found the frame of roll- difficult to find a solution within a practical range of
ing sequence composition problem, which is to ar- time limits since an explosion will take place when we
range objects satisfying every requirement and maxi- should try to verify every sequence composition pos-
mizing evaluation items. sible. A model, which imitates the steps of an expert,
has been structured to employ a trial-and-error type
sequencing method accordingly.
Problem Dissolving Model
The system reported herein has its sequence compos-
Difficulties of dissolving the rolling sequence compo- ing functions dividable into the following two, if
sition problem are to satisfy many conflicting require- roughly classified:
ments. It occurs often that satisfying some require-
ments fails to satisfy other requirements. The conven- Provisional schedule. An entire framework (number
tional system has settled this problem by setting the of roll-changing cycles, timings, etc.) is to be deter-
priorities. Now that, the environments have changed, mined. In accordance with the generally applicable
the conventional system has not been performing its rules in that framework, rolling lots are to be assigned
targets. to make a provisional rolling sequence. The functions
of generally applicable rules are:
Experts' heuristics comprises their knowledge and ef- 1. Determining a lot which has a starting point,
ficient steps. A problem-dissolving model encounters 2. Sequencing from thin to thick products,
with a problem: how to efficiently use the processing 3. Taking productivity into consideration, and so on.
time. These are described below.
In this stage, restrictive requirements have not been
Structuring a sequence composition. Experts' knowl- checked in detail yet; some problematical portions ex-
edge about the sequence composition could be sum- ist.
marized into product specifications and production

ES Knowledge Product specifications Priority restriction



Quantity restriction

Position restriction

'reduction technologies Priority restriction

Quantity restriction

Fig. 4 Expert System's Knowledge Configuration

75
Final schedule. Based on the findings in the provi- able to dissolve initially is moved to the most desir-
sional schedule, experts' know-how and quantitative able one of the positions to which this lot is movable.
knowledge are used to determine a sequence while This process is repeated until any lot satisfys the re-
satisfying the restrictive requirements definitely. The strictive requirements, so that a rolling sequence can
term, quantitative knowledge, means the parameters be determined.
to which the experts' knowledge has been converted,
and it has been set as the information which may be Rolling sequence composition inference method de-
changed frequently. scribed here is one of the effective methods dissolving
sequence composition problems.
The expert system is applied in making the "Final
Schedule." Inference principles are as follows: Speeding up Inference Processing.

In relation to the provisional rolling sequence, some The processing efficiency of a system is critical sub-
lots may fail to satisfy the restrictive requirements. ject upon which whether the system is effectively
These lots are to be taken out and inserted into appro- functioning or not. An improvement of efficiency, es-
priate positions. pecially in inference processing, is influential over a
model configuration of inference. Besides, it has an
The techniques of inserting lots into appropriate posi- impact on an internal structure of the ES structuring
tions are based on the principle of selecting the most tool. As a solution to this problem, we have improved
productive position out of those positions which sat- the efficiency as described below, so that an inference
isfy the restrictive requirements. If a position in a roll- may be processed in a reasonable period time.
ing sequence, which is using mandrel bar or HSR roll
of the same size as that of a lot desired to be inserted, Suppressing a redundant startup of rules. Some rules
should be selected to save the labor who change the need to process repeatedly in an inference cycle, but
tool. As a result, the sequence can be composed with- others need to process only once. For an example, the
out adversely affecting the productivity. rules to search for a position need to process many
times; those to set data in a frame of knowledge base,
Lots are to be moved from position to position in however, need to process only once. These rules have
turns, with priority given to the lot required the most. been arranged in an order and those unnecessary proc-
essing have been eliminated.
Fig. 5 shows the steps of processing the inference por-
tion on the above principles. In other words, a lot Suppressing an operation of the rule sets. Rule sets are
whose restrictive requirements are considered desir- composed to process by turns according to the se-
quence of processing. Some of them, however, do not
need to get to be executed, depending on current
exsisting in the data to be processed. For an example,
to process an order containing no medium alloy steel,
Listing the rolling lots whose pipe making the rule sets relating to medium alloy steel do not need
timing is changed
to process. This problem has been dissolved to process
Giving priority sequence to the rolling lots before proceeding to the inference by determining the
whose pipe making timing is changed rule sets.

Scheduling the rolling lot to which the first


Tuning a function. In relation to those functions used
priority has been given frequently and whose execution time is long, their in-
3Z
Dissolving the some of the rolling lots whose
ternal algorithm has been improved. This has allowed
pipe making time is changed the processing time to be reduced while cutting off the
working memory.
Carrying out a similar process once a
sequenced status have been altered

J EVALUATION
(Until restrictive requirements have been satisfied)

Fig. 5 Inference Process Row The system was put into operation in August, 1990.

76
Since then, the system was evaluated based on its op-
erating conditions. Results outputted by the system
have been at a level nearly equal to the sequence com- Developing the sequence composition inference
position prepared by experts. We have evaluated the method. A seamless steel pipe rolling sequence com-
system as capable of making an inference comparable position has typical factors considering sequence
with that of an expert. Evaluation results could be composition problems. So that, an inference method
summarized as follows: described here is able to apply similar problems and
cases. For instance, a problem replacing seamless
Evaluating results of the ES. The reasonableness of steel pipe with slab or coil will have same require-
restrictive requirements, production yield, and pro- ments and object functions.
ductivity have been taken into consideration as the
factors for evaluating the execution results. An expert CONCLUSION
has been preparing a rolling sequence composition
while simultaneously taking the above factors into A small-sized seamless steel pipe rolling sequence
consideration. Therefore, the results, equivalent to composition has been introduced to the reader as a
those of an expert, have been obtained. The confor- case of applying the expert system to the problems
mance rate of inference results has been compared involved in a sequence composition, in the field of
with the rolling sequence composition made by an ex- production control. In this development, we con-
pert and the system. As a result, we have evaluated it firmed effectiveness of an expert system. Now we are
at a level nearly identical to that of an expert. Thus, the convinced that any other similar sequence composi-
system has enabled us to perform an operation compa- tion problems are dissolved by the inference method
rable with an expert's. described here.

Evaluating the execution performance. The process- The result reffered to above has been successfully ob-
ing time has been taken up to evaluate the execution tained. Nevertheless, we have many problems remain
performance. Normally one to two hours has been unsolved in constructing an expert system which
taken to prepare a rolling sequence composition on the completely merged with the conventional software
system. Approximately three hours or more have resources within the limitations of both the hardware
been required in the case with a high content of spe- and the software. To dissolve these problems, we will
cial steel, such as stainless steel, then the initial target continue studying the applicability of the ES.
has been nearly achieved.

Evaluating operation. The system has allowed us to REFERENCES


execute a rolling sequence composing operation with-
out the experts' knowledge. The knowledge about Forgy, C.L. (1982). A First Algorithm for the Many
scheduling, could be summarized into the knowledge Pattern/Many Object Pattern Match Problem. Artificial
Intelligence, Vol. 19, pp 17-37
about production technologies, the process control,
and the productivity. It has enabled us to respond Hiroaki, K., and Shigeki, A. (1990). Seamless Stainless Steel
Pipe. Kawasaki Steel Giho, Vol. 22, pp 291-293
readily to a change in rolling sequence composition
Kazuhiro, F. (1988). Knowledge Programming. Kyouritu-
requirements. syuppan Co.
Midori, K. (1991). Problem Solving Tool based on Inference
In this development, we obtain the results as follows: Mechanism. Kawasaki Steel Giho, Vol. 23, pp 82-83
Satoshi, F., and Katuaki, S. (1988). An Application of Expert
Confirming effectiveness of an expert system. A roll- System to Shipment Planning. Operations Research, Vol. 33,
No. 1, pp 33-39
ing sequence composition has been deemed as one of
the problems whose systematization has been said to Shinya, A., and Satoshi, F. (1988). Applications of Expert
System at Kawasaki Steel. Kawasaki Steel Giho, Vol. 20, pp
be difficult with the system technologies heretofore 76-81
available. The expert system has been proven effec-
Yasuhiro, S. (1990). Recent Progress in Pipemaking
tive in dissolving the problems. There are many simi- Technology Developed by Kawasaki Steel Corporation.
lar problems and cases in bisiness application fields Kawasaki Steel Giho, Vol. 22, pp 219-227.
which will turn out to have a possibility of systematization

77
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

EXPERT SYSTEMS FOR THE AUTOMATIC SURFACE


INSPECTION OF STEEL STRIP
R. Haataja*, M. Kerttula, T. Piironen and T. Laitinen

Rautamukki Oy New Technology, P.O. Box 217, SF-90101 Oulu, Finland

Abstract. Surface inspection technology for flat steel products is


considered in this paper. Rautaruukki New Technology has developed its own
machine vision inspection technology which is based on line scan imaging,
very fast image processing facilities and sophisticated knowledge-based
classification algorithms for surface defects. Automated tools which
utilize expert systems and graphical user-interfaces, are added to the
system to assist the development of new machine vision applications.

Expert systems are applied in the knowledge extraction and in the


implementation of defect classifiers. The domain-specific classification
rules can be modified by an expert system which is capable of providing a
complete decision tree classifier with executable source code. Graphical
visualisation of the classification scheme is made possible. Real defect
images are used as the input data for this system. The resulting
classifier software can be transferred into the real surface inspection
system for application testing. Experience of these tools, as well as of
the first implementation of the inspection system in the pickling line of
Rautaruukki's Hmeenlinna thin sheet mills, is discussed. This system,
called Smartvis, is now being commercialized. At the end, an optional
classification method utilizing neural networks, is considered in the case
of detecting periodical defects like roll marks. The expert system (ES)
tools including neural nets have been tested in laboratory environment.

Keywords. Surface inspection; quality control; classification; acquisition


of knowledge-based information; rule-based systems; expert system
technology; programming languages; user interfaces; commercial expert
systems; neural networks.
INTRODUCTION based on real defect classes but rather on
more general symbolic groups. In addition,
Visual surface inspection is required in the adjusting of these systems used to be
several manufacturing stages of strip and very laborious which has also aroused
sheet metal products. Traditionally human suspicion towards this technology.
inspectors have done the job due to the
lack of automatic systems. Many problems Due to the lack of good commercial
are related to human inspection, e.g. solutions, Rautaruukki started the
inspection criteria differ by person, development of new generation surface
performance fluctuates in the course of inspection systems in co-operation with the
time, it may be difficult to find good VTT (Technical Research Centre of Finland)
personnel due to the boring work, and fast and the University of Oulu at the end of
production speeds (> 300 m/min) are 80's (Piironen,Haataja,Kernen, 1989;
impossible or risky to use. Piironen,Laitinen,Haataja, 1990; Piironen
and others 1990). Based on previous studies
The development of fully automatic it became evident that to be successful the
inspection started ca. 10 years ago. The surface inspection system must include many
first inspection systems are, however, adaptive and intelligent features; the
merely defect detection devices unable to illumination should be optimal to maximize
identify the type and cause of defects. the contrast of defect images; the digital
Later on some manufacturers have offered preprocessing must be very fast (> 1 giga
systems with some classification operation/sec) to allow real time
capability, but the classification is not operation; the customizing and tuning must

* occupation since August -91:


Vaasa Institute of Technology, Wolffintie 30, SF-65200 Vaasa, Finland

79
be flexible due to varying process 1. Informing the operator of defects that
conditions and inspection requirements. require immediate process control actions.

The surface inspection system developed by 2. Informing the following process stages
Rautaruukki is based on high resolution about defects that require changes in
line scan imaging, a special illumination process parameters.
system, advanced digital filtering methods,
rule-based decision tree classification and 3. Preparing coil- or sheet-specific defect
automated development support (Mkinen and reports for the quality control and
co-workers, 1991) . One key idea is to production personnel.
differentiate just the critical and most
significant defect types so that the
classification is still based on real Surface defect detection
defect classes. This considerably
simplifies the software of the system Surface defects on metal strips are divided
increasing, as we believe, also the into the three main groups:
reliability of the classification.
1. Three-dimensional defects like
The system, called Smartvis, is an lamination defects, cracks, roll marks and
inspection system which also includes a small dents (or pits) which are usually the
development system to assist the most critical, both for the product quality
customizing process. Automated tools and process control.
utilizing expert systems and graphical user
interfaces are added to the system to 2. Two-dimensional defects like discoloured
systemize and boost the development and spots, dirt stains, which are usually less
tuning of a new application, which critical, but must be detected in some
typically takes 1-2 years per system (in cases.
the mill!). Expert systems can be applied
both in the development phase - primarily 3. Large area defects, which can have e.g.
for knowledge extraction and rule texture-like appearance.
generation - and in the real time operation
for automatic defect classification and 4. Surface flatness deviations.
surface quality evaluation. Neural networks
offer a new choice for implementing one The system under consideration is most
part of the classifier. However, the basic suitable for inspection of the faults in
system utilizes a rule-based classifier. the first two categories. In the production
of some end products like galvanized and
colour coated steel it is most useful to
RULE-BASED CLASSIFICATION SYSTEM also detect some texture-like defects, but
the basic method does not support these
kind of defects. A parallel method based
The basic inspection system either on statistical algorithms like FFT
and texture analysis, or e.g. on neural
Basic components of the surface inspection nets is then needed, and is under
system are the illumination unit, one or development.
more semiconductor cameras, image
processing unit, operating panel, colour
monitor, image recorder and PC based Decision tree approach
reporting station (Fig.l). Image processing
functions are handled by a high speed VME Decision tree classification is a widely
computer unit with image processors used approach to classification in pattern
allocated for special processing. The other recognition, especially when the number of
functions are separated from the image features and classes is large. Pattern
processing and handled by one or more PC's. recognition approaches can be divided into
The system structure is modular allowing two main categories: decision theoretic
optimal configuration for each application (statistical) and syntactic (structural).
(Mkinen and co-workers, 1991) . Decision tree classification is viewed as
a subclass for the former case, and can be
The system uses special illumination to described by Fig.3 (Kerttula,1990). The
enhance the visibility of defects. A line main goal of a decision tree classifier is
scan camera is used to image the surface. to classify an unknown sample into a class
The video signal from the camera is using previously stated knowledge. This
digitized and processed in order to detect knowledge is presented in a form of
and classify defects. Information on the decision rules or functions
line speed is required to compensate for ( F u , R o s e n f e l d , 19 8 4;
speed variations and follow the position of W u , L a n d g r e b e , S w a i n , 19 7 5;
the strip. Additional process and Quinlan,1986,1990).
production data can be used to control the
inspection (Fig.2). A decision tree classifier is basically a
multistage-classifier which applies simple
The main functions of the system are: decision functions to different features at

80
each stage and moves to the next according illumination and image processing. Examples
to the result. This sequence continues of defect types are illustrated in Fig.6.
until a terminal node is achieved. This
strategy can be best described by a tree The generation of classification rules
diagram (Quinlan,1986,1990) . requires a large number of samples of
different defect types in order to define
A typical tree structure is described in the key features of each defect class
Fig.4, where decision nodes are shown as (Fig.7) . The expertise of the quality
ellipses and feature nodes as triangles. inspection personnel must be incorporated
Decision nodes hold classes to which in the analysis of the sample material.
feature nodes apply their decision
functions at each stage. A terminal node is
reached when a decision node holds only one KNOWLEDGE EXTRACTION METHODS
class (Kerttula,1990;Swain,Hauska, 1977) .
There are several techniques used for
The following principles apply to a knowledge extraction from domain experts in
decision tree structure: different fields. The methods can be
divided into different categories which are
1. There is just one root node in a tree. based primarily on individual or group
2. Every node which is not a root node has decisions. The emphasis of the method can
a single parent node. be either in interviewing or in observa-
3. There are no cycles in a tree. tion. In the first method we simply
4. A path from the root node to any given speaking directly ask how the expert makes
terminal node is unique. decisions, while in the latter method we
first let him make decisions and try to
There are two possible ways to design a observe the logical behaviour of the expert
tree classifier. When the classification (Forsythe,Buchanon,1989).
problem is relatively small i.e. the number
of features and classes is about ten, the In the observation method the given task
domain expert can pretty easily form the may just cover non-directly the problem of
required decision rules. In a larger application to get genuine results.
problem, formal tools are needed. Observation techniques can eliminate the
well known problem that an expert
When a decision rule is based only on a frequently acts differently to how he
single feature, simple distance thinks or says to act.
measurements between classes are used to
determine the class boundaries. Means and A method that suits well in this kind of
variances for all features in each class quality inspection application is based on
are calculated and the results are multiple experts. The defect rules are
represented in a so-called coincident seldom so absolute that all the experts
feature plot (Fig.5). Based on this plot, would classify the defects in the same way.
the final decision steps can be chosen. The Thus it is obvious that to eliminate biased
interactions between several features can results, the validation should be done
be used to advantage, and they are obtained correctly, preferably based on multiple
based on covariance matrices, which show experts and on all expert groups including
how different features respond to one end users, supervisors etc.
another in one class. The classifier can be
improved by adding another feature to a The main benefits of using multiple experts
decision rule. In general, wide trees that are:
use several features at each node, reflect - expertise covers a larger domain
the relative weight given to classifier - group judgements tend to be better than
accuracy, whereas deep trees, that use a individual
small number of features at each node, - group has better ability to recognise and
reflect the weight given to efficiency reject incorrect solutions and
(Kerttula,1990/Swain,Hauska,1977). suggestions
- contradictions and overlaps can be
clarified more easily
Rule-based classification - group work can save the time of
individuals
Defect classification is based on the
features of individual defect candidates One method designed for this purpose is
and their geometrical relationships, which called the Abaque de Regnier method, and is
are calculated in the image processing based on non-direct interview. A commercial
stage. The features include polarity, area, PC program supporting this method exists.
position, width, length, orientation,
mutual distances and repeatibility of
defects. In a typical cold rolling process
there are ca. 50 defect types which easily
leads to overlapping in the defect feature
space. Performance of the classification is
very much dependent on the quality of the

81
GENERATION OF RULE BASED rule generator visualizes the decision tree
CLASSIFIERS created by a graph and allows the user to
modify and prune it to find the most
efficient version.
Smartlearn: Decision tree design
environment 6. The complete decision tree can be tested
with different defect examples in the PC
An increased number of defect classes and before automatic code generation. Finally
varying application areas have arisen the classifier code is embedded in the Smartvis
need for new design tools faster and more software in the VME environment.
efficient than manual methods. A new
approach for the off-line design of a
decision tree classifier is presented here. Experience

When starting the development of a new Several classifiers were generated using
decision tree design environment, the SMARTLearn in our laboratory environment.
following goals were set: Both different simulated defect samples and
pure expert knowledge were used as teaching
1. Definition of the rules with graphical material.
user interface.
2. Visualization of defined classes and In small classification tasks (i.e. max 10
possibility to verify and test classes) tree generation succeeded even
generated decision trees off-line. with small example sets. In Fig.10 a
3. Possibility for supervised learning; decision tree for 5 defect classes defined
either from real defect samples or by a quality person is shown. In Table 1 32
knowledge defined by a quality person or examples of these classes are presented,
both mixed. and in Fig. 11 a decision tree generated by
4. Automated conversion of a decision tree SMARTLearn with previous samples is given.
into software. Figure 11 indicates that the generated tree
is compact and efficient. With larger
A software tool called SMARTLearn was problems (i.e > 10 classes) number of
developed utilizing a Windows environment, examples per class usually had to be from
because that offered excellent support for 20 to 100 to achieve reasonable results.
visualization and interfacing. A typical Sometimes statistical pruning was needed
sequence for the generation of a new for wide trees.
classifier with SMARTLearn is the
following: Classification from merely expert knowledge
was reasonable only if the limits defined
1. The user picks up a gray level defect by the expert were well covering. That
image to the main window. Images for the means exact limits at least for the half of
program can be produced in several the features should be given. Too many
different ways; it can be grabbed from live "don't cares" make the tree too large and
video tape or transferred from the Smartvis incomplete. However, mixing well defined
system. expert classes with real examples
succeeded.
2. Defect segmentation operations are
performed on the defect image to extract As a result the design phase of a tree
different defects from the background. classifier has considerably improved -
Operations simulate those of Smartvis. there is now the possibility to choose from
several tree structures and verify their
3. The user selects all the defects that he performance, in shorter development time
wants to be classified. Defect and defect than before. In addition the SMARTLearn
feature parameters are shown in a child tool is not only tied to the decision tree
window and the user can decide whether or classifiers: the features collected in
not to select this defect for an example example files are ready to be handled by
file. Before adding a defect to an example other classifiers, like neural networks,
file a class name for the defect has to be too.
given (Fig.8).
The pure decision tree classifier uses
4. If necessary, the user can also add his "hard rules" for defect classification. In
own expert knowledge to the example file. future work addition of fuzzy logic based
This is done by a Expert Knowledge child "soft rules" will be tested to evaluate how
window (Fig.9), where the user can define reliable the classification results are in
variation limits for each feature parameter the case of incomplete information.
for different classes.

5. When all examples are given, example


files can be fed to the rule generator.
Rule generation is done based on an
iterative method with information-driven
evaluation function (Quinlan, 1986). The

82
A NEURAL NETWORK CLASSIFIER on a heuristic algorithm, binary matching
and Fast Fourier Transform (FFT) were
tested in the same manner with good
Background accuracy, too. In the final comparision of
these methods the possibilities of real-
Neural networks (NN) offer an interesting time implementation has to be taken into
way to solve pattern recognition problems account. Naturally, the performance has to
without the need to develop exact be verified also in a real operation
algorithms or rule-bases. Nevertheless, environment without test simplifications.
there still remain the problems how to
choose the optimal network structure, input
vector (features, parameters) , how to teach CONCLUSIONS
the network, and in the end how to validate
the performance of the network. The same Design and implementation of surface
type of problems naturally exist when using inspection systems is a long and
conventional classifiers. complicated process. A major problem is the
collection of an adequate data base of
A case study was made to test neural defect images from the process for the
classification for detecting periodical teaching of classification rules. Another
surface defects in the laboratory problem area is to transfer inspection
environment. Commercial software (by Hecht instructions, as they are understood by a
Nielsen Computers Inc.) was utilized. human, to the quantities measurable by the
image processing system.

Periodical defects As the application area requires much


domain expertise, expert systems (ES) are
Repetitive i.e. periodical defects are a natural approach. In the Smartvis
typical in hot and cold rolled metal inspection system ES has been applied in
processing, and they are caused by roughly two ways: first off-line for
defective rolls or extra material particles generating rule-based classifiers, and
in machinery, or they can also be formed secondly in real-time for classification.
mechanically when coiling or storing the In future the optional way to use neural
strip. The periodical defects are met in classification would thus compensate for
machine direction, usually from the same the rule-based ES's by NN's, while the
longitudinal sector. The distance between design process would still be done using
the defects may vary from a few metres to off-line ES's.
tens of metres. An instant on-line
detection of strong periodical defects Early experiences indicated that the
would yield significant economic savings developed software tool for decision tree
due to reduction in material losses, design can considerably reduce time, cost
process damage and maintenance costs. and application-dependent work on the
process line. Based on our experience at
least partly automated design process is a
Methodological aspects necessity and a condition for professional
new application management.
The inspection window length in machine
direction can be assumed based on the
production line machinery and process REFERENCES
model. In the cross-machine direction the
strip is divided into a suitable number of
sectors to save computing time. In the Forsythe,D.E., and B.G. Buchanon (1989). Knowledge
teaching phase, periodical and non- Acquisition for Expert Systems: Some Pitfalls
periodical defect vectors are taught to the and Suggestions. IEEE Trans, on Systems, Man and
NN by examples. Verification of the Cybernetics, Vol. 19, No 3.
classifier can be done by any other defect
vectors other than by those used for
teaching. Fu,K.S., and A. Rosenfeld (1984). Pattern Recognition and
Computer Vision. IEEE Computer, October, 274-282.

Testing experiences Kerttula,M. (1990). Decision Tree Classification in Pattern


Recognition. In post-graduate seminar report (in press),
HNC Inc's NetSet II simulator was used for University of Oulu, Department of Electrical
tests, the goal being to classify a 64-bit Engineering, Oulu, Finland.
binary input vector. Test data was
synthesized based on real defect data
gathered by the actual Smartvis system in Mkinen,M.,Piironen,T.,Laitinen,T., and R. Haataja (1991).
the pickling line in Hmeenlinna. Off-line High-Speed Adaptive Surface Inspection System. AISE
laboratory tests gave remarkable Spring Conference 1991, April 22-24, Dearborn,
classification accuracy (ca. 100%). In the Michigan, USA.
same study also three other methods based

83
Piironen,T.,Haataja,R.,and E. Kernen (1989). Defect SPEED / POSITION PROCESS
CONTROL
detection in cold rolled steel strips. Proc. of the TRANSDUCER
SIGNALS
21st International Symposium on Automotive Technology
- strip start
and Automation (ISATA), Wiesbaden, FRG, Nov. 1989,
pp. 1749-1763.
PRODUCTION
Piironen, T.,Laitinen,T., and R. Haataja (1990). Machine DATA
strip length
vision based inspection of surface defects on strip and
strip width
sheet products. Automation Days, Helsinki, Finland, quality class
17-21.9.1990.

Piironen,T.,Silven,0.,Pietikainen,M.,Laitinen,T., and E. ALARMS TO


PROCESS
Strmmer (1990). Automated visual inspection of rolled PERSONNEL /
metal surfaces. Machine Vision and Applications, 3, CONTROL
247-254. process control
actions
QuinlanJ.R. (1986). Induction of Decision Trees. Machine
Learning, Kluwer Academic Publishers, Boston, USA.
INSTRUCTIONS
1, 81-106.
TO NEXT
PROCESS STAGE
QuinlanJ.R. (1990). Decision Trees and Decision Making. - machinery / strip
IEEE Trans, on Systems, Man and Cybernetics, Vol. 20, damages
No.2, 339-346. - e.g. lower speed

Swain,P.H., and H. Hauska (1977). The Decision Tree DEFECT REPORT


Classifier: Design and Potential. IEEE Trans, on FOR EACH COIL
Geoscience Electronics, Vol. GE-15, No.3, July, position of defect
142-147. (width, length,
STATUS OF coordinates)
THE PROCESS
type of the defect
- strip end
Wu,C.L.,Landgrebe,D.A.,andP.H. Swain (1975). The decision - strip stops repeatibility
tree approach to classification, NASA-CZ-141930, May.

Fig. 2. Data flow model of the inspection system.

Illumination unit

>jlMlliWill J

Operator Color Video Printer Reporting Modem


panel monitor hardcopy station

Fig.l. The basic components of the inspection system

84
Feature Classifi-
Pattern L Vector cation
Feature Classifica-
Extraction tion

Pattern
Samples
^h
Feature
+
Learning
Recognition

Analysis
{

Selection

Fig.3. A decision -theoretic pattern recognition system.

c^ct>
*X^~ D,E,F^)

A,B,...,H classes
4,7,8,9,10 features

Fig.4. A simple decision tree structure.

A = class 1 car
B = class 2 bus
A,B,C,D means of the classes
C = class 3 airplane
******* variances of the classes
D = class 4 house

Feature 1: * * * * * A* * * * *
AREA **B**
*****c*****
***D***
Feature 2: * * * * A* * * *
BRIGHTNESS **B**
*****c*****
**D**
Feature 3: * * * A* * *
SURFACE ****B****
ORIENTATION *C*
****D****
Feature 4: * * A* *
TEXTURE ****B****
******c******
*****D*****
0.00 -+--+--+--+--+--+--+--+--+--+--+--+--+--+--1.00

Fig.5. A coincident feature plot.

85
SEAM

: ABRASION
RIGHT HORIZONTAL SCRATCH

'i I I I
SPILL DARK 1
VERTICAL A i

SCRATCH \

| 1 1
( r) ;
1

i
SPILL I
1
ROLL MARK

\
\
no other
blobs near /
/
MM
Fig.6. Examples of defect classes.

INSPECTION CLASSIFICATION
SYSTEM PROGRAM
classification
defect features rules defect type
CLASSIFICATION
TEACHING SYSTEM CONTROL

7
SYSTEM

inspection criteria, process


defect types control alarms
displays
defect
QUALITY CONTROL reports
OFFLINE ONLINE

Fig. 7. Generation of classification rules.

86
Expert Knowledge
Give Your Limits for Different Features.
When giving Exact Value, fill both Sides with the Value !

Min. Max.
Width 20 Give Class Identification
forgiven Features.
Height 20
Class for Features:
40 Area *
large_spot
0.2 Aspect *
When ready, Add limits
Surr. Area * Example File.
Orientation *
Adtf is Example Fife
Left Dist. *
Right Dist. * Do You Want to Save
Limits for Future Use ?
Angle *
Members
* ;Sav& Limit 1

Fetch Limits from File


Polarity: Dark
Get Limits
O Bright

Ouit

Fig.9. Expert Knowledge child window.

diag__s cratch

YES / \ NO

large_spot small__spot

Fig.10. Manually designed decision tree.

87
Area >= 30.0
I
NO > small_spot
I
YES> Aspect >= 4.2875
I
YES> ver_scratch
I
NO > Height >= 34.0
I
YES> hor_scratch
I
NO > Orientation >= 1.7
I
YES> Width >= 7.5
I I
I YES> diag_scratch
I I
| NO --> large_spot
NO --> large_spot

Fig.11. Decision tree generated by SMARTLearn.

TABLE 1 Teaching Data for Decision Tree Generation

NO. Width Height Area Aspect Surr.area Orient. Left dist. Right dist. Angle Members Class
1 99 4 272 24.750 396 1.456 165 0 0 * hor_scratch
2 55 8 229 6.875 440 1.921 37 170 178 * hor_scratch
3 175 16 2029 10.938 2800 1.380 85 3 * * hor_scratch
4 169 10 1156 16.900 1690 1.462 87 6 * * hor_scratch
5 10 128 612 0.078 1280 2.092 19 234 * * ver_scratch
6 5 99 271 0.051 495 1.827 201 57 89 * ver_scratch
7 4 101 319 0.040 404 1.266 253 6 * * ver_scratch
8 4 6 12 0.667 24 2.000 151 107 124 * small_spot
9 6 105 412 0.057 630 1.529 6 249 * * ver_s cratch
10 4 54 152 0.074 216 1.421 60 198 88 * ver_s cratch
11 55 8 229 6.875 440 1.921 37 170 178 * hor_scratch
12 7 16 42 0.438 112 2.667 148 107 106 * large_spot
13 19 21 278 0.905 399 1.435 92 152 97 * large_spot
14 17 10 87 1.700 170 1.954 39 207 23 * diag_stracth
15 3 4 12 0.750 12 1.000 43 216 60 * small_spot
16 20 16 218 1.250 320 1.468 4 238 * * large_spot
17 13 16 111 0.813 208 1.874 22 226 52 * diag_stracth
18 25 29 239 0.862 725 3.033 230 7 * diag_stracth
19 21 2 45 10.500 42 0.933 160 83 179 * hor_s cratch
20 8 12 42 0.667 96 2.286 37 217 119 * diag_stracth
21 23 31 139 0.742 713 5.129 110 129 51 * diag_stracth
22 24 23 186 1.043 552 2.968 17 222 135 * diag_stracth
23 7 6 33 1.167 42 1.273 197 59 37 * diag_stracth
24 2 4 11 0.500 8 0.727 98 163 90 * small_spot
25 70 6 257 11.667 420 1.634 110 83 * * hor_scratch
26 35 4 78 8.750 140 1.795 132 96 179 * hor_scratch
27 11 37 233 0.297 407 1.747 88 163 83 * ver_scratch
28 8 2 18 4.000 16 0.889 200 54 179 * small_spot
29 209 14 2426 14.929 2 92 6 1.206 2 51 * * hor_scratch
30 4 4 12 1.000 16 1.333 222 37 26 * small_spot
31 5 4 18 1.250 20 1.111 228 30 179 * small_spot
32 8 10 62 0.800 80 1.290 182 74 64 * large_spot
33 10 8 73 1.250 80 1.096 222 31 6 * large_spot

88
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

COIL TRANSFER EXPERT SYSTEM FOR A HOT STRIP


MILL FINISHING LINE
M. Hosoda*, T. Kuribayashi*, F. Hirao** and K. Takenaka**
*Kashima System Planning Department, System Engineering Division, Sumitomo Metal Industries Ltd., Kashima-gun,
Ibaraki, Japan
**System Engineering Division, Sumitomo Metal Industries Ltd., Osaka, Japan

Abstract. Sumitomo Metal Industries has developed a coil transfer system that
utilizes knowledge engineering. The new system, installed on a hot coil finishing line
at Kashima Works, features an "expert system" to improve the efficiency and labor
productivity of material handling operations. Built around a knowledge base of approx-
imately 8000 steps, it provides real-time control of tasks previously left to humans.
The expert system is closely linked to the coil storage control system. Furthermore,
it automatically creates coil transfer plans for the finishing line as a whole a task
traditionally left to humans. The objective of transfer control is, in a general
sense, to increase the material handling efficiency and productivity at the finishing
line as a whole. The expert system has done much to realize these benefits early on.
One great advantage is flexibility-software modifications are easy and quick. Another
is that the automated system can be built around the experience of operations. With
its flexibility, its application of knowledge engineering to production planning, and
its real-time control, we believe that such coil transfer system will prove to be a
significant development in the field of material flow planning.

Keywords. Knowledge engineering; expert system; Knowledge base; Real-time control;


Transfer system; Hot coil finishing line; Artificial Intelligence.

finishing and processing lines, and materials


INTRODUCTION flow between them is complex; production moves
along at a swift pace and a great many coils
From quite some time back, Sumitomo Metals has must be handled; and the only system in place to
benefited considerably in terms of total cost keep track of these coils was designed primarily
minimization by the introduction of automated for record-keeping. We felt that, in order to
systems, generally to increase yield. However, solve these problems, the systemboth mechani-
the environment surrounding the steel industry cally and conceptuallyhad to be rebuilt from
continues to change; we are challenged with the basics.
demands for further cost reductions and
strengthened non-price competitiveness (ability Our basic objective in this development was the
to flexibly respond to varying customer require- provision of a system that: transfers, in a
ments) . timely manner, coils produced on the flow of
material between processes. To this end, our
On the finishing lines at the hot strip mill of developmental objectives were:
our Kashima Steel Works, Sumitomo Metals has (1) Enhancement of storage yard control and
adopted a "knowledge engineering" approach to automation of transfer equipment;
automate materials flow scheduling and opera- (2) Realization of materials flow based on oper-
tions previously performed by people. The result ating know-how and optimized in terms of the
is a coil transfer expert system that operates system as a whole;
in real-time utilizing a large-scale knowledge (3) Ability to respond flexibly to changes in
base. The system was put into operation in production.
September, 1988. Not only has the system met its
operating objectives outlined above, it has also Below we describe the software development cen-
provided a solid foundation for further system tering mainly on knowledge engineering.
developments to quickly and dependably accommo-
date recent changes in the field of materials
handling. BACKGROUND BEHIND THE INTRODUCTION OF
KNOWLEDGE ENGINEERING

DEVELOPMENTAL OBJECTIVES Coil tracking, coil yard management, issuing


commands to load coils onto lines and ship them
Various plans have been drawn up to strengthen from linesnearly all finishing line operations
operations at the hot strip mill, the core of were performed in accordance with operator
flat product activities at the Kashima Works. judgement. Operators situated in each zone would
One of these called for a fundamental review of check to see what had already been done to the
finishing line operations to improve material coils, how the equipment was operating, where
flow and reduce labor requirements. the transfer cranes were, and how the conveyors
were running; they would use their knowledge,
There were several kinds problems associated gained through experience, to judge what they
with materials flow at the finishing operations: saw; and they would issue commands in accord-
the finishing lines actually consist of several ance. However, the amount of information to be

89
processed is vast; the situation is always
changing; and decisions are made in terms of the fee
No.ISkinpass
situation at each zone. Clearly, a rational No.2 Pickling
system of scheduling was required, a system that No.1 SIitting
No.1 Pickling
incorporates the operating know-how and experi-
ence of each operator to comprehensively cover lD-
the entire finishing operation. Wrapp i ngc.
:Air-cooling

On a related development, it was determined that -B-Loop


coil cars were to be installed to speed the flow 110.2 SI itt iIng
of material between zones. However, we had no No.1 S h e a r i n g N k . 2 Skinpafc
accumulated know-how as to the operation of coil
cars. We thereby decided that the new system, as Water-cooling
No.2 Shearing o 3
developed, must be flexible; that is, it must be N S k i n p
s a s

easy to add or modify software to accommodate


Conveyer
new operating practices that may be established
as we gain experience.

If is for these reasonsspecifically, ease of Fig. 1. Layout of Finishing Line


incorporating operator knowledge and ease of
system modificationthat we chose to adopt an
expert system. The expert system we chose for
this project was MARKS II-RT, a system developed Rollins Sect ion
by our company that features high-speed infer-
Ma i n conveyer
ence under real-time environments.
Vater Coo 1 i n g Air C0 01i n

OUTLINE OF CONTROL SYSTEM Skinpass Line


Coil transfer in finishing line Pickling: Line
Slitting L ine
Fig. 1 shows the layout and rough dimension of
the finishing lines. Four working zones are in Wrapping. Line [Shearing Line
the finishing lines. The pickling (PC) zone
contains tow pickling lines, a skinpass line and
a slitting line. The cooling (CL) zone where Co Id Strip Mi 11 | | Shipping
coils are conveyed from the hot rolling mill has
large air-cooling coil yards, which are also
used for temporary storing of coils, and a
wrapping line. The shearing (SH) zone has a
shearing, skinpass, and slitting line. The Fig. 2. Coil Transfer Flow
skinpass (SK) zone has water-cooling coil yards
besides a skinpass and shearing line.
TABLE 1 Number of Transfer Equipment
Fig. 2 shows the flow of coils. A typical flow
of a hot coil is as follows. At first, a hot PC zone CL zone SH zone SK zone
rolled coil is cooled by air or water in the
cooling zone or the skinpass zone, and the Crane 3 6 3 4
surface of the coil is treated in the pickling Coil car A-Loop: 8 cars B-Loop: 4 cars
line. Then, the coil is processed through the
skinpass line and the slitting line. Finally the
product is wrapped and shipped.
station. A-loop has twenty-one stations and B-
The coil yards have a vast capacity for the loop has thirteen. The loading or unloading
accumulation of several thousands coils, and operation of a coil is performed when the coil
many coils are brought in the yards every day. car stops at a station.
Therefore, the search for the specified coils
and the transfer of the coils were very trouble- System Configuration of Control System
some work before the development of the comput-
Fig. 3 shows a newly developed microcomputer
er system in the finishing line.
system for the coil finishing lines. This con-
Transfer Equipment trol system is a very large-scale distributed

Coils in the finishing lines are transferred by


cranes, coil cars and conveyers. Table 1 shows
the number of transfer machines in the finishing
lines. The cranes share the transfer within one S-NET (Sumitomo-Network)

zone. Some conveyers can carry coils both within


one zone and across zones, and other conveyers Database |, Expert I,1 Commun Test
are responsible for the coil transfer at the Management System cation Control
(80286) (80286) (80286) (80286)
entrance and the exit of each finishing line.
The coil car system was introduced at the renew-
al of the finishing lines, because the capacity
of the coil transfer had to be increased. The Qh 30
dimensions and the number of cars in the coil Sk inpass Wrapping Coil Crane CoiI Car
car system were determined by the simulation of Cont rol Control Tracking Linkage Control
(80286) (80286) (80386) (80386)
the operation research technique. There are two (80286)

looped tracks, "A-loop" and "B-loop", and a


total of twelve coil cars travel in one direc-
tion on each looped track. Each coil car carries
one coil and is controlled to stop at the fixed Fig. 3. System Configuration

90
system, which consists of 12 80286-based micro- vated, it scans the variable area and reflects a
computers and 6 80386-based microcomputers, and change, if any, on data objects (polling-type).
is responsible for the coil flow management and After that, the inference engine applies the
the transfer equipments control of all the fin- knowledge units, in which the applying condition
ishing line. This system contains an expert is satisfied, in the listed order. The coils
system. which are being conveyed are referred to through
polling-type linkage. In the knowledge unit of
This expert system occupies the main part of "crane-scheduling", if the rule shown above is
this control system and is embedded in a 80386- satisfied with one of many transfer units and
based expert system machine. It is coupled fired, the conventional procedure is called to
closely with the conventional computer system give instructions to the crane (calling-type).
through communication facilities to get the
information or the knowledge data. The informa- Functions
tion about the finishing schedule are obtain
from the mainframe through the communication This expert system has three main functions. The
machine. The instructions about crane operations first is to select a coil to be transferred. The
are sent from the expert system machine to the coil must be transferred to an inlet conveyer
crane linkage machine and displayed in front of without delay so that each finishing line may
crane operators. The instructions about coil car work without discontinuity and must be removed
operation are sent to the coil car control immediately at the exit of each process to pre-
machine, and the cars are controlled fully vent the remaining coils from disturbing the
automatically. process operation. Furthermore, the process
condition in another zone should be considered
in the coil selection, because the coil is
EXPERT SYSTEM sometimes transferred across the zone by using a
coil car or a conveyer. Thus the expert system
System Configuration needs the information about the conditions of
each process, the crane information on the
The system configuration of the expert system position, operation and the kind of lifting
machine is shown in Fig. 4. This diagram is sim- tools, the conveyer information on available
plified to show only tasks directly relate to location and operation, and the coil car infor-
the expert system. MARKS-II has three types of mation on the location and loading state of each
linkage with conventional systems, which is car. The information is too much for an expert
described in procedural language such as PL/M, operator to grasp instantaneously, and the
Fortran and C. Both the expert system and the expert system is the most profitable for this
conventional system execute on the same comput- task.
er, and can interface with each other very well.
The second function is to determine a loading or
unloading station to/from a coil car in the
looped track. The efficiency of the coil car
operation depends upon the following three fac-
tors. The first is the number of in-service coil
cars. Vacant coil cars must be minimized. The
second is the delay time between the stopping of
a coil car and the operation of a crane. Each
crane should lift or put down a coil when a car
stops. The last is the interval between coil
cars. If coil cars travel close together, a car
should wait when the preceding car stops. There-
fore, the selection of the station must be per-
formed in consideration of these factors.

The last function is to keep transport equip-


ments operating as efficiently as possible.

To sum up these judgements must be done instan-


taneously in place of human experts.
Fig. 4. Expert System Machine Configuration
Knowledge Base Configuration
Transfer demand is given through the network
from the main-frame. In this computer, "Demand Since the system is to handle a vast amount of
Receive Task" is activated on receiving the information concerning cranes, coil cars, coil
demand, and saves the received demand to a RAM transfer commands, and the like, we chose to
file. When "Inference Driver Task" is activated unitize/modularize the rule base by function.
periodically, it retrieves the differential This helps simplify maintenance.
transfer demands from a RAM file and divides the
demands into more than one transfer unit. All of Knowledge Data Object
these demands, including transfer units, are
asserted into the knowledge data objects in the This expert system is a real-time control system
form of facts by the assert-fact procedure in the category of planning problem, so we take
(called-type). After that, the driver task production system an inference engine, and adopt
activates inference engine by activating-infer- frame and fact as a fundamental knowledge data
ence procedure provided as the called-type object.
linkage.
Transfer Equipments Object
"Data Gather Task" is activated independently by
interrupts as soon as the external situation Transfer equipments, cranes and coil cars, have
changes, and stores the gathered data to the much information. But the information is struc-
variable area. tured and a great portion of it is common to all
the cranes or all the coil cars. So we represent
When the inference engine in MARKS-II is acti- them by frame concisely as shown in Fig. 5. In

91
CRANE The rules use this parameter and make a transi-
status : idle tion between these conditions. The goal of the
work null expert system is to assign a transfer machine to
coiI-no: 0
a ready coil. In other words, the goal is to
change the condition parameter from "Transfer
PC-CRANE CL-CRANE fSH^RW"
ready" to "Transfer machine assigned", and to
is-aus :CRANE is-aus :CRANE is-aus CRANE replace some other parameters as an actual ma-
wk-zone: PC wk-zone: CL wk-zone SH chine name or a concrete position considering
tool : xx tool : yy tool XX the various environmental condition. Then the
result of inference is sent to the cranes or the
coil cars.
PC01 PC02
instance-of:PC-CRANE instance-of:PC-CRANE
sequence-no: 1 sequence-no: 2 Rules
location : 100 location : 230
The operational rules of transfer equipment and
expert's knowledge about scheduling are repre-
sented by rules. Many rules are defined through
interviewing expert operators to realize the
Fig. 5. Crane Represented by Frame functions mentioned above. Furthermore many
rules are added during the test period.
"crane" frame in the upper-most row, common
information concerned with a crane should be To increase the operational efficiency of
described, so that these information could be cranes, the movable area is specified to each
inherited to the lower frames. In the second crane. This reduces the out-of-service period of
row, the special information about the cranes in cranes.
each working zone is described. Cranes which are
not a concept but actual transfer equipments are To increase the operational efficiency of fin-
described in the third row. ishing processes and transfer machines, the
cranes should be assigned according to the
Transfer Demands following priority. The first is to lift a coil
from a coil car. The second is to place a coil
The coil orders of each manufacturing equipment on a coil car. The third is to remove a coil
are scheduled in a main-frame computer, and the from the process outlet, and the fourth is to
transfer demands are given by lots via communi- charge a coil at the process inlet. This priori-
cation machine. These demands are asserted and ty rank was determined, because the operation of
retracted frequently, so we represent them in coil cars is anticipated as the bottleneck of
the form of fact. the whole transfer system. If the finished coil
remains on the outlet of the line, the process
This expert system aims to schedule the opera- must be stopped. Therefore the remaining coil
tions of transfer equipments such as a crane or should be removed immediately to prevent it from
a coil car, so we divide the received demand disturbing the finishing process of the next
into more than one transfer unit corresponding coils.
to each transfer equipment. Each unit is made an
object of scheduling. The division can be proc- A rule was defined, about coil car assignments,
essed in a fixed procedural way, so we would that a coil car must be left vacant if it fol-
make this in conventional language. But in this lows three successive in-service cars. This rule
process, the concrete transfer equipments are results in the coil car assignment being easily
not determined, nor the loop-station. In deter- performed at any station without time delay. The
mining which crane or coil car is to be used, fraction of vacant coil cars can be adjusted in
the rules of the expert system are used. It is consideration with the finishing line's condi-
convenient to put all information about one tions. When a coil is scheduled to be trans-
demand together and save it. ferred from a conveyer to a coil car, the load-
ing station may be re-scheduled to the forward
So we represent all the information, such as position if it takes an unexpected amount of
time constraints and selection priority, by the time for the crane to make the previous opera-
fact named "transfer", and each transfer unit tion.
information, such as starting/destination posi-
tion and the kind of equipment, by the fact An example rule to assign a crane in CL zone is
named "trans_unit". The following are examples. shown below, where V() means the polling type
The first item in each fact is the Fact name or linkage with conventional system and
type and all other items are replaced by actual conveyer(X).coil(Y) indicates the coil number at
numeric or string values in the process of the Y-th saddle on the X-th conveyer.
inference.
RULE No2-conveyer-to-loop-3
?f <- FACT(trans_unit ?coil-no 2 No2conveyer
FACT(transfer coil-no work-code ?from-loc A-loop-station ?to-loc
from-time to-time crane not-scheduled)
work-order{if specified} priority) FACT(territory ?crane CL <?from-loc
FACT(trans_unit coil-no working-zone >?from-loc)
kind-of-starting-position location-of-it FRAME(?crane
kind-of-destination location-of-it @is-a crane
kind-of-trans-equipment step-of-scheduling) @wk-zone CL
status idle)
V(conveyer(2).coil(>=10&<=15&#MAX)
The last parameter of "trans_unit" indicates the <>0&?coil-no)
present transferring condition listed below. FRAME(?car
@is-a car
(1) Transfer forbidden @loop A-loop
(2) Transfer ready (not-scheduled) status ( i d l e or refuge
(3) Transfer machine assigned (scheduled) @zone ?zone)
(4) Being transferred TEST(( ?to - l o c - ?zone>=0) and

92
(?to-loc - ?zone<10)) dynamically reviewed. At the same time, the
system examines new transfer demands, cross
reference them with various restrictions, and
determines which demands can be honored. From
retract-FACT ?f these demands, the one with the most priority
assert-FACT(trans_unit ?coil-no 2 No2conveyer points is selected. The remaining demands are
?from-loc A-loop-station ?to-loc examined by inference to determine whether any
?crane scheduled) transfer demands exist that are optimal in terms
of the efficiency of materials transfer as a
whole. (This inference includes considerations
of actual and predicted crane position, actual
This rule means that when a unit transportation and predicted coil car position, distance coil
is demanded from No.2 conveyer onto an A-loop is to be moved, special operations, and other
coil car and the crane which is responsible for factors.) If so, these demands are allocated as
that territory is idle, the coil being trans- appropriate to cranes and coil cars. Finally,
ported exists between the 10th and the 15th the statuses of cranes and coil cars thus speci-
saddle on No.2 conveyer and there is a coil car fied are revised, after which those statuses are
which is idle and near the station, the coil held as they are until the next event. Statuses
which has the biggest number of saddle is se- are assigned on a one-to-one correspondence with
lected and the unit is scheduled. each operating crane and coil car.

Knowledge Unit System Characteristics


We divide our knowledge base into some units in The system is characterized by the use of var-
order to increase the maintainability. The divi- ious methods to speed up the inference processes
sion also helps to limit the rule firing within and create a real-time transfer control system
a knowledge unit and to suppress the unexpected capable of handling the diverse and flexible
firing. The knowledge units are as follows. demands of the work floor. These methods are
described below:
(1) Restriction in facility
(2) Restriction in charging order External input variables. The system uses exter-
(3) Coil car assignment nal input variables to take in crane tracking,
(4) Crane assignment coil car tracking, conveyer tracking, and other
event information. Consequently, the number of
The rules for the crane assignment include com- such variables is immense, as can be seen in
plicated logic, and the unit of the crane as- Table 2. To perform polling for all of this data
signment has the largest numbers of rules of 95 would create problems for real-time processing.
in this expert system. Knowledge base scale and Here, we used the following methods to speed up
knowledge unit configuration are shown in Table processing:
2. - Use a polling cycle of once every 10 seconds
(normally, polling would be performed every time
a knowledge unit is shifted);
TABLE 2 Outline of Knowledge Base - Omit polling of some data kept in tables and
defined as external input variables;
- Omit polling of areas with no data within
FDATA FACT 4 ?. (t v p e s ) external input variable tables;
BASE FRAME Z(tvpes) - Omit polling of meaningless "0" data.
PL/M data 7758
(Variables) Rule Description. The description parameter
RULE Event Reflection 45 Ru les section of many of the rules is quite long.
BASE Useless data Cut 4 3 Rules Consequently, to increase matching efficiency,
Work Limitation 4R Rules we placed facts with many inferential changes at
Tota 1 Schedule Limitation 68 Rules the end of the parameter section. Furthermore,
1 1 3 7 4 r u 1 e s Car Control 77 Rules we made the fact configuration such that facts
8000steps Decide Crane Work 95 Rules including wildly changing elements are separate
from those containing other elements.

Use of Many External Functions. Editing and


After all the relevant rules are applied in a processing of character strings in the rule
knowledge unit, the rules in the next knowledge execution section are done completely by exter-
unit are applied. The switching of knowledge nal functions, all of which can be called di-
units is controlled by the metarules. The coil rectly from within the knowledge base. In addi-
car assignment is estimated as more crucial than tion, built-in functions have been provided to
the crane assignment, so the knowledge unit of derive, from data that fits the parameters in
the coil car assignment is executed earlier than question, minimum and maximum values without
that of the crane assignment. having to resort to complex rule descriptions.

Inference Flow Discussion

Inference is performed in 10-second cycles. In It is true that the operating speed of expert
one inference cycle, 19 knowledge units are system is slower than that of conventional sys-
sequentially activated; inference is performed tems, but the development period of expert sys-
repetitively until there are no more rules to tems must be much shorter than that of conven-
execute. The average time for one inference tional systems. Expert systems enable the trial
cycle is about six seconds. test of the prototype at an early stage. And the
modifications and additional changes are easily
The overall inference flow is as follows. First, performed to develop the final system. Therefore
an external event changes the status of a crane, expert systems are suitable for applications
coil car, coil, conveyer, or the like. Then, where logic is updated frequently or changed by
based on event information, (predetermined) the environmental conditions. In fact we have
priority points for each transfer demand are developed this expert system in only ten months,

93
and we estimate that the period is less than a REFERENCES
half in comparison with the conventional system.
During the test or adjustment phase, various Forgy, C.L. (1982). Rete:a fast algorithm for
modifications were needed according to the in- the many pattern/many object pattern
crease of production, but the modifications have matching problem.
been achieved easily in a short period. This has
confirmed the expert system's feature of easy Fumiki, H. (1990). An expert system for effi-
maintenance. cient coil transfer in finishing line of hot
strip mill. ISIJ International, Vol.30,
On the other hand, it is difficult to test or 167-172.
verify the logic of the rules in the development
of the expert system. Therefore, the offline Kazuki, T. (1988). Expert system building tool
simulation tool, including the model of finish- for process control:MARKS-II. Workshop
ing lines and the expert system, has been de- on Artificial Intelligence for Industrial
veloped, and the logic of rules have been justi- Applications, 376-381.
fied in the offline test. In this manner, the
logic were verified and the system could be Tamao, Y. (1987). Expert system building tool
smoothly shifted to online environment. This for process controloverview--. IPSJ,
method of testing should be applied to the 1721-1722.
development of expert systems in general.

BENEFITS

Use of this system has allowed us to increase


the overall efficiency of materials flow at the
finishing lines as a whole; it has also contrib-
uted significantly to the rationalization of
finishing operations. More specifically, we
obtained a reduction of lead time (-2.8 days)
and in materials handling personnel (-86 people)
through; establishment of coil yard management
based on an automated system of record-keeping;
automation of coil transfer commands; introduc-
tion of coil cars; implementation of automatic
operation and control; and other such action.

The adoption of an expert system played an ex-


tremely large role in helping us to complete
this project quickly. Needless to say, flexibil-
ity and speed with regards to the addition of
modification of software were invaluable; we
also found the expert system to be beneficial in
allowing us to construct an automatic system
that incorporates existing operating know-how in
a form favored by the operators. Through this
project, we established the effectiveness of
expert system for scheduling-type systems.

CONCLUSION

Only five months after the creation of a proto-


type, a flexible materials flow system fully
matched to the needs of the workplace was com-
pleted. We believe that this coil transfer
expert system, characterized by the application
of knowledge engineering to scheduling and the
introduction of an expert system for real-time
control, suggests one approach for future mate-
rial flow scheduling systems.

To cope with the increasing harshness of the


environment surrounding the steel industry and
to meet ever stricter demands for product quali-
ty, we plan to devote even more energy to the
development of new systems that exploit the
advantages of expert systems.

94
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

KNOWLEDGE ENGINEERING AN EXPERT SYSTEM TO


TROUBLE-SHOOT QUALITY PROBLEMS IN THE
CONTINUOUS CASTING OF STEEL BILLETS
S. Kumar, J.A. Meech, I.V. Samarasekera, J.K. Brimacombe
The Centre for Metallurgical Process Engineering, The University of British Columbia,
Vancouver, B.C., Canada V6T174

ABSTRACT

The important features of knowledge engineering an expert system to diagnose quality problems
in the continuous casting of steel billets are described. The paper considers the process used to
extract knowledge from the Experts, the experiments employed to represent knowledge in different
ways, the strategies adopted to accumulate evidence and the methodologies to present coherent
recommendations to both experienced and inexperienced operating personnel.

Expertise was derived from diverse sources. On the one hand fundamental knowledge about heat
transfer, solidification and mechanical behaviour of steel is applied to identify the origin of a quality
problem in the process; on the other, heuristic knowledge associated with billet characteristics,
machine design and operating factors is required to focus on the problem causes. Integration of
these two different types of knowledge was essential in developing a useful system.

From the Experts' viewpoint, this exercise provided them with a formal representation of their
knowledge to solve quality problems and identified new areas for future research. Moreover, the
Experts actually generated new ideas about the domain which previously were unrecognized. For
the users, this system serves two important functions - as a diagnostic tool for analysing quality
problems and as a teaching tool for new operating personnel.

Keywords. Expert System; COMDALE/X; Knowledge Units; Knowledge Acquisition;


Knowledge Representation; Evaluation and Testing of Expert Systems; Continuous Casting of
Steel Billets; Quality Problems in Billet Casting; Cracks; Rhomboidity; Breakouts.

INTRODUCTION KNOWLEDGE DOMAIN


In the continuous casting of steel billets, emphasis Continuous casting is a heat-extraction process in
on quality control is a prerequisite in today's which molten steel is converted into a solid
competitive environment to meet customer semi-finished product. The steel is cooled in three
specifications and to reduce operating costs. The stages: a water-cooled copper mould, one or more
goal of high quality cannot be realized, however, water spray zone(s) and finally in air by radiation
without knowledgeable operating and heat transfer. Figure 1 shows a schematic diagram
maintenance personnel who understand their of a billet casting machine. During processing,
casting system. Similarly, the expertise these the solid steel is subjected to thermal and
individuals possess should be considered mechanical stresses which can give rise to quality
invaluable, and a method to retain and transfer problems in the billet such as cracks (Brimacombe
this expertise within an organization can be of and Sorimachi, 1977, Brimacombe, Hawboltand
great advantage. Weinberg, 1980 and Bommaraju, Brimacombe
and Samarasekera, 1984), rhomboidity
There has been significant research into the (Brimacombe and Samarasekera, 1988) and
control of billet casting which has correlated breakout of liquid steel through the solid shell
certain process operating parameters with (Samarasekera and Brimacombe, 1987). Figure
different quality problems. Although the resulting 2 shows a rhomboid billet and the types of cracks
literature is well-documented, transfer of this that might be observed.
fundamental knowledge to operating personnel is
difficult eventhough numerous short courses on
the subject have been given. There is a need then,
for an expert system to guide operators in
analysing quality-related problems and in
providing them with a ready source of
fundamental knowledge related to the operation
of the casting machine. Such a system has been
developed with this purpose in mind.

This paper presents the important aspects of


knowledge engineering involved in the
development of this system. The various stages
of knowledge acquisition are described together
with the approach taken to structure the problem
domain. The knowledge representation
techniques available in the development tool,
COMDALE/X, are highlighted. Testing and
evaluation of the system by the Experts and the Fig. 1. Schematic diagram of a billet casting
Users is summarized. machine.

95
Fundamental knowledge related to heat transfer, acquisition can be severe when the Knowledge
solidification and the mechanical behaviour of Engineer has far less knowledge about the domain
steel are important topics in analysing these than the Experts. In this work, steps were taken
problems. In addition, operator experience in the to ensure that this was not an issue.
plant provides a valuable source of heuristic
knowledge about the process, particularly site An outline of quality problems, comprising the
specific issues. In this expert system, integration origin of defects, their causes of formation and
of knowledge from these two diverse sources was suggested remedies for prevention, was prepared
attempted to provide useful advice for operators by the Knowledge Engineers. This preliminary
experiencing quality problems. categorization is summarized in the appendix.
Although these tables contain the major elements
of the knowledge domain, the methodology for
diagnosis is much more complex than a simple
TRANSVERS
E MIDFAC
E CRAC
K "look-up" table. With a desire for a system
LONGITUDINA
L MIDFAC
E CRAC
K possessing human-like thought and
communication processes, considerable effort
CRAZ
E CRACK
S TRANSVERS
E CORNE
R
was expended to structure the actual approach
CRAC
K used by the Experts in the analysis of quality
problems.

Expert System Development


CENTRELIN
E CRACK
MIDWAY CRACKS The problem domain consisting of cracks (internal
PINCH-ROL
L CRACKS
and surface), rhomboidity and breakouts was
OFF-CORNE
R INTERNA
L CRACK
S
broken down initially into individual modules -
Fig. 2. Schematic diagram of a rhomboid each of which could be worked on independently.
billet showing various types of cracks. Accordingly, development began with an
examination of midway cracking as this problem
was considered to be relatively simple and
KNOWLEDGE ACQUISITION well-understood. The Experts initiated the
knowledge acquisition process by identifying that
Knowledge acquisition has been defined as the midway cracks were caused by reheating of the
transfer and transformation of problem-solving billet surface.
expertise from some knowledge source to a
computer program (Hayes-Roth, Waterman & The focus then, was aimed at analysing
Lenat 1981). In our case, this process consisted spray-related issues that are less complex than are
of four principal stages: definition of the problem mould disorders. It quickly became apparent,
domain, acquisition of background knowledge on however, that poor mould operation could also
continuous casting, development of the expert contribute to midway cracks. In the end, midway
system and testing and validation in the field. cracks turned out to be the most complicated
During the first stage, the Experts outlined the problem from the viewpoint of diagnosis and
problem area and the need for an expert system to knowledge representation.
diagnose quality problems in billet casting.
During system development, the knowledge base The Experts outlined the steps followed in dealing
was structured, coded into rules, evaluated by the with midway cracks and explained the
Experts and modified. This was an iterative significance of operating parameters involved at
process where the knowledge base was subjected each stage. The initial scheme to analyse midway
to numerous revisions as well as expansion. The cracks originating in the radiation zone is
final stage involved testing of the Expert system summarized in Table 1. Factors such as spray
by the Knowledge Engineers, Experts and length, water flow rate, steel temperature and
End-Users. Feedback obtained from these composition were considered to be important, and
evaluations was incorporated into the expert based on this information, a small prototype was
system. developed.
TABLE 1 Initial Scheme proposed bv the
Definition of Problem Domain Experts to analyse Midway Cracks
originating in the Radiation Zone.
The problem identification stage was driven by
the Experts who wanted a system to help transfer STAGES DETAILS |
their knowledge to industry. The Experts defined 1 Input Distance of the outer tip of the crack from the billet
the problem domain and the need for an expert surface
system to diagnose quality problems in billet 2 Compare Measured distance with solid shell thickness profile
casting and to train less experienced operating 3 Establish Location of crack origin in the machine
personnel. It was envisaged that the system could I4 Input Model-predicted billet surface reheat for radiation zone
be used as a consultant by users and so provide Spray parameters - zone length and water flow rate
High temperature strength of steel - Mn/S ratio
the Experts with more time for other activities. Cast structure - Superheat level
Thus, the problem modality was well-defined and 5 Analyse Compare input parameters with values specified by the
the Experts were ready and willing to participate experts
in the development process. 6 Correlate Generation of midway crack with the operating
parameters

Background knowledge on continuous casting Conclude Conclusions and justification

The Experts provided the Knowledge Engineers


L
Almost immediately, questions were raised about
with literature on continuous casting of steel the solid-shell thickness profile and the surface
billets- operations, design and product quality. temperature distribution. In the prototype, it was
One of the Knowledge Engineers also attended a assumed that the users would have prior
short course provided by the Experts and their information about these predictions. Since the
colleagues in Vancouver. This established a system simply informed the users when operating
practical understanding of the problem domain parameters were "high", "low", or "okay", its
and helped considerably in subsequent usefulness was nothing more than a check-list
discussions with the Experts during development. with input based on intuition rather than
Buchanan and others (1981) have pointed out that measurements or analysis. Thus, the first stage
communication problems during knowledge of implementation was not very successful.

96
The knowledge base was reviewed and emphasis Until this stage, the focus was on the radiation
was re-focused on the consultation process zone. But midway cracks have also been observed
followed to deal with a client rather than on actual in the spray zone or in the region between the
details of the data. The system was modified to mould and the sprays (submould). For the
provide conclusions and suggest remedial actions Experts, it was difficult to visualize the origin of
in the same way as the Experts. The knowledge midway cracks in the spray zone because cooling
base was re-structured to facilitate examination of rates in this region are high and therefore, the
all parameters in the analysis. In addition, billet surface is unlikely to reheat. If midway
information for the users was incorporated into cracks do initiate in the spray zone, the Experts
the system in the form of explanations, surmised that only one factor could influence
screen-messages and customized rule billet surface reheating -poor spray maintenance
descriptions, using the user-interface facilities of (damaged or blocked spray nozzles). The system
COMDALE/X. was expanded to include this situation. This
completed the prototype for midway cracks and
At this stage, the expert system was capable of the next problem was selected for development.
handling only typical situations which cause
midway cracks to form - high reheat of the billet
surface, high superheat of steel and poor steel It was pointed out by the Knowledge Engineers
ductility at high temperature. The Experts were that more than one quality problem might be
consulted on other situations where midway observed at the same time. This meant that the
cracks might be seen viz. when both spray and plan to have a separate module for each problem
steel parameters were satisfactory, when only was inappropriate. There was need for a general
steel parameters were satisfactory and when only strategy to handle any combination of quality
spray parameters were satisfactory. The Experts problems.
felt that it was impossible for midway cracks to
form when both spray and steel parameters were
satisfactory. Combinations of Quality Problems

However, when a problem related to steel quality Based on the knowledge summarized in the
was evident and the spray parameters were appendix, the domain was divided into two broad
satisfactory, the Experts believed that reheating groups. The first group consisted of those
must be related to a mould disorder. Under these problems that require a detailed analysis of the
circumstances, the presence of dark and bright mould and/or the spray cooling system.
patches on the billet surface at the mould exit, Off-corner cracks, transverse cracks and
representing over-cooled and under-cooled depressions, midway cracks, rhomboidity,
regions respectively, would confirm a diagonal cracks and breakouts are included in this
mould-related problem. The overcooled regions category. The second group deals with very
can reheat excessively in the radiation zone below specific issues for which a detailed analysis of the
the sprays, leading to midway cracks. Therefore, mould and/or the sprays is not needed.
expansion of the system was necessary to include Longitudinal corner cracks, longitudinal midface
knowledge dealing with mould parameters cracks, pinch-roll cracks, centreline cracks and
(distortion, oscillation, taper and lubrication). craze cracks are in this group.

It was clear that the users would not be in aposition The knowledge base was organized into three
to predict either billet surface reheat levels or the modules as shown in Fig. 4 to handle
solid shell thickness profile needed for the crack combinations of quality problems and to
analysis. Therefore, a heat-transfer mathematical accommodate the memory limitations of the
model for billet solidification was developed to operating system, MS-DOS 4.01. This
predict these profiles. The model uses operating classification was based on three different
parameters as inputs and predicts solid shell approaches identified for the diagnosis process
thickness and maximum reheat temperatures at that require detailed mould and/or spray analysis.
key locations in the casting machine. The Module 1 is designed for situations where no
alternating direction implicit finite-difference midway cracks are present but any combination
method is used to solve the unsteady-state of off-corner cracks, transverse cracks and
heat-conduction equation in two-dimensions depressions, breakouts, rhomboidity and
(Patankar, 1980) over one quarter of the billet diagonal cracks are observed. In this situation,
section. The transverse section was discretized
into a 30 x 30 mesh with a time increment of 0.10
seconds. A typical output profile for a test QUALITY PROBLEMS REQUIRING
DETAILED ANALYSIS
situation is shown in Fig. 3.
OH-come
r Crac
k Midwa
y Crack
s
Transvers
e Defect
s
Rhomboidit
y
Breakout
s

[ Midwa
y Crack
s |

Distance from the meniscus ( m )

Fig. 3. A typical output of the mathematical


model showing predicted solid shell
thickness and billet surface Fig. 4. Scheme for diagnosing quality
temperature profiles problems requiring detailed mould
(Casting Speed - 25 mm per second; and/or spray analysis.
Section Size - 150 mm square)

97
only a detailed analysis of the mould is necessary,
KNOWLEDGE REPRESENTATION
although the system can provide warnings about
possible spray maintenance problems. Module 2
Knowledge representation involves the
examines cases that include midway cracks with
development of structures that assist in coding
any other type of quality problem. Mould issues knowledge into the system so that intelligent
are examined first followed by a detailed spray behaviour is exhibited. The main knowledge
analysis to address midway cracking problems. items used in this expert system can be grouped
into structural, procedural, external and
Module 3 deals with midway cracks as the sole meta-knowledge (Meech, 1990). Structural
quality problem. In this case, the spray system is knowledge consists of rules (statements and
examined first. If a spray problem is uncertain, procedures ), and facts (classes and keyword
then mould issues are examined. As well, the triplets). These units provide the various
mould may be examined if required by inference connections between facts in the expert system as
of the system or by request of the user. If such a it searches for final conclusions. Procedural
request is not desired but the system detects certain knowledge units are employed to direct the
mould disorders, a severe warning message is inferencing process for a number of
issued. In all three modules, there are rules knowledge-specific tasks. External knowledge
dealing with quality problems that do not require provides an interface for the system to interact
detailed analysis of the mould or the sprays. with external programs while Meta-knowledge or
Adverse mould-shell interaction is the root of "knowledge about knowledge", gives useful and
most mould-related problems. A thorough appropriate explanations and justifications to the
"breadth-search" of factors related to adverse user. The knowledge units available in
mould-shell interaction, both at the meniscus and COMDALE/X are shown in Fig. 6.
away from it, is conducted. Figure 5 shows the
factors examined. The important issues are KNOWLEDGE
mould-tube distortion, oscillation characteristics,
mould lubrication and taper.
Object
s Attribute
s Value
s Degre
e ot Belie
f
I- , I
ADVERSE MOULD-SHEL
L
INTERACTIO
N Keywor
d Triplet
s Exclusiv
u e sets t x c l
-Logical JMull
i.choice sets
Mulli

I Si
I n
I
Keyword Triplet
s
- C o n d i t ino S t a t e m esn t Logica
l Connective
s
-ii -Predicate
Function
s
s / Operator
s

Keyword Triplet
s
- Conclusio
n S t a t e m esn t Certaint
y Factor
s
MOULD TAPE
R MOUL
D DISTORTIO
N MOUL
DLUBRICATIO
N Assignments
- Else S t a t e m esn t Function
s

Inferenc
e Strategie
s
1: Cooling WaterVelocit
y Flowrate 1: Negative Strip Time
2: Cooling WaterQualit
y j 2: Oil Distributio
n 2: Mould Lead Input/Outpu
t Strategie
s
3: Mould Tube Alignment 3: Oil-Physical Propertie
s 3: Nature of oscillatio
n ere Communication Control
4: Type of Mould Constrain
t ' Oil System cleanliness A: MetalLevelVariatio n 1
S: Mould WallThickness M e t a - K n o w el e d g
6: Positio
n of MetalLevel Triplet Representation
s
6: Mould Coolin
g Water
7: Mould Coppe r Temperatur
e Customized Question
s
8: Mould Design Tolerances 1 Customized Rule Description
s
Customize
d Explanations

Fig. 5. A general outline of mould-related Fig. 6. Knowledge units available in


factors contributing to quality COMDALE/X Development Tool.
problems in billet casting.

Mould-tube distortion is related to cooling water Decisions to use specific facets such as exclusive
quality and velocity, the nature of the mould sets, fuzzy sets, restrictions, default values, etc.
constraint system, copper composition, wall to represent various facts, were made as the
thickness, metal level, tube alignment and design knowledge base expanded across the domain. The
tolerances (Brimacombe, Samarasekera and availability of procedures to control the inference
Bommaraju, 1986). The analysis of oscillation process - breadth search versus depth search,
characteristics involves evaluation of forward chaining versus backward chaining is a
negative-strip time, mould lead and the nature of major asset of the COMDALE/X Development
oscillation marks - depth and uniformity across Tool. Of particular importance is the flexibility
the billet surface. Important parameters dealing of the tool to allow knowledge accumulation
with lubrication in the mould are oil flow options that are domain-specific (Meech, 1989).
conditions (flowrate and distribution), physical The method developed to establish belief in a
properties of oil (boiling point, flash point and spray-related problem is described below.
viscosity), and maintenance of the oil-distribution
Belief in a spray-related problem
system. Finally, an analysis of the type of taper
and its magnitude is conducted. Taper can
The appearance of midway cracks in a billet
influence the formation of both off-corner cracks
suggests that solid steel close to the solid-liquid
and transverse defects.
interface has been subjected to tensile strains
during processing. The generation of the tensile
This development had a major impact on the strains is frequently caused by reheating of the
project as it presented to the Experts a structure strand surface due to a sudden reduction in heat
that was very similar to their own. For the first extraction. Reheating causes surface expansion
time, they were able to observe links between which is constrained by the interior of the
various quality problems that had not been solidifying shell such that a tensile strain is
considered previously in their analysis. This
generated close to the solidification front where
strategy was also beneficial as it significantly
reduced the amount of coding necessary as per the the steel has low ductility at temperatures within
earlier plan since only three modules were needed. 50 C of the solidus temperature (Thomas,
Brimacombe and Samarasekera, 1986).

98
The level of surface reheat determines the
magnitude of the tensile strain generated and
therefore the intensity of a cracking problem.
Thus, it was considered that belief in a high strain
level is equivalent to the degree of belief in a
"high" reheat. Belief in a "high" reheat level is
determined by defining a fuzzy set for the B
adjective "high":
A
Radiation Zone 0 50 75 100 150
Reheat
Temperature
(C)
Degree of 0 25 50 75 100 INITIA
L BELIE
F IN SPRAY-RELATE
D STRAI
N PROBLE
M ( %)
belief in a high
reheat (%)
Fig. 8. A schematic sketch illustrating the
calculation of final belief in a
A low Mn:S ratio and a high steel superheat are spray-related tensile strain problem
two operating factors that can aggravate the following the detection of a mould
cracking susceptibility of steel when the reheat disorder.
level is high (Brimacombe, 1976). Although the
OBC : Belief in a spray problem due to reheat, high
presence of a high strain level is mandatory for superheat and composition problem
crack formation, problems with composition and
superheat can lead to cracking at lower strain OAC : Modified belief in a spray problem due to the
levels i.e., at lower reheat levels. On their own, definite presence of one or more mould
superheat and composition problems are not of disorders
concern since cracks cannot initiate in the absence
of a tensile strain.
Thus, the contribution of belief in these effects
on increasing or decreasing belief in a
spray-related strain problem is considered to be
similar. The influence of a mould problem on
belief in a spray problem is taken into account only
after the effects of superheat and high-temperature
/ / strength of steel have been considered.
/ /| c

STRUCTURE OF THE EXPERT SYSTEM


Figure 9 outlines the general methodology
adopted in this system to analyse quality
problems. At the start of a consultation, the
system acquires operating parameters with the
help of an external program. The heat-transfer
F IN HIGH REHEA
BELIE T DUE TO THE SPRAY
S ( %) model then is executed to calculate the solid shell
thickness profile and the surface temperature
Fig. 7. A schematic sketch illustrating the distribution for the billet. An output file is created
calculation of degree of belief in a to interface with the expert system shell,
spray-related tensile strain problem COMDALE/X. The user is then prompted to
from belief in high reheat in the desciibe the quality problems and to select the
radiation zone, high superheat and most important one. Output from the
composition problems. mathematical model, as well as the quality
OCD : Initial belief in a tensile strain problem due to problems entered by the user, are "imported" into
belief in high reheat the "main" module of the expert system.
OBD : Modified belief due to definite high superheat
"OR" definite composition problem
OAD : Modified belief due to definite high superheat
"AND" definite composition problem
The effect of composition and superheat on belief OPERATIN
G PARAMETER
S
INPUT
in a spray-related strain problem is shown in Fig.
7. In the methodology adopted, belief in a 1
spray-related strain problem can be increased HEAT TRANSFE
MODE L
R QUALIT
Y PROBLEM
INPUT
S

above the belief in a "high" reheat when a


composition and/or superheat problem is present.
If a "high" reheat "might not be" present (i.e.
degree of belief < 50 percent), then belief in a MAIN MODUL
E
strain problem can only be increased if the
Analysis of specificit
y associated
certainty of a composition or superheat problem with the most important proble
m
is greater than 50 percent. Thus, when there is
uncertainty about the presence of "high" surface
reheat (degree of belief = 50 percent), the absolute
presence of one or both of these problems will
increase belief in a strain problem to 70 or 90 MODUL
E1 | | MODUL
E2 | | MODUL
E 3 ~j
percent respectively.

Belief in a spray-related strain problem can also


be decreased if the system detects a mould
disorder. Figure 8 shows this influence. When
uncertainty exists about a spray problem (degree
of belief = 50 percent), the absolute presence of
a mould problem will lower belief in the spray Fig. 9. A general flow-sheet representing the
problem to 30 percent. structure of the expert system.

99
REFINEMENT OF DOMAIN KNOWLEDGE
In the "main" module, the system verifies the
quality problem(s) entered by the user. In the case
This knowledge engineering exercise was
of midway cracks and pinch-roll cracks for
beneficial in many ways. It led to a formal
example, the user is asked questions on the
representation of the Experts' knowledge; a
distances of the outer tip of the cracks from the structure evolved that was easy to understand and
billet surface. This establishes initiation sites for explain to others. During knowledge acquisition
midway cracks and confirms or denies an and system development, some new ideas about
observation about pinch roll cracks. the domain were uncovered while several obscure
issues were clarified. Some examples of these are
For the most important quality problem, the as follows:
system addresses issues related to frequency of
the problem with regard to particular strands,
* A new classification scheme for quality
grade of steel, time in the heat, location and
problems in billet casting has evolved. The
orientation. One of the three analysis modes is
problems are divided into two groups; those
then selected depending on the combination of
that have complex origins and those where
quality problems observed. Prior to presenting a
very specific issues need to be examined.
summary of the analysis in the "report" module,
With this strategy, all combinations of
the system provides the user with an opportunity
quality problems can be handled.
to examine all final conclusions and to probe into
the knowledge base to obtain justification for each
piece of advice. * The presence of scale deposits on the cold
face of the mould and poor mould lubrication
were linked. When scale deposits are present
on the cold face of the mould, the mould
operates hotter than normal and therefore,
the lubricating oil vaporizes or burns before
EVALUATION OF THE EXPERT SYSTEM reaching the metal level. Thus, a problem
with mould lubrication can be related to poor
The expert system was evaluated informally by water quality.
the Experts and the Knowledge Engineers during
its development. Here, the computer program * A novel inferencing strategy was developed
was tested and weak areas of the knowledge base in the analysis of midway cracks to integrate
were identified and modified. At a later stage in the influence of important parameters such
the development, the expert system was installed as surface reheat temperature, superheat
in five Canadian steel companies for formal levels and steel composition.
evaluation. This proved that the system was
capable of successfully linking dominant SUMMARY
problems at several sites with expected causes.
An expert system has been developed to guide
operators in analysing quality-related problems
Company and in providing them with a ready source of
Comments on the analysis fundamental knowledge related to the operation
"A" The system related a problem with of the casting machine. Important outcomes of
rhomboidity to plugged spray this knowledge engineering exercise are:
nozzles. The operators solved this
problem by improving the spray (a) Fundamental and heuristic knowledge
cooling water quality. required for problem solving have been
integrated during system development.
"B" Off-corner cracks were found to
originate near the mould exit. The (b) The formal structuring of the knowledge
system suggested a mismatch base had a major impact on the domain as
between spray and mould heat it presented the Experts with a structure
extraction rates due to a large gap that was very similar to their own thought
between the mould exit and the first process. The methodology evolved can
spray nozzle. handle combinations of defects in billet
"C" The heat transfer model calculated a casting.
high reheat in the radiation zone.
Midway cracks were seen in the (c) The knowledge structure provided the
radiation zone. The cause of this high users, for the first time, with a consistent
reheat was a short spray zone. interpretation of the domain knowledge.

(d) This system provided an opportunity to


establish new links between quality
The users also identified a number of issues related problems and new ideas about the
to modification and future expansion: knowledge domain have evolved.
* Expansion of the present knowledge base Future efforts in expanding this system will
to include other quality problems - examine the application of this knowledge to a
centreline segregation, laps, bleeds, real-time on-line monitoring and control system.
blowholes and pinholes. By providing this assistance to the quality-control
process, significant improvements in productivity
* Errors in the code - omission, misprints, and product quality are likely to occur.
incorrect units, etc.

* Need for additional user-friendly features ACKNOWLEDGEMENTS


- graphical output profiles of billet surface The authors are greatly indebted to Stelco Steel,
temperature and solid shell thickness. Ivaco Rolling Mills, Slater Steel, Sidbec-Dosco
Inc., Courtice Steel, Western Steel, Hatch
* Need to recognize different levels of Associates and the Natural Sciences and
expertise among the possible users. Engineering Research Council of Canada for
support of this study. Gratitude also is expressed
to Comdale Technologies Inc. for their assistance
with the software.

100
REFERENCES

Bommaraju, R., Brimacombe, J.K. and Buchanan, B.G., Barstow, D., Bechtal, R.,
Samarasekera, I.V. (1984). Mould behaviour and Bennett, J., Clancey, W., Kulikowski, C ,
solidification in the continuous casting of steel Mitchell, T. and Waterman, D.A. (1983). In :
billets: . structure, solidification bands, crack F.Hayes-Roth, D.A.Waterman and D.B.Lenat
formation and off-squareness. ISS Transactions. (Eds.), Building Expert Systems. Reading,
5, pp.95-105. Massachusetts, USA: Addison-Wesley,
pp. 127-167.
Brimacombe, J.K. (1976). Design of continuous
casting machines based on a heat-flow analysis: Hayes-Roth, F., Waterman, D.A. and Lenat, D.B.
state-of-the-art-review. Canadian Metallurgical Eds (1983). Building Expert Systems. Reading,
Quarterly. 15, pp. 163-175. Massachusetts, USA: Addison-Wesley.
Brimacombe, J.K., Hawbolt, E.B. and Weinberg, Meech, J.A. (1989). Conflict resolution and
F. (1980). Formation of off-corner internal cracks knowledge accumulation in expert systems. In :
in continuously-cast steel billets. Canadian R. Rao and G. Dobby (Eds.), Processing Complex
Metallurgical Quarterly. 19, pp.215-227. Ores. Pergamon Press, London, pp.575-585.
Brimacombe, J.K. and Samarasekera, I.V. (1988). Meech, J.A. (1990). Expert Systems for teaching
Fundamental analysis of the continuously casting and training in the mineral industry. Minerals
process for quality improvements. Proceedings Engineering. 3, pp. 129-136.
of Indo-US Workshop on Principles of
Solifdification and Materials Processing. Patankar, S.V. (1980). Numerical Heat Transfer
Hyderabad, India, Oxford and IBH Publishing and Fluid Flow. New York, USA : Hemisphere
Company, New Delhi, India, pp. 179-222. Publishing Corporation.
Brimacombe, J.K., Samarasekera, I.V. and Samarasekera, I.V. and Brimacombe, J.K. (1987).
Bommaraju, R. (1986). Optimum design and Mould design to prevent breakouts in the
operation of moulds for the continuous casting of continuous casting of steel billets. Report to
steel billets. Fifth International Iron and Steel Stelco Edmonton Steel Works. Unpublished
Congress, Washington D.C., USA, 69th Work.
Steelmaking Conference Proceedings.
ISS-AIME. 69, pp.409-423. Thomas, B.G., Brimacombe, J.K. and
Samarasekera, I.V. (1986). The formation of
Brimacombe, J.K. and Sorimachi, K. (1977). panel cracks in steel ingots: a state-of-the-art
Crack formation in the continuous casting of steel. review. ISS Transactions. 7, pp.7-20.
Metallurgical Transactions B. 8B, pp.489-505.

APPENDIX : QUALITY PROBLEMS IN BILLET CASTING

A Internal Cracks. B Midway cracks.

1 QUALITY LOCATION CAUSES INFLUENCING 1 QUALITY LOCATIONS CAUSES INFLUENCING 1


PROBLEMS OF FACTORS PROBLEM FACTORS
CRACKING Mould exit or in Reheating of the Mismatch between the 1
Midway
Off-comer Lower part of Bulging of the Thermo-mechanical behaviour Crack the gap between billet surface mould and the sprays: due |
Crack the mould or solid shell and of the mould; Adverse the mould and the to design or maintenance |
very close to the hinging at the mould/shell interaction; Deep sprays problem; Poor design of |
mould exit off-comers and non-uniform oscillation cooling jacket near the
marks; Steel composition and mould exit; Steel H
superheat composition and superheat |

Diagonal Crack In the spray Non-uniform Thermo-mechanical behaviour Upper portion of Reheating of the Poor spray maintenance : |
zone shell of the mould; Adverse the sprays billet surface bent or plugged spray II
generated by mould/shell interaction; nozzles; Steel composition
the mould Asynchronous intermittent and superheat 1
boiling in the mould; Deep and Lower portion of Reheating of the Sprays : design and
non-uniform oscillation marks; the sprays or the billet surface due maintenance issues; Steel
steel composition and superheat radiation cooling to the sprays composition and superheat
Asymmetric Poor spray design and zone
spray cooling maintenence; Steel Reheating of Thermo-mechanical 1
composition and superheat dark overcooled behaviour of the mould;
Centreline Near the point Sudden Inadequate spray cooling near patches Adverse mould/shell
Crack of complete decrease in the point of complete generated by the interaction; Deep and
solidification the centreline solidification; Steel mould non-uniform oscillation
temperature at
the point of
composition and superheat
_ marks; Steel composition
and superheat
complete
solidification
Pinch-roll Close to the Squeezing on Excessive pinch roll pressure;
Crack pinch rolls a strand with Steel composition and
liquid core superheat
Unbending Close to the Unbending on Excessive bending strains;
Cracks point of a strand with Steel composition and
unbending liquid centre superheat; High casting speed

101
Q Surface Cracks. D Rhomboiditv and Breakouts.

QUALITY ORIGINS CAUSES INFLUENCING | QUALITY ORIGINS CAUSES INFLUENCING 1


PROBLEMS FACTORS PROBLEMS FACTORS

Transverse In the mould Pulling action on Thermo-mechanical behaviour | Rhomboidity In the mould Non-uniform Thermo-mechanical behaviour 1
Crack (and the strand as a of the mould; Adverse | and/or the shell generated of the mould; Adverse I
depression) result of binding mould/shell interaction; Deep | sprays in the mould mould/shell interaction;
or sticking in the and non-uniform oscillation | Asynchronous intermittent j
mould marks; Steel composition and Q boiling in the mould; Deep and I
superheat fl non-uniform oscillation marks; |
Mould-tube alignment; Steel i
Longitudinal In the mould Reheating of the Large comer radius; Presence |
superheat J
Comer Crack billet comer due of comer "key holes"; Mould |
to a large tube alignment; Asynchronous 8 In the sprays Asymmetric Poor spray design and 1
mould/shell gap intermittent boiling in the | spray cooling maintenence; Steel superheat J
mould; Deep and non-uniform H
Breakouts Close to Thin shells Thermo-mechanical behaviour
and / or oscillation marks; |
obtuse-angle generated at the of the mould; Adverse
Thermo-mechanical behaviour
comers of obtuse angle mould/shell interaction; Deep
Presence of thin of the mould; Adverse the strand comers due to and non-uniform oscillation
and weak shell at mould/shell interaction; Steel I asymmetrical marks; Mould-tube alignment;
the hotter comers composition I
cooling in the Steel superheat
| Longitudinal In the mould Excessive Presence of scratch or gouge mould
| Midface Crack reheating of a marks on the inner surface of 1
Close to Local reduction Adverse mould/shell interaction;
localized portion the mould wall transverse in the shell Deep and non-uniform
of the billet depressions oscillation marks;
thickness due to
surface. or deep Thermo-mechanical behaviour
the presence of
| Craze Crack In the sprays Grain boundary High level of Cu, Sn or low oscillation depressions or of the mould
embrittlement melting impurity in steel marks on the deep oscillation
due to the billet surface marks
presence of Cu,
Close to any Inadequate Insufficient dwell time in the
Sn or low
weak spot in amount of shell mould; Entrapment of slag or
melting
the shell thickness at the scum between mould and billet
impurities in
mould exit
steel.
Mould Operators' error Poor metal control
overflow

102
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

APPLYING KNOWLEDGE-BASED TECHNIQUES TO THE


SCHEDULING OF STEEL ROLLING
O. Lassila*, P. Mattila**, L. Pesonen**, M. Syrjnen* and S. Trm*

* Laboratory of Information Processing Science, Helsinki University of Technology, Otakaari 1,


SF-02150 Espoo, Finland
**Rautaruukki Oy, Raahe Steel Works, SF-92170 Raahe, Finland

Abstract: This paper describes a knowledge-based production scheduling system built for the
plate rolling mill of a large Finnish steel manufacturer. The chosen scheduling algorithm (the
activity-based scheduling method) had to be modified because the dominating production con-
straints in steel rolling differ from those of job-shop manufacturing, the original application
area of the method. The continuous nature of the production process as well as some resources
need special treatment w h e n constructing a timetable. The resulting scheduling algorithm is
outlined in the paper.

Keywords: Artificial Intelligence, Rolling Mills, Steel Manufacture, Production Control, Micro-
computer-based Control

1. INTRODUCTION ble solution is to invest in both vertical and lateral


communication systems (Kerr 91).
Steel manufacturing differs in many ways from job-
shop -like production. The end products of steel works This paper describes a general framework for produc-
lack internal structure, i.e. they are not composite ob- tion planning for Rautaruukki's Raahe Steel Works
jects, and the production process is far from assembly- and a knowledge based scheduling system for plate
oriented. Instead, it splits larger units into smaller ones, rolling mill that was built during 1989 - 1990. Helsinki
e.g. meltings into slabs and slabs into plates. University of Technology and Rautaruukki Oy have
carried out this project together using frame-based
In addition, the dominating constraints in steel manu- scheduling software developed at the Laboratory of In-
facturing are typically other than those in job-shop pro- formation Processing Science.
duction. There are, for example, strong connections be-
tween successive manufacturing operations in several
points of the process and important technological con- 2. STEEL MANUFACTURING PROCESS
straints that limit the possible operation sequences on
some resources. A commercial job-shop -oriented Rautaruukki's Raahe Steel Works is composed of sev-
scheduling system is not the best alternative for manag- eral different production units as illustrated in figure 1.
ing that kind of a manufacturing process. The end-products of the plant consist of hot rolled steel
plates and strips. The production process can be divided
The traditional w a y to
manage the s u b s y s t e m
"" " " "
- "
T
interactions in produc- " - ",
-- " "' "1 " -
" "

"
"' >
'"'"'' ""'"'
tion plants is to use in- Strip

\7
termediate buffers to re- Strip !
, ,$t#p$^ Loading products
duce the need for coordi-
: Raw
t mill
nation. H o w e v e r , the
global trend t o w a r d s
Ore Blast Steel f Jkbs

7\,f.
Post

shop
K
shorter delivery times furnaces processing 1
and increasingly c u s - i \
tomer oriented products Cote i i i i i i l i i t I. t a l e s - -. Plate
products
has caused more and Loading
more pressure to decrease
Coking
iiilliii
stocks and throughput Coal
times and has thus cre- plant \sPost
ated a need for more so- processing 2
phisticated methods for
coordination. One possi-
Figure 1: Rautaruukki's Raahe Steel Works

103
into two fairly independent parts: the process up to the and produces plates which are of the size specified in
blast furnaces and the process after them. The task of the customer order. In the beginning of the process the
the blast furnaces is to provide sufficient amounts of slabs are heated to the proper rolling temperature. In
even-quality raw iron to the steel shop. the rolling mill the hot slab is rolled to specified plate
width and thickness. After rolling, about half of the
The primary production management problems deal plates are normalized. Plates are levelled and after that
with the process starting from the steel shop where raw they arrive to a cooling bed where they spend a time
iron is processed to different types of molten steel and proportional to their thickness. The cool plates are then
cast to steel-slabs. The weights of the meltings are signif- moved either to the mechanical cutting line or the
icantly larger than the average weights of delivered flame cutting cell.
products and therefore the steel grades produced must
be selected so that they satisfy the requirements of a Plates can be processed through the plate mill using dif-
wide range of end-products. This is an optimization ferent routes. Some routing choices are determined by
problem the goal of which is to find a set of steel grades the properties of the plate, such as thickness or width,
and a product mix that match each other and satisfy a and some can be chosen during scheduling.
set of external constraints such as due dates, cost min-
imisation, even quality, and even material flow and re- Part of the operations, especially in the beginning of the
source load. line, must be done immediately after the previous op-

To further complicate the


task, the selection of the di- Hot
mensions of the slabs is sub- leveling
mill 2
ject to two different kinds of
pressures: first, the rolling
mills prefer slab dimensions
which match the dimensions
of the end-products to mini-
mize the waste, and second,
the steel shop wants to mini-
mize the changes of the di-
mensions of the slabs be-
tween different castings.

From steel shop the slabs are


moved to strip and plate
mills where they are rolled in
hot rolling mills into thin
steel plates and finally cut
into customer d i m e n s i o n s
and packed for dispatching.
The process of the strip mill
is fairly process-like as the
slabs move through the pro-
cess in fixed routes and in the
release order. The plate mill
Loading
is more complicated because t hall
the routes of plates can vary a
lot and different product
mixes may cause bottlenecks Figure 2: The plate rolling process
in different parts of the pro-
cess. There is thus a clear need for a detailed scheduling
system in the plate mill. eration has been completed. That makes planning more
difficult because decisions concerning individual opera-
tions of an unbreakable sequence cannot be made sepa-
The production planning at the plate mill initiates a rately but the focus of attention must, instead, be in
task called slab design, deciding which plates are cut complete operation sequences.
from selected slabs and how the cutting is done. This is
The plate rolling process contains some resources that
again an optimization problem where the goal is to
pose strong constraints on some properties of successive
minimize the amount of steel wasted. The heat plan-
reservations (so called adjacency constraints). There are
ning in the steel shop is also based on slab design. In
two classes of adjacency constraints the first of which in-
addition, the production of the plate mill needs careful
cludes constraints that limit the set of operation pairs
scheduling in order to satisfy complex technological
that can be performed within some time period in the
constraints and to effectively utilize the production fa-
same resource and the latter consists of constraints lim-
cilities.
iting the operations that can be performed with same
resource regardless of the time spent between them. In
2.1. Plate Rolling Mill plate mill the pusher resources (reheating furnaces,
normalization furnace and cooling beds) belong mostly
Plate rolling mill (figure 2) takes as its input steel-slabs
to the first category and the hot rolling mill to the latter.

104
The operation of pusher resources (figure 3) is as fol- pending on customer orders. Therefore a scheduling
lows: the slabs or plates move through the resource one system is a more justifiable alternative for solving the
after another and they get a special treatment (such as production management problems than investment in
heating or cooling) during the time they spend inside new resources which, in the case of steel industry, tend
the resource. Typically several slabs or plates can be in- to be expensive.
side the resource simultaneously. The throughput of
the resource can be maximized by minimizing the dif-
ferences between consecutively processed items.
3. PLATE-MILL SCHEDULER
In heating resources, for example, thick plates need a
Knowledge-based systems - a knowledge-based sched-
longer treatment and the plates following the thick
uler being one - can be thought of as consisting of two
ones get a treatment of a similar length. It is therefore
distinct parts: a problem solver (independent of the ac-
beneficial if the thick plates can be processed in batches.
tual problem) and knowledge about the problem to be
In some pusher resources there are an additional dan-
solved. In the case of factory scheduling, the problem
ger of overheating, i.e. also the maximum time a slab or
solver is a more or less general scheduler, and the
plate can spend inside the resource is limited. In prac-
knowledge is contained in the system as a detailed
tice this means that slabs or plates that differ much with
model of the target manufacturing process.
respect to the critical property (thickness) cannot be
processed close to each other. Object-oriented programming and m o d e l i n g lend
themselves well to the implementation of knowledge-
based schedulers. To represent the structure and proper-
ties of a manufacturing process is fairly straightforward
using object-oriented - or more specifically frame-based
- techniques. In addition to structural relationships and
<M11 liii attribute data, a model also consists of different kinds of
constraints that pose limits to how different things can
Figure 3: A pusher resource
be arranged in a schedule, and preferences that guide
The adjacency constraints of the rolling mill are caused the schedule formation.
by the wear of the work rolls (figure 4). These rolls have
In our approach, w e have used frame-based modeling
to be changed every now and then and the new work
to describe the whole scheduling problem: the manufac-
rolls have different capabilities than the old rolls. To
turing facility with all its products, resources, produc-
maximize the quality of the plates (e.g. constant thick-
tion procedures, constraints and preferences. This has
ness) and to minimize the probability of failures the
enabled us to add a large amount of detail information
order in which slabs are processed must be planned
to our model and, for example, to create special resource
very carefully.
classes that closely represent the performance of the ac-
tual machines. These details can be effectively utilized
In the beginning of a rolling cycle work rolls are slip-
in scheduling process.
pery and require slabs with low rolling pressure. When
the work rolls have warmed up the increasingly diffi-
The scheduler was implemented using the Common
cult slabs are processed and as the work rolls wear only
Lisp language (Steele 1990) and a frame system called
narrower and thicker plates can be rolled.
BEEF (Lassila 1990). The scheduler runs on high-end
Macintosh workstations. At the end of the project the
possible connection between Macintoshes and IBM
mainframes was planned but not implemented.

Scheduling <> User-


Kernel Interface
>k I\
1'

Domain
Figure 4: The rolls of the rolling mill Model
The current planning system focuses almost exclusively
on the sequence constraints of the hot rolling mill. The
result of the planning is a rolling sequence, the order in Figure 5: Structure of the DOM scheduler
which the slabs are processed in hot rolling mill. The
sequence is basically determined by the dimensions of
The scheduler consists of roughly three distinct parts
the slabs and plates but some heuristic rules are also ap-
(depicted in figure 5): the scheduling kernel takes care of
plied to balance the load in other parts of the produc-
scheduling, based on the information contained in the
tion process. In practice, however, unexpected bottle-
domain model, a rich object-oriented representation of
necks arise and special actions are required to manage
the manufacturing process; the system can be manipu-
those situations.
lated and monitored through an interactive, window-
based and mouse-driven user-interface. The basic de-
The bottlenecks of the process vary and almost any ma- sign objective was to clearly separate these three parts. It
jor resource in the line can turn into a bottleneck de- is also possible to use several scheduling kernels simul-

105
taneously in case the scheduling process needs to be dis- task is created for each manufacturing activity that be-
tributed. longs to the operation-graph of some order. These sub-
tasks are then processed one-by-one. This fine-grained
The connection between the user interface and the
decomposition allows the scheduler to change its atten-
other parts of the system is very loose. It is in principle
tion from one bottleneck to another in a very flexible
possible to run the scheduler without the user interface.
way.
The connection is based on so-called demon mecha-
nism of the underlying frame system: changes in the
The basic ABS algorithm proceeds as follows. At each
scheduler's data-structures (that is, frames) are automat- cycle
ically transmitted to the user interface objects, without
the scheduler or the model having to know about this. 1. the activity that is the most difficult to schedule is
selected and
The plate-mill scheduler was originally planned to 2. a position (that is, resource, start-time and duration)
work as a part of a distributed scheduling system and its for the activity is selected so that the other un-
top-level control structure is designed with this goal in scheduled activities would be as easy to schedule as
mind. The scheduler communicates with the external possible.
world - for instance, with the user or with the other
parts of a distributed system - by sending and receiving In the end of each cycle a commitment to the selected
messages. These messages are processed and turned into reservation is made and the influences of the commit-
a queue of appropriate tasks and the queue is then pro- ment are propagated. If a conflict is detected - i.e. the
cessed according to priorities of the tasks. This control time-window of some activity vanishes - the last deci-
structure gives the scheduler the flexibility that is re- sion is retracted and the best remaining position is tried.
quired in a distributed problem solving environment. If there are no remaing positions then decision before
the last one is retracted and so on. The basic algorithm
is thus based on backtracking search.
4. SCHEDULING METHOD
4.2. The analysis of the competition
Initially the plate-mill scheduler was based on the activ-
ity-based scheduling method (subsequently referred to There are two things that are done in ABS to avoid con-
as ABS) which is a general scheduling method based on flicts. First, the competition situation is repeatedly ana-
the analysis of constraint interactions (Keng, Yun and lyzed during the schedule generation process and sec-
Rossi, 1988; Sadeh and Fox, 1989). ond, a control strategy that minimizes the likelihood of
conflicts is used.
The background of ABS lies in constraint satisfaction
problems (Mackworth, 1977) that form a category of The analysis is carried out by computing the estimated
problems studied within the field of artificial intelli- load impacts of activities and estimated load distribu-
gence. The production scheduling problem is viewed as tions for resources. First the time-windows of activities
a set of variables (here the start-times, durations and re- - that is, the time intervals during which the activities
sources of operations) whose possible values are limited have to be performed - are computed by propagating
by different constraints (the due dates of customer the earliest possible start times and latest possible end
orders, limited capacities of resources and precedence times in the operation graphs. The load-densities for
constraints between operations in same operation- the activities are then computed with the aid of their
graph). The goal is to find a value assignment for vari- local start-time preferences. A load-density is a temporal
ables that satisfies all constraints. distribution that reflect the likelihood that an activity is
being performed at different time points.
The production scheduling problem belongs to a sub-
class of constraint satisfaction problems called con- The load-densities of those activities that use same re-
strained resource problems (Keng and Yun, 1989) where source are added up to yield the load-density of the re-
the domains of variables overlap and one value can source. These load-densities show the probable loca-
only be assigned to one variable. Consequently the vari- tions of bottlenecks.
ables compete of the same values. The competition can
lead to conflicts: two activities want to reserve the same
capacity and one of them has to give it up.
4.3. Control points and decisions
As described above there are two main control points in
Since the domains of the variables are finite - caused by the ABS-algorithm: the selection of the next activity
finiteness of available time and capacity - the conflids and the selection of the next reservation. Both of these
can lead to a dead-end situation where all possible val- selections can be done in different ways, see for example
ues of a variable have been assigned to other variables. (Sadeh and Fox, 1989).
If this happens there are basically two possibilities. The
first is to backtrack, that is, a previously made decision In activity selection a good heuristic should reflect the
is retracted and a new alternative is tried. The second al- probability of the future lack of any positions for the
ternative is to relax some of the constraints (for in- activity. A good estimate is, for instance, an activity
stance, the due dates) which increases the domain of the resource reliance heuristic which rates those operations
variable. most difficult that have the largest share in resource
bottlenecks.
4.1. Basic algorithm
In the above formulation the efficiency of the schedule
The name of ABS is derived from the particular way generation process is the only concern. In practice, how-
that the overall scheduling task - consisting of a set of ever, the quality of the resulting timetable is an impor-
customer orders - is decomposed into subtasks. A sub- tant factor which has to be taken into account in the se-

106
lection of the position for an o p e r a t i o n . A mixed One slab is scheduled as an unbreakable operation se-
heuristic should be applied w h e r e the difficulty of re- quence from the beginning to the end of the process.
maining operations and the quality of the decision are
combined. At each cycle

1. the loose start time of the next reservation in most


4.4. Modifications critical resource is determined
In the course of the project it turned out that the actual 2. the activity sequence is selected based on (a) the fit
constraints in steel rolling differ significantly from to the selected time-point, (b) the load densities and
those constraint classes taken into account in basic ABS- (c) the local preferences of the operations
algorithms a n d s o m e basic a s s u m p t i o n s behind the
In the end of the cycle a commitment is m a d e and in-
method were u n w a r r a n t e d in this case (for further dis-
fluences are propagated. In case of conflicts due-dates of
cussion see (Trm, Lassila and Syrjnen, 1991)). While
orders are relaxed which means that the schedule can be
it was possible - in principle - to create schedules with
completed without backtracking.
the algorithm it performed poorly what came both to
the quality of the schedules a n d the efficiency of the
scheduler.
5. DISCUSSION
The problems originated from two very basic features of
the ABS-method: (1) the type of decomposition used A result of the scheduler project was a prototype of a
and (2) the order in which the next activity and the next scheduler for the plate mill of Raahe Steel Works. The
reservation are determined. p r o t o t y p e has s h o w n that the p r o d u c t i o n p l a n n i n g
problem in the plate mill can be solved with the aid of a
The activities that belong to unbreakable operation se- knowledge-based system. The scheduler uses several
quences cannot be treated individually by the algorithm criteria w h e n m a k i n g s c h e d u l i n g decisions and the
as the decisions concerning one of the activities m a y timetables are therefore s o m e w h a t different from their
vastly reduce the freedom with the others. If the algo- h u m a n - m a d e counterparts where some particular
rithm uses local heuristics (that evaluate single activi- technological preferences are stressed exclusively.
ties) only the first activity of a sequence is considered in
making a decision that influences all the activities, re- The plate-mill scheduler is able to function as one part
sulting in s u b o p t i m a l reservations for the remaining of a hierarchical distributed scheduling system that has
activities. been sketched to Raahe Steel Works. The b o u n d a r i e s
given by the overall master schedule for the whole steel
This problem can be solved by using a different decom- works are posted as goals to different production units
position. Instead of single manufacturing operations, (steel s h o p , rolling mills, etc.) that make detailed sched-
the algorithm regards the whole unbreakable sequences ules of their internal activities. In case of conflicts (i.e. if
as complex activities. The estimates are c o m p u t e d for the goals c a n n o t be met) t h e s c h e d u l e r s of t h e
whole operation sequences and they are scheduled in production units can negotiate between each other to fix
one cycle. This modification had a positive effect both the p r o b l e m s or if the conflicts a r e too large the
on the quality of timetables and on the efficiency of the production unit can report its inability to meet the goals
scheduling algorithm. and master schedule is modified a n d n e w goals are
created.
The second p r o b l e m concerns the constraints dealing
with adjacent reservations in the s a m e resource. The The plate-mill scheduler project confirmed the well-
basic ABS-algorithm tends to make reservations so that k n o w n fact that it is extremely important to acquire the
they fill the available time horizon evenly. It is difficult users' acceptance to the system. A good user interface
to guide that kind of process so that good reservation and user controllability are i m p o r t a n t issues but even
sequences are constructed. It turned out that if the selec- more important is the users' participation to the pro-
tion phases in the algorithm are "reversed" (that is, the ject. Without it the system is likely to confront resis-
start-time of the next reservation at most critical re- tance simply by being u n k n o w n and a foreign inven-
source is selected first a n d the actual operation only tion.
after that) the adjacency preferences can be followed
more faithfully. The modifications that had to be m a d e to the original
scheduling algorithm suggest that it might not be wise
to try to build very general schedulers. Different con-
4.5. The final algorithm straints d o m i n a t e different p r o d u c t i o n processes a n d
the c o n s t r u c t i o n process of a s c h e d u l e r s h o u l d be
The final algorithm used in plate-mill scheduler em- driven by the actual constraints. Scheduling is a compu-
ployed the two modifications that were described above. tationally complex task and there is a trade-off between
A major concern was the efficiency of the scheduling the time spent a n d the quality of the result. A good
system. A lot of tuning was d o n e to reduce both the scheduler avoids unnecessary work by simplifying its
time spent in scheduling process and the m e m o r y re- task where possible.
quired.
There a r e , h o w e v e r , m a n y c o m m o n p a r t s (mecha-
The determination of alternative routes of the plates nisms, object-types, user interface widgets) that exist in
was left to the process p l a n n i n g p h a s e before the most schedulers. O u r current research is directed to-
scheduling phase. Only simple cases where the sched- wards building a scheduler toolkit, an extensible object-
uler chooses between alternative resource (such as dif- oriented class library, that includes both descriptions of
ferent cooling beds or flame cutting machines) were left d o m a i n objects (resources, orders, operations) and im-
to be determined at the scheduling time. plementations of several useful mechanisms (to per-

107
form time bound propagation, to conduct complex
searches, to maintain load distributions etc.). The main
goal of the research is to provide clear interfaces to dif-
ferent mechanisms so that the system can be extended
with a minimum need to rewrite existing program
code.

REFERENCES
Keng N.P., D.Y.Y. Yun and M. Rossi (1988). Interaction-
sensitive planning system for job-shop scheduling. In
M.F. Oliff (Ed)., Expert Systems and Intelligent Manufac-
turing, North-Holland, pp. 57-70.
Keng N.P., D.Y.Y. Yun (1989): A Planning/Scheduling
Methodology for the Constrained Resource Problem, in
proc. of Eleventh International Joint Conference on Ar-
tificial Intelligence (IJCAI-89), Morgan Kaufmann, pp.
998-1003.

Kerr R. (1991). Knowledge-based Manufacturing Man-


agement, Addison-Wesley.
Lassila O., M. Syrjnen, S. Trm (1991). Coordinating
mutually dependent decisions in a distributed sched-
uler. In: E. Eloranta (Ed.): Advances in Production Man-
agement Systems, North Holland, pp. 257-265.
Lassila O. (1990), The BEEF Reference Manual - A Pro-
grammer's Guide to the BEEF Frame System. Report
TKO-C43, Laboratory of Information Processing Science,
Helsinki University of Technology, Otaniemi (Finland).
Mackworth, A.K. (1977): Consistency in Networks of Re-
lations, Artificial Intelligence 8
Sadeh, N. and M. Fox (1989). Focus of attention in an ac-
tivity-based scheduler. Proceedings of the NASA Con-
ference of Space Telerobotics.

Steele G.L. Jr. (1990). Common LISP: The Language


(second edition). Digital Press.
Trm, S., O. Lassila, M. Syrjnen (1991): Adapting the
Activity-Based Scheduling Method to Steel Rolling. In:
G. Doumeingts, J. Browne, M. Tomljanovich (Eds.),
Computer Applications in Production and Engineering
(CAPEf91), North Holland, pp. 159-166.

108
Copyright IFAC Expert Systems in Mineral and GENERAL APPLICATIONS
Metal Processing, Espoo, Finland, 1991

EXPERT SYSTEM FOR COAL BLENDING


E. Nakata*, H. Fujimoto** and K. Terazono***

*Systems Engineering Division, Kawasaki Steel Systems Research and Development Corporation,
Kurashiki 712, Japan
*Systems Laboratory, Systems Planning and Data Processing Department, Kawasaki Steel Corporation,
Tokyo 100, Japan
*Coke Technology Section, Ironmaking Department, Kawasaki Steel Corporation, Kurashiki 712, Japan

Abstract: The application of an expert system to plan coal blending for coke production
is described. The object of this coal blending plan is to produce coke of stable quality at
low cost while maintaining the optimum use of the coal yard. The monthly coal blending
planning work previously depended on an experienced operator's expertise, and the
expert system approach has helped to reduce planning time and has standardized the
planning method. The effectiveness of the expert system in comparison with linear pro-
gramming was confirmed by the following advantages:
1. Every fact that was difficult to express by a fixed formula could be expressed as a
rule.
2. Calculation time was shortened.
3. Every solution could be derived by limiting its searching scope.

Keywords: artificial intelligence; expert systems; mathematical programming; produc-


tion control; blending; cokemaking

PREFACE DEFINITION OF THE SYSTEM AND ITS AIMS

The steel industry has had to adapt to many changes in Definition of the Coal Blending Plan
its surroundings, like the effect of the oil crisis in 1973
that spurred the saving of resources and energy. It is Japan imports most the coal needed for coke making
now moving into the age of greater product sophistica- from Australia, Canada, U.S.A., U.S.S.R., and China.
tion, a wider range of products, and a shorter lead time KSC determines its annual purchase plan by linear
to production. programming (LP), which yields the optimal coal
blend for minimum cost, that is needed to supply suffi-
With the aim for integrated and totally planned control cient coke quality for blast furnace use. This LP selects
of the entire steelworks operations, the Mizushima from about 50 coal brands, but the coal yards cannot
Works of Kawasaki Steel Corporation (KSC) has been usually stock all these brands. Therefore, the coal
developing a large scale system. By the end of 1990, blending plan needs to be formulated every month.
Mizushima had accumulated COBOL programs con-
sisting of some 30 million steps. The coal blending plan involves setting up proper
blending ratios for each bed for a 2-months period.
However, some processes involving human judgment About 25 beds are used, each bed including about 20
and decision-making have been beyond the reach of coal brands selected from approximately 40 brands. It
EDP using conventional systems approaches. While is needed to attain the coke quality and quantity that is
these tasks left out of EDP applications can be handled defined by the plan for blast furnace operation and the
by skilled employees (experts), they will be a future synchronized plan for coke oven production.
bottleneck to achieving further integration.

After recognizing that expert systems (ES) could solve Aims of the System Development
this problem, we applied ES to various tasks with suc-
cessful results. This paper focuses on the development Blending is performed according to inventory informa-
of COMPASS-88 (Coal Mixing Plan Artificial Intelli- tion for the coal yard (quality and quantity of each coal
gent System Developed in '88), a coal blending pro- brand) and the plan of coal shipment designed at the
gram that has recently been put into commercial opera- head office (arrival date, prospective quality and ton-
tion. nage by brand). Fig. 1 shows the environment of the
blending plan. The aim of the blending plan is to manu-
facture stable quality coke at minimum cost with the
highest possible yard operation efficiency.

KSC has developed models to forecast coke strength

109
since the constraint conditions change depending on
Tokyo Head Office
the requirements for cost minimization, quality
fluctuation minimization, etc.
Coal Optimum ship
2. Depending on the type of coke, some blending ratios
purchasing delivery
are treated as zero percent instead of some numbers
system system
due to the constraints of quality fluctuation and
workability, so that it becomes arduous to derive a
Blast furnace
formula. Even if a numerical formula can be de-
production
rived, its calculation takes a long time.
Berthing 3. Although there exists a model to estimate the coke


plan
plan quality from different grades of coal, the formula is
a multi-dimensional polynomial with grade parame-
Coke oven
ters, creating the non-linear programming task with-
production
out a guaranteed solution.
plan
For these three reasons, we decided to adopt the
method used by the human experts. With the use of ES,
we were able to solve the problems associated with the
Coal & coke LP method.
Bed stacking
quality control
program
system
OUTLINE OF THE SYSTEM AND ITS FEATURES
Coke Pmnfiss
Cornputer Planning Process
The inputs to COMPASS-88 are as follows:
Fig. 1 Environments of the coal blending design (1) Target quantity and quality index of coke to be
produced;
(2) Inventory information for the coal yard regarding
from the coal brands, blending condition, and coke the quality and quantity of each coal brand;
oven operation conditions. However, these models' (3) Coal shipment information regarding the arrival
forecasting accuracy is limited, because the blending date, prospective quality and tonnage of each coal
plan needs to be set up for the coals before its tonnage brand;
and quality has been confirmed. (4) Operational data for the coke ovens.
Accordingly, experts have manually formulated the COMPASS-88 formulates the coal blending plan in
blending plan to mitigate operational risk. These ex- three stages as follows (see Fig. 2). The initialization
perts have the required skill for brand selection and stage sets up the various parameters interactively as
blending ratio set up under many restricting conditions. preconditions for the plan. A crude plan is then formu-
Most of this expertise was difficult to standardize, and lated to grasp the trend in coal quality from the inven-
the planning work has been akin to a black art. Since 2 tory. The refined plan is finally created for a detailed
days were required to formulate a 40-day plan, it was blending program that is feasible and fits well with the
difficult to re-formulate the plan rapidly to meet coal quality in the inventory.
changes in the aimed coke quality and in the incoming
ship arrival schedule. COMPASS-88 adjusts the blend averages of coal qual-
ity indexes such as ash, sulfur, volatile matter, and re-
The aims in developing a blending system were as fol- flectance fluidity to the targets by using quality estima-
lows: tion models, and creates a set of the best blends by us-
(1) To shorten the planning time for various quality ing expertise that has been provided by the human
targets and coal arrival plans; experts to avoid coke quality fluctuation. Fig. 3 shows
(2) To relieve experts from routine planning work; some examples of this expertise.
(3) To standardize the planning procedure.
The expertise for designing the blending program con-
sists of obtaining resolutions by branch and bound
Reasons for Applying ES methods, based on recognition of the steps (meta
knowledge) for effective resolution shown in Fig. 2,
Generally, the linear programming (LP) method is and on the values and restrictions shown in Fig. 3.
considered suitable for a problem that involves distrib-
uting limited resources effectively, like a blending The outputs from COMPASS-88 are as follows:
plan. If a formulation is possible and the solution can (1) The determined coal blending ratio by each brand
be obtained within a practical time, LP is most likely to for every bed;
be used. The head office of KSC in fact uses the LP (2) Estimated quality indexes of coke that would be
method for total cost minimization in the annual coal produced from the determined coal blending ratio;
purchasing plan. (3) Various graphs for analyzing the blending plan that
has been drawn up by COMPASS-88.
In practice, we found it difficult to form a suitable
model for the system in our plant by using LP for the
following reasons:
1. It is difficult to express by a fixed numerical formula

110
Renewal of Setting up aimed quality index Setting up aimed quality index
coal brand quality
next bed next bed

Obtaining arriving Distributing Setting up the plan


coal delivery designated brand initialization
information
Breaking down blending ratio Adjusting blending ratio
Obtaining
coal inventory
Calculating Calculating
quality index values quality index values

Setting up aimed
coke quality Checking quality indexes

All beds OK
Bed scheduling All beds NG
completed completed
Adjusting
quality indexes
C Return j (^Completion)

Fig. 2 Planning process with COMPASS-88


Systems Configuration able workstation was been sought to run COMPASS-
88, there was an enormous difference in CPU power
In the case of an ES procedure that repeats by trial and between a main-frame and workstations that are af-
error, it is impossible to accurately forecast a resource fordable according to the balance between investment
consumption in the CPU and memory, such a proce- and assumed return. So much so that the creation of
dure sometimes being called a "MIPS eater". Any in- suitable solutions in a practical period of time was
crease of such systems in a main-frame is not desirable questionable when using a workstation.
because of the strong influence on existing systems.
Consequently, when determining the configuration of
The continuing development of low-cost, high-effi- COMPASS-88, we set up a policy to develop the pro-
ciency workstations should make it possible to run ES gram for a workstation, but in case the creation of a
on such hardware. However, at the time when a suit- suitable plan became impossible in a practical period

VALUES RESTRICTIONS
1. Best possible equalization of inter-bed quality 1. Blending ratio to be 3.5% minimum for each
differences. brand to be blended (expect for specific brands
such as oil coke).
2. Best possible equalizatioon of inter-bed vari-
ation in the blending ratio of coal brands. 2. No immediately high blending ratio for a recently
arrived brand (to be increased gradually)
3. Speed in consumption of small stockpiles of
coal. 3. Upper and lower limits to the blending ratio are
specified for some brands or their group.
4. Maintenance of stock levels of reliable quality
coals 4. Upper and lower limits are specified for variation
in the inter-bed blending ratio.
5. Use of the highest possible amount of semi-
soft coking coal.

5L
5al blending plan to assume a specified oka qi
2
Fig. 3 Values and restrictions for the coal blending plan

111
of time, we would be able to transport the program to PROCESS OF KNOWLEDGE ACQUISITION
the main-frame with the minimum manpower. We fi-
nally selected the Fujitsu G-150 workstation and Knowledge acquisition is the most important aspect of
M
ESHELL/XM expert shell. ES development, but there is no practical or effective
method of knowledge acquisition. Therefore, we ac-
quired the expertise for formulating a coal blending
Features of COMPASS-88 plan by the method that follows:
When systematizing a planning task such as coal Learning about coal and blending. An expert gave us
blending, correction according to environmental lectures on operational restrictions, supply and demand
changes is foreseeable. ES, when compared with con- problems, and control aspects regarding the physical
ventional systems, is easy regarding maintenance, al- and chemical properties of coal and blending.
though there can be no guarantee to handle a correction
rapidly. However, during the maintenance period for Observation of designing task. An expert spoke his
ES knowledge and meta knowledge, the ability to per- thought process into a tape recorder while formulating
form tasks is required. an actual blending plan, this know-how being analyzed
in the subsequent knowledge summarization stage.
To meet this requirement, COMPASS-88 has the func-
tion anywhere in the program that a designer can inter- Preparation and improvement of the knowledge speci-
vene in the planning process as follows: fications. All this know-how was summarized, docu-
(1) A blending ratio designation in which a designer mented and then refined by asking questions or point-
can give his intention prior to the crude designing by ing out any contradictions to the experts. The problems
COMPASS-88; of our own lack of knowledge and the limited explana-
(2) A function for modifying ship's arrival; tions of an expert's thought processes were gradually
(3) A function for modifying constraint values; eliminated, thus making the knowledge specifications
(4) A function for correcting an aimed quality index more detailed.
that has been set up by COMPASS-88.
Manual simulation. After assimilating this knowledge,
a manual simulation was done to confirm that a satis-
Problem-solving Mechanism factory blending plan could be formulated.
Coal quality adjustment involves something over 10 It normally takes 2 days for the experts to design a 40-
parameters including coal grade, coking properties, day blending plan, while our simulation took about a
fluidity, moisture, ash content, and volatile matter. week to design a 10-day blending plan. Thus, the ex-
perts were 56 times as productive as we were. At that
After blending the main brands, COMPASS-88 adjusts time, we suspected this was caused by the difference
the blending ratio to bring the quality index closer to its between a professional and an amateur, but we later
aim. COMPASS-88 tries to choose a pair of brands to found that insufficient knowledge had been acquired.
be increased or decreased which meets the level of a
given constraint. When it succeeds in finding such a
pair, COMPASS-88 changes the blending ratio, and Shortening the Processing Time
then recalculates the quality index. These steps are
performed until the index value falls within the aimed We aimed with COMPASS-88 to formulate a blending
range (see Fig. 4). plan in 4 hours so that at least 2 cases per day could be
handled. However, a practice run with test data using a
Above process is a depth search, using the values and prototype for the crude plan took 15 hours. Completing
restrictions shown in Fig. 2, when respectively deter- this plan would have taken more than 30 hours, which
mining the priority in expanding branches, and in cut- is too long for a practical use. Therefore, we practiced
ting off branches that have been developed. the various countermeasures shown in TABLE 1 to
reduce the processing time.

>v v > Current quality index 1) To revise the algorithm for quality index adjustment,
we needed to acquire more knowledge from the ex-
perts, who have the know-how to design a plan ef-
fectively in limited time, as well as to design a plan
GQ Brand to be > Aimed which satisfies the required quality.
X
decreased Equality
7D
C N. index 2-3) While a number of methods were conceivable for
tuning, we didn't have enough time to try all the
CO
methods. Therefore, we concentrated our efforts to
13
look for a high effect with low manpower by bury-
ing code in the user defined function (UDF) to iden-
8 Brand to be
tify areas with long processing time.
increased
The cumulative effect of the software improvements
shown in TABLE 1 would alone have cut down the
->* processing time to 1/17. However, we were also look-
Coal quality index A
ing for improvements to upgrade the quality of plan-
ning in parallel. The combined effect on the planning
Fig. 4 Concept of quality index adjustment

112
TABLE 1 Methods for reducing processing time proximate processing time needed, the knowl-
edge engineers can add additional knowledge
I I while considering the balance between proc-
No. Method Effect* Remarks essing time and the quality of the plan. Addi-
Change of quality tionally, the development schedule for ES
1 index adjusting 0.2 Avoiding combination explosivity should always include enough time for im-
algorithm proving the prototype.

2 UDF compilation 0.5


STUDY ON THE USE OF NUMERICAL
Useing RPLACD-type functions, and
DESIGN METHODS
black boards instead of objects;
3 Tuning 0.6 Reducing recursive calls, and
When observing the procedure used by the
message passing;
experts for planning, it seemed that the resolu-
post-improvement processing time tion was uncertain and unassuming. The plan-
'Effect = ning procedure also seemed to involve a process simi-
pre-improvement processing time
lar to the "simplex method" of LP. Therefore, we stud-
ied a system combining LP and ES, whereby ES evalu-
time after all these software improvements turned out ates the results of LP and alters constraint equations for
to be some 1/10 of the prototype. Consequently, we those tasks that are not expressible by LP models. LP is
met the time requirement, so that COMPASS-88 can then used to recalculate based on the new constraint
formulate a blending plan in 2 or 3 hours. equations.

Retrospective Considerations In model I of TABLE 2, resolution could not be ob-


tained in any practical time. Model II, although giving
The knowledge initially acquired for COMPASS-88 a large change in blending pattern, is free from the de-
was not enough, and we had to acquire additional ex- fects of Model I. An improvement to Model II ap-
pertise and substantially modify the planning proce- peared to offer the means to a practical system. How-
dure accordingly. Such a trial and error is unavoidable ever, for the following reasons, we abandoned the
for the following reasons: study of a system combining ES and LP:
(1) The prediction of resources consumption by ES is
difficult, and it is correspondingly difficult to de- (a) Mitigating the risk of ES was the motive for starting
sign an algorithm with a suitable balance between the study of LP models. However, the manual
processing time and planning quality; simulation by using the ES model only, which was
(2) To produce an ideal plan, the experts tend to present done along with the LP model application, gave
complicated knowledge that they would them- satisfying results, and LP turned out to be less at-
selves disregard because of the constraints on plan- tractive.
ning time;
(3) It is difficult to acquire knowledge which experts (b) The experts did not have the required know-how to
use subconsciously; in particular, expertise for re- evaluate the output of LP and to alter the constraint
ducing planning time tends to be missed, since such equation. Accordingly, a considerable degree of
knowledge cannot improve the quality of the plan. trial and error would have been necessary before
any practical system could be completed.
We found that it is preferable to start developing a
prototype as soon as the knowledge engineers had ac- (c) We could not identify a clear division between the
quired the essential expertise. After grasping the ap- LP and ES, which might cause a hindrance when

TABLE 2 Function sharing between ES and LP

^ - ^ _ Model I Model
Use ES to set up aimed quality, and then upper Use ES to set up aimed quality, upper & lower
& lower limits of designated brand blending ratios. limits of designated brand blending ratios, and
Outline
Then use LP to calculate blending ratio for all upper & lower limits of brands used in the
beds for a plan period. previous bed. Then use LP to calculate one bed .

Objective Function Minimum grade variation Minimum grade error


Number of
constraint equation about 3,200 about 200
Proper use for business demand is impracticable About 1 minute of CPU time assuming 27 beds
because 3 minutes and longer with a VP-50 for a plan period, and 2 repetitions per bed (0.01
super-computer is too long for CPU time. minutes of CPU time / bed).
Incapable of expressing upper & lower limits of Lower limit blending ratio almost resolved by
Evaluation blending ratio. proper joint use of ES and LP.
Smooth operation for each brand is difficult. The blending amount tends to veer toward either
Experts have no know-how for actions to be taken upper limit or lower limit in a variable range,
during infeasible period. making operation unsmooth.
Experts have no know-how for actions to be
taken during infeasible period.

113
Fig. 5 Effects of COMPASS-88

reviewing models against environmental changes,


even though such models are made by applying a COMPASS-88 has reduced coke quality fluctuation.
combination of LP and ES. Therefore, lowering the aimed coke strength can sat-
isfy the quality that a blast furnace requires. Thus
COMPASS-88 was thus structured by using the ES COMPASS-88 had contributed to reducing cost (see
model only. In the meantime, experts have critically Fig. 5).
evaluated the output of the LP model to draw out the
restriction conditions for the ES model, so that the
study on adopting LP was productive. CONCLUSIONS
An automatic method to formulate a coal blending plan
EVALUATION for cokemaking was developed using ES. It takes only
2 to 3 hours to output a 90-day blending plan that is as
COMPASS-88 has been in practical use since its first good as an experts' one, so that blend simulations can
blending design task in May 1989. The development of be carried out for various coke quality targets and coal
COMPASS-88 has documented and systematized the arrival schedules. This expert system has been manag-
expertise for formulating a coal blending plan that pre- ing coal blending and coke quality control at
viously existed only in the brains of experienced spe- Mizushima Works without an experts' assistance since
cialist engineers. Today, staff with absolutely no previ- May 1989.
ous experience can formulate the coal blending plan.
Each blending plan drafted by COMPASS-88 does not ACKNOWLEDGMENTS
always exceed the level of that by conventional ex-
perts. However, work that used to take 2 days has been The authors express their sincere gratitude to staff of
reduced to 2-3 hours, and the method is capable of rap- Fujitsu, and Fujitsu Okayama Engineering for their
idly responding to changes in the quality requirement extensive cooperation in the development of this Sys-
for a blast furnace and ship arrival schedules. Further- tem.
more, it has become possible to increase the span of the
planning, to shorten planning cycles, and to conduct
various case studies. The net result is that planning
accuracy has been improved in comparison with the
earlier method.

114
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

A DATA BASED EXPERT SYSTEM FOR ENGINEERING


APPLICATIONS
D.W. Ginsberg and W.J. Whiten

Julius Kruttschnitt Mineral Research Centre, Isles Road, Indooroopilly, Queensland, Australia 4068

Abstract

A novel kind of expert system has been developed for the mineral processing industry. It has
been designed to classify quantitative continuous data (such as process measurements) and
has applications in data classification, process monitoring and control. The knowledge base of
the system is based on the results of clustering a training set of data using a newly developed
clustering program, especially designed for typical engineering type data.

The system is easy to use and has a relatively fast knowledge acquisition period associated
with it. Systems with multiple process variables are easily handled in a completely
multidimensional environment. The properties and behaviour of the expert system are
illustrated using the potential to identify coloured density tracers that are being used to
determine the performance of spiral separators.

Keywords

categorisation, classification, clustering, knowledge acquisition, outlier detection, data trends,


local cluster shape, continuous data, weighted distance measure, expert systems.

INTRODUCTION

In the artificial intelligence domain, the concepts of This clustering procedure is quantitative in nature, as
classification and categorisation are distinct functions opposed to the highly qualitative nature of rule - based
which have complementary features. Classification expert systems. In addition, the clustering procedure,
methods assign objects to predefined categories. Such which corresponds to knowledge acquisition through
methods are common in artificial intelligence; one being interviews for rule - based expert systems, is significantly
rule - based expert systems. shorter in duration. Some of the features of the clustering
program include finding local trends, the categorisation of
Category formation involves clustering similar objects in data and importantly, identifying outliers.
a set of objects without a priori information on
classification (Anderberg 1973). Although having received The second part of the procedure classifies new data
less attention, Thompson and Thompson (1991) assert points by assigning them to one of the previously defined
that a theory of categorisation is central to our basic clusters. The system handles points found in regions not
picture of reality. previously defined and has the capability of dynamically
modifying its knowledge base.
A complementary two part procedure has been developed
to provide a novel expert system specifically designed for In order to appreciate the expert system, some of the
engineering applications. The system acquires knowledge features of the clustering procedure need to be described.
by categorising a training set of data, and then classifies Once the categorisation of a data set has been completed,
new data points. It thus synthesises the concepts of the classification of new data points is an extremely
categorisation and classification into one expert system. simple procedure.

A number of applications suggest that categorisation CATEGORISATION OF A TRAINING DATA SET


could play an important part in expert systems. Although
this paper uses the identification of coloured density The most difficult task in setting u p an expert system, is
tracers to illustrate the system's characteristics, other the knowledge acquisition component. The traditional
attractive applications include identifying different ore interviewing of the expert by the knowledge engineer is
suites in mineral processes, image analysis and advisory extremely time consuming. In addition, the knowledge
control. provided by the expert still has to be thoroughly checked
for consistency. The knowledge base is always limited by
The first part of the procedure categorises a training set of the extent of the expert's knowledge. A typical rule based
data points into a number of clusters. These clusters have expert system building procedure for mineral processing
a well defined shape and orientation. The clusters are is illustrated in Fig. 1 (Freeman et al 1990).
then given suitable qualitative or quantitative definitions.

115
A clustering technique, specifically designed for
EXPERT engineering applications, serves as the knowledge
acquisition component of the expert system. Some of its
k A properties are discussed in the following section.
T ^f
CHARACTERISTICS OF THE CLUSTERING
Development or Knowledge Engineer TECHNIQUE
Process Metallurgist
The basis of understanding the developed clustering
I T technique and the ensuing classification system, is the
initial assumption about cluster shape. This is based on an
; r Expert System
Shell
individual cluster's sample covariance matrix, which

i 1 defines a multidimensional ellipsoid with principal axes


in the direction of the particular matrix eigenvectors. The
relative spread of data along a particular axis is given by
the associated eigenvalue. Thus, it is possible to preserve
T local trends in the data. This geometric description can
correspond (but is not limited) to a multivariate normal
Knowledge Base distribution for each cluster.

In the case of clustering a data set of coloured density


tracers used in experiments on spirals, the specific colours
>f
are measured in terms of the three primary colour
intensities; red, green and blue. This is based on the RGB
Inference Engine
colour space (Sears and Zemansky 1957, Vaughn 1990).
The coloured tracers fall into a number of three
L dimensional ellipsoids (such ellipsoids include spheres,
pencils and plates), principally corresponding to various
ranges of intensity of a particular colour. A two
^f dimensional example of white and lime clusters is
Advice / Action / Diagnostics illustrated in Fig. 2. More complicated shapes may be
Fig. 1 Development of a rule based Expert System represented by a number of ellipsoids.
for mineral processing applications
36-

34"
Neural networks is one alternative to rule - based expert
systems (Sestito and Dillon 1989) and is an area of artificial 32-

intelligence which has received much attention in the 30-



1 a
literature. One reason for all the interest shown in this 28-
_
^^ 1
/ a
a a
White
Limel
technique, is it addresses the process of automated 26- / a a Lime 2
n D a /a a
learning and allows for data points not necessarily seen
/D
24- n Do D O n a
a a a a
previously (such as the kind that noisy signals produce). 22-
Layers of neurons are interconnected; each with a
20-
weighting function. The weighting functions are obtained
by running a training set of data through the network. 18-
16 18 20 22 24 26 28 30
Once these initial weighting functions are obtained, they Red Intensity
may vary dynamically in an attempt to accommodate the
changes in the process. This structure attempts to mimic
Fig. 2 White and lime clusters defining classification
the way information is processed by the human brain.
regions for the Expert System
In practice, it is often extremely difficult to obtain these
In order to facilitate appropriate points/clusters being
weighting factors and even the most simple of processes
agglomerated, the distance dissimilarity measure takes the
often require a large number of iterations before
shape and orientation of the cluster directly into account.
satisfactory weightings are obtained. In addition, there are
When measuring the distance from a cluster to a point,
no guarantees that a particular network can be trained at
the shape and orientation of the multidimensional
all. Cybenko (1990) is of the opinion that the design of
ellipsoid, as defined by its sample covariance matrix (a
neural networks for nontrivial engineering problems will
slightly modified sample covariance matrix is actually
not be any easier than other large scale engineering
problems. He also points to four successful applications of used), is taken into account by weighting the squared
neural networks found in the literature, where in each of Euclidean distance measure ie:
the four examples, the number of continuous parameters
required to parameterise the particular network was larger d2=(mn-x)TR-l(mn-x) (1)
than the number of data/training samples used in its
training. where m n is the centre of the cluster and x is the point
vector.
Clustering data has some obvious benefits in knowledge
acquisition. Firstly, most engineering processes have Equation 1 corresponds to scaling the squared Euclidean
some kind of data logging system and therefore have long distance in the directions of the eigenvectors of the
and detailed records of historical data; suitable to be used sample covariance matrix, with the corresponding
as a training set for the knowledge acquisition. Clustering eigenvalues. It corresponds to the Mahalanobis distance
directly addresses the problem of categorising data (Hartenberg 1975), with local cluster covariance
without a priori information. In addition, it can be a weightings being used instead of the covariance matrix for
highly quantitative procedure. the entire data set.

116
The corresponding multivariate normal distribution is When a new data point is sampled, either on line or from
given by (James 1985): a batch file, the weighted distance from each cluster to the
point is measured. This measure directly takes the shape
of each cluster into account, as described previously. The
{(2)-/2 IRI -V2)exp[-0.5(m n - x ^ R " 1 ^ - x)] (2)
smallest distance is then compared to some previously
defined threshold (typically 2 - 3 normalised units). If it is
where p is the number of point dimensions. smaller than the threshold, the point belongs to the
associated cluster and the appended advice is given or
A modified partitional leader algorithm (Hartenberg 1975) action taken.
for agglomerating points has been developed, since most
engineering applications do not require the complete
This classification function corresponds to Bayes'
dendrogram produced by hierarchical methods. An initial
classification rule for normally distributed points in a
distance threshold for agglomeration is set. As the clusters
particular cluster (James 1985). It is a quadratic
form, the points in the cluster progressively define the
discriminant function and works well under a wide range
region of the cluster. To form clusters, an initial
of conditions. Unlike the popular linear discriminant
unagglomerated point is chosen and then points within
function, clusters are not assumed to have equal
the specified distance threshold are added to form a
covariance matrices, which enhances the identification of
cluster. When a point is added, the sample covariance
local trends in the data.
matrix is updated and an improved cluster shape
definition is obtained. Since the shape of the cluster
changes as points are added, it is necessary to retest Note that in general, Bayes' rule is the optimum
unagglomerated points until no further points add to the classification rule. Even for clusters without a normal
cluster. distribution, the above form of the rule works well.

For example, the three density tracer clusters illustrated in


After one cluster is completed, another unagglomerated
Fig. 2, define three regions in the expert system.
point is chosen and the process is repeated. If no clusters
form at the current threshold distance, a new threshold is
A problem that many engineering systems have, is
calculated and the process is repeated. This is repeated
overlapping scenarios. These are very difficult to describe
until a maximum threshold is reached. Points that have
in conventional rule - based expert systems. In this expert
not formed into a cluster at the maximum threshold are
then reported as outliers. system, however, a quantitative measure to both the
corresponding clusters is available.
Unlike the popular K - Means partitional method
Situations never seen before (possibly through
(Anderberg 1973, Hartenberg 1975), no specification of the
instrumentation failure) are reported as outliers.
number of ensuing clusters is needed. Although the K -
Means method produces locally optimum clusters, initial
Processes that suffer from severe drift or changing
clustering is based on random seed points, or the manual
dynamics may be dealt with in one of the following ways:
selection of initial seed points; typically using two
dimensional projections of the data.
1. From the outset, cluster shape and orientation can
The modified leader algorithm avoids the danger of be updated continuously as points are agglomerated
having two or more seed points in the same cluster by into the specific clusters. This might lead to a
removing points available for clustering as the clusters continuously updated knowledge base,
form. It automatically determines an appropriate number corresponding to machine learning in other
of clusters, whose shape is determined by the properties of classification schemes such as neural networks.
the data, rather than the seed points.
2. A more robust method would be to simply acquire a
In the case of the coloured density tracers, 363 data points new knowledge base by clustering an updated
comprising of four colours, were categorised into 11 training set of data points. The clustering procedure
clusters (5 large clusters and 6 smaller clusters). Each has a short execution time. For instance, a set of
cluster has a defined mean and cluster threshold. Apart 1000 data points with under 10 process variables
from a single point, the data points in each cluster takes 1 - 2 minutes on a Macintosh SE/30 (16 MHz
corresponded to the same colour as the other data points and floating point coprocessor).
in the cluster. Thus, it is evident that good discrimination
between these colours is possible and the expert system Fig. 3 illustrates the structure of the proposed expert
can be installed with some degree of confidence. Nine system.
outlying points were identified.
CLUSTER VERIFICATION USING
CLASSIFICATION OF DATA CLASSIFICATION

The second part of the procedure, is to classify new data The complementary nature of the categorisation and
points into the categories formed by the clustering classification components of the system is highlighted in
program. The clusters are each given some kind of label the cluster verification procedure. In the categorisation of
or description (either quantitative or qualitative) and are a training set of data points, clusters form by
stored as the knowledge base of the expert system. agglomerating all near points. This means that some
points may be better suited to membership of another
Most engineering processes have a large number of close cluster. A simple, yet effective clustering verification
process variables and thus a large number of possibilities procedure is to run the training set of data points through
arise when trying to describe the process in a knowledge the classifier, which assigns points to the closest cluster. In
base. In addition, previously unseen situations may arise this manner, clusters may be upgraded. This closely
o w i n g to p r o c e s s drift, nonlinearities and corresponds to the mixture model clustering technique
instrumentation failure. (Mclachlan 1988) for locally optimising clusters.

117
HISTORICAL DATA
REFERENCES

L ., no 1. Anderberg, M. (1973). Cluster Analysis for Applications,


Academic Press, New York.
CLUSTERING PROGRAM
2. Bowerman, R.G. and Glover, D.E. (1988). Putting Expert
Systems Into Practice, Van Nostrand Reinhold
Company, U.S.A.

3. Cybenko, G. (1990). Complexity Theory of Neural


On/Off Line
Networks and Classification Problems, Lecture Notes
Data in Computer Science: Neural Networks EURASIP
Workshop 1990 Proceedings, Springer - Verlag, Berlin
Heidelberg, pp26 - 44.
KNOWLEDGE BASE
4. Freeman, N. , Kemp, T. and Legg, J. (1990). The
Development of an Operator Guidance System for Lead
INFERENCE ENGINE Blast Furnace Operations, Proceedings AI 90, Perth
Australia.

5. Hartigan, J. (1975). Clustering Algorithms, John Wiley


and Sons, U.S.A.
ADVICE/ACTION/DIAGNOSTICS
6. James, M. (1985). Classification Algorithms, Collins
Fig. 3 Structure of the proposed Expert System Professional and Technical Books, Great Britain.

7. Mclachlan, G.J. and Basford, K.E. (1988). Mixture


SYSTEM SUITABILITY FOR A PARTICULAR Models: Inference and Applications to Clustering,
APPLICATION Marcel Dekker, New York.

One major problem with automated systems in general, is 8. Sastito, S. and Dillon, T (1989). Using Neural Networks
choosing a suitable system for a particular application. For for the Extraction of High Level Knowledge
example, using neural networks, this might prove to be a Representations for Machine Learning, Proceedings of
time consuming, costly and frustrating endeavour. The Australian Conference on Artificial Intelligence, pp413
expert system described here has the significant advantage -428.
of being able to answer the suitability question in a
relatively short time period. If no suitable or distinct data 9. Sears, W and Zemansky, M. (1957). University Physics
trends emerge from clustering the training set's data Electricity and Magnetism, Optics and Atomic Physics,
points, it points out the inherent problems in the process Addison - Wesley, U.S.A.
(such as overlapping regions). In the categorisation of
coloured density tracers, no such overlap existed and as 10. Thompson, B and Thompson, B ( January 1991).
was mentioned previously, the expert system could be Overturning the Category Bucket, Byte, pp249 - 256.
implemented with some degree of confidence.
11. Vaughn, F ( December 1990). Color WYSIWYG Comes
CONCLUSIONS of Age, Byte, pp275 - 279.

The newly developed expert system appears to be a useful


tool for engineering applications. Its fast knowledge
acquisition time and quantitative nature gives it
significant advantages over conventional rule based
expert systems and neural networks for certain
applications.

System suitability may be assessed in a short period of


time, thus eliminating costly and time consuming initial
testing.

The system caters for previously unseen situations,


caused by, amongst other things, instrumentation failure,
nonlinearities and process drift.

The expert system has been illustrated using the


identification of coloured density tracers, where 363
tracers were categorised according to colour. Despite the
problems typically endured under less than ideal
measurement conditions, such as reflections and
shadowing, 362 tracers were verified to be correctly
categorised The expert system could thus be expected to
classify further tracers according to colour with a high
degree of confidence.

118
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

ADAPTIVE EXPERT SYSTEMS FOR METALLURGICAL


PROCESSES
E.K. Juuso and K. Leivisk

Department of Process Engineering, University ofOulu, Oulu, Finland

A b s t r a c t . Expert systems are developed for the multilayer simulation system in order to improve the appli-
cation facilities. T h e combined system contains procedures for developing simplified fuzzy models on the basis
of deterministic simulation experiments. Since these models, together with rule-based linguistic models, are
embedded in the expert systems, there are a total of five levels of simulation. The linguistic models developed
from the fuzzy models are used together with qualitative relations to define suitable meaning for each linguistic
variable.
The knowledge base of the expert system is represented by linguistic relations which can be changed into matrix
equations. The reasoning is based on these matrix equations or on the aggregated sets of linguistic relations
which are obtained by solving the equations. Only forward chaining is needed in approximate reasoning because
all the variables can be handled in the same way. The system is adaptive since the meaning of the linguistic
values depends on the working point of the process.

K e y w o r d s . Adaptive systems; expert systems; fuzzy systems; computer simulation; hierarchical systems; met-
allurgical industries.

INTRODUCTION User Interface


The simulation system provides a method for comparing the
production alternatives of metallurgical processes (Juuso and
Uronen, 1989). The deterministic system originally developed for
the Outokumpu ferroalloy smelter includes many ways to anal- Knowledge
Multilayer D a t a Base
yse chemical reactions as well as electrical and heat flows (Juuso, Base
Simulation Management
1988, 1989a, 1989b). Expert systems were developed for fer- Management
System System
roalloy processes in order to utilize linguistic information in the System
extensive, hierarchical simulation system (Juuso, 1989a, 1989c).

H
Adaptive properties were included to the expert system by re-
lating the meaning of linguistic variables to the working point of
the process (Juuso and Leivisk, 1990b). Knowledge
~ D a t a Base
Base
The multilayer structure based on a decomposition procedure
is very efficient in an iterative, gradually refining optimization
scheme. In addition to the deterministic simulation, the com- F i g u r e 1: T h e c o m b i n e d s i m u l a t i o n a n d e x p e r t s y s t e m .
bined simulation and expert system contains procedures for de-
veloping simplified fuzzy models on the basis of simulation exper-
iments. Since these models, together with linguistic models, are
embedded in the expert systems, there are a total of five levels
The development work was started on Sperry 1100 series com-
of simulation (Juuso and Leivisk, 1990a).
puters and continued on an IBM 3083 computer. Since device-
The combined simulation and expert system is a model oriented dependent p a r t s of the system are collected in certain subrou-
system which can be used as part of multilevel process control tines, the simulation system has been transferred to P D P 11
(Juuso, 1990). The whole system is in full operation during opti- series computers, VAX computers and IBM AT compatible mi-
mization. In other parts of the multilevel process control, the crocomputers with some standard changes. The present system
amount of details are decreased and the communication with is also running in several UNIX environments on H P and SUN
d a t a oriented systems, e.g. automation systems in monitoring computers. The distributed parameter models and the simulation
and control and decision support systems in management and experiments for the fuzzy linear regression require a mainframe
coordination applications becomes important. computer. The Cray X-MP supercomputer and the Alliant FX
minisupercomputer are used in more detailed calculations.
The computer modelling of the deterministic simulation system
is carried out using standard F O R T R A N subroutines and MAT- The prototype of the rule-based expert system was implemented
LAB functions in order to ensure the portability of the programs. in the GURU programming generator on microcomputers. In the
The multilayer simulation system is running in several computers present expert system based on P R O L O G language, the rules are
and there are some differences in the application fields of these replaced by linguistic relations. In the interactive work with ma-
subsystems. Usually, the simulation programs are executed sep- trix based models, the MATLAB system is used. The d a t a base
arately on each level of calculations. The common d a t a base management system based on SQL language is running on the
structure connects together the whole simulation and expert sys- IBM 3083 computer. For each subsystem, the environment is
tem (Fig. 1). The input d a t a and results are transferred between chosen on the basis of program packages supporting the applica-
the computers through the computer network. tion.

119
MULTILAYER SIMULATION Fuzzy Simulation
In the multilayer simulation system, deterministic calculation T h e deterministic simulation models are applicable in uncertain
methods ranging from known parameters and formulae to de- environments if discretized values are used instead of distributions
tailed simulation models are connected to fuzzy and linguistic (Juuso, 1989a). Each value has its own grade of membership
simulation models (Juuso, 1988, 1989a, 1989b). Actually, vari- generated on the basis of a membership function, i.e. triangular,
ables are described on four levels (Fig. 2): the fuzzy and linguis- trapezoidal or rectangular distribution (Fig. 2). In the triangular
tic methods are most useful for expert system applications. In case, the grade of membership is one only for one discretized
the linguistic subsystem, the development is continuing towards point, b u t in the trapezoidal case at least the boundary points of
diagnostical applications, therefore definitions for the linguistic the most possible area must be selected to the set of discretized
values are related to the working point of the process (Juuso and points. In the combined fuzzy and linguistic simulation, at least
Leivisk, 1990b). three discretized values must be chosen from the most possible
area in order to take into account the interactions described by
the linguistic models.
~ V Y Y \f~ Linguistic
In the method of the discretized fuzziness, the number of simu-
lation cases becomes very large when the number of uncertain
variables increases. In the fuzzy calculation level n, the num-
/ \ I I Fuzzy ber of simulation cases is (2n + j)m where j is the number of the
points selected from the most possible area, and m the number of
fuzzified parameters. In order to avoid long running times, sim-
Discretized fuzzy plified fuzzy models are used insted of the deterministic models.
In formula based models, several fuzzy parameters are combined
to a single fuzzy coefficient by discretized fuzzy calculations. For
more complicated models, fuzzy models are developed on the ba-
Deterministic sis of simulation experiments by fuzzy linear regression (Juuso,
1989a, 1989c).
C h e m i c a l r e a c t i o n s . In the simulation of chemical reactions, it
F i g u r e 2: T y p e s of t h e variables. is i m p o r t a n t to take into account the fact t h a t recoveries depend
on the working point of the process, especially on the basicity
The fuzzy approach makes the generation of the production alter- of the slag {{CaO + MgO)/Si02))- In the ferromanganese pro-
natives much easier. Discretized input values are used together duction, the manganese content of the slag, (Mn), becomes very
with nonlinear models of the electrical flow (Juuso, 1989a) and high if the basicity is low (Drrer, 1972). The recoveries for
models based on the mass and energy balances (Juuso, 1989c). manganese and silicon can be estimated by combining the model
Additional models based on uncertain process knowledge are also described above with Equation
developed on fuzzy level from qualitative relations by introducing
quantitative meanings for the variables. .((Mn)[Si\l\ CaO + MgO
ln (1)
The aggregation of the information is continued on the linguistic { [Mn] ) = -a^Ji2

level where several fuzzy models can be merged into a set of lin-
guistic relations. Again additional knowledge which in this case where [Mn] is the manganese content of the metal, (Mn) the
is based on qualitative information can be used in f he problem manganese content of the slag, and [Si] the silicon content of the
solving. In order to make this level consistent with the other lev- metal (Juuso and Leivisk, 1990b). Since there are also other
els a specific method for linguistification and delinguistification factors affecting to the recoveries, e.g. temperature and alumina
is developed. content of the slag, the parameters contain uncertainties.
Electrical flow. For the calculation of the electrical flow, there
D e t e r m i n i s t i c Simulation are several formula-based models applicable to fuzzy simulation
(Juuso, 1989a). The electrical resistance is represented by
On the feasibility level, emphasis is laid on testing ideas. A charge
calculation model generates production alternatives on the basis
R= (2)
of design constraints (Juuso, 1988). The process constraints are
taken into account in the energy balance model. The calculations i^\j) '
of heat flow are based on the requirements of the process: elec-
where s is the electrode separation, hQ the height of the tip of the
trical flow is combined with heat flow by known specific energy
electrode above the metal layer, he the electrode immersion and h
consumption and power density. Electrical resistance and aver-
the burden height ( = h0-\-he); the shape factor d2 depends on the
age electrical conductivity are approximated by formulae (Juuso,
furnace dimensions, and the connection factor 7 is assumed to be
1989a).
0.92 for a three phase furnace (Juuso, 1989a). In the uncertain
On the design level, dimensions of the furnaces are placed un- environment, the average conductivity , the height h0 and the
der consideration. A zone balance model divides the mass and shape factor d2 can have uncertain values. The average electrical
energy balance into several zone balances suitable for detailed conductivity is estimated on the basis of conductivities of the
simulation model application (Juuso, 1989a). The calculations of charge and the slag, ac and as respectively (Juuso, 1989a).
the electrical flow are based on the highest applicable resistance.
H e a t flow. A procedure based on one-dimensional cases (Juuso,
A descending burden element model corresponding to a batch
1989b) can be used in choosing the electrical parameters of the
reactor model (Juuso, 1989b) provides a link between lumped
submerged arc furnace. In the case of constant electrical conduc-
parameter and distributed parameter models.
tivity, the steady state solution for the temperature distribution
On the process study level, spatial dependence is taken into ac- of the molten zone is represented by equation
count in the detailed models of electrical and heat flow (Juuso,

=4? K) -()] <3>
1989a, 1989b). The steady state solution for the temperature
distribution is obtained in the calculations by a transient scheme
based on the odd-even hopscotch method using finite difference
approximations (Juuso, 1989b). For chemical reactions, a kinetic where J ^ e is the current density at the tip of the electrode and
model of simultaneous reactions is employed. Finally, the pro- Rs the radius of the tip sphere (Juuso, 1989b). Here the effective
cess development applications include a calculation method for thermal conductivity, AC, which includes the effects of conduction,
the chemical reaction equilibrium (Juuso, 1988). radiation and convection, is also assumed to be constant.

120
U n c e r t a i n t y c a l c u l i . The grades of membership for results Y* 1 1
are determined by the methods used to combine joint member- slag-cond(as , basicity i temperature)
ship functions corresponding to the fuzzy input d a t a Zj,j = I
1...M. This case is similar to the conditional statement conductivity ( 1
,as,ac,he,I)

*/ f ) Z] then Y* (4)
I
resistance(R, , he)
i=i

Since almost similar results can be obtained from several simula-


tion cases, the results are classified and the grades of membership voltage^^J.R)
for the classified values are combined by methods developed for \ I
conformative membership function. An important feature of the useful .power {P^ 1
,I,U)
method is t h a t the number of classes depends on the fuzzy cal-
culation level n (Juuso, 1989a).
In the case of non-interactive variables, the minimum method F i g u r e 3: Linguistic s i m u l a t i o n .
(intersection) is used for the joint computation and the maxi-
mum method (union) for the conformative calculations. A nega-
tive correlation between variables can be taken into account by a Electrode position
bounded sum (Lee, 1990b) which is also called Lukasiewicz O R
(Buckley and Siler, 1988): VH H N L VL

LOR(a,) = a = min(a + , 1), (5)


r VH |~N~] [T~| [VL] [VL] [VL] VL
where a and are the grades of membership for two cases under o R
consideration. For more complicated interactions, the fuzzy and n
linguistic simulation methods are combined.
d H [v^l [VLJ L S
u
c
t N [VH] [VL] N t
Linguistic Simulation i
The linguistic simulation is based on linguistic relations, i.e. in
V
i L [VH] [VH] H j
the conditional statement 4, the fuzzy vectors Z* and Y* are t
replaced by linguistic vectors. In the previous rule-based system,
y VL [VH] [VH] [VH] VH
there was clear distinction between input and o u t p u t variables
(Juuso and Leivisk, 1990b). In the present system, the linguistic VL L N H VH
rules used in previous systems are replaced by linguistic relations
and linguistic equations. Electrode immersion
The linguistic model part of the knowledge base consists of sev-
eral facts relating n linguistic variables. If n 1 variables are F i g u r e 4: T h e linguistic m o d e l of t h e electrical r e s i s t a n c e .
known, the remaining variable can be solved from the set of re-
lations which can be considered as a linguistic equation (Juuso,
1991). Usually, the number of variables in these sets ranges be- L i n g u i s t i c e q u a t i o n s . With careful selection of the variables,
tween 3 and 5. For expert system applications, a set of facts can the set of linguistic relations can be changed into a very sim-
be generated by solving linguistic equations. ple mathematical equation, e.g. the set of relations presented in
Figure 4 can be represented by
L i n g u i s t i c r e l a t i o n s . The linguistic process model is described
by groups of linguistic relations: each group can be based on a
single fuzzy model (Eq. 2), or several fuzzy equations can be ag-
gregated into a single group of linguistic relations. The variables
*> = (6)

of the relations are chosen in such a way t h a t the directions of


where Xj,j = l...ra, are the levels of the linguistic values for the
the changes are balanced, e.g. the first relation in Figure 3 is
variables which are resistance, electrode immersion and conduc-
based on the fact t h a t the increase of the resistivity of the slag,
tivity (m = 3). Here the values veryJow, low, normal, high,
as~l, with increasing basicity can be compensated by increasing
and veryJiigh are replaced by numbers -2, - 1 , 0, 1 and 2. This
temperature. This selection procedure is not necessary for the
model can also be generalized for finer fuzzy partitions.
application of the linguistic relations.
Several sets of linguistic relations can be combined in matrix
The electrical resistance decreases with increasing immersion
presentation, e.g. the linguistic models shown in Figure 3 are
depth, he, and increasing conductivity, . Equation 2 can be
represented by equation
represented by relations resistance(x,y,z) where x, y and z are
the linguistic values for the resistance, R, the conductivity, , and AX = 0, (7)
the electrode immersion, he, respectively. Each relation describes
which linguistic values belong together, e.g. where A is a matrix

resistance(normal, normal, normal) / -1 1 0 - 1 0 0 0 0 0 \


resistance(low, high, normal) 0 0 -1 1 1 1 0 1 0 0
0 0 1 0 0 1 1 0 0 0
This model consists of 25 linguistic relations (Figure 4) if each 0 0 0 0 0 0 1 1 -1 0
variable has five linguistic values: veryJow, low, normal, high, 0 0 0 0 0 0 0 1 0 -i )
and very .high.
if the vector X contains the linguistic variables basicity,
The linguistic model described in Figure 4 is refined for actual ap-
temperature, , $, ac, he, R, I, U, and P/v
plications by taking into account the effects of the burden height,
h, and the shape factor, d2. For design applications, a more de- In order to solve the problem, five of these variables should be
tailed model including effects of the electrode separation, s, is known or variated. Due t o nearly singular matrices, some of
employed. The relation sets are also expanded and updated by these combinations cannot be used. However, only the integer
simulation experiments based on the fuzzy simulation (Juuso and solutions are required, and exactly the same set of solutions is
Leivisk, 1990b). obtained by any combination (Juuso, 1991).

121
The result is an aggregated set of those linguistic relations which C o m b i n e d F u z z y and Linguistic S i m u l a t i o n
are relevant if the process constraints described by the complete
In the applications, fuzzy numbers are values of fuzzy variables,
set of linguistic equations are taken into account. As non-integer
and there are additional associations between these variables.
solutions correspond to the solutions in finer fuzzy partition, they
The models based on detailed calculations can be combined with
can be excluded. The matrix method is implemented in the
linguistic relations based on qualitative models, e.g. the increas-
MATLAB system, and the resulting set of linguistic relations
ing viscosity of the slag with increasing basicity can be compen-
can be used either as a list of facts for a P R O L O G program or
sated by using higher temperatures, but the high temperature
as a database of alternatives for a FORTRAN program.
causes additional vaporization of the manganese, and simultane-
In this methodology, the linguistic relations are developed grad- ously the increasing amount of the charge to be melted increases
ually: only for a small p a r t of the problem is taken into account specific energy consumption.
at a time. By the m a t r i x method (Eq. 7) it is very easy t o build
I n t e r a c t i o n s b e t w e e n s i m u l a t i o n m o d e l s . Interactions be-
large systems for different applications. The problem solving,
tween the simulation models of the chemical reactions, electri-
i.e. simulation or diagnostical reasoning, is based on integer ma-
cal flow and heat flow are very important, e.g. the conductivity
trix manipulations. The linguistification methods based on fuzzy
as depends on the temperature and the composition in the slag
simulation have a vital importance in connecting this methodol-
ogy to the real practice. layer. Therefore, it is necessary to include at least some features
from every subsystem to the calculations. In any subsystem,
U n c e r t a i n t y c a l c u l i . The handling of uncertainties is an es- this methodology makes it possible to avoid studying alterna-
sential part of the linguistic simulation. The membership func- tives which are irrelevant in another subsystem.
tions for results are calculated from the membership functions of
M e a n i n g of l i n g u i s t i c v a l u e s . The linguistic relations can be
the known variables by the joint computation method described
developed on the basis of the fuzzy models, e.g. curves with
above. The minimum method which weights the overlap as much
one independent variable can be represented on linguistic level
as possible (Buckley and Siler, 1988) suits very well into these
by dividing one of the coordinate axis into linguistic values while
applications where smoothly changing results are prefered. Var-
the corresponding linguistic division for the other coordinate axis
ious families of T-norms (Bonissone and Decker, 1986) are not
is denned by the fuzzy model (Juuso and Leivisk, 1990b). This
useful in this application since they all produce smaller grades of
procedure produces a set of linguistic relations for two variables:
membership t h a n the min operator which is the upper bound of
other variables of the formula-based model are assumed to be on
T-norms.
normal level.
The maximum method used for confirmative computation in the
For diagnostical purposes, several variables should be aggregated
discretized fuzzy calculations discounts the overlap between the
into a single set of linguistic relations. The procedure described
known variables (Buckley and Siler, 1988). This was a favourable
above could be extended for several variables by introducing com-
property in the discretized fuzzy simulation, but here it distroys
pensation effects into the relations. However, the number of simu-
the smoothness of the results. Therefore, it is better to apply
lation cases increases quite fast with increasing number of vari-
the bounded sum (Eq. 5) which weights the overlap as much as
ables if additional variables are included to the set of relations
possible. The consistency with the fuzzy simulation can still be
variable by variable. If there are five linguistic values for each
improved by using the simultaneous support of the neighbouring
variable, the meaning of five linguistic values must be solved on
values n 1 and n -f- 1 as an additional evidence for the value n
each level.
(Fig. 5 and 6).
It is much easier to develop the meanings for the other variables
by the combined fuzzy and linguistic simulation. As the most
possible ranges of different linguistic values are not allowed t o
overlap, the definition of the linguistic values of the main variable
must contain sufficient space between the most possible areas.
The consistency of the linguistic and deterministic models in the
multilayer simulation system are automatically checked by these
interactive fuzzy calculations (Juuso, 1991).

User Interface
In the ferromanganese model, parts of the model, i.e. d a t a modi-
fication, simulation calculations and printing out reports, can be
Electrode immersion chosen independently (Juuso and Leivisk, 1990a). In modifica-
tion of deterministic input data, also programs for calculation,
F i g u r e 5: M e m b e r s h i p values of n o r m a l electrical r e s i s t a n c e checking and printing out are used in addition to the commu-
in t h e t r a p e z o i d a l case. nication programs. The input d a t a is preprocessed in order to
make the actual simulation calculations more efficient.
In the fuzzy simulation, fuzzified values are generated on the ba-
sis of checked input d a t a which has already been accepted by
the simulation system. In the present system, the fuzzy calcu-
lation level n is the same for every fuzzy parameter and is re-
duced automatically if the number of simulation cases exceeds
the maximum value defined by the user. The same checking fa-
cilities which are used in the deterministic simulation are also
active in the case of fuzzy simulation (Juuso, 1989a, 1989c).
The linguistic d a t a is stored in relational databases and it is used
in knowledge-based systems (Juuso and Leivisk, 1990b). The
" -1 -0.5 0 0.5 1 user interface contains access to the lingustic facts through the
d a t a base management system. Since deterministic and fuzzy
Electrode immersion
input values are preprocessed, input d a t a can be given in any
form shown in Figure 2. Also probability based measurement
F i g u r e 6: M e m b e r s h i p values of n o r m a l electrical r e s i s t a n c e d a t a can be used if the mean value and standard deviation are
in t h e t r i a n g u l a r case. known.

122
ADAPTIVE EXPERT SYSTEMS A fuzzy partition determines how many terms should exist in the
In the expert system, domain-specific knowledge bases are cou- term set. For diagnostical purposes, coarse partition with five
pled with the simulation system, and the evaluation of alterna- terms is quite adequate. In order to use the same methodology
tives is based on fuzzy and linguistic simulation. Approximate in control applications, a finer fuzzy partition is employed. If
reasoning makes it possible to use linguistic quantifications with each set of linguistic relations is represented in the equation form
more precise d a t a . The formula based models are used as rela- (Eq. 6), the expansion of the system is very easy. Additional
tions between different parameters and variables. Depending on definitions for linguistic values can be obtained by interpolation.
the known facts, these relations can be utilized in several ways. Actually, a gradually refining set of discretized values is used in
The relations are tuned by adjusting the meanings of the linguis- the combined simulation and expert system. In this kind of mod-
tic values on the basis of the working point conditions. els, relative limits are very useful for applications. The meaning
of the linguistic values depends strongly on the working point.

Probabilistic Deterministic Fuzzy Linguistic E m b e d d e d Simulation


1 L
J 1 The linguistic models are essential parts of the knowledge base
Fuzzification and they are used on two levels: the fine partition is employed
in the area of low, medium and high values. In the case of very
t J low and very high values, the working point of the process is
Linguistification changed considerably and a recalculation with the fuzzy models
was required in previous system (Juuso and Leivisk, 1990b). In
j 1 1 addition to the recalculation, some rule sets were replaced if the
application area of the expert system was wide.
KB Decision Making Logic
In the present system, the linguistic models are mostly integrated
I 1 1 to the m a t r i x equation (Eq. 7), and only the meaning of linguistic
Delinguistification values depends on the working point. The parameters defining
the meaning can be stored into databases. The alternative work-
I J 1 i ing points is based on the fine fuzzy partition, but the meanings
of linguistic variables are changed only if the working point is
Defuzzification moved out from the normal area (coarse partition). Fuzzy mod-
els can also be embedded within the rules by linguistification and
1
Probabilistic
J
Deterministic
J
Fuzzy Linguistic
delinguistification.

Approximate Reasoning
F i g u r e 7: Configuration of t h e a d a p t i v e e x p e r t s y s t e m .
The approximate reasoning in both fuzzy and linguistic simula-
tion was based on fuzzy logic. In the previous system (Juuso
and Leivisk, 1990b), the certainty factor algebra of the GURU
Fuzzification Interface
programming generator was applied to both fuzzy and linguistic
Fuzzy and probabilistic d a t a are used together because uncer- simulation. The joint certainty computation can use the mini-
tainty appears under several forms: some observations measur- mum, product, average or balance method. The confirmative
able in a statistical sense are therefore captured by probabilities computation can use the maximum, probability sum, average
while others require both probabilistic and possibilistic meth- or balance method. For linguistic simulation, the bounded sum
ods for characterization. Probability theory and fuzzy set theory method seems to be very useful in confirmative computation.
are not rival approaches, but rather complementary ones (Dubois
In the present system, the fuzzy logic is also applied to diagnos-
and Prade, 1989). In the fuzzification stage, the probabilistic d a t a
tical and control purposes. Linguistic simulation models, diag-
based on measurements is converted into fuzzy numbers which are
nostical reasoning models, and control rules are represented by
easier to manipulate t h a n random numbers. In this way, com-
linguistic relations can be used in any direction. In the design
putational efficiency is increased (Lee, 1990a). The fuzzification
applications, the values of the basicity are used in estimating
function is an isosceles triangle defined by the mean value and
the recoveries. In the monitoring and control applications, the
the standard deviation of the d a t a set.
analysis d a t a is used in estimating the state variables. Only data-
Fuzzy numbers can be generated from deterministic values by driven forward chaining is needed since all the variables in the
defining upper and lower limits in addition to the most possi- set of linguistic relations can be handled in the same way.
ble value used in deterministic calculations (Juuso, 1989a). The
resulting membership function is triangular, but not necessarily
Delinguistification Interface
isosceles triangular. Also trapezoidal and rectangular member-
ship function can be used if the most possible value is on a certain Delinguistification is a mapping from linguistic variables to fuzzy
range (Fig. 2). In the construction of fuzzy values, interaction variables. The method is quite straigthforward: each linguistic
between variables must be taken into account. Therefore, fuzzi- value corresponds to a trapezoidal fuzzy number, (a/6, c\d). Usu-
fication is applied to d a t a groups: every d a t a value in the chosen ally, the meaning of each linguistic value is already required in
d a t a group is fuzzified so t h a t each case is acceptable (Juuso, the linguistification stage and can be found from the database.
1989a, 1989c). However, the reasoning can also be based linguistic input values
(Fig. 7). In this case, the results can be represented on fuzzy
level only if the linguistic to fuzzy mapping is available.
Linguistification Interface
The linguistification inferface has the central role in adapting the
expert system into the working point conditions. The linguistic Defuzzification Interface
values for the variables are obtained by comparing their deter- A defuzzification inferface is aimed at control purposes where a
ministic or fuzzy values to linguistic classes of the variable. Fuzzy non-fuzzy control action is required. At the present, the com-
variables can be represented either by trapezoidal fuzzy numbers, monly used strategies are the max criterion, the mean of maxi-
(a/6, c\d), or by discretized versions of them. A fuzzy variable m u m and, and the center of area (Lee, 1990b). All these methods
can have several linguistic values, each with its own certainty fac- are available in our system. Usually, the center of area method
tor. Therefore, trapezoidal (and rectangular) fuzzy numbers as which generates the center of gravity of the possibility distribu-
input variables may produce quite fuzzy results. tion of a control action.

123
Knowledge Base Management System CONCLUSIONS
In the prototype of the rule-based expert system implemented T h e adaptive expert system merges symbolic and non-symbolic
in the GURU programming generator, the consultation environ- concepts in a hybrid environment. Usually, the symbolic work is
ment was modified by adjusting the settings of the environment changed into numerical form which means t h a t the reasoning is
variables (Juuso, 1990b). In the present system, the rules are re- actually done by solving matrix equations. This method based
placed by linguistic relations presented by P R O L O G predicates on careful selection of variables has improved the system in many
and facts. The P R O L O G system contains expressions for in- ways. T h e relations are tuned by adjusting the meanings of the
put, constructing alternatives, certainty factor calculation, simu- linguistic variables. The calculation is based on the fuzzy and
lation, and result manipulation. deterministic models.
The d a t a base management system and the knowledge base man-
agement system are closely connected: expert systems can be REFERENCES
built on the basis of SQL commands. The linguistic rules are
specified on the basis of the shape of the relations. If the work-
ing point of the process is changed, the set of relations remains in Bonissone, P . P., a n d K. S. Decker (1986). A framework for
the same form as it was earlier, only the meaning of the linguistic comparing uncertain inference systems to probability. In L. N.
values is changed. For this kind of applications, the embedded Kanal and J. F. Lemmer (Eds.), Uncertainty in Artificial Intel-
SQL commands in the FORTRAN code is a very efficient so- ligence, North-Holland, Amsterdam, pp. 217-247.
lution. The matrix representation has made these applications Buckley, J . J., a n d W . Siler (1988). Fuzzy numbers for expert
very fast. systems. In M. M. Gupta and T. Yamakawa (Eds.), Fuzzy
Logic in Knowledge-Based Systems, Decision and Control, North-
Holland, Amsterdam, pp. 153-172.
MULTILEVEL CONTROL
D u b o i s , D . , a n d H. P r a d e (1989). Fuzzy sets, probability and
The combined simulation and expert system is in full operation measurement. European Journal of Operational Research, 40,
for optimization applications (Juuso, 1990). In other p a r t s of the 135-154.
multilevel process control, simplified fuzzy and linguistic mod-
Drrer, R. (1972). Metallurgie der Ferrolegierungen, G. Volkert
els generated on the optimization stage are combined with other
and K.-D. Frank (Eds.), Springer, Berlin.
models containing uncertainties. A number of reasoning systems
cooperating via common knowledge base are needed for the mul- J u u s o , E. K. (1988). Multilayer simulation of chemical reactions in
tilevel process control. metallurgical processes. In C. C. Barnett and W. M. Holmes
(Eds.), Proceedings of the Summer Computer Simulation Con-
ference 1988, SCS International, San Diego, USA, pp. 330-335.
Optimization
J u u s o , E. K. (1989a). Expert systems and the fuzzy simulation of
The simulation system is designed for off-line optimization of pro- electrical flow. In H. Jaakkola and S. Linnainmaa (Eds.), Scan-
duction alternatives by using deterministic steady state models dinavian Conference on Artificial Intelligence - 89, Proceedings
(Juuso and Uronen, 1989; Juuso 1990). During the optimization of the SCAI'89, IOS, Amsterdam, pp. 581-595.
process the amount of alternatives are reduced and more refined
J u u s o , E. K. (1989b). Multilayer simulation of heat flow in a sub-
methods based on the distributed parameter models can be used. merged arc furnace used in the production of ferroalloys. In M.
The results of simplified calculations are used as initial estimates Amouroux and A. El Jai (Eds.), Control of Distributed Parame-
for parameters used on detailed levels. On the other hand, the ter Systems 1989, Selected Papers from the 5th IF AC Symposium,
results of detailed calculations refine estimated input d a t a used IFAC Symposia Series, number 3, Pergamon, Oxford, 1990, pp.
other levels. The adjustment of the meanings of the linguistic 459-464.
values is an essential part of the optimization stage. J u u s o , E. K. (1989c). An expert system in fuzzy simulation of met-
allurgical processes. In G. X. Ritter (Ed.), Information Pro-
Monitoring and Control cessing 89, Proceedings of the 11th World Computer Congress,
North-Holland, Amsterdam, pp. 1051-1056.
The goal of the development work is a knowledge-based advisor
system which has close connections with the automation systems. J u u s o , E. K., a n d P . U r o n e n (1989). Hierarchical simulation of
ferroalloy processes. In V. Koppel (Ed.), Automation in Min-
On the coarse fuzzy partition level, the adaptive expert system
ing, Mineral and Metal Processing 1989, Selected Papers from
is applicable to diagnostical purposes. If some variables become the 6th IFAC Symposium, IFAC Symposia Series, number 10,
high or low, the fault diagnosis system activates the fuzzy control Pergamon, Oxford, 1990, pp. 227-232.
system. If some variables become very high or very low, the work-
J u u s o , E. K. (1990). Multilayer simulation and expert systems in
ing point has changed beyond the area where the fuzzy control
multilevel process control. In V. Utkin and U. Jaaksoo (Eds.),
system can take care of the process. The fuzzy control system is
Automatic Control in the Service of Mankind, Preprints of the
used on the finer fuzzy partition level. 11th IFAC World Congress, IFAC, pp. 65-70.
The multilayer simulation system based on steady state mod- J u u s o , E. K., a n d K. Leivisk (1990a). An intelligent user inter-
elling provides an aggregated overview which is very valuable in face for combined simulation and expert system. In Proceedings
control system design for the stabilizing control. The adaptive ex- of the 1990 European Simulation Multiconference, SCS Interna-
pert system is well fitted in the functional concepts presented by tional, Ghent, Belgium, pp. 300-305.
Lehman (Linkens, 1990): it helps users to construct and verify J u u s o , E. K., a n d K. Leivisk (1990b). Expert systems com-
dynamical models. Combining the fuzzy approach with statis- bined with a multilayer simulation system. In Proceedings of the
tical process control (spc) could be an appropriate way to use IMACS - IFAC International Symposium on Mathematical and
simulation methods in the quality control. Intelligent Models in System Simulation, IMACS, pp. V.B.4:1-6.
J u u s o , E. K. (1991). A linguistic equation framework for generat-
Management and Coordination ing adaptive expert system. To appear.
The aggregated overview of the process operation is also very use- Lee, C. C. (1990a). Fuzzy logic in control systems: fuzzy logic con-
ful on the management level where a whole system of processes troller - Part I. IEEE Transactions on Systems, Man, and Cy-
are organised for planning the market, the raw material alterna- bernetics, 20, 404-418.
tives and the personnel. On the coordinationlevel, fuzzy methods Lee, C. C. (1990b). Fuzzy logic in control systems: fuzzy logic con-
corresponding to the fuzzy control described above could be ap- troller - Part II. IEEE Transactions on Systems, Man, and Cy-
plied to the mutual adjustment of several coupled processes. The bernetics, 20, 419-435.
rapid development in both process control systems and in man- Linkens, D . A. (1990). AI in control systems engineering. The
agement type systems provides new opportunities for plant-wide Knowledge Engineering Review, 5, 181-214.
automation.

124
Copyright EFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

KNOWLEDGE BASED SIMULATION AND


IDENTIFICATION OF METALLURGICAL REACTORS
M.A. Reuter* and J.S.J. van Deventer**

^Institut fr Metallhttenwesen and Elektrometallurgie, RWTH Aachen, Intzestrae 3,5100 Aachen, Germany
**Department of Metallurgical Engineering, University of Stellenbosch, 7600 Stellenbosch, South Africa

Abstract. This paper illustrates the application of a knowledge based system to


perform fault-diagnosis and dynamic simulation of batch and continuous metallurgical
and mineral processing systems. The proposed approach is essentially useful to
simulate ill-defined dynamic processes where existing fundamental and empirical models
fail owing to their lack of generality. The basis of the model is a set {k[C(t)] n , C(t n ),
t n } which is derived directly from kinetic data or concentration time data of a system of
continuous stages in series and its associated pivot knowledge. The generalized first
order kinetic model is defined by use of the above set and linked to a knowledge base
via adjustment objects containing a (rate) and (final recovery) adjustment factors.
This approach permits the dynamic simulation of a variety of metallurgical and mineral
processing systems at any process conditions catered for by the knowledge base. The
same set may be used to establish deviations from the pivot conditions quantified as a
and values, which hence permits the establishment of process conditions within the
metallurgical reactor. In contrast with most existing models, no curve-fitting is
required, as the kinetic model utilises experimental data directly, so that no fitting
variances are associated with the data. Also, adjustment factors are defined in an
intelligent way in the knowledge-based system so as to relate the operating conditions
to the kinetics of the process under consideration.

Keywords. Kinetics; batch and continuous simulation; reactor identification; knowledge


based; neural networks.

INTRODUCTION conditions, making a generalized characterization of


the process via these inexact parameters even more
Schumann (1942) explained that the kinetic difficult. It is clear from the above points that a
behaviour of minerals during flotation will in many generalized simulation and diagnostic procedure
cases supply worthwhile information regarding the cannot be defined on the basis of these diverse
process and hence the effect of the various and inexact models.
process conditions. He also demonstrated that
changes of process conditions within flotation cells It is hence the objective of this paper to develop a
have a measurable effect on the flotation response. generalized kinetic model with the following
These observations based on the kinetics of the characteristics: (i) it must fit the data exactly (no
process form the basis of the simulation and model variances), irrespective of the process type, (ii)
based diagnostic procedure discussed in this the parameters of the model must be a function of
paper. all the possible process conditions (e.g. via
objects) catered for by the knowledge base, (i)
Numerous models exist to describe the kinetics of the proposed model and its accompanying
leaching, flotation, reduction and other processes knowledge base must be simple (<2 parameters)
(Sohn and Wadsworth, 1979; Dowling, Klimpel and and must have an analytic solution for batch as
Apian, 1986). Taking flotation as an example, well as continuous reactor systems, and (iv) it
Dowling, Klimpel and Apian (1986) could show that must facilitate diagnosis of the process conditions
the kinetic flotation model based on a first order within the respective reactors via its one or two
process with a rectangular distribution was parameter(s).
superior to thirteen other models. However, by
fitting this model to the given data, the fitting Various knowledge based systems have been
procedure produced a fitted final recovery of developed to perform simulation and diagnostic
>100%. Other flotation models performed badly due activities in chemical and metallurgical systems
to large confidence limits on the fitted parameters. (e.g. Stephanopolous and others, 1987; Fusillo and
Models for leaching, precipitation, reduction and Powers, 1988; Hoskins and Himmelblau, 1988;
adsorption would exhibit a similar behaviour to Tzounas and others, 1988; Tucker and Lewis,
that of the flotation models when fitted to kinetic 1988). Various architectures have been applied in
data, unless, however, the kinetic model is an the above-mentioned and various other examples
exact representation of the process being not mentioned here. These include object
considered. This is usually not the case owing to orientated (e.g. Venkatasubramanian and Rich,
some mixed kinetic mechanism underlying the 1988), frame orientated (e.g. Beltramani and
process. Motard, 1988), goal-tree-success-tree (e.g.
Birkey, McAvoy and Modares, 1988) and other
The large number of models that are available for architectures. The dynamic simulation of ill-
defining the kinetics of the above-mentioned defined problems by the use of these techniques
processes contain a varying number of parameters. has, however, found little application in the
When fitted to kinetic data, these parameters are literature. Dalle Molle, Kuipers and Edgar (1988),
determined only within a certain confidence limit and Kuipers (1984) proposed the use qualitative
by fitting procedures. Furthermore, each simulation for this purpose.
parameter usually lumps various process

125
Reuter and Van Deventer (1991a, 1991b) hence
demonstrated how a knowledge based system (KBS)
can be combined with a system of dynamic dC(t)/dt = -k[C(t)]n-C(t) (4)
equations to simulate leaching, pyrometallurgical
reduction, resin adsorption, carbon-in-pulp and where k[C(t)]n holds for the interval
c a r b o n - i n - l e a c h s y s t e m s . The methodology applied C(tn+1)<C(t)<C(tn). The p r o c e s s conditions for
is based on i n d e p e n d e n t adjustment objects in an which t h i s s e t ~{k[C(t)] n , C ( t n ) , t n } of r a t e s i s
hierarchical network u s i n g backward chaining for determined, a r e termed t h e pivot conditions and it
i t s solution a s for e . g . in EMYCIN (Shortliffe a n d i s postulated t h a t t h i s s e t remains i n d e p e n d e n t of
B u c h a n a n , 1975). No diagnostic system h a s really t h e p r o c e s s conditions, i . e . t h e kinetic mechanism
attempted to a d d r e s s diagnostics in metallurgical does not c h a n g e . This definition permits t h e
r e a c t o r s on a model b a s i s t h a t includes all possible simulation of p r o c e s s e s in which t h e kinetic data
process conditions within a particular unit a r e not a monotonic decreasing function in time
operation via i t s p a r a m e t e r s . Model based i . e . it could be S-shaped for example. An
diagnosis, a s it i s s u g g e s t e d h e r e , h a s been example is the induction period in the
r e p o r t e d for t h e diagnosis of electronic c i r c u i t s concentration r e t e n t i o n time data for t h e T e n n e s s e e
(Davis, 1984). copper r o u g h e r circuit (Dowling, Klimpel and
Apian, 1986), which cannot really be defined b y
In c o n t r a s t to t h e p r e v i o u s p a p e r s b y R e u t e r a n d t h e available theoretical models. It is exactly t h i s
Van Deventer (1991a, 1991b), t h i s p a p e r will induction period which could h a v e a decisive
demonstrate how a simple analytic model may be influence d u r i n g d i a g n o s i s .
applied to simulate a n d diagnose b a t c h and
continuous metallurgical r e a c t o r s b y applying a Equation 4 h a s an analytic solution for t h e given
suitable knowledge b a s e . T h e incorporation of a range:
n e u r a l network in t h i s a p p r o a c h is also d i s c u s s e d .
C(t)=C(tn)-exP{-k[C(t)]n-(t-tn)} (5)

To summarize: t h e s e t { k [ C ( t ) ] n , C ( t n ) , t n } is t h e
KNOWLEDGE BASED MODEL b a s i s of t h e model from which all o t h e r models a r e
derived.
The most simple kinetic e x p r e s s i o n is a first o r d e r
kinetic model. Due to i t s two possible p a r a m e t e r s
k and t h e final r e c o v e r y R it would not b e capable
of d e s c r i b i n g many p r o c e s s e s v e r y a c c u r a t e l y . Batch Model
However, if k is not a c o n s t a n t b u t a variable
e . g . k ( C ) , it h a s been shown ( R e u t e r and Van Any deviation from t h e pivot conditions i s reflected
D e v e n t e r , 1991a&b) t h a t t h i s kinetic e x p r e s s i o n b y a and v a l u e s , which a r e derived from
can d e s c r i b e a v a r i e t y of metallurgical p r o c e s s e s . a p p r o p r i a t e i n d e p e n d e n t adjustment objects or
adjustment object g r o u p s ( d e p e n d e n t o b j e c t s ) . In
dC(t)/dt=-k[C(t)]-C(t) (1) this instance eq. 4 is rewritten a s :
This definition implies t h a t if t h e p r o c e s s is f i r s t dC(t)
o r d e r in n a t u r e f(C) will be equal to k a n d for a -a-k[C(t)]n-[C(t)-C0-(l-)] (6)
second o r d e r p r o c e s s f(C)=k*C, a linear function dt
of c o n c e n t r a t i o n . If t h e p r o c e s s is c h a r a c t e r i z e d
b y mixed k i n e t i c s , f(C) i s a complex function of which holds for
C. This a p p r o a c h would s u g g e s t t h a t k=f(C) i s
reaction specific and a "finger-print" of a C0-3-[C0-C(tn+1)]<C(t)<C0--[C0-C(tn)]
p a r t i c u l a r reaction, a n d could hence be used a s a
b a s i s for simulation and diagnostic a c t i v i t i e s . Note This e x p r e s s i o n also h a s an analytic solution,
t h a t t h i s model is essentially a o n e - p a r a m e t e r which holds for t h e validity r a n g e .
model, as k[C(t)] also contains information
r e g a r d i n g t h e final r e c o v e r y i . e . where k [ C ( t ) J T h e d i s c u s s e d b a t c h approach may also be applied
becomes z e r o . I t i s h e n c e essential t h a t k [ C ( t ) ] to continuous r e a c t o r s if c o n c e n t r a t i o n - r e t e n t i o n
must cover t h e p r o c e s s o v e r t h e full e x t e n t of t h e time data for a continuous bank a r e applied to
reaction. simulate t h e s e r e a c t o r s , a s was done b y Dowling,
Klimpel and Apian (1986). This implies t h a t t h e
As t h e a p p r o a c h d i s c u s s e d h e r e a t t e m p t s to define set {k[C(t)]n, C(tn), t n } is calculated from
a methodology not based on s t a t i s t i c s , the c o n c e n t r a t i o n - r e t e n t i o n time data for t h e b a n k .
definition of k [ C ( t ) ] should not be b a s e d on a
function of v a r i o u s p a r a m e t e r s to be f i t t e d . In
view of t h i s , k [ C ( t ) ] was defined a s a d i s c r e t e
function of C ( t ) , which i s a d i s c r e t e s e t of Continuous Model
a v e r a g e values for k [ C ( t ) ] a n d t h e i r associated
validity r a n g e s , calculated directly from the T h i s may b e defined b y a p p r o p r i a t e differential
practical kinetic data via e q . 1 in a discretized e q u a t i o n s for a microscopic fluid ( R e u t e r and Van
form: D e v e n t e r , 1991a&b) or b y t h e following expression
for a macroscopic fluid, which defines t h e a v e r a g e
- <5C(t) exit concentration for continuous r e a c t o r s :
k[C(t)] = (2) l
n+l
fit'C(t)ai C=ECr E ( t ) - e x p { - k [ C ( t ) ] n - ( t - t n ) } - d t (7)
tn
C(t)a is t a k e n to be t h e arithmetic a v e r a g e
between two s u c c e s s i v e data points s e p a r a t e d b y This equation may be solved analytically for simple
t h e time i n t e r v a l 6 t . C ( t ) a v could also be t a k e n E ( t ) f u n c t i o n s , by a Gaussian q u a d r a t u r e ( P r e s s
a s t h e geometric mean, b u t most examples h a v e , a n d o t h e r s , 1989) or b y E u l e r ' s method:
however, been done b y applying e q . 3 . Hence,
for d i s c r e t e data p o i n t s , e q . 2 becomes for t h e (8)
arithmetic mean (no c u r v e f i t t i n g ) :

-2-[C(tn+1)-C(tn)] C ( t n ) is calculated via e q . 5 or 6. In t h e s e


(3) e q u a t i o n s E ( t ) r e p r e s e n t s an a r b i t r a r y retention
[C(tn+1)+C(tn)]-t time d i s t r i b u t i o n ( R T D ) , which may be a function

126
of dead volume, fraction plug a n d / o r mixed flow, plant data may be compared to t h e R p i v o t and
number of reactors, retention time etc. k 1 pivot v a l u e s o f t h e pivot d a t a , from which a and
(Levenspiel, 1972). values, and hence appropriate operating
conditions may be estimated d i r e c t l y .
The linear differential e q u a t i o n s for microscopic
fluids may also be i n t e g r a t e d to p r o d u c e simple
analytic solutions.
Knowledge Base

Details of t h e knowledge base a r e too l e n g t h y to


Basis for Fault Diagnosis discuss here. It suffices to s a y t h a t the
knowledge i s represented in an hierarchical
T h e basis of t h e proposed a p p r o a c h i s to determine network, the top end (super-class) r e p r e s e n t s the
b y what a and f a c t o r s t h e p r o c e s s to be system being c o n s i d e r e d and t h e bottom e n d t h e
diagnosed deviates from t h e pivot c o n d i t i o n s . By i n d e p e n d e n t adjustment factors mentioned a b o v e .
means of a backward chaining p r o c e s s adjustment T h e s e may be of a r e g r e s s i o n t y p e , i . e . permitting
objects which p r o d u c e t h e s e a and v a l u e s , a r e t h e definition of t h e A r r h e n i u s equation a n d
determined. T h e u s e of n e u r a l n e t w o r k s to fundamental correlations s u c h a s S h = f ( S c , R e ) or of
perform t h e adjustment of a and v a l u e s , or to a factor t y p e e . g . (1 of n p o s s i b l e ) :
determine p r o c e s s conditions from given a a n d PROPERTY [Na-jarosite]
v a l e s , is p r e s e n t l y being i n v e s t i g a t e d . SUB-PROPERTY 100kg/m3
n-1 INTERACTION
Batch r e a c t o r s . The -factor is determined easily, FACTORS 1
i . e . simply t h e ratio of t h e final r e c o v e r i e s , w h e r e SLOPE FACTOR a=0.39
t h e slopes a r e z e r o . The -factor is determined RECOVERY FACTOR (3 = 1
via a time weighed slope i n t e g r a t i o n . Consider in COST FACTOR 1
dimensionless form for a first o r d e r equation
(similar for a second o r d e r equation) This adjustment object indicates t h a t t h e r a t e
d e c r e a s e s b y a factor of 0.39 if t h e Na-jarosite
seed concentration h a s a value of 100kg/m 3 (see
l/k' = t'k'-C*dt= (9) Table 8 ) . For t h i s same example t h e adjustment
o rule, constructed from the pivot adjustment
o b j e c t s , may be written a s :
which h a s been generalized to
if Na-jarosite precipitation (CLASS)
1/k" = E t n - k [ C ( t ) ] n - C ( t n ) - 5 t (10) and Distinguishing property (SUB-CLASS)
and [Fe 3+ ]=30kg/m 3
and [Na+]=8kg/m3
By comparing t h e time weighed slope l / k p i v o t of and [Zn 2+ ]=100kg/m 3
t h e pivot s e t { k [ C ( t ) ] n , C ( t n ) , t } to t h a t of t h e and [H 2 S0 4 ]=30kg/m 3
p r o c e s s to be d i a g n o s e d , an estimate for t h e ex- and [Na-jarosite]=300kg/m3
factor may be established from t h e ratio of t h e and temperature is 95C
time weighted slopes i n t e g r a t i o n , i . e . a = k ! / k p i v o t . then a=l
For second order processes this ratio also =l
a p p r o a c h e s W k p i v o t for t=> and for t h i r d o r d e r it cost factor=l
2
a p p r o a c h e s ( k ' / k p i v o t ) . For r e a c t i o n s of o r d e r ^
t h e r e s u l t of t h e time weighted i n t e g r a l is also T h e s e a n d o t h e r objects a r e g r o u p e d into a l a r g e r
1/k'. The time weighted slope l / k p i v o t for t h e object, t h e pivot object, defined for each data s e t .
pivot data is also an I D - t a g for t h e d a t a , which A n e u r a l network based a r c h i t e c t u r e is c u r r e n t l y
can hence be applied to identify o r e t y p e s e t c . being considered in addition to t h e adjustment
from t h e kinetic data a n d i t s associated pivot d a t a . object a p p r o a c h .
It is clear t h a t t h i s a p p r o a c h can also be u s e d to
determine t h e reaction mechanism for unknown
reactions if an e x t e n s i v e data b a s e is available.
Summary
If t h e batch simulation a p p r o a c h i s u s e d to
simulate continuous r e a c t o r s , t h e above method T h e s t r e n g t h of t h e a p p r o a c h d i s c u s s e d in t h i s
may also be applied to diagnose c o n t i n u o u s r e a c t o r paper is that it applies the same slope-
banks. concentration data s e t { k [ C ( t ) ] n , C ( t n ) , C ( t n + 1 ) }
to perform simulation a n d diagnosis in both b a t c h
Continuous r e a c t o r s . If t h e p r o c e s s is c o n s i d e r e d a n d continuous p r o c e s s e s .
to h a v e macroscopic fluid p r o p e r t i e s a similar
integration a s t h a t demonstrated b y e q . 9 may be
performed for continuous r e a c t o r s .
VALIDATION OF THE KNOWLEDGE
BASED MODEL
(ID
T h e v a r i o u s models d i s c u s s e d in t h e p r e v i o u s
section will be validated b y comparing t h e r e s u l t s
where with theoretical v a l u e s . T h e kinetic data in
column 1(a) of all t a b l e s a n d i t s associated
E(t)=(l/T)*(V/V )-exp[-(V/V )(t/r-Vp/V)] knowledge r e f e r to t h e pivot object. In t h e
or any other suitable function theoretical examples knowledge is not i n c l u d e d .
Also consult R e u t e r and Van Deventer (1991a&b)
For N ideal continuous r e a c t o r s in s e r i e s ( V p = 0 ) , for a v a r i e t y of o t h e r practical a n d theoretical
each with a mean retention time and examples of b a t c h p r o c e s s e s .
accommodating a first o r d e r reaction with a final
r e c o v e r y R, t h e above i n t e g r a t i o n would p r o d u c e Batch r e a c t o r s

C=(l-R)+[R/(l+k')] N (12) Two theoretical examples will be given in t h i s


section v i z . : a first o r d e r flotation model with a
Hence, given a specific RTD for a continuous r e c t a n g u l a r d i s t r i b u t i o n of r a t e c o n s t a n t s (Tables
r e a c t o r s y s t e m , R and k' values calculated from 1 a n d 2) and a second o r d e r r a t e eguation (Tables

127
3 a n d 4 ) . C Q i s 1 in both examples. Table 1 a n d TABLE4 Columns ( a ) : Concentration-time data
3 give t h e r e s p e c t i v e slopes calculated for each from t h e given model and k v a l u e s .
example via e q . 3 (& geometric mean) and Tables 2 Columns (b) according to e g . 4 - 6 .
a n d 4 give t h e r e s u l t s p r o d u c e d for v a r i o u s Columns (c) u s i n g geometric mean.
adjustment factors via equations 4 to 6.

(a): C/C0=l/(l+C0-k,-t) (b k c): Eq. 4 -

TAB LEI Summary of t h e calculated k f C ( t ) l n k'/min 20.0 10.0 1.00


v a l u e s according to e g . 3 , u s i n g t h e a 1.0 0.5 0.05
data in column 1(a) in Table 2.
t(min) Ka) 1(b) 1(c) 2(a) 2(b) 2(c) 3(a) 3(b) 3(c)
1 1
Column 1(a) of Table 2 0.000 1 000 1 000 1 000 1 000 1 000 1 000 1 000 1 000 1.000
k 0.025 0 666 0 670 0 665 0 800 0 819 0 815 0 976 0 980 0.979
C/C0=l-R-[l-(l-e" tj/k't]
0.050 0 500 0 450 0 498 0 666 0 670 0 666 0 952 0 961 0.960
C Range k[C(t)]n t[min]
0.150 0 250 0 225 0 248 0 400 0 381 0 398 0 870 0 887 0.884
1.0000 to 0.9507 0.496 0.000-0.102 0.175 0 222 0 200 0 221 0 364 0 348 0 363 0 851 0 869 0.867
0.9507 to 0.9046 0.487 0.102-0.204
0.9046 to 0.8615 0.478 0.204-0.306 6.225 0 008 0 008 0 008 0 016 0 016 0 016 0 138 0 138 0.138
etc. etc. etc. 6.250 0 008 0 008 0 008 0 016 0 016 0 016 0 138 0 138 0.138
.... to 0.0000 0.000
200.00 0 000 0 000 0 000 0 001 0 001 0 001 0 005 0 005 0.005

TABLE2 Columns ( a ) : Concentration-time d a t a


from t h e given model and R a n d k I t i s clear from t h e above examples t h a t no
v a l u e s . Columns (b) according to e g . v a r i a n c e s e x i s t in a a n d v a l u e s , t h a t t h e
4-6. simulation i s almost exact and t h a t no fitting is
performed, which a r e t h e proposed criteria for an
a p p r o p r i a t e kinetic model for u s e in a model based
(a): C/C0=l-R'[l-(l-e"k,t)/k't] (b): Eq. 4 - 6 diagnostic a p p r o a c h .

R & k 1.0 k 1.0/min 0.5 k 0.1/min 0.1 k 4.0/min Continuous r e a c t o r s


I P & a 1.0 k 1.0 0.5 k 0.1 0.1 k 4.0
In t h i s section continuous reactors will be
t(min) Ka) 1(b) 2(a) 2(b) 3(a) 3(b) simulated by following a macroscopic fluid
approach. I n each case t h e theoretical e q . 11 i s
I 000 1.000 1.000 1.000 1.000 1.000 1 000 compared to the results produced by the
0 50 0.787 0.787 0.988 0.988 0.943 0 943 approximate e q . 7 . To calculate t h e slope data
1 00 0.632 0.633 0.976 0.976 0.925 0 925 s e t , e q . 3 was applied t h r o u g h o u t to p r o d u c e t h e
2 00 0.432 0.433 0.953 0.953 0.913 0 913 r e s u l t s summarized in Tables 5 a n d 6. I t is clear
4 00 0.245 0.246 0.912 0.912 0.906 0 906 from t h e p r e v i o u s examples t h a t a geometric mean
6 00 0.166 0.166 0.876 0.876 0.904 0 904 would h a v e p r o d u c e d more a c c u r a t e results.
However, in most c a s e s , t h e c o r r e s p o n d e n c e i s
a d e q u a t e . T h e i n t e g r a t i o n was performed b y a 15
point Gaussian Q u a d r a t u r e ( P r e s s and o t h e r s ,
TABLE3 Summary of t h e calculated k i " C ( t ) 1 n 1989). Note t h a t t h e top integration limit was not
values according to e g . 3 (& geometric t a k e n to be infinity. The given top limit was
m e a n ) , u s i n g t h e data in column 1(a) sufficient to calculate t h e a v e r a g e concentration
in Table 4 . from a continuous r e a c t o r for both t h e first a n d
t h e second o r d e r p r o c e s s e s . Note, however, t h e
small variations in t h e a n s w e r s for t h e second
C o l u m n 1(a) of Table 4 o r d e r p r o c e s s e s when t h e top limit c h a n g e s from 20
C/C0=l/(l+C0-k'"t) to 10 in Tables 5 a n d 6 r e s p e c t i v e l y . The given
theoretical solution for t h e second o r d e r p r o c e s s is
C Range k[C(t)] n k[C(t.)]n t[min] calculated via
( 1 / k r ) -&(-1/* ) -Ei(l/kx)
arithm. geometr> t[min]
(Levenspiel, 1972; E i s e r r o r i n t e g r a l ) .
1.00 to 0.66 16.000 16.330 0-0.025
TABLE5 Comparison of e g . 7 and 11 subject to
0.66 to 0.50 11.429 11.547 -0.050
different p a r a m e t e r s for t h e r e s p e c t i v e
0.50 to 0.40 8.888 8.944 -0.075
kinetic and R T D s . C ( t n ) i s calculated
0.40 to 0.33 7.273 7.303 -0.100
from t h e given kinetic models a n d
0.33 to 0.29 6.154 6.172 -0.125
etc. etc. etc. etc.
k T C ( t ) 1 n from t h e s e r e s p e c t i v e data
.... to 0.00 0.00 0.00
( I n t e g r a t i o n limits 0 and 2 0 ) .

Equation and parameters e-t l/(l+t) l/(l*5t) i/(i+iot) I


In each of t h e examples a good c o r r e s p o n d e n c e i s a=l;=l;C0=l and 6=0.1
achieved between theoretical and approximate
models. For the geometric mean the t=l (Theoretical) 0.3679 0.5000 0.1667 0.0909
c o r r e s p o n d e n c e a p p e a r s to be much b e t t e r which Eq.5 t=l (Approx.) 0.3679 0.5000 0.1667 0.0909
permits a l a r g e r 6t to be t a k e n . From t h e f i r s t
slope in Table 3 calculated via e q . 3 it i s clear Eq.ll T=2;Vm=1.0;V =0.0 0.3333 0.4614 0.2002 0.1261
that k[C(t)]n=16=19.2-C for this interval jEq.7 r=2;Vm=1.0;Vp=0.0 0.3333 0.4616 0.2025 0.1320
(0.66<C<1), and for the geometric mean Theoretical 0.3333 0.4617 0.2014 0.1297
k [ C ( t ) ] n = 2 0 ' C , which is an exact approximation,
since r=-20C^ is t h e second o r d e r equation for Eq.ll x=2;Vm=0.8;V =0.2 0.2578 0.4075 0.1357 0.0748
t h e s e kinetic pivot d a t a . Eq.7 =2 Vm=0.8;Vp=0.2 0.2578 0.4075 0.1358 0.0749

Eq.ll T=2;Vm=0.5;Vp=0.5 0.1839 0.3613 0.1052 0.0559


Eq.7 r=2 Vm=0.5;Vp=0.5 0.1839 0.3614 0.1052 0.0559

128
TABLE6 Comparison of e g . 7 and 11 subject to The kinetics of this reaction are affected by a
different parameters for the respective variety of factors which include the temperature,
kinetic and RTDs. By applying the the concentration of the Na-jarosite seed,
data from column 1(a) of Table 4 as [ H 2 S 0 4 ] , [ F e 3 + ] and the [ N a + ] . The reaction
pivot to calculate kTC(t)1 n , C ( t n ) may kinetics for this reaction have been represented by
be calculated by e g . 6 (Integration a variety of complex models, e . g . :
limits 0 and 10).
dFe3+/dt=-k(T)[Fe3+]a[Na+]b[Na-jarosite]c[H2S04]d
(Rastas and others, 1979)
Equation and parameters l/(l+20t) l / ( l + 5 t ) 1 / d + t )
C 0 =l; p=l; 6=0.01 a=l a=0.25 a=0.05 or

t=l (Theoretical) 0.0476 0.1667 0.5000 dFe3+/dt=-{k1[Fe3+]2[Na+]1/2-k2[H2S04]1/4}


Eq.5 t=l (Approx.) 0.0472 0.1658 0.4999 (Wang Qlan-kun and others, 1985)

Eq.ll i=2;V m =1.0;V p =0.0 0.0786 0.2012 0.4609 From the results in Table 8 it is clear that the
Eq.7 i=2;7 i = 1.0;Vj=0.0 0.0790 0.1986 0.4569 proposed simple kinetic model and its associated
Theoretical 0.0803 0.2014 0.4617 knowledge base can produce the same and better
results as the rather complex and semi-empirical
Eq.ll T=2;Vm=0.8;V =0.2 0.0395 0.1357 0.4073 multi-parameter kinetic models given above. To
Eq.7 T=2 Vm=0.8;Vp=0.2 0.0390 0.1349 0.4065 predict the data in column 3(b) of Table 8 during
simulation, appropriate adjustment objects would
lEq.ll i=2;Vm=0.5;V =0.5 0.0288 0.1052 0.3613 produce the following adjustment rule:
!Eq.7 i=2 V m =0.5;v;=0.5 0.0286 0.1046 0.3610
1 if N a - j a r o s i t e p r e c i p i t a t i o n (CLASS)
Application to practical data and D i s t i n g u i s h i n g property (SUB-CLASS)
and [Fe 3+ ]=30kg/m^
Both examples discussed in this section were taken and [Na + ]=8kg/m 3
from the zinc hydrometallurgy. and [Zn 2+ ]=100kg/m 3
and [H 2 S0 4 ]=30kg/m 3
Zinc-ferrite leaching. The data for this example and [Na-jarosite]=100kg/m 3
were taken from Rastas and others (1979), who and temperature i s 85C
could describe these data by reaction controlled then =11110.39-0.39=0.152
shrinking core model. The model applied by 3=*=
Rastas and others was a reaction controlled cost factor=l
shrinking core model. As may be seen from Table
7 the correspondence i s very good. Also note the TABLE8 Na-Jarosite precipitation-time data
use of adjustment factors and their associated (Rastas and others, 1979). Data in
objects, e . g . for column 2 the temperature object column 1(a) were used to determine
would be (assuming only two objects viz. k [ C ( t ) 1 n by e g . 3 and used to predict
temperature and [H2SO4]): values in columns (b) according to e g .
4-6.
PROPERTY Temperature
SUB-PROPERTY 85C
1 INTERACTION (a): Data from Rastus and others ( b ) : Eq. 4 - 6
FACTOR 1 1
SLOPE FACTOR a=0.75 PIVOT
RECOVERY FACTOR =l c 95C 95C 85C 75C 95C
COST FACTOR 1 [Na-jar] 300kg/m3 100kg/m3 100kg/m3 100kg/m3 20kg/m3
TABLE7 Con version-time data for No 4 Zinc- a & 1.0 k 1.0 0.39& 1.0 0.15t 1.0 0.054&1.0 0.085&1.0
ferrite (Rastas and others, 1979).
Data in column 1(a) was used to t(min) 1(a) 1(b) 2(a) 2(b) 3(a) 3(b) 4(a) 4(b) 5(a) 5(b)|
determine kTC(t)1 n by e g . 3 , and to
predict values in columns (b) 0.00 30.0 30.0 30.0 30.0 30.0 30.0 30.0 30.0 30.0 30.0
according to e g . 4-6. 0.50 24.9 24.9 28.1 27.9 29.2 29.2 29.7 29.7 29.6 29.5 !
1.00 21.7 21.7 26.4 25.9 28.4 28.3 29.4 29.4 29.2 29.2
2.00 17.4 17.4 23.6 22.9 27.0 26.8 28.8 28.8 28.5 28.1
(a): Data from Rastus and others ( b ) : Eq. 4 - 6 4.00 12.9 12.9 19.6 19.1 24.6 24.2 27.7 27.6 26.8 26.4
7.00 9.7 9.7 15.7 15.3 21.7 21.5 26.2 26.0 24.5 24.2
PIVOT 15.00 6.2 6.2 10.6 10.7 16.7 16.6 22.8 22.8 19.0 20.3
C 95C 85C 75C 95C 95C 25.00 4.4 4.4 7.8 8.0 13.1 13.3 19.8 20.0 14.4 16.9
[H2S04] 30kg/m3 30kg/m3 30kg/m3 50kg/m3 15kg/m3
a & 1.0 & 1.0 0.75& 1.0 0.43& 1.0 1.9 k 1.0 0.3 & 1.0
FAULT DIAGNOSIS
t(min) 1(a) Kb) 2(a) 2(b) 3(a) 3(b) 4(a) 4(b) 5(a) 5(b)
Diagnosis in this paper is taken to imply the
0.00 1.00 1 00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 detection of faults within the respective unit
0.40 0.84 0 84 0.88 0.88 0.93 0.93 0.72 0.72 0.95 0.95 operations, i . e . deviations from process conditions
1.00 0.65 0 65 0.73 0.73 0.82 0.83 0.43 0.45 0.88 0.88 within metallurgical reactors. Fault-diagnosis i s
3.00 0.30 0 30 0.39 0.39 0.56 0.58 0.13 0.13 0.68 0.68 discussed under three headings viz. the
5.50 0.15 0 15 0.22 0.20 0.37 0.38 0.04 0.05 0.49 0.49 identification of a class and s u b - c l a s s of a meta
7.00 0.10 0 10 class and fault-diagnosis in batch and continuous
16.00 0.016 0 016 reactors respectively.
25.00 0.004 0 004
54.00 0.000 0 000

Identification of a class and sub-class of a meta-


Jarosite precipitation. The example being class. This method is based on comparing the
discussed here is the precipitation of F e 3 + as Na- final recovery and time-weighted slope of kinetic
jarosite (Rastas and others, 1979): data of an as yet unknown class to those available
in the knowledge base. This permits the
3Fe 2 (S0 4 ) 3 +Na 2 S0 4 +12H 2 0=2NaFe 3 (S0 4 ) 2 (OH) 6 +6H 2 S0 4 identification of the unknown class. If the batch

129
flotation kinetic data are given, the possible ore DISCUSSION AND SIGNIFICANCE
body and possible flotation conditions could be
established for a meta-class, i.e. for example the A simple generalized approach was discussed that
flotation for a specific plant or ore body. If the permits both the dynamic simulation of batch and
knowledge base is constructed consisting only of continuous processes. The basis is process
pivot objects (no adjustment objects), the above independent and has been applied to simulate
process could also be applied to perform process adsorption, leaching, reduction, precipitation and
identification i.e. the pivot conditions, associated flotation processes. The same basis has been
final recovery and time weighted slope data define shown to permit diagnosis of batch as well as
both batch and continuous (C-retention time data) continuous processes.
processes.
The above aspects are possible since the proposed
Identification of the process conditions within a kinetic model gives an exact description of the
batch reactor. Consider the data in columns 1(a) above mentioned processes without forfeiting
and 2(a) of Table 4. The theoretical time accuracy due to curve fitting. The 'one
weighted slope for these respective columns are parameter' model is well suited for manipulation by
0.1281 and 0.2561 which implies a=0.5. Since the adjustment factors in an object architecture. At
final recovery is 100% in both cases, is equal to present a neural network architecture is being
1 (=100%/100%). The inference engine would investigated to relate the adjustment factors a and
hence attempt to establish which process conditions respectively to process conditions.
would produce the given adjustment factors. The
practical data summarized in Tables 7 and 8 could
not be applied here as the data do not cover the REFERENCES
process up to the final recovery and precipitation
respectively. Beltramini, L. and R.L. Motard (1988). KNOD-A knowledge-based approach
for process design, Comput. Chem. Enqnq., 12, pp. 939-958.
Identification of the process conditions within a Birkey, G.J., T.J. McAvoy and M. Modares (1988). An expert system for
continuous reactor. Two methods will be pointed distillation control design, Comput. Chem. Enqnq., 12, pp. 1045-
out in this section. A third possible method based 1063.
on a microscopic fluid approach is discussed by Dalle Molle, D.T., B.J. Kuipers and T.F. Edgar (1988). Qualitative
Reuter and Van Deventer (1991b). modeling and simulation of dynamic systems, Comput. Chem. Enqnq.,
12,. pp. 853-866.
Method 1 applies equation 9 and 10 as for batch Davis, R. (1984). Diagnostic reasoning based on structure and function,
processes, to describe the kinetics of the flotation Artificial Intelligence, 24, pp. 347-411.
bank as a whole. This approach implies that the Dowling, E.C., R.R. Kumpel and F.F. Apian (1986). Use of kinetic models
pivot objects are defined accordingly i.e. they do to analyze industrial flotation circuits, Proceedings The
not include batch kinetic data but contain Reinhardt Schumann International Symposium on Innovative
concentration-retention time (or stage number) Technology and Reactor Design in Extraction Metallurgy, Editors
data. All adjustment objects are defined D.R. Gaskell, J.P. Hager, J.E. Hoffmann, P.J. Mackey, TMS-AIME,
accordingly with reference to these data. One Colorado, November, pp. 533-552.
would obviously not be able to perform process Fusillo, R.H. and G.J. Powers (1988). Operating procedure synthesis
identification of batch cells with these data, hence using local models and distributed goals, Comput. Chem. Enqnq.,
forfeiting generality. This method is hence l ^ p p . 1023-1034.
identical to that discussed previously for batch Hoskins, J.C. and D.M. Himmelblau (1988). Artificial neural network
reactors. models of knowledge representation in chemical engineering,
Comput. Chem. Enqnq., 12^ pp. 881-890.
Whereas the batch fault-diagnosis procedure as Kuipers, B. (1984). Commonsense reasoning about causality: Deriving
applied in method 1 applies eq. 9 or 10 to behaviour from structure, Artificial Intelligence, 24, pp. 169-
determine an estimate of a, equations 7 or 8 and 203.
12 are applied as the basis of diagnostic procedure Levenspiel, 0. (1972). Chemical Reaction Engineering, John Wiley k Sons,
for method 2. This implies that batch kinetic data New York.
form the basis of this method. Press, W.H., B.P. Flannery, S.A. Teukolsky and W.T. Vetterling (1989).
Numerical Recipes in Pascal, The Art of Computing, Cambridge
It has already been stated that for a first order university Press, New York.
rate equation and an ideal retention time Qian-kun, W., M. Rong-jun and T. Zhi-zheng (1985). The jarosite process
distribution (V m =l) the solution of equation 11 is - kinetic study, Proceedings: Zinc'85, AIME, pp. 675-690.
l/(l+k*r). A meta-rule based on this equation has Rastas, J., S. Fugleberg, L-G. Bjrkqvist and R-L. Gisler (1979).
been given above (eq. 12) for estimating a and Kinetik der Ferritlaugung und Jarositfllung, Erzmetall, 32, pp.
values. If the solution for the kinetic data 117-125.
represented by e~^ are used to solve equation 7 or Reuter, M.A., J.S.J. Van Deventer and W.I. Van der Merwe (1991a). The
11, a result of C av =0.333 is produced for Vm=l application of knowledge-based systems to the simulation of gold
and =2. For the kinetic model e~ 2t this result extraction processes, Minerals Engineering, 4, pp. 103-119.
would be C a v =0.2. It is a trivial exercise to work Reuter, M.A. and J.S.J. Van Deventer (1991b). Knowledge-based simulation
backwards from the given C a v values, via equation and identification of various metallurgical reactors,
12, to obtain the respective rates of 1 and 2 for Metallurgical Transactions B, In press.
=2. This would imply that a is equal to 2 with Shortliffe, E. and B. Buchanan (1975). A model of inexact reasoning in
reference to the pivot value of k=l. It is obvious medicine, Math. Bioscience, 23, pp. 351-379.
that the meta-rule eq. 12 can only produce an Schuhmann, R. (1942). Flotation kinetics I. Methods for Steady-State
exact value for a if the process has first order Study of Flotation Problems, J. Phys. Chem., 64^ pp. 891-902.
kinetics. The meta-rule can, however, give an Sohn, H.Y. and M.E. Wadsworth (1979). Rate Processes of Extractive
estimate for an initial a value which can be refined Metallurgy, Plenum Press, New York.
by a subsequent iterative procedure. Since the Stephanopoulos, G., J. Johnston, T. Kriticos, R. Lakshmanan, M.,
solution of equation 7 is rapid, this iterative Mavrovouniotis and C. Siletti (1987). Design-Kit: An object-
procedure is fast. As an example consider the oriented environment for process engineering, Comput. Chem.
data for =2 and Vm=l in Table 5 for the three Engnq., 11^ pp. 655-674.
second order equations. If the data for the Tucker, P. and K.A. Lewis (1988). An expert system for shaking table
second order equation l/(l+t) are taken as the diagnostics, Minerals Engineering, 2, pp. 87-92.
pivot, the following three a-values could be Tzouanas, V.K., C. Georgakis, W.L. Luyben, and L.H. Ungar (1988). Expert
estimated, viz. 1, 3.4 and 5.9 for the given multivariable control, Comput. Chem. Engnq., 12, pp. 1065-1074.
integration limits from 0 to 20. From these data Venkatasubramanian, V. and S.H. Rich (1988). An object-oriented two-tier
the process conditions can subsequently be architecture for integrating compiled and deep-level knowledge for
estimated. The theoretical a values are 1, 5 and process diagnosis, Comput. Chem. Engnq., 12, pp. 903-921.
10 respectively for =1.

130
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

SELF ORGANIZING CONTROL OF pH IN A STIRRED


TANK REACTOR
I.M. Shah and R.K. Rajamani
Department of Metallurgical Engineering, University of Utah, Salt Lake City, UT84112, USA

M?is..tXiact. To-date, most of the useful work on the self organizing controller (SOC) has
been done on simulated processes. To test it in a real environment, its application to
controlling pH in a stirred tank reactor is tried here. Due to extreme nonlinearity of
the pH process, the basic design of SOC did not result in adequate controller response.
Therefore, an alternative approach utilizing preselected scaling factors but
appropriately choosing performance rules is examined. It is found that the decision
table resulting from a non-adaptive fuzzy controller serves as a good choice for the
performance table for the SOC. Under both servo and regulator type control situations,
the SOC provided impressive pH response. It is concluded that this simple self learning
controller methodology deserves attention. It can be suitable for many difficult to
model metallurgical processes.

Keywor<liS. Fuzzy set theory; computer control; pH control; control nonlinearities;


adaptive control; metallurgical industries.

A considerable amount of research on SOC has been


INTRODUCTION done at University of London, under the guidance
of Prof. E. H. Mamdani. To-date, most of the
Fuzzy set theory (FST) provides for emulating useful work is on simulated processes (Lembessis,
human abilities of extracting essential features 1984; Procyk, 1977; Yamazaki, 1982). A wide range
of a system, and using this granulated knowledge, of processes, varying in terms of order of the
not dependent on accurate measurements, in process and magnitude of features like dead-time
judgement and decision making process. Control is and non-linearity have been considered. A notable
thus a natural candidate and a major application limitation of these works has been the fact that
area for the FST. A unique feature of a fuzzy SOC has not been tried on a real system. Also, the
logic controller (FLC), which distinguishes it past studies have concentrated mainly on the servo
from other rule based systems, is representation type control problem, i.e., handling changes in
of variables in the rules by fuzzy sets. the set point. Tackling both servo and regulator
type problems, the latter needing adjustments when
process load or disturbance changes, have been
Mamdani and Assilian (1975) developed the first targeted in the present work. Another important
fuzzy controller. Applications to a variety of aspect considered is the effect of changing
processes have been studied since then (Lee, 1990; process dynamics. Changing time constant and
Tong, 1985). These research efforts have indicated process gain with time is an important
that the FST is effective in providing a framework characteristic of many real processes. In fact,
for adequate controller performance. The FLC has a these are the processes that are most difficult to
definite advantage in terms of ease of model and control through traditional control
implementation. Another important feature is techniques. Thus the present effort is aimed at
elimination of the need of a precise mathematical bridging some critical gaps in the SOC literature,
model of the process. by implementing SOC on a real process having
changing process dynamics.

One of the presumption in the FLCs is the


availability of a comprehensive set of control THEORETICAL BACKGROUND
rules (a protocol) to accomplish the control task.
In many process situations it is not possible to In the following, the essentials of the framework
pre-specify all necessary control rules, but only is briefly discussed. First the concepts pertinent
a few general rules can be stated a priori. Thus to the FLC are presented as it forms the lower
it is desirable that the controller learn from level of the two levels of SOC, the main topic of
past performance and generate required rules this research.
dynamically. This appealing extension to the FLC
has been referred to as the self organizing gemxal ^ r u c t o of ELC
controller (Procyk and Mamdani, 1979). It can be
likened to the adaptive controller of the control A fuzzy set (also called a fuzzy subset) A in a
literature. It provides a promising alternative universe X is characterized and represented by a
for control of complex processes, especially those membership function fA(x), e.g.,
involving changing process dynamics.
A = [ fA(x); x E X ] (1)
where fA(x) : X > [0,1]

131
An important concept in FST is that of implication base on-line, either by generating new rules or
function. It deals with interpretation of a rule modifying the existing rules.
(a conditional statement). Consider E, C, and as
three fuzzy variables. A rule, denoted as Sk, has
the form "If Ek and Ck then Uk". Note that this is For convenience, consider a case where the control
an abbreviation of "If E is Ek and C is Ck then U rules and the performance rules both have same two
is Uk". For control applications, Mamdani's fuzzy variables as antecedents. These are error,
definition of implication function (Mamdani, 1974) E, and change in error, C. They represent
is widely used (Lee, 1990; Yamazaki, 1982). It information on the state of the process output,
defines Sk as a fuzzy set, represented as a which is also considered to be the controller
cartesian product Ek x Ck x Uk. Its membership input. For the control rules, a fuzzy variable
function fgjje, c, u) is calculated as: change in the process input, U, is taken to be the
consequent and hence the controller output.
Min [ %,.(), fck( c )' f Uk< u ) 3 (2) Through the use of variables E and C as
antecedents, a set of performance rules assesses
Several statements SI, S2, ..., SN connected using the process performance at every sampling instant.
The performance rules give as a consequent a fuzzy
a connective "else" (or "also") form a set of variable, P, defined as the desired change in the
statements (rule base). The overall relation R is process output to improve process performance. The
expressed as a union (Max) of individual variable P is assigned a value that reflects the
relations. It is characterized by a membership degree of correction required by the process
function % ( e , c, u ) ; output. In all the applications in the past, P
fR(e, c, u) = Max [ fgi(e, c, u), f ^ e f c ' u ) ' directly represents reinforcement for adjusting
the consequent part of a responsible control rule.
..., f SN (e, c, u) ] (3)
For notational convenience, R is expressed as
R E C U = Max k (Ek x Ck x Uk) (4) The fuzzy variables E, C, U, and P take fuzzy
values, expressed as pre-defined fuzzy subsets,
This relation defines the overall mapping from the representing their linguistic meanings. A set of
two antecedents E and C to the consequent U. control rules gives a relation matrix R E Q J , as
given by Eq. (4). Similarly, the set of
performance rules gives relation matrix REcp:
An inference rule helps determine a fuzzy subset
Uj, induced by the relation R and fuzzy values Ej R E C P = Max k (Ek x Ck x Pk) (7)
and Cj. For inference, the Max-Min rule proposed Consider that E is Ej and C is Cj at a particular
by Zadeh (1973) has been used in most control sampling instant. Then the output of the
applications. This operation is represented as controller, Uj, is represented by Eq. (5), and the
Uj = Ej o Cj o R (5) performance index value, Pj, can be similarly
inferred as:
with the membership values of Uj characterized by
fuj(u) = Maxg M a ^ { Min [ f E j(e), f C j(c), Pj = Ej o Cj o R E C P (8)
fR(e, c, u) ] } (6) The subset Uj represents the controller output
which is used to obtain the process input (control
action) while the subset Pj is used to modify the
The forgoing discussions deal with a framework for controller rule base and hence the corresponding
reasoning in situations involving fuzzy (or controller relation matrix.
qualitative) inputs and outputs. In process
control, however, the information provided through
process instrumentation is deterministic (non- The rule modification procedure uses a parameter
fuzzy). Also, only deterministic quantities are called "delay in reward". For simplicity, this is
sought as corrective actions from the controller. specified as a number, m, representing the number
Without going into details, Fig. 1 presents the of samples in the past whose action relates to the
functional diagram of a FLC depicting the general current process situation. Let E 1 and C be the
structure that has evolved over the years. error and change in error and U ! the corresponding
controller output at that time (m samples in the
past). Thus the desirable controller output for
.ae......Seli.....Qrs.ani^.ing......Leyel future situation of E 1 and C , based on the
performance index, would be V = U' + Pj. In the
The FLC described above is characterized by a set controller rule base, thus, the rule E 1 x C x U'
of if-then type qualitative rules. The derivation should be replaced by a rule E' x C x V .
of an adequate set of rules and its representation
in a form suitable for use in FLC is difficult for
complicated processes. Application of fuzzy logic
in such practical situations, requiring dynamic One possible method suggested for effecting the
learning capabilities, is the basic premise of rule replacement is through a linguistic operation
SOC. In the present work, the framework of SOC as of the form
proposed by Procyk (Procyk, 1977; Procyk and RM = { C (REOJ) but n o t R,
l else
ER"] }
W
Mamdani, 1979) has been followed. It incorporates Here, R1 and R'' are the relation matrices of the
a mechanism for the evolution of fuzzy control rules to be deleted and added, respectively, and
rules based on process performance evaluations. RN denotes the modified controller relation
matrix. In terms of set theoretical operations,
this can be written as
Figure 1 also depicts the structure of the SOC.
Essentially, a supervisory level is added which RN = Max { [ Min ( R ^ , ') ], [R ,f ] } (10)
changes the functional relationship between the
controller's input and output whenever the ' denoting the complementation operation.
performance is found unsatisfactory. The operator
or the control system designer provides what are The interpretation in terms of relation matrices
called the performance rules. These rules specify has shortcomings in terms of memory usage and
the objectives the controller is to follow. The computation time. Another limitation is that the
algorithm is designed in such a fashion that these rules which form the controller are not directly
performance rules can also modify the control rule accessible. This is because they are collectively

132
represented by a relation matrix only. Thus a pH = -log10 [H+]
"simplifications" has been suggested to make the Control of pH is an important objective in many
procedure more practical. This involves replacing industrial processes. For applications like water
the operations on relation matrices with those on and wastewater treatment and many chemical and
the control rules themselves. biological processes, it is crucial to maintain
the pH within a specified range. In mineral and
extractive metallurgy industry also, it is a
The simplification is based on De Morgan's law common practice to control pH in process vessels.
which inter-relates the union (Max), intersection For example, the pH of reclaimed water is
(Min) and complementation operations. The law, maintained at a pH of about ten before being fed
which holds for both ordinary and fuzzy sets, can to a grinding circuit. Mine water treatment
be stated as "complementation of intersection plants, for heavy metal removal, also require pH
(union) is equivalent to union (intersection) of to be maintained in the range of 9.5-10.0 to
complements". precipitate the metals as hydroxides, and then
final pH adjusted to 6.0-9.0 before discharging
the treated water (Young and Bohling, 1981). Also,
Using De Morgan's theorem, the complement of R', many reagents in processes like flotation and
i.e., '( = E' x C x U M can be written as flocculation work best only within a specified
range of pH values. Solvent extraction units,
' = Max (', ,!!') (11) leaching vessels, ion exchangers also require pH
An identity is used at this stage to expand Eq. monitoring. Corrosion inhibition is another area
(11) into a more convenient form. It specifies where control of pH is important.
that the intersection (Min) of a fuzzy subset and
a unit subset is equivalent to the fuzzy subset.
Let IE, IC and IU be defined as unit subsets of With the availability of very sensitive and
respective universes, having all membership values accurate instruments, measurement of pH has become
as 1.0. Then R1 can be written as a routine in many industrial processes.
Maintaining pH is, however, by no means trivial.
' = Max [ (' x IC x IU), (IE, 'x IU),
The control is troublesome because of inherent non
(IE x IC x IT) 1 (12) linearity of the pH response to reagent additions,
characterized by a S-shaped titration curve.
Additionally, in many cases this titration curve
Considering Eq. (12), the r.h.s of Eq. (10) can be changes due to changes in the composition of the
written in terms of individual rules as process solutions.
Max { [ Min ( { Maxk (Ek x Ck x Uk) },

{Max [ (' x IC x IU), ( IE x .' x IU), The pH process was selected as an example problem
due to the challenge it presents. The other
( IE x IC x IT) 1 > ) L [ (E x C x V ) ] } advantage it offers is that it is not very
difficult to set up a test facility in a
which on manipulation gives,
laboratory. Additionally, it provides flexibility
in terms of adjustment of the level of difficulty
Max { [ Maxk {[( Ek x *) x Ck x Uk], for the control problem. For example, the time
[Ek x (Ck x ') x Uk], [Ek x Ck x (Uk x II")]} ], lag, dead time, non linearity, servo or regulator
type problem, etc. can be appropriately selected
[ (' x C* x V ) ] } (13) with ease. Another interesting feature of the pH
Eq. (13) suggests the following procedure. Every process is its characteristic non-sharp target
rule Ek x Ck x Uk be replaced by three new rules: specification making it a suitable candidate for a
fuzzy logic based approach. As Shinskey (1973) has
[ (Ek x E') x Ck x Uk ] noted "Desirable response for a pH loop may differ
[ Ek x (Ck x ) x Uk ] considerably from that expected from other
processes. ... any pH between 6 and 9 may meet
[ Ek x Ck x (Uk x U') ] local regulations and therefore be acceptable." It
This new set and the additional rule E' x C x V must be emphasized, however, that the main purpose
form the modified control rule base. of the investigation is the explorations of the
technique of SOC.

The difficulty however is the forming of new


subsets (e.g. (Ekx')) and tripling of the Considering the importance of pH regulation, it is
number of rules every time a rule change is not surprising that extensive literature exists
necessary. To overcome this, an approximation is directly addressing the pH control problem.
used. It is based on reasoning that a rule not MDreover, in testing of novel controller schemes,
sufficiently similar to the rule to be deleted many researchers use the pH process as an example
need not be changed. The degree of match is process because of the challenge and flexibility
checked using the property that the intersection it offers. Some of the recent experimental
of a subset and its compliment produce a subset of investigations are briefly discussed in the
membership values all less than or equal to 0.5. following. Of the numerous simulation based
Thus, every rule is checked by checking the studies, only a couple are mentioned (Clark and
membership values for subsets (Ek x j ' ) , (Ck x Gawthrop, 1981; Parrish and Brosilow, 1988).
'), (Uk x 12'). If all have membership values less Because pH control "per se" is not the main
than or equal to 0.5, that rule k can be deleted. interest, a detailed review of literature is
If not, the rule is left unaltered. When all the avoided.
existing rules are thus processed, the new rule E*
x C x V is included in the controller rule base.
Gustafsson (1985) studied two different controller
schemes. Laboratory scale equipment was used to
THE pH CONTROL PROBLEM test the controller implementations. Dynamic
modeling was done considering a set of reaction
For the discussions in this work, pH is defined in invariant variables and a reaction variant
terms of [H + ], the hydrogen ion concentration in variable. A system of decoupled linear
g-ions/1: differential equation specified the reaction
invariant part whereas a nonlinear static equation

133
related the reaction invariant variables to the Two centrifugal pumps were used to deliver the
pH. The process was first modeled off-line based acidic process fluid from two separate storage
on experimental input-output data. The first tanks. This permitted making predefined change in
controller scheme was designed to give linear process solution compositions. The process inflow
control loops, considering linear feedback of the rates were monitored using two block type flow
reaction invariant state vector. This reaction meters each equipped with a valve. A Cole-Parmer
invariant state vector, representing the digital pH transmitter, a pH electrode and an ATC
concentrations of hypothetical weak acids having probe were used for pH measurements. The
pre-specified dissociation constants, is not transmitter supplied a 4-20 mA analog signal
directly measurable. It was computed based on the proportional to the pH value. For the controlled
measurements of pH and other control signals. The reagent flow, an Ismatec peristaltic pump was
implementation effectively implies a nonlinear used. A 0-10 V signal was used to manipulate the
feedback of pH. The second scheme utilized a pumping rate. A Keithley measurement and control
linear feedback of pH, the feedback law being of system was used for signal processing. An IBM PS/2
self tuning type. Here, for each sampling time, Model 70 computer was interfaced to the data
the system parameter were estimated as they are acquisition system for SOC algorithm
used in the calculation of the new feedback implementations.
coefficients.

Fuzzy Representation
Jutila (1983) also used the idea of hypothetical
chemical species in modelling the unknown liquid In the present work, the error universe has 14
composition. In an off-line identification step, elements represented by integer values in the
the process solution titration curves were used in range -6 and 6, with two zero values (0 and 0~).
evaluating the dissociation constants of the The others have integer values between -6 and 6,
hypothetical species. An on-line Kalman-type but 13 elements (only one 0). The number of fuzzy
filter algorithm estimated the concentrations of subsets (linguistic values of the fuzzy variables)
the hypothetical species. The pH and flow defined on each of these universes are also the
measurements were used in this estimation step. same as the number of elements. We refer to Shah
The controller adaptation was performed based on (1991) for details.
this estimates. The modeling and control approach
was studied by simulations and the scheme was also
applied to a full-scale chemical waste-water In the fuzzy logic framework, the levels of
treatment plant. The possibility of adaptive discretization of the universes, number of fuzzy
feedforward control loop was mentioned as the subsets, grades of membership, etc., needs to be
distinguishing feature of this approach. defined subjectively. These variables affect the
computations and the memory requirements, the
fineness of controller, and so on. The use of
Investigating applications to the pH process of above mentioned definitions for the pH process,
self-tuning controller (STC), Proudfoot and co- which is very much different from the processes
workers (1983) carried out a series of extended considered by Procyk, thus disregards this
trials. Here also, the target of the control was a subjectivity argument. It must be emphasized,
full-scale industrial-effluent treatment plant. however, that SOC has additional adjustment
The STC was derived based on the assumption of a options like scaling factors, performance rules,
linear first-order process model, together with a delay in reward, etc. The use of subjective
cost function dependent on the integrated output definitions will add complexity to the designing
error. It was demonstrated that the self-tuning of SOC. Thus the above ad hoc choice was done for
algorithm was able to provide better parameter convenience and to demonstrate the applicability
values for the proportional integral controller of a general framework of SOC for widely varying
(PIC) than manual tuning. Also, the adaptive processes.
controller gave better results than achieved by
fixed parameter PIC.
RESULTS

It is sufficient to state that many researchers in In the work reported here, a sampling time of 0.1
the past have investigated advanced estimation min was used. Also, the delay-in-reward parameter
and/or adaptive control techniques to overcome the was set to a value 1 for convenience. Thus only
difficulties of non-linearity and changes in non- the scaling factors and the performance rules are
linearity. MDSt methods require the development of left for adjustments. Knowledge of titration
a complex mathematical model. The methods of fuzzy curves and process operating characteristics was
logic and SOC eliminates this need and thus can be used in designing a FLC (without adaptation) for
considered an attractive alternative. the pH process. The study, described in Shah and
Rajamani (1990), showed that a set of intuitively
developed control rules drove the controller in
EXPERIMENTAL SET-UP the right direction. Nevertheless, the response
exhibited initial oscillations and long settling
A 4 1 pyrex glass beaker or a 5 1 polyethylene time. Difficulties in developing an adequate set
vessel was used as a reactor. Both had baffles to of rules for the nonlinear pH process were
help in achieving proper mixing inside the outlined. The need for developing an SOC was thus
reactor. In the tests reported, an acidic process demonstrated.
solution was neutralized using an alkaline
reagent. Both the process solution and the alkali
reagent entered near the bottom of the reactor. For initial SOC tests, the set of parameters that
The pH and the ATC probes were positioned near the worked under the FLC environment were kept the
top to avoid the localized effects of the entering same and the performance rules developed by Procyk
solutions. The reactor outlet was an overflow tube were used. A typical result is shown in Fig. 2.
near the top, maintaining a constant liquid volume Note that the test was started with acid solution
in the reactor vessel. The effective hold up in the reactor. This, in effect, corresponds to a
volume for the two vessels were 3.95 1 and 4.65 1, set point change from the incoming pH value to a
respectively. set point value of 7 at time t = 0. Note that in
this manuscript, such a problem is denoted as
implicit set point change control problem.

134
The set of control rules used for the FLC test
Observe the very slow rise in pH during the were used directly as the performance rules. The
initial period. Two factors are responsible for scaling factor values chosen were different, as
this. (1) the nature of the pH process i.e., the different acid flow rate and concentration values
titration curve for the solutions considered were used. In fact, different tests in this set
indicated that a relatively large change in the were different amongst each other in this respect.
alkali amount is needed to cause any appreciable Another modification was the subdivision of the
change in pH when solution pH is near 3.0. (2) The error space into four regions rather then two as
neutralizing alkali needs to neutralize both the done previously.
incoming acid and also the acid residing in the
reactor. The alkali rate thus exceeded the amount
needed to neutralize the incoming acid. In fact, Figure 3 shows one of the results. Note that the
it reached the maximum value permitted by the solution in the tank at the start of the test was
pump. It also resulted in the overshoot as seen in dilute compared to the incoming acid and the pH
the pH response. value (about 3.5) at time zero is higher than the
pH of incoming solution (about 3). A marked
improvement in the response is clearly seen.
Afterwards, the controller made changes in the Unfortunately, not enough amount of reagent was
reagent addition rate to drive the pH value available and the run had to be stopped at about
towards the set point. In this respect, the 12.7 min.
response was very similar to that observed for the
FLC test. However, once the pH was in the set
point region, the controller could not maintain For the next test, the same process solution was
the steady reagent flow rate. It continued to used again but the flow rate was reduced to almost
change some rules and a gradually diverting half the value. To keep the nonlinearity the same,
response resulted. the reagent concentration was also reduced. Under
this conditions and keeping the same controller
parameters, it should be possible to get results
In the above tests, including the FLC test, the similar to those of the above test. Note however
following scaling factor strategy was used. For a that at the start of the run, a lower
particular test, constant GE and GC scaling factor concentration alkali reagent has to neutralize the
values were used to map the actual error and same volume of acid in the tank. Thus a more
change in error values to the elements of the sluggish start up should be expected. Put another
specified universes. However, variable GU values way, because of the change in the process flow
were considered for two separate regions of the rate, the dynamics of the process are not the same
error space. These regions correspond to; (1) as before (the process time constant will be
steep middle region and (2) flat regions at upper almost doubled) and a longer time should be
and lower ends of the titration curve. The expected before convergence.
variable GU values were achieved using a constant
GU value but separate GU multiplier values. Thus a
small GU value, but using a large multiplier value The result shown in Fig. 4 indeed confirm these
for the outer region, faster response can be assertions. It shows that the response is similar
achieved by making large changes in reagent flow to that of Fig. 3 except that the time scale is
rates. A value of 1 for GU multiplier for the almost doubled. Many other similar tests also gave
other region allowed finer reagent rate tuning, adequate controller performance, driving the
for pH relatively closer to the set point. This controller into set point region and reasonably
strategy thus corresponds to having two maintaining it there. These tests demonstrated
individually controlled reagent delivery lines that the SOC implementation was able to handle the
making fine and coarse adjustments in reagent flow implicit change in set point type control problem.
rates. Tests where adjustment to explicit set point
change was demanded of the controller also showed
adequate performance. It is to mentioned, however,
Additional tests using different GE, GC and GU that a set point value of 7 is most demanding
values did not show any improvement. The above because generally the sensitivity at this point is
tests represent the usual design method used in maximum. Adjusting to load change or process
the SOC literature, of using a particular disturbance is another common control problem.
performance index table and adjusting scaling This was dealt with next.
factors to tailor to the problem. The approach is
based on initial success (at least in terms of
process output convergence) of the particular In the test shown in Fig.4, a disturbance in terms
performance rule set for certain simulated of a sudden increase in alkali strength was
processes. It is likely that some combination of introduced at about 50 min. Note that for the pH
the scaling factors could have provided better process, this also corresponds to a change the
results for the pH process also. However their process nonlinearity. Notice that it made the
selection can be tedious. controller response oscillatory. Thus the SOC
designed above did not adequately handle the
regulator type control problem.
An alternative is to choose scaling factors and
partition the space based on process and
performance requirements. The performance rules A more common type of load variation in pH
can be tailored for the application. The problem processes is a change in incoming acid
here is how to develop the set of performance composition/concentration. This is considered in
rules. Towards this end one possible approach is the next test. Here again, a implicit type
to first develop a fuzzy controller (no SOC level) regulator problem is considered first. Note that
around process conditions where adequate if a test is started with distilled water in the
understanding exists. Once a set of rules provides reactor then the pH at time t = 0 will be around
reasonable controller performance, a set of 7. The situation is analogous to the controller
performance rules may be developed based on such being on and distilled water being the inflowing
rules. This was examined next for the pH process. process solution. The process thus effectively
experiences a disturbance in the flow as the acid
starts flowing into the reactor. This is the so
called implicit regulator type problem. For

135
explicit changes, a change in concentration of one In conclusion, use of FST as a tool to reduce the
of the components of a two component acidic complexity to a manageable level should receive
process solution was considered. more attention. It is necessary that research
continues in the application of fuzzy logic to
difficult to solve problems. Then only the
For the tests done next the incoming acid was greatest promise and ambitious goal of solving
normally 0.001 N HC1-0.001 N CH3COOH solution. problems not amenable to conventional techniques
Also note the following three modifications in the of precise mathematics can be fulfilled.
SOC scheme, which were introduced based on tests
with such solutions prior to the reported test.
One modification was based on the observation that REFERENCES
a particular change in error value, say 0.5, may
be considered small in the steep region but it Clark, D.W. and P. J. Gawthrop (1981).
should be classified as large a change in error Implementation and application of
for the flat region of the titration curve. Thus microprocessor-based self-tuners.
like variable GU, variable GC scaling was Automatica, vol. 17, no. 1, 233-244.
considered. Second, unlike previous scheme using E Gustafsson, T.K. (1985). An experimental study of
alone, both E and C values were considered in a class of algorithms for adaptive pH
assigning GU multiplier value. Third, in line with control. Ch. Eng. Sei., vol 40, no. 5, 827-
the adopted SOC design philosophy, a new 837.
performance table was developed and used. Jutila, P. (1983). An application of adaptive pH-
control algorithms based on physico-
chemical modelling in a chemical waste-
Figure 5 depicts result of one of the tests. As water treatment plant. Int. J. Control, vol
mentioned, this test was started with distilled 38, no. 3, 639-655.
water in the reactor. Note that another test was Lee, C.C. (1990). Fuzzy logic in control systems:
done (not shown), where the reactor initially had fuzzy logic controller-Part I and II. IEEE
acid solution. Thus the results of the two tests Trans. Syst. Man Cybern., vol. 20, no. 2,
provided a basis for comparing the controller's 404-435.
response to the two types of control problem Lembessis, E. (1984). Dynamic learning behaviour
mentioned above. It appeared that the controller of a rule-based self-organizing
learned more quickly in the load change problem. controller". Ph.D. Thesis, Queen Mary
The nonlinearity of the pH problem must however be College, University of London.
remembered, when the pH of the solution in the Mamdani, E.H. (1974). Applications of fuzzy
reactor is already at pH 7.0, any acid coming into algorithms for simple dynamic plant. Proc.
the reactor will rapidly drop the pH of the IEE, vol. 121, no. 12, 1585-1588.
reactor contents due to high sensitivity of the pH Mamdani, E. H. and S. Assilian (1975). An
to acid or alkali in this region of the titration experiment in linguistic synthesis with a
curve. Thus controller will immediately react to fuzzy logic controller. Int. J. Man-Machine
counter such large error and gradient values. Also Studies, 1, 1-13.
any reagent thus coming in will essentially Parrish, J.R. and C. B. Brosilow (1988). Nonlinear
neutralize only the incoming acid, as the reactor inferential control. AIChE J., vol 34, no.
content is already at set point pH value. Thus the 4, 633-644.
controller can over react in almost no time and Procyk, T.J. (1977). A self-organizing controller
consequently, good damping characteristics can be for dynamic processes. Ph.D. Thesis, Queen
rapidly acquired by the controller in regulator Mary College, University of London.
problems. Procyk, T.J. and E. H. Mamdani (1979). A
linguistic self-organizing process
controller. Automatica, 15, 15-30
To confirm the effectiveness of the controller in Proudfoot, CG., P. J. Gawthrop and 0. L. R.
dealing with load changes, an explicit load change Jacobs (1983). Self-tuning PI control of a
was introduced at about 25 min. This was in the pH neutralisation process. IEE Proc., vol.
form of a change in the process solution to 0.001 130 D, no. 5, 267-272.
N HCL-0.004 N CH3COOH solution. It must be Shah, I. M. (1991). Application studies of a fuzzy
emphasized that the titration curves for the two logic based self organizing controller. Ph.
solution types are markedly different, the high D. Thesis, University of Utah.
CH3COOH concentration solution demanding almost Shinskey, F.G. (1973). pH and pION Control in
two and half times reagent amount for Process and Waste Streams, John Wiley and
neutralization. Sons.
Tong, R.M. (1985). An annotated bibliography of
fuzzy control. In M. Sugeno (Ed.),
Next test was a repeat of the first part of the Industrial Applications of Fuzzy Control,
above test, now for a high acid-low buffer (0.004 Elsevier Science Publishers, North-Holland,
N HCL-0.001 N CH3COOH) type process solution. The pp.249-269.
result, shown in Fig. 6 is once again quite good, Yamazaki, T. (1982). An improved algorithm for a
thus showing the effectiveness of the controller self-organizing controller. Ph.D. Thesis,
schemes and the parameters chosen in handling the Queen Mary College, University of London.
three types of process solutions considered. Young, J.L. and R. A. Bohling (1981). Water
treatment at Homestake's Bulldog Mountain
operation. Mining Eng., vol 33, no. 12,
CONCLUSIONS 1689-1691.
Zadeh, L.A. (1973). Outline of a new approach to
The claim that the SOC scheme can provide adequate analysis of complex systems and decision
controller performance has been experimentally processes. IEEE Trans. Syst. Man Cybern.,
verified. Application to a highly nonlinear pH vol. 3, no. 1, 28-44.
process required some modifications of the basic
SOC algorithm. Both servo and regulator type
control problems were tested, and SOC indeed
performed remarkably well. The difficult control
problem of handling changing process dynamics was
also successfully tackled.

136
process

Kx>
outpu
t

Integratio
n | f(error
) ~~
|

Scalin
g Scalin
g
' I ' 1 I I I I I I I
Defuzzificatio
n Quantizatio
n 10 20 30 40 50 60 70
Time, min
SUPERVISOR
Y LEVE
L
Fig. 4. Controller same as in Fig. 3 but
process dynamic changed. Also shown is
Performanc
e Rule
s
effect of change in alkali strength
Contro
l Rule Modifi
e

Knowledg
e Base
fuzzy sets Huzz
y rules

Implicatio
n & Inferenc
e

Decisio
n Unit

Fig. 1. Functional diagram of FLC and SOC

11

t -
f\ t
\f\t\i\ iWillflil
,,-i
piW
X -,

5-

3-
J \J U- 1)i 1/ v1 1 1
10 20 30 40
Time, min Time, min
Fig. 2. pH response with controller Fig. 5. pH and alkali rate responses
designed using basic SOC scheme for an explicit load change

i i

n (
71 1 il /

3-4-
V \)\)w 1
10
\ '
20
Time, min Time, min
Fig. 3. pH response with controller Fig. 6. Controller same as in Fig. 5 but
designed using alternative SOC scheme acid composition changed

137
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

OSTECH (Ornamental Stone TExtural CHaracterization):


A Structure of Expert System to Evaluate and Describe
Numerically the Textural and Structural Features of
Ornamental Stone Slabs
G. Bonifazi and A. Costantini
Dipartimento di Ingegneria Chimica, dei Materiali, delle Materie Prime e Metallurgia,
Universita degli Studi di Roma "La Sapienza", Via Eudossiana, 18-00184 Roma, Italy

Abstract. The recognition of ornamental stone slabs presenting similar


characteristics in texture, color and other "aestetic aspects" is an
important problem in all those applicative fields, where the economic use
of such slabs is closely tied to the possibility of determining similar
characteristics, so that slabs may be grouped homogeneously. This problem
may be tackled through petrographic analysis, or through the reliance on
ad hoc or subjective evaluations of an expert. These subjective evaluation
may work very differently from sample to sample and it is difficult to
obtain from such classification procedures accurate quantification of the
criteria used. The developed and set procedure has been planned and
implemented in order to be easily interfaced with numerical simulation
programs, where the textural characteristic modelling represents one of
the fundamental "step" of the input parameters.

Keywords: Ornamental stone; aestetic characteristics; image analysis;


texture analysis; quality control.

INTRODUCTION To achieve the above mentioned goals it


need to create an integrated software (SU)
Procedures and methodologies based on and hardware (HD) architecture able to
picture processing techniques of sample conduct all the determinations and the
images have been greatly increased in analyses in order "to extract" from the
these last few years. This growing samples the informations for a further
development is mainly due to the wide classification, thus strictly related to
range of applications suitable for this the utilization of data that is:
technique and to the development of
computing systems more and more 1. the identification of the lythotypes
sophisticated at a lower and lower costs. and its characteristics;

Today's microcomputers are powerful enough 2. the classification from a marketable


to address digital image analysis problems point of view of different products
and are inexpensive enough to standard coming from the same lythological
equipmenet. Digital images recorded by a family, but having different
TV camera system provide an inexpensive aestethical characteristics;
and widely available source of data. An
"expert system" which uses reasoning 3. the definition of the numerical set of
techniques and incorporates comprhensive parameters representative of the
knowledge in its classification rules can textural and structural
extract information about rock-types from characteristics of the rock;
digital images.
4. the definition and formalization in
The possibility to operate a mineralogical numerical form of "quality standards"
and petrographical classification on the for the different products.
characteristics of a sample image,
deriving from a rock by numerical analyses Together with the effort to define and
can represent an important step in the quantify numerically "the quality" of a
development of procedures to define, to rock, an other aspect has to be
control and to verify, in automatic form, considered: the possibility to define a
the set of parameters conditioning the classification criteria able to recognize
results of beneficiation flowsheet further and select not only sample of rocks coming
applied to the rock itself. from different lythotypes but also samples
belonging to a same lythotypes but
The development of such a procedures has presenting similar or different
opened the way towards the possibility to aesthetical characteristics.
develop systems able to operate an
objective quantification of the The recognition of similarities in the
"ornamental stone characteristics" texture and in the characteristics of
starting from the exam of a certain number ornamental stone slabs is infact of
of sample images. It is simple to fundamental importance to determine their
understand as the "aesthetical proper utilization for ornamental
characteristics" of a marble or of a purposes. Further, given a sufficient
granite, for example, represent one of the stock of stone slabs, those that are
basic parameter to establish the "quality" obviously very similar can be selected to
of such a product. be assembled together, so that a

139
homogeneous surface is obtained, but often been developed following a sequence of
at great expense with an excessive waste operations and a logic of analysis similar
of material. to those utilized by the best expert
system today present on the market: the
For all these reasons, it is important to man.
study the classification procedures and
the determination of similarities in a
formal way, by defining appropriate DIGITAL IMAGES HANDLING,
analytical procedures of aestetical MANIPULATION AND ANALYSIS
characteristics and determining a suitable
hardware and software architecture able to Image Processing (IP) systems are good
example of a technology that facilitates
1. identify the characteristics to be used better and faster decisions (Aggarwal,
in the classification algorithm; 1977); they are used mainly for traditiona
Decision Support System (DSS) when a
2. classify the slabs in appropriate particular set of samples is analysed in
categories based on their properties order to extract informations not easily
and similarities; detectable by the human senses, as for
example shape or morphological
3. determine the distribution of characteristics of objects.
parameters of recognition with regard
to the class boundaries. Try to create an architecture of Expert
System (ES) able to extract and manage
informations from images on the bases of
ROCK SAMPLE AS SOURCE OF their "aestetical characteristics" is
INFORMATION certainly not a simple task.

Any analytical procedure is based on the Differently from a DSS an ES working in


exam of samples characterized by specific this field (IP) have to guarantee the
attributes. Very often aim of the possibility to face and solve the problem
analytical procedure is to evidence one of in an unique way. Each slab or set of
more these specific attributes. slabs have to be univocally characterized
and classified inside a market class.
For the determination discussed in this
paper the sample is represented by an A DSS is generally based on the following
image: related to a rock surface. All the items:
attributes necessary to perform the
aestetical characterization have to be 1. DSS usually addresses problems
extracted and transformed in numerical typically faced at the managerial
form starting from the image. level. Such problems are often not
well structured and have little or no
The surface of the sample is constituted existing solutions.
by a set of cristalline elements, each of
them having its own chemical, physical and 2. DSS attempt to incorporate quantitative
shape characteristics. Together with these management science methodologies into
parameters have to be considered the traditional data processsing
discontinuities (fractures, microfractures functions.
and cavities) characterized by a specific
typology, orientation and shape. 3. DSS generally facilitate easy,
convenient and interactive use by both
Defined an acquisition scale and fixed the computer and non-computer
related acquisition parameters, the professionals.
surface of the rock (sample image) can be
regarded as a whole, practically 4. DSS tend to diversify into various
continuous, of plain domains internally areas in terms of flexibility,
connected, each characterized by a applicability and adaptability for
frontier (profile) and by a chromatic decision making.
function able to describe the color of
each plain domain. It's thus possible to On the contrary, the major distinguishing
identify on the sample more than one characteristics of an ES may be summarized
domain defined by the same chromatic as follows:
function, but characterized by a different
profile and/or in different position 1. ES are typically domain-specific. That
inside the image. is, their individual areas of
application are well-defined,
The informations detectable from the structured, and narrowly bounded.
analyses are therefore based on an
evaluation of the type, the entity, the 2. ES tend to operate by utilizing
distribution and the mutual position of heuristic problem solving approaches.
the domains inside the sample itself. The That is, ES problems have no
data have to be successively transformed algorithmic solutions. Problem
in a set of values to characterize the solutions or conclusions are generated
image and to operate a comparison among by inferences often based on
different samples. incomplete or uncertain data.

The difficulty to create a numerical 3. ES are organized in three distinctive


structure able to operate an exam of the levels: knowledge base (problem
image, following the above mentioned solving rules, procedures, and data
criteria, is due mainly to the relevant to the problem domain),
bidimensionality of the problem and to the
synthesis of the informations resulting Fig. 1. Industrial engineer and expert
from the analyses. system interface [Adedeji, 1988]. An
expert system can be defined as a
All the research work and the first knowledge-based system with the knowledge
attempts to create and to implement the acquired from humane who are experts in
hardware and software architectures have their field. Knowledge base: This includes

140
superiority of one expert's judgementand
exprience over another. Infact, if a
problem can be solved in an algorithmic
fashion such that one expert's solution is
essentially as good as any other, then
conventional programming may be the
appropriate choice.

An ES for the recognition and


classification of ornamental stone
aesthetical characteristics have to
satisfy basically the following
conditions:

1. Ease of the data collection. This task


is one of the most difficult to
realize: the input data are
represented mainly by the images and
subordinately by other related
informations as origin, physical and
chemical characteristics and so on.
Machine vision is an area of
Artificial Intelligence (AI) that has
been more successful in research
laboratories than in actual practice.
The reason is that laboratories
operates under controlled conditions
on the contrary in a factory the
acquisition of image can be subjected
to all sorts of unanticipated
variations.
Fig. 1.
2. Frequency of problem occurrence. On the
the rules that represent the knowledge basis of different problems, related
which is known to the system and a list of with the aesthetic characterization, a
facts about the problem. There are many different and a more or less wide
ways to encode knowledge, each one having range of rules have to be formulated,
positive or negative points: developed and implemented.
- Semantic networks 3. Representation of data. The informative
- Object-Attribute-Value triplets content of each sample (image) is very
- Rules high, thus a set of procedure based on
- Frames different approach have to be
- Logical expression developed in order to operate a
synthesis of the informations and to
Inference engine: This the program that generate a data set (pattern vector)
applies the rules to the input data, draws subjected to be handled. Can the
conclusions and solves internal conflicts. pertinent knowledge (data set
Among other things, the inference engine associated to each sample image) be
must decide where to start and path to organized as a knolegde base? Are
follow if more than one alternative there clear-cut relationships or
exists. The two most popular methodologies heuristics that can be handled in
used in inference engines are backward and terras of IF-THEN statement? A correct
forward chaining. Furthermore, both can be answer to these two questions strongly
differentiated in either depth-first or contribute to a valid representation
breadth-first search. of the data.

working memory (task-specific data for 4. Value of problem domain. An analysis of


the problem under consideration), and the different aspects, related with
inference engine (generic control the classification, has to be
mechanism that applies the axiomatic conducted to establish the most
knowledge in the knowlegde base to the appropriate and convenient range of
task-specific data to arrive at some aspects to emphasize during the
usable solution or conclusion). analyses to properly identify the
parameters of interest.
The main goal in the definition of a 5. Costs. An aspect to be considered,
structure of ES working in the field of strongly conditioning the development
ornamental stone textural characterization of such an ES is the cost; either in
is to create a human-reasoning-emulation terms of HD and SU or in terms of man
architecture which can mimic human resource assesment (experts) during
behaviour in solving problem by employing the project. The point is particularly
symbolic knowledge (Fig. 1 ) . delicate considering the necessity to
The implmentation of such a procedure is a merge together different expertise
knowlwedge-based system which utilize a ranging from IP (Andrews, 1972), AI,
set of rules, stored in its rule base, petrography, geology, natural
able to solve complex problems as those resources, architecture, ornamental
related to the aestetical characterization stone marketing and so on.
of ornamental rocks, for which traditional
algorithms either do not exists or are 6. Availability of human experts. Is there
impractical (Fig. 1 ) . access to willing experts to provide
inputs that will form the knowlegde
A key factor in the implementation of such based? The answer to this question is
a kind of ES is that this application area not easy. A large number of
is heavily dependent on expert heuristics. classification criteria and
That is, problem-solving heuristics whose classification procedure exists
robustness can serve as measure of the strictly related to the lythological

141
characteristics of the sample (i.e. further eleborations and interactions with
marble and/or granite) and on the other informative structure is the main
"experience" and the "approach" of problem arising during the encoding stage.
each expert. Try to operate the The attempt to establish a hyerarchical
merging among all these different structure and a procedural sequence based
"phylosophies" is probably the hardest on these informations contrasts with the
goal to reach in developing a development of a system having great
procedure to create an ES working on flexibility. Thus two solutions based on
ornamental stones.
a) a priority criterion, or
7. Resource availability. Are the needed
resources (computers, programmers, b) an heuristic criterion modelled in term
software, etc.) available? Uhat will of IF-THEN-DO conditions can be
it cost to upgrade the existing aopted.
resources to the adequate level
required for the expert system Both present advantages and disadvantages.
project? Two aspects have to be
considered: The first (priority criterion) gives good
result when applied to subjects (stones)
i) the numerical computing of each characterized by a data set markedly
data set of the characteristics of different, this way the goal of
the image related to a stone identification and/or classification can
sample; be quickly reached. The second (heuristic
criterion) even if less fast enable to
ii) the development of SU architecture solve the problem in the best way.
able to handle and manage the
whole set of informations on the Digital informations
bases of rules.
These informations derive directly from
For the point i) ("numerical the analytical systems and are the results
computing") a large number of programs of DIP performed on rock samples. During
exists. Inthis case it needs to re- the analyses the ES perform a control of
arrange or to adjust the codes in the results successively obtained in order
order to perform the requested to suggest the further analytical
analyses inside a pre-defined procedure to adopt to fully characterize
environment eventually easy and classify the sample.
interfaceable with the SU architecture
mentioned in i i ) . On the contrary for Interactions between pre-assigned and
point ii) ("development of SU digital informations
architecture") the interaction among
experts of different areas is the base Great importance have to be attributed to
to establish relations between the the data base dimensions and structure
numerical data set resulting from thr containing the informations, moreover the
analyses on each sample image and its system (rules) analyze both forward and
aesthetical characteristics. It is backward, seeking rules or facts that
easy to understand as in this case a support an action.
great effort have to be realized in
terms of human, HD and SU resources. The integrations of the data deriving from
the experience (pre-assigned) with those
deriving from the DIP is not easy
ALGORITHMS ANALYSIS especially if interaction criteria have to
AND DEFINITION be established.

The ornamental rock are commercialized in


the form of slabs of different size. Each EXPERIMENTAL AND DISCUSSION
slab (surface) can be considered as a
whole of crystalline elements All the research work developed up to now
characterized by their physical, chemical has been mainly focused on the teraptative
and geometrical individuality. The to identify the best procedure and the
distribution of these elements (mineral best algorithms to characterize the slab
grain) on the surface, together with the
polishing degree, affect the aesthetic
characteristics of the sample. TABLE 1.1 - Sample (Granite): Sardinian
Pink. The rock is characterized by coarse
A criteria have to be formulated to define grains (minerals). The presence of pink
a basic procedure able: feldspar phenocrystals is noticeable. The
textural characteristics just described
a) to associate to each sample a certain are well evidenced by the values assumed
number of pre-assigned or pre-defined by the parameter R 3 e ( = 0*.45,90,135)
informations (commercial name, in comparison with the same assumed by the
exploitation site, lythological and same parameter for the Sardinian Grey (see
petrographical characteristics, etc.); Table 1.2).

b) to quantify numerically by Digital


Image Processing (DIP) the Textural
informations of rock surface parameter 45 90 135
characteristics (color analysis,
texture analysis, degree of pictorial 2
Rie-10- 0.5448 0.5586 0.5877 0.5547
omogeneity, morphological and Rae-10* 1 0.6708 0.6386 0.5632 0.6474
morphometrical analyses of the Rae'lO-* 0.7923 0.7926 0.7925 0.7925
constituents, etc.). R 4 e'10* 3 0.3995 0.3995 0.3995 0.3995
R5e-10+o 0.3750 0.3796 0.3939 0.3777
Pre-assigned informations R e'10- s 0.3815 0.3815 0.3816 0.3815
R 7 e-10+ 3 0.1352 0.1355 0.1358 0.1355
The definition of this set of data and Ree-10- 1 0.5516 0.5491 0.5436 0.5498
their arrangement in a form suitable of Rve-10* 1 0.3860 0.3862 0.3862 0.3862

142
TABLE 1.2 - Sample (Granite): Sardinian TABLE 1.6 - Sample (White Marble); Carrara
Grey. The sample is characterized by fine White Veine Brouille'.
grains. The presence of grey-whitish
feldspars is noticeable. Not frequent the
presence of pink feldspars. Textural
parameter 0 45 90 135
1
Textural ie'10- 0.4533 0.4845 0.4887 0.4783
1
parameter 0 45 90 135 R 2 0 " 10+ 0.6564 0.5713 0.5388 0.5862
Ra e "10+* 0.3755 0.3755 0.3756 0.3755
3
-,--* 0.5782 0.5716 0.6167 0.5764 R4 e "10+ 0.1401 0.1401 0.1401 0.1401
Re e 10-
'
1
0..6913 0..7153 0..5929 0..7042 R '10+
0.3804 0.3947 0.3994 0.3918
Ra e 10+* 0..1262 0..1261 0,.1262 0 .1262 Re '10+ 3 7345 0.7344 0.7346 0.7345
R4 e 10-3 0..4319 0,.4319 0,.4319 0,.4320 Rre ' 10+ 2017 0.2024 0.2012 0.2023
1
Ra e 10+
' 0.,3944 0..3874 0..4092 0..3897 Re '10+ 5680 0.5612 0.5599 0.5625
RA 0 10-* 0..3831 0..3831 0..3831 0..3831 R*'10+ 0.4041 0.4045 0.4042 0.4044
R7 e 10+3 0..2526 0..2523 0..2536 0..2524
1
Re e 10+ 0..5672 0..5688 0..5601 0,.5681
1
Rv 0
' 10- 0..4055 0..4956 0..4053 0..4055

Results related to the values assumed by


the textural parameters (see Appendix)
derived for the different lithotypes. All
TABLE 1.3 - Sample (White Marble): Carrara the exams has been conducted on 30x30 cm
White Bruille'. The rock is characterized sample images, acquired in standard
by the presence of grey patches and/or lifihtin conditions. The results well
veins. Operating a comparison with the evidence the high selectivity of these
values of the textural parameters derived parameters. The proposed approach can
from the analyses for this sample set and operate a synthetic description ("difiital
those derived from the other marbles (Tab. values") of ornamental stone preeentin
1.4, 1.5 ed 1.6) it is possible to see as different aesthetic characteristics.
a good correspondence exists between
aestehtical and numerical characteristics.
surface by a set of parameters directly
derived by IP.
Textural
The possibility to obtain a set of digital
parameter 0 45 90 135
informations able to well characterize the
R1e-10-S 0,.4889 0.5222 0.5253 0..5223 aesthetical features of each rock sample
Ree-lO* 1 0,.6455 0.5609 0.5285 0,.5605 is infact the most difficult goal to
Rae-lO* 7 0,.3066 0.3066 0.3067 0..3067 reach. IP techniques must mimic the
R 4 e'10** 0,.1239 0.1239 0.1239 0..1239 logical route usually followed by human
R 5e -10* 0..3834 0.3983 0.4001 0..3982 expert to judge a slab.
R A e -10+ 2 0,.6909 0.6909 0.6910 0..6808
R 7 *10- 3 0..1758 0.1766 0.1754 0..1766 Two facts have to be considered in the
Ree-10* 1 0.,5603 0.5535 0.5523 0..5534 choice of the number and of the type of
R9e-10*1 0..3972 0.3977 0.3874 0..3977 the digital informations to be derived:

1. the time needed for computation,


TABLE 1.4 - Sample (White Ilarble): Carrara
White JT. 2. the selectivity of the algorithms and
the derived numerical parameters;

Textural both influencing the possibility to


parameter 0 e conduct in a reasonable way all the
45 90 135 analyses.
R1 10" = 0 .4460 0 .4740 0 .4770 0 .4737
Re 0 1 0 +
1
0 .6515 0 .5735 0 .5374 0 .5733
The identification of the way in which the
R 3 0 . 1 0- 6 0 .6025 0 .6025 0 .6026 0 .6025 aesthetic surface characteristics of an
R4 e ' 10+3 0..2111 0..2111 0.,2111 0..2111 ornamental stone can be recognized and
R ' 10+ 0..3842 0..3961 0.,3996 0..3964 mesured by IP techniques, can be worked
Re e 10+
e
0,.9059 0..9059 0..9060 0..9059 out by fixing three different groups of
R7 1 0 +
3
0,.2306 0..2313 0..2297 0,.2313
parameters (Bonifazi, 1988):
1
Re ' 10+ 0. 5723 0. 5 6 6 0 0. 5647 0.,5659
R * ' 1 0 +
1
0..4095 0..4098 0..4095 0..4098
parameters depending on the surface
color levels and that cen be measured by
spectral analysis of the radiation
reflected by surface themselves;

- parameters depending on the polishing


TABLE 1.5 Sampl e 1CWhite M a r b l e ) : Carrara degree of surfaces, and derivable by the
White D. analysis of the noise presenting the
spectrum of the radiation diffused by
surfaces;
Textural
parameter 0 45 90* 135 - parameters depending on the textural and
structural characteristics of surfaces and
a
Rie-10- 0 .5812 0 .6199 0.6270 0 .6223 computable by the analsysis of texture and
Rse-lO^ 0 .6440 0 .5603 0.5264 0 .5587 structure of the surface themselves.
R3e-10+* 0,.2119 0 .2192 0.2192 0,.2192
R4'10+3 0,.1148 0 .1148 0.1148 0,.1148 The algorithms adopted to quantify the
R*e-10+ 0,.3830 0,.3975 0.4014 0,.3988 above mentioned parameters are those
R6e-10+= 0,.6673 0,.6673 0.6674 0,.6673 deriving by the classical IP texture
R7-10+3 0..1305 0..1313 0.1309 0..1313 analyses (see Appendix) (Resenfeld, 1970).
1
Re"10+ 0.,5444 0.,5378 0.5364 0.,5376
R,e-10+1 0..3813 0,.3819 0.3816 0..3819 The basic assumptions of such a set of

143
procedure is that an image, in this case a coefficients able to quantify the
slab surface, can be considered as a aesthetical characteristics of a stone.
continuos two-dimensional function f(x,y)
of two planar variables x and y, that for The considerations developed in this paper
each pair of coordinates can provide a and the results derived show as a correct
value which is representative of the color interactions of the different set of
level value (Schacter, 1978) (see informations ("pre-assigned and digital")
Appendix). can conduct to develop an integrated tool
able to extend the frontiers of computer
Texture generally means the distribution usage, usually well marked and limited in
of the spatial variations of color levels this field (ornamental stone
(or gray, if the image is in black and characterization), for characterization,
white) of an image related to a surface. classification and decision making.
An image can be considered as formed by
more elementary scenes.

An elementary scene can be reduced to a


uniform pictorial element. More generally
an elementary scene can be characterized
by three fundamental parameters: the
profile, the skeleton and the structure.
The structure can be characterized by a
certain number of factors in reference to
the elements that form an elementary
scene:

dimensions, levels of color, ori entation


of elements;

- number and frequencies of the different


elements;

- different elements' distribution.

The texture evaluation of the image


surface can be carried out when the
average size of the structure elements is
smaller than the analyzed area; the
statistical recurrences in the
distribution of elements in the elementary
scene can be noticed.

Both conditions are verified in the


present study. A statistical analysis of
the distribution of the various elements
that constitute the elementary scenes
(and/or the whole image) can represent the
most immediate approach for the texture
evaluation.

A certain number of parameters can be


derived this way. Each parameter can be
considered as representative of the mutual
spatial relations among the color levels
associated to each pixel constituting the
image. These parameters work out an
exemplification and a structuring of the
informations, in this case the aesthetical
characteristics present in the image.

The results of the experiments (see Table


1) conducted on a large number of samples
shows these procedure can be considered
and adopted as the best approach for the
determination of the "digital
informations" necessary for the
development of an ES able to carry out the
requested aesthetical characteristics of
stones.

The textural characterization of


ornamental stone slabs by means of image
processing, enables both to well
synthetize the set of aesthetic parameters
that characterize a stone and to carry out
an objective documentation on which the
successive exams or processing will be
based.

The results show how by this procedure it


is possible to carry out a large number of
analysis in a completely automatic way; to
set up data base formed by images and
their related textural values to have an
integrated analogic-digital information
system and, finally, to define

144
APPENDIX - Schematic representation of the algorithms and the procedures adopted to
quantify image aesthetical characteristics by the classical IP texture analyses.

For texture evaluation, the definition of suitable color-level, spatial-dependence


matrices of an image may be defined. Consider an image of size N M by N y where they
indicate respectively the number of pixels in each coordinate direction which are use to
represent the image. If the image is in black and white, each pixel can be characterized
by N g levels of grey denoted by (g(i,j) = 0,1,...Ng, -V i,j ) , (Bonifazi & Massacci, 1988).
Thus, the image can be defined by the following set:

P = { (i,j)|(i,j) ( U x Ly) } (1)

where LK and Ly are the two sets, respectively LH = { 1,2,...,NM ) and L y = { 1,2 Ny }
being g(i,j) the level of grey of the pixel having (i,j) coordinates. The formulation of
these relations enables the construction of a grey-tone spatial-dependence matrix M k , k
being the serial number of the directions along which the distance d is calculated.Thus,
if the main directions are 0, 45, 90 and 135 the corresponding grey-tone spatial-
dependence matrices are defined by the following relations:

Mi(l,ffl,d)= #{((p,q),(r,s)) (Ly x Lx)x(Ly x L x ) | p-r = 0,


|q-s| = d, g(p,q) = 1, g(r,s) = m) (2)
M*(l,m,d)= #{((p,q),(r,s)) (Ly x Lx)x(Ly x L x ) | [ p-r = d, q-s=-d ] ,
or [ p-r = -d, q-s = d ] , g(p,q)=l, g(r,s)=m) (3)
M 3 (l,m,d)= #{((p,q),(r,s)) (Ly x Lx)x(Ly x L x ) | | p-r| = d,
q-s = 0, g(p,q) = 1, g(r,s) = m) (4)
M 4 (l,m,d)= #<((p,q),(r,s)) (Ly x Lx)x(Ly x L x ) | [ p-r = d,
q-s = d ] , or [p -r = -d, q - s = - d ] , g(p,q)=l, g(r,s)=m) (5)
being # the number of couples of pixels that check the mentioned conditions.

Uith reference to the first of the four matrices, the element Ml(l,m,d) is equal to the
set of ordered couples of pixels: (PQ) (r,s) whose distance according to the
coordinate axis y is null and whose distance in the x coordinate direction is d, which
also have grey levels g(p,q)=l and g(r,s)=m respectively. The matrices Mk (k = 1,...,4)
are symmetrical, that is:

M k (l,m,d)=M k (m,l,d) (6)

and the size of the matrix M k is equal to NgxNg. The following correspondence between
directions and the matrices are defined: 0,k = 1; 45,k = 2; 90,k = 3; 135,k = 4.

From the M k (l,m,d) matrices the required information on the image characteristics:
homogeneity, contrast, type of boundaries, image complexity, etc. can be obtained. Image
processing is carried out by taking into account normalized grey-tone spatial-dependence
matrix, (Aggarwal et al., 1977), which is defined as a matrix whose generic element:

M' k (l,m,d) = M k (l,m,d)/T k (7)

which is defined, of course, only if 1, m, d are integers and where:

if k=l, then T-, = 2(N H -l)N y (8)


if k=2, then Ta = 2(N*-1)(N y -1) (9)
if k=3, then T3 = 2N H (N y -l) (10)
if k=4, then T4= 2(N H -1)(N y -1) (11)
From these normalized matrices M' k (l,m,d) a series of parameters can be derived:

Rii. = - E N * m ., M' k (l,m,d)) e (12)


R 2 k = *-,- zz E N " 1 . 1 [M' k (l,z-l,d) + M' k (l,z + l,d)] (13)
R3k = 11.1 ENc,.i [(lm)M' k (l,m,d)- ]/ (14)
where , , and are respectively the means and the standard deviations of the
marginal distributions connected with M' k (l,m,d)) .

R4k = - E N e m .i ( 1 - ) s M' k (l,m,d) 2 (15)

where _ is the mean of the values of the M' k (l,m,d) matrix.

R 5k = E^i-i EN*m-i M' k (l,m,d) /(l + (l - m ) e ) (16)


Rk = E 2 N z - 2 EN*i-i zM' k (l,z-l,d)) (17)
Ry* = 2 " , - E N 1 . 1 (z - R k )*M' k (l,z-l,d) (18)
Re k = ~ " , - 2 "- M' k (l,z-l,d)Log{ENgj=lM'(j,z-j,d)} (19)
R9k = - E N 1 - 1 ENflm-i M' k (l,m,d)LogM(l,m,d) (20)
These 9 parameters can be calculated with reference to each of the 4 matrices M k (l,m,d),
(k = 1,...,4), corresponding to each direction. Notice that of particular interest are the
first three parameters of the R l k (i=l,2,3..) which can be identified as the angular
second moment, the contrast and the correlation in the respective directions. These
definitions directly apply.

145
REFERENCES Bonifazi, G., P. Massacci and G. Patrizi
(1989). Pattern recognition for
Adedeji B. Badiru (1988). Expert systems texture classification of ornamental
and industrial engineers: a stone slabs . IEEE International
practical guide to a successful Conference on Image Processing.
partnership. Computers Ind. Engng. Singapore, pp. 851-860.
1, pp. 1-13. Rosenfeld, A. and B.S. Lipkin (1970).
Aggarval, J.K., Duda R.O., Rosenfeld A., Texture synthesis. In: Rosenfeld.
(1977), Computer Methods in Image A. and Lipkin. B.S. (Editors).
Analysis, IEEE Press. New York. Picture Processing and
Andrews, H.C., A.G. Tescher, and R.P. Psycopicftorics. Academic Press, New
Kruger (1972). Image processing by York, 309-345.
digital computers. IEEE Spectrum. 9_ Schacter, B., A. Rosenfeld and L. Davis
pp. 20-32. (1978). Random mosaic models for
Bonifazi, G. and P. Massacci (1988). textures. IEEE Transactions on
Ornamental stone characterization by Systems. Man and Cybernetics. 1_ 694-
image processing. International 702.
Conference on Operational Research.
Management Science and New
Technologies. EURO IX - TIMS XXVIII.
Paris.

146
Copyright IFAC Expert Systems in Mineral and NEW METHODS
Metal Processing, Espoo, Finland, 1991

NEURAL NETWORK MODEL FOR RECOGNITION OF


CHARACTERS STENCILED ON SLABS
K. Asano*, J. Tateno*, S. Maruyama*, K. Arai*, M Ibaragi** and M. Shibata**

^Technical Research Division, Kawasaki Steel Corporation, 1 Kawasaki-cho, Chiba-city, Chiba 260, Japan
**Chiba Works, Kawasaki Steel Corporation, 1 Kawasaki-cho, Chiba-city, Chiba 260, Japan

Abstract. T h e N o . 3 c o n t i n u o u s c a s t i n g p l a n t at C h i b a W o r k s of K a w a s a k i
S t e e l C o r p o r a t i o n i d e n t i f i e s e a c h s l a b by s t e n c i l i n g a c o d e on t h e s u r f a c e of
e a c h s l a b . T h e s t e n c i l e d c h a r a c t e r s a r e o c c a s i o n a l l y d e g r a d e d by b l u r r i n g o r
s p o t s of p a i n t a n d m a y b e c o m e u n r e a d a b l e f o r o p e r a t o r s w h o h a n d l e t h e s l a b s .
In o r d e r to d e t e c t s u c h d e f e c t s of c h a r a c t e r s a n d to a s s u r e t h e q u a l i t y of t h e
i d e n t i f i c a t i o n c o d e s , a c h a r a c t e r r e c o g n i t i o n s y s t e m h a s b e e n d e v e l o p e d by
u s i n g a t h r e e - l a y e r neural n e t w o r k m o d e l . The c h a r a c t e r s are e x t r a c t e d from
t h e i m a g e t a k e n by MOS c a m e r a s by m e a n s of a p a t t e r n m a t c h i n g m e t h o d .
A f t e r p r e p r o c e s s i n g , t h e i m a g e s of e a c h c h a r a c t e r a r e p r o v i d e d to t h e n e u r a l
n e t w o r k m o d e l w h i c h p r o d u c e s a c r i t e r i o n of s i m i l a r i t y b e t w e e n t h e c h a r a c t e r s
a n d t h e s t a n d a r d p a t t e r n s . An a l a r m is g i v e n to o p e r a t o r s w h e n t h e q u a l i t y of
c h a r a c t e r s h a s b e e n j u d g e d to be d e f e c t i v e by t h e n e u r a l n e t w o r k m o d e l . T h e
c o r r e c t c l a s s i f i c a t i o n r a t e of t h e c h a r a c t e r s is m o r e t h a n 9 9 % , a n d t h e r a t e o f
o v e r l o o k i n g d e f e c t i v e c h a r a c t e r s is 0 % . T h e s y s t e m h a s b e e n successfully
u s e d f o r t h e q u a l i t y a s s u r a n c e of s l a b m a r k i n g .

K eyw ords. Character recognition; Classification; Neural nets; Image


p r o c e s s i n g ; Steel industry; Casting; Alarm s y s t e m s ; Automation.

INTRODUCTION In r e c e n t y e a r s , t h e r e h a s b e e n a g r o w i n g
i n t e r e s t in a r t i f i c i a l n e u r a l n e t w o r k m o d e l s .
The c o n t i n u o u s c a s t i n g (CC) p r o c e s s a l l o w s In p a r t i c u l a r , m u l t i l a y e r f e e d f o r w a r d neural
m o l t e n s t e e l to be c o n t i n u o u s l y c a s t a s t r a n d , networks have been used for pattern
w h i c h is c u t by t o r c h e s i n t o s l a b s w i t h a s i z e classification, process control, robot control
of a b o u t 25 0 X 2 0 0 0 X 1 0 0 0 0 m m . a n d s o o n . It w a s a n t i c i p a t e d t h a t a s i m i l a r
The i d e n t i f i c a t i o n of each slab is very method could be used for character
i m p o r t a n t to e n s u r e its p r o m p t d e l i v e r y to a recognition.
h o t s t r i p m i l l . T h e N o . 3 C C P l a n t at C h i b a A system for recognition and quality
Works of Kawasaki Steel Corporation evaluation of c h a r a c t e r s stenciled on the
i d e n t i f i e s e a c h s l a b by s t e n c i l i n g a c o d e on s u r f a c e of s l a b s w a s t h e r e f o r e developed,
t h e s u r f a c e of e a c h s l a b j u s t a f t e r c u t t i n g by u s i n g a t h r e e - l a y e r n e u r a l n e t w o r k m o d e l , to
t h e t o r c h . As t h e s t e n c i l e d c h a r a c t e r s are e n s u r e t h e c o r r e c t t r a c k i n g of t h e s l a b s a n d
o c c a s i o n a l l y d e g r a d e d by b l u r s or by s p o t s of their prompt delivery. This paper describes
p a i n t a n d m a y b e c o m e u n r e a d a b l e for the the neural network model and the
o p e r a t o r s who h a n d l e the s l a b s , a quality c o n f i g u r a t i o n of t h e s y s t e m .
a s s u r a n c e s y s t e m for t h e s e c h a r a c t e r s has
b e e n in g r e a t n e e d .
Although a number of investigations on QUALITY EVALUATION OF
c h a r a c t e r r e c o g n i t i o n h a v e b e e n d e s c r i b e d in THE CHARACTERS
the literature and several character
recognition systems have been developed, T o r e c o g n i z e t h e c h a r a c t e r s p r i n t e d on t h e
m o s t of t h e m d e a l w i t h c h a r a c t e r s p r i n t e d on s u r f a c e of i n t e r m e d i a t e p r o d u c t s l i k e s l a b s ,
p a p e r , in w h i c h t h e q u a l i t y of c h a r a c t e r s is t h e f o l l o w i n g p o i n t s n e e d to be c o n s i d e r e d :
r e l a t i v e l y h i g h a n d t h e b i n a r y i m a g e s of t h e 1) T h e c o n t r a s t of c h a r a c t e r s is r e l a t i v e l y
characters can be easily obtained. The l o w , their e d g e s are not c l e a r l y d e f i n e d , and
m e t h o d o l o g y for r e c o g n i z i n g c h a r a c t e r s t h a t besides, t h e i r i m a g e is u s u a l l y identified
are b l u r r e d , deformed or a c c o m p a n i e d by from a d i s t a n c e under n o n - u n i f o r m l i g h t i n g ,
n o i s e s u c h as t h o s e on t h e s u r f a c e of a s l a b s o t h a t s i m p l e t h r e s h o l d i n g is n o t a p p l i c a b l e
has not p r e v i o u s l y been e s t a b l i s h e d . to o b t a i n t h e g o o d b i n a r y i m a g e .

147
2) The quality of the characters itself is not p r o c e d u r e repeatedly adjusts the weight of the
important as long as they are readable by connections in the network to minimize a
operators. measure of the difference between the actual
The following additional points also need to output vector of the network and the desired
be considered in c o n s t r u c t i n g a system: output vector.
1) The characters are marked by using This difference m e a s u r e , E, is defined as
stencils as shown in Fig. 1, so that they
c o n s i s t of a number of small areas rather than =^-)2> (3)
a series of continuous l i n e s .
2) Due to high temperature and r o u g h n e s s of where c is an index over cases ( i n p u t - o u t p u t
the surface of a slab, the image of a character p a i r s ) , j is an index over output units, and d-
is sometimes degraded by b l u r s , smears and is the desired output value for y .
other types of noise as shown in Fig. 2, Each weight is changed by the following rule:
black squares s h o w i n g g r a y - s c a l e l e v e l s .
3) The p u r p o s e of the system is not only to dE (4)
Aw = -
recognize characters but also to detect
unreadable characters and give an alarm. where is the learning rate.
However, since no information about the We used the following rule for accelating
stenciled characters is given to the system, convergence:
both recognition and quality evaluation need
to be done s i m u l t a n e o u s l y . Aw(n) = -n=-+aAw(n-\)' ^ '
C o n s e q u e n t l y , basic requirements which the aw (n)
system has to meet are the recognition and where s u b s c r i p t n indexes the presentation
quality evaluation of characters with a margin number, and a is a constant which determines
that permits degradation of the characters to a the c o n t r i b u t i o n of the past weight change to
certain degree. the current weight change.
Methods available for pattern recognition can The neural network is applicable to pattern
be classified as either statistical or recognition by making the input units and the
s t r u c t u r a l . With the former method, the output units c o r r e s p o n d to patterns and
similarity between two patterns is evaluated. categories respectively. To recognize a
If we take a certain function which r e p r e s e n t s character, the simplest way is to use
the similarity between a sample pattern and character patterns c o n s i s t i n g of pixels and to
the standard pattern of each category to make those pixels c o r r e s p o n d to the input
recognize the c h a r a c t e r s , then we can use the units. In this case, both feature extraction
value of the function to evaluate the quality and c l a s s i f i c a t i o n can be executed by the
of the characters to satisfy the r e q u i r e m e n t s . network.
We took a three-layer neural network model
to construct a function r e p r e s e n t i n g this APPLICATION OF THE NEURAL NETWORK
similarity by utilizing its ability for function MODEL TO THE SYSTEM
approximation.
Statistical pattern recognition is based on
Bayesian d e c i s i o n . From the standpoint of an
PATTERN RECOGNITION USING optimal c l a s s i f i c a t i o n , the neural network
A MULTILAYER NEURAL NETWORK MODEL should be trained so that it represents the
Bayes a posteriori probabilities of each
Let us consider the three-layer feedforward category (Asoh 1989). However, this is not
neural network model shown in Fig. 3. Units r e a l i s t i c because training in this way requires
in the input layer have only one input and a a vast amount of samples to represent the
linear activation function. The activation rule d i s t r i b u t i o n s of p a t t e r n s . With the stenciling
for a unit in the hidden layer and the output system in q u e s t i o n , it is difficult to obtain
layer is a non-linear monotonic function of enough samples of unreadable characters in
the weighted sum of its inputs JC;; every category because character deterioration
to such a degree rarely h a p p e n s .
n =f(Lwfixi ~eJ> (1) C o n s e q u e n t l y , we utilized the neural network
in another way. We determined the standard
where yj is the j-th output value, wjt are the patterns for each category artificially and
weights of c o n n e c t i o n s , and ] is a bias. trained the network so that its output values
We used a sigmoid function for the activation would indicate a kind of similarity between
function: the artificial standard patterns and the sample
p a t t e r n s . The network works not only as a
f (*)=W(1+e-*). (2) n o n - l i n e a r d i s c r i m i n a n t function but also as a
evaluation function for the quality of the
The neural network works as a multi- characters.
dimensional non-linear function as a whole, Examples of the standard patterns are shown
which can be trained to approximate the in F i g . 4 , l o w - p a s s filtered patterns being
desired i n p u t - o u t p u t mapping by learning used and consisting of 160 pixels
from a set of examples. c o r r e s p o n d i n g to the number of input units.
B a c k - p r o p a g a t i o n (Rumelhart, 1986a, 1986b), Each pixel has a value between 0 and 1, and
which is a kind of g r a d i e n t - d e s c e n t method, the hidden layer contains 20 units. There are
is widely used as a learning p r o c e d u r e . The two n e t w o r k s , one for the alphabetic and the

148
o t h e r for the n u m e r i c c h a r a c t e r s . The o u t p u t the case of t h e h i d d e n units which are
layer for the former contains 14 units a c t i v a t e d by G. T h e s i g n s a r e vice-versa
c o r r e s p o n d i n g to t h e c h a r a c t e r s f r o m A to N , r e g a r d i n g t h e c o n n e c t i o n s to t h e o u t p u t u n i t
t h e o t h e r c h a r a c t e r s n o t a p p e a r i n g in the corresponding to G, so that t h e s e units
i d e n t i f i c a t i o n c o d e s . The o u t p u t l a y e r for the c o n t r i b u t e to t h e c l a s s i f i c a t i o n of D a n d G.
l a t t e r c o n t a i n s 10 u n i t s c o r r e s p o n d i n g to t h e Figure 10 shows the weights of the
n u m e r a l s from 0 to 9. T h e n e t w o r k was c o n n e c t i o n s f r o m t h e i n p u t u n i t s to t h e s e
t r a i n e d by u s i n g b a c k - p r o p a g a t i o n , u s i n g t h e hidden units, black and white squares
a r t i f i c i a l s t a n d a r d p a t t e r n s for the t r a i n i n g meaning positive and negative values,
p a t t e r n s for e a c h c a t e g o r y . T h e d e s i r e d o u t p u t respectively. A comparison between Fig. 5
vectors are unit vectors which are o r t h o g o n a l and F i g . 10 s h o w s that the w e i g h t s of
to e a c h o t h e r a n d h a v e an e l e m e n t w i t h a c o n n e c t i o n s f r o m t h e f e a t u r e s for D to t h e
value of 1 at the position of the h i d d e n u n i t s a c t i v a t e d by D h a v e positive
corresponding categories. values, and those to the hidden units
a c t i v a t e d by G h a v e n e g a t i v e v a l u e s f o r t h e
c a s e of t h e n e t w o r k c o n t a i n i n g 2 0 h i d d e n
CAPABILITY OF THE NEURAL NETWORK u n i t s . A g a i n , t h i s is v i c e - v e r s a r e g a r d i n g t h e
w e i g h t s of c o n n e c t i o n s f r o m t h e f e a t u r e s for
G . C o n s e q u e n t l y , t h e w e i g h t s of c o n n e c t i o n s
D i s c r i m i n a t i o n of S i m i l a r Characters
f r o m t h e i n p u t u n i t s to t h e h i d d e n u n i t s w o r k
as a template for the classification of
S o m e of s t e n c i l p a t t e r n s s h o w n in F i g . 1
c h a r a c t e r p a t t e r n s . H o w e v e r , in t h e c a s e of
r e s e m b l e each o t h e r , like p a t t e r n s for D and
the n e t w o r k c o n t a i n i n g 8 hidden u n i t s , some
G t h a t a r e s h o w n in F i g . 5 . T h e r e a r e o n l y a
w e i g h t s have the o p p o s i t e s i g n , w h i c h may
s m a l l n u m b e r of p i x e l s t h a t a r e n o t c o m m o n
w e a k e n t h e p o w e r of r e c o g n i t i o n . S o w e u s e d
w h i c h w o r k as f e a t u r e s to d i s t i n g u i s h one
t h e n e t w o r k c o n t a i n i n g 2 0 h i d d e n u n i t s for
from the o t h e r . When the s a m p l e p a t t e r n s are
further study.
d e r i v e d f r o m t h e i m a g e s of c h a r a c t e r s , t h e
f e a t u r e s a r e w e a k e r d u e to d e t e r i o r a t i o n .
Figure 6 shows intermediate patterns between
D a n d G o b t a i n a b l e by a l i n e a r i n t e r p o l a t i o n SYSTEM CONFIGURATION
of t h e t w o p a t t e r n s . Figure 7 shows the
activation levels of the output units
Hardware Configuration
c o r r e s p o n d i n g to D a n d G w h e n t h e p a t t e r n s
of F i g . 6 w e r e p r e s e n t e d to t h e i n p u t l a y e r
a f t e r l o w - p a s s f i l t e r i n g l i k e t h a t in F i g . 4 . T h e s y s t e m c o n f i g u r a t i o n is s h o w n in F i g .
While patterns D and G have a large 1 1 , a n d t h e s p e c i f i c a t i o n s of t h e s y s t e m a r e
c o r r e l a t i o n c o e f f i c i e n t , the a c t i v a t i o n levels s h o w n in T a b l e 1.
of t h e t w o units show sigmoidal curves In N o . 3 CC Plant, identification code
r e g a r d i n g the i n t e r m e d i a t e p a t t e r n s for the m a r k i n g is d o n e s o o n a f t e r t o r c h cutting.
t w o o r i g i n a l p a t t e r n s , w h i c h is h e l p f u l for E a c h s l a b a f t e r b e i n g c u t by t h e t o r c h e s is
discrimination. c a r r i e d on t a b l e r o l l e r s to a c e r t a i n p o s i t i o n
w h e r e i t s c r o s s s e c t i o n is c o o l e d w i t h w a t e r
a n d its i d e n t i f i c a t i o n c o d e is s t e n c i l e d on t h e
A c t i v a t i o n L e v e l s of T h e H i d d e n Units s u r f a c e . It is t h e n c a r r i e d to a s l a b y a r d .
The stenciling system sends a timing signal
F i g u r e 8 s h o w s t h e a c t i v a t i o n l e v e l s of t h e to t h e c h a r a c t e r recognition system after
h i d d e n u n i t s w h e n t h e s t a n d a r d p a t t e r n s for m a r k i n g , b e f o r e an i m a g e of t h e stenciled
e a c h c a t e g o r y w e r e p r e s e n t e d . In t h e c a s e of c h a r a c t e r s is t a k e n by MOS c a m e r a s . Two
the n e t w o r k c o n t a i n i n g 20 h i d d e n u n i t s , t h e r e i d e n t i f i c a t i o n c o d e s a r e s t e n c i l e d on s o m e
are r e d u n d a n t h i d d e n u n i t s w h i c h can be s l a b s , s o t w o c a m e r a s c o r r e s p o n d i n g to t h e
a c t i v a t e d by a n y of t h e p a t t e r n s , l i k e t h e t w o c o d e s a r e i n s t a l l e d . H a l v e s of i m a g e s
u n i t s of N o s . 3 , 7 a n d 1 1 . T h e r e is no t a k e n by t h e t w o c a m e r a s a r e s y n t h e s i z e d a n d
r e d u n d a n t u n i t in t h e c a s e of t h e n e t w o r k s e n t to t h e s i g n a l - p r o c e s s i n g s y s t e m b a s e d on
containing 8 hidden units (networks a VME bus m i c r o c o m p u t e r , w h e r e the images
c o n t a i n i n g h i d d e n u n i t s less than 8 did not a r e d i g i t i z e d a n d p r o c e s s e d . All p r o c e s s i n g is
c o n v e r g e in o u r e x p e r i m e n t s ) . e x e c u t e d by s o f t w a r e algorithms, and the
F i g u r e 9 s h o w s t h e a c t i v a t i o n l e v e l s of t h e processed images are presented to the
h i d d e n units w h e n the i n t e r m e d i a t e p a t t e r n s o p e r a t o r s w i t h r e c o g n i t i o n r e s u l t s . An a l a r m
s h o w n in F i g . 6 a r e p r e s e n t e d a f t e r l o w - p a s s is g i v e n to t h e o p e r a t o r s in t h e t o r c h - c o n t r o l
filtering. In the case of the network r o o m by a b u z z e r a n d d i s p l a y on a TV m o n i t o r
c o n t a i n i n g 20 h i d d e n u n i t s , the h i d d e n u n i t s w h e n t h e q u a l i t y of t h e c h a r a c t e r s h a s b e e n
of N o s . 6, 8, 9 and 17 a r e activated j u d g e d to be d e f e c t i v e by t h e s y s t e m .
s e l e c t i v e l y by p a t t e r n s D a n d G. In t h e c a s e
of t h e n e t w o r k c o n t a i n i n g 8 h i d d e n u n i t s , t h e
Signal Processing
h i d d e n u n i t s of N o s . 1 a n d 7 a r e a c t i v a t e d
s e l e c t i v e l y . T h e w e i g h t s of c o n n e c t i o n s f r o m
T h e b a s i c a l g o r i t h m f o r p r o c e s s i n g is s h o w n
t h e s e u n i t s to t h e o u t p u t u n i t s c o r r e s p o n d i n g
in F i g . 1 2 .
to D h a v e p o s i t i v e v a l u e s ( e x c i t a t o r y ) in t h e
T h e p r o j e c t i o n of e a c h i m a g e to t h e v e r t i c a l
c a s e of t h e h i d d e n u n i t s w h i c h a r e a c t i v a t e d
axis is calculated to detect the region
by D, a n d h a v e n e g a t i v e v a l u e s ( i n h i b i t o r y ) in
c o r r e s p o n d i n g to t h e c h a r a c t e r s by m e a n s of a

149
pattern matching method. The projection of Alarm
the detected region to the horizontal axis is
then calculated to find the boundaries of the The recorded images of characters were also
characters and to segment the region into checked from the s t a n d p o i n t of proper
images of each character. After p r e p r o c e s s i n g j u d g e m e n t for an alarm. As the threshold for
such as correcting the inclination due to the this judgement is set high to avoid
camera p o s i t i o n s , noise rejection, trimming o v e r l o o k i n g defective c h a r a c t e r s , an alarm is
and contrast enhancement, the images of each given for about 3% of s l a b s . This allows a
character are converted to images c o n s i s t i n g 0% margin of error to be maintained so that
of a normalized number of pixels in no defective characters are overlooked.
accordance with the number of input units of Excess alarms are not a serious problem with
the neural network model, as shown in Fig. the system, because the operators can look at
2. The g r a y - s c a l e data of the images is then images of the characters on a TV monitor and
provided to the neural network model, which confirm the degrees of deteoriration of each.
produces a criterion of similarity between the Examples of characters for which the system
characters and the standard p a t t e r n s . gave alarms are shown in Fig. 15. In the case
An alarm is given in the following c a s e s : of (a), D is u n r e a d a b l e . While the category
1) the p r e p r o c e s s i n g is terminated abnormally assigned by the system was I, the value of
owing to a defective character; the output unit c o r r e s p o n d i n g to I was less
2) the value of the output unit which showed than 0 . 5 , so mark ' # ' was added to the
the maximum activation level, whether or not r e c o g n i t i o n r e s u l t , which gives an alarm. In
it c o r r e s p o n d s to the category the character the case of (b), all the characters of the
belongs to, is less than the t h r e s h o l d value, identification code were m i s s i n g , which was
which means that the character is not judged detected during p r e p r o c e s s i n g . Mark ' @ ' ,
to be similar to any of the standard p a t t e r n s . another mark giving an alarm, was added to
A value of 0.5 is used for the t h r e s h o l d , and the r e c o g n i t i o n r e s u l t .
the time required for total p r o c e s s i n g is about
3 s e c o n d s . Examples of the original image
and the recognition results are shown in Fig. CONCLUSION
13.
A character r e c o g n i t i o n and its quality
evaluation system was developed, using an
EXPERIMENTAL RESULTS artificial neural network model, to assure the
quality of identification codes stenciled on
This character r e c o g n i t i o n system has been c o n t i n u o u s l y cast s l a b s . The p r e p r o c e s s i n g
applied at No. 3 CC Plant and tested on more algorithm and adoption of the neural network
than 3 0 , 0 0 0 s labs since August 1989. The have made the system effective even when the
original images and the recognition results image of a character is degraded by blurs and
were recorded on video tapes, which have noise such as from spots of paint. The correct
been used to evaluate performance o f the classification rate of the characters is more
system. than 9 9 % , which is satisfactory for the
purpose of the system. The rate of
Recognition o v e r l o o k i n g defective characters on slabs to
be alarmed is 0%, while the excess alarm rate
The recorded images of the characters were is nearly 3 % , which is not a problem because
manually checked, and the correct o p e r a t o r s can back up the system visually.
classification rate of the characters was The system has been successfully used for the
calculated. Characters which were judged to quality a s s u r a n c e of slab marking.
be i n d i s t i n g u i s h a b l e by eye were eliminated
from the calculation in the following c a s e s :
1) characters were m i s s i n g ; REFERENCES
2) parts of characters were missing or
degraded, so they were unreadable or Rumelhart, D . E . , J . L . McClelland and the
p o s s i b l y could be misread; PDP Research Group ( 1 9 8 6 ) . Parallel
3) characters were too blurred or too faint to Distributed Processing, Vol. 1. The
read. MIT P r e s s , Cambridge, M a s s a c h u s e t t s ,
The correct classification rate of the pp. 3 1 8 - 3 6 2 .
characters has been more than 9 9 % , which is
satisfactory for the p u r p o s e of the system. Rumelhart, D . E . , G . E . Hinton, R . J . Williams
The p r e p r o c e s s i n g algorithm and adoption of ( 1 9 8 6 ) . Learning r e p r e s e n t a t i o n s by
the neural network have made the system b a c k - p r o p a g a t i n g e r r o r s . Nature, 323,
effective even when images of characters 533-536.
were degraded by blurs and s p o t s . Examples
of degraded characters that have been Asoh, H . , N. Otsu ( 1 9 8 9 ) . Nonlinear data
correctly recognized by the system are shown analysis and multilayer p e r c e p t r o n s .
in Fig. 14. Int. Joint Conf. on Neural N e t w o r k s , //,
411-415.

150
T A B L E 1 S p e c i f i c a t i o n of t h e S y s t e m

Camera MOS-type camera X 2 / strand of CC

Optical filter Cold filter


Synthesis of video signals of two ^
Synthesis of images
cameras
VMEbus microcomputer system
Processing system

Light source
CPU: 68020+ 68881
Frame buffer: 512 X 480 X 8bit X 8

500W Halogen lamp X 2 / strand


M
F i g . 1. Stencil patterns

Hidden layer
Input layer Output layer

Fig. 3. Three-layer neural network


model
(a) Original stencil (b) Characters extracted from images
patterns
Fig. 2. E x a m p l e s of b l u r r e d c h a r a c t e r s
(Each s q u a r e s h o w s the g r a y - s c a l e . )

(a) Pattern for D (b) Pattern for G

Pixels not in common


(a) Original stencil (b) Low-pass filtered patterns
patterns ill Pixels in common

Fig. 4. P a t t e r n s for t r a i n i n g the neural F i g . 5. C o m p a r i s o n b e t w e e n p a t t e r n s for


network model D and G

F i g . 6. Intermediate patterns between D and G


( T h e s e p a t t e r n s w e r e o b t a i n e d by a l i n e a r i n t e r p o l a t i o n of
p a t t e r n s D and G, c h a n g i n g w e i g h t t for D . )

151
A Correlation coef. between intermediate patterns and G
A Correlation coef. between intermediate patterns and D
O Activation level of the output unit corresponding to G
Activation level of the output unit corresponding to D Number of units Number of units

(a) Neural network (b) Neural network


containing containing
20 hidden units 8 hidden units

0.4 0.6 0.8 1.0


F i g . 8. Activation levels of the hidden units
Weight for G for the standard patterns

Fig. 7. Activation levels of the output units


and correlation coefficients for the
intermediate patterns shown in F i g . 6

1.0 1 B 4 B B

7 y>
I
0.8
>
<
c
#o 0.6
a
>
0.4
<
0.2

f l D D
0.4 0.6 0.8 .i
0.2 0.4 0.6 0.8 "
Weight for G Weight for G
(a) Neural network containing (b) Neural network containing
20 units in the hidden layer 8 units in the hidden layer

Fig. 9. Activation levels of the hidden units for the intermediate


patterns shown in F i g . 6
(Only those unit numbers are shown which are activated
selectively by patterns D and G . )

No.6 No.8 No.9 No.17 No.l No.7


DQ D. o . o |_|G D P .I
DMB- a o DQ D o a D(Z " Ho
a a not I
a in o a UM.U a a
D D - ")
::::-oR
- < )ammm i_| fj"i

CaO
DD 3D naq I o- - H M
a
a o = D D a D =DD
a o ^rtr
- o a a DD a D .-..::
n -=Dag
: . o o D DD a O D D * "I L . . . . a -
D * a I. DDD
: O * c Ol I. . . c a DD
a B DQ o . ' U U U U U BB
pa OQQD
D noi D =
a Dc [ D o p . aD
5m a
(a) Neural network containing (b) Neural network containing
20 hidden units 8 hidden units
Positive value D Negative value
(The area of each square is proportional to the absolute value of the weight. The position of each square
corresponds to the pixel of the characters.)

Fig. 10. Weights between the input layer and the hidden layer
(Weights are shown only for those units which are activated
selectively by patterns D and G . )

152
F
Camera
Image
synthesizer

Light VME bus


microcomputer

Frame
buffer
kfl
Alarm
Monitor H Marker Correction of inclination
of characters

Fig. 11. S c h e m a t i c d i a g r a m of t h e s y s t e m Segmentation

Conversion in accordance with unit number


of neural network model
(a) Original image
Calculation of neural network model

No

( END ) Alarm

F i g . 12. O u t l i n e of p r o c e s s i n g algorithm

(b) Result of recognition

Fig. 13. E x a m p l e of r e c o g n i t i o n
( E a c h i m a g e r e c o r d c o n s i s t s of t w o
i m a g e s t a k e n by t w o MOS c a m e r a s . )
(a) Alarm for an unreadable character
(2-8528BDI)
Output value: 0.437(#)

(a) Characters with low contrast

(b) Alarm for missing characters


Output value: - (@)

(b) Characters marked imperfectly and blurred Fig. 15. E x a m p l e s of d e f e c t i v e - c h a r a c t e r


a l a r m s g i v e n by t h e s y s t e m
F i g . 1 4 . E x a m p l e s of d i f f i c u l t c h a r a c t e r s t h a t
c a n be d i s c r i m i n a t e d by t h e s y s t e m

153
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

AN EXPERT SYSTEM FOR CONTINUOUS STEEL


CASTING USING NEURAL NETWORKS
A.B. Bulsari, M. Sillanp and H. Saxen
Kemisk-tekniskafakulteten, Abo Akademi, SF 20500 Turku/bo, Finland

Abstract. Developing an expert system is often time consuming even after knowledge acquisition.
Artificial neural networks offer an advantageous alternative to coding such knowledge in an
expert system shell or writing a program for it. This paper illustrates the feasibility of using a
feed-forward neural network for knowledge storage and inferencing, like an expert system, for
an industrial problem.
The inputs to the network were information about an incoming ladle of steel, and the output
was about its suitability for successful continuous casting, giving an indication on whether
problems would be encountered in the beginning and/or at the end of the casting. A trust-
region optimisation method was used for training the networks, where the input-output relation
of the nodes was given by a sigmoid function. This training method has been used successfully
for other neural network problems and was found to be quite reliable and robust.
By using a feed-forward neural network as an expert system for predicting operational problems
in the continuous steel casting process, some inconsistencies in the knowledge base were also
revealed.
Keywords. Neural nets, steel industry, expert systems

[6]. This method is very simple and its storage


1. Introduction requirements are small, which means that big
Artificial neural networks (ANNs) are claimed networks can be trained. However, because the
to have a good potential for knowledge acquisition, back-propagation by the generalised delta rule is
representation and storage. Therefore, it seems likely a modification of the steepest descent optimisation
that certain kinds of (fuzzy) expert systems in the method, its convergence is usually very slow. A trust-
field of chemical or metallurgical engineering rather region method [7,8] which uses higher order information
should be implemented in form of ANNs. Knowledge was therefore used in this work. This choice turned
out to be advantageous as long as the number of
engineering, expert system development effort as well unknowns is not very large, say not more than a few
as CPU time for inferencing could be saved when the hundreds. This restriction does not seem very severe,
neural net approach is feasible. Even though ANNs in since oversized networks should anyway be avoided.
the above mentioned fields have been utilised mainly
for diagnosis, pattern recognition and process control, Chemical and metallurgical engineers do not
there have been several attempts to implement expert yet routinely use artificial neural networks as problem
systems in neural networks [1,2,3]. solving and modelling tools. This is probably due
to the fact that the main research in the field has
ANNs consist of interconnected computational been carried out within computer science. In chemical
elements called nodes, or neurons. The internal nodes engineering, the major efforts have been on process
usually receive as inputs a weighted sum the net control and identification [9,10,11], fault diagnosis
input of the output signals from other nodes. The [12 - 16] and pattern recognition. In metallurgical
output, or activation, of a node is calculated as a engineering, there have been some efforts in signal
(usually nonlinear) function of the net input. In preprocessing [17], solving subproblems in expert
multilayer feedforward networks, which are considered systems and pattern recognition [18,19]. In this work,
in this paper, there are a few layers (the input, output a problem from the field of steelmaking is tackled by
and possibly some hidden layers) through which the using neural networks.
information passes. There is no feedback, and usually
each node in a layer receives signals only from the nodes 2. Continuous steel casting
in the layer below it; no connections exist within the
layer. Further, a bias, which may act as a threshold, is Steel casting is the process of giving shape
usually added as an input to each node in the hidden to molten steel. Ingot casting the traditional
and output layers. method has been under the recent decades
gradually replaced by continuous casting (see Fig. 1)
Various configurations and topologies of neural in steelmaking. The first large scale industrial
networks exist [4,5], but layered feed-forward networks continuous casting applications appeared during the
have been frequently used in technical applications. 1950's. Before that time, steel was exclusively cast
The network parameters are adapted by network through ingot casting. Casting steel in a continuous
training, where the weights which yield a network process leads to a considerable reduction in energy
that best fits the training instances presented are consumption and to a uniform quality over the full
determined. The most common network training length of the casting strand. Other major advantages
method has been the so called back-propagation are the increased casting capacity, reduced labour

155
and repair costs, and improvements in yield as the
number of forming steps are reduced and final product Schematic representation of the curved conti-
dimensions can be fit better. Possible drawbacks of nuous casting process and the two strand bloom
continuous casting are high investment costs, wear of caster at Ovako Steel Imatra.
refractories due to high casting temperatures, and the
fact that some steel brands are not well suited to be Jatkuvavaluprosessi kaoviollises esitettyn sek
cast continuously. Ovako Steel OyAb:n Imatran terstehtaan kaksilin-
jainen bloomivalukone.
Finland has been a pioneer in adopting the
continuous casting method in steel production. Since
the middle of the 1960's the new steelmaking capacity
has been based on continuous casting. Today almost
all the crude steel produced in Finland is cast by the
continuous casting method.
The objective of this work was to analyze if an
important and clearly defined problem in steelmaking
could be tackled by using artificial neural nets. The
problem in question is whether to accept or reject
a heat the liquid content of a steel ladle for
continuous casting: It would be of great value to be
able to foresee potential technical or quality related
problems that may be expected if a heat of liquid
steel prepared to be cast were actually taken to the
casting process. From the operational experience of
engineers and operators at Rautaruukki Oy, Raahe
Works, Finland, heuristic knowledge was acquired,
partly in form of a decision tree represented as Table 1. Fig. 1. Continuous steel casting
This table was used in the test of the neural network
approach.
3. T h e n e t w o r k a n d its t r a i n i n g
Table l. Training instances 126 to 170
The layered feedforward network used in this
work (see Fig. 1) has an input layer where the nodes
do not transform the signal, but only distributes it 4 33 1 11 22 22 22 00 00
towards the nodes in the layer above them. Each node 44 33 1 22 11 11 1
1 00 00
in the upper layers receives as input a weighted sum of 44 33 1 22 11 22 11 00 00
the outputs (activations) of the nodes in the layer below 4 33 1 22 22 11 11 00 00
it. A bias is further added to the input of each node;
44 33 1 22 22 22 11 00 00
this is equivalent to an additional weighted connection
to a node with a unit activation. The net input, *,, to 44 33 1 22 22 22 2 00 00
the ?::th node can thus be written 44 33 22 1 11 22 11 00 00
N
44 33 22 1 22 11 11 00 00
Xi = 22wijaj = wfa, 44 33 22 1 22 11 2 00 00
j=o 44 33 22 1 22 22 11 00 00
44 33 22 1 22 22 2 00 00
44 33 22 22 22 22 11 00 00
22 1 11 11 11 11 11 1 00
22 1 11 22 11 22 11 1 00
22 1 11 22 22 11 11 1 00
22 1 11 22 22 11 2 1 00
22 1 11 22 22 22 11 i 00
22 1 11 22 22 22 2 1 00
22 1 22 11 11 22 11 1 00
POSSIBLE 1 22 22 11 1 00
22 22 22
V HIDDEN
22 22 11 11 11 22 11 1 00
22 22 11 22 22 22 1i 1 00
22 22 22 11 22 11 11 1 00
22 22 22 11 22 22 11 1 00
33 11 11 22 11 11 11 1 00
33 11 11 22 11 11 2 1 00
33 11 22 11 11 11 11 1 00
33 11 22 11 11 11 2 1 00
33 22 11 11 11 11 11 1 00
33 22 11 ii 11 11 2 1 00
Fig. 2. The layered feed-forward neural network.
33 22 11 22 11 22 11 1 00
where N is the number of nodes in the layer 33 22 11 22 11 22 2 1 00
below the one where the node is located and , is the 33 22 11 22 22 11 11 1 00
node's bias. The output, or activation a,, of a node 33 22 11 22 22 11 2 1 00
is given by the logistic activation (sigmoid) function of 33 22 22 11 11 11 11 1 00
the net input 33 22 22 22 22 11 11 1 00
a = 33 33 11 11 11 22 11 1 00
J i + e -/J*i
33 33 11 11 22 11 11 1 00
which is a monotonically increasing function. The 33 33 11 22 22 22 11 1 00
parameter is a gain term which is introduced for 44 33 11 11 11 11 11 1 00
numerical reasons. In the sequel, the following notation 44 33 22 22 22 11 11 1 00
will be used for the feed-forward neural networks: Nin = 11 22 22 22 00 11
11 11 11
number of input layer nodes, Nout = number of output
layer nodes, and iVhid = number of nodes in each hidden 22 11 11 11 11 11 22 00 11
layer. Thus, the network configuration can be written 22 11 11 22 11 22 22 00 11
as as (N in , iVhid,., Wout). For a feed-forward layered
neural network of this type, the number of parameters,

156
i.e., weights in the weight matrix W = {>,_,}, is thus Table 3. Classification of the 7 casting variables
given by ^ o u t x ( ^ h i d + l)+7Vhidx(7Vhid + l) + ...+yVhidx(^in + l). and the variable codes used in Table 2.
A modification of the Newton-Raphson method
for solving nonlinear equations (or non-linear regres-
sion problems) was used to determine the optimum va-
lues for the elements of the network weight matrix W.
This trust-region method [7,8,20] differs from the back 1 2 3 4
propagation method [5,6] in that the search direction
is formed as an interpolation between the directions
given by Newton's and the steepest descent methods. 1 low fairly low fairly high high
The linearised model for the residuals is applied within
a trust region, where it, based on "experience" from 2 high normal low
earlier iterations during the search, has been found to
be valid. The strength of the method lies in its ability 3 little much
to shift towards the steepest descent direction whene-
ver the linearised model is inappropriate (e.q., when far 4 short long
from the minimum), still being able to show nice con-
vergence close to the minimum, where Newton's met- 5 short long
hod is superior.
6 first subsequent
4. The knowledge base
7 short long
The knowledge elicited from experts at the iron
and steel industry Rautaruukki Oy [21] was in terms of
qualitative data partly in the form of a decision tree
for the effect of some central variables on the outcome originally classified in four crisp classes: successful
of the casting process. However, no distinction was casting, initial problems, problems at the end of
made whether these situations were practically possible casting, and unrealisable casting. For training the
or not. Therefore the decision tree, part of which is neural networks, the output was, however, provided
represented in Table 1, also includes cases on which no in terms of two variables, the first indicating whether
actual information about the outcome of the casting or not there would be problems in the beginning, and
could have ever been obtained in practice. Table the second for the end of the casting (c/. lines 8 and 9
2 explains the meaning of the columns of Table 1. of Table 2):
Variables 1 through 7 refer to the first seven casting
(input) variables (columns 1 through 7 of Table 1),
while the eighth and ninth variables state whether
initial problems or problems at the end of the casting, Table 4. Classification of the outcome of the casting.
respectively, are expected.
The "input variable" information is expressed 0 1
in form of crisp classes represented as numerical codes
between 1 and 4. The physical interpretation of the
8 no yes
classes appear below in Table 3.
As the range of input variable 1 has been 9 no yes
discretised into four classes, variable 2 into three, and
the remaining variables into two classes, a total of
4 x 3 x 25 = 384 different combinations can be specified.
Naturally, the hypothetical casting data has been A single output to classify the problems using
subject to widespread simplifications as only a few and 0, | , , and 1 as the target for the output was also
crisp classes have been introduced for each variable. attempted, but the errors were somewhat more difficult
The results, i.e., the castability of a heat were to interpret. Alternatively, one could use 4 separate
output nodes for the 4 classes. This too was attempted,
but the increased number of unknowns (weights) in the
network led to convergence problems.
5. Results

Table 2. Explanation of the columns of Table 1.


This section describes some results of analysis
of the heuristics in the knowledge base by using feed-
forward neural networks. The networks were trained
1 liquid steel superheat after ladle treatment by minimising the error square sum, SSQ, of the 384
instances in the knowledge base. As the number of
2 heat content of the ladle outputs equals two, the average squared error for the
instances is obtained by dividing SSQ by 768.
3 mass of skull in ladle

4 holding time of ladle before the casting 5.1 A neural network without hidden layers
5 ladle stirring time
A network with no hidden layers does not
6 ladle position in the casting sequence usually perform well enough, but as shown in [22],
it reveals the cross correlations between inputs and
7 ladle casting time outputs. Therefore, a (7,2)-network was trained giving
the results of Table 5.
8 problems at the beginning of casting
9 problems at the end of casting

157
hidden layer, the error square sum was reduced to 4.000
(four misclassified instances), while the SSQ further
Table 5. Optimum weights for the (7,2) network. dropped to 3.000 (three main errors) as an eighth
hidden node was introduced.
One can fit the data set accurately by adding
Output 1 Output 2 | further nodes. What is too many and what is just
sufficient is a matter of investigation. Here (7,6,2)
From: seems to be a reasonable network, after which the
extra nodes simply seem to fit some additional training
Bias +1.22 -0.12 instances. The (7,6,2) network did not fit instances
55, 149, 152, 153, 169 and 352, while the (7,7,2)
Superheat -0.52 -0.65 could not fit instances 55, 127, 148 and 152. The
(7,8,2) network did not fit 143, 153 and 176. From
Heat content +0.37 +0.58 these observations it is clear that one should use some
method to eliminate nodes that do not contribute to
Mass of skull +0.17 +0.37 the generalisation capacity of the network, but merely
fit special (and possibly contradicting) instances of
Holding time +0.47 +0.35 the training set. Recently, some methods for weight
elimination have been proposed in the literature [24],
Stirring time -0.46 -0.39 which could be employed to tackle the problem of
network overparametrisation.
Ladle position -0.36 -0.19
Networks with two hidden layers proved to
Casting time -0.17 +0.37 be difficult to train. Most attempts resulted in
very large weights and large values of SSQ (> 30).
Apparently, numerical problems were caused by the
strongly nonlinear nature of the network model and the
existence of several local minima in the search space.
This solution corresponds to an error square
sum of SSQ=33.7 or an average square error of 0.29. 5.3 The refined data set
Even though the model is not good, it has at least
revealed the interrelations between the input and The (7,6,2) network indicated that the instances
output variables. It can be seen that a larger superheat 55, 149, 152, 153, 169 and 352 were difficult to fit
reduces (implied by negative weights) the possibilities into the existing framework in which the remaining
for problems at the beginning or the end of the casting. 378 instances fitted well. This seemed to be a possible
As for the second input variable, a low heat content sjanptom of inconsistency in the knowledge base. As
(indicated by a high input to the second input neuron) the instances were examined, the 55th instance was
favours higher activations for both outputs, i.e., casting found to be contradictory since e.g. instance 165 with
problems. It may be noted that the first 6 parameters higher superheat was classified as "initial problems"
have a similar effect on both the output variables. while the knowledge base classification of instance 55
The casting time (variable 7) is seen to mainly exert was "successful casting". Instance 149, in turn, had a
influence on the conditions at the end of the casting, low superheat as the only reason for possible problems,
a short casting time being favourable for avoiding and was classified as "successful casting" by the neural
problems (which should be expected). Skulls (variable network instead of "initial problems" as indicated in
3), in turn, are not very likely to cause problems in the the knowledge base. Instances 152, 153, 169 and
beginning, and the ladle position (variable 6) does not 352 could be explained similarly implying that the
significantly affect the temperatures during the final neural network's expectation was more reasonable than
phase of the casting. These two findings agree well what was in the training set. Modifying the data
with the physical knowledge of the problem that is set according to the revelations of the (7,6,2) network
coded in the knowledge base: i) Skulls usually appear resulted in the refined data set, which was completely
at a high position at the ladle wall, and therefore learned by the network (SSQ 0.76493 10" 16 ).
mainly affect the temperature of the uppermost liquid
"layers". As the steel is tapped into the tundish The results from the (7,7,2) or (7,8,2) networks
through a hole in the bottom of the steel ladle, the part could, naturally, have been utilized in a similar manner,
of the liquid which is cooled by the skulls will enter the which would have led to (other) sets of consistent
tundish at a late stage, ii) Initial problems caused data. There are, though, some objections to this
by low temperatures can be expected when a new approach. The (7,8,2) network learns the 55th instance
tundish is taken into operation. The introduction of a without any error. Obviously, it has also learnt the
new tundish is implicitly captured in the sixth input inconsistencies simply because of its higher storage
variable, the ladle position in the casting sequence. capacity; with a sufficient number of weights, one can
Thus, when casting the first steel ladle there may be fit even random numbers in a neural network without
initial problems encountered due to a cold tundish. errors. Similar findings for time-series modelling
by neural nets have been reported [24]. In these
The effect of all seven input variables can thus be applications, the oversized networks were shown to be
explained without any apparent contradictions against able to reproduce the noise of the training set, which
physical principles, which indicates that the knowledge resulted in inferior prediction performance on a testing
base is sound as a whole. For a more detailed discussion set of data. The drawbacks of using too many model
of the physical motivation and interpretation of the parameters are, indeed, well known to anyone with
heuristics, see Sillanp [23]. experience in the field of mathematical modelling or
5.2 Feed-forward neural networks with hidden layers curve-fitting.
The (7,6,2) network thus seems to be a network
With one hidden layer, network training was of appropriate size for the problem in question; it does
not generalise too much, and it does not seem to learn
carried out with 4 or more nodes in the hidden layer. inconsistent instances. The number of parameters of
For the (7,4,2) network, the error square sum dropped this network is 62, which still must be considered quite
to SSQ = 14.000, almost completely contributed by high. Naturally, there could be alternative methods
14 instances with errors of 1.000 or -1.000. The for capturing the information of the knowledge base.
(7,5,2) network yielded SSQ 7.834, but one weight The number of unknowns required to describe the
was very large which indicates less reliability of the knowledge should be much smaller than the number
result. The errors were here more evenly distributed, of training instances, because of the redundancy of
and no complete misclassifications were reported. With the branches in the decision tree. The neural net
the (7,6,2) network, only 6 instances contributed with approach must therefore merely be considered one of
significant errors (SSQ 6.000). For 7 nodes in the

158
many possible ways to tackle the problem. neural computation", IEEE Control Systems Magazine,
(April 1990) 24-30.
10. Bhat, N. and T. J. McAvoy, "Use of neural nets for
6. Conclusions dynamic modeling and control of chemical process
systems", Computers and Chemical Engineering 14 (1990)
573-582.
The work demonstrated that neural networks 11. Ydstie, B. E., "Forecasting and control using
can be used for knowledge storage and convenient adaptive connectionist networks", Computers and
retrieval for use as an expert system. A real industrial Chemical Engineering 14 (1990) 583-599.
problem decision making in continuous steel casting
the solution of which would be of practical value 12. Hoskins, J. C. and D. M. Himmelblau, "Artificial
for the metallurgical industry, was tackled by using neural network models of knowledge representation
neural networks. This approach may save a lot of in chemical engineering", Computers and Chemical
time in developing the system. Further, inferencing is Engineering 12 (1988) 881-890.
almost instantaneous and no expensive expert system
development tools (and thick manuals) are required,. 13. Venkatasubramanian, V. and K. Chan, "A neural
network methodology for process fault diagnosis",
Inconsistencies in training instances (or "outli- AIChE Journal 35 (1989) 1993-2002.
ers") can be identified with the help of the networks.
There seems to be something like the correct size of a 14. Venkatasubramanian, V., R. Vaidyanathan and Y.
neural network for the example studied: Too many hid- Yamamoto, "Process fault detection and diagnosis
den nodes can force the network to learn inconsistent using neural networks I: Steady state processes",
data. Computers and Chemical Engineering 14 (1990) 699-712.
Networks with one hidden layer performed
better than networks with two hidden layers, and 15. Watanabe, K., I. Matsuura, M. Abe and M.
also better than the one without hidden layers. Kubota, "Incipient Fault Diagnosis of Chemical
Networks with two hidden layers have, according to Processes via Artificial Neural Networks", AIChE
our experience, always been more difficult to train. Journal 35 (1989) 1803-1812.
Apparently, there are many local minima which make
the search for a global minimum more difficult, and 16. Ungar, L. H., B. A. Powell and S. N. Kamens, "Ad-
the increased number of unknowns also makes the aptive networks for fault diagnosis and process con-
multidimensional search problematic. trol" , Computers and Chemical Engineering 14 (1990) 5 6 1 -
572.
17. Noel, H. and J.-Y. Quemeneur, "NHF: Localisation
A ck nowledgement s de Pertes Thermiques sur Haut-fourneau", Proc. of
International Neural Network Conference, J u l y 1990, P a r i s ,
France, Vol. 1, p. 159.
Rautaruukki Oy, Raahe, Finland, is gratefully 18. Hirata, T., K. Yamamura, S. Morimoto and H.
acknowledged for its cooperation with the authors and Takada, "Blast Furnace Operation System using
for providing the continuous casting heuristics.
Neural Networks and Knowledge-base", Proc. of 6th
International Iron and Steel Congress, N a g o y a , J a p a n , ISIJ
References (1990) 2 3 - 2 7 .

19. Kanoshi H, M. Sanui, S. Morimoto and H.


Tanaka, "Development of advanced engineering
1. Bulsari, A. B. and H. Saxen, "Implementation system for operation control of Oita no 2 blast
of a chemical reactor selection expert system in a furnace", Proc. of 6th International Iron and Steel
feed-forward neural network", to be presented at Congress, Nagoya, Japan, ISIJ (1990) 387-394.
Australian Conference on Neural Networks, Sydney,
Australia, February 1991. 20. Henley, E. and E. Rosen, Material and Energy Balance
2. Gallant, S.I., "Connectionist Expert Systems", Computations, Wiley k Sons, New York, 1969.
Comm. ACM (February 1988) 152-169. 21. Mikkola, J. (Rautaruukki Oy, Raahe, Finland),
3. Chan, S-C. and F-H. Nah, "Fuzzy Neural Logic personal communication.
Network and its Learning Algorithms", Proc. of 22. Bulsari, A. and H. Saxen, "A feed-forward artificial
HICSS-24, Koloa, Hawaii, IEEE Computer Society neural network as a simulator for a chemical
Press, Los Alamitos, CA, USA, Vol. I (1991) 476- process", Proc. of HICSS-24, Koloa, Hawaii, IEEE
485. Computer Society Press, Los Alamitos, CA, USA,
4. Lippmann, R. P., "An introduction to computing Vol. I (1991) 500-506.
with neural nets", IEEE ASSP Magazine, (April 1987)
4-22. 23. Sillanp, M., "Model for predicting the evolution
of steel temperature in a continuous casting tundish
5. Widrow, B. and A. Lehr, "30 Years of Adaptive on the basis of expert knowledge", Technical Report
Neural Networks: Perceptrons, Madaline, and 91-1, Heat Eng. Lab., Abo Akademi, Abo, Finland
Backpropagation", Proc. of IEEE Neural Networks I: (1991).
Theory & Modeling" 78 (1990) 1 4 1 5 - 1 4 4 2 .
24. Weigend, A.S., B. A. Huberman and D. E. Rumel-
6. Jones, W. P. and J. Hoskins, "Back-Propagation: A hart, "Predicting the Future: A Connectionist Ap-
generalized delta learning rule", Byte, (October 1987) proach", IJNS1 No. 3 (1990) 193-209.
155-162.
7. Levenberg, K, "A Method for Solution of Certain
Non-linear Problems in Least-squares", Quart. Appl.
Math. 2 (1944) 164-168.
8. Marquardt, D. W., "An algorithm for least-squares
estimation of nonlinear parameters", J. SIAM 11
(1963) 431-441.
9. Bhat, N. V., P. A. Minderman, Jr., T. McAvoy and
N. S. Wang, "Modeling chemical process systems via

159
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

NEURAL NETWORKS FOR STEADY-STATE PROCESS


MODELLING AND FAULT DIAGNOSIS
J.J. Cilliers
Department of Chemical Engineering, University of Cape Town, Private Bag Rondebosch,
Cape Town 7700, South Africa

ABSTRACT

The use of Artificial Intelligence techniques to assist operators in process fault


diagnosis has recently generated considerable interest. This paper discusses the
ability of feedforward Artificial Neural Networks (ANN) to be used as process models
suitable for fault diagnosis using parameter estimation. A comparison of the
performance of various network architectures and training data sets generated from
rigorous experimental designs is given. Model performance is evaluated using
extrapolation and interpolation.

Results indicate that the modelling ability of ANN's is generally insensitive to


network architecture and that performance is not significantly better than that of
linear interpolatory models. Data generated through rigorous experimental designs
would therefore be better suited to conventional modelling techniques.

ANN's have also been applied as direct fault classifiers from process measurements.
Some comments on recent results reported in the literature is given.

KEYWORDS

Mineral processing, Process models, Artificial Intelligence, Neural Networks

INTRODUCTION

Process Fault Diagnosis Hvdrocvclones

The use of computer assisted techniques to assist Hydrocyclones are commonly used in the Minerals
operating staff on complex chemical and Processing industry as size classification
metallurgical process plants with fault diagnosis devices for particulate slurries. Their correct
has received widespread interest. Only in recent operation is critical and can have a large impact
years, however, have the use of Artificial on the profitable operation of the process as a
Intelligence (AI) techniques become popular. whole. Very accurate mathematical models of
Recently, the use of Artificial Neural Networks hydrocyclones have been developed (3). These
for fault diagnosis has been intensively models are relatively complex and are unsuitable
researched. for real-time parameter estimation for fault
diagnosis. They can, however, be used as a
ANN's can be used in two ways to enhance fault steady-state replica of an operating
diagnosis systems. In the first case, the network hydrocyclone, and data collected for the training
is used to develop a mathematical model of the of a Neural Network. The trained network can
process, i.e. to form a relationship between the subsequently be applied to the physical system.
process variables (network inputs) and responses This exercise is thus attempting to model the
(network outputs). This can then be used to response of a physical system using an ANN, using
determine whether a difference exists between the data collected from a mathematical model of the
process responses and the network outputs for a process. In cases where such a model is not
certain set of input process variables. Should a available, data collected from the process itself
significant difference exist, this can be related will be required to train the network.
to process faults. It should be evident that a
robust model is required that can be used under a Four critical parameters were identified as
wide range of operating conditions (1). system variables, namely the cyclone vortex
finder and spigot diameters (physical parameters)
In the second instance, the ANN is used to and the feed particles size distribution and
directly form a relationship between a set of density (operating parameters). Four responses
process measurements and a comprehensive fault are monitored; the recovery of solids to the
list. In this case, the network is used to underflow, the percentage solids in the
classify the measurement patterns into operating underflow, the underflow fines content and the
regimes (2). operating pressure. The process model required
will therefore have four inputs and four outputs,
In the discussion to follow, the use of ANN's to and fit a surface in multi-dimensional space.
model steady-state process operations is
analysed. Some comments of the use of ANN's as
classifiers is also given.

161
Data can be collected from the system by TABLE 1 : Average Percentage Response Errors for
performing simulations. It must be kept in mind, Various Model Architectures
however, that if data is to be collected on an
operating plant, or under experimental RESPONSE LINEAR 4 NODE 8 NODE 20 NODE
conditions, two complicating factors exist.
Firstly, data from physical tests is generally 1 0.62 1.67 1.22 1.12
subject to error and noise. The data collected
from the simulator therefore represents an 2 2.25 1.50 1.10 0.93
extremely optimistic case.
3 9.84 5.93 4.71 7.04
Secondly, data collection under these conditions
is extremely time consuming and therefore, in 4 4.40 1.27 1.08 0.95
general, a rigorous experimental design is done
before testwork commences. These designs often
take the form of n m experiments, where n is the
number of levels tested for the variable (e.g. It can be noted that an increase in the number of
low, medium and high for a 3-way factorial hidden nodes in the network does not
design), and m the number of variables tested. significantly improve the average error in the
For the system under consideration, therefore; 2, model predictions.
3, 4 and 5 way factorial designs yield 16, 81,
256 and 625 data sets respectively. Since a Figures 1 to 4 show the trend response of the
single experiment may take a full day to perform, system, and the predictions of the linear, 4, 8
it can be appreciated that the results shown here and 20 hidden node models. The trend in the
also represent an optimistic scenario. response of the system is relatively simple,
being positively linear, parabolic, exponential
(or parabolic) and negatively linear for each of
Evaluation Technique the four responses, when only the single variable
is changed. It must be noted that the data points
Two effects were investigated, namely the network corresponding to 5 and 40 feed % solids require
architecture used and the amount of training data model extrapolation, the remainder interpolation.
available. For the former, feed-forward networks
with different numbers of hidden nodes were used, Two results are immediately evident. Firstly, the
using data from a 3-way factorial design (81 data extrapolations tend to improve slightly as the
sets). In the latter case, a single architecture number of hidden nodes is increased, indicating
(8 hidden nodes) with 2 to 5 way factorial that additional nodes are required to capture the
designs were compared. underlying model form. Secondly, none of the
neural networks capture the essence of the data
Evaluation was done in two ways. Firstly, a test trends and do not perform significantly better
set of 200 random input values within the than the most simple linear model.
training range were evaluated and the error
statistics monitored. This gave an indication of
the general model accuracy. Secondly, one of the Experimental Design Effects
model input variables was varied across the
training range and slightly beyond, while the Believing that the above results were an
three other input variables were kept constant at indication that too little training data was
values midway between any of the training data. available, the effect of varying the training
These results gave an indication of the ability data set size was investigated. For this case a
of the trained network to both interpolate and network having 8 hidden nodes, which performed
extrapolate from the training data, and to mimic adequately in the previous tests, was used.
operating trends.
TABLE 2 : Average Percentage Response Errors for
Various Experimental Designs
EVALUATION
RESPONSE 2 WAY 3 WAY 4 WAY 5 WAY
Neural Network Models 1 2.14 1.22 2.81 1.66
All tests were performed using the NeuralWorks 2 3.38 1.10 1.90 1.91
Explorer v4.01 package (10). The networks were
trained using normalised cumulative delta 3 6.14 4.71 9.00 7.22
backpropogation and had hyperbolic tangent
transfer functions. Input was scaled between -1 4 1.88 1.08 2.06 1.20
and +1, and the output between -0.8 and +0.8, to
allow for modest extrapolation. Training data was
presented to the network using a randomised Table 2 shows the average percentage errors for
shuffle-and-deal technique. Training was the same 200 test cases used previously. Note
performed until the RMS error on the output nodes that the average errors do not necessarily
stabilised. decrease as the number of training data sets
increases. The linear model developed using a
three-way factorial design (cf Table 1) does not
A simple linear model without interaction terms perform significantly worse than either the four-
was calculated from a 3-way factorial design or five-way networks, trained with three and
experiment as a base comparison. seven times as much data, respectively.

In some cases, training converged at an error


Network Architecture Results level higher with more data than with less data,
indicating that the network is "learning" the
Table 1 shows the average percentage errors data itself, rather than the underlying
calculated for the 200 random test data sets functional form. Considering the experimental
presented to each of the trained ANN effort that is required to perform a 5-way
architectures and to the linear model. factorial design experiment, the lack of model
accuracy is disturbing.

162
Figures 5 to 8 show the trend results for each of
the model responses, in the same format as
before. Note again that an increase in the
training data does not significantly improve the
trend results. REFERENCES

1. Himmelblau,DM. (1978). Fault Detection and


DISCUSSION Diagnosis in Chemical and Petrochemical
Processes, Elsevier.
In general, when steady-state models of process
operations are developed, only a limited amount 2. Hoskins, J C , and Himmelblau, DM. (1988)
of test data is available. Results indicate that Artificial neural network models of knowledge
this lack of data means that neural network representation in chemical engineering. Comput
models of steady-state processes will not perform Chem Engng, 12, (9/10), 881.
significantly better than even the most simple
linear approximation to the system response. Well 3. Cilliers, JJ., and Hinde, AL. (1991). An
designed experiments allow full use to be made of improved hydrocyclone model for backfill
conventional mathematical and statistical model preparation. Minerals Engineering, 4, (7-11),
building techniques and is therefore a more 683-693.
suitable methodology than ANN's for this
application. 4. Lippmann, RP. (1987) An introduction to
computing with neural nets. IEEE ASSP Mag 4.
Through experimental design and heuristic
knowledge, a broad picture of the response 5. Bhat, N. and McAvoy, TJ. (1990). Use of neural
surface can be developed, which can guide the nets for dynamic modelling and control of
functional form required for the response model. chemical process systems. Computers Chem Engng,
Although it is known that Neural Networks can 14,(4/5), 573.
approximate any non-linear function (4), explicit
control of the model form being developed when 6. Rich, SH., and Venkatasubramanian, V. (1989).
using conventional modelling techniques is Causality-based failure driven learning in
valuable and should be exploited if possible. diagnostic expert systems. AIChE J, 35, (6), 943.

When extrapolation of operating trends is 7. Huang, WH., and Lippmann, RP. (1988). Neural
required, results indicate that the response of net and conventional classifiers. Amer. Inst.
an Artificial Neural Network can be extremely Physics Neural Network Conference, 387.
unreliable. Conventional mathematical models,
even when not extremely accurate (as in the case 8. Kramer, MA., and Leonard, JA. (1990).
of the linear model above), have yery predictable Diagnosis using backpropogation neural networks -
qualities when extrapolating and may, generally, analysis and criticism. Computers Chem Engng, 14,
be more suitable for process modelling. This is (12), 1323.
certainly the case when hard-limiting transfer
functions are used and the extrapolation response 9. Leonard, JA., and Kramer, MA. (1991). Radial
is limited by definition. basis function networks for classifying process
faults. IEEE Control Systems, April, 31.
These results do not mean that ANN's are not
suitable for use as process models. Considerable 10. NeuralWare Inc. 1991.
success has been had when using this methodology
for process control (5). The ability of a model
to continuously improve is appealing, and while
problems exist with on-line training of ANN's,
this is a promising field. In this case, the data
set is practically infinite. This necessity of a
randomly distributed data set spanning the
response surface to be modelled is essential.

The use of feedforward, sigmoidal ANN's as direct


fault classifiers from measured input data has
been investigated widely (6). Recent publications
tend, however, to indicate that ANN's will
perform no better than conventional nearest
neighbour classifiers (Z,8). Work is in progress
with modified network architectures and transfer
functions in an attempt to improve this
performance, although problems in its practical
implementation do still remain (9).

While it is expected that ANN's will, in future,


play an important role in both the fault
diagnosis and control of process operations,
their successful application will require
considerable further research.

163
UNDERFLOW % SOLIDS
RECOVERY TO UNDERFLOW

FEED % SOLIDS
FEED % SOLIDS

MODE
L TYP
E
MODE
L TYP
E
SYSTE
M -- LINEA
R * 4 N O DSE
SYSTE
M LINEA
R -*- 4 N O D SE
8 N O DSE 20 N O D SE
- - N O D SE -- 20 N O D SE

OPERATING PRESSURE
UNDERFLOW FINES CONTENT

FEED % SOLIDS
FEED % SOLIDS

MODE
L TYP
E
MODE
L TYP
E
SYSTE
M LINEA
R -*- 4 N O D SE
SYSTE
M -- LINEA
R -*- 4 N O D SE
"- 8 N O D SE 20 N O DSE
-" S N O D SE - * ~ 20 N O D SE

Fig 1 to 4:
Comparison of Neural Network Architectures
and Linear Model with System Responses.
(Three Way Factorial Design Experiment)

164
RECOVERY TO UNDERFLOW UNDERFLOW % SOLIDS

RECOVERY TO UNDERFLOW UNDERFLOW % SOLIDS

^
*. -^St^=^
^ - *
%^ s*r^

^ ^ ^--/w/^'
% ,
~^^-'
^x%= &>j
^O 6 ^.^-*

'
1
5 10 15 20 25 30 35 40 45 10 15 20 25 30 35 40 45
FEED % SOLIDS FEED % SOLIDS

FACTORIA
L EXPERIME
TN FACTORIA
L EXPERIME
TN

SYSTE M -*- 2 WAY


-*- 3 WA
Y
SYSTE M 1~ 2 WAY ^fc- 3 WAY

-a- 4 WAY - *- WAY


-0- 4 WAY - *- 5 WAY

UNDERFLOW FINES CONTENT OPERATING PRESSURE

UNDERFLOW FINES CONTENT OPERATING PRESSURE

Jt /
ifr^ ^ /

^* /' /
/frf /
/// / /
/// / / / s$?
/// y^f
/ <>/ ' s/^
0
/ v3 / j /
^
i ^ ^
,
10 15 20 25 30 35 40 45 15 20 25 30 35 40

FEED % SOLIDS FEED % SOLIDS

FACTORIA
L EXPERIME
TN FACTORIA
L EXPERIME
TN
SYSTE
M - <- 2 WA
Y -*- 3 WA
Y SYSTE
M - +- 2 WA
Y -*- 3 WA
Y
- e- 4 WA
Y - *- S WA
Y - B- 4 WAY - *- 5 WA
Y

Fig 5 to 8:

Effect of Training Data Set Size on Trend


Prediction for 8 Hidden Node Network.

165
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

MINERAL PROCESS CONTROL BY NEURAL NETWORK


K. Okaya and T. Inoue
Department of Mineral Development Engineering, Faculty of Engineering, University of Tokyo, Tokyo, Japan

Abstract. Learning capability represents the most outstanding feature of artificial neural network
(ANN). To avail of this particular property of the ANN the authors are trying to form a process
controller for mineral processing of which design can be established free from human expertise. At the
present stage of the development the ANN process controller comprises a standard ANN with three
layers for input, hidden and output activities. In order that the ANN can shape a process controller
the input layer is provided with extra cells to contain the history for several times the retention period,
so that the goodness of control can be evaluated pertaining to the current process dynamics.
The performance of the controller is observed by a process simulator operated under two types
of Gaussian noise. Thus, Gaussian pink noise is used to mimic the actual process disturbance, while
the particle size monitor is assumed to pick up an observation noise which is Gaussian and white. It
was proved that the ANN learns itself even though slowly only with the minimum teaching by human
expertise. Although the quality of control is not so high at the present stage of development, the
impact of the controller requiring less design skills will be of immense value, particularly when the
neuro-computing technology with higher performance is realized in future.

Keywords. process control; neural network; back propagation; learning method; mineral processing.

INTRODUCTION The study of the process control by ANN is still at the first
stage. Implementation of ANN requires a very large memory
space and a huge amount of simple arithmetic operations.
The authors has been investigating the control techniques for However, the integration density of semi-conductor memories
mineral processing using several types of computing meth- and the computer speed has been increased steadily for the
ods, such as expert system, fuzzy logic and artificial neural last few decades, and we forecast the trend will continue
network (ANN). These new methods which have been stud- further on.
ied are expected to have great possibility for automatic con-
trol in practice. Especially the authors consider that ANN In this situation it seems worthwhile to work on the appli-
will be able to realize the control system which can auto- cation of ANN for process control, in particular to explore
matically adapt itself to such a process for which designing how to obtain the teaching signal for ANN. In the problems
of the control system is difficult without learning capability. such as pattern recognition it is obvious that teaching signal
can be secured in a large range of applications. In auto-
We have tried to adapt ANN to a control system of mineral matic control, however, it is not so simple. In this paper we
processing from the following three reasons. will propose a configuration of the ANN controller system
and the learning method for optimal control. The result is
(1) Mineral processing is an ill-defined and complex process shown in an application of a grinding circuit simulator.
whose operating conditions are liable to frequent changes. So
it is difficult to design a control system of mineral processing
by the methods based on mathematical modeling. ANN can
be utilized to construct process models and controllers. NEURAL N E T W O R K
(2) The time constant for most of the mineral processing op- Most of the mineral processes involve time delays; hence they
erations are not so small. Considering the current calculation are quoted as dynamic processes. In order to learn the dy-
speed of small computers ANN controllers are supposed to namics of the process, the ANN must be provided with extra
be rapid enough to comply with them. neurons to represent the history for several times the reten-
tion period (Fig.l). Not only the ANN process model but
(3) In general mineral processing operations are known to as- also the ANN process controller requires these extra neurons.
sume self-regulating property. At the first stage when learn-
ing of ANN is not complete ANN may likely to control im- Learning of the process dynamics is substantially equivalent
properly. It will be possible to circumvent this disadvantage to making a simulator of the process. The learning of the
of ANN by temporarily switching over to manual control simulator establishes the relationship between the input and
or auxiliary (non-ANN) controller to maintain the process the output of the process. This simulator can be used in the
within a tolerable conditions. later time to tune the controller.

167
ANN has coincided fairly well with that of the actual pro-
cess. Ten neurons are used as the input layer of this ANN
to approximately cover the retention period of the process.
The hidden layer consists of 100 neurons, while the output
layer has only one neuron. About 70,000 sets of data have
been used until the status of Fig.3 was reached, where learn-
ing appeared to be nearly complete. Step responses of the
ANN and the process are shown in Fig.4. We can see that
the dynamics of the ANN after learning is compatible with
the process.

Fig.5 shows again the step response of the ANN and that of
the process with dead time plus first-order delay. The accu-
racy of the response from the ANN appears to be inferior to
that of Fig.4. This is probably because several neurons in
the input layer have been ineffectual according to the dead
time, which otherwise should have been served to represent
the process dynamics. To avoid this problem it may be ap-
propriate to use additional neurons.

Figure 1: Configuration of ANN.

uf1VVf\I
It is not so difficult to organize the learning mechanism of the
process dynamics, since a reliable teaching signal is available.
The weighting factors of the ANN system can be corrected
according to the error of the calculated output values against
those of the real process.

The learning process of-the simulator is shown in Fig.2 and \

y
3. Input and output data of the objective process are given
to the input layer, and the response of the process is used as
a teaching signal. \ !
\
The responses of the process and the ANN look like inter-
acting each other, reflecting that ANN is trying to mimic

error

Figure 3: Learning of the process simulator (final stage).

Figure 2: Learning of the process simulator (early stage).

the actual process dynamics in the early stage of learning


as shown in Fig.2. The difference between the two is more
distinct, when we observe the errors (lower diagram). Fig.3 Figure 4: Step response of simulator
shows the final stage of learning, where the response of the with a second-order time delay.

168
The learning of control actions by the ANN according to optimizing the dynamics of ANN. The procedure of teaching
the existing controller is similar to that of the simulator, the dynamics to the ANN controller is shown in Fig.7.
or the process model, as described above. Fig.6 shows the
result of the learning which was taught by a PID controller. On the first phase the ANN should acquire the outline of
It is indicated that the ANN acts as a controller, though the controller dynamics. This phase is useful for the inno-
unfavorable results are observed in some areas. This may be cent ANN to converge rapidly. In order for ANN to learn
attributed to the shortage of the number of neurons. the controller dynamics it is beneficial to have an existent
controller from which ANN can get the teaching signal. If
From these results it becomes clear that we can obtain the no controller is readily available, a virtual controller must be
process simulator or the controller which simulates the dy- prepared in the system for temporary use. This virtual con-
namics of the process or the existing controller using simple troller is made to tune the ANN controller using the process
ANN with three layers and additional neurons to deal with simulator which is mentioned above. The purpose of this
the dynamics. However, the capability of the controller can virtual controller is to determine the basic dynamics of the
not surpass that of the controller which was used for teach- controller against the process being considered. To this end
ing. In the next section We will discuss how to optimize the the simplest model of the controller is sufficient, but it is not
dynamics the controller by the ANN. necessary to optimize this controller completely.

On the second phase the ANN controller must be optimized


step by step. The optimization is carried out using the ANN
simulator and a kind of evaluating function. For example,
sum of squares of the process errors against the set point will
be an index value of the badness of control. The ANN con-
troller can be tuned so that the badness can be minimized.
In order that the ANN controller can be optimized in this
way it is essential to have a dependable process simulator.

Figure 5: Step response of simulator


with a dead time and a first-order time delay.
generate and tune
temporary controller

learn control dynamics learn control dynamics


from from
the temporary controller the existent controller

learn process dynamics

T
optimize control dynamics
using the simulator
and the evaluating function

T
C END J

Figure 7: Teaching process of the control dynamics.


Figure 6: Step response of neuro-controller
with a second-order time delay.
SYSTEM COMPOSITION
OPTIMIZATION
OF THE ANN CONTROLLER Our system contains two sets of ANN's. One of them is
used for modeling the objective process, and the other serves
as the adaptable controller. In order to enable calculating
We have introduced two phases of learning for the ANN as a the responses of the ANN's quickly, all of the data of the
controller. First phase is the learning of basic dynamics for ANN's should be located on the computer memory. If an
the ANN which has no bit of knowledge yet. Second phase is ANN consists of Z, M and TV neurons for input, hidden

169
(intermediate) and output layers respectively, {(L+M+N)+ Fig.9 shows one of results of the simulation. Line A indicates
( Z * M + M * N ) + (.M+iV)}*4 bytes of memories are required the trace of the proportion of the soft ore. Line B is the re-
to keep the data of all the neurons, connecting factors and sults without control. Line C is the results with the ANN
thresholds. In addition to this {(Z*Af+/* JV) + (Af+)}*4 controller. As the figure shows, the system remains relatively
bytes of memory are required to execute back propagation stable, when the magnitude of the observation noise against
by the momentum method. Furthermore, for the calculation the average product size is 5%. When the noise level is in-
of these matrices or vectors, the work area size will come up creased over 10%, the ANN controller could not control the
to about 3 times the largest matrix size. process any more. This means that the ANN model could
not learn the dynamics of process correctly at this high noise
Our system is written in FORTH language on a personal level.
computer. The processor of the computer is i80386 (20MHz
clock cycle) with a co-processor for crunching real numbers
and a 9M byte memory. This FORTH system itself has been DISCUSSION
developed by the authors to realize the ANN system with a
large number of neurons. Theoretically our system is capable
of the memory area up to 2G bytes. On the process of developing the control system by ANN's,
several problems have become apparent. A few considera-
tions about these problems will be discussed below.

SIMULATION Determination of the number of neurons


in the hidden layer
A grinding circuit simulator was used for testing the per- To take the case of the XOR problem, as the simplest ex-
formance of the ANN control system consisting of the ANN ample, two neurons are required theoretically for the hidden
model and the ANN controller (Fig.8). In this set up the layer to form the ANN. We should be able to calculate the
80% size of the product is controlled by changing the feed exclusive-or against a pair of input data with this ANN. Nev-
rate. The mill model consists of 5 completely mixed sections ertheless, the learning of this function is not always success-
with back mixing flow simulating the diffusive flow of mate- ful. According to the initial values of the ANN parameters,
rials through the mill axis. The feed ore is assumed to be a it can require an extremely large number of repetitions until
mixture of two types of ores having different hardness. The it gets converged. It was proved that the converging speed
proportions of the soft and hard ores were assumed to change of the ANN depends on the combination of the signs of the
around 30 and 70% respectively. The variation was assumed initial values, while the setting of the weighting parameters
to be represented by a pink noise whose auto-correlation (determined with random numbers) are influential as well.
function can be indicated by an exponential function of time In order to avoid the unfavorable effect of the initial values,
difference. It was also assumed that the particle size monitor it was found to be advantageous to increase the number of
picks up an observation noise which is Gaussian and white. neurons in the hidden layer. We used 100 neurons in the
hidden layer to obtain satisfactory results.

Choice of the correction method


At first appearance the collective correction method seems
to assure minimum cycles of corrections required, and hence
the maximum rate of learning in the back propagation of the
ANN. It turned out, however, that the collective correction
method presents only a smaller degree of correction per cy-
cle compared to the successive correction method, because
of the averaging effect specific to the former. On the other
hand, introduction of the momentum method proved to be
Figure 8: Grinding circuit model. beneficial for increasing the converging speed. Based on the
above results observed we decided to use the successive cor-
rection method in combination with the momentum method
[/urn] in our system.
55 C
50
Finding the optimal manipulated value
45 for the ANN controller
[] We need to know the optimal manipulated value using the
55
B process simulator for tuning the ANN controller. However,
the proper value of correction may only be obtained a later
50 time using some kind of an evaluating function. It has been
45 mentioned above that the process simulator is available to
search for the optimal value. Iteration is required to find out
the optimal value. It takes too much time for this calculation,
[%]
40
however. Instead of this, only three times of simulation was
performed in our system to acquire the approximate value for
30 correction. The solution can be expected to improve itself in
the repetition of learning.
20

Availability of the teaching signal


0 1 2 3 4 5 [k] by perturbation
Figure 9: Results of the simulation. It should be noted that the results shown in Figs.2 and 3

170
are obtained from the simulations on the computer with a
random and extensive change in the input level. On the con-
trary the state of real process will not change so drastically
as shown there. Hence it is anticipated that much longer
time is necessary for the learning of the simulator after the
actual process. In this context we will have two problems:

(1) In the actual process the extent of state change occurs


only in a limited degree in the normal operating conditions,
and ANN may lack the knowledge necessary to cope with
the extraordinary conditions where ANN controllers are ex-
pected to work properly, too.

(2) The signal obtained from the actual process are stained
by noise. The influence of noise on the learning efficiency
must be examined. If the effect of noise is fatal, we will have
to devise any possible means to cope with this problem.

These problems are not yet studied, but remain as the task
to be examined further in future.

On-line adaptation of ANN

In the simulation of the grinding circuit control shown above


are run after learning of the ANN's is over. Learning of the
controller system consisting of the ANN model and the ANN
controller required about 10 hours using a personal computer
stated above. No problem will exist for the controller to
continue learning on-line, while it is controlling the actual
process. For the combination of a small scale control system
studied here and a small computer available today, the 10
hour learning period will be considered as reasonable for real
applications.

CONCLUSION

Application of ANN for process control has been discussed.


The control system developed comprises two sets of ANN's
(the ANN process model and the ANN controller) to be in-
tegrated together to form the entity. Also, Two phases of
learning have been introduced to tune the ANN controller.
In order to deal with the mineral processes involving time
delays the input layers of these ANN's are provided with
extra neurons to hold the time series of data. Simple exam-
ple on mineral process simulation and feed back control with
process disturbance and observation noise indicates that the
ANN can be applicable to on-line process control.

ACKNOWLEDGEMENT

We wish to acknowledge Grant-in-Aid for General Scientific


Research (No. 02452226, 1990) for support of the research
work presented here.

References
[1] T.Inoue and K.Okaya : Modelling of Grinding Circuits for
Simulation Studies, Proceedings, 4th Asian Symposium in
Mineral Processing, Manila, April 1991.

[2] W.T.Miller, R.S.Sutton and P.J.Werbos : Neural Networks


for Control, The MIT Press, London, 1990.

171
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

PROGNOS: A PROTOTYPE EXPERT SYSTEM FOR FAULT


DIAGNOSIS OF THE TRANSMISSION SYSTEM OF
LOAD-HAUL-DUMP VEHICLES IN KIRUNA MINE,
LKAB, SWEDEN
N. Vagenas

Division of Mining Equipment Engineering, Lule University of Technology, Lulea, S-95187, Sweden

Abstract. As a consequence of the increasing complexity of


modern mining machinery and systems, more time is consumed
on repair following a breakdown. Equipment breakdowns cause
lengthy and costly delays in the operation of mines. PROGNOS
is a rule-based expert system for fault diagnosis of the
transmission system of Load-Haul-Dump (LHD) vehicles in
Kiruna mine, Sweden. This paper presents the problem
approach and discusses the formulation of the knowledge
base as well as the software implementation. The use of
expert systems for fault diagnosis of mining machinery can
be a cost and time effective approach of minimizing repairs
in the mines.

Keywords. Rule-based expert systems; mining vehicles;


transmission system; fault diagnosis; failure detection.

INTRODUCTION FAULT DIAGNOSIS OF MINING EQUIPMENT

An expert system is a computer program that uses The application of expert systems for fault
knowledge and inference procedures to solve diagnosis of mining machinery is considered as
problems that require human expertise for their appropriate, since human expertise is generally
solution (Ramani, 1989). Recently, several expert required in the solution process of these
system applications in mining have been reported problems. Human expertise may be considered as a
in the literature. Expert systems have been combination of intelligence, experience and
developed for selection of mining layouts, problem intuition. Human expertise can be easily lost
identification in mining machinery, mining through job transfer, retirement or job
equipment selection, etc., (Wilke and Seewald, separation. It is a waste of knowledge if human
1989; Atkins and others, 1990; Guo and Xin, expertise cannot be captured, documented and
1990; Hrtgen, 1990). maintained.

During the last few years, Swedish mining Expert systems offer the possibilities to capture
companies have shown an interest in the possible and maintain human expertise. Fault diagnosis of
use of expert systems for maintenance of mining mining equipment generally relies on the
machinery (e.g. fault diagnosis). Studies in mines experience and intuition of maintenance personnel.
have shown that the maintenance cost both indirect Furthermore, in some cases, manuals from the
and direct often constitutes 50-60% of the manufacturers contain insufficient or limited
equipment cost (Kumar, 1989). Direct costs usually information about faults and fault reasoning.
consist of costs for labor and material. Indirect
costs occur, for instance, due to loss of As it has been indicated by repairmen of
production. Load-Haul-Dump (LHD) vehicles (Fig. 1) in Swedish
mines, diagnosis of the true cause of a defect may
It has also been reported that at present, many
not be straightforward. For example, a defect
Swedish mines are operating with 80% emergency
occurring anywhere within the transmission system
repairs and 20% planned maintenance (Danielson,
of an LHD may have repercussions elsewhere on the
1987). Statistics in a Swedish mechanized mine
indicates that in average 500 to 750 operating vehicle and leads to secondary defects. Thus, at
hours per month are lost either due to machine first sight it may not be apparent which of the
breakdowns or due to non availability of machines defects actually initiated the problems.
(Kumar, 1990). In several Swedish mines, the costs Therefore, there must be a systematic approach of
for a machine to stay idle in a repair shop due to analyzing and formalizing failure information to
a breakdown can reach the level of 3000 to 5000 detect the true cause.
SEK/hour per machine (cost estimation in June
1990). Diagnostic techniques which enable identification
of the true cause of a machine breakdown quickly
Downtime of mining machinery is considered as a and effectively can reduce the repair time and the
major cause of reduced production in the Swedish associated cost in the mines.
Mining Industry. Fault diagnosis consumes often
more than 60% of the total repair time of a Application of expert systems for fault diagnosis
machine. Furthermore experienced repairmen in some of mining machinery has been reported in the
Swedish mines foresee that the use of expert literature, see for example, (Patton and others,
systems may reduce the time duration of a fault 1989; Mitchell, 1989; Price and Lewis, 1989;
diagnosis process by 20% to 50%. Bandopadyay and Venkatasubramanian, 1989).

173
The expert systems assist in finding faults for a - Identification of end-users specifications and
machine and in proposing repair-actions for computer facilities
correcting the faults. The main steps involved in
a diagnostic process and implemented in an expert - Selection of appropriate expert system
system are : development tools

- Fault detection (the expert system attempts to - Obedience to time and budget constraints imposed
detect the presence of a fault) on the project

- Fault localization (the expert system attempts Criteria that can be used to select expert system
to locate a fault to a sufficiently small projects for fault diagnosis of mining equipment
subregion of the system or component) are presented in Table 1, (Efstathiou, 1989).

- Fault identification (the expert system attempts Table 1. Criteria for selecting expert system
to identify the fault) projects for fault diagnosis of mining
equipment.
- Fault correction (the expert system suggests
corrective actions to restore the normal
operation of the component or system) TABLE 1

The quality and the performance of an expert Criteria for selecting expert system projects
system is mainly dependent upon the type, depth for fault diagnosis of mining equipment
and breadth of knowledge incorporated in the
knowledge base. Usually, for fault diagnosis of The task requires expertise
mining equipment, the following classes of The task is well defined and bounded
knowledge can be employed to develop expert The task is decomposable
systems : Algorithmic techniques are not satisfactory
Symbolic representation and reasoning are more
- Descriptive knowledge which deals with the appropriate
hierarchical decomposition of a system - it Heuristic knowledge is important
identifies and analyzes the structural and Experts are available and willing to help
functional modules of the system A payoff is expected by applying the expert
system
- Behavioral knowledge which indicates the normal The user interface is straightforward
and abnormal functionality of the components of Hardware and software are available
a system - this knowledge is based on
pre-defined failure modes or constraints that
characterize the behavior of a system In general, it is necessary for the developer of
an expert system to have sufficient information
- Operational knowledge which describes the about the equipment/component under study. This
principles of operation of a system or a demands awareness of the following issues :
component
- Equipment specifications
- Heuristic knowledge which allows a fast
association between symptoms and their most - Manufacturer's input concerning the equipment
probable causes, thus avoiding a time consuming design characteristics
complete reasoning analysis - heuristic
knowledge is often based on empirically - History of operation of the equipment in a mine
obtained information about fault-cause
relationships and fault priorities - Maintenance and repair history of the equipment
in a mine
- Diagnostic knowledge which provides information
about methods adapted to diagnose a system - Human factors related to the usage of the
equipment in a mine
- Maintenance knowledge which provides all the
information relevant to the history of - Working environment of the equipment
maintenance of a system, such as symptoms,
corrective actions, chronological order of - Economic factors related to the operation and
fault occurrences, etc. downtime of the equipment

In general, expert system technology for fault In general, the application of expert systems for
diagnosis of mining equipment is still in infancy. fault diagnosis of mining equipment can offer to
At present, most of the literature concentrates on mining companies a possibility to achieve the
the technical issues of designing expert systems, following three goals :
such as knowledge representation, inference
mechanism and user interface. Reports about the - Effective diagnosis of equipment
experience of using an expert system in a mine are
still limited. Initiating, planning, carrying out - Better education of service personnel in the
and completing an expert system project still is mine
not a straightforward task. Introducing a new
technology carries risks for all the participants - Better understanding of equipment behavior and
in the project. Thus, careful judgment is required performance
to select expert system projects.
For the equipment manufacturers, the development
Developing an expert system involves many tasks and use of diagnostic expert systems can result
beyond analyzing and solving the problem. These in :
tasks include :

- Identification of experts and other sources of - Better design of equipment


expertise in the problem domain
- Effective use of knowledge about the equipment
- Acquisition and validation of the required inside the company
knowledge
- Better documentation of the design
characteristics of equipment

174
- More descriptive service manuals of equipment A common method for representing knowledge in
for the customers diagnostic expert systems is by using rules.
PROGNOS has a knowledge base of "If..Then" rules
The Div. of Mining Equipment Engineering at Lule and typically uses a deductive approach to making
University, Sweden in close cooperation with the inferences, (Nau, 1983). This approach can be
maintenance division of the iron ore mining expressed simply by the syllogism : Given fact "A"
company LKAB in Kiruna, Sweden, has developed a and Rule "A > B" then infer "B".
prototype consultative expert system, (called
PROGNOS), for fault diagnosis of the transmission Using the rule-based deduction method, the
system of the TORO 500 LHD model manufactured by sequential hypothesis-and-test reasoning process
ARA Inc., Finland, see (Vikman, 1990). of the human diagnostician is adapted. Given an
initial set of symptoms, the expert system is
A consultative expert system is supposed to give supposed to construct a hypothesis about the
advice and guidance to the user for solving a causes of these symptoms and then search for
particular problem in a manner similar to further information in order to arrive at the
consulting a human expert. fault (Bandopadhyay and Venkatasubramanian, 1989).

PROBLEM APPROACH KNOWLEDGE ACQUISITION


LHD vehicles are used for both loading and Knowledge acquisition is a critical step in the
transportation of minerals in underground mines. design process of an expert system. The process of
The LHDs are economic for haulage on short knowledge acquisition often creates the biggest
distances and can also load trucks or rail cars. bottleneck in expert system development because of
The flexibility of these machines allows the the difficulty in accessing every piece of
extraction of irregular orebodies, thereby knowledge an expert uses in solving problems
improving the recovery of the total ore reserve. (Mitchell, 1989). The following methods were used
At present the Swedish Mining Industry has a large for acquiring and capturing the knowledge in
number of diesel powered vehicles and a few PROGNOS :
electric machines.
- Study of operations documents (e.g. TORO 500
The availability and reliability degree of these manual and fault reports written by repairmen in
machines have a significant importance to the the mine)
total productivity of a mine. To apply an
effective fault diagnosis, it is necessary to - Interviews with experienced repairmen in the
understand the failure characteristics of the mine
equipment. In general, partitioning the equipment
into object categories assists in identifying the - Studies of relevant literature (e.g. textbooks
parts of the equipment that require maintenance. A for vehicle transmission systems)
partitioning procedure can indicate critical items
and identify failure mechanisms (Kumar, 1989). Figure 4 displays the approach taken for knowledge
Thus, the LHD machine can be divided in several acquisition. The fault reports taken into
sub-systems (e.g. transmission system, electrical consideration deal with 9 electric powered and 17
system, etc.), (Fig. 2 ) . At present, PROGNOS deals diesel powered T0R0 500 LHD models in Kiruna mine.
only with the transmission system of a TORO 500 Approximately 2300 fault reports for LHDs have
LHD model. been investigated for the period Jan-Sept. 1989.
400 of these reports were found to be related to
Future upgrades are expected to include knowledge problems in the transmission system.
bases for other sub-systems of the machine (e.g.
hydraulics, electrical system, etc.). There are After a thorough analysis and classification, 96
several reasons for initiating the development of faults were identified and 102 repair actions were
the prototype expert system PROGNOS from knowledge distinguished. In the discussions with experienced
gathered for the transmission system : repairmen in the mine, questions were asked to
them for determination of important symptoms and
- The transmission system is not as complicated as faults. As knowledge and information were
other sub-systems of an LHD machine (e.g. gathered, it became possible to start formalizing
hydraulics) rules. These rules and the associated knowledge
were separated into modules which represent
- Documentation of faults in the transmission components or parts of the transmission system
system made by the mining company was considered (Fig. 5 ) .
as adequate and descriptive
The repairmen also helped in validating and
- Experienced repairmen in Kiruna mine felt verifying the contents of the knowledge base. The
confident to define and describe faults in the knowledge base for the transmission system of the
transmission system TORO 500 LHD model is constructed mostly from the
knowledge derived from the fault reports in the
The role of the transmission system is to transmit mine and the discussions with the repairmen. The
the power from the engine to the wheels. The power vehicle's manual provided limited information.
is transmitted to a converter through a fibre
pinion. From the converter, the power is
transmitted via an axle to the gear box. SOFTWARE IMPLEMENTATION

The gear box distributes the power to the front Once the knowledge base is formalized, a suitable
and back wheels through two cardan axles (Fig. 3 ) . Expert System Building Tool (ESBT) must be chosen.
The power reaches the front differential through a The choice of ESBT is guided by the nature of the
middle axle and a cardan axle. The power to the knowledge representation. In PROGNOS, the
back wheels is transmitted directly from the knowledge is captured and represented by
cardan axle to the back wheel differential. The "If..Then" rules.
differential distributes the power to the wheels
through the built in planetary gears in the hub
and through the axles, (Vikman, 1990).

175
Thus, an ESBT that is user-friendly in Guo, W. and Xin, J. (1990). Two Applications of
constructing "If..Then" type of knowledge bases is Expert Systems in Underground Coal Mining,
preferable. Other important considerations are the Proceedings of the 22nd APCOM Symposium.
hardware requirements, software portability, Berlin, Germany, September 1990,
maximum number of rules, high level language ISBN 3-7983-13741.
interface, graphics interface, interface with
databases and spreadsheets. Hapgood, W. (1987). lst-CLASS Expert System.
Instruction Manual, Programs in Motion Inc.,
PROGNOS was developed using lst-CLASS as ESBT 10 Sycamore Road, Wayland, MA 01778, USA.
(Hapgood, 1987). This ESBT offers a high degree of
intuitive presentation of the knowledge base. This Hrtgen, F. (1990). Determination and Presentation
results in a rapid development and testing of a of Escapeways by Combination of CAD and Expert
prototype knowledge base. lst-CLASS also allows systems, Proceedings of Uie 22nd APCOM
external text and graphics supporting programs to Symposium. Berlin, Germany, September 1990,
be activated by the rules when they are triggered. ISBN 3-7983-13741.
During consultation, for each piece of advice
given, the user of lst-CLASS can graphically trace Kumar, U. (1989). Maintenance strategy for
the rule logic and underlying data that leads to mechanized mining systems, Licentiate Thesis,
that specific advice, as well as, retrieve Div. of Mining Equipment Eng., Lule
detailed explanations if it is desirable. The use University of Technology, Lule, Sweden,
of this particular ESBT does not imply a personal 1989:OIL, ISSN 0280-8242.
endorsement by the author.
Kumar, U. (1990). Reliability Analysis of
PROGNOS has already been tested in Kiruna mine. So Load-Haul-Dump Machines, Doctoral Thesis,
far the tests have been encouraging. The advices Div. of Mining Equipment Eng., Lule
given by the system for fault diagnosis are University of Technology, Lule, Sweden,
considered by the repairmen in the mine as logical 1990:88D, ISSN 0348-8373.
and reasonable.
Mitchell, J. (1989). A Knowledge-based System for
Hydraulic Maintenance of a Continuous Miner,
CONCLUSION Proceedings of the 21st APCOM Symposium. Las
Vegas, USA, ISBN 0-87335-0790.
PROGNOS is a prototype rule-based expert system
for fault diagnosis of the transmission system of Nau, D. (1983). Expert computer systems,
TORO 500 LHD models in Kiruna mine, Sweden. Computers. February 1983, pp. 63-85.
PROGNOS can be considered as one of the first
attempts to explore the potential and limitations
of applying expert systems for fault diagnosis of Patton, S.B., Novak, T., Sanford, R.L. (1989).
mining machinery in the Swedish Mining Industry. Electrical Fault Diagnosis Using Expert
Systems, Mining Engineering. Sept. 1989.
Artificial intelligence is an emerging technology
in the field of computer applications. Expert Price, A.E. and Lewis, D.C. (1989). Reducing
systems as a branch of artificial intelligence can Machine Downtime with Consultative Expert
be used in applications where imitation of human Systems, Mineral Resources Engineering
intelligence and reasoning process is required. Journal. 1989, Vol 2, No 2, pp. 97-107.

Fault diagnosis of mining machinery is a research Ramani, R.V. (1989). Expert Systems in Mining
field where expert systems can be successfully Engineering - An introductory review,
applied, since human experience and intuition are Proceedings of the 21st APCOM Symposium. Las
important for diagnosing equipment faults. Vegas, USA, ISBN 0-87335-0790.

Vikman, K. (1990). Development of an Expert System


ACKNOWLEDGEMENTS for Fault Diagnosis of the Transmission System
of a TORO 500 LHD Model in LKAB's Underground
The author wishes to thank the staff of the Mine in Kiruna, Examensarbete 1990:01 E (in
maintenance division of Kiruna mine, LKAB, for Swedish), Div. of Mining Equipment Eng., Lule
their cooperation and encouragement. Thanks are University of Technology, Lule, Sweden.
also acknowledged to Gruvteknik 2000 for the
financial support. Wilke, F.L. and Seewald, M. (1989). The
Fundamentals for an Expert System, Designed to
Assist the Layout Planning of Face Operations
for an Underground Coal Mine, Proceedings of
REFERENCES the 21st APCOM Symposium. Las Vegas, USA,
ISBN 0-87335-0790.
Atkins, A. S. , Ren, T., Singh, R. N. (1990).
Applications of Expert Systems in Engineering
Risk Assessment in Decision Support,
Proceedings of the 22nd APCOM Symposium.
Berlin, Germany, September 1990,
ISBN 3-7983-13741.

Bandopadhyay, S. and Venkatasubramanian, P.


(1989). A Fault Diagnostic Expert System Model
for Longwall Shield Support, Proceedings of
the 21st APCOM Symposium. Las Vegas, USA,
ISBN 0-87335-0790.

Danielson, B. (1987). Study of Maintenance


Problems in Swedish Mines, Study Report, (in
Swedish), Idhammar Konsult AB, Sweden.

Efstathiou, J. (1989). Expert Systems in Process


Control. Longman, ISBN 0-582-04267-4.

176
An electric LHD. Tramming capacity 8000 kg.
3
Bucket size 3.2-5.4 m . Electric motor 132 kW.
Cable length 210 m. Length 9.5 m, width 2.4 m,
height 2.4 m.

Fig. 1. An LHD machine.

LHD Machine

Engine Hydraulics Brakes Transmission Tyres/Wheels Body

Static Dynamic
parts parts

Engine Cylinder Piston Crank shaft Bearings


block head

r 1 r 1

Bucket hydraulics Others such as steering, etc.

Tanks Pumps Tubes Filters

Piston Valves Seals

Fig. 2. An example for partitioning an LHD machine to identify


significant items, after (Kumar, 1989).

1. Front wheel axle 3. Middle axle 5. Gear


2. Cardan axle 4. Back wheel axle 6. Converter

Fig. 3. The transmission system of a TORO 500 LHD model manufactured


by ARA Inc., after (Vikman, 1990).

177
EXPERTS OPERATIONS DOCUMENTS RELEVANT LITERATURE

INTERVIEWED BY
MODELLER OR INFORMATION EXTRACTED AND
KNOWLEDGE ENGINEER CAPTURED BY MODELLER

MODELLER COLLATES DATA AND INFORMATION AND THEN FORMULATES


RULES AND RELATIONSHIPS AMONG PARAMETERS AND VARIABLES

PROTOTYPE KNOWLEDGE BASE

Fig. 4. An approach for knowledge acquisition.

TRANSMISSION SYSTEM

ENGINE FIBRE PINION CONVERTER CARDAN AXLE GEAR FRONT

Flywheel Throttle unit


Crankshaft Hydraulic regulator
Stator Gear box
Turbine wheel Loading pump
Pump wheel Converter
Turbine axle Oil radiator
Free wheel Filter
House Magnetic regulator group

CARDAN AXLE MIDDLE AXLE CARDAN AXLE DIFFERENTIAL

Sunwheel
Planetary wheel
Bearing
Axle tap

Fig. 5. The knowledge base is constructed by partitioning components


or parts of the transmission system of the vehicle, after
(Vikman, 1990).

178
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

THE SIMULATION OF ILL-DEFINED METALLURGICAL


PROCESSES USING A NEURAL NET TRAINING
PROGRAM BASED ON CONJUGATE-GRADIENT
OPTIMIZATION
T.J. van der Walt*, J.SJ. van Deventer* and E. Barnard**
^Department of Metallurgical Engineering, University of Stellenbosch, Stellenbosch 7600, South Africa
**Computer and Electronics Engineering Department, University of Pretoria, Pretoria 0001, South Africa

Abstract. Most metallurgical processes are ill-defined in some way owing to their complex natures.
This causes difficulty to construct adequate phenomenologicai models for such processes. It is
shown in this paper how neural nets (NN's) can be used to model ill-defined reaction systems by
using a typical metallurgical process, such as carbon-in-leach (CIL).

Multi-dimensionality (a high number of system variables), which causes problems for modelling, is
investigated. It is then illustrated how a NN trained with noisy data can be employed to reduce
dimensionality by following two approaches. Firstly, the influences of the system variables are
quantified with a perturbation analysis technique. Less significant variables are then eliminated from
a specific part of the operating domain. Secondly, the NN is used successfully to identify
mathematical relations between system variables and functions. If a relation is known, it can be
expressed mathematically. The ill-defined process domain thus shrinks further. Eventually the non-
defined variable space has a low dimensionality. The relations between functions and these variables
can more easily be learned with NN's. Such a simplified NN-model is then incorporated in a dynamic
simulator to predict the performance of the reactor system, and compares extremely well with a
phenomenologicai model simulator. A new approach to conduct process identification on the basis
of continuous data is also proposed.

Keywords. Adsorption, multi-dimensionality, neural nets, perturbation analysis.

INTRODUCTION to be a promising modelling tool to cover the


abovementioned gaps in the metallurgical modelling toolkit.
The problem with most unit operations and metallurgical
processes is that they are ill-defined in some way. This A further major problem for any modelling expert is the
places an obligation on the modelling engineer to question of multi-dimensionality. The amount of
construct adequate mathematical models to be fully information (such as system data points) required to
representative of the whole system. In weakly-defined model a process adequately, increases exponentially with
systems, such models are usually domain-specific and a rise in the number of process variables, which in turn
unreliable. The carbon-in-leach (CIL) process will be used has an influence on the process performance. In order to
here to illustrate the proposed theory of neural net (NN) explain this, consider a function / to be oepicied on an m-
modelling. dimensional variable space Rm. If i/j is the average
number of data points necessary within the j t h dimension
Recently KBS has been applied increasingly in treating ill- to represent the whole system sufficiently, the minimum
defined problems, especially in the chemical industry. number of data points needed will be
Reuter, Van Deventer and Van der Merwe (1991)
explained the application of KBS to the simulation of CIP- m
and CIL-circuits. From batch reactor simulations, they p - , W
constructed system variable profiles (such as gold /-1
concentration and gold-loading-on-carbon profiles) which
were coupled with the kinetics of the adsorption and It should now be clear that the addition of every single
leaching processes through a data base. dimension will cause an exponential decay in information
contained by a specific set of information for a system.
The search for a technique which has the ability to analyze This problem will also be addressed using the CIL-
a "black-box" system effectively is still continuing. Such a example. Also, the process of training a NN-model is
technique should be able to construct a mapping between much more complex for a system of higher dimensionality,
system variables and parameters so that this relational owing to an increase in complexity of the total mapping
mapping can be implemented as a "black-box" model. with every additional variable. It will be shown how
Most importantly, it should have the ability to learn automated perturbation analysis can be performed by
relations within those system domains which cannot means of a trained NN and its training data set in order to
readily be modelled by means of conventional techniques. quantify the degree of influence of the separate process
The connectionist network-approach studied here seems variables on the different function surfaces. Having done

179
so, variables which are identified as "low-key" variables
within a certain domain of the system, can be eliminated
and dimensionality is reduced accordingly.

DIRECT MODELLING OF A
CONTINUOUS REACTOR

In practice a model compiled on batch data needs, almost


without exception, to be adjusted considerably if it is to be
incorporated in a simulator for a continuous system.
Hence, it is proposed here that process identification
should be conducted preferably on the basis of continuous
data

The dynamics of a continuous reactor can be described Fig. 1: Architecture of a three-layer perceptron
by performing material, energy and momentum balances
over the reactor. A typical material balance equation for
substance X in a CSTR is:
calculated by their activation functions, which are linear in
-f = ,-Xou,* f P) this case. They simply take the values of the system
variable inputs. The inputs to the hidden and output
/ represents the reaction (s) which take place within the nodes are calculated by adding the products of the
reactor. This reaction term can be expressed in terms of activities of all nodes in the previous layer with the
the other three terms of the latter equation: corresponding weights of the connections which attach the
separate nodes in the previous layer to the specific node.
The activities of the hidden nodes are calculated with the
sigmoidal squashing function (act = 1/(1+exp(-input))),
while the output nodes are linear again. The outputs of
The terms on the righthand side of equation 3 should be the NN are equal to the activities of the output nodes.
measurable so that / can be calculated with this
expression. At the same instant when / is determined, Training a Neural Net
some other process variables which might probably have
an influence on / , are evaluated as well and combined The training process of NN's is an important part of NN-
with / to form a system data point. All such data points research. It involves optimizing a weight matrix of a NN so
are put together to make up an information data set, which that the NN will memorize the mapping between a number
can be used by the modelling engineer to develop a model of system variables and functions. The backpropagation
in some way or another. In the case of a NN modelling algorithm, as described by Rumelhart, Hinton and Williams
approach, this data set is employed as a "training set" by (1986), is used by the NN training program, which
a NN training program to learn the relations between the incorporates the conjugate gradient (CG-) optimization
different process variables and functions. algorithm with restart procedures of Powell (1977) as
optimization method. During each training iteration all
weights of the NN are adjusted in such a way as to
NEURAL NETS
decrease the value of an objective function. Most NN
Various NN-topologies have been proposed. However, training programs employ the LMS-error function as
recent success in connectionist network research is mainly objective function and the net used here is no exception.
attributed to the achievements in multilayer perceptrons
with backpropagation training algorithms. The architecture Mainly two steps can be identified during each iteration of
and training algorithm of the three-layer perceptron used training, viz.
here will now be discussed briefly. 1. determining the error-weight gradients of each
weight in the network by presenting all training
NN-architecture examples once to the net;
2. adjusting the weight matrix by means of an
Figure 1 illustrates the NN used for the purpose of this optimization method.
paper. The three-layer perceptron consists of three layers The first step in calculating the error-weight gradients (step
of nodes, viz. (1) an input layer, (2) a single hidden layer 1) is performed in the following substeps, for each training
and (3) an output layer. Nodes of adjacent layers are example respectively:
connected and these connections quantified with weights. (i) A training example is fed to the input layer. The
The weight matrix of a trained NN contains the information activities of all nodes in the net are calculated
about a system under investigation. Each node is during a feedforward step as described in previous
characterized by an activation function (linear or sigmoidal paragraphs.
in this case). During a feedforward calculation of the NN, (ii) The value of the error function at each output node
the values for the system variables as contained in a is calculated.
training data point, are fed as inputs to the input nodes. (iii) During a backward sweep the error-weight
Each system variable is associated with a different input gradients throughout the whole weight space are
node. An additional input node, the bias node with a determined via the procedure of backpropagation.
constant value of 1, provides extra degrees of freedom
which enables a fitted curve to be moved up or down in It is vitally important that an effective optimization method
the variable space. The activities of the input nodes are be used during step 2. The CG-method is usually able to

180
locate a minimum of a multivariate function much faster Gold balance on carbon for the / t h stage:
than the momentum algorithm which is customarily
employed with backpropagation. Furthermore, its memory
usage is in the order of N (number of weights) locations. W,- Q
SPC<!M
Also important to note is that the CG-technique eliminates dt (e, +e 0 )
the choice of critical parameters, such as the learning rate (5)
+4
and momentum parameters of the momentum algorithm.
On the other hand, like all gradient descent optimization (^>
techniques, the CG-algorithm can converge into local
minima, which is a major drawback.
Gold balance on ore for the / t h stage:

dGj
MODELLING A TYPICAL
METALLURGICAL PROCESS
M,
dt 7T^ G *
(e, +e_ )

<V-1
Qfs,.,PoGM (6)
A known phenomenological model for the CIL-process +e
was used to generate data points during a single
K, H>
simulation run of a continuous CIL simulator for a 5-stage ^ V O ^ f l , * *
cascade. These data points were determined at
consecutive time steps for all 5 stages in the cascade, and
the data points represented a very broad and practical
range of the CIL-variable space. It constituted a training The volumetric flow rate of the overflow slurry stream (Qfs)
set which could be used to train a NN-model for the CIL- can be calculated as follows:
system. It is shown here how a NN-model for a specific (7)
O, I Of
ore and carbon type can be developed and refined by
analyzing the training set, quantifying the influence of the
different variables and by reducing the dimensionality Q f is the voiurnetric flow rate of the siurry input to the first
accordingly. stage. During the intervals when no carbon transfer takes
place, Q s is non-existent and Q fs equais Qf for all stages.

-,, f2 and l3 are the kinetic reaction terms and are


described by the following equations according to the film
diffusion model for adsorption and an empirical expression
for the process of leaching:
6k,W,
{CrCs) (8)
Pc<*c

-k,M,(QrQrf (9)

1 , Qr
(10)
- (Wa)
Last stage]
N
El u ted A Freundlich isotherm describes equilibrium at the carbon
carbon
4
Q* surface, so that

Fig. 2: Schematic diagram of the CIL-cascade cs - <-jr (11)

where A and n are the equilibrium paran'raters, specified to


The mass balances of the gold in the different phases are be 6 and 0.2 respectively for the specific carbon. Further
described by equations 4 to 6 and were constructed for specifications of the carbon is a density (pc) of 900 kg.nrf3
the cascade illustrated in Fig. 2. With the exception of the and an average particle diameter of 1.4mm. The
first and last reactor stages, these balance equations are adsorptive and leaching coefficients (kf & k,) are assumed
applicable for each intermediate stage in the CIL-cascade, to remain constant at 10"5m.sec"1and 1.2 kg ore .g Au " 1 .sec" 1
as portrayed schematically in Fig. 2. C is the gold respectively. The grade of gold in the ore at infinity (G")
concentration in water, q represents the gold loading on was specified to be 8x10" 6 g Au .kg ore " 1 .
the carbon and G the degree of gold in the ore.
Volumetric fractions for water (/), carbon (c) and ore (o) During a typical simulation run with the phenomenological
are expressed by e. model simulator, fv f2 and f3 were calculated and the
values of C, q, G, M (mass of ore in the reactor) and W
Gold balance in liquid phase for the / th stage:
(mass of carbon in the reactor) registered together with
the three reaction terms. At random vaiuas for M (400-500
s CM tons) and W (between 10 and 25 tons) vi/ere generated at
each time step for every stage. This was done at fixed
reactor volumes of 750m 3 for all stages. The values of f-\,
(4)
f2 and f3 of these data points (variable-function pairs with
5 input variables and 3 output functions) were then
randomly corrupted with 20% Gaussie. noise. The first
training set thus contained noisy data which is typical of an

181
industrial system. After scaling the values of the data TABLE 1 Training Results of 6 NN's converged upon
points, this training set was employed to train a NN which by a Training Set with 20% Gaussian noise on f v f2 & f3
in turn was used to perform perturbation analyses on the - (all nets have 12 hidden nodes, but different random weight
training data set. matrices at the start of the training runs)

Number of training Average absolute error normalized as %


Training the Neural Net iterations of 4 |

Noisy data NN-predictions \


It is important largely for three reasons to scale the input
and output values of the variable and function values. 1567 3.34 1.57
1106 3.22 1.48
Firstly the input node activities of the NN should be
3455 3.56 1.20
bounded so that the weights between the inputs of the 2825 4.21 1.52
sigmoidal hidden nodes are sensible (their absolute values 3214 3.21 1.31
should not be too large). This will cause extremely slow 1918 3.67 1.47
training convergence. Secondly it is imperative that the
input values to the net are of the same absolute order. 2348a 3.54a 1.42a I
During training each weight is adjusted according to the
overall smallest error-weight gradient. The error-weight
gradient is directly proportional to the magnitude of the values. Note that these errors were calculated as the
input node according to backpropagation. A very small average absolute differences between the specific scaled
input node will thus prevent the weight that connects it to function values and the model-predicted scaled function
the hidden layer, to make a significant contribution to the values ("true" scaled values) for all training data points and
objective function and will slow down the training process. over all three output nodes. The number of training
Thirdly, whilst learning is complicated if highly curved iterations is also significantly lower than would have been
function surfaces should be found, NN's can easily learn the case for a training program with a momentum
smooth relations. optimization algorithm.

For these reasons, the values of the input-output pairs are One of these NN's was used to complete a perturbation
scaled as illustrated in Fig. 3. The variables C and Q analysis on the training set. This net will be referred to as
varied over 3 to 4 orders of magnitude (0.001 to 10) NN (A) below.
and were effectively scaled logarithmically. In order to do
comparable perturbation analyses the range of all scales Automated perturbation analyses
variables was specified to be between approximately 4 and
-4. NN's with activation functions such as this seem to It is very difficult and sometimes impractical to perform a
control such input ranges effectively. All variables were perturbation analysis on noisy data points gathered from
scaled accordingly. an industrial system. Some techniques such as factorial
design have been developed to analyse data points of an
Six NN's with 12 hidden nodes each and different random ideally defined system. Noisy data limit the applicability of
weight matrices at the start of the training runs, were such linear perturbation techniques.
trained on a 2000 noisy data point training set constructed
as described earlier. The training results are displayed in A NN automatically identifies smooth function curves which
Table 1. From these results it is clear that NN's handle enable its convenient use in perturbation analysis on the
noise well. The average absolute errors (normalized as a whole system. This is done in the following manner: For
% of 4), when compared to the "true" scaled a given variable space coordinate the value of a function
phenomenological model calculations for the input variable is calculated by the trained NN (note that the scaled NN
values in the training set, show that the trained neural nets' output value is scaled back to the function value). This
predictions of ^, f2 and f3 are much closer to the desired predicted value is the perturbation midpoint. The scaled
scaled values as is the case for the noisy scaled function value of a specific variable is now adjusted to both sides
of the midpoint with the same constant absolute value,
while the other variables are kept constant. The values of
the function at these two new coordinates are predicted by
the NN. The absolute differences between the midpoint
and the two discriminate perturbated function values are
divided by the absolute midpoint value, and the average of
these two relative values calculated. This average is
expressed as a percentage (see Table 2). This is done for
all variables (dimensions) for a specific data point. At a
specific location within a system's variable space the
influences of the different variables on the separate
functions are now quantified so that the magnitudes of
these average perturbated values can be compared
directly. Non-significant variables can now be eliminated
from the variable space within the relevant domains.

For this specific CIL-system where the input variables were


scaled between 4 and -4, a low constant value of 0.1 was
selected to adjust a certain variable value to both sides of
the midpoint. The normalization constant is chosen to be
0.1, which will prevent the NN from being used in its highly
Fig. 3: NN (A) trained on noisy data for the CIL-process inaccurate extrapolative domain. The importance of

182
TABLE 2 Perturbation results of five randomly A simplified NN-model
selected data points, using MM (A)
As was expected, the perturbation results showed that f-j,
f2 and f3 are functions of the following variables for specific
1 Data point coordinates \ ore ana carbon types:

data Cx10 3
qx103
Gx10 4
M W forq > 1 g.kg'1:
\\ point C [g.rrT3] [gkg1] [g.kg1] [tons] [tons] | U ^ (0, q, G, M, W) (12)
f2 = e(C, q, W) (13)
1 3340 7380 5.0 450 18
2 750 4610 3.1 forq < 1 g.kg'1:
3 125 1500 2.2
4 26 337 1.6 U = (0, G, M, W) (14)
5 8 91 1.3 f2 = e(C f W) (15)
Average relative perturbation values for U: \ for all values of q:
1 29.25 33.64 14.07 1.57 3.12 f3 = 0(G, M) (16)
2 11.75 3.65 1.62 0.52 2.00
3 29.02 1.14 3.01 0.14 2.90
4 32.82 0.65 11.45 0.48 2.97 Identifying mathematical relations. NN (A) was also used
5 61.91 2.10 35.93 1.84 8.15 | to confirm the linear relationships between f2 & W, as well
as f3 & M. In the case of f2 & W the first three data points
Average relative perturbation values for f9: \
of Table 2 are used to illustrate how this is done. For
1 26.92 22.84 6.16 0.74 1.63 each of these data points the value of W was varied within
2 10.05 1.47 1.89 0.43 1.62
3 22.68 3.49 0.62 0.33 2.41 its data range, while the values of the other 4 variables
4 23.58 0.57 2.01 0.09 1.96 remained constant. f2 was calculated with NN (A) at
5 24.96 .44 6.02 ! 0.24 2.02 || different l/lZ-values for each data point. These NN-
Average relative perturbation values for f^: \ predictions of f2 were plotted against W for each data point
1 0.32 0.61 1.24 0.15 0.05
in Fig. 4. This figure confirms the direct proportionality
2 0.64 0.55 8.01 0.71 0.05 between f2 and W. The linear curves can be forced
3 0.99 0.27 10.09 0.46 0.28 through the origin, so that f2 can be expressed as follows:
4 0.08 0.19 12.19 0.44 0.09
I 5 0.23 0.68 14.13 0.34 0.13 I
f2 = W.r2, (17)
where r2 = a(C, q) if q > 1.0
scaling all variables to approximately the same range and r2 = a(C) if q < 1.0.
should now be apparent. Perturbation analysis was done
on each data point of the training set for C, q, G, M and W The same procedure was performed for f3 and M, and
consecutively, and their influences on a!l three functions f1s equation 18 displays the simplified functions.
f2 and f3 determined. To illustrate the effectivity of this
proposed technique, 5 randomly chossn data points of the ^3 = M.r3, ( 18 )
training set are used to do perturbation analyses with NN where r3 = (G).
(A). The perturbation results are summarized in Table 2.
It was assumed that some a priori knowledge about the
CIL-system is available. It should be highlighted that the
Perturbation calculations tor tne goic concentration C NN approach that is proposed here does not tend to
show that functions fA and f2 are very sensitive to changes
of C, while C does not have any significant influence on f3. replace all existing modelling techniques and available a
The gold loading q seems to be strongly related to f^ and priori knowledge about the system. This modelling
f2 at high gold-concentrations and -carbon-loadings, but approach should contribute to the modelling toolkit. If it is
loses this effect rapidly towards lower values for these a known fact that the gold mass balance in the liquid
system variables. This corresponds with the model phase is dependent on an adsorption and leaching
predictions generated in terms of the exponential
Freundlich isotherm. Again q has a negligible effect on f3. process, equations 17 and 18 can be combined to
Although the perturbation results for the gold grade G are describe .
less apparent for these 5 data points, it could be shown
that G has higher perturbation values for functions fA and U = M.r3-W.r2 (19)
f3 than for f?. It should be noted that the range of a
specific variable plays a significant role in the order of
magnitude of the perturbation values. For example, M
varies from 0.4-0.5 tons, while C can take on any value
between approximately 0.001 and 10 gA U - m According
to the CIL-phenomenological model, any perturbative
change of a log-scaled C which is done with the same
perturbation constant as employed for a perturbation
analysis on M, the former will register a higher sensitivity
level than for a linearly-scaled M or W. This is the reason
for the lower order perturbation values of M and IV. The
same principle holds for M and W mutually. W spans over
a wider relative range than M. Hance, Table 2 shows that
W has a more significant effect on f2 than on f3, and vice
versa for M. The perturbation results thus conform with all
phenomenological model properties.

Consequently, these results can be used to simplify the


system. G and M are eliminated as variables for f2, while
the same is done with C, q and W in the case of f3. A cut-
off point in the (/-dimension for f2 is identified at q=1. This
is done after realizing the unimportance of q at low q-
values. The simplified NN-system will thus employ two
different NN's for the two separate q-intervals. This is Fig. 4: Relation between f2 and W
described later. as predicted by NN (A)

183
CONCLUSIONS
Gold concentration Ig.m

A new approach to conduct process modelling on the


basis of continuous data collected directly from an
industrial reactor, was proposed and applied to modelling
a CIL-process. Multi-dimensionality as a problem to any
modelling technique, was addressed with different
approaches which utilized a NN trained on noisy data.
The dimensionality was reduced and a very accurate NN-
model could be constructed for the CIL-process. This
I.O model was incorporated in a dynamic simulator which
0 20 40 60 predicted the performance of the CIL-process successfully.
time [hours]
phen. model * NN-model I - * - NN-model II Critics of NN's as modelling technique have valid doubts
if the dimensionality-problem is not solved adequately.
Fig. 5: Comparison between Concentration Profile Further work is conducted to refine and automate the
Simulation Results for stage 1, as predicted by the proposed techniques for reducing dimensionality. This
CIL-simulator incorporating different Models. also includes the design of a new NN-topology which
should be able to reduce dimensionality, even within the ill-
defined variable space.

According to these relationships, a simplified NN-model REFERENCES


(NN-model I) with three NN's (two nets for each q-interval
for r2 and a net for r 3 ) was constructed. These NN's were Powell, M.J.D. (1977). Restart procedures for the
trained on exactly the same training data with the conjugate gradient method. Mathematical
exception that the data set did not contain noise. The Programming, Vol.12, pp. 241-254.
NN's of r3 and r2 (lower q-range) had only one input node Reuter, M.A., Van Deventer, J.S.J., Van der Merwe, I.W.
(the bias node ignored), while the NN for r2 within the (1991). The application of knowledge-based
higher g-range was specified to have two input nodes. systems to the simulation of gold extraction
The dimensionality of the "ill-defined" parts of the system processes. Minerals Engineering, Vol.4, No.2,
has been reduced considerably from a 5-dimensional one pp. 103-119.
to an ill-defined dimensionality of only 2 (function r2 for q Rumelhart, D.E., Hinton, G.E., and Williams, R.J. (1986).
> 1.0) and 1 (functions r 2 for q < 1.0 and r3) by means of Learning internal representations by error
the two techniques described above. NN-model I was propagation, in Rumelhart, D.E. and McClelland,
used to replace the phenomenological model in a dynamic J.L (Eds.), Parallel Distributed Processing, Vol.1,
CIL-cascade simulator. At each time step, f1s f2 and f3 MIT Press, Cambridge.
were calculated by equations 17, 18 and 19.

LIST OF SYMBOLS
Comparing simulation results. Another NN-model (NN-
model If) consisting of three NN's (one net for each of f^, C Gold concentration [g.m" 3 ].
f2 and f3) was trained with the same data points used for of Average diameter of solid particle [m].
training NN-model I. Each one of these NN's had five f Reaction term in mass balance.
input nodes for the five variables as explained. The G Gold grade in the ore [g.kg" 1 ].
reason for splitting up the different output features of NN kf Adsorption constant [m.sec" 1 ].
(A) was to simplify the training processes to find suitable k| Leaching constant [kg.g~1.sec~1].
mappings between f^, f2 and f3 and the 5-dimensional M Mass of ore in a single reactor [kg].
variable space more easily. The training sets for these p Density [kg.nrf 3 ].
NN's were also noiseless. The convergence errors for Q Volumetric slurry flow rate [m 3 .sec" 1 ].
these trained NN's compared well to the convergence q Gold loading on the carbon [g.kg -1 ]
errors of the NN's in NN-model I. NN-model II was also t Time [sec].
incorporated in the simulator and the simulation results of V Reactor volume [m 3 ].
the two NN-model simulators compared to the results of W Mass of carbon in a single reactor [kg].
the phenomenological model-simulator. All NN's of NN-
model I and // were defined to have 7 hidden nodes. The Greek symbols
number of NN-weights (degrees of freedom) of the two
NN-models are thus the same, so that their simulation e Void fractions.
results can be compared as well. Some simulation results
of reactor stage 1 for the three models referred to, are T Residence time of reactor stream [sec].
shown in Fig. 5.
Subscripts
The simulation results with NN-model I are highly superior
to the results of simulations with NN-model II, if compared c Carbon.
to the simulation results of the phenomenological model- I Liquid phase.
simulator, which are viewed to be the standard. This is i i t h reactor or stage.
mainly due to a large reduction in dimensionality and an in Reactor entrance stream.
associated improvement of the model. o Ore.
out Reactor exit stream.

184
Copyright IFAC Expert Systems in Mineral and PLENARY PAPERS
Metal Processing, Espoo, Finland, 1991

REQUIREMENTS AND TECHNOLOGIES FOR


OPERATIONS MANAGEMENT DECISION SUPPORT
SYSTEMS
L.K. Nenonen
National Research Council, Canada

INTRODUCTION

Decision Support Systems (DSS) apply intellectual The objective of this paper is to discuss general DSS
and computer technologies to help improve decision- requirements, supporting technologies and their ap-
making (Keen 1986). The DSS concept was con- plication for decision support. It is based on a rather
ceived in the business sector in the mid-1970s as a wider than deep look at the literature.
means to better utilise Information Technologies
(IT) to support strategic, financial and investment 2.0 GENERAL DSS REQUIREMENTS
decisions. The concept has been adopted in other
sectors to support design and operations manage- To develop a specific DSS, explicit knowledge is
ment decisions and has served to focus attention on needed about the organisation, the types of deci-
the ultimate objective of all IT applications, namely sions/problems to be assisted, the types of decision-
to support decision-making. While the literature on makers to be served and their individual prefer-
the topic is extensive, work is needed to integrate the ences. The DSS should advise users of the strengths
different approaches and IT-based support tools. and limitations of the IT-based tools in the context of
the problems to which they are being applied.
Operations management (OM) encompasses those
decision-making tasks requiring human participation Decision-makers apply experience, intuition and
and excludes those decisions taken automatically by knowledge to analyse a situation and to decide on ac-
plant process control systems (PCS). However, since tions to be taken. A DSS should encourage the user
OM decisions can be implemented by the PCS (e.g. to be systematic and to increase the scope of factors
via changed set-points), they must be taken using normally considered. The DSS should embody and
knowledge of the PCS. As part of OM, managers set reinforce an organisational unity of purpose and of
production, quality and cost targets, planners/sched- spirit, a set of 'ethical guideposts' and 'intellectual
ulers create plans/schedules to meet those targets and quality controls' to improve the consistency and
operations supervisors deploy available resources to quality of the decisions (Keen 1986). The impor-
implement those plans/schedules. Thus, OM can in- tance of understanding the rationality of the organi-
volve a variety of decisions taken by different deci- sation a DSS is to support is also stressed by Guariso
sion-makers with effects over different time dura- andWerthner (1989).
tions. Unexpected events and uncertainties about
their durations (e.g. equipment breakdown and re- Sequences of related decisions are often made in the
pair) complicate these decision-making tasks, partic- context of a problem (e.g. A refiner has broken
ularly those of operations supervisors. Union and down. As production supervisor, what should I do
safety rules, individual and company rights and ex- and how and when should I do it?). In situations
pectations and internal and external environmental where all data and knowledge is available, the deci-
constraints must be respected. Thus, decision-mak- sions could be taken by decision-making processes
ing must proceed in a timely way and must consider coded in the DSS. If some data are unknown or un-
many factors. certain or the problem itself is not clearly identified,
the decision-maker will need to play a larger role.
DSSs for OM (DSS/OM) will need various types of
IT-based tools and user assistance to apply them ef- Feelings of powerlessness and of information over-
fectively for data management and decision-making. load can adversely affect the performance of deci-
Optimisation models can assist planning and schedul- sion-makers. Providing them with more information
ing tasks. Simulation models can assist operations su- may not improve their performance. Information
pervisors to evaluate outcomes of their decisions. overload contributes to the gap between what we
Expert System (ES) technology can be used to store know and what we understand (Wurman 1989). A
and make available knowledge about how to solve DSS should help the decision-makers consider more
problems, how to use the IT tools in the DSS to help facts when making a decisions. It should work in
solve them and to implement operating rules in the harmony with and enhance the user's process of 'se-
context of simulation models. While DSS/OMs for lective attention' by which he/she controls the vol-
specific plants (e.g. an underground mine, a pulp ume of information considered at any moment.
and paper mill, a manufacturing plant) must satisfy
specific local requirements, they should also satisfy
requirements considered vital to all DSSs.

185
Keen (1986) believed that DSSs should be more 'ac- 3.0 BRIEF REVIEW OF SUPPORTING TECH-
tive' (i.e. encourage new, better quality solutions) NOLOGIES AND APPLICATIONS
and less 'passive' (i.e. encourage acceptance of con-
servative, lower quality solutions). For example, an A DSS combines intellectual and computer technolo-
optimiser in an 'active' operations management DSS gies. Intellectual technologies include the knowledge
could suggest a solution which had not occurred to residing in the decision-maker and in the DSS soft-
an operations supervisor. ware, including knowledge about how to use the
knowledge for decision-making. Computer technolo-
To cope with our increasingly complex technological gies are the hardware/software technologies used to
society, people are tending to acquire more spe- store information and knowledge and to make these
cialised training and experience. Since some DSS available for decision-making.
users may be relatively specialised and narrow in
their scope of reasoning, DSSs should encourage, if 3.1 Intellectual technologies
not force, users to take a broader perspective of the
potential impacts of decisions being considered. Much effort has been devoted to determine how hu-
Changes in staffing, equipment and procedures are mans observe, think, learn, solve problems and
expected to occur with increasing frequency and the make decisions and to determine how to code and
experience available for decision-making is expected use this knowledge via computer-based systems.
to become less broad and deep. Therefore, a DSS
should be adaptable and its role is likely to increase McKenney and Keen (1974) categorised the thinking
in importance with time. styles of decision-makers (e.g. preceptive, receptive,
systematic) and developed a model of cognitive style
to describe them. Smith (1988) developed a theory
DSS users may not be knowledgeable about the IT of problem structuring, assessed a number of meth-
tools in the DSS. A DSS should be designed to min- ods and proposed a way to deal with managerial
imise the likelihood that the user may misinterpret problems. Lewandowski and Wierzbicki (1988) de-
results it generates. Where the context is well-de- scribed work at the International Institute for Ap-
fined, the DSS could indicate which tool would give plied Systems Analysis (IIASA) on aspiration-based
the most reliable results for the decision at hand. decision analysis and support. IIASA concluded that
a diversity of model-based tools are needed to sup-
The DSS user-interface influences the degree to port decision-making, that such tools cannot deal
which the intellectual capabilities of the DSS/deci- with all human concerns and that a DSS should pro-
sion-maker team can be effectively applied. Coombs vide an interactive decision analysis tool for learning
and Alty (1981) list the following independent vari- about decision problems rather than for selecting
ables which define the DSS requirements: (1) The optimal solutions.
users job (determines the relevance of the DSS to the
user), (2) The decision-making task (specifies the in- Dreyfus and Dreyfus (1986) categorised the stages
formation and processing needs), (3) Discretionary by which adults acquire new skills as follows:
powers of the user (degree to which the user can de- novice, advanced beginner, competent performer,
cide what he does and how he does it), (4) Role of proficient performer, expert performer. One DSS
the user (behaviour expected of the user), (5) Per- role is to make the knowledge of the expert available
sonal factors (personality, attitudes, motives, intelli- to other categories of decision-maker. The expert
gence). Based on these independent variables, user performer sees by intuition what needs to be done
requirements are developed under the following cat- and decides how to do it. A DSS should help deter-
egories: (1) Task needs (information, information mine what needs to be done and help decide how to
processing), (2) Support needs (Advise user what the do it.
DSS can do, how to ask it to do it), (3) Expectations
(what the user is prepared to do and not do), (4) Insights into development and use of DSSs can be
Psychological needs (job interests, autonomy over derived from research in man-machine systems on
decisions). Coombs and Alty also discuss other as- topics such as task analysis. Harker and Eason
pects such as structured/unstructured tasks, impact (1985) believe that the functions and roles of IT sys-
of DSS design decisions on pre-structuring tasks, tems such as DSSs be defined first, then the informa-
variabilities between users, and focus of control be- tion requirements. An emphasis on systems analysis
tween the user and the DSS. tends to lead to systems ill-suited to perform the
functions needed by their users. The functional
Results from a computer users survey (Rushinek and needs of a DSS can be defined by analysing the deci-
Rushinek 1986) indicated some important require- sion-making tasks of the potential users. Harker and
ments to be met by a DSS. It should meet user Eason describe an 'open-system' task analysis com-
expectations re speed and support, give the user con- bining functional analysis of the task and an analysis
trol and a feeling of independence. A DSS should of user characteristics to determine the system speci-
detect and advise the user when his application is ap- fications. Payne and Green (1986) describe a goal-
proaching the capability limits of the DSS and en- based 'task-action grammar' approach. It involves a
sure that the user never loses control of the system. planning step in which the current state is compared
with the goal state and a sequence of simple tasks is
determined that will lead towards the goal state.

186
Vicente and Rasmussen (1988) describe a cognitive DSS for steel mill production planning which uses
architecture for DSSs for industrial process control. intuitive knowledge to permit use of LP models. Di-
As use of automation increases, process operators rect use of such models was not feasible due to prob-
deal with higher-level supervisory control or opera- lem size and the need to deal with uncertainties. Wu
tions management tasks. Cognitive analysis of such et al (1990) developed a DSS for managing produc-
tasks helps match their characteristics to the appro- tion of diverse small-volume orders. The DSS in-
priate level of DSS and to the decision-making cludes a database (dBASEIII Plus), a model (UNDO
agents: designer, operator, computer. By listing the Integer Programming) and user dialogue systems.
designer as a decision-making agent, Vicente et al Inventory costs are minimised using an interactive
highlight the fact that the DSS design will influence heuristic procedure (Model dialogue system) to
the way decisions are made using it. Vicente and solve a large bicriterion (Minimum total earliness,
Rasmussen believe that ES technology could be used total flowtime) integer programming problem.
to make the DSS design knowledge available to the Menus and question/answer dialogues help the user
users. They describe six levels of decision support create feasible schedules by inserting or deleting idle
ranging from data validation and display to com- times from schedules proposed by the model. These
pletely automatic decision-making. They have devel- examples illustrate the need for interface modules to
oped an interface framework to match the DSS to optimisation models to help users formulate their
the process and to the decision-makers. problems in the required forms and to guide their
operation to improve their performance. Explana-
Gaines (1988) reports that empirically-based design tion, modules are also used to explain the results in-
recommendations for person-computer interfaces cluding the potential impact of the assumptions and
have grown from some 10 rules in the 1970s to over approximations required to cast real problems into
1000 rules in the 1980s. He described a conceptual the rather rigid forms associated with the algo-
framework as a basis for generalizing such rules and rithms. Users will probably need some understand-
as a guide for system design. This framework should ing of how the algorithms function and of the impli-
encompass both 'causal' and 'anticipatory' models of cations of the assumptions on which they are based.
human behaviour.
'Symbiotic Systems' is another general approach to
3.2 Information Technologies (IT) decision support. Manheim et al (1990) described a
'Symbiotic DSS' (S/DSS) for production planning
ITs to assist operations management decision-making and scheduling. A 'Computer-Directed Process Man-
include: Database management, modelling (proba- ager' based on a model of the user works indepen-
bilistic, analytical, event-based), mathematical pro- dently of user direction to assist in problem solving.
gramming (Linear, nonlinear, dynamic), Expert Sys- The S/DSS sits outside a commercial production
tems, neural networks, user interfaces, object-ori- planning and scheduling package (Schedule-X by
ented design and programming, spreadsheets. Numetrix Inc. of Toronto, Canada) and helps the us-
er to develop production schedules. Scriabin et al
In the context of any given decision-making task, (1988) described a Symbiotic System using visual
each type of IT (and its specific form of implemen- problem representations. In a nurse scheduling ap-
tation in the DSS of concern) has strengths and limi- plication, proposed schedules are presented visually
tations which should be made known to the decision- such that the scheduler can quickly spot rule viola-
maker. Accuracy and validity of results produced tions (Bell et al 1990). These examples of Symbiotic
and the time performance are two concerns. The systems illustrate two effective techniques for DSS,
DSS should guide the user to the best tool for each (1) use of user models to assist application of 'tech-
decision in the task at hand. nology codes' and, (2), use of visual representations
of problem solutions to facilitate their validation us-
Some tasks can require combined or sequential use ing the decision-makers visual pattern-matching ca-
of different types of tools, for example, expert sys- pabilities.
tems and simulation models. Problems can arise in
linking particular products together to service com- DSS generators are high-level end-user languages
bined applications. facilitating development of models primarily for
business decision-making. They are interactive, pro-
Mathematical programming tools are used to seek vide common financial functions and, in some cases,
optimal solutions to problems that can be cast in the Monte Carlo simulation and tools for developing and
rather rigid forms required to permit application of evaluating what-if scenarios. Sharda (1988) reported
the algorithms. Kribbe et al (1987) developed user- on an experimental study of the effectiveness and ef-
friendly software to help formulate problems in a ficiency of a DSS built using a DSS generator called
nonlinear programming form and an algorithm to 'Interactive Financial Planning System' (IFPS). Con-
transform the NLP into sub-problems solvable by clusions were that use of the DSS resulted in more
conjugate gradient methods. Shaw et al (1984) de- effective decisions ( measured in terms of net earn-
veloped a user interface for an optimisation model ings), investigation of more alternatives, increased
which included a model management system, a dia- confidence in the decisions and a reduction in vari-
logue module and an inferencing module to explain ance in profit performance. Benson used a DSS gen-
model results. Nambiar et al (1988) developed a erator called FCS-EPS developed by Thorn EMI to

187
develop a simulation model to evaluate economics of Rubin et al (1988) describe an architecture for an
installing a monitoring and control system (MCS) in ES that can function as an 'operator's associate'
a potash mine. The model included an operating (OA) in supervisory control of a complex system.
schedule for machine and crew assignments and An OA is a support system architecture with control
maintenance work. Impact of installing the MCS was and understanding properties to enhance human abil-
investigated by varying the breakdown durations. ities, overcome human limitations and complement
human preferences. Provision of understanding
It may be feasible to develop DSS generators for op- properties requires an 'operator's function model
erations management purposes (e.g. a DSS generator (OFM) to represent interrelations between dynamic
to facilitate development of DSSs for operations system states and operator functions. Operator
management of mineral processing plants). knowledge comprises static knowledge of the sys-
tem, dynamic knowledge of the current system state
Modelling and simulation techniques have been used and procedural knowledge about how to carry out
to support decision-making, sometimes in combina- operator functions. To be useful as an OA, the
tion with other ITs such as Expert Systems (ES). Fil- knowledge required by the OFM must be imple-
ip et al (1989) described a layered modelling ap- mented by an ES. The OFM is used to infer the in-
proach where external models are converted to an tent of observed operator actions (intent inferencer).
internal form. Moser (1989) used an ES to analyse A blackboard framework of problem solving was
model results, draw inferences and make recommen- chosen to represent the inferences and carry out the
dations. Ulgen and Thomasma (1989) developed a assessments of the OA.
'Simulation Program Generator' with a natural lan-
guage interface to help decision-makers build pa- Rosenbloom et al (1985) describe the application of
rameter, event or icon-based models of manufactur- Soar to develop a more efficient version of Rl, an
ing plants. 'Qualitative Physics', the art and science OPS5-based ES for configuring DEC computer sys-
of symbolic reasoning about physical systems can be tems. Soar, based on OPS5, is another problem-solv-
a source of ideas on development of simple models ing architecture where problem-solving activity in-
representing only features relevant to the problem volves attempts to satisfy goals via heuristic searches
being solved (Shrager 1988). Lee (1990) discussed in problem spaces. Soar uses a mechanism called
the combined use of the efficiency of a rule-based 'chunking' to create new production rules that allow
ES with the robustness of a model-based system. the system to perform actions directly rather than
Falster (1987) described use of data-driven models via problem-solving on subgoals. Tests on four
combined with heuristic rules for planning and unibus configuration tasks showed a reduction by
scheduling. These examples contain ideas on how factors of 20 to 200 in the number of decisions re-
various types of models (external/internal, parame- quired.
ter/event/icon-based) could be used, sometimes in
combination with ESs, to support decision-making.
Realtime ESs can communicate with sensors and ac-
tuators in the process environment, perform under
4.0 DSS ARCHITECTURES/LANGUAGES time constraints and do multiple tasks (Diaz 1990).
While such ESs perform process control functions,
A DSS architecture or language should facilitate in- they may also be able to implement operations man-
tegration of the IT tools needed to support the tar- agement functions to support operations supervisors.
geted decision-making tasks. Various options should Diaz reports that realtime ES products are available
be considered before making a selection. which can communicate with database management,
spreadsheet and other foreign software and can man-
Chalfan (1986) described use of a knowledge repre- age operation of multiple ESs. Realtime ESs are be-
sentation language called the 'Meta-level Representa- ing developed using blackboard architectures (Diaz
tion System' (MRS) to integrate existing procedural and Orchard 1990, Green 1985) which could facili-
codes for aerospace design applications. tate implementation of DSSs for operations manage-
ment.
Jagannathan et al (1989) describe a 'Blackboard' Reitman (1990) describes development of two Al-
model, a conceptual, high-level organisation of in- based LISP-based 'generic' executive DSSs called a
formation and knowledge and a method for its dy- Management Adviser and an Operations Adviser
namic control and use for problem-solving. It can (OA). OA helps managers develop manufacturing
encompass multiple knowledge sources, different ap- strategies accounting for capacity utilisation, inven-
proaches to reasoning and support step-by-step tories, lead times and unit costs. The Palladian Soft-
problem solving. Corkill (1991) describes a black- ware Language was used which combines object-ori-
board-based approach involving integration of ented programming, production rules, model-based
OPS5, a rule-based ES language, with the University reasoning, goal-directed control and constraint-
of Massachusetts Generic Blackboard System. based inferencing. A graphic 'roadmap' mechanism
Corkill describes the requirements of a problem- shows the user all the places to which he can go at
solving architecture that can be embedded with other any time during an analysis. Due to their experience
architectures and that can coexist with multiple in- in development and application of business-oriented
stances of itself and of other problem solvers. DSSs, suppliers of such systems have developed lan-
guages or tools that integrate all the needed func-

188
tions. Adaptability and compatibility of these lan- 6.0 REFERENCES
guages/tools to operations management applications
should be investigated. Even if they cannot be used Bell,B.,Place,S.,Scriabin,M.,Zilm,D. (1990). A
directly, perhaps some of their concepts and design Symbiotic Nurse Scheduling System. Canadian Op-
features can be adapted. erations Research Society Conference, May 1990,
Ottawa, Canada.
5.0 RECOMMENDATIONS
Benson,G. (1985). DSS Deployed for Mine Simula-
The following check list contains features to consid- tion Model. Software Canada, V.4, N.9, October
er when developing or selecting a DSS: 1985.
1. Support a systematic approach to decision-mak- Chalfan,K.M. (1986). A Knowledge System that In-
ing. tegrates Heterogeneous Software for a Design Appli-
2. Encourage decision-maker to enlarge cope of fac- cation. The AI Magazine. V2,N.2, Summer 1986.
tors considered.
3. Reinforce goals, values and ethics of the organisa- Coombs,M.J., AltyJ.L. (Eds.) (1981). Computing
tion. Incorporate explicitly to facilitate changes. Skills and the User Interface. Computers and People
4. Provide quality control of decisions. Series. Academic Press. 1981.
5. Support decision-making in the context of prob-
lem-solving. Corkill,D.D. (1991). Embedable Problem-Solving
6. Provide only the information needed/requested. Architectures: A Study of Integrating OPS5 with
7. Provide suggested decisions/solutions. UMass GBB> IEEE Transactions on Knowledge and
8. Facilitate adaptability to changing conditions and Data Engineering, V.3,N.l, March 1991.
to progress of users in acquiring new decision-mak-
ing skills. Diaz,A.C. (1990). An Overview of Realtime Expert
9. Provide advice on what tools to use and how. Systems. Division of Electrical Engineering, Nation-
10. Maximise comparability with the users jobs, al research Council Canada, report No. ERA-380,
tasks, discretionary powers, expectations, thinking NRCNo. 31759, April 1990.
styles, levels of experience.
11. Define functions and roles of the DSS first, then Diaz,A.C.,Orchard,R.A. (1990). A Prototype Black-
information needs. board Shell Using CLIPS. Applications of Artificial
12. Incorporate problem domain knowledge in the Intelligence in Engineering V. V.2 Manufacture and
form of task guidance systems and IT tool applica- Planning. Proceedings of the 5th International Con-
tion knowledge in the form of technology guidance ference, Boston, USA. July 1990. Editor G.Rzevski,
systems. Computational Mechanics Publications, Southampton
13. Ensure that the user is always in control. Boston.
14. Assist user to structure unstructured tasks. Facil- Co-publisher: Springer-Verlag.
itate their incorporation into task guidance systems.
Dreyfus,H.L.,Dreyfus,S.E. (1986). Competent Sys-
In seeking information on intellectual and computer tems: The Only Future for Inference-Making Com-
technologies, here are some topics and starting puters. Future Generation Computer Systems, V.2,
points: p. 233-243. Elsevier Science Publishers, North-Hol-
land, 1986.
(a) Thinking styles of decision-makers (Mckenney
and Keen 1974). Falster,P. (1987). Planning and Controlling Produc-
(b) Models of cognitive style (McKenney and Keen tion systems Combining Simulation and Expert Sys-
1974, Vicente et al 1988). tems. Computers in Industry, 8(1987) 161-172.
(c) Theories of problem structuring (Smith 1988).
(d) Aspiration-based decision analysis and support Filip,F.G.Donciulescu,D.A.,Majuru,A.,Socol,I.
(Lewandowski and Wierzbicki 1988). (1989). External and Conceptual Models in Decision
(e) Task, functional, user analysis (Harker and Eason Support Systems. Syst. Anal. Model. Simul. 6(1989)
1985, Payne and Green 1986). 7, 507-512, Akademie-Verlag Berlin.
(f) Distribution of task elements between the user
guidance system, the uger and the IT support tools Gaines,B.R. (1988). A Conceptual Framework for
in the DSS (Vicente and Rasmussen 1988 ) . Person-Computer Interaction in Complex Systems.
(f) Identification of levels of DSS needed and match- IEEE Transactions on Systems, Man, and Cybernet-
ing the DSS to plant and user needs (Vicente et al ics, Vol.18, No.4, July/August 1988.
1988).
Guariso,G.,Werthner,H. (1989). Environmental De-
(g) DSS architectures/languages (Corkill 1991, Diaz cision Support Systems. Halsted Press, John Wiley
1990, Rubin et al 1988, Rosenbloom et al 1985, and Song, 1989.
Chalfan 1986).
(h) Symbiotic systems, visual representations (Scn-
abin et al 1988). Green,P.E. (1988). AF: A Framework for Realtime
(i) Modelling and ES (Ulgen and Thomasma 1989). Distributed Cooperative Problem-Solving. Distribut-
(j) Conceptual framework for describing human- ed Artificial Intelligence, M.N.Huhns (ed.), Pitman,
computer interaction (Gaines 1988). London, Morgan Kaufmann, Pub. Inc., Los Altos,
CA., 1985.
189
Harker,S.,Eason,K. (1985). Task Analysis and the Rosenbloom,P.S.,Laird,J.E.,McDermott,J.,Newell,A.
Definition of User Needs. In Analysis, Design and ,Orciuch,E.(1985).Rl-soar:An Experiment in
Evaluation of Man-Machine Systems. Proceedings of Knowledge-Intensive Programming in a Problem-
the 2nd IFAC/IFIP/IFORS/IEA Conference, Varese, Solving Architecture. IEEE Transactions on Pattern
Italy, Sept. 1985, Published by Pergamon Press. Analysis and Machine Intelligence, Vol. PAMI-
p.63-66. 7,No.5, September 1985.

Jagannathan,V.,Dodhiawala,R.,Baum,L.S. (1989). Rubin,K.S.,Jones,P.M.,Mitchell,CM. (1988).


Blackboard Architectures and Applications. Perspec- OFMspert: Inference of Operator Intentions in Su-
tives in Artificial Intelligence, V.3, Academic Press pervisory Control Using a Blackboard Architecture.
Inc., 1989. IEEE Transactions on Systems, Man and Cybernet-
ics., Vol. 18, No. 4, July/August 1988.
KeenJP.G.W. (1986). DSS: The Next Decade, p. 221-
237. Decision Support Systems: A Decade in Per- Rushinek,A.,Rushinek,S.F. (1986). What Makes
spective. Editors: E.R. McLean, H.G.Sol. Elsevier Users Happy? Communications of the ACM, July
Publishers B.V. (North-Holland) 1986. 1986, V.29,N.7,p. 594-598.

Kribbe,W.,van der Hoek,G. (1987). Applicability of SchragerJ. (1988). Issues and Directions in Qualita-
Nonlinear Programming in Decision Support Sys- tive Physics: Special Edition from the 1987 Work-
tems: Realization of User Friendly Software. Faculty shop. Artificial Intelligence in Engineering, 1988,
of Mathematics and Informatics/Computer Science, V.3,N.3,p.l22-124.
Delft University of Technology. Available from Na-
tional Technical Information Service, Springfield, Scriabin,M.,Bisanz,S.,Lakeman,G.,Place,S. (1988).
VA. 22161. Report No. P88-14065. Symbiotic Systems for Complex Problems. Proceed-
ings of the 1st International Conference on Industri-
Lee,N.S. (1990). A Computational Paradigm that In- al and Engineering Applications of AI and ES, Uni-
tegrates Rule-based and Model-based Reasoning in versity of Tennessee Space Institute, June 1988.
Expert Systems. International Journal of Intelligent
Systems, V.5, 135-151, 1990. Shaw,J.J.,Deckert,J.C.,Alexandridis,M.G. (1984).
Development of an Interface for Optimization Algo-
Lewandowski,A.,Wierzbicki,A. (1988). Aspiration- rithms. Final Technical Report. National Science
based decision Analysis and Support. Part LTheoret- Foundation report NSF/ECS-84005, August 1984.
ical and Methodological Backgrounds. International
Institute for Applied Systems Analysis. Working pa- Sharda,R.,Barr,S.H.,McDonnell,J.C.(1988).Deci-
per WP-88-03, January 1988. sion-Support-System-Effectiveness: A Review and
an Empirical Test. Management Science, 34-
Manheim,M.L.,Srivastava,S.,Vlahos,N.,Hsu,J- ,2,February 1988, p.139159.
.,Jones,P. (1990). A Symbiotic DSS for Production
Planning and Scheduling: Issues and Approaches. Smith,G.F. (1988). Towards a Heuristic Theory of
Proceedings of the 23rd Annual Hawaii International Problem Structuring. Management Science.
Conference on System Sciences. IEEE Comput. Soc. V.34,N.12, Dec.1988.
Press., Los Alamitos,CA.,1990,V.3,p.383-390.
Ulgen,O.N.,Thomasma,T. (1989). Computer Simu-
McKenney,J.L.,Keen,P.G.W. (1974). How Managers' lation Modelling in the Hands of Decision Makers.
Minds Work. Harvard Business Review, May/June Simulation and AI, 1989. Proceedings of the SCS
1974, p.79-90. Multiconference, San Diego, CA.,1989, p.89-95,
VII.
MoserJ.G. (1986). Integration of Artificial Intelli-
gence and Simulation in a Comprehensive Decision Vicente,K.J.,Rasmussen,J. (1988). The Cognitive
Support System. SIMULATION, 47:6 223-229, Architecture of Decision Support Systems for Indus-
1986. trial Process Control. Riso National Laboratory,
DK-4000 Roskilde, Denmark, Harch 1988. Report
Nambiar,P.V.N.,Acharya,D. (1988). Decision Sup- No. Riso-M-2636. 42 pages.
port System for Production Planning: A Steel Plant
Experience. Operational research '87. G.K.Rand Wu,D.S.,Tabucanon,M.T. (1990). Multiple Criteria
(Editor), Elsevier Science Publishers B.V. (North- Decision Support System for Production Manage-
Holland), 1988. ment. Engineering Costs and Production Economics,
20 (1990) 203212.
Payne,S.J., Green,T.R.G. (1986). Task-Action
Grammars: A Model of the Mental Representation Wurman, R.S. Information Anxiety. Doubleday,
of Task Languages. Human-Computer Interaction, New York, 1989.
v.2, p.93-133, 1986.

Reitman,W. (1990). Generic Expert Systems for


Management Applications: The Operations Adviser
and the Management Adviser. Computer Science in
Economics and Management. 3:167-175, 1990.
190
Copyright IFAC Expert Systems in Mineral and
Metal Processing, Espoo, Finland, 1991

APPLICATION VIEWPOINTS OF EXPERT SYSTEMS IN


MINERAL AND METAL PROCESSING
K. Leivisk
Department of Process Engineering, Control Engineering Laboratory, University ofOulu,
SF-90570 Oulu, Finland

1. INTRODUCTION

The number of reported applications of Expert Table 1. Classification of Expert Systems applications and
Systems has increased rapidly in all engineering methods in mineral and metal processing.
branches and this workshop is also showing similar
type of behaviour in mineral and metal processing. Control Level Application Methods
Applications are not limited to some type of DSS Quality planning Knowledge-based
processes, only, but they are widely distributed over systems +
the whole field. Research work is very active and Conventional
the development rate is high. However, the programming &
technology seems to be maturing and therefore the Data-base methods
Production control Scheduling Knowledge-based
risks connected to industrial applications are Energy management systems
becoming lower. conventional
simulation and
optimization
The role of Expert Systems depends more or less on methods
the level of control where they are applied. On the Diagnostics Knowledge-based
lowermost control level the requirements of the systems
actual, time critical control functions must be taken Neural networks +
conventional
into account. Also the integration with existing methods
conventional control systems and with control Process control Supervisory control Rule-based systems
methods and algorithms is essential. Stabilization Fuzzy control
Direct control Neural networks
On the upper levels, in diagnosis and DSS,
man/machine interface and explanation mechanisms
play an important role. Integration of Expert or valuing dynamics and long time delays make also
Systems with existing mathematical models and the situation more complicated. These kind of
methods is important. The use of Expert Systems to processes need the inclusion of human knowledge
systematically store the key expert knowledge and and qualitative reasoning in control strategies. Also
know-how is also emphasized. On the upper levels, uncertainties in systems, environments and
the acquisition, validation and updating of measurements can require the application of Expert
knowledge is an essential and also problematic area. System tools.
The functions and methods of Expert Systems Specific features of expert control are reviewed for
applications in mineral and metal processing can be instance by rzen (1989): Non-monotonic
classified as shown in Table 1. reasoning is required because the validity of data is
changing with time and the state of the system and
this must be automatically taken into account.
2. EXPERT SYSTEMS IN PROCESS CONTROL Interrupting mechanisms must be available to
priorize between asynchronous events of different
Expert Systems applications exist for nearly all importance. A real-time system must also include
types of processes, simple and complex. The most features like temporal reasoning capabilities; ie. it
beneficial, however, seem to be complex processes must be possible to connect time and reasons and
where process performance and behaviour are their sequences together; and reasoning under
difficult to model with conventional methods. The limited time and based on erroneous or missing
complexity is often connected with big amount of sensor data.
measurements with uncertainties and high noise
levels, where the operators have difficulties in According to Saito (1990), in expert control the
system monitoring and following the process recognition-judgement-action cycle is repeatedly
performance and especially the changes in it. Slow carried out (see Fig. 1).

191
Figure 1.
Expert control as a recognition-judgement-action
cycle.
INTERPRETATION/ PLANNING/
DIAGNOSTICS DESIGN

START NORMAL STATE NORMAL STOP


OPERATION CHANGE OPERATION
Stabilizing control

In stabilizing controls both open-loop and closed- Figure 3. Different states of the process.
loop applications arc in use (Fig. 2).

Advisory systems are based on process Supervisory control


measurements and they are giving advice to process
operators. In this case, the control loop is closed As far as the supervisory control is concerned
only by the operator. Examples are found for Expert Systems type of applications have been done
instance in Partington, (1989) for BOS monitoring already before using process control computers and
and control and blast furnace control. Fortran and other languages. Customized systems
have been built for different processes.
Basically, two variants of closed-loop control exist.
First, Expert System can take care of the whole Expert System tools seem to offer several
control using for instance Fuzzy Control. Some advantages facilitating especially systems building: it
examples are found in Saito (1990). Secondly, is possible to use linguistic terms instead of
Expert System is used in tuning conventional programming approach, system building is faster
controllers, in changing control strategies and with minimum programming skills and computer
adopting to varying operational conditions. This is knowledge, and system modification is easier.
done by utilizing expert knowledge and rule-based Supervisory applications, however, suffer from the
systems. same limitations and lacks as other Expert Systems
as reviewed in the later chapters.
Another way of dividing Expert System applications
to process control would be to look at the state of Requirements are discussed in Herbst et al. (1990)
the process; ie. to look for systems for start/stop and they differ slightly from the stabilizing system
sequences, normal operation and state changes (Fig. requirements: Real-time features are same as before
3). It is, however, quite surprising to see that very (interrupt mechanisms, time-limited features,
few systems for start/stop sequences and process or priorities). For supervisory systems, data transfer
production line state changes have, this far, been with stabilizing level systems is of course a must.
reported. This means links with distributed control systems or

EXPERT EXPERT
SYSTEM
USER -* SYSTEM
/
CONTROL EXPERT
CONTROL
-j
SYSTEM SYSTEM S/STEM
^
r
PROCESS ^ ^ PROCESS PROCESS

ON-LINE OPEN-LOOP ON-LINE CLOSED-LOOP


ADVISORY SYSTEM SYSTEMS

Figure 2. Principles of open-loop and closed-loop expert controllers.

192
equal and communications to both directions are Diagnostics
needed. Also connections with algorithmic parts in
supervisory systems are needed. This means the Diagnostics and fault detection systems are classical
utilization of process models and optimization applications of Expert Systems. Compared with
systems. In many cases, connections with process algorithmic-based fault deduction the biggest
monitoring and analyzing systems must also be advantage of Expert Systems approach is that it
included. gives possibilities to follow human's way of fault
diagnosing.
The greatest challenge on the supervisory control
level would be building systems that utilize expert Applications vary from troubleshooting of
knowledge in the systems tuning and hydraulic and electronic systems to vibration
implementation stage. This is usually the most time analysis. The essential feature is combining of
consuming stage in supervisory system projects and Expert Systems with mathematical models and other
the success in it is very much dependent on the skills conventional tools and finally their integration with
of the system engineer. These kind of applications total condition monitoring systems.
have not been reported.

3. OTHER APPLICATIONS Expert Systems in scheduling

Needs for other applications are more or less same In scheduling application Expert Systems are
as before; Expert Systems are used when it is targeting to the higher throughput, the keeping of
necessary to handle a lot of complicated, varying delivery times and good end-product quality. The
and uncertain information, when the decision functions vary from the allocation of customer
making is based on experience and on qualitative orders to the selection of the routing in the mill
reasoning. One important aspect in upper level according to product requirements.
Expert Systems solutions is that they make it
possible to retain the key knowledge in the system Two main possibilities are available. In the first
and facilitate diminishing the negative effects of one, Expert System is used together with
changing personnel. They are also used in training conventional algorithmic scheduling in the problem
of new staff members and, wisely applied, give formulation stage and in combining qualitative or
their users a chance to keep up their own rule-based information in the system. In this way
knowledge. the strict mathematical problem formulation is
adopted to changing requirements.
Upper level Expert Systems are mostly consulting
systems and with the aid of these experts can reduce In the second approach, the whole scheduling
their working loads. problem is formulated as a rule based system and
the solution itself can also include some algorithmic
parts (Fig. 4).

ORDERS
PROBLEM
FORMULATION

SCHEDULES
| CONSTRAINTS
REQUIREMENTS

ALGORITHMIC
PARTS

ORDERS

SCHEDULES
CONSTRAINTS
REQUIREMENTS

Figure 4. Different ways of applying Expert Systems in scheduling.

193
The utilization of Expert System approaches in 4. ADVANTAGES
scheduling gives possibilities to include qualitative
information about limitations, bottlenecks, quality There are not too many detailed studies available on
variables, strategy changes etc. into the solution. the results gained with Expert Systems in mineral
This is not necessarily possible in algorithmic and metal processing. This is due to the fact that we
approaches. are dealing with the new technology in the trial
stage. Also the applications are on areas where the
As mentioned above Expert System approaches are results are difficult to estimate quantitatively. Some
sometimes used together with conventional examples are given by Born, 1989, Partington,
algorithmic approaches. The result is a suboptimal 1989, Atsumi, 1989, Saito, 1990 and Herbst et al.
or feasible schedule or several schedules are (1990) and following is a short conclusion of these.
presented for operator selection. One problem here
is to guarantee the optimality of the solution. The A regularly mentioned advantage is the improved
feasibility is assured for instance by simulations. and more consistent decision making by non-experts
and the saving of experts' time. It is also possible to
process uncertain knowledge like the expert is doing
DSS/ES applications it. By storing of expert knowledge the expertise is
kept within the company eventhough experts are
The higher in the control hierarchy we go, the leaving the company or moving to other positions
more important are connections with conventional inside it. The similar effect is also gained by using
data processing systems and data bases. DSS/ES Expert Systems in training of new staff members or
systems are by nature consulting systems and it in keeping staff members' skills on the existing
means that operator interface questions are having level.
still more importance compared with earlier
mentioned systems. Also the questions connected Expert Systems make fast system development
with maintenance and updating are emphasized. possible using prototyping with some tools even by
Moving from rule-based systems to object-based users without any intervention of system
makes modifications easier. programmers (mostly concerning the knowledge
base). First results are shown very rapidly.
Figure 5 gives some implication of the structure for Prototyping has also some disadvantages and
this kind of system. The applications vary a lot. especially with large scale systems it does not give
Product and quality control and different design and the clear picture of the job in the beginning and
planning activities seem to be the most popular. may even be misleading. The need for more
efficient tools for systems design is clear.

REQUIREMENTS LIMITATIONS

/ PROBLEM
FORMULATION

RESULTS

Figure 5. Rough scheme of DSS/ES system structure.

194
With Expert Systems, different tools are available inefficient approach. Small prototypes do not tell
for users with different skills: AI languages, enough about the requirements of the large scale
toolkits, shells and application packages. Figure 6 applications and full protos turn out very expensive.
shows the situation according to Partington (1989).
Some systematic structural design methodology and
corresponding supporting functions for system
APPLICATION design are needed.
SPECIFIC PACKAGES
Problems in integrating with existing systems.
Nearly all applications include interfacing and
integration needs; we need interfacing with
SCOPE
OF USE SHELLS program languages, conventional simulation and
optimization programs, outside hardware and
software. For instance in process control
applications the degree of integration must be
TOOLKITS considered case by case as long as distributed system
vendors do not have Expert System tools as
standard functions in their systems.

GENERAL AI LANGUAGES Need for training. Being a new technology, Expert


HIGH LOW Systems applications introduce considerable training
PROGRAMMING SKILLS REQUIRED especially for the staff that is responsible for
development, updating and maintenance of Expert
Figure 6. Systems. This concerns both hardware and
Different Expert Systems tools according to the software. This need is also increased in some cases
scope of application and the programming skills by the lack of maintenance support and knowledge
required (Partington, 1989). from some tools' vendors and by the need to keep
the company knowledge in own hands.
In the supervisory control the savings are coming
from the same sources as in the conventional
systems. Expert Systems help to stabilize the process 6. REFERENCES
and to adapt operations to changes in feed variations
and changing equipment conditions. Examples are Atsurni T., 1989, Artificial Intelligence in the
given in Herbst (1990) for SAG circuits, where the Japanese steel industry. IISI Technical Exchange
application of Expert System increased throughput Session "Use and application of Expert Systems and
by 6-15 % and decreased the energy consumption Artificial Intelligence". Chicago 1989, pp. 31-42.
by 2.5 %. Concerning the blast furnace systems
Atsurni (1989) reported that in 93 % of cases the Born A., 1989, The application of Expert Systems.
Expert System was judged to perform at least as IISI Technical Exchange Session "Use and
well as the human operator. application of Expert Systems and Artificial
Intelligence". Chicago 1989, pp. 1-6.
5. PROBLEMS
Herbst J.A. et al., 1990, Report on actual benefits
Expert Systems suffer also from some limitations arising from the application of expert control
and disadvantages. Some of these are commented system in industrial semi-autogenous control
upon in the following: circuits. Proceedings of 6th IFAC Symposium
Automation in mining, mineral and metal
Lack of systematic methods for knowledge processing, Buenos Aires, Argentina 4-8 September
acquisition and validity testing. This problem is met 1989. Pergamon Press, 1990, pp. 53-60.
in all applications and some automatic, or at least
systematic, methods are expected by the user. In Partington D., 1989, Expert Systems developments
control applications this is seen as a necessity to within British Steel. IISI Technical Exchange
check rule-based control with model based controls Session "Use and application of Expert Systems and
in order to assure for system optimality and Artificial Intelligence". Chicago 1989, pp. 15-29.
stability.
Saito T., 1990, Application of artificial intelligence
Lack of learning capabilities. This comes up in the Japanese iron and steel industry. Proceedings
especially with deeper knowledge and large scale of 6th IFAC Symposium Automation in mining,
systems. Also in this case some automatic methods mineral and metal processing, Buenos Aires,
that help the system to adapt its operations Argentina 4-8 September 1989. Pergamon Press,
according to what has happened in the past are 1990, pp. 31-38.
waited for.
rzen K-E., 1989, An architecture for expert
Lack of proper design methods. Once again, system based feedback control. Automatica
especially for large scale systems prototyping is an 25(1989)6, 813-827.

195
AUTHOR INDEX

Anabuki, Y. 63 Kanemoto, M. 51 Shibata,M. 147


Ando,N. 19 Kerttula,M. 79 Sillanp,M. 155
Arai,K. 147 Koponen, P. 31 Snopek, W. 39
Aral, S. 71 Kumar, S. 95 Spassov, K. 57
Arima, K. 51 Kuribayashi, T. 89 Spring, R. 7
Asano, K. 147 Stange, W. 13
Stohl, K. 39
Laitinen, T. 79 Syrjnen, M. 103
Barnard, E. 179 Lassila, 0 . 103
Bonifazi,G. 139 Lee, H.G. 45
Bradford, S.H. 1 Lee, I.O. 45 Takenaka, K. 89
Brimacombe, J.K. 95 Leivisk,K. 119,191 Tashiro,K. 19
Bulsari, A.B. 155 Tateno, J. 147
Teraski, T. 19
Maruyama, S. 147 Terazono, K. 109
Choi,T.H. 45 Mattila,P. 103 ThibaulU. 25
Cilliers,JJ. 161 Meech,J.A. 95 Trm, S. 103
Costantini, A. 139 Moritz, Th. 39 Tottori, H. 51

Filev, D. 57 Nakata, E. 109 Uronen, P. 31


Flament, F. 25 Nenonen, L.K. 185
Franklin, M. 7
Fujimoto, H. 71, 109 Vagenas, N. 173
Fukaya, N. 71 Okaya, K. 167 van der Walt, T.J. 179
Owaki, R. 63 van Deventer, J.S.J. 125, 179
Viherma,R. 31
Ginsberg, D.W. 115
Pesonen, L. 103
Piironen, T. 79 Watabane, M. 19
Haataja,R. 79 Weigert, Th. 39
Hadjiski,M. 57 Whiten, W.J. 115
Hirao,F. 89 Rajamani, R.K. 131
Hodouin, D. 25 Rm,T. 31
Hosoda,M. 89 Reuter, M.A. 125 Yamane, H. 51
Yi, S.H. 45
Yoshida,T. 51
Ibaragi, M. 147 Sakane, K. 51
Inoue, T. 167 Sakiyama, H. 63
Samarasekera, I.V. 95
Sanou, K. 71
Jin, J.G. 45 Saxen, H. 155
Juuso, E.K. 119 Shah,I.M. 131

197
KEYWORD INDEX

Acquisition of knowledge-based information, 79 Feedforward neural network, 25


Adaptive control, 131 Fuzzy model, 57
Adaptive systems, 119 Fuzzy set theory, 131
Adsorption, 13, 179 Fuzzy systems, 51,119
Aestetic characteristics, 139
Alarm systems, 147 Grinding simulation, 25
Artificial intelligence, 1,19,45, 71, 89,103,109,161
Automatic control, 1,19 Heuristic programming, 71
Automation, 39, 147 Hierarchical systems, 119
Hierarchically intelligent control, 1
Back propagation, 167 Hot coil finishing line, 89
Batch and continuous simulation, 125 Hydrocy clones, 13
Blast furnace operation, 45
Blending, 109 Image analysis, 139
Blowing control, 51 Image processing, 147
Breakouts, 95 Inference processes, 71

Casting, 147 Kinetics, 125


Categorisation, 115 Knowledge acquisition, 95,115
Character recognition, 147 Knowledge base, 57, 89
CIP, 13 Knowledge based, 125
Classification, 79, 115, 147 Knowledge engineering, 19, 31, 39, 45, 51, 89
Clustering, 115 Knowledge representation, 95
Cokemaking, 109 Knowledge units, 95
COMDALE/X, 95
Commercial expert systems, 79 Ladle metallurgy, 57
Computer control, 1, 19, 63, 131 Learning method, 167
Computer graphics, 7 Local cluster shape, 115
Computer simulation, 119
Continuous casting of steel billets, 95 Mathematical model, 57
Continuous data, 115 Mathematical programming, 109
Control engineering applications of computers, 1 Metallurgical industries, 39, 119, 131
Control nonlinearities, 131 Metals production, 31
Cracks, 95 Micro-computer-based control, 103
Microcomputer based control, 1
Data trends, 115 Mineral processing, 7, 13,25, 161, 167
Mining vehicles, 173
Electrical steel finishing line, 63 Multi-dimensionality, 179
Evaluation and testing of expert systems, 95
Expert system technology, 79 Neural controller, 25
Expert system, 45, 63, 89, 95 Neural nets, 13, 147, 155, 179
Expert systems, 1, 39, 71, 109, 115, 119, 155 Neural networks, 79, 125, 161, 167

Failure detection, 173 Ornamental stone, 139


Fault diagnosis, 173 Outlier detection, 115

199
Perturbation analysis, 179 Rhomboidity, 95
pH control, 131 Rolling mills, 103
Plant inverse neural model, 25 Rule base, 57
Plant neural model, 25 Rule-based expert systems, 173
Power management, 31 Rule-based systems, 79
Prediction, 31
Process control, 1, 7, 13, 167 Scheduling, 39
Process models, 161 Steel industry, 19, 31, 63, 71,147,155
Production control, 71, 103, 109 Steel manufacture, 39, 103
Programming environments, 7 Surface inspection, 79
Programming languages, 79 Synthesis methods, 71

Quality control, 79,139 Texture analysis, 139


Quality problems in billet casting, 95 Thermal state, 57
Transfer system, 89
Reactor identification, 125 Transmission system, 173
Real time computer systems, 19 Transportation control, 63
Real-time computer system, 63
Real-time control, 89 User interfaces, 79
Real-time expert system, 51
Refining process, 51 Weighted distance measure, 115

200
Published Symposia/Workshop volumes

SYMPOSIA VOLUMES

AD ALI & TUNALI: Microcomputer Application in Process Control ISERMANN: Identification and System Parameter Estimation (1979)
AKASHI: Control Science and Technology for the Progress ISERMANN & FREYERMUTH: Fault Detection, Supervision and
of Society, World Congress 1981, 7 volumes Safety for Technical Processes
ALBERTOS & DE LA PUENTE: Components, Instruments and ISERMANN & KALTENECKER: Digital Computer Applications to
Techniques for Low Cost Automation and Applications Process Control*
ALONSO CONCHEIRO: Real Time Digital Control Applications * ISIDORI: Nonlinear Control Systems Design
AMOUROUX & EL JAI: Control of Distributed Parameter Systems JAAKSOO & UTKIN: Automatic Control, World Congress 1990,
(1989) 6 volumes
ATHERTON: Multivariable Technological Systems JANSSEN, PAU & STRASZAK: Dynamic Modelling and Control of
BABARY & LE LETTY: Control of Distributed Parameter Systems National Economies (1980)
(1982) JELLALI: Systems Analysis Applied to Management of Water
BALCHEN: Automation and Data Processing in Aquaculture Resources
BANKS & PRITCHARD: Control of Distributed Parameter Systems JOHANNSEN & RIJNSDORP: Analysis, Design, and Evaluation of
(1977) Man-Machine Systems*
BANYASZ & KEVICZKY: Identification and System Parameter JOHNSON: Adaptive Systems in Control and Signal Processing
Estimation (1991) JOHNSON: Modelling and Control of Biotechnologjcal Processes
BAOSHENG HU: Analysis, Design and Evaluation of Man-Machine KAY A & WILLIAMS: Instrumentation and Automation in the Paper,
Systems (1989) Rubber, Plastics and Polymerization Industries (1986)
BARKER: Computer Aided Design in Control Systems KHEIR et al: Advances in Control Education
(1991) KLAMT & LAUBER: Control in Transportation Systems (1984)*
BARKER & YOUNG: Identification and System Parameter Estimation KOPACEK & GENSER: Skill Based Automated Production
(1985) KOPACEK, TROCH & DESOYER: Theory of Robots
BASANEZ, FERRATE & SARIDIS: Robot Control "SYROCO '85" KOPPEL: Automation in Mining, Mineral and Metal Processing
BASAR & PAU: Dynamic Modelling and Control of National (1989)
Economies (1983)* KMMEL: Adaptive Control of Chemical Processes (ADCHEM'88)
BAYUS: Safety of Computer Control Systems (1983)* LARSEN & HANSEN: Computer Aided Design in Control and
BEKEY & SARIDIS Identification and System Parameter Estimation Engineering Systems
(1982) LEININGER: Computer Aided Design of Multivariable
BINDER & PERRET: Components and Instruments for Distributed Technological Systems*
Computer Control Systems* LEONHARD: Control in Power Electronics and Electrical Drives
CALVAER: Power Systems, Modelling and Control Applications (1977)*
Van CAUWENBERGHE: Instrumentation and Automation in the LESKIEWICZ & ZAREMBA: Pneumatic and Hydraulic Components
Paper, Rubber, Plastics and Polymerisation Industries and Instruments in Automatic Control*
(1980)*(1983) LEVIS & STEPHANOU: Distributed Intelligence Systems (1991)
CHEN HAN-FU: Identification and System Parameter Estimation LINKENS & ATHERTON: Trends in Control and Measurement
(1988) Education
CHEN ZHEN-YU: Computer Aided Design in Control Systems MACLEOD & HEHER: Software for Computer Control
(1988) (SOCOCO'88)
CHRETIEN: Automatic Control in Space (1985) MAHALANABIS: Theory and Application of Digital Control
CHRISTODULAKIS: Dynamic Modelling and Control of National MANCINI, JOHANNSEN & MARTENSSON: Analysis, Design and
Economies (1989) Evaluation of Man-Machine Systems (1985)
COBELLI & MARIANI: Modelling and Control in Biomedical MARTOS, PAU & ZIERMANN: Dynamic Modelling and Control of
Systems National Economies (1986)
CUENOD: Computer Aided Design of Control Systems* McGREAVY: Dynamics and Control of Chemical Reactors and
DA CUNHA: Planning and Operation of Electric Energy Systems Distillation Columns
DE CARLI: Low Cost Automation MLADENOV: Distributed Intelligence Systems: Methods and
De GIORGIO & ROVEDA: Criteria for Selecting Appropriate Applications
Technologies under Different Cultural, Technical and Social MUND AY: Automatic Control in Space (1979)
Conditions* NAJIM & ABDEL FATTAH: System Approach for Development
DEVANATHAN: Intelligent Tuning and Adaptive Conrol (1980)*
DUBUISSON: Information and Systems NIEMI: A Link Between Science and Applications of Automatic
EHRENBERGER: Safety of Computer Control Systems Control, World Congress 1978, 4 volumes*
(SAFECOMF88) NISHIKAWA & KAYA: Energy Systems, Management and
ELLIS: Control Problems and Devices in Manufacturing Economics
Technology (1980)* NISHIMURA: Automatic Control in Aerospace
FERRATE & PUENTE: Software for Computer Control (1982) NORRIE & TURNER: Automation for Mineral Resource Development
FLEISSNER: Systems Approach to Appropriate Technology Transfer NOVAK: Software for Computer Control (1979)
FLORIAN & HAASE: Software for Computer Control (1986) O'SHEA & POLIS: Automation in Mining, Mineral and Metal
FRANKE & KRAUS: Design Methods of Control Systems Processing (1980)
GEERING & MANSOUR: Large Scale Systems: Theory and OSHIMA: Information Control Problems in Manufacturing
Applications (1986) Technology (1977)
GENSER et al: Control in Transportation Systems (1986) PAUL: Digital Computer Applications to Process Control (1985)
GERTLER & KEVICZKY: A Bridge Between Control Science and PERRIN: Control, Computers, Communications in Transportation
Technology, World Congress 1984, 6 volumes* PONOMARYOV: Artificial Intelligence
GHONAIMY: Systems Approach for Development (1977) PUENTE & NEMES: Information Control Problems in Manufacturing
HAIMES & KINDLER: Water and Related Land Resource Systems Technology (1989)
HARDT: Information Control Problems in Manufacturing Technology RAMAMOORTY: Automation and Instrumentation for Power Plants
(1982) RANTA: Analysis, Design and Evaluation of Man-Machine Systems
HERBST: Automatic Control in Power Generation Distribution and (1988)
Protection RAUCH: Applications of Nonlinear Programming to Optimization and
HRUZ & CICEL: Automatic Measurement and Control in Control*
Woodworking Industry-Lignoautomatica'86 RAUCH: Control of Distributed Parameter Systems (1986)*
HUSSON: Advanced Information Processing in Automatic Control REINISCH & THOMA: Large Scale Systems: Theory and
ISERMANN: Automatic Control, World Congress 1987, 10 volumes Applications (1989)
REMBOLD: Robot Control (SYROCO'88)
RIJNSDORP: Case Studies in Automation Related to Humanization of TITLI & SINGH: Large Scale Systems: Theory and Applications
Work (1980)*
RIJNSDORP et al: Dynamics and Control of Chemical Reactors TROCH, DESOYER & KOPACEK: Robot Control (1991)
(DYCORD'89) TROCH, KOPACEK & BREITENECKER: Simulation of Control
RIJNSDORP, PLOMP & MLLER: Training for Tomorrow- Systems
Educational Aspects of Computerized Automation UHI AHN: Power Systems and Power Plant Control (1989)
ROOS: Economics and Artificial Intelligence VALADARES TAVARES & EVARISTO DA SILVA: Systems
SANCHEZ: Fuzzy Information, Knowledge Representation and Analysis Applied to Water and Related Land Resources
Decision Analysis* Van WOERKOM: Automatic Control in Space (1982)
SAWARAGI & AKASHI: Environmental Systems Planning, Design WANG PINGYANG: Power Systems and Power Plant Control
and Control WESTERLUND: Automation in Mining, Mineral and Metal
SINHA & TELKSNYS: Stochastic Control Processing (1983)
SMEDEMA: Real Time Programming (1977)* YANG JIACHI: Control Science and Technology for Development
STRASZAK: Large Scale Systems: Theory and Applications (1983) YOSHITANI: Automation in Mining, Mineral and Metal Processing
SUBRAMANYAM: Computer Applications in Large Scale Power (1986)
Systems ZWICKY: Control in Power Electronics and Electrical Drives (1983)
TAL': Information Control Problems in Manufacturing Technology
(1986)

WORKSHOP VOLUMES

ASTROM & WITTENMARK: Adaptive Systems in Control and Signal LOTOTSKY: Evaluation of Adaptive Control Strategies in Industrial
Processing Applications
BOULLART et al: Industrial Process Control Systems MAFFEZZONI: Modelling and Control of Electric Power Plants
BRODNER: Skill Based Automated Manufacturing (1984)*
BULL: Real Time Programming (1983)* MARTIN: Design of Work in Automated Manufacturing Systems*
BULL & WILLIAMS: Real Time Programming (1985) McAVOY: Model Based Process Control
CAMPBELL: Control Aspects of Prosthetics and Orthotics* MEYER: Real Time Programming (1989)*
CHESTNUT: Contributions of Technology to International Conflict MILLER: Distributed Computer Control Systems (1981)
Resolution (SWIIS) MILOVANOVIC & ELZER: Experience with the Management of
CHESTNUT et al: International Conflict Resolution using Systems Software Projects (1988)
Engineering (SWIIS) MOWLE & ELZER: Experience with the Management of Software
CHESTNUT et al: Supplemental Ways for Improving International Projects (1989)
Stability* NARITA & MOTUS: Distributed Computer Control Systems (1989)
CICHOCKI & STRASZAK: Systems Analysis Applications to OLLUS: Digital Image Processing in Industrial Applications - Vision
Complex Programs Control
CRESPO & DE LA PUENTE: Real Time Programming (1988) QUIRK: Safety of Computer Control Systems (1985)(1986)
CRONHJORT: Real Time Programming (1978) * RAUCH: Control Applications of Nonlinear Programming*
DI PILLO: Control Applications of Nonlinear Programming and REMBOLD: Information Control Problems in Manufacturing
Optimization Technology (1979)
ELZER: Experience with the Management of Software Projects RODD: Artificial Intelligence in Real Time Control (1989)
FLEMING & JONES: Algorithms and Architectures for Real-Time RODD: Distributed Computer Control Systems (1983)
Control RODD: Distributed Databases in Real Time Control
GELLIE & TAVAST: Distributed Computer Control Systems (1982)* RODD & LALIVE : Distributed Computer Control
GENSER et al: Safety of Computer Control Systems (SAFECOMF89) Systems (1988)
GOODWIN: Robust Adaptive Control RODD & MLLER: Distributed Computer Control Systems (1986)
HAASE: Real Time Programming (1980) RODD & SUSKI: Artificial Intelligence in Real Time Control
HALANG & RAMAMRITHAM: Real Time Programming (1991) RODD & SUSKI: Artificial Intelligence in Real-Time Control (1991)
HALME: Modelling and Control of Biotechnical Processes* SIGUERDIDJANE & BERNHARD: Control Applications of
HARRISON: Distributed Computer Control Systems (1979) Nonlinear Programming and Optimization
HASEGAWA: Real Time Programming (1981)* SINGH & TITLI: Control and Management of Integrated Industrial
HASEGAWA & INOUE: Urban, Regional and National Planning - Complexes*
Environmental Aspects SKELTON & OWENS: Model Error Concepts and Compensation
JAMSA-JOUNELA & NIEMI: Expert Systems in Mineral and Metal SOMMER: Applied Measurements in Mineral and Metallurgical
Processing Processing
JANSEN & BOULLART: Reliability of Instrumentation Systems for SUSKI: Distributed Computer Control Systems (1985)
Safeguarding and Control SZLANKO: Real Time Programming (1986)
KOPETZ & RODD: Distributed Computer Control Systems (1991) TAKAMATSU & O'SHIMA: Production Control in Process Industry
KOTOB: Automatic Control in Petroleum, Petrochemical and UNBEHAUEN: Adaptive Control of Chemical Processes
Desalination Industries VILLA & MURARI: Decisional Structures in Automated
LANDAU, TOMIZUKA & AUSLANDER: Adaptive Systems in Manufacturing
Control and Signal Processing
LAUBER: Safety of Computer Control Systems (1979)

*Out of stock High quality repro print available. Details of prices sent on request from the IF AC Publisher.

IFAC related titles


BROADBENT & MASUBUCHI: Multilingual Glossary of Automatic Control Technology
EYKHOFF: Trends and Progress in System Identification
NALECZ: Control Aspects of Biomedical Engineering

Das könnte Ihnen auch gefallen