Beruflich Dokumente
Kultur Dokumente
for
Advanced SCADA engineering'
Redundant system
using HAC
Venue: YEF-NLS
Date: Mar. 1, 2010 (Mon) - Mar. 5, 2010
(Fri)
Conducted by: YHQ GEC ETC Support
Copyright Yokogawa Electric Corporation
Intro
Requirements
How it works
Configuration pitfalls
A HAC system:
Protects against single hardware failures
Medium protection against software failures
Some protection against database
corruption
Takeover time in order of seconds (after
error detection)
Master and backup system can be in
different buildings/cities
Upgrading windows or patching FAST/TOOLS
needs mostly only switchover
Intro
Requirements
How it works
Configuration pitfalls
Requirements
Intro
Requirements
How it works
Configuration pitfalls
How it works
System
1
System
2
How it works
During synchronizing:
First a .zip of all related files on backup system
is created
Then per file difference is checked between
systems:
Missing and different files are copied on bucket basis
Key file is re-created on backup system
Presence of processes
CPU load of processes
Queue size of processes
Responsiveness of some specific processes
Presence of processes
The list of running processes is compared to
the list of processes which should run at
standby or active state
Regularly checks if all processes in current
mode are running
When a process is missing, FAST/TOOLS
health is considered BAD
Responsiveness of processes
Some processes have a build in I am alive
mechanism
When the alive flag is missing during the
configured time period, FAST/TOOLS health
is BAD
The configuration of this mechanism for a
process which does not support it will be
ignored
ALERT
Print only alert message on UMH
PROCESS
Restart the process
TOOL
Restart the tool
REBOOT
Re-boot the system
SHUTDOWN Shut down the system
Monitor logging
Intro
Requirements
How it works
Configuration pitfalls
Configuration pitfalls
Configuration pitfalls
Configuration pitfalls
Responsiveness of processes
Some processes set this alive trigger only
once per minute (e.g. ALM)
Some processes can have unexpected long
processing periods (e.g. ITM on debugger
request save all items to disk)
Configuration pitfalls
Configuration aspects
Configuration pitfalls
Configuration pitfalls
Known problems
Do not use * definitions in process name
specifications
This can result in BAD health ghost messages
Exercise