Beruflich Dokumente
Kultur Dokumente
Dale E. Seborg
Present address: Johnson Controls, Inc., 507 E. Michigan St., Milwaukee, WI 53202. Email: Ashish.Singhal@jci.com
software (OSI
Software, www.osisoft.com). Because PI
is widely
used for data archiving, it is informative to compare
the commercially available software with the classical
techniques. In particular, the BatchFile Interface for
the PI
i=1
m
j=1
2
i, j
(5)
where m is the number of measurements in the original
dataset; n is the number of variables;
i, j
= (x
i, j
x
i, j
), x
i, j
rep-
resents the j
th
measurement of the i
th
variable in the original
data, and x
i, j
is the corresponding reconstructed value.
If the recording limit constant, c, is the same for all meth-
ods, then the resulting compression ratios will be dierent for
each method. These type of results would indicate how eec-
tive each method is for compressing data. However, in order
to compare the methods with respect to reconstruction ac-
curacy, it is easier to analyze the results if all methods have
the same compression ratio. A constant compression ratio
requires adjusting the recording limits individually for each
method. Because the accuracy of the data reconstruction is
a key concern, the recording limits for each method were
varied in order to achieve the the same compression ratio.
As mentioned in the previous section, the recording limits
for a for given method and each process variable are propor-
tional to their standard deviations. For example, the OSI PI
3.0 PI
14.83 0.33
accurate both in terms of reconstruction error and the simi-
larity of the reconstructed and original datasets.
Although the PI
,
but much lower than conventional compression methods such
as box-car, etc. Data compressed using wavelets also show a
high degree of similarity with the original data.
For pattern matching applications, it is benecial to com-
press the snapshot data prior to performing pattern matching.
4
Table 2. Eect of dierent data compression and reconstruction methods on pattern matching for the CSTR example.
Compression
method
Recording limit
constant (c)
Reconstruction
method
S
PCA
S
dist
SF
Box-Car 2.2295
Linear 0.88 0.67 0.81
Zero-order hold 0.87 0.83 0.86
Backward-slope 2.7744
Linear 0.84 0.63 0.77
Zero-order hold 0.83 0.39 0.68
Combination 2.20025
Linear 0.87 0.67 0.80
Zero-order hold 0.85 0.79 0.83
Averaging
NA
Linear 0.92 0.99 0.94
(over 1.25 min) Zero-order hold 0.93 0.97 0.94
Wavelet 2.2669 Wavelet 0.95 >0.99 0.97
PI
3.0 PI
S F = 0.67 S
PCA
+ 0.33 S
dist
Table 3. Eect of data compression on pattern matching for the CSTR example when both the snapshot and historical data are
compressed using the same method.
Compression
method
Similarity
factor
Opt. N
P
p (%) (%)
max
(%) (%)
S
PCA
only 34 43 90 99 66
Original data S
dist
only 25 41 68 97 54
S F
14 75 72 88 74
S
PCA
only 41 30 78 99 54
Combination S
dist
only 59 19 75 100 47
S F
15 65 67 91 66
S
PCA
only 21 49 65 95 57
Averaging S
dist
only 24 40 65 96 53
S F
17 64 73 92 68
S
PCA
only 34 38 82 99 60
Wavelet S
dist
only 52 25 83 100 54
S F
16 71 76 92 73
S F = 0.67 S
PCA
+ 0.33 S
dist
For the simulated case study, data compression had only a
minor eect on the eectiveness of a new pattern matching
strategy.
11
Acknowledgements
The authors thank OSI Software for providing nancial
support and the data archiving software PI
, and Gregg
LeBlanc at OSI for providing software support during the
research. Financial support from ChevronTexaco Research
and Technology Co. is also acknowledged.
References
(1) Singhal, A. and Seborg, D. E. Pattern Matching in Mul-
tivariate Time Series Databases Using a Moving Win-
dow Approach. Ind. Eng. Chem. Res., 2002. 41, 3822
3838.
(2) Hale, J. C. and Sellars, H. L. Historical Data Recording
For Process Computers. Chemical Eng. Prog., 1981.
77(11), 3843.
(3) Kennedy, J. P. Building an Industrial Desktop. Chemi-
cal Engr., 1996. 103(1), 8286.
(4) Bristol, E. H. Swinging Door Trending: Adaptive
Trend Recording? In Advances in Instrumentation and
Control, volume 45. Instrument Society of America,
Research Triangle Park, NC, 1990 749754.
(5) Mah, R. S. H.; Tamhane, A. C.; Tung, S. H. and Patel,
A. N. Process Trending With Piecewise Linear Smooth-
ing. Comput. Chem. Engr., 1995. 19, 129137.
5
Table 4. Eect of data compression on pattern matching when snapshot data are not compressed and historical data are com-
pressed.
Compression
method
Similarity
factor
Opt. N
P
p (%) (%)
max
(%) (%)
S
PCA
only 34 43 90 99 66
Original data S
dist
only 25 41 68 97 54
S F
14 75 72 88 74
S
PCA
only 48 26 76 100 51
Combination S
dist
only 40 25 67 99 46
S F
15 59 63 91 61
S
PCA
only 60 23 85 100 54
Averaging S
dist
only 16 52 57 92 54
S F
16 63 70 92 66
S
PCA
only 39 31 75 99 53
Wavelet S
dist
only 15 50 53 91 52
S F
14 68 67 88 68
S F = 0.67 S
PCA
+ 0.33 S
dist
(6) Bakshi, B. R. and Stephanopoulos, G. Compression of
Chemical Process Data Through Functional Approxi-
mation and Feature Extraction. AIChE J., 1996. 42,
477492.
(7) Misra, M.; Kumar, S.; Qin, S. J. and Seemann, D. Error
Based Criterion for On-Line Wavelet Data Compres-
sion. J. Process Control, 2001. 11, 717731.
(8) Watson, M. J.; Liakopoulos, A.; Brzakovic, D. and
Georgakis, C. A Practical Assessment of Process Data
Compression Techniques. Ind. Eng. Chem. Res., 1998.
37, 267274.
(9) Nelson, P. R. C.; Taylor, P. A. and MacGregor, J. F.
Missing Data Methods in PCA and PLS: Score Calcu-
lations with Incomplete Observations. Chemometrics
and Intel. Lab. Syst., 1996. 19, 4565.
(10) Roweis, S. EM Algorithms for PCA and SPCA. In
Neural Information Processing Systems 11 (NIPS98).
1997 626632.
(11) Singhal, A. Pattern Matching in Multivariate Time-
Series Data. Ph.D. Dissertation, University of Califor-
nia, Santa Barbara, CA, 2002.
(12) Krzanowski, W. J. Between-Groups Comparison of
Principal Components. J. Amer. Stat. Assoc., 1979.
74(367), 703707.
(13) Russo, L. P. and Bequette, B. W. Eect of Process De-
sign on the Open-Loop Behavior of a Jacketed Exother-
mic CSTR. Comput. Chem. Eng., 1996. 20, 417426.
(14) Johannesmeyer, M. C.; Singhal, A. and Seborg, D. E.
Pattern Matching in Historical Data. AIChE J., 2002.
48, 20222038.
6