Beruflich Dokumente
Kultur Dokumente
CORRELATION
METHODOLOGY
9
PETROLEUM REFINERY P ROCESS ECONOMICS
As we would expect (and shall see later), the actual operating results
from the literature will be scattered to some degree or other. This is due to
the complexity of the systems involved (a multitude of species of hydrocar-
bons) and, of course, uncertainties in observations (errors in measuring,
reading, recording), failure to attain true steady state, etc.
In correlating these data, we are attempting to find a relatively simple
expression (equation) to characterize the relationship between two or more
variables in a very complex system. From a consideration of the chemistry
involved, possible reaction mechanisms and kinetics, we may infer a possi-
ble relationship between a set of variables. A plot of the data on this basis
will indicate by the pattern of the points (trend and scatter) how well the
assumed relation fits the data.
Ordering the data (arranging by increasing or decreasing order) in
terms of one of the variables often helps to indicate a possible correlating
parameter. Plotting an independent variable against one of the dependent
variables may indicate the type of relationship (linear, quadratic, expo-
nential, etc.) between them—if any.
Once a possible relation is detected, the equation describing this
apparent relationship is usually determined by linear regression analysis—
or multiple linear regression analysis if more than one independent variable
is involved.
Before electronic computers (mainframe or personal computer [PC]),
this was a very tedious process—even with a calculator. Now however, with
ready access to PCs with very high speed and capacity, regression analysis
is quick and easy.
Spreadsheet programs such as Lotus, QuatroPro, Excel, etc., provide
great flexibility in the arrangement and manipulation of data (moving
columns, transforming data, etc.) and provide for automatic plotting of data
in addition to regression analysis capabilities.
In developing each correlation in this book, an attempt was made to dis-
cover a single independent variable as a basis for correlation. The degree to
which this was successful will be apparent from the graphs on which both
the raw data and the regression lines have been plotted. Frequently, it was
necessary to employ two or more independent variables to obtain a satisfac-
tory correlation. The results are summarized in the following tabulation:
10
CHAPTER 2 • CORRELATION METHODOLOGY
Independent Variables
Process
Yields Properties
11
PETROLEUM REFINERY P ROCESS ECONOMICS
where:
HPI represents HPI Consultants, Inc.
G & H represents Gary and Handwerk
DAO represents deasphalted oil
CCR represents continuous catalyst regeneration
LV represents liquid volume
12
CHAPTER 2 • CORRELATION METHODOLOGY
The regression output will give the value of the constant, the coeffi-
cient(s) of the variable(s), the coefficient of correlation, and the standard
error of the estimate of the dependent variable and of each of the coeffi-
cients, also the number of points and the degrees of freedom.
Having the equation of the regression line, one may calculate values
of the dependent variable for each of the sets. The difference between the
calculated value and the corresponding “observed” value may then be cal-
culated. The magnitude of the differences may point to certain data that do
not fit with the rest of the population. Reference to the source of these data
may suggest reasons for discarding these data. A plot of the data points
together with the regression line will give a visual indication of the appro-
priateness of the relation selected to represent the data.
It is not necessary to have a regression program to obtain the same
results. They can be calculated from the sums of the individual variables, of
their squares, and of their cross products. In the case of a first order or lin-
ear equation, this requires the sums of the following:
13
PETROLEUM REFINERY P ROCESS ECONOMICS
Significance of Results
The yield of full-boiling range gasoline in FCC has been chosen to illustrate
the significance of a regression analysis and the use to which it may be put.
Figure 2–1 is a plot of the data (382 points) for gasoline yield (Y) from FCC
vs. conversion (X) together with the regression line for the equation:
Y = a + bX + cX2
14
CHAPTER 2 • CORRELATION METHODOLOGY
Y = (a+d) + bX + cX2
16
CHAPTER 2 • CORRELATION METHODOLOGY
Notes
1. McElroy, E.E., Applied Business Statistics, Holden-Day, Inc., San
Francisco, Second Edition, 1979, p. 293
17