Beruflich Dokumente
Kultur Dokumente
A method was developed, in which the integral areas of resonance signals in 12 regions of the 1H nuclear
magnetic resonance (NMR) spectra of 5 widely different Colombian crudes and a crude mixture were correlated
with selected physicochemical properties and standard distillation cut yields. The crudes used ranged from
very light to heavy aromatics, thus covering a wide range of chemical compositions. The relationship between
the NMR spectra and the properties of interest was obtained with the partial least-squares (PLS) algorithm.
High correlation values were obtained for all of the PLS models used. The R2 values of the prediction models
for the physicochemical properties and SimDis results were >0.95, while the CV-q2 values fluctuated between
0.74 and 0.99. The properties of an unknown crude or crude mixtures can be obtained from their 1H NMR
spectra and the use of the PLS models developed in this work. The correlation method is very fast (1 h instead
of several weeks) and much less costly than the conventional ones.
was found that the use of five factors (C1-C5) in the first case while the yields of the different cuts were obtained from the
(physicochemical and product yield) provided the best prediction multivariate PLS method.14 Both methods are part of the PLS
accuracy, while in an second one (SimDis analysis), only three module of Statistica 6.0.17
factors and their products were found to give the best results. A
lower and higher number of factors generated a lower accuracy in Results and Discussion
all cases. Initially, in all of these models using the PLS method, R2
was >0.99, employing the five factors obtained in the training step Table 3 shows the yields of the different fractions selected
(see Table 7). The high value of R2 led us to think that some in this work for the evaluation of each crude, and Table 4 shows
overfitting may be present. To avoid this problem, the statistical F the yields of the most important refinery distillation products
method was employed.16 To select the optimal model in each case,
at standard cut points, obtained from the crude oil simulated
we computed the predicted residual sum of squares (PRESS) for
cross-validated models with various numbers of factors h. The distillation curve SimDis.12,13 These results have to be similar
model with the minimum PRESS is used as a benchmark, and the to those obtained from the sum of the different cut yields of
number of factors associated with it was designated by h*. All Table 3 included in each product. Figure 2 shows the SimDis
models with h < h* were compared against this benchmark. The curves of crude 2 and its fractions.
smallest model (fewest number of h factors), such that its PRESS As seen in Figure 2, the overlap between the different cuts
was not significantly greater than the PRESS for the model with of crude 2 is small; therefore, the yield and the properties
h* factors, was found this way. The selection started by computing measured correspond to the hydrocarbon families present and
F(h) ) PRESS (model with h factors)/PRESS (model with h* are representative of that distillation range. The distillation
factors) for h ) 1, 2, ..., h*. Then, in the next step, the optimal
curves obtained for all of the remaining crudes and its cuts were
number of factors (the smallest h) was chosen, such that F(h) <
FR;m,m, where FR;m,m is the (1 - R) percentile of Snedecors F very similar to that of crude 2 and are not shown.
distribution with m and m degrees of freedom (m ) number of Table 5 shows the physicochemical properties of the
calibration samples). If the prediction errors have a 0 mean and crudes studied in this work. The American Petroleum Institute
are mutually independent and normally distributed, Prob {F(h) > (API) gravity values,11-13 KUOP factor,11-13 wax content, and
Fa;m,m|h2 ) h*2} ) 2R. Here, h2 and h*2 are the prediction error ion chromatography (IC) values11,13 confirm that crude 1 is the
variance of the model with h and h* factors, respectively, and R is most paraffinic one, while crude 6 is the most aromatic one.
a probability value to be selected. In our case, we used R ) 0.25 This classification shows that S, N, V, and Ni content, insol-
to have a reasonable balance between over- and under-fitting.16 ubles in nC7 and nC5, and the micro carbon residue (MCR)13
To estimate prediction statistics, a leave-one-out cross-validation increase progressively when the paraffinic hydrocarbons de-
(CV) regression method was employed.14 The best prediction model
is the one with the larger CV-q2. The resulting models for the
crease and the aromatic resins and asphaltenes increase, which
correlations are shown in the next section. is corroborated with the increase in the density, mean aver-
The correlation between physicochemical properties and NMR age boiling point (MeABP) value,13 and viscosity. Crude 4
spectra of crudes was determined using the univariate PLS method, shows the lowest pour point and the largest acid number,
confirming that it is the more naphtenic of the studied
(16) Haaland, D.; Thomas, E. Anal. Chem. 1988, 60, 1193-1202. Colombian crudes.
1H NMR Spectra of Colombian Crude Oils Energy & Fuels, Vol. 21, No. 3, 2007 1677
Equations 1-5 show the resulting polynomials for the Tables 6 and 7 show the values of R2 and CV-q2 found for
prediction of the crude distillation products obtained with the correlation of the standard cut product yields. The prediction
PLS correlation from 1H NMR data. for the yield of the jet A and vacuum residue is excellent (R2 >
0.98, and CV-q2 > 0.96) and similarly for naphta, diesel, and
percent naphtas ) -55.34 - 3.5532C1 + vacuum gas oil (R2 > 0.98, and CV-q2 > 0.90). All of the cuts
6.349 17C2 - 22.2954C4 (1) showed overlap; therefore, they have a higher correlation. This
was particularly noticeable for the heavier crudes, where an ad
percent jet A ) -21.5783 + 2.397 82C1 - hoc interpolation was necessary to obtain all of the gas oil cuts
0.0289 511C1C1 (2)
at the same final temperature.
percent diesel ) -21.1064 + 3.520 07C1 - Equations 6-21 show the resulting polynomials for the
0.074 790 3C1C1 (3) prediction of the physicochemical properties and SimDis
percent gas oil ) -29.0943 + 6.5898C1 - 0.184 131C1C1 curves obtained with the PLS correlation17 from 1H NMR
(4) data.
percent residue ) 80.8482 - 1.355 31C1 -
0.073 263 6C1C1 (5) (17) Statistica 6.0. StatSoft, Inc.: Tulsa, OK.
1678 Energy & Fuels, Vol. 21, No. 3, 2007 Molina V. et al.
may be achieved with the PLS method. Figure 3 shows the experimental and predicted SimDis values for three of the six crudes
studied in this work.
The correlations obtained between the measured and predicted values by the PLS method using eqs 13-21 are seen in Figure 3.
The same type of high-quality fitting was obtained with the PLS method with crudes 2, 4, and 6 and is not shown for the sake of
brevity.
Once the basic correlation using these well-known crudes is set, the prediction of the product yields and physicochemical properties
requires the determination of the 1H NMR spectrum of the crude or mixture under investigation. The total time for such determination
will be, in general, around 60 min per sample. Both off- and online application of this new methodology can be envisioned for
refinery feedback control and optimization.
Conclusion
The use of 1H NMR spectra provides a very fast and relatively inexpensive method to predict the physicochemical properties and
the refining product yields for a wide variety of crudes and their mixtures using the PLS regression method.
Acknowledgment. The authors thank to the Unidad de Investigacion of Ecopetrol-Instituto Colombiano del Petroleo for gen-
erous financial support (Convenio number 004 2003) and allowing for the publication of this paper. The authors also thank Drs.
Ricardo Fierro from Universidad Nacional de Colombia and Winston Quinonez from Universidad de Antioquia for the NMR spectra
and Jairo Rene Martnez from Universidad Industrial de Santander for acting as the University Coordinator of the Convenio with
Ecopetrol-ICP.
Appendix
The procedure for a particular property (percent naphta) will be outlined here to show the different steps involved in the correlation
between the areas of the NMR signals and properties. The notation employed is such that the area NMR1 (H1) of crude 1 is 1H1,
while area 2 for the same crude is 1H2, etc. and for crude 2 and H1 area it will be 2H1, etc. These values formed matrix X, while
[ ] []
the different properties generated matrix q as shown in eq A1.
1H1 1H2 1H3 1H4 1H5 1H6 1H7 1H8 1H9 1H10 1H11 1H12 33.6
2H1 2H2 ... 2H12 30.0
3H1 9.8
X) q) (A1)
... 7.5
... 8.0
6H1... ... 6H12 4.5
The application of PLS produces the weights (wt) for the different areas. In the present case, the results are shown in
Table A1.
1680 Energy & Fuels, Vol. 21, No. 3, 2007 Molina V. et al.
With Table A1 and matrix X, the factors (Ci) were calculated as followed:
C1 of crude 1 ) (-0.323 153 1H1) + (-0.327 058 1H2) + (-0.231 773 1H3) + ...
(0.299 659 1H12*) ) 24.181 637 14
C3 of crude 4 ) (0.039 645 4H1) + (-0.075 135 4H2) + (-0.395 929 4H3) + ...
+ (0.249 223 4H12) ) -6.216 845 619...
The values for the Cis for each crude are shown in Table A2.
Table A2. Cis Factors
C1 C2 C3 C4 C5 percent naphtas
crude 1 24.181 637 14 -5.324 013 79 -13.303 423 45 13.955 121 76 -5.268 423 249 33.6
crude 2 23.823 380 33 -7.882 114 39 -11.475 055 96 15.662 478 95 -5.192 700 979 30.0
crude 3 20.291 228 92 -3.902 492 67 -11.250 672 53 14.354 718 37 -4.554 571 469 9.8
crude 4 17.234 296 56 -3.725 528 32 -6.216 845 619 15.025 922 67 0.289 819 827 7.5
mixture 16.768 961 67 -3.435 588 48 -6.772 319 875 14.010 889 33 -2.030 028 194 8.0
crude 6 13.718 716 74 -4.110 847 78 -5.546 934 884 13.589 306 92 -1.646 006 628 4.5
C1-C5 are taken as independent variables and the percentage of naphtas is dependent for a MLR regression. Several models with
different Cis were considered. To each of them, leave one out (LOO), CV-q2, and the F factor analysis were applied. The chosen
model had to comply with the F factor criteria and show the largest R2, and its CV-q2 value had to be the closest to 1. The resulting
equation for the percentage of naphtas is shown in eq 1. All of the other models (eqs 2-21) were obtained by a similar procedure.
EF060335N