Beruflich Dokumente
Kultur Dokumente
DOI 10.1002/cem.2808
RESEARCH ARTICLE
1
College of Information Engineering, Zhejiang
University of Technology, Hangzhou, ZJ 310023, Parameter estimation of reaction kinetics from spectroscopic data remains an impor-
China tant and challenging problem. This study describes a unified framework to address
2
Center for Advanced Process Decision‐making, this challenge. The presented framework is based on maximum likelihood princi-
Department of Chemical Engineering, Carnegie
ples, nonlinear optimization techniques, and the use of collocation methods to solve
Mellon University, Pittsburgh, PA 15213, USA
3
the differential equations involved. To solve the overall parameter estimation
Small Molecule Design and Development, Lilly
Research Labs, Indianapolis, IN 46285, USA problem, we first develop an iterative optimization‐based procedure to estimate
Correspondence the variances of the noise in system variables (eg, concentrations) and spectral
Lorenz T. Biegler, Center for Advanced Process measurements. Once these variances are estimated, we then determine the concen-
Decision‐making, Department of Chemical, tration profiles and kinetic parameters simultaneously. From the properties of the
Engineering, Carnegie Mellon University, 5000
Forbes Ave. Pittsburgh, PA 15213, USA.
nonlinear programming solver and solution sensitivity, we also obtain the covari-
Email: lb01@andrew.cmu.edu ance matrix and standard deviations for the estimated kinetic parameters. Our pro-
posed approach is demonstrated on 7 case studies that include simulated data as
well as actual experimental data. Moreover, our numerical results compare well with
the multivariate curve resolution alternating least squares approach.
KE YWO RD S
506 Copyright © 2016 John Wiley & Sons, Ltd. wileyonlinelibrary.com/journal/cem J. Chemometrics 2016; 30: 506–522
CHEN ET AL. 507
In addition, more sophisticated techniques have been and it is difficult to determine the weight factors correspond-
adopted for the determination of kinetic parameters from ing to the regularization items; these affect the accuracy of
spectroscopic data. Such techniques rely on the availability kinetic parameter estimation significantly. In addition, the
of accurate pure component spectra to identify the com- hybrid method breaks down when the columns of the concen-
pounds participating in the reaction. The approach based on tration matrix (C in Equation 2) are dependent, because this
Gauss‐Newton‐Levenberg‐Marquardt18,19 is widely used to method requires (CTC) to be nonsingular. For the same rea-
adjust the kinetic parameters for minimizing the difference son, both Gauss‐Newton‐Levenberg‐Marquardt and MCR‐
between the measured and modeled absorbance spectra and ALS also encounter calculation problems for the dependent
usually has good performance.20–24 This kind of approach case. Finally, while all of the previous methods consider mea-
fixes the kinetic parameters and integrates the system of ordi- surement errors in the spectra, none of these approaches con-
nary differential equations (ODEs) to get the concentration sider the system noise in the reaction kinetic model, which
profiles. Then, the pure component spectra and the sensitivity directly affects the concentrations. This noise should not be
of the system with respect to the kinetic parameter can be conflated with measurement errors and must be captured
estimated to calculate the perturbed step of kinetic parameter with an extended maximum likelihood form.
to reduce the fitting error of the spectroscopic data. However, In this work, we develop a simultaneous parameter esti-
this approach is sequential and poorly suited to unstable and mation algorithm based on spectroscopic data for reaction
ill‐conditioned dynamic systems; it can lead to unbounded systems with both system variables and spectral measurement
state profiles and convergence difficulties.25 Another widely noise. In Section 2 we develop a simultaneous collocation
applied approach is the multivariate curve resolution alternat- approach for kinetic parameter estimation, where we assume
ing least squares (MCR‐ALS) method, which obtains the that variances for spectral measurement error and system
pure concentration profiles and spectra of all absorbing variable noise are known. However, because only spectra
species present in the raw measurements by using soft model- are available and concentrations are not measured directly,
ing constraints and kinetic model constraints.26,27 The con- this assumption cannot be applied directly. Because both
centration and spectra matrices are updated alternately by measurement error and system noise are conflated, their var-
separate least squares methods. Once the concentration iances cannot be determined independently. To address this
matrix is obtained by least squares, it is modified by issue, Section 3 extends the approach of Section 2 and pre-
nonnegativity constraints, unimodality constraints, closure sents an iterative optimization‐based strategy that determines
constraints, and other soft constraints. Then the ODE is fitted estimates of spectral measurement and system variances, on
to concentration data to get kinetic parameters. The spectra the basis of maximum likelihood concepts. This approach
matrix is also modified by these constraints before calculat- then leads to an overall parameter estimation strategy.
ing the concentration matrix with least squares. These alter- Following the parameter estimation, Section 4 then derives
nating iterations continue until the residual sum of squares covariance matrices and confidence limits of the estimated
has little change. Without derivative information and explicit parameters. The overall approach described in Sections 3
search directions towards minimizing the fitting error, this and 4 is then demonstrated on 7 cases with simulated data
procedure converges slowly and the precision of estimated as well as real measurements in Section 5. Our numerical
kinetic parameters may not be high enough. Moreover, in results are also compared with the MCR‐ALS GUI 2.0 tool-
another study,26 it is stated that the “hard‐ and soft‐modelling box.27 Finally, Section 6 summarizes the results and con-
approach equals or overcomes the performance of pure hard‐ cludes the paper.
modelling method and pure soft‐modelling method in any
case.” For this reason, as well as the ready availability of
2 | SIMU LTAN EOU S AP PROAC H F O R
MCR‐ALS, we will compare our proposed method to
K I N E T I C PA R A M E T E R E S T I M AT I O N
MCR‐ALS in this study.
Another hybrid approach proposed by Sawall et al28 com-
The spectra measured at each sampling time during the reac-
bines the model‐free curve resolution technique with a
tion can be arranged as rows to form a 2‐dimensional matrix.
model‐based kinetic regularization. Here, a low‐rank approx-
If there is no measurement error, then Beer‐Lambert's law
imation of the original spectroscopic data is computed by a
predicts the absorbance of a solution d(t, λ) at a particular
singular value decomposition in the first step and only a rota-
wavelength λ and given time t as the sum over all contribu-
tion matrix is determined to recover the concentration and
tions of dissolved absorbing species, ie,
spectra matrix. Then the regularized error between the con-
centration matrix and the solution of the kinetic model can d ðt; λÞ ¼ c1 ðt Þs1 ðλÞ þ c2 ðt Þs2 ðλÞ þ ⋯ þ cnc ðt Þsnc ðλÞ (1)
be incorporated to recover the spectra.14 The NL2SOL29
optimization code was used to minimize the regularized where ck(t) is the concentration for the kth species at time t,
objective function to adjust the rotation matrix and kinetic sk(λ) is the absorbance for the kth species with 1 unit concen-
parameters simultaneously. However, no guidelines were tration at time t, and nc is the number of absorbing species.
given to properly choose the structure of the rotation matrix, Assuming ntp is the number of sampling points in time and
508 CHEN ET AL.
∂F
1. Wk(0) = 0 (with probability 1). ðc−zÞ≈η (9)
2. For 0 ≤ s < t ≤ T, the random variable given by incre- ∂c
ment Wk(t) − Wk(s) is normally distributed with mean
0 and variance t − s; equivalently, Wk(t) − Wk(s) From Equation 9, we can show for t ∈ [0, T] that we have
~ (t − s)(1/2) N(0, 1), where N(0, 1) denotes a normally
the probability density function p(ω = c − z) ≈ det ∂F
∂c p(η),
31
dzðt Þ
¼ f ðzðt Þ; θÞ and we interpolate z(t) at the sampling times ti using Equation
dt 15. Equation 14 becomes
ck ðt i Þ ¼ zk ðt i Þ þ ωk ðt i Þ
nc (11)
2
d i;l ¼ ∑ ck ðt i Þsk ðλl Þ þ ζi;l ntp nwp nc ntp nc
k¼1 min∑ ∑ d i;l − ∑ ck ðti Þsk ðλl Þ =δ2 þ ∑ ∑ ðck ðt i Þ−zk ðti ÞÞ2 =σ 2k
i ¼ 1::ntp; l ¼ 1::nwp; k ¼ 1::nc i¼1 l¼1 k¼1 i¼1k¼1
K
s:t: ∑ l ̇m ðτÞzjm −hj ⋅f zjm ; θ ¼ 0; j ¼ 1::ne; m ¼ 1::K
m¼0
K
From Equation 11, we have zK ðt i Þ ¼ ∑ lm ðτÞzjm ; τ ¼ ti −tpj−1 = tpj −tpj−1
m¼0
ntp
nc
2
min∑ d i;l − ∑ zk ðt i Þsk ðλl Þ
i¼1 k¼1
(22)
s:t: S≥0
l ¼ 1; :::; nwp; k ¼ 1; :::; nc
4 | COVAR IA NCE A ND I NTE RP RE TAT I ON Suppose the data d^ are varied slightly, this would cause
O F PA RA M ET ER UNC ERTAI NT Y the minimum to shift from Θ ^ to Θ
^ þ δΘ, where we have
¼0 (31)
∂Θ
^
Θ¼ΘþδΘ
noise.
Here, we regard D as a random matrix, and D^ is an obser-
vation matrix. Note that J Θ; d^ is regarded as the reduced Equation 31 is expanded to first order, and then we have
objective of Equation 17, ie,
T ∂2 J ðΘ; dÞ
∂2 J ðΘ; dÞ
min J Θ; d^ ; d^ ¼ D
^ ;Θ ¼ ½CT ; ½ST ; θ (26) H δΘ þ B δd ¼ 0; H ¼
;B ¼ ∂Θ ∂d
θ ∂Θ2
Θ¼ Θ^ ^
Θ¼ Θ
d¼ ^d d¼d^
≤ε
J Θ; d^ −J Θ; (27) Hessian and calculated directly by sIPOPT, a code developed
for sensitivity analysis of nonlinear programming prob-
lems.37 B is a partial derivative matrix with respect to d and
where ɛ is the largest difference between risks that we are
Θ, and Vd is the covariance matrix of d (see Appendix C
willing to consider insignificant, and we have no reason to
^ over any other value of Θ, which satisfies Equation for the derivation of Equation 33 and expressions for B and
prefer Θ
Vd).
27. Moreover, the smaller is the max∥Θ−Θ ^ ∥ that satisfies
Remark: In this work, covariance matrix of the parame-
Equation 27, the better estimate Θ^ we have. ter estimates is used to evaluate the reliability of the parame-
The objective function of Equation 26 can be approxi- ter estimates.
mated by a second‐order expansion around the estimated
h i
^ With ∂J Θ; d^ = ∂Θ
1 T ∂2 J
5.1 | Cases with simulated data approach have good confidence levels. The concentration
profiles for species A, B, and C with the estimated parameter
5.1.1 | Reaction (A → B and B → C), no system noise 27
values and the profile bands corresponding to the standard
We consider a 2‐step consecutive reaction (A → B and
deviations are shown in Figure 2.
B → C) with an initial concentration of species A of
1.0 × 10−3 mol L−1. The data matrices were obtained by sim-
ulating the reaction mechanism with rates k1 = 2.0 second−1 5.1.2 | Reaction (A → B and B → C) with system noise
and k2 = 0.2 second−1. Three NIR spectra with considerable
A 2‐step consecutive reaction mechanism (A → B and
overlaps were used as pure species spectra of the 3 assumed
B → C) with rates k1 = 0.30 second−1 and k2 = 0.05
components. Measurement error was introduced by adding
second−1 is considered. The initial concentrations of species
normally distributed random numbers. The dimension of
A of 1.0 M L−1 and others are 0. System noise is considered
the data matrix D is 300 × 100.
in this case, and the concentration profiles for A, B, and C
In the first step, using the proposed variance estimation
were obtained by simulating the stochastic differential equa-
procedure in Figure 1, the variances of system noise (σk2)
tions with the approach proposed by Higham.30 Three NIR
for the 3 species were obtained as 1.5519 × 10−11,
spectra were simulated for the 3 individual species using
8.5460 × 10−11, and 6.1185 × 10−11 for A, B, and C, respec-
Gaussian peaks. Normally distributed white noise was added
tively. Note that the true variances are 0 for these species, so
to the data matrix. The dimension of the simulated data
the estimated variances are reasonable. The estimated vari-
matrix D is 401 × 301.
ance for measurement error (δ2) is 1.8731 × 10−6. Our vari-
In the first step, the proposed variance estimation pro-
ance estimation procedure converges in 2 iterations, and the
cedure in Figure 1 leads to variances of system noise
CPU time cost is 2.0 seconds.
(σk2) given by 4.2962 × 10−6, 1.1130 × 10−5, and
With the above estimated variances and initialization, the
1.0791 × 10−5 for A, B, and C, respectively. The true var-
solution of Equation 17 requires 10.35 CPU seconds and 21
iances for system noise are 2.7493 × 10−6, 1.5907 × 10−5,
iterations with optimal parameters k1 = 1.9735 second−1
and 1.1523 × 10−5. The estimated variance (δ2) for mea-
and k2 = 0.20071 second−1. The relative errors are 1.33%
surement error is 7.2544 × 10−6, and the true variance
and 0.36%, respectively. With MCR‐ALS,27 we obtain
for measurement error is 7.4075 × 10−6. We can see that
k1 = 1.8939 second−1 and k2 = 0.19840 second−1, and the
the estimated and true variances have the same order of
relative errors are 5.30% and 0.80%, respectively. Hence,
magnitude. The variance estimation procedure requires 2
the proposed approach performs better than MCR‐ALS.
iterations to converge, and the CPU time cost is
The covariance matrix of the estimates with respect to k
10.0 seconds.
was also evaluated, and the diagonal elements of the covari-
With the above estimated variances and initialization, the
ance matrix are shown as follows:
solution of Equation 17 requires 52.59 CPU seconds and 17
T iterations with optimal parameters k1 = 0.29678 second−1
diagðV Θ Þ ¼ 7:9510×10−4 4:0600×10−5
and k2 = 0.049831 second−1. The relative errors are 1.07%
σ k1 ¼ sqrt 7:9510×10−4 ¼ 0:028
and 0.34%, respectively. With MCR‐ALS,27 we obtain
σ k2 ¼ sqrt 4:0600×10−5 ¼ 0:0064 k1 = 0.30236 second−1 and k2 = 0.074513 second−1. The rel-
ative errors are 0.79% and 49.03%, respectively, and the rela-
The standard deviations for both k1 and k2 are small; tive error for k2 is very large. We also evaluate the covariance
hence, the kinetic parameters estimated by the proposed matrix of the estimates for k1 and k2 as follows:
FIGURE 2 Profiles for A, B, and C and profile bands corresponding to the standard deviations
CHEN ET AL. 513
FIGURE 3 Profiles for A, B, and C and profile bands corresponding to the standard deviations
514 CHEN ET AL.
The reaction, dissolution, and crystallization rate laws are normally distributed random numbers. The dimension of
shown as follows: the D matrix is 471 × 111.
Using the proposed variance estimation procedure in
r 1 ¼ k 1 cSA ðt ÞcAA ðt Þ Figure 1, we estimate system noise variances for SA, ASA,
r 2 ¼ k 2 cASA ðt ÞcAA ðt Þ and ASAA species (σk2) as 5.4538 × 10−8, 5.0461 × 10−7,
r 3 ¼ k 3 cASAA ðt ÞcH 2 O ðt Þ and 3.3012 × 10−7, respectively. The true variances are
r 4 ¼ k 4 cAA ðt ÞcH 2 O ðt Þ 2.2440 × 10−8, 1.8459 × 10−7, and 2.1517 × 10−7. The esti-
(35)
( d mated variance for measurement error(δ2) is 9.6868 × 10−8,
SA ðT Þ−cSA ðt Þ ; if mSA ðt Þ≥0
k d csat and the true value is 1.0 × 10−7. It can be seen that they have
rd ¼
0; if mSA ðt Þ<0 the same order of magnitude. The proposed variance estima-
c tion procedure requires 8 iterations, and the CPU time cost is
r g ¼ k c max cASA ðt Þ−csat ASA ðT Þ; 0
83.0 seconds.
With the above estimated variances and initialization, the
The change of concentration and mass for each liquid and solution of Equation 17 requires 9.63 CPU seconds and 7
solid component of the reaction system can be described by iterations to obtain the optimal parameters listed in Table 1.
the following system of ODEs: We also obtained the following covariance matrix for the esti-
mated parameters:
V̇
ṁSA ¼ −M SA Vr d ; ċSA ¼ rd −r 1 − cSA
V
V̇ ½9:2821×10−11 3:0548×10−8 1:6041×10−2
ċAA ¼ −r 1 −r2 −r 4 − cAA diagðV Θ Þ ¼
V
1:3796×10−3 4:6649×10−6 2:0728×10−5 1:2780×10−7 T
V̇
ċHA ¼ r1 þ r 2 þ r3 þ 2r4 − cHA
V
V̇ with standard deviations shown in the Standard Deviation
ṁASA ¼ M ASA Vr g ; ċASA ¼ r1 −r 2 þ r3 −r g − cASA (36) column of Table 1.
V
V̇ From Table 1, we can see that the relative errors for all the
ċASAA ¼ r 2 −r3 − cASAA parameters are within their confidence intervals and are
V
f V̇ therefore acceptable.
ċH2 O ¼ −r3 −r4 þ cin H2 O − cH O We also can see that the standard deviations are compat-
V V 2 !
ns 4 ible with the absolute errors for all the estimated parameters.
f
V ̇ ¼ V ∑ υi ∑ γ i;j r j þ γ i;d rd þ γ i;c rc þ εi cin The profiles for SA, ASA, and ASAA with the estimated
i¼1 j¼1 V H2 O
The comparison between the exact and estimated parameters
where γ are stoichiometric coefficients and ν are partial vol- TABLE 1
umes. To validate the accuracy of our method, we generate Absolute Relative Standard
Exact Estimated Error Error (%) Deviation
simulated absorbance data for known parameters,
k1 0.036031 0.036011 2.0 × 10−5 0.056 9.6 × 10−6
k1 = 0.036031, k2 = 0.15961, k3 = 6.8032, k4 = 1.8029,
k2 0.15961 0.15967 6.8 × 10−5 0.043 1.7 × 10−4
kc = 0.75669, kd = 7.1109, and CSAsat = 2.0627. Here, we
k3 6.8032 7.0390 0.24 3.5 0.13
add system noise η to produce concentration profiles for
k4 1.8029 1.8560 0.053 2.9 0.037
SA, ASA, and ASAA by solving the corresponding stochastic
kc 0.75669 0.76021 0.0035 0.46 2.2 × 10−3
differential equations. Also, the pure component spectra for
kd 7.1109 7.1073 0.0035 0.049 4.6 × 10−3
SA, ASA, and ASAA are the same as in the work of Joiner et −4
CSAsat 2.0627 2.0629 2.4 × 10 0.012 3.6 × 10−4
al,22 and the measurement error for D was introduced with
CHEN ET AL. 515
parameter values and the profile bands corresponding to the The evaluated system noise variances for the A and C spe-
standard deviations are shown in Figure 5. cies (σk2) are 2.4589 × 10−6 and 3.1297 × 10−15, respectively.
The estimated variance for measurement error (δ2) is
1.9455 × 10−5. The variance estimation procedure in Figure 1
requires 2 iterations, and the CPU time cost is 5.0 seconds.
5.2 | Parameter estimation with real measurements
With the above estimated variances and initialization, the
5.2.1 | Reaction (A + B → C + D) solution of Equation 17 requires 6.89 CPU seconds and 15
Consider the reaction of 4‐(trifluoromethyl) aniline (A) with iterations with optimal parameters k1 = 0.0066555 second−1.
benzaldehyde (B). The solvent is 4‐methyl‐2‐pentanone The variance of the estimates (σ2) with respect to k1 is
(MIBK). The reaction products are imine (C) and water 5.3399 × 10−19, and the standard deviation (σ) is 7.3 × 10−10.
(D). The temperature at which the reaction was performed The profiles for A and C with the estimated parameter values
is 22°C, and the reaction scheme is A + B → C + D. The vol- and profile bands corresponding to the standard deviations are
ume stays constant, and the active species A and C were shown in Figure 6. Because the standard deviation is so small,
monitored. The infrared spectra were recorded using a time the widths of the profile bands are nearly 0 and cannot be rec-
interval of 1 minute. The data size is 193 × 431. ognized in the figure.
FIGURE 5 Concentration profiles for SA, ASA, and ASAA with the estimated parameter values and profile bands corresponding to the standard deviations
516 CHEN ET AL.
FIGURE 6 Concentration profiles for A and C with the estimated parameter values and profile bands corresponding to the standard deviation
5.2.2 | Reaction (A + B → C + D) with chloroform as solvent (B). The intermediate adduct (C) is formed during the
We consider the same reaction as in the previous case but with reaction. The reaction products are 3‐chlorophenyl‐
chloroform as a solvent. Again, the reaction scheme is hydrazonocyanoacetamide (D) and the byproduct ethylene
A + B → C + D, the volume stays constant, and the active spe- sulfide (E). The temperature at which the reaction was
cies A and C were monitored. The infrared spectra were recorded performed is 25°C. The reaction scheme is A + B → C
using a time interval of 3 minutes. The data size is 331 × 431. and C → D + E. The active species A, C, and D were
The evaluated system noise variances for A and C species monitored. The UV‐visible spectra for each run were
(σk2) are 1.8170 × 10−6 and 2.6250 × 10−13, respectively. The recorded using time intervals of 10 seconds. The wave-
estimated variance for measurement error (δ2) is 3.1032 × 10−5. length range is 200 to 600 nm (wavelength resolution is
The variance estimation procedure in Figure 1 requires 2 1 nm). The total runtime is 2710 CPU seconds.21 The
iterations to converge, and the CPU time cost is 9.0 seconds. comparison between the proposed approach and MCR‐
With the above estimated variances and initialization, the ALS is shown in Table 2. The “Run Index” in Table 2 rep-
solution of Equation 17 requires 33.55 CPU seconds and 21 resents different experimental data sets.
iterations with optimal parameters k1 = 0.0053905 second−1. Bijlsma et al21 estimated the kinetic parameters with the
The variance of the estimates (σ2) with respect to k1 is Levenberg–Marquardt/PARAFAC method. The mean
1.7181 × 10−8, and the standard deviation (σ) is 1.3 × 10−4. values for k1 and k2 were reported as 0.3146 and 0.0258,
The profiles for A and C profiles with the estimated parameter and the corresponding standard deviations are 0.0092 and
values and profile bands corresponding to the standard devia- 0.0011. From Table 2, we can see that the mean values
tions are shown in Figure 7. evaluated by the proposed approach are closer to the values
reported by Bijlsma et al21 than those from MCR‐ALS,
especially the estimated value for k2. The profiles for A,
5.2.3 | Reaction (A + B → C and C → D + E)21 C, and D with the estimated mean parameter values and
Consider the reaction of 3‐chlorophenyl‐ the profile bands corresponding to the standard deviations
hydrazonopropane dinitrile (A) with 2‐mercaptoethanol are shown in Figure 8.
FIGURE 7 Concentration profiles for A and C with the estimated parameter values and profile bands corresponding to the standard deviation
CHEN ET AL. 517
TABLE 2 The comparison among the proposed approach and multivariate curve resolution alternating least squares (MCR‐ALS) for parameter estimation
based on real experiment data set
MCR‐ALS
Proposed Approach
Run Index k1 Standard Deviation k2 Standard Deviation CPU (s) k1 k2
1 0.31125 0.015 0.025819 0.0046 43.54 0.30946 0.027807
2 0.31015 0.012 0.026766 0.0035 81.18 0.29592 0.030658
3 0.30935 0.018 0.027232 0.0051 134.01 0.29320 0.031308
4 0.31519 0.012 0.027201 0.0030 70.91 0.30068 0.030813
5 0.30786 0.012 0.025151 0.0011 83.98 0.32234 0.036597
6 0.29377 0.011 0.029307 0.0031 78.68 0.28094 0.036662
7 0.32149 0.013 0.025268 0.0035 84.26 0.31103 0.026977
8 0.31714 0.013 0.028323 0.0022 68.99 0.31349 0.029298
9 0.31942 0.0080 0.025035 0.0011 101.89 0.32905 0.024903
10 0.29952 0.0054 0.026399 0.0015 58.90 0.30034 0.027889
Mean 0.31051 0.012 0.026650 0.0029 80.63 0.30565 0.030291
FIGURE 8 Concentration profiles for A, C, and D with the estimated mean parameter values and profile bands corresponding to the standard deviations
C ntp by nc concentration matrix ɛ the largest difference between risks that we are
d(ti, λl) absorbance of a solution at a particular wavelength willing to consider insignificant
λl and given time ti ζi,l entry in the ith row and lth column of E
di,l the entry in the ith row and lth column of matrix D ηk(t) system noise for the kth species
d^ vector form of D ^
η vector of discretized system noise
D ntp‐by‐nwp spectra matrix θ kinetic parameter vector
^
D observation for D, when D is regarded as a Θ parameter vector for the reduced version of Equa-
random matrix tion 17
E ntp‐by‐nwp noise matrix ^
Θ estimated value for Θ
f(c(t),θ) process function vector К diagonal matrix and the diagonal element is κk
fk(c(t),θ) process function corresponding to the kth κk disturbance intensity for the kth state variable
state variable
λl lth sampling wavelength point
F vector of discretized dc(t)/dt = f(c(t), θ)
νl2 variance for the lth column of spectra data D
hj length of jth finite element
σk2 variance for ηk(t)
H reduced Hessian τ relative value of t in finite element
^
J Θ; d reduced objective of Equation 17
τj jth Radau collocation point
lm(τ) Lagrange basis polynomials
ω error vector between c and z
l˙m ðτÞ derivative of Lagrange basis polynomials
nc number of absorbing species
ACKNOWLEDGEMENT
ntp number of sampling points in time
nwp number of sampling points for wavelength dimension This research was supported by the Lilly Research Award
Program. SG thanks Kevin Girard (Pfizer Inc) for his
N(·, ·) normal distribution
feedback to improve the title of this paper. We thank Carla
O infinitesimal of the same order Luciani, Michael Frederick, and Zhenqi (Pete) Shi (Eli Lilly)
p(·) the probability density function for their assistance in collecting data. We also thank Prof Paul
sk(λl) absorbance for the kth species with 1 unit concen- Gemperline for sharing data on the aspirin case study.
tration at λl
S nwp‐by‐nc absorbance matrix
APPENDIX A
tpj−1 beginning point of the jth finite element
Consider the general scenario
ti ith sampling time point 0
F 1;1 ≡c1;1 − c1;0 þ f 1 c1;0 ; c2;0 :::; cm;0
Vd covariance matrix of d B
B F 1;2 ≡c1;2 − c1;1 þ f 1 c1;1 ; c2;1 :::; cm;1
VΘ covariance matrix of Θ B
B
B :::
Wk(t) standard Wiener process B
B
B F ≡c −c
W(t) vector composed of Wk(t) B 1;n 1;n 1;n−1 þ f 1 c1;n−1 ; c2;n−1 :::; cm;n−1
B
B
z vector of discretized z(t) B F 2;1 ≡c2;1 − c2;0 þ f 2 c1;0 ; c2;0 :::; cm;0
B
B
z(t) concentration vector without system noise B F 2;2 ≡c2;2 − c2;1 þ f 2 c1;1 ; c2;1 :::; cm;1
B
B
K
z (t) interpolation polynomial vector F¼B B :::
B
B F ≡c −c
zjm value of z(t) at the mth collocation point of the jth B 2;n 2;n 2;n−1 þ f 2 c1;n−1 ; c2;n−1 :::; cm;n−1
B
finite element B :::
B
value of the kth component of z(t) at ti B
zk (ti) B
B F m;1 ≡cm;1 − cm;0 þ f m c1;0 ; c2;0 :::; cm;0
Z B
concentration matrix without system noise B
B F m;2 ≡cm;2 − cm;1 þ f m c1;1 ; c2;1 :::; cm;1
(corresponding to C) B
B
B :::
δ2 variance of ζi,l @
δd infinitesimal for d F m;n ≡cm;n − cm;n−1 þ f m c1;n−1 ; c2;n−1 :::; cm;n−1
T
δΘ infinitesimal for Θ c ¼ c1;1 ; ; c1;2 :::; ; c1;n ; ; c2;1 ; ; c2;2 :::; ; c2;n :::; ; cm;1 ; ; cm;2 :::; ; cm;n
CHEN ET AL. 519
2 3
0 0 0 ⋯ 0
6 ∂f 1 7 Then we have J(1)(i × n + 1, i × n + 1) = 1, J(1)(i × n + 2,
6− 0 0 ⋯ 07 i × n + 2) = 1, and J(1)(i × n + 1, k) = 0, if k is not equal
6 ∂c2;1 7
6 7 to i × n + 1 and i × n + 2, i = 0, ..., m − 1.
∂F 1;• 6 7
6 ∂f 1 7
¼6 0 − 0 ⋯ 07 Next, J(1) can be transformed to J(2) with the same deter-
∂c2;• 6 ∂c2;2 7
6 7 minant as J(1) by adding some times of the i × n + 2th row to
6 ⋮ ⋮ ⋮ ⋱ ⋮7
6 7 the i × n + 3th row to J(2) to eliminate the nonzero elements;
4 ∂f 1
5
0 ⋯ 0 − 0 the index i is from 0 to m − 1. J(2) is composed of the follow-
∂c2;n−1
ing submatrices.
2 3 For i ≠ j,
0 0 0 ⋯ 0
6 ∂f 1 7 2 3
6− 0 0 ⋯ 07 0 0 0 ⋯ 0
6 ∂c3;1 7 60
6
6
7
7 6 0 0 ⋯ 077
∂F 1;• 6 ∂f 1 7 ∂F i;• ð 2Þ 6 7
¼6 0 − 0 ⋯ 0 7; :::; 6 ⋯ 07
∂c3;• 6 ∂c3;2 7 ¼ 60 0 0 7
6 7 ∂cj;• 6 7
6 ⋮ ⋮ ⋮ ⋱ ⋮7 6⋮ ⋮ ⋮ ⋱ ⋮7
6 7 4 ∂f i
5
4 ∂f 1
5 0 ⋯ 0 − 0
0 ⋯ 0 − 0 ∂cj;n−1
∂c3;n−1
2 3
0 0 0 ⋯ 0 For i = j,
6 ∂f 1 7 2 3
6− 0 0 ⋯ 07
6 ∂cm;1 7 1 0 0 ⋯ 0
6 7 60
∂F 1;• 6
6 ∂f 1
7
7 6 1 0 ⋯ 077
¼6 0 − 0 ⋯ 07 ∂F i;• ð 2Þ 6 7
∂cm;• 6 ∂cm;2 7 6 ⋯ 07
6 7 ¼ 60 0 1 7
6 ⋮ ⋮ ⋮ ⋱ ⋮7 ∂cj;• 6 7
6 7 6⋮ ⋮ ⋮ ⋱ ⋮7
4 5 4 ∂f i
5
∂f ⋯ −1−
0 ⋯ 0 − 1 0 0 0
∂cj;n−1
1
∂cm;n−1
520 CHEN ET AL.
2 3
1 0 0 0 0 0 0 0 0
6 −1− ∂f 1 −
∂f 1
−
∂f 1
07
6 ∂c1;1
1 0
∂c2;1
0 0
∂c3;1
0 7
6 7
6 ∂f 1 ∂f 1 ∂f 1 7
6 0 −1− 1 0 − 0 0 − 07
6 ∂c1;2 ∂c2;2 ∂c3;2 7
6 7
6 07
6 0 0 0 1 0 0 0 0 7
6 ∂f ∂f ∂f 7
6
∂F 6 − 2 0 0 −1− 2 1 0 − 2 0 07
¼ 6 ∂c1;1 ∂c2;1 ∂c3;1 7
∂c 6 7
∂f 2 ∂f 2 ∂f 2 7
6 0 − 0 0 −1− 1 0 − 07
6 ∂c1;2 ∂c2;2 ∂c3;2 7
6 7
6 0 0 0 0 0 0 1 0 07
6 7
6 ∂f ∂f ∂f 7
6 − 3 0 0 − 3 0 0 −1− 3 1 07
6 ∂c1;1 ∂c2;1 ∂c3;1 7
4 5
∂f 3 ∂f 3 ∂f 3
0 − 0 0 − 0 0 −1− 1
∂c1;2 ∂c2;2 ∂c3;2
!
pffiffiffiffiffi
nc ntp ðck ðt i Þ−zk ðt i ÞÞ2 In a similar manner as with Equations B1 and 19, we get
lnðpðcjθÞÞ ¼ ∑ ∑ − ln 2π − lnðσ k Þ−
2σ 2k
k¼1i¼1
2
nc ntp
pffiffiffiffiffi 1 1 ntp 1 ntp nwp nc
¼ ∑ ∑ − ln 2π − ln ∑ ðck ðt m Þ−zk ðt m ÞÞ2 (B3) δ2 ¼ ∑ ∑ d i;l − ∑ ck ðt i Þsk ðλl Þ (B9)
k¼1i¼1 2 ntpm¼1 ntp⋅nwp i¼1 l¼1 k¼1
ntp⋅nc
−
2
Substituting Equation B9 into the objective function of
Equation 24, we obtain Equation 25.
Because σk2 is independent of index i in Equation B3,
Equation 19 becomes Equation 20.
APPENDIX C
When the values of zk(ti) are fixed, di,l can be described by
For Equation 26, matrix B in Equation 33 is
nc nc
di;l ¼ ∑ ck ðt i Þsk ðλl Þ þ ζi;l ¼ ∑ ðzk ðt i Þ þ ωk ðt i ÞÞsk ðλl Þ þ ζi;l −2Sðl; kÞ
k¼1 k¼1 Bððk−1Þ⋅ntp þ i; i þ ðl−1Þ⋅ntpÞ ¼
nc nc δ2
¼ ∑ zk ðt i Þsk ðλl Þ þ ∑ ωk ðt i Þsk ðλl Þ þ ζ i;l −2Cði; kÞ
k¼1 k¼1 (B4) Bðnc⋅ntp þ ðk−1Þ⋅nwp þ l; i þ ðl−1Þ⋅ntpÞ ¼
nc δ2
nc
2 2 i ¼ 1:::ntp; l ¼ 1:::nwp; k ¼ 1:::nc
p di;l jzk ðti Þ; k ¼ 1…nc eN ∑ zk ðt i Þsk ðλl Þ; ∑ sk ðλl Þ σ k þ δ 2
k¼1 k¼1 (C1)
i ¼ 1; …; ntp; l ¼ 1; …; nwp
Cov d i;l ; d q;p 22. Joiner DE, Billeter J, McNally MEP, Hoffman RM, Gemperline PJ. Compre-
nc nc hensive kinetic model for the dissolution, reaction, and crystallization process
involved in the synthesis of aspirin. J Chemometr. 2014;28(5):420–428.
¼E ∑ ωk ðt i Þsk ðλl Þ þ ζ i;l ∑ ωr t q sr λp þ ζq;p
k¼1 r¼1 23. Bijlsma S, Boelens HFM, Smilde AK. Determination of rate constants in sec-
nc nc ond‐order kinetics using UV‐visible spectroscopy. Appl Spectrosc. 2001;55
¼ E ∑ ∑ ωk ðt i Þsk ðλl Þωr ðt i Þsr ðλl Þ þ ζi;l
2 (1):77–83.
k¼1r¼1 24. Puxty G, Maeder M, Hungerbuhler K. Tutorial on the fitting of kinetics
nc models to multivariate spectroscopic measurements with non‐linear least‐
¼ ∑ σ 2k s2l;k þδ 2
squares regression. Chemom Intell Lab Syst. 2006;81(2):149–164.
k¼1
25. Biegler LT. Nonlinear Programming: Concepts, Algorithms, and Applications
(C5)
to Chemical Processes. Philadelphia: Society for Industrial and Applied
Mathematics;2010.
REFERENCES
26. Juan A, Maeder M, Martinez M, Tauler R. Combining hard‐ and soft‐
1. Roggo Y, Chalus P, Maurer L, Lema‐Martinez C, Edmond A, Jent N. A modelling to solve kinetic problems. Chemom Intell Lab Syst. 2000;54
review of near infrared spectroscopy and chemometrics in pharmaceutical (2):123–141.
technologies. J Pharm Biomed Anal. 2007;44(3):683–700.
27. Jaumot J, Juan A, Tauler R. MCR‐ALS GUI 2.0: new features and applica-
2. Quinn AC, Gemperline PJ, Baker B, Zhu M, Walker DS. Fiber‐optic UV/vis- tions. Chemom Intell Lab Syst. 2015;140(15):1–12.
ible composition monitoring for process control of batch reactions. Chemom
28. Sawall M, Borner A, Kubis C, Selent D, Ludwig R, Neymeyr K. Model‐free
Intell Lab Syst. 1999;45(1‐2):199–214.
multivariate curve resolution combined with model‐based kinetics: algorithm
3. Ng DJW, Assirelli M. Mixing study in batch stirred vessels using a fibre‐ and applications. J Chemometr. 2012;26(10):538–548.
optic UV‐VIS monitoring technique: a novel method. Chem Eng Res Des.
29. Dennis JE, Gay DM, Welsch RE. NL2SOL—an adaptive nonlinear least
2007;85(10):1348–1354.
squares algorithm. ACM Trans Math Softw. 1981;7(3):369–383.
4. Gurden SP, Westerhuis JA, Smilde AK. Monitoring of batch processes using
30. Higham D. An algorithm introduction to numerical simulation of stochastic
spectroscopy. AIChE J. 2002;48(10):2283–2297.
differential equations. SIAM Rev. 2001;43(3):525–546.
5. Lawton WH, Sylvestre EA. Self modelling curve resolution. Technometrics.
31. Bard Y. Nonlinear Parameter Estimation. New York and London: Academic
1971;13(3):617–633.
Press;1974.
6. Lawton WH, Sylvestre EA, Maggio MS. Self modelling nonlinear regression.
32. Biegler LT, Cervantes AM, Wächter A. Advances in simultaneous strategies
Technometrics. 1972;14(3):513–532.
for dynamic process optimization. Chem Eng Sci. 2002;57(4):575–593.
7. Sylvestre EA, Lawton WH, Maggio MS. Curve resolution using a postulated
33. Kameswaram S, Biegler LT. Convergence rates for direct transcription of opti-
chemical reaction. Technometrics. 1974;16(3):353–368.
mal control problems using collocation at Radau points. Comput Optim Appl.
8. Hamilton JC, Gemperline PJ. Mixture analysis using factor analysis. II: self‐ 2008;41(1):81–126.
modeling curve resolution. J Chemometr. 1990;4(1):1–13.
34. Keller HR, Massart DL. Evolving factor analysis. Chemom Intell Lab Syst.
9. Tauler R, Smilde A, Kowalski B. Selectivity, local rank, three‐way data analysis 1991;12(3):209–224.
and ambiguity in multivariate curve resolution. J Chemometr. 1995;9(1):31–58.
35. Maeder M. Evolving factor analysis for the resolution of overlapping chro-
10. Tauler R. Multivariate curve resolution applied to second order data. Chemom matographic peaks. Anal Chem. 1987;59(3):527–530.
Intell Lab Syst. 1995;30(1):133–146.
36. Marsili‐Libelli S, Guerrizio S, Checchi N. Confidence regions of estimated
11. de Juan A, Tauler R. Multivariate curve resolution (MCR) from 2000: progress parameters for ecological systems. Ecol Model. 2003;165(2‐3):127–146.
in concepts and applications. Crit Rev Anal Chem. 2006;36(3‐4):163–176.
37. Pirnay H, Lopez‐Negrete R, Biegler LT. Optimal sensitivity based on IPOPT.
12. Ruckebusch C, Blanchet L. Multivariate curve resolution: a review of Math Program Comput. 2012;4(4):307–331.
advanced and tailored applications and challenges. Anal Chim Acta.
38. Fourer R, Gay DM, Kernighan BW. AMPL: a modeling language for mathe-
2013;765:28–36.
matical programming. 2nd ed. Cengage Learning: Boston; 2002.
13. Gemperline PJ, Cash E. Advantage of soft versus hard constraints in self‐
39. Wachter A, Biegler LT. Line search filter methods for nonlinear program-
modeling curve resolution problems. Alternating least squares with penalty
ming: motivation and global convergence. SIAM Journal on Optimization.
functions. Anal Chem. 2003;75(16):4236–4243.
2005;16(1):1–31.
14. Neymeyr K, Sawall M, Hess D. Pure component spectral recovery and
40. Waechter A, Biegler LT. Line search filter methods for nonlinear program-
constrained matrix factorizations: concepts and applications. J Chemometr.
ming: local convergence. SIAM Journal on Optimization. 2005;16(1):32–48.
2010;24(2):67–74.
41. Waechter A, Biegler LT. On the implementation of a primal‐dual interior point
15. Abdollahi H, Tauler R. Uniqueness and rotation ambiguities in multivariate
filter line search algorithm for large‐scale nonlinear programming. Mathemat-
curve resolution methods. Chemom Intell Lab Syst. 2011;108(2):100–111.
ical Programming. 2006;106(1):25–57.
16. Westerlund T, Karrila S, Makila PM, Brink A. Techniques in the maximum
likelihood estimation of the covariance matrix. Control Dyn Syst.
1996;76:323–339. SUPPORTING INFORMATI ON
17. Vaia A, Sahinidis NV. Simultaneous parameter estimation and model structure
determination in FTIR spectroscopy by global MINLP optimization. Comput Additional Supporting Information may be found online in
Chem Eng. 2003;27:763–779. the supporting information tab for this article.
18. Levenberg K. A method for the solution of certain problems in least squares.
Q J Appl Math. 1944;2(2):164–168.
19. Marquardt DW. An algorithm for least‐squares estimation of nonlinear param- How to cite this article: Chen, W., Biegler, L. T., and
eters. J Soc Ind Appl Math. 1963;11(2):431–441. Muñoz, S. G. (2016) An approach for simultaneous
20. Maeder M, Zuberbuhler AD. Nonlinear least‐square fitting of multivariate estimation of reaction kinetics and curve resolution
absorption data. Anal Chem. 1990;62(20):2220–2224.
from process and spectral data, J. Chemometrics,
21. Bijlsma S, Louwerse DJ, Smilde AK. Estimating rate constants and pure UV‐
30: 506–522. doi: 10.1002/cem.2808
vis spectra of a two‐step reaction using trilinear models. J Chemometr.
1999;13(3‐4):311–329.