Beruflich Dokumente
Kultur Dokumente
ROI BAER
Chaim Weizmann Institute of Chemistry and Lise Meitner Center for Quantum Chemistry,
the Hebrew University of Jerusalem, Jerusalem 91904 Israel.
DANIEL NEUHAUSER
Department of Chemistry and Biochemistry, University of California, Los Angeles CA 90095
USA.
Abstract
Shifted contour (SC) auxiliary field Monte Carlo (AFMC) for electronic structure is described
in considerable algorithmic detail. Several applications demonstrate the high accuracy achiev-
able in molecular electronic structure, including excited states, using the techniques of corre-
lated sampling and variational AFMC. SC-AFMC is also applicable to large Hubbard lattices
of strongly correlated electrons. Applications are shown, up to lattices of size 16×16 with 202
electrons. Finally, we present a method for exact coarse graining of the AFMC integrand.
1 Introduction
2 Basic Theory
= δ ss 'δ jj ' , cˆ js , cˆ j ' s ' = cˆ†js , cˆs†' j ' = 0 . Using the creation and destruction opera-
+ +
and:
Vijkl = ∫ φi ( r ) φ j ( r ) v ( r, r ' ) φk ( r ' ) φl ( r ') d 3 rd 3 r ' , (2.5)
where:
e2
v ( r, r ') = . (2.6)
r −r'
Our goal is to develop methods for computing various observables concerning the
electrons. There are many options to strive for in this context. One of the most im-
portant observable, for studying the chemical properties of molecules, is the ground-
state energy. Other important quantities are low-lying excited-state energies, charge
distributions and AC/DC linear-response properties. Another worthy goal, useful
particularly for large systems, is the electronic canonical partition function
{ } from which the thermodynamic properties of the electrons
Z c ( β , N e ) = trNe e − β H
ˆ
may be inferred.
In most of these objectives, the Boltzmann operator e − β Ĥ plays a central role.
With a large inverse temperature, it can be used to project the low energy manifold
of the molecule. For higher temperatures, it can be used to compute the partition
function or to trace thermodynamic expectation values. We therefore concentrate in
this review on the development of computational methods for accurately estimating
matrix elements of the Boltzmann operator. Our goal is towards the low tempera-
tures (large β values).
It is convenient to use the friendliest, Ne-electron wavefunctions - the Slater de-
terminants. These wavefunctions describe Ne electrons that are occupying a set of Ne
spin-orbitals f1 ( x ) … f Ne ( x ) where x ≡ ( r, ξ ) is a space-spin single electron co-
ordinate:
f1 ( x1 ) " ( )
f1 x Ne
(
Φ f x1 ,… , x Ne = ) 1
Ne !
$ % $ . (2.7)
f1 ( x1 ) " ( )
f Ne x Ne
In terms of our orthonormal basis orbitals, φi ( r ) , we may write:
N
fα ( x ) = ∑ φ (r )δ ξ F
j =1
j s js ,α α = 1… N e . (2.8)
s =↑↓
with which we define the 2nd quantization analog of the determinantal wave func-
tion:
Φ f1" f Ne ⇔ f1† " f N†e 0 . (2.10)
As described above, our general goal is then to develop a computational scheme
enabling the computation of the Boltzmann operator matrix elements:
Φ g e − β H Φ f = 0 g Ne " g1e − β H f1† " f N†e 0 .
ˆ ˆ
(2.11)
These quantities will be used in this chapter to mainly compute the groundstate en-
ergy of the molecule:
ˆ − β Hˆ Φ
Φ g He
Egs = lim
f
− β Hˆ
, (2.12)
β →∞
Φg e Φf
where Φ f and Φ g are any determinantal wave functions not orthogonal to the true
ground state.
ward to understand why this is so. Assume the orbitals are orthonormal (this is not
essential, just convenient). The wavefunction Φ ( β ) = e− β T Φ f solves the differen-
ˆ
tial equation:
dΦ ( β )
= −Tˆ Φ ( β ) Φ(β ) = Φ f . (2.13)
dβ
We show that Φ ( β ) = Φ f ( β ) leads to an acceptable solution, which because of
uniqueness is the only solution. For this, substitute Φ ( β ) = Φ f ( β ) on both sides of
cal situation that when the particles do not interact with each other, they respond to
one-body Hamiltonians in a completely independent way.
The reader should appreciate that this is quite a remarkable result, by no means
trivial. It has far-reaching consequences. The Boltzmann operator of a one-body
Hamiltonian T̂ is a many-body operator, i.e. it has 2-body, 3-body etc., terms built
into it – as can be seen by simply expanding it in a Taylor type series
∞
(
e − β T = ∑ − β Tˆ )
ˆ n
n ! . Yet, its operation on a determinant is remarkably simple: the
n=0
wavefunction stays a determinant!
All this simplicity is erased however, when a two-body interaction is present.
Now the particles are dependent. The picture is complicated. Starting from a deter-
minant at β = 0 , the wavefunction at any β > 0 is not simple at all it is a linear
combination of a large number of determinants. The number of possible determi-
nants grows very rapidly, factorially, with system size and with number of electrons
and exponentially with β .
Next, let us use a shorthand notation, defining the composite index I = ( ij ) (some-
times we will also write K = ( kl ) ) and write the Hamiltonian in the following ele-
gant way:
1
Hˆ = t T ρˆ + ρˆ T V ρ . (2.18)
2
Here, t and ρ̂ are considered column vectors of some dimension M = N × N in
the index I, respectively of the elements:
t I = tij ρˆ I = ρˆ ij . (2.19)
The elements of matrix V are defined as follows:
VIK = Vijkl . (2.20)
We use the mathematical notation:
AT B = ∑ AI BI . (2.21)
I
A particularly important fact concerning the matrix VIK is that it is “positive defi-
nite”, a fancy name for saying that all its eigenvalues are positive. Physically this
comes from the simple fact that electrons repel each other with an ever-positive
monotonically decreasing e 2 r12 potential. Mathematically, this can be proved by
showing that VIK is an overlap matrix VIK = GI GK for some set of functions GI .
Overlap matrices are always positive definite: they can be diagonalized by simply
orthogonalizing the functions GI . Their eigenvalues are then just the norms of the
orthogonal functions and norms are always positive!
By going to Fourier space, we can find exactly what the functions GI that form
the matrix VIK are. Define:
FK ( q ) = ( 2π ) φk ( r ) φl ( r ) d 3 r
−3
∫e
iq ⋅r
. (2.22)
φk ( r ) φl ( r ) = ∫ FK ( q ) e −iq⋅r d 3q
Next, use the fact that for q ≠ 0 :
eiq⋅r 3 4π
∫ r d r = q2 . (2.23)
And then:
VIK = 4π e 2 q −1 FI q −1 FK , (2.24)
showing that VIK is an overlap matrix composed of the functions
GI ( q ) = 4π e 2 q −1 FI ( q ) and therefore it must be positive definite.
where:
L
− 12 ∑ σ lT V σ l ∆β
l =1
e
W [σ ] = L
. (2.36)
− 12 ∑ σ lT V σ l ∆β
∫e l =1
Dσ
In a similar way, an energy element can be written as:
ˆ − β Hˆ Φ ≈
Φ g He f
. (2.37)
∫ W [σ ] Φ g
ˆ ˆ "Uˆ Φ Dσ
HU σL σ1 f
Let us now sketch the algorithm for computing the overlap and energy of a mole-
cule. In a single reference case the two determinants on the left and right are identi-
cal and we define:
S ( β ) = Φ f e − β Ĥ Φ f , (2.38)
H ( β ) = Φ f He
ˆ − β Hˆ
Φf . (2.39)
Note that in principle, H ( β ) = − dS d β , however we will describe a direct method
too. Once these quantities are known, the ground state energy can be approximated
as:
Algorithm (AFMC)
1) Basis application on the Hartree-Fock molecular orbitals:
fα ( r ) = ∑ φi ( r ) Fiα . Set iteration counter N iter = 0 .
i
IJ
J =1
where L is a lower triangular matrix (and therefore easily invertible) ob-
tained from the Cholesky[25] factorization V = LT L .
4) Create the Hamiltonian:
( hσ )ij = ( hσ )I = t I + i ∑ VI , K σ K . (2.41)
K
6) l ← l + 1. If l < L go to step 3.
7) Compute S ← S + det F † UF (the reader is referred to the appendix for de-
L −1
tails on matrix element evaluation). Notice that the factors ∏ ( Cd
l =0
M
σ l ) in
Eq. (2.33) are already contained implicitly since the random numbers are
drawn from normalized distributions.
S × Niter + det F † UF
8) Update average S← and iteration number
N iter + 1
0.2
0
Energy per site
-0.2
-0.4
-0.6
-0.8
-1
0 0.5 1 1.5 2
β
FIG. 1 AFMC (squares) and SCAFMC (circles) energies of the 2-site 2-electron Hubbard model as a
function of the inverse temperature, after 16000 iterations. δt = 0.02. U = 4, t = 1.
S (β ) =
−∆β ( t + iV σ ) ρˆ
T
Φg e Φf
W (σ )
= Φ g 1 − i ∆β σ T V ρˆ Φ f + O ( ∆β )
W
The middle term is zero because σ = 0 . However, it has fluctuations. The Gaus-
sian distribution ensures that σ T V σ ∆β ≈ 1 , so the fluctuations are of magnitude
proportional to ∆β . Apart from the constant term Ψ Φ , the dominant part of
the signal is proportional to ∆β . Thus, the noise to signal ratio is:
noise 1
∝ , (2.44)
signal ∆β
which is formally infinite.
Algorithm (SC-AFMC)
1) Basis application to the Hartree-Fock molecular orbitals:
fα ( r ) = ∑ φi ( r ) Fiα α = 1," , N e . Set iteration counter Niter = 0. Note,
i
The huge effect of the shift on the statistics of a 2-site Hubbard model is seen in
FIG. 1. We discuss more in detail the shifted contour and the Hubbard model in the
next section. For a molecular system, where there is no real discrete auxiliary field
analog, the shifted contour makes a huge difference; we discuss some examples in
section IV.
The formalism of a new method to handle the sign problem is presented in Ap-
pendix B. This method is relies on the concept of coarse graining. Applications us-
ing this method will be discussed in a future publication.
In the Hubbard model, the electron structure problem is studied in a simplified con-
text: electrons “live” on a discrete (say, a 2 dimensional) lattice. There is a coupling
t between nearest neighbor sites, so that electrons can hop from one site to its
neighbor. The sites on the lattice are all equivalent so that the model has discrete
translational symmetry and periodic boundary conditions are imposed. The electron-
electron interaction is also very simplified. It is short ranged, so electrons on differ-
ent sites do not interact. Electrons on the same site (and necessarily of opposite
spins) repel each other, increasing the energy by U units. The Hubbard Hamilto-
nian is then:
Hˆ = −t ∑ ∑ cˆσ cˆσ
†
j i + U ∑ nˆ↑ j nˆ↓ j . (3.1)
i, j σ=↓↑ j
In this expression, the first summation, in the so-called “one body term”, is per-
formed over all nearest-neighbor pairs of sites on the lattice. In the second Hamilto-
nian term, the electron number operators are nˆ js = cˆ†js cˆ js . This term contains explicit
particle-particle interactions and is thus called “a two-body term”. We impose peri-
odic boundary conditions on the system, making all sites equivalent with transla-
tional invariance. This means that sites on opposite boundaries of the lattice are con-
sidered “nearest neighbors”. For a given lattice (say, a N × N 2D square lattice), all
groundstate properties of the system depend on only three quantities: the ratio U t
(here, we agree to take t = 1 in all subsequent applications) and the densities of
electrons ns = N s M for s =↑, ↓ , where M is the number of sites on the lattice. In
this review, we treat only closed shell systems, for which all electrons are paired, so
that we need consider only two parameters: the parameter U and the electron
density, n M . In Hubbard model terminology, sometimes, people use the term
“filling” which is the ratio of the number of electrons to the maximal number of
electrons that can be accommodated by the lattice (2N2). Thus, the case of density 1
is termed, “half filling”. We should mention, en passé, that the 1D Hubbard problem
(i.e. N × M lattices where M is small, typically M = 1, 2 or 3) can be treated using
the density matrix renormalization group [26]. Lattice models of dimensionality
greater than 2 are usually considered less difficult, because mean-field theories,
such as the Hartree-Fock theory, perform well there.
Because for fermions nˆ 2js = nˆ js , the interaction term in the Hamiltonian can be writ-
ten as nˆi ↑ nˆi ↓ = 1
2 (ρ 2
ii − ρii ) . This shows, by comparing with Eqs. (2.16) and (2.2)
that the Hubbard model is an electronic structure type Hamiltonian with a particular
choice of the parameter matrices Tij and Vijkl :
Tij = −t ∆ ij Vijkl = U δ ij δ kl δ ki . (3.2)
-0.850
-1.080
-0.900
-1.090
Exact
-0.950 0.3 0.4 0.5 0.6 0.7
β
-1.000
-1.050
-1.100
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
β
FIG. 2: The SC-AFMC energy of a 4×4 lattice with t = 1 , U = 8 and 10 electrons ( N ↑ = N↓ = 5 ).
The insert shows the details of the Monte-Carlo fluctuations. 40 Pentium III computers , each performing
1×106 iterations, were used for the computation.
An example far beyond the capabilities of exact diagonalization is the 16×16 lattice
with 202 (101 spin-up and 101 spin-down) electrons. The results are shown in FIG.
3. A best fit exponential is fitted to the computation from which groundstate energy
can be derived: E N = −1.135 ± 0.01 . The error bar is only a statistical error. There
is also an extrapolation error, which is more difficult to estimate. The constrained
path Monte Carlo (CPMC) method[18] applied to the same problem yields E N = -
1.1193 ± 0.0003 . The CPMC has no sign error, so the statistical errors are very
small. There is however an unknown systematic error, usually small, typically up to
0.01 energy units per site.
-1.000
Energy per site
-1.050
-1.100
-1.150
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
β
FIG. 3: The SC-AFMC energy of a 16×16 Hubbard lattice with 202 electrons. 150,000 iterations. The
solid line is a best-fit exponential.
0
MP5
-0.01
-0.02
-0.03
-0.04
-0.05 MP2
-0.06
-0.07
0 0.5 1 1.5 2
β (au)
FIG. 4: The correlation energy contribution to the inversion barrier of water. It is seen that to within a
statistical error of 0.01 eV the correlation energy is 0.00 eV. Taken from ref. [34].
1.2
β = 0.7 au
1
β = 1.3 au
0.8 β = 2.5 au
0.6
0.4
0.2
0
1.8 1.9 2 2.1 2.2 2.3 2.4
R (au)
FIG. 5: Convergence of PES with increasing β . The solid line is a cubic spline through ab initio energy
differences of Ref. [36] (the minimum of energy is taken as zero). Statistical errors in correlated energy
differences of the largest β are ±0.03 eV (note that the statistical errors of absolute energy are ±0.18eV).
Taken from ref. [37].
Correlated sampling can be used also for computing energy differences be-
tween ground and excited electronic states. For this we describe a variational multi-
reference AFMC [23,38,39]. Consider a set of M determinantal wave functions
( )
Φ m ξ1 ,… , ξ Ne , m = 1,… , M , the choice of which is application dependent, as dis-
cussed in the next section. Here ξ n is the coordinates and spin of the nth electron.
Projecting with the Boltzmann operator, damping the high-energy components,
a new set of M functions is obtained:
Φ m ( β ) = e − β Ĥ Φ m . (3.6)
A linear combination of the resulting wave functions can efficiently approximate
the true groundstate wave function:
M
Ψ gs ≈ Ψ = ∑ Cm Φ m ( β ) . (3.7)
m =1
25
J (kcal/mole)
20
15
10
5
0 0.5 1 1.5 2 2.5 3 3.5
β (au)
FIG. 6: Correlated sampling SC-AFMC computation of the Triplet-Singlet splitting for the methylene
molecule. The full line is a best-fit exponential to the results for β > 0.1 au. The horizontal line is the
relevant experimental result. The extrapolated β → ∞ splitting is J = 10.5±0.8 kcal/mole. Cell length is L
= 12 Bohr; grid spacing ∆x = 0.4 Bohr and time step is ∆β = 0.1 au. Total of 8,000 Monte Carlo itera-
tions. Taken from ref. [39].
the Lagrange parameter ε multiplies the normalization constraint and the matrices
appearing in Eq. (3.10) are computed using SCAFMC with the shift determined by
5 Summary
Acknowledgements
This work was supported by the Israel Science Foundation founded by the Israel
Academy of Sciences and Humanities.
( N !) ( − ){
pα } + {qα }
{qα } ∏
−1
of the terms g qα f pα . If the functions are bi-
α
=1
orthogonal, the vast majority of such terms is zero and the determinantal overlap is
simplified to the following expression:
Ne
Φg Φ f = ∏σα . (A.2)
α =1
In general, the orbitals are not bi-orthonormal. However, there is a freedom to ex-
ploit. Consider a linear transformation that mixes the orbitals of a determinantal
wave function:
f'α = ∑ f β Vβα . (A.3)
Where, the matrix V is non-singular ( det V ≠ 0 ). Because of the general rule for
square matrices, according to which: det A × V = det A det V , the following relation
between the determinantal wave functions exists:
Φ f' = Φ f det V .
Thus, the wavefunction is only changed by a numeric factor. This freedom can now
be used to bring the orbitals into bi-orthogonality, by rotating the left and right orbi-
tals. The device of singular value decomposition (SVD)[25] proves especially use-
ful here. The overlap matrix ( g † f ) can be decomposed as follows:
( g f ) = U ΣV
† †
, (A.4)
where, U and V are unitary matrices, U U = UU = VV = V V = I and Σ is a
† † † †
diagonal matrix with non-negative real diagonal matrix elements, called the singu-
lar values, appearing in ascending order: σ 1 ≥ σ 2 ≥ " ≥ σ Ne . The decomposition is
unique (except for the obvious cases that occur when there are identical singular
values). The number of zero singular values is indicative of the way to treat deter-
minantal matrix elements.
The matrices U and V are now used for defining two new sets of rotated orbitals
of the left and right determinants:
Ne Ne
g'α = ∑ U αβ g β , f'α = ∑ Vαβ f β . (A.5)
α =1 α =1
The new orbitals are bi-orthogonal, as required by Eq. (A.1):
The determinantal wave functions of the rotated orbitals equal to the original deter-
minantal wave functions, multiplied at most by a phase factor:
Φ g' = Φ g eiθU , Φ f' = Φ f eiθV , (A.7)
iθU iθV
where θU and θV are real and e = det U and e = det V .We will denote the
following product of the singular values:
α
∆α = ∏ σ β . (A.8)
β =1
Once again we use the SVD of Eq. (A.4), yielding the unitary matrices U and V
and the singular values σ 1 ≥ σ 2 ≥ " ≥ σ Ne . Using the unitary matrices, we define
the rotated coefficient matrices:
G' = GU , F' = FV . (A.21)
Thus,
G' † F' = U †G † FV = Σ . (A.22)
The equations (A.11), (A.13), (A.15), (A.17) and (A.18) can be used with the
definitions:
tα = G (' † TF' , ) αα
(A.23)
where, ( T )ij = Tij and for the 2-body operator:
Ne
Vˆ = 1
2 ∑ vˆ ( r , r ) ⇔ ∑V
n , m =1
n m
1
2
ijss '
† †
cˆ cˆ c c js .
ijkl is ks ' ls ' (A.24)
n≠ m
we have:
vααββ = ∑ ( G' )α
ijkl , s , s ′
†
, is
( )
'†
F' js ,α G
β , ks ′
F' ls′, β Vijkl
. (A.25)
vαββα = ∑ ( )α
ijkl , s , s ′
'†
G
,is
( )
'†
F' js , β G
β , ks ′
F' ls′,α Vijkl
The quantities defined in Eqs. (A.23) and (A.25) should not be used in an actual
computer implementation because of the high cost of affecting the transformation.
Instead, the following economical expressions are given:
Next, we treat 2-body operator. With all singular values being positive:
Φ g Vˆ Φ f = e ( U V ) 12 ∑ Pis1 js1 Pks2 ls2 vijkl − Pis1ls2 Pks2 js1 vijkl .
i θ −θ
(A.29)
ijkl , s1 s2
( ) = ∑ ( S −1 ) e
N
( Uσ )ik = e − ihσ ∆β
ik ij
− iE j ∆β
( S ) jk . (A.30)
j =1
Where, the columns of the matrix S are the right eigenvectors of hσ. Consecutive
operation of several evolution operators is done by matrix multiplication,
U = Uσ L " Uσ1 . Each of the orbitals of Φ f is thus rotated by U , after which the
methods of SVD discussed above can be used.
For wavefunctions of spin-0, it is beneficial to compute determinantal matrix ele-
ments of closed-shell (CS) determinantal wavefunctions. The N e 2 orbitals are
represented by a matrix with respect to φ j ( r ) :
N
fα ( r ) = ∑ φ j ( r ) F j ,α . (A.31)
j =1
One can still define matrices as in (A.19)-(A.22). The overlap is now the square of
the determinant:
Φ CS Φ CS = ∆ Ne 2 .
2
g f (A.33)
The CS density matrix is defined similarly to Eq. (A.27). Note, that the relation be-
tween the CS and the regular density matrices is:
PijCS = Pi ↑ , j ↑ = Pi ↓, j ↓ . (A.34)
The one-body matrix element is:
−∞
−∞
a∆β 2π Γ [σ ] ∫ dy exp ( − 12 ay 2 ∆β ) ×
∞
−∞
(
exp − 1
2 (σ l0 − yζ ) V (σ
T
l0 − yζ ∆β ) )×
(
exp − σ l0 V σ l0 ∆β
1
2
T
)
( ) (
Φ g tl0 exp iy ρˆ I0 ∆β Φ f tl0 ) ( ).
We now use Eq. (B.12) with φ ( x ) = exp ( iyx∆β ) and obtain:
Ξ I0 l0 [σ ] Γ [σ ] = a∆β 2π
( )
∞
∫ −∞
exp − 12 b y 2 − 2 y σ I 0 ,l0 b ∆β × (B.17)
where we define b = a + ζ I0 . Assume first I 0 is diagonal index ( I 0 = ( i0 i0 ) ). Then,
from Eq. (B.13) φ'I0 = eiy∆β − 1 and the coarse graining function is:
Ξ I0 l0 [σ ] Γ [σ ] = K 0 + 2 ρ' I0 l0 ( K1 − K 0 ) + ρ' I20 l0 ( K 0 + 2K1 + K 2 ) , (B.18)
with
(
K k = a b exp σ I0 l0 + ik
2
2b . ) (B.19)
Index
2nd-quantized ..............................................................................................................3
AFMC 1, 2, 3, 10, 11, 12, 13, 15, 16, 17, 20, 21, 27. see auxiliary field Monte Carlo
Algorithm ...........................................................................................................10, 13
auxiliary field Monte Carlo ..................................................................................1, 22
References