1. Lecture 4: Mean-field theory and Hartree-Fock theory
In this lecture we’ll describe a general strategy to approximately solving the many body problem introduced in the previous lecture. This approach falls broadly under the rubric of mean-field theory and is better known, in various contexts, as Hartree-Fock theory, the self-consistent field method, and the Gutzwiller ansatz.
The idea behind mean-field theory is simple: we take as a variational class one that neglects all quantum correlations between particles and apply the variational method. In the application of the variational method one then sees that the influence of all the other particles on a given one are treated in an averaged way. Our treatment of the Helium atom in lecture 2 could be seen as an application of mean-field theory in an embryonic form.
To explain mean-field theory in this lecture we’ll consider a sequence of simplified examples.
These lecture notes can be found in pdf form here.
2. Example 1: a quantum spin system
Quantum spin systems are simplified models that arise as approximations of systems of electrons moving in the presence of a regular array of binding atoms (see, e.g., Auerbach (1994), chapter 3, for an example derivation).
The degrees of freedom of a quantum spin system are, as the name suggests, quantum spins, localised in a regular array. In this example we only consider an array of spin- degrees of freedom arranged in a regular one-dimensional lattice. A convenient basis for a single spin- degree of freedom is provided by the eigenstates of the spin operator, written and . The hilbert space of a single spin- is isomorphic to .
The hilbert space for a (one-dimensional) collection of such spin- degrees of freedom is given by
A general hamiltonian for a quantum spin system has the form
are the Pauli sigma matrices. The first summation in (3) describes an interaction between neighbouring spins that encourages quantum spins to align along the spin axis. The second summation describes the effect of an external magnetic field of strength which encourages the spins to line up along the spin axis. There is an obvious competition between these two terms. Indeed, the interplay between the two terms is sufficiently complex that the model exhibits a great deal of interesting physics, including, a quantum phase transition. The transverse Ising model is actually exactly solvable using a sophisticated map to a fermionic system, but we’ll pretend we don’t know this.
A general state of quantum spins is written, in the basis of the operators, as
Note: there are terms in this expansion!
Our objective is to understand the ground state of . If we were to proceed by diagonalising , which is in principle possible, it would take a prohibitive time as becomes large because is a matrix (even is rather difficult on a laptop computer). Thus, if we want to understand such a model as becomes large we must use another method.
In this example we apply the variational method to using as our variational class the set of all states with the form
i.e., the set of all product states. The class enjoys some important features: (i) it is easy to describe, meaning that it takes only real numbers to specify a general member of the class (instead of ); and (ii) it is easy to calculate, meaning that the expectation value of any reasonable observable quantity in a member of is easy to compute. However, the class has the considerable downside that no member exhibits any spatial correlations, i.e., suppose is an observable of the spin at location and is an observable on the spin at location (for example, and ), then
for all . Despite this drawback the class , when used in conjunction with the variational method, provides surprisingly good results.
We are going to consider the case where . In this limit the model is translation invariant. It is fairly reasonable, although not a priori correct (why not?), to assume that the state minimising
is itself translation invariant. Assuming, regardless, that this is correct we can restrict our variational class to
so that a general member requires only numbers to specify it. Let’s now apply the variational method to using the class . The expectation value of the energy of the system is given by
(We’ve exploited translation invariance of to drop the subscripts on the pauli sigma matrices.) Since this expression generically tends to infinity as it is convenient to focus, rather, on the energy density . In the limit the energy density becomes
The variational method now amounts to minimising over all normalised states of a single spin- degree of freedom:
This minimisation can be done directly, but here we choose a slightly different route. At this point we exploit the convenient Bloch sphere representation for a general (mixed) state of a spin- degree of freedom:
In order that is a quantum state (i.e., has both eigenvalues ) it is necessary and sufficient that . If is pure, i.e., , then (see, e.g., Nielsen and Chuang (2000)). Thus, noting that
allows us to write our variational problem as the following geometric problem
Our variational parameters are the three numbers , and , subject to the constraint . Since plays no role in this minimisation we can set it to so as to allow and to vary over the largest domain. Thus our problem becomes
In polar coordinates and this becomes
In the region this equation admits extrema at , , and
Substituting this into gives us the value
Outside this region there is are only two extrema at , , and the energy density is
The point is special as the energy density behaves nonanalytically and signifies the presence of a quantum phase transition.
Exercise 1. Calculate the corresponding magnetisation for the mean-field solution we’ve derived. What happens at ?
Assignment 1. Carry out a similar analysis as above for the antiferromagnetic Heisenberg model
First assume that the mean-field solution is translation invariant: what solution do you get in this case. Next try relaxing this assumption by positing that the solution is only -periodic:
What value do you get for the energy density in this case?
3. Example 2: spinless fermions on the lattice
In this section we describe the variational principle applied to a class of fermion states known as gaussian or quasi-free. In this case the variational principle is known as Hartree-Fock theory. We consider a second-quantised lattice setting, where the fermion creation and annihilation operators may be given by the finite set
You can think of as annihilating a fermion from the single-particle state with wavefunction
where is the lattice spacing and
if and zero otherwise. Obviously this is a huge simplification: the operators which annihilate fermions from single-particle states orthogonal to these have been ignored. That such a simplification preserves interesting physical properties of a system of interest is beyond this course but can be found, e.g., in Auerbach (2003).
The model we consider has the second-quantised form
with periodic boundary conditions , and describe fermions hopping on a ring with repulsive interactions between neighbouring sites.
3.1. Majorana fermions
In this subsection we follow the paper quant-ph/0404180 closely. It is very much worthwhile reading this paper in full.
The gaussian or quasi-free fermion states are morally analogous to the product states we studied above, and may be defined via several routes (the analogy is that in both cases a system whose state is product/gaussian may be though of as not interacting). Here we define them as all those states arising from a certain closed subset of quadratic physical operations generated by hamiltonians of the form
are single-particle, or tunneling, transformations and
which model squeezing operations, e.g., an interaction with a bulk -wave superconductor where a pair of electrons is swapped against a cooper pair. Both of these generators are quadratic in the fermion operators.
Rather than expressing everything in terms of the non-hermitian operators and it is convenient to introduce the hermitian Majorana fermion operators
analogous to the bosonic position and momentum operators. From the anticommutation relations it follows that
for all . The set of all linear combinations of products of these elements is called the Clifford algebra . An arbitrary element can always be represented as
The hamiltonian takes the form
where may be an arbitrary antisymmetric real matrix. Define , then
with , and . Any rotation in may be implemented with appropriate choice of . (Exercise: prove these statements.)
3.2. Grassmann numbers
Grassmann numbers are built using an -dimensional complex vector space: consider a basis
of . An example would be simply the column vectors with a in the th place. At the moment all we know is how to add or subtract these elements, i.e., there is no product operation defined on the vector space. We supply a product by defining
an extend it by linearity to an arbitrary element of . No other product relations are imposed. Thus is not an element of and the collection of such products provide an additional linearly independent elements. Indeed, it is possible to find linearly independent elements in total generated by the above relations. The set of all such elements are called the Grassmann numbers . An arbitrary element of may be written as
3.3. Gaussian states
Suppose that is some linear combination of products of majorana fermion operators. We can naturally associate a Grassmann number to such an operator by replacing ‘s with ‘s by defining
Warning: this is a map on to only as linear spaces, the product operation is not preserved by this operation.
Suppose that is a transformation implementing the rotation (see above), and an arbitrary operator. Then
We come now to the main
Definition 1 A quantum state of fermionic modes is Gaussian if and only if its density operator has a Gaussian Grassmann representation, i.e.,
for some antisymmetric matrix . The matrix is called the correlation matrix of .
The correlation matrix for a Gaussian state can be found via
The correlation matrix completely characterises via Wick’s theorem because the expectation value of any higher-order monomial of fermion operators may be computed using the formula
with , denotes the Pfaffian, and denotes the submatrix of with the indicated rows and columns. The only case we’re really going to use is
Any real antisymmetric matrix can be converted into a block diagonal form by an appropriate choice of rotation via
The absolute values , are the Williamson eigenvalues of . It follows that any Gaussian state may be transformed via into a product form
In order the be a legal quantum state it is necessary that , , which is the same as saying that the eigenvalues of must all lie in .
3.4. Generalised Hartree-Fock theory
We finally come to the formulation of generalised Hartree-Fock theory. We follow, in part, the paper arXiv:1005.5284.
By transforming our original fermion operators to the Majorana representation our original hamiltonian takes the form
with antisymmetric. Exercise: what is the exact form of and in our case?
Let’s now apply the variational principle to using as our variational class the set of all Gaussian states, both mixed and pure. Thus we aim to solve the optimisation problem
This is greatly simplified by noticing that
Notice what a huge simplification this is: to specify our state we need only specify the numbers defining the upper triangular portion of , and the energy is a function purely of these numbers. Generalised Hartree-Fock theory is then to carry out the minimisation
This is far from trivial for arbitrary and , and we must take recourse, in general to numerical methods gradient descent methods. However, we have made a huge saving because this problem can at least be stored in a computer’s memory for large , in contrast to the situation where non-Gaussian states are considered. Additionally, symmetries may allow us to compute the objective function efficiently.