OK up to sec 4
This commit is contained in:
parent
9414799871
commit
87773c9208
@ -146,6 +146,7 @@ Laboratoire de Chimie et Physique Quantiques
|
||||
|
||||
%============================================================%
|
||||
\section{Introduction}
|
||||
\label{sec:intro}
|
||||
%============================================================%
|
||||
|
||||
\subsection{Background}
|
||||
@ -419,7 +420,7 @@ In fact, it is known that when the HF wave function is a bad approximation to th
|
||||
A convenient way to investigate the convergence properties of the MP series is to analytically continue the coupling parameter $\lambda$ into the complex variable.
|
||||
By doing so, the Hamiltonian and the energy become complex-valued functions of $\lambda$, and the energy becomes a multivalued function on $K$ Riemann sheets (where $K$ is the number of basis functions).
|
||||
As mentioned above, by searching the singularities of the function $E(\lambda)$, one can get information on the convergence properties of the MP series.
|
||||
These singularities of the energy function are exactly the exceptional points connecting the electronic states as mentioned in the introduction.
|
||||
These singularities of the energy function are exactly the EPs connecting the electronic states as mentioned in Sec.~\ref{sec:intro}.
|
||||
The direct computation of the terms of the series is quite manageable up to 4th order in perturbation, while the 5th and 6th order in perturbation can still be obtained but at a rather high cost \cite{JensenBook}.
|
||||
In order to better understand the behavior of the MP series and how it is connected to the singularity structure, we have to access high-order terms.
|
||||
For small systems, one can access the whole terms of the series using full configuration interaction (FCI).
|
||||
@ -428,12 +429,12 @@ If the Hamiltonian $H(\lambda)$ is diagonalized in the FCI space, one gets the e
|
||||
\subsection{Alternative partitioning}\label{sec:AlterPart}
|
||||
|
||||
Obviously, although practically convenient for electronic structure calculations, the MP partitioning is not the only possibility.
|
||||
Here, we will consider three alternative partitioning schemes
|
||||
Here, we will consider three alternative partitioning schemes:
|
||||
\begin{itemize}
|
||||
\item The Epstein-Nesbet (EN) partitioning which consists in taking the diagonal elements of $\hH$ as the zeroth-order Hamiltonian.
|
||||
Hence, the off-diagonal elements of $\hH$ are the perturbation operator. .
|
||||
Hence, the off-diagonal elements of $\hH$ are the perturbation operator.
|
||||
\item The weak correlation (WC) partitioning in which the one-electron part is consider as the unperturbed Hamiltonian $\hH^{(0)}$ and the two-electron part is the perturbation operator $\hV$.
|
||||
\item The strong coupling (SC) partitioning where the two operators are inverted as compared to the WC partitioning.
|
||||
\item The strong coupling (SC) partitioning where the two operators are inverted compared to the WC partitioning.
|
||||
\end{itemize}
|
||||
%An alternative partitioning scheme, maybe even more natural than the MP one,
|
||||
%This partitioning leads to Epstein-Nesbet (EN) perturbation theory.
|
||||
@ -463,22 +464,22 @@ Here, we will consider three alternative partitioning schemes
|
||||
When one relies on MP perturbation theory (and more generally on any perturbative partitioning), it would be reasonable to ask for a systematic improvement of the energy with respect to the perturbative order, i.e., one would expect that the more terms of the perturbative series one can compute, the closer the result from the exact energy.
|
||||
%In other words, each time a higher-order term is computed, one would like to obtained an overall result closer to the exact energy.
|
||||
In other words, one would like a monotonic convergence of the MP series. Assuming this, the only limiting process to get the exact correlation energy (in a finite basis set) would be our ability to compute the terms of this perturbation series.
|
||||
Unfortunately this is not as easy as one might think because i) the terms of the perturbative series become rapidly computationally cumbersome, and ii) erratic behavior of the perturbative coefficients are not uncommon. For example, in the late 80's, Gill and Radom reported deceptive and slow convergences in stretch systems \cite{Gill_1986, Gill_1988} (see also Refs.~\cite{Handy_1985, Lepetit_1988}).
|
||||
In Fig.~\ref{fig:RUMP_Gill}, which has been extracted from Ref.~\cite{Gill_1986}, one can see that the RMP series is convergent, yet oscillating which is far from being convenient if one is only able to compute the first few terms of the expansion (for example here RMP5 is worse than RMP4).
|
||||
Unfortunately this is not as easy as one might think because i) the terms of the perturbative series become rapidly computationally cumbersome, and ii) erratic behavior of the perturbative coefficients are not uncommon. For example, in the late 80's, Gill and Radom reported deceptive and slow convergences in stretched systems \cite{Gill_1986, Gill_1988} (see also Refs.~\cite{Handy_1985, Lepetit_1988}).
|
||||
In Fig.~\ref{fig:RUMP_Gill}, which has been extracted from Ref.~\cite{Gill_1986}, one can see that the RMP series is convergent, yet oscillatory which is far from being convenient if one is only able to compute the first few terms of the expansion (for example here RMP5 is worse than RMP4).
|
||||
On the other hand, the UMP series is monotonically convergent (except for the first few orders) but very slowly.
|
||||
Thus, one cannot practically use it for systems where only the first terms can be computed.
|
||||
|
||||
When a bond is stretched, in most cases the exact wave function becomes more and more multi-reference. Yet the HF wave function is restricted to be single Slater determinant.
|
||||
Thus, it is inappropriate to model (even qualitatively) stretched system. Nevertheless, the HF wave function can undergo a symmetry breaking to lower its energy by sacrificing one of the symmetry of the exact wave function during the process (see for example the case of \ce{H_2} in Ref.~\cite{SzaboBook}).
|
||||
When a bond is stretched, in most cases the exact wave function becomes more and more multi-reference. Yet the HF wave function is restricted to be a single Slater determinant.
|
||||
Thus, it is inappropriate to model (even qualitatively) stretched systems. Nevertheless, the HF wave function can undergo a symmetry breaking to lower its energy by sacrificing one of the symmetry of the exact wave function during the process (see for example the case of \ce{H_2} in Ref.~\cite{SzaboBook}).
|
||||
One could then potentially claim that the RMP series exhibits deceptive convergence properties as the RHF Slater determinant is a poor approximation of the exact wave function for stretched system. However, even in the unrestricted formalism which clearly represents a better description of a stretched system, the UMP series does not have the smooth and rapidly converging behavior that one would wish for.
|
||||
|
||||
\begin{table}[h!]
|
||||
\centering
|
||||
\caption{Percentage of electron correlation energy recovered and $\expval{S^2}$ for the \ce{H2} molecule as a function of bond length (r,\si{\angstrom}) in the STO-3G basis set (taken from \cite{Gill_1988}).}
|
||||
\caption{Percentage of electron correlation energy recovered and $\expval*{S^2}$ for the \ce{H2} molecule as a function of bond length (r,\si{\angstrom}) in the STO-3G basis set (taken from Ref.~\cite{Gill_1988}).}
|
||||
\begin{tabular}{ccccccc}
|
||||
\hline
|
||||
\hline
|
||||
$r$ & UHF & UMP2 & UMP3 & UMP4 & $\expval{S^2}$ \\
|
||||
$r$ & UHF & UMP2 & UMP3 & UMP4 & $\expval*{S^2}$ \\
|
||||
\hline
|
||||
0.75 & 0.0\% & 63.8\% & 87.4\% & 95.9\% & 0.00\\
|
||||
1.35 & 0.0\% & 15.2\% & 26.1\% & 34.9\% & 0.49\\
|
||||
@ -490,47 +491,50 @@ One could then potentially claim that the RMP series exhibits deceptive converge
|
||||
\label{tab:SpinContamination}
|
||||
\end{table}
|
||||
|
||||
In the unrestricted framework the singlet ground state wave function is allowed to mix with triplet wave function, leading to the so-called spin contamination issue. Gill \textit{et al.}~highlighted the link between slow convergence of the UMP series and spin contamination, as shown in Table \ref{tab:SpinContamination} for the example of \ce{H2} in a minimal basis \cite{Gill_1988}.
|
||||
Handy and coworkers reported the same behavior of the series (oscillating and \titou{slowly monotonically convergent}) in stretched \ce{H2O} and \ce{NH2} systems \cite{Handy_1985}. Lepetit et al.~analyzed the difference between the MP and EN partitioning for the UHF reference \cite{Lepetit_1988}. They concluded that the slow convergence is due to the coupling of the singly- and doubly-excited configurations.
|
||||
In the unrestricted framework the singlet ground state wave function is allowed to mix with triplet wave functions, leading to the so-called spin contamination issue. Gill \textit{et al.}~highlighted the link between slow convergence of the UMP series and spin contamination, as shown in Table \ref{tab:SpinContamination} for \ce{H2} in a minimal basis \cite{Gill_1988}.
|
||||
Handy and coworkers reported the same behavior of the series (oscillatory and \titou{slowly monotonically convergent}) in stretched \ce{H2O} and \ce{NH2} systems \cite{Handy_1985}. Lepetit \textit{et al.}~analyzed the difference between the MP and EN partitioning for the UHF reference \cite{Lepetit_1988}. They concluded that the slow convergence is due to the coupling of the singly- and doubly-excited configurations.
|
||||
%Moreover, the MP and EN numerators in Eqs.~\eqref{eq:EMP2} and \eqref{eq:EEN2} are the same and they vanish when the bond length $r$ goes to infinity. Yet the MP denominators tends towards a constant when $r \to \infty$ so the terms vanish, whereas the EN denominators tends to zero which improves the convergence but can also make the series diverge.
|
||||
Cremer and He analyzed 29 atomic and molecular systems at the FCI level \cite{Cremer_1996} and grouped them in two classes: i) the class A systems where one observes a monotonic convergence to the FCI energy, and ii) the class B for which convergence is erratic after initial oscillations. The sample of systems contains stretched molecules as well as molecules at their equilibrium geometry for various basis sets. They highlighted that \cite{Cremer_1996}
|
||||
Cremer and He analyzed 29 atomic and molecular systems at the FCI level \cite{Cremer_1996} and grouped them in two classes: i) the class A systems where one observes a monotonic convergence to the FCI energy, and ii) the class B for which convergence is erratic after initial oscillations. Their system set contains stretched molecules as well as molecules at their equilibrium geometry for various basis sets. They highlighted that \cite{Cremer_1996}
|
||||
\begin{quote}
|
||||
\textit{``Class A systems are characterized by electronic structures with well-separated electron pairs while class B systems are characterized by electronic structures with electron clustering in one or more regions.''}
|
||||
\end{quote}
|
||||
\titou{They proved that using different extrapolation formulas of the first terms of the MP series for class A and class B systems, this improves the precision of those formulas.} The mean deviation from FCI correlation energies is $0.3$ millihartree whereas with the formula that do not distinguish the system it is 12 millihartree. Even if there were still shaded areas and that this classification was incomplete, this work showed that understanding the origin of the different modes of convergence would lead to a more rationalized use of the MP perturbation theory and to more accurate correlation energies.
|
||||
\titou{They proved that using specific extrapolation formulas involving the first terms of the MP series for class A and class B systems improves the precision of those formulas.} The mean deviation from FCI correlation energies is $0.3$ millihartree whereas with the formula that do not distinguish the system it is 12 millihartree. Even if there were still shaded areas and that this classification was incomplete, this work showed that understanding the origin of the different convergence modes would lead to a more rationalized use of MP perturbation theory and to more accurate correlation energies.
|
||||
|
||||
\subsection{Cases of divergence}
|
||||
|
||||
In the late 90's, Olsen \textit{et al.}~have discovered an even more preoccupying behavior of the MP series \cite{}. They showed that the series could be divergent even in systems that they considered as well understood like \ce{Ne} and \ce{HF} (see Fig.~\ref{fig:NeHFDiv}) \cite{Olsen_1996, Christiansen_1996}. Cremer and He had already studied these two systems and classified them as ``class B'' systems. However, the analysis of Olsen and coworkers was performed in larger basis sets containing diffuse functions. In these basis sets, they found that the series become divergent at (very) high order.
|
||||
In the late 90's, Olsen \textit{et al.}~discovered an even more preoccupying behavior of the MP series \cite{Olsen_1996}. They showed that the series could be divergent even in systems that they considered as well understood like \ce{Ne} and \ce{HF} (see Fig.~\ref{fig:NeHFDiv}) \cite{Olsen_1996, Christiansen_1996}. Cremer and He had already studied these two systems and classified them as ``class B'' systems. However, the analysis of Olsen and coworkers was performed in larger basis sets containing diffuse functions. In these basis sets, they found that the series become divergent at (very) high order.
|
||||
|
||||
\begin{figure}[h!]
|
||||
\centering
|
||||
\includegraphics[width=0.45\textwidth]{Nedivergence.png}
|
||||
\includegraphics[width=0.45\textwidth]{HFdivergence.png}
|
||||
\caption{\centering Correlation contributions for \ce{Ne} and \ce{HF} in the cc-pVTZ-(f/d) $\circ$ and aug-cc-pVDZ $\bullet$ basis sets (taken from \cite{Olsen_1996}).}
|
||||
\caption{\centering Correlation contributions for \ce{Ne} and \ce{HF} in the cc-pVTZ-(f/d) $\circ$ and aug-cc-pVDZ $\bullet$ basis sets (taken from Ref.~\cite{Olsen_1996}).}
|
||||
\label{fig:NeHFDiv}
|
||||
\end{figure}
|
||||
|
||||
The discovery of this divergent behavior is worrying as in order to get meaningful and accurate energies, calculations must be performed in large basis sets (as close as possible from the complete basis set limit). Including diffuse functions is particularly important for the case of anions and/or Rydberg excited states where the wave function is much more diffuse than the ground-state one. As a consequence, they investigated further the causes of these divergences as well as the reasons of the different types of convergence. To do so, they analyzed the relation between the dominant singularity (i.e., the closest singularity to the origin) and the convergence behavior of the series \cite{Olsen_2000}. Their analysis is based on Darboux's theorem:
|
||||
The discovery of this divergent behavior is worrying as in order to get meaningful and accurate energies, calculations must be performed in large basis sets (as close as possible from the complete basis set limit). Including diffuse functions is particularly important in the case of anions and/or Rydberg excited states where the wave function is much more diffuse than the ground-state one. As a consequence, they investigated further the causes of these divergences as well as the reasons of the different types of convergence. To do so, they analyzed the relation between the dominant singularity (i.e., the closest singularity to the origin) and the convergence behavior of the series \cite{Olsen_2000}. Their analysis is based on Darboux's theorem:
|
||||
\begin{quote}
|
||||
\textit{``In the limit of large order, the series coefficients become equivalent to the Taylor series coefficients of the singularity closest to the origin. Following the result of this theorem, the convergence patterns of the MP series can be explained by looking at the dominant singularity.''}
|
||||
\end{quote}
|
||||
|
||||
A singularity in the unit circle is designated as an intruder state, more precisely as a front-door (respectively back-door) intruder state if the real part of the singularity is positive (respectively negative). Theie method consists in performing a scan of the real axis to detect the avoided crossing responsible for the pair of dominant singularity. Then by modeling this avoided crossing via a two-state Hamiltonian one can get an approximation of the dominant conjugate pair of singularity by finding the EPs of the $2\times2$ Hamiltonian. The diagonal matrix is the unperturbed Hamiltonian and the other matrix is the perturbative part of the Hamiltonian.
|
||||
|
||||
A singularity in the unit circle is designated as an intruder state, more precisely as a front-door (respectively back-door) intruder state if the real part of the singularity is positive (respectively negative). Their method consists in performing a scan of the real axis to detect the avoided crossing responsible for the pair of dominant singularities. Then by modeling this avoided crossing via a two-state Hamiltonian one can get an approximation of the dominant conjugate pair of singularities by finding the EPs of the following $2\times2$ Hamiltonian
|
||||
\begin{equation}
|
||||
\underbrace{\mqty(\alpha & \delta \\ \delta & \beta)}_{\hH} = \underbrace{\mqty(\alpha + \alpha_s & 0 \\ 0 & \beta + \beta_s )}_{\hH^{(0)}} + \underbrace{\mqty(- \alpha_s & \delta \\ \delta & - \beta_s)}_{\hV}
|
||||
\underbrace{\mqty(\alpha & \delta \\ \delta & \beta)}_{\hH} = \underbrace{\mqty(\alpha + \alpha_s & 0 \\ 0 & \beta + \beta_s )}_{\hH^{(0)}} + \underbrace{\mqty(- \alpha_s & \delta \\ \delta & - \beta_s)}_{\hV},
|
||||
\end{equation}
|
||||
|
||||
They first studied molecules with low-lying doubly excited states of the same spatial and spin symmetry because in those systems the HF wave function is a bad approximation. The exact wave function has a non-negligible contribution from the doubly excited states, so those low-lying excited states were good candidates for being intruder states. For \ce{CH_2} in a large basis set, the series is convergent up to the 50th order. They showed that the dominant singularity lies outside the unit circle but close to it causing the slow convergence.
|
||||
where the diagonal matrix is the unperturbed Hamiltonian and the other matrix is the perturbation.
|
||||
|
||||
Then they demonstrated that the divergence for the \ce{Ne} is due to a back-door intruder state. When the basis set is augmented with diffuse functions, the ground state undergo sharp avoided crossings with highly diffuse excited states leading to a back-door intruder state. They used their two-state model on this avoided crossings and the model was actually predicting the divergence of the series. They concluded that the divergence of the series was due to the interaction with a highly diffuse excited state.
|
||||
They first studied molecules with low-lying doubly-excited states of the same spatial and spin symmetry.
|
||||
% \titou{because in those systems the HF wave function is a bad approximation.}
|
||||
The exact wave function has a non-negligible contribution from the doubly-excited states, so these low-lying excited states were good candidates for being intruder states. For \ce{CH_2} in a large basis set, the series is convergent up to the 50th order. They showed that the dominant singularity lies outside the unit circle but close to it causing the slow convergence.
|
||||
|
||||
Moreover they proved that the extrapolation formulas of Cremer and He \cite{Cremer_1996} cannot be used for all systems. Even more, that those formula were not mathematically motivated when looking at the singularity causing the divergence. For example the hydrogen fluoride molecule contains both back-door intruder states and low-lying doubly excited states which results in alternated terms up to order ten and then the series is monotonically convergent. This is due to the fact that two pairs of singularity are approximately at the same distance from the origin.
|
||||
Then they demonstrated that the divergence for \ce{Ne} is due to a back-door intruder state. When the basis set is augmented with diffuse functions, the ground state undergo sharp avoided crossings with highly diffuse excited states leading to a back-door intruder state. They used their two-state model on this avoided crossings and the model was actually predicting the divergence of the series.
|
||||
%They concluded that the divergence of the series was due to the interaction with a highly diffuse excited state.
|
||||
|
||||
Moreover they proved that the extrapolation formulas of Cremer and He \cite{Cremer_1996} cannot be used for all systems, and that these formulas were not mathematically motivated when looking at the singularity causing the divergence. For example, the hydrogen fluoride molecule contains both back-door intruder states and low-lying doubly-excited states which results in alternated terms up to 10th order. For higher orders, the series is monotonically convergent. This surprising behavior is due to the fact that two pairs of singularities are approximately at the same distance from the origin.
|
||||
|
||||
\subsection{The singularity structure}
|
||||
|
||||
In the 2000's, Sergeev and Goodson \cite{Sergeev_2005, Sergeev_2006} analyzed this problem from a more mathematical point of view by looking at the whole singularity structure where Olsen and collaborators were trying to find the dominant singularity causing the divergence. They regrouped singularities in two classes: i) $\alpha$ singularities which have ``large'' imaginary parts, and ii) $\beta$ singularities which have very small imaginary parts. Singularities of type $\alpha$ are related to large avoided crossing between the ground and low-lying excited states, whereas $\beta$ singularities come from a sharp avoided crossing between the ground state and a highly diffuse state. They succeeded to explain the divergence of the series caused by $\beta$ singularities using a previous work of Stillinger \cite{Stillinger_2000}.
|
||||
In the 2000's, Sergeev and Goodson \cite{Sergeev_2005, Sergeev_2006} analyzed this problem from a more mathematical point of view by looking at the whole singularity structure where Olsen and collaborators were trying to find the dominant singularity causing the divergence. They regrouped singularities in two classes: i) $\alpha$ singularities which have ``large'' imaginary parts, and ii) $\beta$ singularities which have very small imaginary parts. Singularities of type $\alpha$ are related to large avoided crossing between the ground and low-lying excited states, whereas $\beta$ singularities come from a sharp avoided crossing between the ground state and a highly diffuse state. They succeeded to explain the divergence of the series caused by $\beta$ singularities following previous work of Stillinger \cite{Stillinger_2000}.
|
||||
|
||||
To understand the convergence properties of the perturbation series at $\lambda=1$, one must look at the whole complex plane, in particular, for negative (i.e., real) values of $\lambda$. If $\lambda$ is negative, the Coulomb interaction becomes attractive but the mean field (which has been computed at $\lambda = 1$) remains repulsive as it is proportional to $(1-\lambda)$:
|
||||
|
||||
@ -544,20 +548,19 @@ To understand the convergence properties of the perturbation series at $\lambda=
|
||||
].
|
||||
\end{equation}
|
||||
|
||||
|
||||
The major difference between those two terms is that the repulsive mean field is localized around the nuclei whereas the interelectronic interaction persist away from the nuclei. If $\lambda$ becomes more and more negative the mean field becomes more and more repulsive so there exists a critical (negative) value of $\lambda$, $\lambda_\text{c}$, for which the Coulombic field created by the nuclei cannot bind the electrons anymore because of the $\lambda$-independent nature of the the electron-nucleus attraction. For $\lambda = \lambda_c$, the electrons dissociate from the nuclei and form a bound cluster which is infinitely separated from the nuclei. According to Baker \cite{Baker_1971}, this value is a critical point of the system and, by analogy with thermodynamics, the energy $E(\lambda)$ exhibits a singularity at $\lambda_\text{c}$. At this point the system undergo a phase transition \titou{and a symmetry breaking}. \titou{Beyond $\lambda_c$ there is a continuum of eigenstates with electrons dissociated from the nucleus.}
|
||||
|
||||
This reasoning is done on the exact Hamiltonian and energy, i.e., the Hamiltonian in the complete Hilbert space, this is the exact energy which exhibits this singularity on the negative real axis. However, in a finite basis set which does not span the complete Hilbert space, one can prove that, for a Hermitian Hamiltonian, the singularities of $E(\lambda)$ occurs in complex conjugate pairs with non-zero imaginary parts. Sergeev and Goodson proved \cite{Sergeev_2005}, as predicted by Stillinger \cite{Stillinger_2000}, that in a finite basis set the critical point on the real axis is modeled by a cluster of sharp avoided crossings with diffuse functions, equivalently by a cluster of $\beta$ singularities in the negative half plane. They explain that Olsen et al., because they used a $2\times2$ model, only observed the first singularity of this cluster of singularities causing the divergence.
|
||||
This reasoning is done on the exact Hamiltonian and energy, i.e., the Hamiltonian in the complete Hilbert space, this is the exact energy which exhibits this singularity on the negative real axis. However, in a finite basis set which does not span the complete Hilbert space, one can prove that, for a Hermitian Hamiltonian, the singularities of $E(\lambda)$ occurs in complex conjugate pairs with non-zero imaginary parts. Sergeev and Goodson proved \cite{Sergeev_2005}, as predicted by Stillinger \cite{Stillinger_2000}, that in a finite basis set the critical point on the real axis is modeled by a cluster of sharp avoided crossings with diffuse functions, equivalently by a cluster of $\beta$ singularities in the negative half plane. This explains that Olsen \textit{et al.}, because they used a simple $2\times2$ model, only observed the first singularity of this cluster of singularities causing the divergence.
|
||||
|
||||
Finally, it was shown that $\beta$ singularities are very sensitive to changes of the basis set but not to the stretching of the system. On the contrary $\alpha$ singularities are relatively insensitive to the basis sets but very sensitive to bond stretching. According to Goodson, \cite{Goodson_2004} the singularity structure from molecules stretched from the equilibrium geometry is difficult, this is consistent with the observation of Olsen and co-workers \cite{Olsen_2000} on the \ce{HF} molecule at equilibrium geometry and stretched geometry. To our knowledge the effect of bond stretching on singularities, its link with spin contamination and symmetry breaking of the wave function has not been as well understood as the ionization effect and its link with diffuse function. In this work we shall try to improve our understanding of the effect of symmetry breaking on the singularities of $E(\lambda)$ and we hope that it will lead to a deeper understanding of perturbation theory.
|
||||
Finally, it was shown that $\beta$ singularities are very sensitive to changes in the basis set but not to the stretching of the system. On the contrary $\alpha$ singularities are relatively insensitive to the basis sets but very sensitive to bond stretching. \titou{According to Goodson \cite{Goodson_2004}, the singularity structure from molecules stretched from the equilibrium geometry is difficult, this is consistent with the observation of Olsen and coworkers \cite{Olsen_2000} on the \ce{HF} molecule at equilibrium geometry and stretched geometry.} To the best our knowledge, the effect of bond stretching on singularities, its link with spin contamination and symmetry breaking of the wave function has not been as well understood as the ionization phenomenon and its link with diffuse functions. In this work, we shall try to improve our understanding of the effect of symmetry breaking on the singularities of $E(\lambda)$ and we hope that it will lead to a deeper understanding of perturbation theory.
|
||||
|
||||
\subsection{The physics of quantum phase transition}
|
||||
|
||||
In the previous section, we saw that a careful analysis of the structure of the Hamiltonian allows us to predict the existence of a critical point. In a finite basis set this critical point is model by a cluster of $\beta$ singularities. It is now well known that this phenomenon is a special case of a more general phenomenon. Indeed, theoretical physicists proved that EPs close to the real axis are connected to \textit{quantum phase transitions} (QPTs) \cite{Heiss_1988, Heiss_2002, Cejnar_2005, Cejnar_2007, Cejnar_2009, Borisov_2015, Sindelka_2017}. In quantum mechanics, the Hamiltonian is almost always dependent of, at least, one parameter. In some cases the variation of a parameter can lead to abrupt changes at a critical point. These QPTs exist both for ground and excited states as shown by Cejnar and coworkers \cite{Cejnar_2009, Sachdev_2011, Cejnar_2015, Cejnar_2016, Caprio_2008, Macek_2019}. A ground-state QPT is characterized by the derivatives of the ground-state energy with respect to a non-thermal control parameter \cite{Cejnar_2009, Sachdev_2011}. The transition is called discontinuous and of first order if the first derivative is discontinuous at the critical parameter value. Otherwise, it is called continuous and of $n$th order if the $n$th derivative is discontinuous. A QPT can also be identify by the discontinuity of an appropriate order parameter (or one of its derivatives).
|
||||
In the previous section, we saw that a careful analysis of the structure of the Hamiltonian allows us to predict the existence of a critical point. In a finite basis set this critical point is model by a cluster of $\beta$ singularities. It is now well known that this phenomenon is a special case of a more general phenomenon. Indeed, theoretical physicists proved that EPs close to the real axis are connected to \textit{quantum phase transitions} (QPTs) \cite{Heiss_1988, Heiss_2002, Cejnar_2005, Cejnar_2007, Cejnar_2009, Borisov_2015, Sindelka_2017}. In quantum mechanics, the Hamiltonian is almost always dependent of, at least, one parameter. In some cases the variation of a parameter can lead to abrupt changes at a critical point. These QPTs exist both for ground and excited states as shown by Cejnar and coworkers \cite{Cejnar_2009, Sachdev_2011, Cejnar_2015, Cejnar_2016, Caprio_2008, Macek_2019}. A ground-state QPT is characterized by the derivatives of the ground-state energy with respect to a non-thermal control parameter \cite{Cejnar_2009, Sachdev_2011}. The transition is called discontinuous and of first order if the first derivative is discontinuous at the critical parameter value. Otherwise, it is called continuous and of $m$th order if the $m$th derivative is discontinuous. A QPT can also be identify by the discontinuity of an appropriate order parameter (or one of its derivatives).
|
||||
|
||||
The presence of an EP close to the real axis is characteristic of a sharp avoided crossing. Yet, at such an avoided crossing, eigenstates change abruptly. Although it is now well understood that EPs are closely related to QPTs, the link between the type of QPT (ground state or excited state, first or higher order) and EPs still need to be clarified. One of the major obstacles that one faces in order to achieve this resides in the ability to compute the distribution of EPs. The numerical assignment of an EP to two energies on the real axis is very difficult in large dimensions. Hence, the design of specific methods are required to get information on the location of EPs. Following this idea, Cejnar \textit{et al.}~developed a method based on a Coulomb analogy giving access to the density of EP close to the real axis \cite{Cejnar_2005, Cejnar_2007}. More recently Stransky and coworkers proved that the distribution of EPs is characteristic on the order of the QPT \cite{Stransky_2018}. In particular, they showed that when the dimensionality of the system increases, first- and second-order QTP behave differently, and converge towards the real axis at different rates (exponentially and algebraically for the first and second order, respectively).
|
||||
The presence of an EP close to the real axis is characteristic of a sharp avoided crossing. Yet, at such an avoided crossing, eigenstates change abruptly. Although it is now well understood that EPs are closely related to QPTs, the link between the type of QPT (ground state or excited state, first or higher order) and EPs still need to be clarified. One of the major obstacles that one faces in order to achieve this resides in the ability to compute the distribution of EPs. The numerical assignment of an EP to two energies on the real axis is very difficult in large dimensions. Hence, the design of specific methods are required to get information on the location of EPs. Following this idea, Cejnar \textit{et al.}~developed a method based on a Coulomb analogy giving access to the density of EP close to the real axis \cite{Cejnar_2005, Cejnar_2007}. More recently Stransky and coworkers proved that the distribution of EPs is characteristic on the order of the QPT \cite{Stransky_2018}. \titou{In particular, they showed that when the dimensionality of the system increases, first- and second-order QTP behave differently, and the position of these singularities converge towards the real axis at different rates (exponentially and algebraically for the first and second order, respectively).}
|
||||
|
||||
It seems like our understanding of the physics of spatial and/or spin symmetry breaking in the Hartree-Fock theory can be enlightened by QPT theory. Indeed, the second derivative of the energy is discontinuous at the Coulson-Fischer point which means that the system undergo a second-order QPT. Moreover, the $\beta$ singularities introduced by Sergeev and coworkers to describe the EPs modeling the formation of a bound cluster of electrons are actually a more general class of singularities. The EPs close to the real axis (the so-called $\beta$ singularities) are connected to QPT because they result from a sharp avoided crossings at which the eigenstates change quickly. However, the $\alpha$ singularities arise from large avoided crossings. Thus, they can not be connected to QPT. The avoided crossings generating $\alpha$ singularities generally involve the ground state and low-lying doubly-excited states. Those excited states have a non-negligible contribution to the exact FCI solution because they have the same spatial and spin symmetry as the ground state. We believe that $\alpha$ singularities are connected to states with non-negligible contribution in the configuration interaction expansion thus to the dynamical part of the correlation energy, while $\beta$ singularities are linked to symmetry breaking and phase transitions of the wave function, i.e., to the multi-reference nature of the wave function thus to the static part of the correlation energy.
|
||||
It seems like our understanding of the physics of spatial and/or spin symmetry breaking in HF theory can be enlightened by QPT theory. Indeed, the second derivative of the energy is discontinuous at the \titou{Coulson-Fischer point (not defined)} which means that the system undergo a second-order QPT. Moreover, the $\beta$ singularities introduced by Sergeev and coworkers to describe the EPs modeling the formation of a bound cluster of electrons are actually a more general class of singularities. The EPs close to the real axis (the so-called $\beta$ singularities) are connected to QPT because they result from a sharp avoided crossings at which the eigenstates change quickly. However, the $\alpha$ singularities arise from large avoided crossings. Thus, they cannot be connected to QPT. The avoided crossings generating $\alpha$ singularities generally involve the ground state and low-lying doubly-excited states. Those excited states have a non-negligible contribution to the exact FCI solution because they have the same spatial and spin symmetry as the ground state. We believe that $\alpha$ singularities are connected to states with non-negligible contribution in the CI expansion thus to the dynamical part of the correlation energy, while $\beta$ singularities are linked to symmetry breaking and phase transitions of the wave function, i.e., to the multi-reference nature of the wave function thus to the static part of the correlation energy.
|
||||
|
||||
|
||||
%============================================================%
|
||||
@ -570,7 +573,7 @@ Simple systems that are analytically solvable (or at least quasi-exactly solvabl
|
||||
\hH = -\frac{\grad_1^2 + \grad_2^2}{2} + \frac{1}{r_{12}}
|
||||
\end{equation}
|
||||
|
||||
The laplacian operators are the kinetic operators for each electrons and $r_{12}^{-1} = \abs{\vb{r}_1 - \vb{r}_2}$ is the Coulomb operator. The radius R of the sphere dictates the correlation regime, i.e., weak correlation regime at small $R$ where the kinetic energy dominates, or strong correlation regime where the electron repulsion term drives the physics. We will use this model to try to rationalize the effects of the parameters that may influence the physics of EPs:
|
||||
The Laplace operators are the kinetic operators for each electrons and $r_{12}^{-1} = \abs{\vb{r}_1 - \vb{r}_2}$ is the Coulomb operator. The radius R of the sphere dictates the correlation regime, i.e., weak correlation regime at small $R$ where the kinetic energy dominates, or strong correlation regime where the electron repulsion term drives the physics. We will use this model to try to rationalize the effects of the parameters that may influence the physics of EPs:
|
||||
\begin{itemize}
|
||||
\item Partitioning of the Hamiltonian and the actual zeroth-order reference: weak correlation reference [restricted Hartree-Fock (RHF) or unrestricted Hartree-Fock (UHF) references, MP or EN partitioning], or strongly correlated reference.
|
||||
\item Basis set: minimal basis or infinite (i.e., complete) basis made of localized or delocalized basis functions
|
||||
@ -798,7 +801,7 @@ In the RHF case there are only $\alpha$ singularities and large avoided crossing
|
||||
\label{fig:UHFEP}
|
||||
\end{figure}
|
||||
|
||||
As shown before, some matrix elements of the Hamiltonian become complex in the holomorphic domain. Therefore the Hamiltonian becomes non-Hermitian for those value of $R$. In \cite{Burton_2019a} Burton et al. proved that for the \ce{H_2} molecule the unrestricted Hamiltonian is not \pt -symmetric in the holomorphic domain. An analog reasoning can be done with the spherium model to prove the same result. The \pt -symmetry (invariance with respect to combined space reflection $\mathcal{P}$ and time reversal $\mathcal{T}$) is a property which ensures that a non-Hermitian Hamiltonian has a real energy spectrum. Thus \pt -symmetric Hamiltonian can be seen as an intermediate between Hermitian and non-Hermitian.
|
||||
As shown before, some matrix elements of the Hamiltonian become complex in the holomorphic domain. Therefore the Hamiltonian becomes non-Hermitian for those value of $R$. In \cite{Burton_2019a} Burton \textit{et al.}~proved that for the \ce{H_2} molecule the unrestricted Hamiltonian is not \pt -symmetric in the holomorphic domain. An analog reasoning can be done with the spherium model to prove the same result. The \pt -symmetry (invariance with respect to combined space reflection $\mathcal{P}$ and time reversal $\mathcal{T}$) is a property which ensures that a non-Hermitian Hamiltonian has a real energy spectrum. Thus \pt -symmetric Hamiltonian can be seen as an intermediate between Hermitian and non-Hermitian.
|
||||
|
||||
Figure \ref{fig:UHFPT} shows that for the spherium model a part of the energy spectrum becomes complex when R is in the holomorphic domain. The parameter domain of value where the energy becomes complex is called the broken \pt symmetry region. This is consistent with the fact that in the holomorphic domain the Hamiltonian break the \pt -symmetry.
|
||||
|
||||
@ -814,7 +817,7 @@ For a non-Hermitian Hamiltonian the exceptional points can lie on the real axis.
|
||||
|
||||
\section{Conclusion}
|
||||
|
||||
In order to model accurately chemical systems, one must choose, in a ever larger zoo of methods, which computational protocol is adapted to the system of interest.
|
||||
In order to model accurately chemical systems, one must choose, in a ever growing zoo of methods, which computational protocol is adapted to the system of interest.
|
||||
This choice can be, moreover, motivated by the type of properties that one is interested in.
|
||||
That means that one must understand the strengths and weaknesses of each method, i.e., why one method might fail in some cases and work beautifully in others.
|
||||
We have seen that for methods relying on perturbation theory the successes and failures are directly connected to the position of EPs in the complex plane.
|
||||
|
Loading…
Reference in New Issue
Block a user