By recasting the non-linear frequency-dependent $GW$ quasiparticle equation into a linear eigenvalue problem, we explain the appearance of multiple solutions and unphysical discontinuities in various physical quantities computed within the $GW$ approximation.
Considering the $GW$ self-energy as an effective Hamiltonian, these issues can be directly related to the intruder state problem.
The $GW$ approximation of many-body perturbation theory \cite{Hedin_1965,Martin_2016} allows to compute accurate charged excitation (\ie, ionization potentials and electron affinities) in solids and molecules. \cite{Aryasetiawan_1998,Onida_2002,Reining_2017,Golze_2019}
Its popularity in the molecular structure community is rapidly growing \cite{Ke_2011,Bruneval_2012,Bruneval_2013,Bruneval_2015,Blase_2016,Bruneval_2016, Bruneval_2016a,Koval_2014,Hung_2016,Blase_2018,Boulanger_2014,Li_2017,Hung_2016,Hung_2017,vanSetten_2015,vanSetten_2018,vanSetten_2015, Maggio_2017,vanSetten_2018,Richard_2016,Gallandi_2016,Knight_2016,Dolgounitcheva_2016,Bruneval_2015,Krause_2015,Govoni_2018,Caruso_2016} thanks to its relatively low cost and somehow surprising accuracy for weakly-correlated systems. \cite{Bruneval_2021}
The idea behind the $GW$ approximation is to recast the many-body problem into a set of non-linear one-body equations. The introduction of the self-energy $\Sigma$ links the non-interacting Green's function $G_0$ to its fully-interacting version $G$ via the following Dyson equation:
Electron correlation is then explicitly incorporated into one-body quantities via a sequence of self-consistent steps known as Hedin's equations. \cite{Hedin_1965}
%which connect $G$, the irreducible vertex function $\Gamma$, the irreducible polarizability $P$, the dynamically-screened Coulomb interaction $W$, and $\Sigma$ through a set of five equations.
% & \Sigma(12) = i \int G(13) W(14) \Gamma(324) d(34),
%\end{align}
%\end{subequations}
%where $v$ is the bare Coulomb interaction, $\delta(12)$ is Dirac's delta function and $(1)$ is a composite coordinate gathering spin, space and time variables $(\sigma_1,\boldsymbol{r}_1,t_1)$.
In recent studies, \cite{Loos_2018b,Veril_2018,Berger_2021,DiSabatino_2021} we discovered that one can observe (unphysical) irregularities and/or discontinuities in the energy surfaces of several key quantities (ionization potential, electron affinity, fundamental gap, total and correlation energies, as well as vertical excitation energies) even in the weakly correlated regime.
These issues were discovered in Ref.~\onlinecite{Loos_2018b} while studying a model two-electron system \cite{Seidl_2007,Loos_2009a,Loos_2009c} and they were further investigated in Ref.~\onlinecite{Veril_2018}, where we provided additional evidences and explanations of these undesirable features in real molecular systems.
In particular, we showed that each branch of the self-energy $\Sigma$ is associated with a distinct quasiparticle solution, and that each switch between solutions implies a significant discontinuity in the quasiparticle energy due to the transfer of weight between two solutions of the quasiparticle equation. \cite{Veril_2018}
Multiple solution issues in $GW$ appears frequently, especially for orbitals that are energetically far from the Fermi level, such as in core ionized states. \cite{Golze_2018,Golze_2020}
It was shown that these problems could be alleviated by using a static Coulomb-hole plus screened-exchange (COHSEX) \cite{Hedin_1965,Hybertsen_1986,Hedin_1999,Bruneval_2006} self-energy \cite{Berger_2021} or by considering a fully self-consistent scheme. \cite{DiSabatino_2021}
However, none of these solutions is completely satisfying as a static approximation of the self-energy can induce significant loss in accuracy and fully self-consistent calculations can be quite challenging in terms of implementation and cost.
In the present article, via an upfolding process of the non-linear $GW$ equation, \cite{Bintrim_2021a} we provide further physical insights into the origin of these discontinuities by highlighting, in particular, the role of intruder states.
Inspired by regularized electronic structure theories, \cite{Lee_2018a,Evangelista_2014b} these new insights allow us to propose a cheap and efficient regularization scheme in order to avoid these issues.
Here, we consider the one-shot {\GOWO}\cite{Strinati_1980,Hybertsen_1985a,Hybertsen_1986,Godby_1988,Linden_1988,Northrup_1991,Blase_1994,Rohlfing_1995,Shishkin_2007} for the sake of simplicity but the same analysis can be performed in the case of (partially) self-consistent schemes.\cite{Hybertsen_1986,Shishkin_2007,Blase_2011,Faber_2011,Rangel_2016,Gui_2018,Faleev_2004,vanSchilfgaarde_2006,Kotani_2007,Ke_2011,Kaplan_2016}
Moreover, we consider a restricted Hartree-Fock (HF) starting point but it can be straightforwardly extended to a Kohn-Sham (KS) starting point.
Throughout this article, $p$ and $q$ are general (spatial) orbitals, $i$, $j$, $k$, and $l$ denotes occupied orbitals, $a$, $b$, $c$, and $d$ are (unoccupied) virtual orbitals, while $m$ labels single excitations $i \to a$.
Within the {\GOWO} approximation, in order to obtain the quasiparticle energies and the corresponding satellites, one solve, for each spatial orbital $p$, the following (non-linear) quasiparticle equation
where $\Om{m}{\RPA}$ and $\bX{m}{\RPA}$ are respectively the $m$th eigenvalue and eigenvector of the random-phase approximation (RPA) problem in the Tamm-Dancoff approximation, \ie,
As a non-linear equation, Eq.~\eqref{eq:qp_eq} has many solutions $\eps{p,s}{\GW}$ and their corresponding weights are given by the value of the following renormalization factor
In a well-behaved case, one of the solution (the so-called quasiparticle) $\eps{p}{\GW}\equiv\eps{p,s=0}{\GW}$ has a large weight $Z_{p}\equiv Z_{p,s=0}$.
which physically shows that the mean-field solution of unit weight is ``scattered'' by the effect of correlation in many solutions with smaller weights.
The non-linear quasiparticle equation \eqref{eq:qp_eq} can be \textit{exactly} transformed into a larger linear problem via an upfolding process where the 2h1p and 2p1h sectors
The size of this eigenvalue problem is $1+ O^2 V + O V^2$ (where $O$ and $V$ are the number of occupied and virtual orbitals, respectively), and this eigenvalue problem has to be solved for each orbital that one wishes to correct.
Note, however, that the blocks $\bC{}{\text{2h1p}}$ and $\bC{}{\text{2p1h}}$ do not need to be recomputed for each orbital.
Because the renormalization factor \eqref{eq:Z} corresponds to the projection of the vector $\bc{}{(p,s)}$ onto the reference (or internal) space, the weight of a solution $(p,s)$ is given by the first coefficient of their corresponding eigenvector $\bc{}{(p,s)}$, \ie,
It is paramount to understand that diagonalizing $\bH^{(p)}$ [see Eq.~\eqref{eq:Hp}] is completely equivalent to solving the quasiparticle equation \eqref{eq:qp_eq}.
This can be further illustrated by expanding the secular equation associated with Eq.~\eqref{eq:Hp}
One can see this downfolding process as the construction of a frequency-dependent effective Hamiltonian where the internal space is composed by a single determinant of the 1h or 1p sector and the external (or outer) space by all the 2h1p and 2p1h configurations. \cite{Dvorak_2019a,Dvorak_2019b,Bintrim_2021a}
The main mathematical difference between the two approaches is that, by diagonalizing Eq.~\eqref{eq:Hp}, one has directly access to the internal and external components of the eigenvectors associated with each quasiparticle and satellite, and not only their projection in the reference space as shown by Eq.~\eqref{eq:Z_proj}.
The element $\eps{p}{\HF}$ of $\bH^{(p)}$ [see Eq.~\eqref{eq:Hp}] corresponds to the relative energy of the $(\Ne\pm1)$-electron reference determinant (compared to the $\Ne$-electron HF determinant) while the diagonal elements of the blocks $\bC{}{\text{2h1p}}$ and $\bC{}{\text{2p1h}}$ provide an estimate of the relative energy of the 2h1p and 2p1h determinants.
In some situations, one of these determinants from the external space may become of similar energy than the reference determinant.
Hence, these two diabatic electronic configurations may cross and form an avoided crossing, and this outer-space determinant may be labeled as an intruder state.
As we shall see below, discontinuities, which are ubiquitous in molecular systems, arise in such scenarios.
In order to illustrate the appearance and the origin of these multiple solutions, we consider the hydrogen molecule in the 6-31G basis set which corresponds to a two-electron system with four spatial orbitals (one occupied and three virtuals).
This example was already considered in our previous work \cite{Veril_2018} but here we provide further insights on the origin of the appearances of these discontinuities.
We denote as $\ket*{1\Bar{1}}$ the $\Ne$-electron ground-state determinant where the orbital 1 is occupied by one spin-up and spin-down electron.
Similarly notations will be employed for the $(\Ne\pm1)$-electron configurations.
Quasiparticle energies (left), correlation part of the self-energy (center) and renormalization factor (right) as functions of the internuclear distance $\RHH$ (in \si{\angstrom}) for various orbitals of \ce{H2} at the {\GOWO}@HF/6-31G level.
Figure \ref{fig:H2} shows the evolution of the quasiparticle energy, energetically close-by satellites and their corresponding weights at the {\GOWO}@HF level as a function on the internuclear distance $\RHH$.
One can easily diagnose two problematic regions showing obvious discontinuities around $\RHH=\SI{0.5}{\angstrom}$ for the HOMO$+3$ ($p =4$) and $\RHH=\SI{1.1}{\angstrom}$ for the HOMO$+2$ ($p =3$).
Let us first look more closely at the region around $\RHH=\SI{1.1}{\angstrom}$.
As one can see, an avoided crossing is formed between two $(\Ne+1)$-electron.
Inspection of their corresponding eigenvectors reveals that the determinants involved are the reference 1p determinant $\ket*{1\Bar{1}3}$ and an excited $(\Ne+1)$-electron of configuration $\ket*{12\Bar{2}}$.
The algorithm diabatically follows the reference determinant $\ket*{1\Bar{1}3}$, while it should be adiabatically switching to the $\ket*{12\Bar{2}}$ determinant which becomes lower in energy for $\RHH > \SI{1.1}{\angstrom}$.
This is not the $\Ne$-electron situation where one has to check if it is multireference, but for the $\Ne\pm1$-electron situations.
As similar scenario is at the play in the region around $\RHH=\SI{0.5}{\angstrom}$ but it now involves 3 electronic configurations: the $\ket*{1\Bar{1}4}$ reference determinant as well as two external determinants of configuration $\ket*{1\Bar{?}?}$ and $\ket*{1\Bar{?}?}$.
These forms two avoided crossings in rapid successions, which involves two discontinuties in the energy surface.
One direct consequences of such irregularities is the difficulty to converge (partially) self-consistent $GW$ methods as, depending on the iteration, the self-consistent procedure will jump erratically from one solution to the other (even if DIIS is employed).
Note that these issues do not only appear at the {\GOWO} level but also at the partially self-consistent levels such as ev$GW$\cite{Hybertsen_1986,Shishkin_2007,Blase_2011,Faber_2011,Rangel_2016,Gui_2018} and qs$GW$. \cite{Faleev_2004,vanSchilfgaarde_2006,Kotani_2007,Ke_2011,Kaplan_2016}
However, fully self-consistent $GW$ methods \cite{Stan_2006,Stan_2009,Rostgaard_2010,Caruso_2012,Caruso_2013,Caruso_2013a,Caruso_2013b,Koval_2014,Wilhelm_2018} where one considers not only the quasiparticle solutions but also the satellites at each iteration are free of these irregularities. \cite{DiSabatino_2021}
The $T$-matrix-based formalism as well as second-order Green's function (or second Born) scheme \cite{SzaboBook,Casida_1989,Casida_1991,Stefanucci_2013,Ortiz_2013, Phillips_2014,Phillips_2015,Rusakov_2014,Rusakov_2016,Hirata_2015,Hirata_2017} exhibit the same problems.
This project has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (Grant agreement No.~863481).}