Here, we build on the works of Scuseria \textit{et al.} [\href{http://dx.doi.org/10.1063/1.3043729}{J.~Chem.~Phys.~\textbf{129}, 231101 (2008)}] and Berkelbach [\href{https://doi.org/10.1063/1.5032314}{J.~Chem.~Phys.~\textbf{149}, 041103 (2018)}] to show connections between the Bethe-Salpeter equation (BSE) formalism combined with the $GW$ approximation from many-body perturbation theory and coupled-cluster (CC) theory at the ground- and excited-state levels.
In particular, it may provide a path for the computation of ground- and excited-state properties (such as nuclear gradients) within the $GW$ and BSE frameworks.
The random-phase approximation (RPA), introduced by Bohm and Pines \cite{Bohm_1951,Pines_1952,Bohm_1953} in the context of the uniform electron gas, \cite{Loos_2016} is a quasibosonic approximation where one treats fermion products as bosons.
In the particle-hole (ph) channel, which is quite popular in the electronic structure community, \cite{Ren_2012,Chen_2017} particle-hole fermionic excitations and deexcitations are assumed to be bosons.
Because ph-RPA takes into account dynamical screening by summing up to infinity the (time-independent) ring diagrams, it is adequate in the high-density (or weakly correlated) regime and captures effectively long-range correlation effects (such as dispersion). \cite{Gell-Mann_1957,Nozieres_1958}
Another important feature of ph-RPA compared to finite-order perturbation theory is that it does not exhibit divergences for small-gap or metallic systems. \cite{Gell-Mann_1957}
Roughly speaking, the Bethe-Salpeter equation (BSE) formalism \cite{Salpeter_1951,Strinati_1988,Blase_2018,Blase_2020} of many-body perturbation theory \cite{Martin_2016} can be seen as a cheap and efficient way of introducing correlation in order to go \textit{beyond} RPA physics.
In the ph channel, BSE is commonly performed on top of a $GW$ calculation \cite{Hedin_1965,Aryasetiawan_1998,Onida_2002,Reining_2017,Golze_2019,Bruneval_2021} from which one extracts the quasiparticle energies as well as the dynamically-screened Coulomb potential $W$.
Practically, $GW$ produces accurate \textit{``charged''} excitations providing a faithful description of the fundamental gap via the computation of the RPA polarizability obtained by a resummation of all time-dependent ring diagrams.
The remaining excitonic effect (\ie, the stabilization provided by the attraction of the excited electron and its hole left behind) is caught via BSE, hence producing overall accurate \textit{``neutral''} excitations.
BSE@$GW$ has been shown to be highly successful to compute low-lying excited states of various natures (charge transfer, Rydberg, valence, etc) in molecular systems with a very attractive accuracy/cost ratio.\cite{Rohlfing_1999a,Horst_1999,Puschnig_2002,Tiago_2003,Rocca_2010,Boulanger_2014,Jacquemin_2015a,Bruneval_2015,Jacquemin_2015b,Hirose_2015,Jacquemin_2017a,Jacquemin_2017b,Rangel_2017,Krause_2017,Gui_2018,Blase_2018,Liu_2020,Blase_2020,Holzer_2018a,Holzer_2018b,Loos_2020e,Loos_2021,McKeon_2022}
Interestingly, RPA has strong connections with coupled-cluster (CC) theory, \cite{Freeman_1977,Scuseria_2008,Jansen_2010,Scuseria_2013,Peng_2013,Berkelbach_2018,Rishi_2020} the workhorse of molecular electronic structure when one is looking for high accuracy. \cite{Cizek_1966,Paldus_1972,Crawford_2000,Piecuch_2002,Bartlett_2007,Shavitt_2009}
In a landmark paper, Scuseria \textit{et al.}\cite{Scuseria_2008} have proven that ring CC with doubles (rCCD) is equivalent to RPA with exchange (RPAx) for the computation of the correlation energy, solidifying in the process the numerical evidences provided by Freeman many years before. \cite{Freeman_1977}
Assuming the existence of $\bX{}{-1}$ (which can be proven as long as the RPAx problem is stable \cite{Scuseria_2008}), this proof can be quickly summarized starting from the RPAx linear eigensystem
where $\Delta_{ijab}=\e{a}{}+\e{b}{}-\e{i}{}-\e{j}{}$.
We assume real quantities throughout this paper, $\e{p}{}$ is the one-electron energy associated with the Hartree-Fock (HF) spinorbital $\SO{p}{}(\bx)$ and
The composite variable $\bx$ gathers spin and spatial ($\br$) variables.
The indices $i$, $j$, $k$, and $l$ are occupied (hole) orbitals; $a$, $b$, $c$, and $d$ are unoccupied (particle) orbitals; $p$, $q$, $r$, and $s$ indicate arbitrary orbitals; and $m$ labels single excitations or deexcitations.
There are various ways of computing the RPAx correlation energy, \cite{Furche_2008,Jansen_2010,Angyan_2011} but the usual plasmon (or trace) formula \cite{Sawada_1957a,Rowe_1968,Schuck_book} yields\footnote{The factor $1/4$ in Eq.~\eqref{eq:EcRPAx} is sometimes replaced by a factor $1/2$, which corresponds to a different choice for the interaction kernel. See Ref.~\onlinecite{Angyan_2011} for more details}
Note that, in the case of RPAx, the same expression as Eq.~\eqref{eq:EcRPAx} can be derived from the adiabatic connection fluctuation dissipation theorem \cite{Furche_2005} (ACFDT) when exchange is included in the interaction kernel. \cite{Angyan_2011}
This simple and elegant proof was subsequently extended to excitation energies by Berkelbach, \cite{Berkelbach_2018} who showed that similitudes between equation-of-motion (EOM) rCCD (EOM-rCCD) \cite{Stanton_1993} and RPAx exist when the EOM space is restricted to the 1h1p configurations and only the two-body terms are dressed by rCCD correlation (see also Ref.~\onlinecite{Rishi_2020}).
where $\bHN= e^{-\hT}\hH_{N} e^{\hT}-\ECC$ is the (shifted) similarity-transformed normal-ordered Hamiltonian, $\Psi_{i}^{a}$ are singly-excited determinants, the one-body terms are
Neglecting the effect of $\hT_2$ on the one-body terms [see Eqs.~\eqref{eq:cFab} and \eqref{eq:cFij}] and relying on the rCCD amplitudes in the two-body terms, Eq.~\eqref{eq:cWibaj}, yields
Although the excitation energies of this approximate EOM-rCCD scheme are equal to the RPAx ones, it has been shown that the transition amplitudes (or residues) are distinct and only agrees at the lowest order in the Coulomb interaction. \cite{Emrich_1981,Berkelbach_2018}
Equation \eqref{eq:EOM-rCCD} can be more systematically derived through the formulation of $\Lambda$ equations based on a rCCD effective Hamiltonian, as proposed by Rishi \textit{et al.}\cite{Rishi_2020}
As we shall see below, the connection between a ph eigensystem with the structure of Eq.~\eqref{eq:RPA} and a set of CC-like amplitude equations does not hold only for RPAx as it is actually quite general and can be applied to most ph problems, such as time-dependent density-functional theory (TD-DFT), \cite{Runge_1984,Casida_1995} BSE, and many others.
This analysis has also been extended to the pp and hh sectors independently by Peng \textit{et al.}\cite{Peng_2013} and Scuseria \textit{et al.}\cite{Scuseria_2013}
are the elements of the correlation part of the dynamically-screened Coulomb potential which is set to its static limit \ie, $W_{pq,rs}^\text{stat}= W_{pq,rs}^\text{c}(\omega=0)$.
and $\Om{m}{\dRPA}$ is the $m$th (positive) eigenvalue and $\bX{m}{\dRPA}+\bY{m}{\dRPA}$ is constructed from the corresponding eigenvectors of the direct (\ie, without exchange) RPA (dRPA) problem defined as
As readily seen in Eqs.~\eqref{eq:A_RPAx}, \eqref{eq:B_RPAx}, \eqref{eq:A_BSE} and \eqref{eq:B_BSE}, the only difference between RPAx and BSE lies in the definition of the matrix elements, where one includes, via the presence of the $GW$ quasiparticle energies in the one-body terms and the screening of the electron-electron interaction [see Eq.~\eqref{eq:W}] in the two-body terms, correlation effects at the BSE level.
Therefore, following the derivation detailed in Sec.~\ref{sec:RPAx}, one can show that the BSE correlation energy obtained using the trace formula
can be equivalently obtained via a set of rCCD-like amplitude equations, where one substitutes in Eq.~\eqref{eq:rCCD} the HF orbital energies by the $GW$ quasiparticle energies and all the antisymmetrized two-electron integrals $\dbERI{pq}{rs}$ by $\wERI{pq}{rs}=\dbERI{pq}{rs}- W_{ps,qr}^\text{stat}$, \ie,
Similarly to the diagonalization of the eigensystem \eqref{eq:BSE}, these approximate CCD amplitude equations can be solved with $\order*{N^6}$ cost via the definition of appropriate intermediates.
As in the case of RPAx (see Sec.~\ref{sec:RPAx}), several variants of the BSE correlation energy do exist, \footnote{\alert{To the best of our knowledge, the trace (or plasmon) formula has been introduced first by Sawada \cite{Sawada_1957a} to calculate the correlation energy of the uniform electron gas as an alternative to the Gell-Mann-Brueckner formulation \cite{Gell-Mann_1957} where one integrates along the adiabatic connection path.
More precisely, the trace formula can be justified via the introduction of a quadratic Hamiltonian made of boson transition operators (quasiboson approximation). See Ref.~\onlinecite{Li_2020} for more details}.} either based on the plasmon formula \cite{Li_2020,Li_2021,DiSabatino_2021} or the ACFDT. \cite{Maggio_2016,Holzer_2018b,Loos_2020e,Berger_2021,DiSabatino_2021}
Following Berkelbach's analysis, \cite{Berkelbach_2018} one can extend the connection to excited states.
Indeed, one can obtain an analog of the 1h1p block of the approximate EOM-rCCD Hamiltonian [see Eq.\eqref{eq:EOM-rCCD}] using the amplitudes resulting from Eq.~\eqref{eq:rCCD-BSE} as well as replacing $\bA{}{}$ and $\bB{}{}$ by their BSE counterparts, \ie,
This equation provides the same excitation energies as the conventional linear-response equations \eqref{eq:BSE}, and the corresponding $\Lambda$ equations based on the BSE effective Hamiltonian $\tHN$ can be derived following Ref.~\onlinecite{Rishi_2020}.
However, there is a significant difference with RPAx as the BSE involves $GW$ quasiparticle energies, where some of the correlation has been already dressed, while the RPAx equations only involves (undressed) one-electron orbital energies, as shown in Eq.~\eqref{eq:EOM-rCCD}.
In other words, in the spirit of the Brueckner version of CCD, \cite{Handy_1989}\alert{the $GW$ pre-treatment renormalizes the bare one-electron energies and, consequently, incorporates mosaic \cite{Scuseria_2008,Scuseria_2013} as well as additional diagrams, \cite{Lange_2018} a process named Brueckner-like dressing in Ref.~\onlinecite{Berkelbach_2018}.}
This observation evidences clear similitudes between BSE@$GW$ and the similarity-transformed EOM-CC (STEOM-CC) method introduced by Nooijen, \cite{Nooijen_1997c,Nooijen_1997b,Nooijen_1997a} where one performs a second similarity transformation to partially decouple the 1h determinants from the 2h1p ones in the ionization potential (IP) sector and the 1p determinants from the 1h2p ones in the electron affinity (EA) sector.
At the CCSD level, for example, this is achieved by performing IP-EOM-CCSD \cite{Stanton_1994,Musial_2003a} (up to 2h1p) and EA-EOM-CCSD \cite{Nooijen_1995,Musial_2003b} (up to 2p1h) calculations prior to the EOM-CC treatment, which can then be reduced to the 1h1p sector thanks to this partial decoupling.
Following the same philosophy, in BSE@$GW$, one performs first a $GW$ calculation (which corresponds to an approximate and simultaneous treatment of the IP and EA sectors up to 2h1p and 2p1h \cite{Lange_2018,Monino_2022}) in order to renormalize the one-electron energies (see Sec.~\ref{sec:GW} for more details).
The dynamical version of BSE [where the BSE kernel is explicitly treated as frequency-dependent in Eq.~\eqref{eq:BSE}] takes partially into account the 2h2p configurations. \cite{Strinati_1980,Strinati_1982,Strinati_1984,Strinati_1988,Rohlfing_2000,Romaniello_2009b,Loos_2020h,Authier_2020,Monino_2021,Bintrim_2022}
Because $GW$ is able to capture key correlation effects as illustrated above, it is therefore interesting to investigate if it is also possible to recast the $GW$ equations as a set of CC-like equations that can be solved iteratively using the CC machinery.
Connections between approximate IP/EA-EOM-CC schemes and the $GW$ approximation have been already studied in details by Lange and Berkelbach, \cite{Lange_2018} but we believe that the present work proposes a different perspective on this particular subject as we derive genuine CC equations that do not decouple the 2h1p and 2p1h sectors.
Note also that the procedure described below can be applied to other approximate self-energies such as second-order Green's function (or second Born) \cite{Stefanucci_2013,Ortiz_2013,Phillips_2014,Rusakov_2014,Hirata_2015,Hirata_2017} or $T$-matrix.\cite{Romaniello_2012,Zhang_2017,Li_2021b,Loos_2022}
Within the perturbative $GW$ scheme (commonly known as $G_0W_0$), the quasiparticle energies are obtained via a one-shot procedure (with or without linearization).
Partial self-consistency can be attained via the \textit{``eigenvalue''} self-consistent $GW$ (ev$GW$) \cite{Hybertsen_1986,Shishkin_2007,Blase_2011,Faber_2011,Rangel_2016,Gui_2018} or the quasiparticle self-consistent $GW$ (qs$GW$) \cite{Faleev_2004,vanSchilfgaarde_2006,Kotani_2007,Ke_2011,Kaplan_2016} schemes.
In the most general setting, the quasiparticle energies and their corresponding orbitals are obtained by diagonalizing the so-called non-linear and frequency-dependent quasiparticle equation
\alert{which gives also access to the satellite solutions}.
In Eq.~\eqref{eq:GW}, $\be{}{}$ is a diagonal matrix gathering the HF orbital energies and the elements of the correlation part of the dynamical (and non-hermitian) $GW$ self-energy are
Because both the left- and right-hand sides of Eq.~\eqref{eq:GW} depend on $\e{p}{\GW}$, this equation has to be solved iteratively via a self-consistent procedure.
As shown by Bintrim and Berkelbach, \cite{Bintrim_2021} the quasiparticle equation \eqref{eq:GW} can be recast as a larger set of linear and frequency-independent equations (that still needs to be solved self-consistently), which reads in the Tamm-Dancoff approximation
Note that, contrary to the IP/EA-EOM-CC equations, $GW$ does couple the IP and EA sectors due to the lack of exponential parametrization of the wave function. \cite{Nooijen_1995,Rishi_2020}
However, it allows to generate higher-order diagrams. \cite{Lange_2018,Schirmer_2018}
Assuming the existence of $\bX{}{-1}$ and introducing $\bT{}{\text{2h1p}}=\bY{}{\text{2h1p}}\cdot\bX{}{-1}$ and $\bT{}{\text{2p1h}}=\bY{}{\text{2p1h}}\cdot\bX{}{-1}$, we have
To determine the 2h1p and 2p1h amplitudes, $t_{ija,p}^{\text{2h1p}}$ and $t_{iab,p}^{\text{2p1h}}$, one can then rely on the usual quasi-Newton iterative procedure to solve these quadratic equations by updating the amplitudes via
\alert{Due to the non-linear nature of these equations, the iterative procedure proposed in Eqs.~\eqref{eq:t_2h1p_update} and \eqref{eq:t_2p1h_update} can potentially converge to satellite solutions.
This is also the case at the CC level when one relies on more elaborated algorithm to converge the amplitude equations to higher-energy solutions. \cite{Piecuch_2000,Mayhall_2010,Lee_2019,Kossoski_2021,Marie_2021b}}
Again, similarly to the dynamical equations defined in Eq.~\eqref{eq:GW} which requires the diagonalization of the dRPA eigenproblem [see Eq.~\eqref{eq:dRPA}], the CC equations reported in Eqs.~\eqref{eq:r_2h1p} and \eqref{eq:r_2p1h} can be solved with $\order*{N^6}$ cost by defining judicious intermediates.
Cholesky decomposition, density fitting, and other related techniques may be employed to further reduce this scaling as it is done in conventional $GW$ calculations. \cite{Bintrim_2021,Forster_2020,Forster_2021,Duchemin_2019,Duchemin_2020,Duchemin_2021}
The $G_0W_0$ quasiparticle energies can be easily obtained via the procedure described in Ref.~\onlinecite{Monino_2022} by solving the previous equations for each value of $p$ separately.
More specifically, we have shown how to recast $GW$ and BSE as non-linear CC-like equations that can be solved with the usual CC machinery at the same computational cost.
The conventional and CC-based versions of the BSE and $GW$ schemes that we have described in the present work have been implemented in the electronic structure package QuAcK \cite{QuAcK} (available at \url{https://github.com/pfloos/QuAcK}) with which we have numerically checked these exact equivalences.
Similitudes between BSE@$GW$ and STEOM-CC have been also highlighted, and may explain the reliability of BSE@$GW$ for the computation of optical excitations in molecular systems.
We hope that the present work may provide a consistent approach for the computation of ground- and excited-state properties (such as nuclear gradients) within the $GW$\cite{Lazzeri_2008,Faber_2011b,Yin_2013,Montserrat_2016,Zhenglu_2019} and BSE \cite{IsmailBeigi_2003,Caylak_2021,Knysh_2022} frameworks, hence broadening the applicability of these formalisms in computational photochemistry.
However, several challenges lie ahead as one must derive, for example, the $\Lambda$ equations associated with $GW$\cite{Bartlett_1986,Rishi_2020} and the response of the static screening with respect to the external perturbation at the BSE level.
The present connections between CC and $GW$ could also provide new directions for the development of multireference $GW$ methods \cite{Brouder_2009,Linner_2019} in order to treat strongly correlated systems. \cite{Lyakh_2012,Evangelista_2018}
This project has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (Grant agreement No.~863481).}