9
1
mirror of https://github.com/QuantumPackage/qp2.git synced 2024-06-02 10:45:18 +02:00
qp2/docs/source/appendix/benchmarks.rst
2019-01-25 11:39:31 +01:00

159 lines
7.0 KiB
ReStructuredText

Benchmarks
==========
The determinant selection, MR-PT2 and diagonalization are parallelized with
distributed parallelism. Benchmarks for the [NH2-CH-NH2]+ molecule in the
aug-cc-pVDZ basis set are presented with up to 50 nodes (1800 cores) on
CALMIP's `Olympe`_ supercomputer, and 200 nodes (9600 cores) on GENCI's
`Irene`_ supercomputer. This represents an active space of 18 electrons
in 111 MOs.
- Nodes of Olympe have two Skylake sockets, 2x18 cores @ 2.3GHz.
- Nodes of Irene have two Skylake sockets, 2x24 cores @ 2.7GHz.
Convergence of the energy
-------------------------
.. figure:: /_static/cn3_energy.png
:alt: Convergence of the energy.
Convergence of the variational energy, with and without the PT2 correction.
Both energies converge to the (frozen core) FCI energy.
The plot is displayed for the ground state and for the 1st excited state.
Variational energy
^^^^^^^^^^^^^^^^^^
================ ================ ================ ===============
Number of dets Ground state Excited state Excitation (eV)
================ ================ ================ ===============
7 -149.489 186 -149.207 354 7.67
123 -149.536 265 -149.261 860 7.47
3 083 -149.685 606 -149.404 450 7.65
29 409 -149.826 151 -149.547 275 7.59
168 595 -149.900 352 -149.626 058 7.46
1 322 537 -149.946 655 -149.675 032 7.39
8 495 334 -149.972 032 -149.704 145 7.29
9 356 952 -149.973 375 -149.706 822 7.25
42 779 636 -149.987 370 -149.721 470 7.24
186 978 487 -149.998 582 -149.733 039 7.23
================ ================ ================ ===============
Variational energy + PT2 correction
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
================ ================ ================ ===============
Number of dets Ground state Excited state Excitation (eV)
================ ================ ================ ===============
7 -150.161 107 -149.904 883 6.97
123 -150.116 958 -149.849 465 7.28
3 083 -150.043 5(2) -149.780 8(2) 7.15
29 409 -150.022 2(2) -149.758 3(2) 7.18
168 595 -150.019 9(1) -149.754 5(1) 7.22
1 322 537 -150.017 89(7) -149.752 55(7) 7.22
8 495 334 -150.015 97(4) -149.750 87(5) 7.21
9 356 952 -150.015 89(3) -149.750 66(3) 7.22
42 959 496 -150.016 75(2) -149.751 88(2) 7.21
186 978 487 -150.017 51(2) -149.752 90(2) 7.20
================ ================ ================ ===============
Davidson's diagonalization
--------------------------
We present the parallel speedup curve, and the wall-clock time in seconds
required to compute one iteration for two wave functions measured on Olympe
and Irene.
.. figure:: /_static/speedup_davidson.png
:alt: Parallel speedup of Davidson's diagonalization.
Parallel speedup of Davidson's diagonalization measured on Olympe and Irene.
Olympe
^^^^^^
======================= ====================== =======================
Number of 36-core Nodes 9 356 952 determinants 42 959 496 determinants
======================= ====================== =======================
1 775.55 11 198.70
5 169.88 2 288.58
10 93.22 1 213.95
20 56.86 626.41
30 43.76 445.65
40 36.18 350.25
50 33.67 295.25
======================= ====================== =======================
Irene
^^^^^
======================= ====================== =======================
Number of 48-core Nodes 9 356 952 determinants 42 959 496 determinants
======================= ====================== =======================
1 572.98 9 154.30 *
10 72.55 922.07
25 38.88 412.34
50 27.95 241.35
75 27.54 183.63
100 27.86 165.68
150 28.14 134.05
200 27.77 134.64
======================= ====================== =======================
PT2 correction
--------------
We present the parallel speedup curve, and the wall-clock time in seconds
required to compute the PT2 correction for two wave functions measured on
Olympe and Irene.
.. figure:: /_static/speedup_pt2.png
:alt: Parallel speedup of the PT2 computation of the ground state.
Parallel speedup of the PT2 computation of the ground state measured
on Olympe and Irene.
Olympe
^^^^^^
======================= ====================== =======================
Number of 36-core Nodes Ground state (9.3M) Excited state (9.3M)
======================= ====================== =======================
1 7 883.74 9 829.19
5 1 629.06 2 022.36
10 832.89 1 029.91
20 440.76 537.37
30 303.31 378.69
40 246.12 296.31
50 201.84 241.55
======================= ====================== =======================
Irene
^^^^^
======================= ====================== ======================= ====================== =======================
Number of 48-core Nodes Ground state (9.3M) Excited state (9.3M) Ground state (42.9M) Excited state (42.9M)
======================= ====================== ======================= ====================== =======================
1 4 935.81 6 152.29 24 586.62 37 440.59
10 525.95 652.23 2 458.66 3 086.19
25 237.47 286.06 1 041.69 1 295.43
50 144.39 174.12 588.35 724.25
75 109.13 129.17 446.74 537.59
100 100.75 103.43 367.21 450.32
150 82.04 91.77 298.63 358.25
200 75.62 85.25 268.96 312.23
======================= ====================== ======================= ====================== =======================