From 55676a900258afc68f66bbbc5dcc34c8994d1f19 Mon Sep 17 00:00:00 2001 From: Anthony Scemama Date: Fri, 16 Oct 2020 18:23:20 +0200 Subject: [PATCH] Moved Wiki documentation into the project --- README.md | 20 ++++++- src/README.org | 145 +++++++++++++++++++++++++++++++++++++++++++++++-- 2 files changed, 156 insertions(+), 9 deletions(-) diff --git a/README.md b/README.md index 7d64383..438414c 100644 --- a/README.md +++ b/README.md @@ -1,7 +1,21 @@ -# qmckl +# Quantum Monte Carlo Kernel Library. ![Build Status](https://github.com/TREX-CoE/qmckl/workflows/test-build/badge.svg?branch=main) -Quantum Monte Carlo Kernel Library. +The domain of quantum chemistry needs a library in which the main +kernels of Quantum Monte Carlo (QMC) methods are implemented. In the +library proposed in this project, we expose the main algorithms in a +language and provide a standard API and tests to enable the +development of high-performance QMCkl implementations taking +advantage of modern hardware. -See the [Wiki](https://github.com/TREX-CoE/qmckl/wiki) for more information. +See the [source code](https://github.com/TREX-CoE/qmckl/src/README.org) +to read the documentation. + + + +------------------------------ + +[[https://trex-coe.eu/sites/default/files/inline-images/euflag.jpg]] +[TREX: Targeting Real Chemical Accuracy at the Exascale](https://trex-coe.eu) project has received funding from the European Union’s Horizon 2020 - Research and Innovation program - under grant agreement no. 952165. The content of this document does not represent the opinion of the European Union, and the European Union is not responsible for any use that might be made of such content. + \ No newline at end of file diff --git a/src/README.org b/src/README.org index c5744fc..fac2ecf 100644 --- a/src/README.org +++ b/src/README.org @@ -2,9 +2,15 @@ ** Introduction - The main objective of present library is documentation. Therefore, - literate programming is particularly adapted in this context. - Source files are written in org-mode format, to provide useful + The ultimate goal of QMCkl is to provide a high-performance + implementation of the main kernels of QMC. In this particular + repository, we focus on the definition of the API and the tests, + and on a /pedagogical/ presentation of the algorithms. We expect the + HPC experts to use this repository as a reference for re-writing + optimized libraries. + + Literate programming is particularly adapted in this context. + Source files are written in [[ottps://karl-voit.at/2017/09/23/orgmode-as-markup-only/][org-mode]] format, to provide useful comments and LaTex formulas close to the code. There exists multiple possibilities to convert org-mode files into different formats such as HTML or pdf. @@ -14,9 +20,21 @@ The code is extracted from the org files using Emacs as a command-line tool in the =Makefile=, and then the produced files are compiled. - If the name of the file is =xxx.org=, the name of the produced C - files should be =xxx.c= and =xxx.h= and the name of the produced - Fortran files should be =xxx.f90= +*** Language used + + Fortran is one of the most common languages used by the community, + and is simple enough to make the algorithms readable. Hence we + propose in this pedagogical implementation of QMCkl to use Fortran + to express the algorithms. For specific internal functions where + the C language is more natural, C is used. + + As Fortran modules generate compiler-dependent files, the use of + modules is restricted to the internal use of the library, otherwise + the compliance with C is violated. + + The external dependencies should be kept as small as possible, so + external libraries should be used /only/ if their used is strongly + justified. *** Source code editing @@ -50,8 +68,123 @@ rm ${nb}.md iso-c-binding. The name of the Fortran source files should end with =_f.f90= to be properly handled by the Makefile. +*** Coding style + # TODO: decide on a coding style + + To improve readability, we maintain a consistent coding style in the library. + + - For C source files, we will use __(decide on a coding style)__ + - For Fortran source files, we will use __(decide on a coding style)__ + + Coding style can be automatically checked with [[https://clang.llvm.org/docs/ClangFormat.html][clang-format]]. + +** Design of the library + + The proposed API should allow the library to: + - deal with memory transfers between CPU and accelerators + - use different levels of floating-point precision + + We chose a multi-layered design with low-level and high-level + functions (see below). + +*** Naming conventions + + Use =qmckl_= as a prefix for all exported functions and variables. + All exported header files should have a filename with the prefix + =qmckl_=. + + If the name of the org-mode file is =xxx.org=, the name of the + produced C files should be =xxx.c= and =xxx.h= and the name of the + produced Fortran files should be =xxx.f90= + +*** Application programming interface + + The application programming interface (API) is designed to be + compatible with the C programming language (not C++), to ensure + that the library will be easily usable in any language. + This implies that only the following data types are allowed in the API: + + - 32-bit and 64-bit floats and arrays + - 32-bit and 64-bit integers and arrays + - Pointers should be represented as 64-bit integers (even on + 32-bit architectures) + - ASCII strings are represented as a pointers to a character arrays + and terminated by a zero character (C convention). + + To facilitate the use in other languages than C, we provide some + bindings in other languages in other repositories. + + # TODO : Link to repositories for bindings + +*** Global state + + Global variables should be avoided in the library, because it is + possible that one single program needs to use multiple instances of + the library. To solve this problem we propose to use a pointer to a + =context= variable, built by the library with the + =qmckl_context_create= function. The =context= contains the global + state of the library, and is used as the first argument of many + QMCkl functions. + + Modifying the state is done by setters and getters, prefixed + by =qmckl_context_set_= an =qmckl_context_get_=. + When a context variable is modified by a setter, a copy of the old + data structure is made and updated, and the pointer to the new data + structure is returned, such that the old contexts can still be + accessed. + It is also possible to modify the state in an impure fashion, using + the =qmckl_context_update_= functions. + The context and its old versions can be destroyed with + =qmckl_context_destroy=. + +*** Low-level functions + + Low-level functions are very simple functions which are leaves of the + function call tree (they don't call any other QMCkl function). + + This functions are /pure/, and unaware of the QMCkl =context=. They are + not allowed to allocate/deallocate memory, and if they need + temporary memory it should be provided in input. + +*** High-level functions + + High-level functions are at the top of the function call tree. + They are able to choose which lower-level function to call + depending on the required precision, and do the corresponding type + conversions. + These functions are also responsible for allocating temporary + storage, to simplify the use of accelerators. + + The high-level functions should be pure, unless the introduction of + non-purity is justified. All the side effects should be made in the + =context= variable. + + # TODO : We need an identifier for impure functions + +*** Numerical precision + + The number of bits of precision required for a function should be + given as an input of low-level computational functions. This input will + be used to define the values of the different thresholds that might + be used to avoid computing unnecessary noise. + High-level functions will use the precision specified in the + =context= variable. + +** Algorithms + + Reducing the scaling of an algorithm usually implies also reducing + its arithmetic complexity (number of flops per byte). Therefore, + for small sizes \(\mathcal{O}(N^3)\) and \(\mathcal{O}(N^2)\) algorithms + are better adapted than linear scaling algorithms. + As QMCkl is a general purpose library, multiple algorithms should + be implemented adapted to different problem sizes. + ** Documentation +- [[qmckl.org][Main QMCkl header file]] - [[qmckl_context.org][Context]] +** Acknowledgments +[[https://trex-coe.eu/sites/default/files/inline-images/euflag.jpg]] +[[https://trex-coe.eu][TREX: Targeting Real Chemical Accuracy at the Exascale]] project has received funding from the European Union’s Horizon 2020 - Research and Innovation program - under grant agreement no. 952165. The content of this document does not represent the opinion of the European Union, and the European Union is not responsible for any use that might be made of such content.