pposv

The pposv solver supports the following input types.

data type: DSZC
index type: std::int32_t, std::int64_t

The pposv solves a square symmetric positive definite matrix using Cholesky decomposition. The matrix of size \(N\) is stored in a memory block of size \(N^2\).

Constructor

There are four template arguments.

data type, e.g., double, float, std::complex<double>, std::complex<float>.
index type, e.g., std::int32_t, std::int64_t.
symmetry flag, e.g., U, L.
proccess grid order, R or C.

This solver uses a 2D process grid, so one can choose from Row major or Column major ordering.

The symmetry flag can be either Upper or Lower. If only half of the matrix is populated, this flag needs to be set properly. If the whole symmetric matrix is populated, either one will work.

With np_row and np_col denoting the numbers of rows and columns of the process grid, a solver can be constructed using the following.

C++
auto solver = pposv<double, int, 'L', 'R'>(np_row, np_col);

It is possible to let the library automatically determine the numbers of rows and columns, then the solver can be constructed as

C++
auto solver = pposv<double, int, 'L', 'R'>();

Solving

Prepare matrices \(A\) and \(B\) on the root process in whatever form of choice. Those two matrices need to be wrapped into template<data_t DT, index_t IT> struct full_mat objects, which require

the number of rows of the matrix, n_rows,
the number of columns of the matrix, n_cols,
a pointer pointing to the data, data.

Then one can call the solver via

C++
constexpr auto N = 6, NRHS = 2;
// storage for the matrices A and B
std::vector<double> A, B;
// ...
// ... storage is populated by some utility on the root process
// ...
const auto info = solver.solve({N, N, A.data()}, {N, NRHS, B.data()});

The factorization will be stored internally as long as the solver object stays alive. Thus to reuse the factorization, a second solve can call the following.

C++
const auto info = solver.solve({N, NRHS, B.data()});

A second call to solve(full_mat<DT, IT>&& A, full_mat<DT, IT>&& B) will clear previous factorizations automatically.

On return, B will be replaced by the solution. In the above example, std::vector<double> B will contain the solution.