Accelerating Nuclear Configuration Interaction Calculations through a Preconditioned Block Iterative Eigensolver

09/06/2016
by   Meiyue Shao, et al.
0

We describe a number of recently developed techniques for improving the performance of large-scale nuclear configuration interaction calculations on high performance parallel computers. We show the benefit of using a preconditioned block iterative method to replace the Lanczos algorithm that has traditionally been used to perform this type of computation. The rapid convergence of the block iterative method is achieved by a proper choice of starting guesses of the eigenvectors and the construction of an effective preconditioner. These acceleration techniques take advantage of special structure of the nuclear configuration interaction problem which we discuss in detail. The use of a block method also allows us to improve the concurrency of the computation, and take advantage of the memory hierarchy of modern microprocessors to increase the arithmetic intensity of the computation relative to data movement. We also discuss implementation details that are critical to achieving high performance on massively parallel multi-core supercomputers, and demonstrate that the new block iterative solver is two to three times faster than the Lanczos based algorithm for problems of moderate sizes on a Cray XC30 system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Hybrid Eigensolvers for Nuclear Configuration Interaction Calculations

We examine and compare several iterative methods for solving large-scale...
research
10/20/2021

Accelerating quantum many-body configuration interaction with directives

Many-Fermion Dynamics-nuclear, or MFDn, is a configuration interaction (...
research
09/01/2021

Accelerating an Iterative Eigensolver for Nuclear Structure Configuration Interaction Calculations on GPUs using OpenACC

To accelerate the solution of large eigenvalue problems arising from man...
research
10/27/2022

Bootstrapped Block Lanczos for large-dimension eigenvalue problems

The Lanczos algorithm has proven itself to be a valuable matrix eigensol...
research
09/25/2020

Compressed Basis GMRES on High Performance GPUs

Krylov methods provide a fast and highly parallel numerical tool for the...
research
05/21/2018

CUDACLAW: A high-performance programmable GPU framework for the solution of hyperbolic PDEs

We present cudaclaw, a CUDA-based high performance data-parallel framewo...
research
12/12/2021

An Advanced Parallel PageRank Algorithm

Initially used to rank web pages, PageRank has now been applied in many ...

Please sign up or login with your details

Forgot password? Click here to reset