Efficient Exascale Discretizations: High-Order Finite Element Methods

by   Tzanio Kolev, et al.

Efficient exploitation of exascale architectures requires rethinking of the numerical algorithms used in many large-scale applications. These architectures favor algorithms that expose ultra fine-grain parallelism and maximize the ratio of floating point operations to energy intensive data movement. One of the few viable approaches to achieve high efficiency in the area of PDE discretizations on unstructured grids is to use matrix-free/partially-assembled high-order finite element methods, since these methods can increase the accuracy and/or lower the computational time due to reduced data motion. In this paper we provide an overview of the research and development activities in the Center for Efficient Exascale Discretizations (CEED), a co-design center in the Exascale Computing Project that is focused on the development of next-generation discretization software and algorithms to enable a wide range of finite element applications to run efficiently on future hardware. CEED is a research partnership involving more than 30 computational scientists from two US national labs and five universities, including members of the Nek5000, MFEM, MAGMA and PETSc projects. We discuss the CEED co-design activities based on targeted benchmarks, miniapps and discretization libraries and our work on performance optimizations for large-scale GPU architectures. We also provide a broad overview of research and development activities in areas such as unstructured adaptive mesh refinement algorithms, matrix-free linear solvers, high-order data visualization, and list examples of collaborations with several ECP and external applications.


page 1

page 4

page 9

page 16

page 18


GPU Algorithms for Efficient Exascale Discretizations

In this paper we describe the research and development activities in the...

Scalability of High-Performance PDE Solvers

Performance tests and analyses are critical to effective HPC software de...

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

With the introduction of advanced heterogeneous computing architectures ...

End-to-end GPU acceleration of low-order-refined preconditioning for high-order finite element discretizations

In this paper, we present algorithms and implementations for the end-to-...

Portable high-order finite element kernels I: Streaming Operations

This paper is devoted to the development of highly efficient kernels per...

Locality Optimized Unstructured Mesh Algorithms on GPUs

Unstructured-mesh based numerical algorithms such as finite volume and f...

Acceleration of tensor-product operations for high-order finite element methods

This paper is devoted to GPU kernel optimization and performance analysi...

Please sign up or login with your details

Forgot password? Click here to reset