Towards Green Computing: A Survey of Performance and Energy Efficiency of Different Platforms using OpenCL

03/08/2020
by   Philip Heinisch, et al.
0

When considering different hardware platforms, not just the time-to-solution can be of importance but also the energy necessary to reach it. This is not only the case with battery powered and mobile devices but also with high-performance parallel cluster systems due to financial and practical limits on power consumption and cooling. Recent developments in hard- and software have given programmers the ability to run the same code on a range of different devices giving rise to the concept of heterogeneous computing. Many of these devices are optimized for certain types of applications. To showcase the differences and give a basic outlook on the applicability of different architectures for specific problems, the cross-platform OpenCL framework was used to compare both time- and energy-to-solution. A large set of devices ranging from ARM processors to server CPUs and consumer and enterprise level GPUs has been used with different benchmarking testcases taken from applied research applications. While the results show the overall advantages of GPUs in terms of both runtime and energy efficiency compared to CPUs, ARM devices show potential for certain applications in massively parallel systems. This study also highlights how OpenCL enables the use of the same codebase on many different systems and hardware platforms without specific code adaptations.

READ FULL TEXT

page 9

page 11

page 12

page 13

page 14

page 16

page 17

page 18

research
03/06/2020

Performance and energy footprint assessment of FPGAs and GPUs on HPC systems using Astrophysics application

New challenges in Astronomy and Astrophysics (AA) are urging the need fo...
research
05/15/2021

Comparison of HPC Architectures for Computing All-Pairs Shortest Paths. Intel Xeon Phi KNL vs NVIDIA Pascal

Today, one of the main challenges for high-performance computing systems...
research
08/10/2019

Work-in-Progress: A Simulation Framework for Domain-Specific System-on-Chips

Heterogeneous system-on-chips (SoCs) have become the standard embedded c...
research
10/31/2019

Direct N-body application on low-power and energy-efficient parallel architectures

The aim of this work is to quantitatively evaluate the impact of computa...
research
05/26/2020

Using PHAST to port Caffe library: First experiences and lessons learned

Performance has always been a hot topic in computing. However, the viabl...
research
06/01/2020

Exceeding Conservative Limits: A Consolidated Analysis on Modern Hardware Margins

Modern large-scale computing systems (data centers, supercomputers, clou...
research
04/21/2019

Intermittent Learning: On-Device Machine Learning on Intermittently Powered System

In this paper, we introduce the concept of intermittent learning, which ...

Please sign up or login with your details

Forgot password? Click here to reset