Accelerating Discrete Wavelet Transforms on Parallel Architectures

04/27/2017
by   David Barina, et al.
0

The 2-D discrete wavelet transform (DWT) can be found in the heart of many image-processing algorithms. Until recently, several studies have compared the performance of such transform on various shared-memory parallel architectures, especially on graphics processing units (GPUs). All these studies, however, considered only separable calculation schemes. We show that corresponding separable parts can be merged into non-separable units, which halves the number of steps. In addition, we introduce an optional optimization approach leading to a reduction in the number of arithmetic operations. The discussed schemes were adapted on the OpenCL framework and pixel shaders, and then evaluated using GPUs of two biggest vendors. We demonstrate the performance of the proposed non-separable methods by comparison with existing separable schemes. The non-separable schemes outperform their separable counterparts on numerous setups, especially considering the pixel shaders.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2017

Accelerating Discrete Wavelet Transforms on GPUs

The two-dimensional discrete wavelet transform has a huge number of appl...
research
08/25/2017

The Parallel Algorithm for the 2-D Discrete Wavelet Transform

The discrete wavelet transform can be found at the heart of many image-p...
research
11/10/2015

The Radon cumulative distribution transform and its application to image classification

Invertible image representation methods (transforms) are routinely emplo...
research
06/20/2019

Performance Comparison Between OpenCV Built in CPU and GPU Functions on Image Processing Operations

Image Processing is a specialized area of Digital Signal Processing whic...
research
03/11/2022

aiWave: Volumetric Image Compression with 3-D Trained Affine Wavelet-like Transform

Volumetric image compression has become an urgent task to effectively tr...
research
01/20/2020

Discrete Variational Methods and Symplectic Generalized Additive Runge–Kutta Methods

We consider a Lagrangian system L(q,q̇) = ∑_l=1^NL^{l}(q,q̇), where th...
research
10/26/2022

Anisotropic multiresolution analyses for deep fake detection

Generative Adversarial Networks (GANs) have paved the path towards entir...

Please sign up or login with your details

Forgot password? Click here to reset