Somoclu: An Efficient Parallel Library for Self-Organizing Maps

05/07/2013
by   Peter Wittek, et al.
0

Somoclu is a massively parallel tool for training self-organizing maps on large data sets written in C++. It builds on OpenMP for multicore execution, and on MPI for distributing the workload across the nodes in a cluster. It is also able to boost training by using CUDA if graphics processing units are available. A sparse kernel is included, which is useful for high-dimensional but sparse data, such as the vector spaces common in text mining workflows. Python, R and MATLAB interfaces facilitate interactive use. Apart from fast execution, memory use is highly optimized, enabling training large emergent maps even on a single computer.

READ FULL TEXT

page 7

page 16

page 21

research
09/15/2017

Fast OLAP Query Execution in Main Memory on Large Data in a Cluster

Main memory column-stores have proven to be efficient for processing ana...
research
08/12/2010

Viewpoints: A high-performance high-dimensional exploratory data analysis tool

Scientific data sets continue to increase in both size and complexity. I...
research
11/09/2019

Performance Comparison of MPICH and MPI4py on Raspberry Pi-3B Beowulf Cluster

Moore's Law is running out. Instead of making powerful computer by incre...
research
12/08/2016

Sorting Data on Ultra-Large Scale with RADULS. New Incarnation of Radix Sort

The paper introduces RADULS, a new parallel sorter based on radix sort a...
research
11/06/2017

Enabling rootless Linux Containers in multi-user environments: the udocker tool

Containers are increasingly used as means to distribute and run Linux se...
research
12/13/2013

Transparent Checkpoint-Restart over InfiniBand

InfiniBand is widely used for low-latency, high-throughput cluster compu...
research
01/20/2023

Asynchronously Trained Distributed Topographic Maps

Topographic feature maps are low dimensional representations of data, th...

Please sign up or login with your details

Forgot password? Click here to reset