a cluster-oriented implementation of self-organizing maps
Project description
Somoclu is a massively parallel implementation of self-organizing maps. It relies on OpenMP for multicore execution, MPI for distributing the workload, and it can be accelerated by CUDA on a GPU cluster. A sparse kernel is also included, which is useful for training maps on vector spaces generated in text mining processes.
Currently a subset of the C++ version is supported with this package.
Homepage: https://github.com/peterwittek/somoclu
Example, in which the data file rgbs.txt can be found at https://github.com/peterwittek/somoclu/tree/master/data
#!/usr/bin/env python2
# -*- coding: utf-8 -*-
import somoclu
import numpy as np
data = np.loadtxt('rgbs.txt')
print(data)
data = np.float32(data)
nSomX = 50
nSomY = 50
nVectors = data.shape[0]
nDimensions = data.shape[1]
data1D = np.ndarray.flatten(data)
nEpoch = 10
radius0 = 0
radiusN = 0
radiusCooling = "linear"
scale0 = 0
scaleN = 0.01
scaleCooling = "linear"
kernelType = 0
mapType = "planar"
snapshots = 0
initialCodebookFilename = ''
codebook_size = nSomY * nSomX * nDimensions
codebook = np.zeros(codebook_size, dtype=np.float32)
globalBmus_size = int(nVectors * int(np.ceil(nVectors/nVectors))*2)
globalBmus = np.zeros(globalBmus_size, dtype=np.intc)
uMatrix_size = nSomX * nSomY
uMatrix = np.zeros(uMatrix_size, dtype=np.float32)
somoclu.trainWrapper(data1D, nEpoch, nSomX, nSomY,
nDimensions, nVectors,
radius0, radiusN,
radiusCooling, scale0, scaleN,
scaleCooling, snapshots,
kernelType, mapType,
initialCodebookFilename,
codebook, globalBmus, uMatrix)
print codebook
print globalBmus
print uMatrix
Get it now
$ sudo pip install somoclu
Build on Mac OS X:
Before installing using pip, gcc should be installed first. As of OS X 10.9, gcc is just symlink to clang. To build somoclu and this extension correctly, it is recommended to install gcc using something like:
$ brew install gcc48
and set environment using:
export CC=/usr/local/bin/gcc export CXX=/usr/local/bin/g++ export CPP=/usr/local/bin/cpp export LD=/usr/local/bin/gcc alias c++=/usr/local/bin/c++ alias g++=/usr/local/bin/g++ alias gcc=/usr/local/bin/gcc alias cpp=/usr/local/bin/cpp alias ld=/usr/local/bin/gcc alias cc=/usr/local/bin/gcc
Then you can
$ sudo pip install somoclu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file somoclu-1.3.tar.gz.
File metadata
- Download URL: somoclu-1.3.tar.gz
- Upload date:
- Size: 54.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7163ac2fc13c045a292efbe1ef07c76e60fdf9830760d2b729637e97faa82993
|
|
| MD5 |
73f0efeff175cde20266193652392c07
|
|
| BLAKE2b-256 |
d06325d18003fb5ee10519f45b30304b833812a6fd459357210d7561a70e05ae
|