MDP · PyPI

MDP is a Python library for building complex data processing software by combining widely used machine learning algorithms into pipelines and networks.

These details have not been verified by PyPI

Project links

Project description

The Modular toolkit for Data Processing (MDP) package is a library of widely used data processing algorithms, and the possibility to combine them together to form pipelines for building more complex data processing software.

MDP has been designed to be used as-is and as a framework for scientific data processing development.

From the user’s perspective, MDP consists of a collection of units, which process data. For example, these include algorithms for supervised and unsupervised learning, principal and independent components analysis and classification.

These units can be chained into data processing flows, to create pipelines as well as more complex feed-forward network architectures. Given a set of input data, MDP takes care of training and executing all nodes in the network in the correct order and passing intermediate data between the nodes. This allows the user to specify complex algorithms as a series of simpler data processing steps.

The number of available algorithms is steadily increasing and includes signal processing methods (Principal Component Analysis, Independent Component Analysis, Slow Feature Analysis), manifold learning methods ([Hessian] Locally Linear Embedding), several classifiers, probabilistic methods (Factor Analysis, RBM), data pre-processing methods, and many others.

Particular care has been taken to make computations efficient in terms of speed and memory. To reduce the memory footprint, it is possible to perform learning using batches of data. For large data-sets, it is also possible to specify that MDP should use single precision floating point numbers rather than double precision ones. Finally, calculations can be parallelised using the parallel subpackage, which offers a parallel implementation of the basic nodes and flows.

From the developer’s perspective, MDP is a framework that makes the implementation of new supervised and unsupervised learning algorithms easy and straightforward. The basic class, Node, takes care of tedious tasks like numerical type and dimensionality checking, leaving the developer free to concentrate on the implementation of the learning and execution phases. Because of the common interface, the node then automatically integrates with the rest of the library and can be used in a network together with other nodes.

A node can have multiple training phases and even an undetermined number of phases. Multiple training phases mean that the training data is presented multiple times to the same node. This allows the implementation of algorithms that need to collect some statistics on the whole input before proceeding with the actual training, and others that need to iterate over a training phase until a convergence criterion is satisfied. It is possible to train each phase using chunks of input data if the chunks are given as an iterable. Moreover, crash recovery can be optionally enabled, which will save the state of the flow in case of a failure for later inspection.

MDP is distributed under the open source BSD license. It has been written in the context of theoretical research in neuroscience, but it has been designed to be helpful in any context where trainable data processing algorithms are used. Its simplicity on the user’s side, the variety of readily available algorithms, and the reusability of the implemented nodes also make it a useful educational tool.

http://mdp-toolkit.sourceforge.net

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

3.6

Apr 24, 2020

3.5

Mar 8, 2016

3.4

Mar 4, 2016

3.3

Oct 4, 2012

3.2

Oct 24, 2011

3.1

Mar 30, 2011

3.0

Jan 17, 2011

2.6

May 14, 2010

2.5

Jun 30, 2009

2.4

Oct 22, 2008

2.3

May 15, 2008

2.1

Mar 23, 2007

2.0RC pre-release

Jun 30, 2006

1.1.0

Jun 13, 2005

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MDP-3.6.tar.gz (412.0 kB view details)

Uploaded Apr 24, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

MDP-3.6-py2.py3-none-any.whl (454.9 kB view details)

Uploaded Apr 24, 2020 Python 2Python 3

File details

Details for the file MDP-3.6.tar.gz.

File metadata

Download URL: MDP-3.6.tar.gz
Upload date: Apr 24, 2020
Size: 412.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.9.1 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.5.2

File hashes

Hashes for MDP-3.6.tar.gz
Algorithm	Hash digest
SHA256	`ac52a652ccbaed1857ff1209862f03bf9b06d093b12606fb410787da3aa65a0e`
MD5	`a88493bd569d9237c7642222058248eb`
BLAKE2b-256	`3b477496bdb9a056f6f9d65220c53a21ba7e8333fe42fe9562259461ad91d5ed`

See more details on using hashes here.

File details

Details for the file MDP-3.6-py2.py3-none-any.whl.

File metadata

Download URL: MDP-3.6-py2.py3-none-any.whl
Upload date: Apr 24, 2020
Size: 454.9 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.9.1 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.5.2

File hashes

Hashes for MDP-3.6-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`5191c7f32e61b7b99a2776333363c4200ab684042cec129364b9658252c8e5e5`
MD5	`cb8341eb1b54b21f2ecd2cdebba85706`
BLAKE2b-256	`abba7b6f47d42e697803b1dd6c1ec8a63864d227769c01cac614c70595ef2c8d`

See more details on using hashes here.

MDP 3.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes