Skip to main content

Powerful processment of temporal data.

Project description

https://pypip.in/v/bigtempo/badge.png https://pypip.in/d/bigtempo/badge.png https://travis-ci.org/rhlobo/bigtempo.png?branch=master https://coveralls.io/repos/rhlobo/bigtempo/badge.png http://rhlobo.github.io/bigtempo/bigtempo_small.png

BigTempo is a powerful and scalable programming model, originally crafted for temporal data processment / analysis. It’s production ready and can handle large ammounts of data.

Implementation:

Python 2.7

Download:

http://pypi.python.org/pypi/bigtempo/

Source:

http://github.com/rhlobo/bigtempo/

Keywords:

bigdata, time series, temporal processment, temporal analysis, data processment, data analysis, scalable, distributed, exploration, production ready, python

This is a Python package created to help you build complex hierarchies of processments, each refered as a datasource. It handles dependency resolution, provides a tagging system that enables querying operations over datasource sets, and much more.

There are other software packages that focus on lower level aspects of data processing, like pandas, numpy, sympy, theano. This is not a framework to replace these. Instead, it aims to support many of these tools, helping you to stitch many processments together. It provides a decoupled programming model that was built with scalability support in its heart and it takes care of a lot of the workflow management so that you can focus on the data itself.

It is here to address the plumbing associated with complex chained data evaluation processes, and because each datasource can be used as input for new datasources, it is ideal for data exploration and analysis. Using it, you are able - for instance - to easily spawn multiple variations of processments over sets of other datasources.

The package was originally conceived to handle temporal data, but it is flexible and can easily be extended to support other data models. It is a great tool for distributed processment when you have ‘quadrillion’ processments for interdependent data sets.

Getting started

Coming out soon!

Need more?

If you need more examples, or just feel like checking out how bigtempo can be used in a project, refer to stockExperiments.

Installation

To install, simply:

$ pip install bigtempo

Or, if you absolutely must:

$ easy_install bigtempo

Dependencies

Both the installation methods above should take care of dependencies on its own, automatically.

The pandas library is the only direct dependency the package has in order to be executed. You should visit its page to find out what it depends on. For best results, we recommend installing optional packages as well.

If you want to run the package tests, or enjoy its testing facilities, you’ll need:

In order to run the tests using the command contained in the bin directory, also install:

  • nose >= 1.3.0

  • coverage >= 3.6

  • pep8 >= 1.4.5

Installing from source

To install bigtempo from source you need:

Clone the git repository:

$ git clone https://github.com/rhlobo/bigtempo.git

Get into the project directory:

$ cd bigtempo

Install dependencies (if you are not using virtualenv, it may need super user privileges):

$ pip install -r requirements.txt

Install it:

$ python setup.py install

Alternatively, you can use pip if you want all the dependencies pulled in automatically (the optional -e option is for installing it in development mode):

$ pip install -e .

Next versions?

  • Integration with celery

  • Build in thread / process pools

  • Smart temporal data caching

  • Python 2.7+

Bug tracker

If you have any suggestions, bug reports or annoyances please report them to our issue_tracker.

Contribute

  1. On the tracker, check for open issues or open a new one to start a discussion around an idea or bug.

  2. Fork the repository on GitHub to start making your changes.

  3. Write a test which shows that the bug was fixed or that the feature works as expected.

  4. Send a pull request and wait until it gets merged and published. Make sure to add yourself to AUTHORS.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigtempo-0.37.9.tar.gz (12.5 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page