ferry

Big data development environments using Docker

Project description

Ferry: big data development engine
====================================

Ferry lets you define, run, and deploy big data stacks on your local machine using [Docker](https://www.docker.io).

Ferry currently supports Hadoop/Yarn, GlusterFS/OpenMPI, and Cassandra (with more in the future).
By using Ferry developers can get started creating their big data applications right away without
the pain of installing and configuring all the complex backend software.

Big Data in small places
========================

Big data technologies are designed to operate and scale over many machines and usually consist
of multiple functional parts. Developers interested in creating a
Hadoop application, for example, must first download the appropriate packages, configure these
systems to operate in a single-machine environment (or multiple machines for operational environments),
and configure other required services (e.g., PostGresql).

Fortunately for us, Ferry and Docker vastly simplifies the entire process by capturing the entire process
in a set of lightweight Linux containers. This enables developers to quickly stand up a big data stack and
attach connectors/clients with zero manual configuration. Because Docker is so lightweight, you can even test
multiple big data stacks with minimal overhead.

Getting started
===============

Ferry is a Python application and runs on your local machine. All you have to do to get started is have
`docker` installed and type the following `pip install -U ferry`. Afterwards you can start creating
your big data application. Here's an example stack:

```javascript
{
"backend":[
{
"storage":
{
"personality":"gluster",
"instances":2
},
"compute":[
{
"personality":"mpi",
"instances":2
}]
}],
"connectors":[
{"personality":"mpi-client"}
]
}
```

This stack consists of two GlusterFS data nodes, and two OpenMPI compute nodes. There's also a Linux
client that automatically connect to those backend components. To create this stack, just type
`ferry start openmpi`. Once you create the stack, you can log in by typing `ferry ssh sa-0`.

More detailed installation instructions and examples can be found [here](http://ferry.opencore.io).

Under the hood
==============

Ferry leverages some awesome open source projects:

* [Docker](https://www.docker.io) simplifies the management of Linux containers
* [Python](http://www.python.org) programming language
* [Hadoop](http://hadoop.apache.org) is a general-purpose big data storage and processing framework
* [GlusterFS](http://www.gluster.org) is a parallel filesystem actively developed by Redhat
* [OpenMPI](http://www.open-mpi.org) is a scalable MPI implementation focused on modeling & simulation
* [Cassandra](http://cassandra.apache.org) is a highly scalable column store
* [PostGresql](http://postgresql.org) is a popular relational database

Project details

Release history Release notifications | RSS feed

0.3.3.4

Oct 19, 2014

0.3.3.3

Oct 7, 2014

0.3.3.2

Oct 7, 2014

0.3.3.1

Oct 7, 2014

0.3.3

Oct 4, 2014

0.3.2

Sep 29, 2014

0.3.1.3

Sep 11, 2014

0.3.1.2

Sep 9, 2014

0.3.1.1

Sep 8, 2014

0.3.1

Sep 1, 2014

0.3.0

Aug 30, 2014

0.2.4.1

Aug 16, 2014

0.2.4

Aug 16, 2014

0.2.3.1

Jul 11, 2014

0.2.3

Jul 7, 2014

0.2.2

Jun 11, 2014

0.2.1

Jun 4, 2014

0.2.0

May 16, 2014

0.1.28

Apr 23, 2014

0.1.27

Apr 18, 2014

0.1.26

Apr 6, 2014

0.1.25

Mar 28, 2014

0.1.24

Mar 28, 2014

0.1.23

Mar 24, 2014

0.1.22

Mar 21, 2014

0.1.21

Mar 17, 2014

0.1.20

Mar 14, 2014

0.1.19

Mar 14, 2014

0.1.18

Mar 14, 2014

0.1.17

Mar 14, 2014

0.1.16

Mar 14, 2014

0.1.15

Mar 13, 2014

0.1.14

Mar 13, 2014

0.1.13

Mar 13, 2014

0.1.12

Mar 13, 2014

This version

0.1.11

Mar 13, 2014

0.1.10

Mar 12, 2014

0.1.9

Mar 12, 2014

0.1.8

Mar 11, 2014

0.1.7

Mar 11, 2014

0.1.6

Mar 8, 2014

0.1.5

Mar 8, 2014

0.1.4

Mar 8, 2014

0.1.3

Mar 8, 2014

0.1.2

Mar 8, 2014

0.1.1

Mar 7, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ferry-0.1.11.tar.gz (4.0 MB view details)

Uploaded Mar 13, 2014 Source

File details

Details for the file ferry-0.1.11.tar.gz.

File metadata

Download URL: ferry-0.1.11.tar.gz
Upload date: Mar 13, 2014
Size: 4.0 MB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for ferry-0.1.11.tar.gz
Algorithm	Hash digest
SHA256	`95ea11658ae276def754022dabcad8403be77fe46c473cbad38384751dd64b99`
MD5	`33887621508dd5db6ae29a6455dbe184`
BLAKE2b-256	`2adadb875342130718dfca3a66a234af457f2db38f5e401f1dac74c78461734e`