SAGA to launch an Hadoop cluster as a normal batch job on Torque clusters
Project description
# SAGA Hadoop
# Overview:
Use [SAGA](http://saga-project.github.io/saga-python/) to spawn an Hadoop Cluster within an HPC batch job.
Currently supported SAGA adaptors:
Fork
Torque
By default SAGA-Hadoop deploys an Hadoop 2.2.0 YARN cluster. Configuration setting
# Usage
Try to run a local Hadoop (e.g. for development and testing)
easy_install saga-hadoop saga-hadoop –resource fork://localhost
Try to run a Hadoop cluster inside a PBS/Torque job:
Some Blog Posts about SAGA-Hadoop:
# Packages:
see hadoop1 for setting up a Hadoop 1.x.x cluster
see hadoop2 for setting up a Hadoop 2.2.x cluster
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.