Skip to main content

Python SDK and CLI for the Renku platform.

Project description

https://img.shields.io/travis/SwissDataScienceCenter/renku-python.svg https://img.shields.io/coveralls/SwissDataScienceCenter/renku-python.svg https://img.shields.io/github/tag/SwissDataScienceCenter/renku-python.svg https://img.shields.io/pypi/dm/renku.svg Documentation Status https://img.shields.io/github/license/SwissDataScienceCenter/renku-python.svg

A Python library for the Renku collaborative data science platform. It allows the user to create projects, manage datasets, and capture data provenance while performing analysis tasks.

NOTE:

renku-python is the python library for Renku that provides an SDK and a command-line interface (CLI). It does not start the Renku platform itself - for that, refer to the Renku docs on running the platform.

Installation

The latest release is available on PyPI and can be installed using pip:

$ pip install renku

The latest development versions are available on PyPI or from the Git repository:

$ pip install --dev renku
# - OR -
$ pip install -e git+https://github.com/SwissDataScienceCenter/renku-python.git#egg=renku

Use following installation steps based on your operating system and preferences if you would like to work with the command line interface and you do not need the Python library to be importable.

MacOS

The recommended way of installing Renku on MacOS is via Homebrew.

$ brew tap swissdatasciencecenter/renku
$ brew install renku

Pip Script Installer (pipsi)

You can use pipsi to isolate dependencies and to guarantee that there are no version conflicts. Make sure you have the pipsi command correctly installed and ~/.local/bin is in your $PATH.

$ pipsi install renku
$ which renku
~/.local/bin/renku

Docker

The containerized version of the CLI can be launched using Docker command.

$ docker run -it -v "$PWD":"$PWD" -w="$PWD" renku/renku-python renku

It makes sure your current directory is mounted to the same place in the container.

Usage

Initialize a renku project:

$ mkdir -p ~/temp/my-renku-project
$ cd ~/temp/my-renku-project
$ renku init

Create a dataset and add data to it:

$ renku dataset create my-dataset
$ renku dataset add my-dataset https://raw.githubusercontent.com/SwissDataScienceCenter/renku-python/master/README.rst

Run an analysis:

$ renku run wc < data/my-dataset/README.rst > wc_readme

Trace the data provenance:

$ renku log wc_readme

These are the basics, but there is much more that Renku allows you to do with your data analysis workflows. The full documentation will soon be available at: https://renku-python.readthedocs.io/

Changes

v0.3.3

(released 2018-12-07)

  • Fixes generated Homebrew formula.

  • Renames renku pull path to renku storage pull with deprecation warning.

v0.3.2

(released 2018-11-29)

  • Fixes display of workflows in renku log.

v0.3.1

(released 2018-11-29)

  • Fixes issues with parsing remote Git URLs.

v0.3.0

(released 2018-11-26)

  • Adds JSON-LD context to objects extracted from the Git repository (see renku show context --list).

  • Uses PROV-O and WFPROV as provenance vocabularies and generates “stable” object identifiers (@id) for RDF and JSON-LD output formats.

  • Refactors the log output to allow linking files and directories.

  • Adds support for aliasing tools and workflows.

  • Adds option to install shell completion (renku --install-completion).

  • Fixes initialization of Git submodules.

  • Uses relative submodule paths when appropriate.

  • Simplifies external storage configuration.

v0.2.0

(released 2018-09-25)

  • Refactored version using Git and Common Workflow Language.

v0.1.0

(released 2017-09-06)

  • Initial public release as Renga.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

renku-0.4.0a3.tar.gz (155.0 kB view details)

Uploaded Source

Built Distribution

renku-0.4.0a3-py2.py3-none-any.whl (146.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file renku-0.4.0a3.tar.gz.

File metadata

  • Download URL: renku-0.4.0a3.tar.gz
  • Upload date:
  • Size: 155.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.1 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for renku-0.4.0a3.tar.gz
Algorithm Hash digest
SHA256 d7e1d4b2dbb63958dd5382b8096396a40c649141a6ae5c4964b187fc5ee97eb0
MD5 5a9728874923a584a89b5e8834ab7406
BLAKE2b-256 2d1d6216bb72458e14ece15825f58b249fbd0425fee707eb4e910f13d809ffb0

See more details on using hashes here.

File details

Details for the file renku-0.4.0a3-py2.py3-none-any.whl.

File metadata

  • Download URL: renku-0.4.0a3-py2.py3-none-any.whl
  • Upload date:
  • Size: 146.8 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.1 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for renku-0.4.0a3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 861205e03dce6a63a3e69c3e8437b7c64ce5802be7d6838026c47b13931b8884
MD5 903b90aef362f8d0f224bdb9bb313127
BLAKE2b-256 7878b1c702b3aeadb6fcabe69b1af8883f5ffc40c720a8d8f4f3e5970827a33f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page