Skip to main content

DataDriver is a unified set of tools for accelerating the industrialization of Data Science projects.

Project description

# DataDriver DataDriver is a unified set of tools for accelerating the industrialization of Data Science projects.

## Our vision We noticed that the most impactful parameter on the success of a data science project was the ability to go to the production phase. We focused on making a tool that facilitate the transition from exploration to production rather than a tool to help the data scientist to explore data or make good models.

We think that the sooner we can put a model in a production environment, the better our project chances are at being succesful. The true difficulty of a data science project lies not in solving the problem using a machine learning model but in facing the complexity of the production setting.

## What it is It packages together [ddapi](https://github.com/octo-technology/ddapi) and [ddui](https://github.com/octo-technology/ddui).

We made an API (pyddapi) that lets data scientists use the technologies they use in exploration be compatible with the industrialization phase. We rely on Airflow to schedule, execute reliably and monitor jobs. We also developped an Airflow plugin for Data Science (ddui) that, combined with ddapi, allows you to visualize the output of any task with metrics and charts.

## Installation `bash pip install datadriver `

## Usage Go to the root of your project directory and launch DataDriver with a single command. `bash datadriver `

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for datadriver, version 0.1.3
Filename, size File type Python version Upload date Hashes
Filename, size datadriver-0.1.3-py2.py3-none-any.whl (2.4 kB) File type Wheel Python version py2.py3 Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page