Skip to main content

A development environment management tool for data scientists.

Project description

envd

Development environment for data science and AI/ML

continuous integration envd package donwloads discord invitation link trackgit-views all-contributors

โš ๏ธ envd is still under heavy development, and subject to change. it is not feature-complete or production-ready. Please contact us in discord if there is any problem.

envd is a machine learning development environment for data science and AI/ML engineering teams.

๐Ÿ No Docker, only Python - Focus on writing Python code, we will take care of Docker and development environment setup.

๐Ÿ–จ๏ธ Built-in Jupyter/VSCode - First-class support for Jupyter and VSCode remote extension.

โฑ๏ธ Save time - Better cache management to save your time, keep the focus on the model, instead of dependencies.

โ˜๏ธ Local & cloud - envd integrates seamlessly with Docker so that you can easily share, version, and publish envd environments with Docker Hub or any other OCI image registries.

๐Ÿ” Repeatable builds & reproducible results - You can reproduce the same dev environment on your laptop, public cloud VMs, or Docker containers, without any change in setup.

Why use envd?

It is still too difficult to configure development environments and reproduce results in AI/ML applications.

envd is a machine learning development environment for data science and AI/ML engineering teams. Environments built with envd provide the following features out-of-the-box:

๐Ÿ Life is short, use Python[^1]

Development environments are full of Dockerfiles, bash scripts, Kubernetes YAML manifests, and many other clunky files that are always breaking. envd builds are isolated and clean. You can write simple instructions in Python, instead of Bash / Makefile / Dockerfile / ...

envd

[^1]: The build language is starlark, which is a dialect of Python.

โฑ๏ธ Save you plenty of time

envd adopts a multi-level cache mechanism to accelerate the building process. For example, the PyPI cache is shared across builds and thus the package will be cached if it has been downloaded before. It saves plenty of time, especially when you update the environment by trial and error.

envd

Docker[^2]

$ envd build
=> pip install tensorflow       5s
+ => Using cached tensorflow-...-.whl (511.7 MB)
$ docker build
=> pip install tensorflow      278s
- => Downloading tensorflow-...-.whl (511.7 MB)

[^2]: Docker without buildkit

โ˜๏ธ Local & cloud native

envd integrates seamlessly with Docker, you can share, version, and publish envd environments with Docker Hub or any other OCI image registries. The envd environments can be run on Docker or Kubernetes.

๐Ÿ” Repeatable builds & reproducible results

You can reproduce the same dev environment, on your laptop, public cloud VMs, or Docker containers, without any change in setup. You can also collaborate with your colleagues without "let me configure the environment in your machine".

๐Ÿ–จ๏ธ Seamless experience of Jupyter/VSCode

envd provides first-class support for Jupyter and VSCode remote extension. You benefit without sacrificing any developer experience.

Who should use envd?

Weโ€™re focused on helping data scientists and teams that develop AI/ML models. And they may suffer from:

  • building the development environments with Python, CUDA, Docker, SSH, and so on. Do you have a complicated Dockerfile or build script that sets up all your dev environments, but is always breaking?
  • Updating the environment. Do you always need to ask infrastructure engineers how to add a new python package in the Dockerfile?
  • Managing environments and machines. Do you always forget which machines are used for the specific project, because you handle multiple projects concurrently?
Before envd After envd

Documentation

See envd documentation.

Getting Started

Requirements

  • Docker (20.10.0 or above)

Install and bootstrap envd

envd can be installed with pip. After the installation, please run envd bootstrap to bootstrap.

pip install --pre envd
envd bootstrap

You can add --dockerhub-mirror or -m flag when running envd bootstrap, to configure the mirror for docker.io registry:

envd bootstrap --dockerhub-mirror https://docker.mirrors.sjtug.sjtu.edu.cn

Create an envd environment

Please clone the envd-quick-start:

git clone https://github.com/tensorchord/envd-quick-start.git

The build manifest build.envd looks like:

def build():
    base(os="ubuntu20.04", language="python3")
    install.python_packages(name = [
        "numpy",
    ])
    shell("zsh")

Then please run the command below to set up a new environment:

cd envd-quick-start && envd up
$ cd envd-quick-start && envd up
[+] โŒš parse build.envd and download/cache dependencies 2.8s โœ… (finished)     
 => download oh-my-zsh                                                    2.8s 
[+] ๐Ÿ‹ build envd environment 18.3s (25/25) โœ… (finished)                      
 => create apt source dir                                                 0.0s 
 => local://cache-dir                                                     0.1s 
 => => transferring cache-dir: 5.12MB                                     0.1s 
...
 => pip install numpy                                                    13.0s 
 => copy /oh-my-zsh /home/envd/.oh-my-zsh                                 0.1s 
 => mkfile /home/envd/install.sh                                          0.0s 
 => install oh-my-zsh                                                     0.1s 
 => mkfile /home/envd/.zshrc                                              0.0s 
 => install shell                                                         0.0s
 => install PyPI packages                                                 0.0s
 => merging all components into one                                       0.3s
 => => merging                                                            0.3s
 => mkfile /home/envd/.gitconfig                                          0.0s 
 => exporting to oci image format                                         2.4s 
 => => exporting layers                                                   2.0s 
 => => exporting manifest sha256:7dbe9494d2a7a39af16d514b997a5a8f08b637f  0.0s
 => => exporting config sha256:1da06b907d53cf8a7312c138c3221e590dedc2717  0.0s
 => => sending tarball                                                    0.4s
(envd) โžœ  demo git:(master) โœ— # You are in the container-based environment!

Play with the environment

You can run ssh envd-quick-start.envd to reconnect if you exit from the environment. Or you can execute git or python commands inside.

$ python demo.py
[2 3 4]
$ git fetch
$

Set up Jupyter notebook

Please edit the build.envd to enable jupyter notebook:

def build():
    base(os="ubuntu20.04", language="python3")
    install.python_packages(name = [
        "numpy",
    ])
    shell("zsh")
    config.jupyter(password="", port=8888)

You can get the endpoint of the running Jupyter notebook via envd get envs.

$ envd up --detach
$ envd get env
NAME                    JUPYTER                 SSH TARGET              CONTEXT                                 IMAGE                   GPU     CUDA    CUDNN   STATUS          CONTAINER ID 
envd-quick-start        http://localhost:8888   envd-quick-start.envd   /home/gaocegege/code/envd-quick-start   envd-quick-start:dev    false   <none>  <none>  Up 54 seconds   bd3f6a729e94

Contribute

We welcome all kinds of contributions from the open-source community, individuals, and partners.

Contributors โœจ

Thanks goes to these wonderful people (emoji key):


Aaron Sun

๐Ÿ““ ๐Ÿ’ป

Aka.Fido

๐Ÿ“ฆ ๐Ÿ“–

Ce Gao

๐Ÿ’ป ๐Ÿ“– ๐ŸŽจ ๐Ÿ“†

Jian Zeng

๐ŸŽจ ๐Ÿค” ๐Ÿ”ฌ

Jinjing Zhou

๐Ÿ› ๐Ÿ’ป ๐ŸŽจ ๐Ÿ“–

Jun

๐Ÿ“ฆ ๐Ÿ’ป

Keming

๐Ÿ’ป ๐Ÿ“– ๐Ÿค” ๐Ÿš‡

Kevin Su

๐Ÿ’ป

Ling Jin

๐Ÿ› ๐Ÿš‡

Manjusaka

๐Ÿ’ป

Siyuan Wang

๐Ÿ’ป ๐Ÿš‡ ๐Ÿšง

Xu Jin

๐Ÿ’ป

Xuanwo

๐Ÿ’ฌ ๐ŸŽจ ๐Ÿค” ๐Ÿ‘€

Yuan Tang

๐Ÿ’ป ๐ŸŽจ ๐Ÿ“– ๐Ÿค”

Yuchen Cheng

๐Ÿ› ๐Ÿš‡ ๐Ÿšง ๐Ÿ”ง

Zhenzhen Zhao

๐Ÿš‡ ๐Ÿ““ ๐Ÿ’ป

Zhizhen He

๐Ÿ’ป

jimoosciuc

๐Ÿ““

kenwoodjw

๐Ÿ’ป

xing0821

๐Ÿค” ๐Ÿ““

This project follows the all-contributors specification. Contributions of any kind welcome!

License

Apache 2.0

trackgit-views

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

envd-0.1.0.tar.gz (148.4 kB view hashes)

Uploaded Source

Built Distributions

envd-0.1.0-pp39-pypy39_pp73-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded PyPy manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-pp39-pypy39_pp73-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded PyPy macOS 10.9+ x86-64

envd-0.1.0-pp38-pypy38_pp73-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded PyPy manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-pp38-pypy38_pp73-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded PyPy macOS 10.9+ x86-64

envd-0.1.0-pp37-pypy37_pp73-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded PyPy manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-pp37-pypy37_pp73-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded PyPy macOS 10.9+ x86-64

envd-0.1.0-cp310-cp310-musllinux_1_1_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ x86-64

envd-0.1.0-cp310-cp310-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-cp310-cp310-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded CPython 3.10 macOS 10.9+ x86-64

envd-0.1.0-cp39-cp39-musllinux_1_1_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ x86-64

envd-0.1.0-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-cp39-cp39-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

envd-0.1.0-cp38-cp38-musllinux_1_1_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ x86-64

envd-0.1.0-cp38-cp38-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-cp38-cp38-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

envd-0.1.0-cp37-cp37m-musllinux_1_1_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ x86-64

envd-0.1.0-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-cp37-cp37m-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded CPython 3.7m macOS 10.9+ x86-64

envd-0.1.0-cp36-cp36m-musllinux_1_1_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.6m musllinux: musl 1.1+ x86-64

envd-0.1.0-cp36-cp36m-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (8.3 MB view hashes)

Uploaded CPython 3.6m manylinux: glibc 2.17+ x86-64 manylinux: glibc 2.5+ x86-64

envd-0.1.0-cp36-cp36m-macosx_10_9_x86_64.whl (8.8 MB view hashes)

Uploaded CPython 3.6m macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page