Dockerfile generator for AGI -- nothing more, nothing less.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

agi-pack

A Dockerfile builder for AGI — nothing more, nothing less.

agi-pack is simply a weekend project I hacked together, that started with a conversation with ChatGPT / GPT-4. See the inspiration section below for more details on the ChatGPT prompts used.

🚨 Disclaimer: More than 90% of this codebase was generated by GPT-4 and Github Co-Pilot.

Rationale 🤔

Docker has become the standard for building and managing isolated environments for ML. However, any one who has gone down this rabbit-hole knows how broken ML development is, especially when you need to experiment and re-configure your environments constantly. Production is another nightmare -- large docker images (10GB+), bloated docker images with model weights that are ~5-10GB in size, 10+ minute long docker build times, sloppy package management to name just a few.

What makes Dockerfiles painful? If you've ever tried to roll your own Dockerfiles with all the best-practices while fully understanding their internals, you'll still find yourself building, and re-building, and re-building these images across a whole host of use-cases. Having to build Dockerfile(s) for dev, prod, and test all turn out to be a nightmare when you add the complexity of hardware targets (CPUs, GPUs, TPUs etc), drivers, python, virtual environments, build and runtime dependencies.

agi-pack aims to simplify this by allowing developers to define Dockerfiles in a concise YAML format and then generate them based on your environment needs (i.e. python version, system packages, conda/pip dependencies, GPU drivers etc).

For example, you should be able to easily configure your dev environment for local development, and have a separate prod environment where you'll only need the runtime dependencies avoiding any bloat.

agi-pack hopes to also standardize the base images, so that we can really build on top of giants.

Features ✨

Simple Configuration: Define your Docker images using a straightforward YAML format.
Dynamic Generation: Use the power of Jinja2 templating to create Dockerfiles on-the-fly.
Sequential and Multi-stage Builds: Define re-usable and production-ready base images and build dependent images for dev, prod, test.
Extensible: Easily extend and adapt to more complex scenarios.

Goals 🎯

Simplicity: Make it easy to define and build docker images for ML.
Modular, Re-usable, Composable: Ability to define good base, dev and prod images for ML, and re-use them wherever possible.
Best Practices: Support best practices for building docker images for ML -- good base images, multi-stage builds, minimal image sizes, etc.
Ecosystem-driven: Make the YAML / DSL extensible to support the ML ecosystem, as more libraries, drivers, HW vendors, come into the market.
Vendor-agnostic: agi-pack is not intended to be built for any specific vendor (including us/where I work). There was clearly a need for this tool internally, so I decided to build it in the open and keep it simple.

Why the name? 🤷‍♂️

agi-pack is very much intended to be tongue-in-cheek -- we are soon going to be living in a world full of quasi-AGI agents orchestrated via ML containers. At the very least, agi-pack should provide the building blocks for us to build a more modular, re-usable, and distribution-friendly container format for "AGI".

Installation 📦

pip install git+hhttps://github.com/spillai/agi-pack.git

For shell completion, you can install them via:

agi-pack --install-completion <bash|zsh|fish|powershell|pwsh>

Usage 🛠

Create a simple YAML configuration file called agibuild.yaml via agi-pack init:
```
agi-pack init
```

Edit agibuild.yaml to define your custom system and python packages

images:
  base-sklearn:
    image: <repo>/agi:latest-base-sklearn
    base: python:3.8.10-slim
    system:
    - wget
    - build-essential
    python: 3.8.10
    pip:
    - loguru
    - typer
    - scikit-learn

Generate the Dockerfile using agi-pack generate

agi-pack generate -c agibuild.yaml

You should see the following output:

📦 base-sklearn
└── 🎉 Successfully generated Dockerfile (target=base-sklearn, filename=Dockerfile).
    └── `docker build -f Dockerfile --target base-sklearn .`

That's it! You can now build the generated Dockerfile using docker build to build the image directly.

Inspiration and Attribution 🌟

Prompt: I'm building a Dockerfile generator and builder to simplify machine learning infrastructure. I'd like for the Dockerfile to be dynamically generated (using Jinja templates) with the following parametrizations:

```

# Sample YAML file
images:
base-gpu:
    image: autonomi/agi:latest-base-gpu
    base: "nvidia/cuda:11.8.0-base-ubuntu22.04"
    system:
    - "gnupg2"
    - "build-essential"
    - "git"
    python: "3.8.10"
    pip:
    - "torch==2.0.1"

I'd like for this yaml file to generate a Dockerfile via `agi-pack generate -c <name>.yaml`.

You are an expert in Docker and Python programming, how would I implement this builder in Python. Use Jinja2 templating and miniconda python environments wherever possible. I'd like an elegant and concise implementation that I can share on PyPI.
```

TL;DR agi-pack was inspired by a combination of Replicate's cog, Baseten's truss, skaffold, and Docker Compose Services. I wanted a standalone project without any added cruft/dependencies of vendors and services.

Contributing 🤝

Contributions are welcome! Please read the CONTRIBUTING guide for more information.

License 📄

This project is licensed under the MIT License. See the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.0

May 3, 2024

0.2.0

Jan 23, 2024

0.1.19

Oct 25, 2023

0.1.18

Oct 23, 2023

0.1.17

Oct 23, 2023

0.1.16

Oct 23, 2023

0.1.15

Oct 22, 2023

0.1.14

Oct 22, 2023

0.1.13

Oct 21, 2023

0.1.12

Oct 21, 2023

0.1.11

Oct 18, 2023

0.1.10

Oct 18, 2023

0.1.9

Oct 17, 2023

0.1.8

Oct 17, 2023

0.1.7

Oct 10, 2023

0.1.6

Oct 10, 2023

0.1.5

Oct 9, 2023

0.1.4

Oct 9, 2023

0.1.3

Oct 9, 2023

0.1.2

Oct 9, 2023

0.1.1

Oct 9, 2023

0.1.0

Oct 9, 2023

This version

0.0.0

Oct 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

agi_pack-0.0.0-py3-none-any.whl (13.5 kB view hashes)

Uploaded Oct 9, 2023 Python 3

Hashes for agi_pack-0.0.0-py3-none-any.whl

Hashes for agi_pack-0.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b3ddfffb62e0365a8d5e66c235c50e7def8fdfb0e85a31a8ea5d3dd30c631443`
MD5	`e586741ac3a0e0bc569a027ee29cfc8f`
BLAKE2b-256	`83c1b3d1bf8cfcf4aa826f59474217cf3c698fff428827d0f81c91b73313d35b`