magemaker

A CLI tool to fine-tuning and deploying open-source models

Project description

Magemaker v0.1, by SlashML

Deploy open source AI models to AWS in minutes.

Table of Contents

About Magemaker
Getting Started
- Prerequisites
- Installation
Using Magemaker
What we're working on next
Known issues
Contributing
License
Contact

About Magemaker

Magemaker is a Python tool that simplifies the process of deploying an open source AI model to your own cloud. Instead of spending hours digging through documentation to figure out how to get AWS working, Magemaker lets you deploy open source AI models directly from the command line.

Choose a model from Hugging Face or SageMaker, and Magemaker will spin up a SageMaker instance with a ready-to-query endpoint in minutes.

Getting Started

Magemaker works with AWS. Azure and GCP support are coming soon!

To get a local copy up and running follow these simple steps.

Prerequisites

Python
An AWS account
Quota for AWS SageMaker instances (by default, you get 2 instances of ml.m5.xlarge for free)
Certain Hugging Face models (e.g. Llama2) require an access token (hf docs)

Configuration

Step 1: Set up AWS and SageMaker

To get started, you’ll need an AWS account which you can create at https://aws.amazon.com/. Then you’ll need to create access keys for SageMaker.

We wrote up the steps in Google Doc as well.

Installing the package

Step 1

pip install magemaker

Step 2: Running magemaker

Run it by simply doing the following:

magemaker

If this is your first time running this command. It will configure the AWS client so you’re ready to start deploying models. You’ll be prompted to enter your Access Key and Secret here. You can also specify your AWS region. The default is us-east-1. You only need to change this if your SageMaker instance quota is in a different region.

Once configured, it will create a .env file and save the credentials there. You can also add your Hugging Face Hub Token to this file if you have one.

HUGGING_FACE_HUB_KEY="KeyValueHere"

(back to top)

Using Magemaker

Deploying models from dropdown

When you run magemaker comamnd it will give you an interactive menu to deploy models. You can choose from a dropdown of models to deploy.

Deploying Hugging Face models

If you're deploying with Hugging Face, copy/paste the full model name from Hugging Face. For example, google-bert/bert-base-uncased. Note that you’ll need larger, more expensive instance types in order to run bigger models. It takes anywhere from 2 minutes (for smaller models) to 10+ minutes (for large models) to spin up the instance with your model.

Deploying Sagemaker models

If you are deploying a Sagemaker model, select a framework and search from a model. If you a deploying a custom model, provide either a valid S3 path or a local path (and the tool will automatically upload it for you). Once deployed, we will generate a YAML file with the deployment and model in the CONFIG_DIR=.magemaker_config folder. You can modify the path to this folder by setting the CONFIG_DIR environment variable.

Deploy using a yaml file

We recommend deploying through a yaml file for reproducability and IAC. From the cli, you can deploy a model without going through all the menus. You can even integrate us with your Github Actions to deploy on PR merge. Deploy via YAML files simply by passing the --deploy option with local path like so:

magemaker --deploy .magemaker_config/bert-base-uncased.yaml

Following is a sample yaml file for deploying a model the same google bert model mentioned above:

deployment: !Deployment
  destination: aws
  # Endpoint name matches model_id for querying atm.
  endpoint_name: test-bert-uncased
  instance_count: 1
  instance_type: ml.m5.xlarge

models:
- !Model
  id: google-bert/bert-base-uncased
  source: huggingface

Following is a yaml file for deploying a llama model from HF:

deployment: !Deployment
  destination: aws
  endpoint_name: test-llama2-7b
  instance_count: 1
  instance_type: ml.g5.12xlarge
  num_gpus: 4
  # quantization: bitsandbytes

models:
- !Model
  id: meta-llama/Meta-Llama-3-8B-Instruct
  source: huggingface
  predict:
    temperature: 0.9
    top_p: 0.9
    top_k: 20
    max_new_tokens: 250

Fine-tuning a model using a yaml file

You can also fine-tune a model using a yaml file, by using the train option in the command and passing path to the yaml file

magemaker --train .magemaker_config/train-bert.yaml

Here is an example yaml file for fine-tuning a hugging-face model:

training: !Training
  destination: aws
  instance_type: ml.p3.2xlarge
  instance_count: 1
  training_input_path: s3://jumpstart-cache-prod-us-east-1/training-datasets/tc/data.csv
  hyperparameters: !Hyperparameters
    epochs: 1
    per_device_train_batch_size: 32
    learning_rate: 0.01

models:
- !Model
  id: meta-textgeneration-llama-3-8b-instruct
  source: huggingface

If you’re using the ml.m5.xlarge instance type, here are some small Hugging Face models that work great:

Model: google-bert/bert-base-uncased

Type: Fill Mask: tries to complete your sentence like Madlibs
Query format: text string with [MASK] somewhere in it that you wish for the transformer to fill

Model: sentence-transformers/all-MiniLM-L6-v2

Type: Feature extraction: turns text into a 384d vector embedding for semantic search / clustering
Query format: "type out a sentence like this one."

Deactivating models

Any model endpoints you spin up will run continuously unless you deactivate them! Make sure to delete endpoints you’re no longer using so you don’t keep getting charged for your SageMaker instance.

(back to top)

What we're working on next

More robust error handling for various edge cases
Verbose logging
Enabling / disabling autoscaling
Deployment to Azure and GCP

(back to top)

Known issues

Querying within Magemaker currently only works with text-based model - doesn’t work with multimodal, image generation, etc.
Deleting a model is not instant, it may show up briefly after it was queued for deletion
Deploying the same model within the same minute will break

(back to top)

License

Distributed under the Apache 2.0 License. See LICENSE for more information.

Contact

You can reach us, faizan & jneid, at support@slashml.com.

We’d love to hear from you! We’re excited to learn how we can make this more valuable for the community and welcome any and all feedback and suggestions.

Project details

Release history Release notifications | RSS feed

0.1.15

Jan 9, 2025

0.1.14

Jan 9, 2025

0.1.13

Jan 9, 2025

0.1.12

Jan 8, 2025

0.1.11

Jan 8, 2025

0.1.10

Jan 8, 2025

0.1.9

Jan 8, 2025

0.1.8

Jan 8, 2025

0.1.7

Jan 8, 2025

0.1.6

Dec 31, 2024

0.1.5

Dec 31, 2024

This version

0.1.3

Dec 26, 2024

0.1.2

Dec 25, 2024

0.1.1

Nov 3, 2024

0.1.0

Nov 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

magemaker-0.1.3.tar.gz (26.7 kB view details)

Uploaded Dec 26, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

magemaker-0.1.3-py3-none-any.whl (31.4 kB view details)

Uploaded Dec 26, 2024 Python 3

File details

Details for the file magemaker-0.1.3.tar.gz.

File metadata

Download URL: magemaker-0.1.3.tar.gz
Upload date: Dec 26, 2024
Size: 26.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.6

File hashes

Hashes for magemaker-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`bb7ff9762f7ebf1da7898321502478fdb767d270400f3ede82540207b8c44eef`
MD5	`ac56785a9889604d7048df0cdaf23ea2`
BLAKE2b-256	`578f17db0159bc50867de414fad41abb47cc0cb4cb65404ff986ed133dd4a81b`

See more details on using hashes here.

File details

Details for the file magemaker-0.1.3-py3-none-any.whl.

File metadata

Download URL: magemaker-0.1.3-py3-none-any.whl
Upload date: Dec 26, 2024
Size: 31.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.6

File hashes

Hashes for magemaker-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9d17e6506496b497c96d3dfcce49ace17bb974feb138b5aadc7784f22e15d0c`
MD5	`017bc1906b43cab22772195d8b997753`
BLAKE2b-256	`6032b58bcc2fea5275d9600908acf0eaee76f31b4986ff996f611732ef4fb897`

See more details on using hashes here.

magemaker 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Magemaker v0.1, by SlashML

About Magemaker

Getting Started

Prerequisites

Configuration

Installing the package

Using Magemaker

Deploying models from dropdown

Deploying Hugging Face models

Deploying Sagemaker models

Deploy using a yaml file

Fine-tuning a model using a yaml file

Deactivating models

What we're working on next

Known issues

License

Contact

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes