Deploy Model Anywhere on AWS
Project description
Easy Model Deployer - Simple, Efficient, and Easy-to-Integrate
Latest News 🔥
- [2025/03] We officially released EMD!
About
EMD (Easy Model Deployer) is a lightweight tool designed to simplify model deployment. Built for developers who need reliable and scalable model serving without complex setup.
Key Features
- One-click deployment of models to the cloud (Amazon SageMaker, Amazon ECS, Amazon EC2)
- Diverse model types (LLMs, VLMs, Embeddings, Vision, etc.)
- Rich inference engine (vLLM, TGI, Lmdeploy, etc.)
- Different instance types (CPU/GPU/AWS Inferentia)
- Convenient integration (OpenAI Compatible API, LangChain client, etc.)
Notes
- OpenAI Compatible API is supported only for Amazon ECS and Amazon EC2 deployment.
Table of Contents
Architecture
Deploy models to the cloud with EMD will use the following components in Amazon Web Services:
Getting Started
Installation
Install EMD with pip, currently only support for Python 3.9 and above:
pip install https://github.com/aws-samples/easy-model-deployer/releases/download/main/emd-0.6.0-py3-none-any.whl
Visit our documentation to learn more.
Usage
Choose your default aws profile.
emd config set-default-profile-name
Notes: If you don't set aws profile, it will use the default profile in your env (suitable for Temporary Credentials). Whenever you want to switch deployment accounts, run emd config set-default-profile-name
Bootstrap emd stack
emd bootstrap
Notes: This is going to set up the necessary resources for model deployment. Whenever you change EMD version, run this command again.
Quickly see what models are supported by emd list-supported-models. This command will output all information related to deployment. The following command is recommended to just check the model type. (Plese check Supported Models for complete information.)
emd list-supported-models | jq -r '.[] | "\(.model_id)\t\(.model_type)"' | column -t -s $'\t' | sort
Choose deployment parameters interactively by emd deploy or deploy with one command
emd deploy --model-id DeepSeek-R1-Distill-Qwen-1.5B --instance-type g5.8xlarge --engine-type vllm --framework-type fastapi --service-type sagemaker --extra-params {} --skip-confirm
Notes: Get complete parameters by emd deploy --help and find the values of the required parameters here
When you see "Waiting for model: ...", it means the deployment task has started, you can quit the current task by ctrl+c.
Check deployment status.
emd status
Notes: EMD allows to launch multiple deployment tasks at the same time.
Quick functional verfication or check our documentation for integration examples.
emd invoke DeepSeek-R1-Distill-Qwen-1.5B
Notes: Find ModelId in the output of emd status.
Delete the deployed model
emd destroy DeepSeek-R1-Distill-Qwen-1.5B
Notes: Find ModelId in the output of emd status.
Documentation
For advanced configurations and detailed guides, visit our documentation site.
Contributing
We welcome contributions! Please see CONTRIBUTING.md for guidelines.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aws_emd-0.7.0.tar.gz.
File metadata
- Download URL: aws_emd-0.7.0.tar.gz
- Upload date:
- Size: 8.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.9.11 Darwin/23.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
54e19d8b9c11edec9195558ffb8b0584ef777ab647167cc2b07106dd4e3a4c67
|
|
| MD5 |
37cb7891e31fc7c7a723d0bb4c3a91e0
|
|
| BLAKE2b-256 |
794eb2c345c4033497285e720d4de9b0a4f490801a8fef4d07a085b7aaa5e6ac
|
File details
Details for the file aws_emd-0.7.0-py3-none-any.whl.
File metadata
- Download URL: aws_emd-0.7.0-py3-none-any.whl
- Upload date:
- Size: 9.0 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.9.11 Darwin/23.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9c538c855fb35176adc2201c6ed32ab2936b167b19d336752c31b50be295d1fa
|
|
| MD5 |
ed0a5f51517b8fc5f30fd7e097d9279a
|
|
| BLAKE2b-256 |
d1edd7ece92c479ac8d7511759d17957830762fa60a535042de943800e0cd678
|