Skip to main content

Easy Model Deployer

Project description

Easy Model Deployer: Easy Foundation Model Hosting on AWS

English | 简体中文

Documentation · Changelog

MIT License PyPI Downloads Build Status

header

Introduction

Easy Model Deployer is a lightweight tool designed for simplify deploy Open-Source LLMs (Supported Models) and Custom Models on AWS. It provides OpenAI's Completions API and LangChain Interface. Built for developers who need reliable and scalable model serving without complex environment setup.

deploy

Key Features

  • One-click deployment of models to AWS (Amazon SageMaker, Amazon ECS, Amazon EC2)
  • Diverse model types (LLMs, VLMs, Embeddings, Vision, etc.)
  • Rich inference engine (vLLM, TGI, Lmdeploy, etc.)
  • Different instance types (CPU/GPU/AWS Inferentia)
  • Convenient integration (OpenAI Compatible API, LangChain client, etc.)

Supported Models

Easy Model Deployer supports a wide range of models including:

  • LLMs: Qwen, Llama, DeepSeek, GLM, InternLM, Baichuan, and more
  • Vision-Language Models: Qwen-VL, InternVL, Gemma3-Vision, and more
  • Embedding Models: BGE, Jina, BERT-based models
  • Reranking Models: BGE-Reranker, Jina-Reranker
  • ASR Models: Whisper variants
  • Custom Models: Support for custom Docker images

For the complete list of supported models and deployment configurations, see Supported Models.

🔧 Get Started

Installation

Install Easy Model Deployer with PyPI, currently support for Python 3.9 or above:

pip install easy-model-deployer

emd

Bootstrap

Prepare the essential resources required for model deployment.

For more information, please refer to Architecture.

emd bootstrap

💡 Tip Once you upgrade the EMD by pip, you need to run this command again to update the environment.

Deploy Models

Deploy models with an interactive CLI or one command line.

emd deploy

💡 Tip To view all available parameters, run emd deploy --help. When you see the message "Waiting for model: ...", it means the deployment task has started and you can stop the terminal output by pressing Ctrl+C.

Show Status

Check the status of the model deployment task.

emd status

💡 Tip The EMD allows launch multiple deployment tasks simultaneously.

Invocation

Invoke the deployed model for testing by CLI.

emd invoke <ModelId>

💡 Tip You can find the ModelId in the output by emd status. For example: emd invoke DeepSeek-R1-Distill-Qwen-1.5B

💡 Tip OpenAI Compatible API is supported only for Amazon ECS and Amazon EC2 deployment types.

List Supported Models

Quickly see what models are supported, this command will output all information related to deployment. (Please browse Supported Models for more information.)

emd list-supported-models

Delete Model

Delete the deployed model.

emd destroy <ModelId>

💡 Tip You can find the ModelId in the output by emd status. For example: emd destroy DeepSeek-R1-Distill-Qwen-1.5B

📖 Documentation

For advanced configurations and detailed guides, visit our documentation site.

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easy_model_deployer-0.9.47.tar.gz (229.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

easy_model_deployer-0.9.47-py3-none-any.whl (314.8 kB view details)

Uploaded Python 3

File details

Details for the file easy_model_deployer-0.9.47.tar.gz.

File metadata

  • Download URL: easy_model_deployer-0.9.47.tar.gz
  • Upload date:
  • Size: 229.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.14.0 Linux/6.11.0-1018-azure

File hashes

Hashes for easy_model_deployer-0.9.47.tar.gz
Algorithm Hash digest
SHA256 62cee12bf41a2089e8712f954760b2e3536829710aa3300773864b1d99a38458
MD5 3486198cd059eb0e5cc0581837194b6e
BLAKE2b-256 3b534e940dfc493ad12a1857d33091ec87033c8c1f6f585828808d5713d54cbe

See more details on using hashes here.

File details

Details for the file easy_model_deployer-0.9.47-py3-none-any.whl.

File metadata

  • Download URL: easy_model_deployer-0.9.47-py3-none-any.whl
  • Upload date:
  • Size: 314.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.14.0 Linux/6.11.0-1018-azure

File hashes

Hashes for easy_model_deployer-0.9.47-py3-none-any.whl
Algorithm Hash digest
SHA256 48dded1ec75df4640e9672ceca007ac2b209e20470578a167fd16ae355bc87fd
MD5 9b549f5bf0129337b7c42e079addb38e
BLAKE2b-256 cdc4ae9ac1993d24ed57937fbaa4df547d580ccd6c814db53b08b3e3609a2bd2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page