fastdeploy

Deploy DL/ ML inference pipelines with minimal extra code.

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

bedapudi6788

These details have not been verified by PyPI

Project description

fastDeploy

easy and performant micro-services for Python Deep Learning inference pipelines

Deploy any python inference pipeline with minimal extra code
Auto batching of concurrent inputs is enabled out of the box
no changes to inference code (unlike tf-serving etc), entire pipeline is run as is
Promethues metrics (open metrics) are exposed for monitoring
Auto generates clean dockerfiles and kubernetes health check, scaling friendly APIs
sequentially chained inference pipelines are supported out of the box
can be queried from any language via easy to use rest apis
easy to understand (simple consumer producer arch) and simple code base

Installation:

pip install --upgrade fastdeploy fdclient
# fdclient is optional, only needed if you want to use python client

CLI explained

Start fastDeploy server on a recipe:

# Invoke fastdeploy 
python -m fastdeploy --help
# or
fastdeploy --help

# Start prediction "loop" for recipe "echo"
fastdeploy --loop --recipe recipes/echo

# Start rest apis for recipe "echo"
fastdeploy --rest --recipe recipes/echo

Send a request and get predictions:

auto generate dockerfile and build docker image:

# Write the dockerfile for recipe "echo"
# and builds the docker image if docker is installed
# base defaults to python:3.8-slim
fastdeploy --build --recipe recipes/echo

# Run docker image
docker run -it -p8080:8080 fastdeploy_echo

Serving your model (recipe):

Writing your model/pipeline's recipe

Where to use fastDeploy?

to deploy any non ultra light weight models i.e: most DL models, >50ms inference time per example
if the model/pipeline benefits from batch inference, fastDeploy is perfect for your use-case
if you are going to have individual inputs (example, user's search input which needs to be vectorized or image to be classified)
in the case of individual inputs, requests coming in at close intervals will be batched together and sent to the model as a batch
perfect for creating internal micro services separating your model, pre and post processing from business logic
since prediction loop and inference endpoints are separated and are connected via sqlite backed queue, can be scaled independently

Where not to use fastDeploy?

non cpu/gpu heavy models that are better of running parallely rather than in batch
if your predictor calls some external API or uploads to s3 etc in a blocking way
io heavy non batching use cases (eg: query ES or db for each input)
for these cases better to directly do from rest api code (instead of consumer producer mechanism) so that high concurrency can be achieved

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

bedapudi6788

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.1.1

Nov 10, 2024

3.1.0

Nov 8, 2024

3.0.32

Nov 7, 2024

3.0.31

Nov 7, 2024

3.0.30

Nov 7, 2024

3.0.28

Nov 4, 2024

3.0.27

Nov 1, 2024

3.0.26

Oct 30, 2024

This version

3.0.25

Oct 30, 2024

3.0.24

Oct 30, 2024

3.0.23

Oct 30, 2024

3.0.21

Oct 29, 2024

3.0.20

Oct 29, 2024

3.0.19

Oct 28, 2024

3.0.18

Oct 24, 2024

3.0.17

Oct 3, 2024

3.0.16

Oct 3, 2024

3.0.15

Sep 30, 2024

3.0.12

Apr 16, 2024

3.0.11

Mar 19, 2024

3.0.10

Mar 14, 2024

3.0.9

Mar 12, 2024

3.0.8

Mar 12, 2024

3.0.4

Dec 12, 2023

3.0.3

Dec 12, 2023

3.0.2

Dec 11, 2023

3.0.1

Dec 11, 2023

3.0.0rc5 pre-release

Nov 30, 2023

3.0.0rc4 pre-release

Nov 30, 2023

3.0.0rc3 pre-release

Nov 30, 2023

3.0.0rc2 pre-release

Nov 29, 2023

3.0.0rc1 pre-release

Nov 29, 2023

2.2.16

Jul 11, 2023

2.2.15

Jun 20, 2023

2.2.12

Apr 20, 2023

2.2.11

Apr 20, 2023

2.2.10

Apr 20, 2023

2.2.8

Apr 20, 2023

2.2.7rc4 pre-release

Mar 21, 2023

2.2.7rc3 pre-release

Dec 1, 2022

2.2.7rc2 pre-release

Nov 22, 2022

2.2.6

Nov 10, 2022

2.2.5

Nov 1, 2022

2.2.4

Oct 19, 2022

2.2.3

Oct 11, 2022

2.2.2

Sep 28, 2022

2.2.1

Jul 19, 2022

2.2

Jul 13, 2022

2.1

Jul 6, 2022

2.0

Apr 30, 2022

2.0rc4 pre-release

Apr 30, 2022

2.0rc3 pre-release

Apr 30, 2022

2.0rc2 pre-release

Apr 28, 2022

2.0rc1 pre-release

Apr 26, 2022

1.0rc44 pre-release

Mar 17, 2022

1.0rc43 pre-release

Mar 10, 2022

1.0rc42 pre-release

Mar 9, 2022

1.0rc41 pre-release

Mar 9, 2022

1.0rc40 pre-release

Mar 9, 2022

1.0rc39 pre-release

Mar 9, 2022

1.0rc38 pre-release

Mar 9, 2022

1.0rc37 pre-release

Mar 9, 2022

1.0rc36 pre-release

Mar 8, 2022

1.0rc35 pre-release

Mar 4, 2022

1.0rc34 pre-release

Jan 24, 2022

1.0rc33 pre-release

Jan 23, 2022

1.0rc32 pre-release

Jan 22, 2022

1.0rc30 pre-release

Jan 13, 2022

1.0rc29 pre-release

Dec 8, 2021

1.0rc28 pre-release

Nov 26, 2021

1.0rc27 pre-release

Nov 25, 2021

1.0rc26 pre-release

Nov 24, 2021

1.0rc25 pre-release

Nov 17, 2021

1.0rc24 pre-release

Nov 17, 2021

1.0rc23 pre-release

Nov 17, 2021

1.0rc22 pre-release

Oct 11, 2021

1.0rc20 pre-release

Oct 7, 2021

1.0rc19 pre-release

Oct 7, 2021

1.0rc16 pre-release

Sep 28, 2021

1.0rc9 pre-release

Sep 27, 2021

1.0rc6 pre-release

Sep 27, 2021

1.0rc5 pre-release

Sep 26, 2021

1.0rc4 pre-release

Sep 26, 2021

1.0rc3 pre-release

Sep 26, 2021

1.0rc2 pre-release

Sep 26, 2021

1.0rc1 pre-release

Sep 26, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastdeploy-3.0.25.tar.gz (16.9 kB view details)

Uploaded Oct 30, 2024 Source

Built Distribution

fastdeploy-3.0.25-py3-none-any.whl (16.9 kB view details)

Uploaded Oct 30, 2024 Python 3

File details

Details for the file fastdeploy-3.0.25.tar.gz.

File metadata

Download URL: fastdeploy-3.0.25.tar.gz
Upload date: Oct 30, 2024
Size: 16.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for fastdeploy-3.0.25.tar.gz
Algorithm	Hash digest
SHA256	`6d8f1287e4278e102bb8e6046e4a8e9e384d98543eaf8340df5b5ce7425d6659`
MD5	`d6db2ed657d76f808313df6984d195a1`
BLAKE2b-256	`8ac72d5ed2fa80e974a3c36c980f3f7ea847eb8f7fda63087d6666e1dae8619c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastdeploy-3.0.25.tar.gz:

Publisher: main.yml on notAI-tech/fastDeploy

Attestations:

Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fastdeploy-3.0.25.tar.gz
- Subject digest: 6d8f1287e4278e102bb8e6046e4a8e9e384d98543eaf8340df5b5ce7425d6659
- Sigstore transparency entry: 145114353
- Sigstore integration time: Oct 30, 2024

File details

Details for the file fastdeploy-3.0.25-py3-none-any.whl.

File metadata

Download URL: fastdeploy-3.0.25-py3-none-any.whl
Upload date: Oct 30, 2024
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for fastdeploy-3.0.25-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2512986691b907689dc1de75a01c2189de4f6121c21d21539680b32762c8cca7`
MD5	`a8db40864e8a89efefe292c8a1e5cd7e`
BLAKE2b-256	`98df58c3565732ebeda7b0d882867f48fda5d3f44df04d0db4a53400e75c36b6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastdeploy-3.0.25-py3-none-any.whl:

Publisher: main.yml on notAI-tech/fastDeploy

Attestations:

Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fastdeploy-3.0.25-py3-none-any.whl
- Subject digest: 2512986691b907689dc1de75a01c2189de4f6121c21d21539680b32762c8cca7
- Sigstore transparency entry: 145114355
- Sigstore integration time: Oct 30, 2024