api-inference-community

A package with helper tools to build an API Inference docker app for Hugging Face API inference using huggingface_hub

These details have not been verified by PyPI

Project links

Homepage

Project description

This repositories enable third-party libraries integrated with huggingface_hub to create their own docker so that the widgets on the hub can work as the transformers one do.

The hardware to run the API will be provided by Hugging Face for now.

The docker_images/common folder is intended to be a starter point for all new libs that want to be integrated.

Adding a new container from a new lib.

Copy the docker_images/common folder into your library's name docker_images/example.
Edit:
- docker_images/example/requirements.txt
- docker_images/example/app/main.py
- docker_images/example/app/pipelines/{task_name}.py
to implement the desired functionality. All required code is marked with IMPLEMENT_THIS markup.
Remove:
- Any pipeline files in docker_images/example/app/pipelines/ that are not used.
- Any tests associated with deleted pipelines in docker_images/example/tests.
- Any imports of the pipelines you deleted from docker_images/example/app/pipelines/__init__.py
Feel free to customize anything required by your lib everywhere you want. The only real requirements, are to honor the HTTP endpoints, in the same fashion as the common folder for all your supported tasks.
Edit example/tests/test_api.py to add TESTABLE_MODELS.
Pass the test suite pytest -sv --rootdir docker_images/example/ docker_images/example/
Submit your PR and enjoy !

Going the full way

Doing the first 7 steps is good enough to get started, however in the process you can anticipate some problems corrections early on. Maintainers will help you along the way if you don't feel confident to follow those steps yourself

Test your creation within a docker

./manage.py docker MY_MODEL

should work and responds on port 8000. curl -X POST -d "test" http://localhost:8000 for instance if the pipeline deals with simple text.

If it doesn't work out of the box and/or docker is slow for some reason you can test locally (using your local python environment) with :

./manage.py start MY_MODEL

Test your docker uses cache properly.

When doing subsequent docker launch with the same model_id, the docker should start up very fast and not redownload the whole model file. If you see the model/repo being downloaded over and over, it means the cache is not being used correctly. You can edit the docker_images/{framework}/Dockerfile and add an environment variable (by default it assumes HUGGINGFACE_HUB_CACHE), or your code directly to put the model files in the /data folder.

Add a docker test.

Edit the tests/test_dockers.py file to add a new test with your new framework in it (def test_{framework}(self): for instance). As a basic you should have 1 line per task in this test function with a real working model on the hub. Those tests are relatively slow but will check automatically that correct errors are replied by your API and that the cache works properly. To run those tests your can simply do:

RUN_DOCKER_TESTS=1 pytest -sv tests/test_dockers.py::DockerImageTests::test_{framework}

Modifying files within `api-inference-community/{routes,validation,..}.py`.

If you ever come across a bug within api-inference-community/ package or want to update it the development process is slightly more involved.

First, make sure you need to change this package, each framework is very autonomous so if your code can get away by being standalone go that way first as it's much simpler.
If you can make the change only in api-inference-community without depending on it that's also a great option. Make sure to add the proper tests to your PR.
Finally, the best way to go is to develop locally using manage.py command:
Do the necessary modifications within api-inference-community first.
Install it locally in your environment with pip install -e .
Install your package dependencies locally.
Run your webserver locally: ./manage.py start --framework example --task audio-source-separation --model-id MY_MODEL
When everything is working, you will need to split your PR in two, 1 for the api-inference-community part. The second one will be for your package specific modifications and will only land once the api-inference-community tag has landed.
This workflow is still work in progress, don't hesitate to ask questions to maintainers.

Another similar command ./manage.py docker --framework example --task audio-source-separation --model-id MY_MODEL Will launch the server, but this time in a protected, controlled docker environment making sure the behavior will be exactly the one in the API.

Available tasks

Automatic speech recognition: Input is a file, output is a dict of understood words being said within the file
Text generation: Input is a text, output is a dict of generated text
Image recognition: Input is an image, output is a dict of generated text
Question answering: Input is a question + some context, output is a dict containing necessary information to locate the answer to the question within the context.
Audio source separation: Input is some audio, and the output is n audio files that sum up to the original audio but contain individual sources of sound (either speakers or instruments for instant).
Token classification: Input is some text, and the output is a list of entities mentioned in the text. Entities can be anything remarkable like locations, organisations, persons, times etc...
Text to speech: Input is some text, and the output is an audio file saying the text...
Sentence Similarity: Input is some sentence and a list of reference sentences, and the list of similarity scores.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.0.37

Feb 12, 2025

0.0.36

Oct 7, 2024

0.0.35

Oct 7, 2024

0.0.34

Oct 3, 2024

0.0.33

Sep 4, 2024

0.0.32

Jul 4, 2023

0.0.31

Jun 20, 2023

0.0.30

Apr 19, 2023

0.0.29

Apr 18, 2023

0.0.28

Apr 13, 2023

0.0.27

Jan 17, 2023

0.0.26

Jan 17, 2023

0.0.25

Jun 10, 2022

0.0.24

May 24, 2022

0.0.23

Mar 9, 2022

0.0.22

Mar 9, 2022

0.0.21

Mar 9, 2022

0.0.20

Nov 29, 2021

0.0.19

Sep 6, 2021

0.0.18

Sep 6, 2021

0.0.17

Aug 13, 2021

0.0.16

Aug 4, 2021

0.0.15

Jul 30, 2021

0.0.14

Jul 30, 2021

0.0.13

Jul 30, 2021

0.0.12

Jul 30, 2021

0.0.11

Jul 27, 2021

0.0.10

Jul 27, 2021

0.0.9

Jul 8, 2021

0.0.8

Jun 18, 2021

0.0.7

Jun 10, 2021

0.0.6

May 20, 2021

0.0.5

May 20, 2021

0.0.4

May 20, 2021

0.0.3

May 14, 2021

0.0.2

May 14, 2021

0.0.1

Apr 16, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

api_inference_community-0.0.37.tar.gz (29.3 kB view details)

Uploaded Feb 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

api_inference_community-0.0.37-py3-none-any.whl (18.1 kB view details)

Uploaded Feb 12, 2025 Python 3

File details

Details for the file api_inference_community-0.0.37.tar.gz.

File metadata

Download URL: api_inference_community-0.0.37.tar.gz
Upload date: Feb 12, 2025
Size: 29.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for api_inference_community-0.0.37.tar.gz
Algorithm	Hash digest
SHA256	`82c78b851c8247e897366d62d30369c12dc5009ad8d32630636c955da0b68b40`
MD5	`3e88234656534712eb69569c952d13fd`
BLAKE2b-256	`b865c24c0f611ba17fb83883be48ce1ceb274dd6c84942e5a1ebbb81c6995b0b`

See more details on using hashes here.

File details

Details for the file api_inference_community-0.0.37-py3-none-any.whl.

File metadata

Download URL: api_inference_community-0.0.37-py3-none-any.whl
Upload date: Feb 12, 2025
Size: 18.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for api_inference_community-0.0.37-py3-none-any.whl
Algorithm	Hash digest
SHA256	`23d65372fc426f85e814486f481f86964ab1d515456c64414dfc2ec8ce67708d`
MD5	`bd347fe8121f2744f2190e1773f1847f`
BLAKE2b-256	`90f52ed05137ae664a7a23eb615bc85080ec115d5ec5b6cfed79e159e4cdf056`

See more details on using hashes here.

api-inference-community 0.0.37

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Adding a new container from a new lib.

Going the full way

Modifying files within `api-inference-community/{routes,validation,..}.py`.

Available tasks

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

api-inference-community 0.0.37

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Adding a new container from a new lib.

Going the full way

Modifying files within api-inference-community/{routes,validation,..}.py.

Available tasks

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Modifying files within `api-inference-community/{routes,validation,..}.py`.