Skip to main content

A client library for accessing ♾️ Infinity - Embedding Inference Server

Project description

infinity_client

A client library for accessing ♾️ Infinity - Embedding Inference Server

Installation

pip install infinity_client

Usage

First, create a client:

from infinity_client import Client
i_client = Client(base_url="https://infinity.modal.michaelfeil.eu")

If the endpoints you're going to hit require authentication, use AuthenticatedClient instead:

from infinity_client import AuthenticatedClient

i_client = AuthenticatedClient(base_url="https://infinity.modal.michaelfeil.eu", token="SuperSecretToken")

Now call your endpoint and use your models:

from infinity_client.models import OpenAIModelInfo
from infinity_client.api.default import models, health
from infinity_client.types import Response

with i_client as client:
    model_info: OpenAIModelInfo = models.sync(client=client)
    # or if you need more info (e.g. status_code)
    response: Response[OpenAIModelInfo] = models.sync_detailed(client=client)

Or use the POST methods:

from infinity_client.models import OpenAIEmbeddingInput, OpenAIEmbeddingResult
from infinity_client.api.default import classify, embeddings, embeddings_image, rerank
from infinity_client.types import Response

with i_client as client:
    embeds: OpenAIEmbeddingResult = embeddings.sync(client=client, body=OpenAIEmbeddingInput.from_dict({
        "input": ["Hello, this is a sentence-to-embed", "Hello, my cat is cute"],
        "model": "michaelfeil/bge-small-en-v1.5",
    }))

Or do the same thing with an async version:

from infinity_client.models import OpenAIModelInfo
from infinity_client.api.default import models
from infinity_client.types import Response

async with i_client as client:
    model_info: OpenAIModelInfo = await models.asyncio(client=client)
    response: Response[OpenAIModelInfo] = await models.asyncio_detailed(client=client)

By default, when you're calling an HTTPS API it will attempt to verify that SSL is working correctly. Using certificate verification is highly recommended most of the time, but sometimes you may need to authenticate to a server (especially an internal server) using a custom certificate bundle.

i_client = AuthenticatedClient(
    base_url="https://internal_api.example.com",
    token="SuperSecretToken",
    verify_ssl="/path/to/certificate_bundle.pem",
)

You can also disable certificate validation altogether, but beware that this is a security risk.

i_client = AuthenticatedClient(
    base_url="https://internal_api.example.com", 
    token="SuperSecretToken", 
    verify_ssl=False
)

Things to know:

  1. Every path/method combo becomes a Python module with four functions:

    1. sync: Blocking request that returns parsed data (if successful) or None
    2. sync_detailed: Blocking request that always returns a Request, optionally with parsed set if the request was successful.
    3. asyncio: Like sync but async instead of blocking
    4. asyncio_detailed: Like sync_detailed but async instead of blocking
  2. All path/query params, and bodies become method arguments.

  3. If your endpoint had any tags on it, the first tag will be used as a module name for the function (my_tag above)

  4. Any endpoint which did not have a tag will be in infinity_client.api.default

Advanced customizations

There are more settings on the generated Client class which let you control more runtime behavior, check out the docstring on that class for more info. You can also customize the underlying httpx.Client or httpx.AsyncClient (depending on your use-case):

from infinity_client import Client

def log_request(request):
    print(f"Request event hook: {request.method} {request.url} - Waiting for response")

def log_response(response):
    request = response.request
    print(f"Response event hook: {request.method} {request.url} - Status {response.status_code}")

i_client = Client(
    base_url="https://infinity.modal.michaelfeil.eu",
    httpx_args={"event_hooks": {"request": [log_request], "response": [log_response]}},
)

# Or get the underlying httpx client to modify directly with client.get_httpx_client() or client.get_async_httpx_client()

You can even set the httpx client directly, but beware that this will override any existing settings (e.g., base_url):

import httpx
from infinity_client import Client

i_client = Client(
    base_url="https://infinity.modal.michaelfeil.eu",
)
# Note that base_url needs to be re-set, as would any shared cookies, headers, etc.
i_client.set_httpx_client(httpx.Client(base_url="https://infinity.modal.michaelfeil.eu", proxies="http://localhost:8030"))

Building / publishing this package

This project uses Poetry to manage dependencies and packaging. Here are the basics:

  1. Update the metadata in pyproject.toml (e.g. authors, version)
  2. If you're using a private repository, configure it with Poetry
    1. poetry config repositories.<your-repository-name> <url-to-your-repository>
    2. poetry config http-basic.<your-repository-name> <username> <password>
  3. Publish the client with poetry publish --build -r <your-repository-name> or, if for public PyPI, just poetry publish --build

If you want to install this client into another project without publishing it (e.g. for development) then:

  1. If that project is using Poetry, you can simply do poetry add <path-to-this-client> from that project
  2. If that project is not using Poetry:
    1. Build a wheel with poetry build -f wheel
    2. Install that wheel from the other project pip install <path-to-wheel>

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

infinity_client-0.0.68.tar.gz (18.4 kB view details)

Uploaded Source

Built Distribution

infinity_client-0.0.68-py3-none-any.whl (43.9 kB view details)

Uploaded Python 3

File details

Details for the file infinity_client-0.0.68.tar.gz.

File metadata

  • Download URL: infinity_client-0.0.68.tar.gz
  • Upload date:
  • Size: 18.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.0rc1 Linux/5.15.153.1-microsoft-standard-WSL2

File hashes

Hashes for infinity_client-0.0.68.tar.gz
Algorithm Hash digest
SHA256 807166f3c5cc7a0417ea7879c3500b0c59f268b72f9842a7c1f80cc2072aa1b9
MD5 656197c909e3ec67853211413e49e562
BLAKE2b-256 62b8da961c26e03d8e4a241e22984ec625ba7487f655a0633bf0d1450b4d9c51

See more details on using hashes here.

File details

Details for the file infinity_client-0.0.68-py3-none-any.whl.

File metadata

  • Download URL: infinity_client-0.0.68-py3-none-any.whl
  • Upload date:
  • Size: 43.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.0rc1 Linux/5.15.153.1-microsoft-standard-WSL2

File hashes

Hashes for infinity_client-0.0.68-py3-none-any.whl
Algorithm Hash digest
SHA256 dc830895d497116a743362e995fe1788ba72199a7a302d04a3f4425f0e861cae
MD5 29be60bba5b761b1b7390da3d084640f
BLAKE2b-256 499cc21c5137038edd44b62540a135e9ad8fe66bc54b97f2b72123fca0e66a7e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page