TextEmbed provides a robust and scalable REST API for generating vector embeddings from text. Built for performance and flexibility, it supports various sentence-transformer models, allowing users to easily integrate state-of-the-art NLP techniques into their applications. Whether you need embeddings for search, recommendation, or other NLP tasks, TextEmbed delivers with high efficiency.

These details have not been verified by PyPI

Project links

Homepage

Project description

TextEmbed - Embedding Inference Server

TextEmbed is a high-throughput, low-latency REST API designed for serving vector embeddings. It supports a wide range of sentence-transformer models and frameworks, making it suitable for various applications in natural language processing.

Features

High Throughput & Low Latency: Designed to handle a large number of requests efficiently.
Flexible Model Support: Works with various sentence-transformer models.
Scalable: Easily integrates into larger systems and scales with demand.
Batch Processing: Supports batch processing for better and faster inference.
OpenAI Compatible REST API Endpoint: Provides an OpenAI compatible REST API endpoint.
Single Line Command Deployment: Deploy multiple models via a single command for efficient deployment.
Support for Embedding Formats: Supports binary, float16, and float32 embeddings formats for faster retrieval.

Getting Started

Prerequisites

Ensure you have Python 3.10 or higher installed. You will also need to install the required dependencies.

Installation

Install the required dependencies:
```
pip install -U textembed
```
Start the TextEmbed server with your desired models:
```
python3 -m textembed.server --models <Model1>, <Model2> --port <Port>
```
Replace <Model1> and <Model2> with the names of the models you want to use, separated by commas. Replace <Port> with the port number on which you want to run the server.

For more information about the Docker deployment and configuration, please refer to the documentation setup.md.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.8

Jun 13, 2024

0.0.7

Jun 10, 2024

This version

0.0.6

Jun 5, 2024

0.0.5

Jun 2, 2024

0.0.3

May 29, 2024

0.0.2

May 25, 2024

0.0.1

May 20, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textembed-0.0.6.tar.gz (21.5 kB view details)

Uploaded Jun 5, 2024 Source

Built Distribution

textembed-0.0.6-py3-none-any.whl (25.1 kB view details)

Uploaded Jun 5, 2024 Python 3

File details

Details for the file textembed-0.0.6.tar.gz.

File metadata

Download URL: textembed-0.0.6.tar.gz
Upload date: Jun 5, 2024
Size: 21.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.9.19

File hashes

Hashes for textembed-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`199aab241647fbac12076153fb770791804e8c1503d245fb498dc11810d1ddd1`
MD5	`82b9e385c32db676ed154a995b2c80a1`
BLAKE2b-256	`cb333e973c05e05daaed0ce36c27006ed43d15fdaa3abc0fbef76498ec3ea959`

See more details on using hashes here.

File details

Details for the file textembed-0.0.6-py3-none-any.whl.

File metadata

Download URL: textembed-0.0.6-py3-none-any.whl
Upload date: Jun 5, 2024
Size: 25.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.9.19

File hashes

Hashes for textembed-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`71d3f609992143f6c907a3115ec120e9937fac51754c07cf3240bd09fbbf2f12`
MD5	`9f621f7b06f314813fdc0e5972adae23`
BLAKE2b-256	`5150ad80c69ae5a10c26552e9e797971a7a6da9c67d901c816243a210cb1aea1`