Brevia

No project description provided

These details have not been verified by PyPI

Project links

Project description

Python Versions

The repository contains a minimal LLM API project in Python based on LangChain for interaction with LLM and FastAPI for the API interface.

Requirements

A version of Python 3.10 or higher and Poetry is required.

It is recommended to use virtualenv in the project. Check the settings with

poetry config --list

Check that the virtualenvs.in-project configuration is true otherwise launch:

poetry config virtualenvs.in-project true

Setup

install the dependencies by running poetry install, a virtualenv will automatically be created in the .venv folder.
then activate the virtualenv by running the poetry shell command.
copy the file .env.sample to .env and value the environment variables, especially OPENAI_API_KEY with the secret key of OpenAI and PGVECTOR_* see the Database section

Update packages

You use poetry update which will update the poetry.lock lock file. To change versions of dependencies you can also directly edit pyproject.toml in the [tool.poetry.dependencies] section.

Database

Run docker compose --profile admin up to run postgres+pgvector and pgadmin docker images.

With your browser, open pgadmin at http://localhost:4000

The 4000 port is configurable with the PGADMIN_PORT environment var in the .env file.

login to pgadmin with PGADMIN_DEFAULT_* credentials from the .env file.
create a connection with Add New Server by setting.
- in General brevia or other name of your choice (PGVECTOR_DATABASE).
- in Connection as host name pgdatabase and choose a Username to Password (PGVECTOR_USER, PGVECTOR_PASSWORD)

Launch migrations to create the initial schema with Alembic

alembic upgrade head

Test setup

To verify that the setup is correct, run

python brevia/scripts/csv_import.py data/test_min_dataset.csv test

To create indexing from a test CSV with one line. If at the end in output you find. Index collection {name} updated with {n} documents and {n} texts then everything is ok.

Import/export of collections.

To export use the export_collection.py script from the virtual env

python export_collection.py /path/to/folder collection

Where

/path/to/folder is the path where the 2 CSV files will be created, one for collection records and other with embeddings
collection is the name of the collection

To export use the import_collection.py script from the virtual env

python import_collection.py /path/to/folder biology

Where

/path/to/folder is the path where the 2 CSV files to be loaded are searched, one for collection records and other with embeddings
collection is the name of the collection

NB: postgres psql client is required for these scripts, connection parameters will be read from environment var (.env file)

server API

To start the server, type the following command from virtualenv:

uvicorn main:app --reload

The server will start executing on port 8000.

Docker

To launch the docker image of the API along with the Postgres service use

docker compose --profile api up

To launch the PgAdmin docker image along with Postgres use

docker compose --profile admin up

To launch the docker image of the APP and API along with Postgres

docker compose --profile app up

The version of the docker image used is defined in the .env file in the environment variables API_VERSION for the API and APP_VERSION for the app

Tracing log

To enable the call tracing feature, built in langchain add/uncomment the system variable on app.py:

environ["LANGCHAIN_HANDLER"] = langchain

ensuring that it is executed before any operation on the langchain libraries.

Start the server via docker images from the console:

langchain-server

Navigate to http://localhost:4173/ to display the trace control panel and use the default session.

To change the session name set:

.environ ["LANGCHAIN_SESSION"] = "my_session" # Making sure that this session actually exists. You can create a new session in the UI.

While to dynamically change session in the code DO NOT set the environment variable LANGCHAIN_SESSION, use instead:

langchain.set_tracing_callback_manager(session_name = "my_session")

Access Tokens

There is a built-in basic support for access tokens for API security

Access tokens are actively checked via Authoritazion: Bearer <token> header if a TOKENS_SECRET env variable has been set. You may then generate a new access token using:

poetry run create_token --user {user} --duration {minutes}

If the env TOKENS_SECRET variable is set token verification is automatically performed on every endpoint using brevia.dependencies.get_dependencies in its dependencies.

The recommended way yo generate TOKENS_SECRET is by using openssl via cli like

openssl rand -hex 32

You can also define a list of valid users as a comma separated string in the TOKENS_USERS env variable.

Setting it like TOKENS_USERS="brevia,gustavo" means that only brevia and gustavo are considered valid users names. Remember to use double quotes in a .env file.

Unit tests

To launch unit tests make sure to have dev dependencies installed. This is done with:

poetry install --with dev

To launch unit tests, type from virtualenv:

pytest tests/

To create coverage in HTML format:

pytest --cov-report html --cov=brevia tests/

Covreage report is created using pytest-cov

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.44

Oct 15, 2024

0.0.43

Oct 8, 2024

0.0.42

Oct 7, 2024

0.0.41

Sep 23, 2024

0.0.40

Sep 4, 2024

0.0.39

Sep 2, 2024

0.0.38

Aug 30, 2024

0.0.37

Aug 28, 2024

0.0.36

Aug 28, 2024

0.0.35

Aug 27, 2024

0.0.34

Jul 24, 2024

0.0.33

Jun 25, 2024

0.0.32

Jun 21, 2024

0.0.28

May 15, 2024

0.0.27

May 10, 2024

0.0.26

Apr 11, 2024

0.0.25

Mar 28, 2024

0.0.24

Mar 26, 2024

0.0.23

Mar 5, 2024

0.0.22

Feb 29, 2024

0.0.21

Feb 21, 2024

0.0.20

Feb 5, 2024

0.0.19

Jan 25, 2024

0.0.18

Jan 24, 2024

0.0.17

Jan 18, 2024

0.0.16

Jan 15, 2024

0.0.15

Jan 10, 2024

0.0.14

Jan 9, 2024

0.0.13

Jan 8, 2024

0.0.12

Dec 11, 2023

0.0.11

Dec 11, 2023

0.0.10

Dec 11, 2023

0.0.9

Nov 30, 2023

0.0.8

Nov 28, 2023

0.0.7

Nov 27, 2023

0.0.6

Nov 14, 2023

0.0.5

Nov 7, 2023

0.0.4

Oct 27, 2023

This version

0.0.3

Oct 17, 2023

0.0.2

Oct 11, 2023

0.0.1

Oct 11, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brevia-0.0.3.tar.gz (34.1 kB view hashes)

Uploaded Oct 17, 2023 Source

Built Distribution

brevia-0.0.3-py3-none-any.whl (47.8 kB view hashes)

Uploaded Oct 17, 2023 Python 3

Hashes for brevia-0.0.3.tar.gz

Hashes for brevia-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`36f490ee5dd5cc6c9140c2dfaf6a6ed3a3e5ea4604ed22d37378c7758717199b`
MD5	`031e6d4bae7ce8b6b449e370b021767f`
BLAKE2b-256	`c8c550a08871fa8c2c41ee120abf0cbb2672e14de65cdf96c1a846e1a7b87087`

Hashes for brevia-0.0.3-py3-none-any.whl

Hashes for brevia-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d180b528b074c319679cc48856fd23cfa360a93f0612ad39b20229d8769b7e2d`
MD5	`751931d9f2a7768d85718a8bec239d88`
BLAKE2b-256	`b1ed30a09428e134b0e65fb8f12c8beab116b22a32d69b1b95930c4de9e98132`