Skip to main content

The official Python SDK for the ColiVara API

Project description

colivara-py

PyPI Changelog License Tests codecov

The official Python SDK for the ColiVara API. ColiVara is a document search and retrieval API that uses advanced machine learning techniques to index and search documents. This SDK allows you to interact with the API to create collections, upload documents, search for documents, and generate embeddings.

Installation

Install this library using pip:

pip install colivara-py

Usage

Please see the ColiVara API documentation for more information on how to use this library.

You will need to either self-host the API (see the ColiVara API repo) or use the hosted version at https://colivara.com. You will also need an API key, which you can obtain by signing up at ColiVara or from your self-hosted API.

import os
from colivara_py import ColiVara


rag_client = ColiVara(
     # This is the default and can be omitted
    api_key=os.environ.get("COLIVARA_API_KEY"),
    # This is the default and can be omitted
    base_url="https://api.colivara.com"
)
# Create a new collection (optional)
new_collection = rag_client.create_collection(name="my_collection", metadata={"description": "A sample collection"})
print(f"Created collection: {new_collection.name}")

# Upload a document to the collection
document = rag_client.upsert_document(
    name="sample_document",
    # optional, defaults to "default_collection"
    collection_name="my_collection", 
    url="https://example.com/sample.pdf",
    metadata={"author": "John Doe"}
)
print(f"Uploaded document: {document.name}")

# Search for documents
search_results = rag_client.search(
    query="machine learning",
    collection_name="my_collection",
    top_k=3
)
for result in search_results.results:
    print(f"Page {result.page_number} of {result.document_name}: Score {result.normalized_score}")

# List documents in a collection
documents = client.list_documents(collection_name="my_collection")
for doc in documents:
    print(f"Document: {doc.name}, Pages: {doc.num_pages}")

# Generate embeddings
embeddings = rag_client.create_embedding(
    input_data=["This is a sample text for embedding"],
    task="query"
)
print(f"Generated {len(embeddings.data)} embeddings")

# Delete a document
rag_client.delete_document("sample_document", collection_name="my_collection")
print("Document deleted")

Development

To contribute to this library, first checkout the code. Then create a new virtual environment:

We use uv, but you can use the pip interface if you prefer:

cd colivara-py
uv venv
source .venv/bin/activate

Now install the dependencies and test dependencies:

uv sync --extra dev-dependencies

To run the tests:

pytest

To build the documenation locally:

pdocs server colivara_py #to see the documentation locally.  
pdocs as_html colivara_py --overwrite #to generate HTML.
pdocs as_markdown colivara_py #to generate markdown.

License

This SDK is distributed under the Apache License, Version 2.0. The API is licensed under Functional Source License, Version 1.1, Apache 2.0 Future License. See the LICENSE.md file for details.

For commercial licensing, please contact us at tjmlabs.com. We are happy to work with you to provide a license that meets your needs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colivara_py-1.2.5.tar.gz (19.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

colivara_py-1.2.5-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file colivara_py-1.2.5.tar.gz.

File metadata

  • Download URL: colivara_py-1.2.5.tar.gz
  • Upload date:
  • Size: 19.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for colivara_py-1.2.5.tar.gz
Algorithm Hash digest
SHA256 fd274fcf07a0d3f95ed9913f8cad9f3a89ad691f865d375d1d6278ec858bf2a8
MD5 ab269bddf792ccb704e0300020f6a642
BLAKE2b-256 6a0e81c4762ff1e6124311528b0ea575f35b5cf937c3cdceba0b7f40f40a72a5

See more details on using hashes here.

Provenance

The following attestation bundles were made for colivara_py-1.2.5.tar.gz:

Publisher: publish.yml on tjmlabs/colivara-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file colivara_py-1.2.5-py3-none-any.whl.

File metadata

  • Download URL: colivara_py-1.2.5-py3-none-any.whl
  • Upload date:
  • Size: 13.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for colivara_py-1.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 1335842b4e9e50b3f4407aa90e95aa0905c5ce69c8814a0d29a21ecedff2f25d
MD5 46a7239773804d219d0e2c08c3e1144f
BLAKE2b-256 2cb75b2d311d0fe45e81863a2956d8bd6bb9663eca1865128f55cfafecb16b28

See more details on using hashes here.

Provenance

The following attestation bundles were made for colivara_py-1.2.5-py3-none-any.whl:

Publisher: publish.yml on tjmlabs/colivara-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page