No project description provided

These details have not been verified by PyPI

Project links

Project description

OneContext

Official OneContext Python SDK

This is the official Python SDK for the OneContext platform. Use this SDK to connect your backend applications, interactive notebooks, or command-line tools to OneContext's platform.

What is OneContext?

OneContext is a platform that enables software engineers to compose and deploy custom RAG (Reading comprehension, Answer generation, and Generative) pipelines on state-of-the-art infrastructure, all without any hassle. Just create a context and add files to it. You can then search your context using vector search, hybrid search, etc. OneContext takes care of all the infrastructure details behind the scenes (SSL intents, DNS, Kubernetes clusters, embedding models, GPUs, auto-scaling, load balancing, etc).

Quick Start

Install the package with pip:

pip install onecontext

from onecontext import OneContext

# if api_key is omitted, ONECONTEXT_API_KEY env variable is used
oc = OneContext(api_key="<ONECONTEXT_API_KEY>")

You can get an api key here.

Note you may need to set the OPENAI_API_KEY on the settings page if using OPENAI embeddigns.

Create a Context

A Context is where you store your data. It represents a sort of a "File Store", a "Knowledge Base", a "Second Brain", etc.

To create a context:

context = oc.create_context(name="my_context")

To initilise an existing context object:

context = oc.Context(name="my_context")

Upload files

Now you can enrich your context with knowledge. You can make your context an expert in anything, just add files.

If you're on the free plan, you can have just one context, with up to 10 files (of less than 50 pages each). If you're on the pro plan, you can have up to 5,000 contexts, each with up to 5,000 files.

You can add individual files

context.upload_files(['path_to_file_1.pdf', 'path_to_file_2.pdf'], max_chunk_size=400)

To upload strings directly use the upload_texts method

context.upload_texts(['Hi Bob!', 'Oh hey Mark!'])

You can also add a full directory of files

context.upload_from_directory("path_to_your_directory")

In the above code, replace "path_to_your_directory" with the actual path to your directory.

You can add metadata to files at upload time

file_paths = ['path_to_file_1.pdf', 'path_to_file_2.pdf']
files_metadata = [{"tag": "file_1_tag"}, {"tag": "file_2_tag"}]
context.upload_files(file_paths,metadata=files_metadata)

You can use this metadata to filter searches against your context. For more details, see the OneContext Structured Query Language section.

List files available in a context

files = context.list_files()
for file in files:
    print(file)

You can also generate a download link for a file using the file_id:

import requests
files = context.list_files()
file = files[0]
file_url = context.get_download_url(file_id=file.id)
response = requests.get(file_url)
path = f"./local_folder/{file.name}"
with open(path, "wb") as f:
    f.write(response.content)

Note: Download urls will be valid for 5 minutes after they have been generated.

List the chunks available in a context

chunks = context.list_chunks(limit=50)

List all the contexts

all_contexts = oc.list_contexts()

These lines of code will provide a list of all contexts available to you.

Deleting contexts

If you wish to delete any context, you can do so with:

oc.delete_context("my_context")

Search through your Context

The following piece of code will execute a query and search across all chunks present in the context which have matchin metadata:

context = oc.Context("my_context")
chunks = context.search(
    query="query_string_to_search",
    semantic_weight=0.7,
    full_text_weight=0.3,
    top_k=5,
    rrf_k=50,
    metadata_filters = {"tag" : {"$eq" : "file_1_tag"}}
)

More details on the arguments for this method:

query: Query string that will be embedded used for the search.
top_k: The maximum number of "chunks" that will be retrieved.
semantic_weight: A value representing the weight of the relevance of the semantic similarity of the data in your context.
full_text_weight: A weight value for the relevance of the actual words in the context using key word search.
rrfK: quite a technical parameter which determines how we merge the scores for semantic, and fullText weights. For more see here
metadata_filters: A dictionary of criteria used to filter results based on metadata. See the OneContext Structured Query Language section below for the syntax details

Extract Structured Output from your Context

You can get structured output directly from your context by providing a json schema or a pydantic (v2) BaseModel:

from pydantic import BaseModel, Field

class RockBandInfo(BaseModel):
    title: str = Field(description="a title of a 1970s rockband")
    lyrics: str = Field(description="lyrics to their absolute banger of a song")

context = oc.Context("my_context")


output_dict, chunks = context.extract_from_search(
    query="tell me about rockbands",
    schema=RockBandInfo, # you can pass a pydantic (v2) model or a json schema dict
    extraction_prompt="Output only JSON matching the provided schema about the rockbands",
)

rock_band = RockBandInfo.model_validate(output_dict)

extract_from_search works just like search but returns the structured output as a dictionary as well as the reference chunks.

Note that pydantic is not a dependency of onecontext; you an pass a json schema definition directly to the extract methods instead of a BaseModel.

You can also extract structured output directly from chunks without performing as search:

output_dict, chunks = context.extract_from_chunks(
    schema=RockBandInfo, # you can pass a pydantic model or a json schema dict
    extraction_prompt="Output only JSON matching the provided schema about the rockbands",
    metadata_filters = {"tag" : {"$eq" : "rockband"}}
)

rock_band = RockBandInfo.model_validate(output_dict)

OneContext Structured Query Language

OneContext allows you to use a custom "Structured Query Language" to filter the chunks in your context.

The syntax is quite similar to what you might find in no-SQL databases like MongoDB, even though it operates on a SQL database at its core.

The syntax is based around the application of operators. There are two levels of operators. You can interpret the two levels as "aggregators" and "comparators".

Aggregators

The aggregator operators you can use are:

Key	Value Description
`$and`	Returns True i.f.f. all of the conditions in this block return True.
`$or`	Returns True if any of the conditions in this block return True.

Comparators

The comparator operators you can use are:

Key	Value Description	Supplied Value Type	Returned Value Type
`$eq`	Returns True if the value returned from the DB is equal to the supplied value.	`string	int
`$neq`	Returns True if the value returned from the DB is not equal to the supplied value.	`string	int
`$gt`	Returns True if the value returned from the DB is greater than the supplied value.	`int	float`
`$lt`	Returns True if the value returned from the DB is less than the supplied value.	`int	float`
`$in`	Returns True if the value returned from the DB is contained by the supplied array.	`array<string>`	int
`$contains`	Returns True if the array value returned from the DB contains the supplied value.	`string	int

Putting it all together

Using the above building blocks, it's pretty simple to put together quite an advanced composite filter across your embeddings at runtime.

For example in Python you could define some metadata filters like the below:

metadata_filters = { "$and": [
  {"$or": [
    {"department": {"$eq":"accounts"}},
    {"department": { "$in": ["finance", "compliance"}}
  ]},
  {"tag": {"$eq": "test"}},
  {"my_score": {"$gt" : 0.5}},
  {"my_other_score" : {"$gt" :0.4}}
]}

context = oc.Context("my_context")
chunks = context.search(
    query="query_string_to_search",
    metadata_filters = metadata_filters,
)

License

onecontext is distributed under the terms of the MIT license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.3.8

Nov 18, 2024

3.3.7

Oct 18, 2024

3.3.6

Oct 17, 2024

3.3.5

Oct 16, 2024

3.3.4

Oct 11, 2024

3.3.3

Oct 9, 2024

This version

3.3.2

Oct 9, 2024

3.3.1

Oct 8, 2024

3.3.0

Oct 8, 2024

3.2.0

Oct 7, 2024

3.1.1

Oct 3, 2024

3.1.0

Oct 3, 2024

3.0.0

Sep 18, 2024

0.1.0

Sep 4, 2024

0.0.31

Jul 18, 2024

0.0.30

Jul 4, 2024

0.0.29

Jul 4, 2024

0.0.28

Jul 4, 2024

0.0.27

Jul 4, 2024

0.0.26

Jul 4, 2024

0.0.25

Jul 2, 2024

0.0.24

Jul 1, 2024

0.0.23

Jun 26, 2024

0.0.21

Jun 18, 2024

0.0.20

Jun 18, 2024

0.0.19

Jun 16, 2024

0.0.18

Jun 13, 2024

0.0.17

Jun 12, 2024

0.0.16

Jun 10, 2024

0.0.15

Jun 10, 2024

0.0.14

May 22, 2024

0.0.13

May 16, 2024

0.0.12

May 14, 2024

0.0.11

Nov 4, 2023

0.0.10

Nov 4, 2023

0.0.9

Nov 3, 2023

0.0.8

Nov 3, 2023

0.0.7

Oct 20, 2023

0.0.6

Oct 17, 2023

0.0.5

Oct 14, 2023

0.0.4

Oct 14, 2023

0.0.3 yanked

Oct 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onecontext-3.3.2.tar.gz (14.1 kB view details)

Uploaded Oct 9, 2024 Source

Built Distribution

onecontext-3.3.2-py3-none-any.whl (14.5 kB view details)

Uploaded Oct 9, 2024 Python 3

File details

Details for the file onecontext-3.3.2.tar.gz.

File metadata

Download URL: onecontext-3.3.2.tar.gz
Upload date: Oct 9, 2024
Size: 14.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.0

File hashes

Hashes for onecontext-3.3.2.tar.gz
Algorithm	Hash digest
SHA256	`32c74b95b000ad6685f7016b88157c69ab5c7bd16c1f8392006c307477da22f8`
MD5	`c653164f63519cba75ae5982740ad849`
BLAKE2b-256	`c9764b483a9c53e17c34e6d4a04d9afa1c165606aa70940f737d35eefff5e9ad`

See more details on using hashes here.

File details

Details for the file onecontext-3.3.2-py3-none-any.whl.

File metadata

Download URL: onecontext-3.3.2-py3-none-any.whl
Upload date: Oct 9, 2024
Size: 14.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.0

File hashes

Hashes for onecontext-3.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`88abed22224f6b2bcd96dcd6f74d322c80fcb0f31f420ec6d4434fac21254c9b`
MD5	`221e609a0b850bb339a4e0e4c6693ffa`
BLAKE2b-256	`b8b2a4f9bf748ed8d1766b2e9a9a308847d4f65c5511fc67cf00775987cbde87`