No project description provided

These details have not been verified by PyPI

Project links

Project description

OneContext

Table of Contents

LLM Context as a Service
Quick Start
License

Full docs page

Check out our docs for a in-depth treatment of how our platform works!

LLM Context as a Service

OneContext makes it really easy and fast to augment your LLM application with your own data in a few API calls. Upload your data to a Knowledge Base query with natural language to retrieve relevant context for your LLM application.

We manage the full document processing and retrieval pipeline so that you don't have to:

document ingestion, chunking and cleaning
efficient vector embeddings at scale using state of the art open source models
low latency multi stage query pipeline to provide the most relevant context for your LLM application

We keep up with the latest research to provide an accurate and fast retrieval pipeline based on model evaluation and best practice heuristics.

Multi stage query pipeline out of the box:

Fast base-model retrieves a large pool of documents
Cross-encoder reranks the retrieved documents to provide the precise results relevant to the query.

Use Cases:

Question Answering over a large knowledge base
Long term memory for chatbots
Runtime context for instruction following agents
Prevent and detect hallucinations based on custom data

Quick Start

Install the package with pip:

pip install onecontext

Note: If you prefer to jump right in the full example code is in quickstart.py

from onecontext import OneContext

# if api_key is omitted, ONECONTEXT_API_KEY env variable is used
oc = OneContext(api_key="<ONECONTEXT_API_KEY>")

You can get an api key here.

Create your first knowledge base

A knowledge base is a collection of files. To create a knowledge base:

knowledgebase = oc.create_knowledgebase(name="my_kb")

Create a Vector Index

We want to chunk and embed the files in our knowledebase but first we need somewhere to store our vectors. We create a vector index and specify the embedding model that the vector index should expect:

oc.create_index("my_vector_index", model="BAAI/bge-base-en-v1.5")

By specifying the model we create a vector index of appropriate dimensions and also ensure that we never write embeddings from a different model to this index.

Create an ingestion Pipeline

We are ready to deploy our first ingestion pipeline.

Create a ingestion.yaml with the following content:

steps:
  - step: KnowledgeBaseFiles
    name: input
    step_args:
      # specify the source knowledgebases to watch
      knowledgebase_names: ["my_kb"]
    inputs: []

  - step: Preprocessor
    name: preprocessor
    step_args: {}
    inputs: [input]

  - step: Chunker
    name: simple_chunker
    step_args:
      chunk_size_words: 320
      chunk_overlap: 30
    inputs: [preprocessor]

  - step: SentenceTransformerEmbedder
    name: sentence-transformers
    step_args:
      model_name: BAAI/bge-base-en-v1.5
    inputs: [ simple_chunker ]

  - step: ChunkWriter
    name: save
    step_args:
      vector_index_name: my_vector_index
    inputs: [sentence-transformers]

Then deploy like so:

oc.deploy_pipeline("my_ingestion_pipeline", pipeline_yaml_path="./ingestion.yaml")

Create a query Pipeline

To query the vector index we need to define a query pipeline.

Create a query.yaml with the following content:

steps:
  - step: SentenceTransformerEmbedder
    name: query_embedder
    step_args:
      model_name: BAAI/bge-base-en-v1.5
      include_metadata: [ title, file_name ]
      query: "placeholder"
    inputs: [ ]

  - step: Retriever
    name: retriever
    step_args:
      vector_index_name: my_vector_index
      top_k: 100
      metadata_json: { }
    inputs: [ query_embedder ]

  - step: Reranker
    name: reranker
    step_args:
      query: "placeholder"
      model_name: BAAI/bge-reranker-base
      top_k: 5
    inputs: [ retriever ]

Here we create a simple two-step query pipeline.

The SentenceTransformerEmbedder step embeds the query
The Retriever performs a similarity search against the index we defined earlier. This step has a high recall and is great to retrieve many candidate vectors.
The Reranker step uses cross-encoder model to further narrow down the results only to the most relevant chunks.

query_pipeline = oc.deploy_pipeline("basic_query", "./query.yaml")

Uploading Files:

Upload files to an existing knowledge base:

knowledgebase = oc.KnowledgeBase(name="my_kb")
knowledgebase.upload_file("babbage.pdf")

When a file is uploaded any pipelines connected to the KnowledgeBase will be triggered to run.

List runs to see the current state of each run:

oc.list_runs()

Run the query Pipeline

Once the ingestion pipeline run is complete we can query the index for relevant chunks using the query pipeline we created earlier.

We can run the query pipeline and override any of default the step arguments defined in our pipeline at runtime by passing a dictionary of the form:

`{step_name : {step_arg: step_arg_value}}`.

query = "What are consequences of inventing a computer?"
retriever_top_k = 50
top_k = 5

override_args = {
    "query_embedder": {"query": query},
    "retriever": {
        "top_k": retriever_top_k,
    },
    "reranker": {"top_k": top_k, "query": query},
}

chunks = query_pipeline.run(override_args)

License

onecontext is distributed under the terms of the MIT license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.2.0

Oct 7, 2024

3.1.1

Oct 3, 2024

3.1.0

Oct 3, 2024

3.0.0

Sep 18, 2024

0.1.0

Sep 4, 2024

0.0.31

Jul 18, 2024

0.0.30

Jul 4, 2024

0.0.29

Jul 4, 2024

0.0.28

Jul 4, 2024

0.0.27

Jul 4, 2024

This version

0.0.26

Jul 4, 2024

0.0.25

Jul 2, 2024

0.0.24

Jul 1, 2024

0.0.23

Jun 26, 2024

0.0.21

Jun 18, 2024

0.0.20

Jun 18, 2024

0.0.19

Jun 16, 2024

0.0.18

Jun 13, 2024

0.0.17

Jun 12, 2024

0.0.16

Jun 10, 2024

0.0.15

Jun 10, 2024

0.0.14

May 22, 2024

0.0.13

May 16, 2024

0.0.12

May 14, 2024

0.0.11

Nov 4, 2023

0.0.10

Nov 4, 2023

0.0.9

Nov 3, 2023

0.0.8

Nov 3, 2023

0.0.7

Oct 20, 2023

0.0.6

Oct 17, 2023

0.0.5

Oct 14, 2023

0.0.4

Oct 14, 2023

0.0.3 yanked

Oct 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onecontext-0.0.26.tar.gz (12.6 kB view hashes)

Uploaded Jul 4, 2024 Source

Built Distribution

onecontext-0.0.26-py3-none-any.whl (14.8 kB view hashes)

Uploaded Jul 4, 2024 Python 3

Hashes for onecontext-0.0.26.tar.gz

Hashes for onecontext-0.0.26.tar.gz
Algorithm	Hash digest
SHA256	`221890c1064d326b1cc2ac16630e3722d51dd304d816f0bf014ee21a1f4bda16`
MD5	`b0037b5e6ab5789dab36cdedf821e82e`
BLAKE2b-256	`f44be36e0cc91ef1a0bf0c7a9fc5eb40033611e628b5bcdfcad98a370fd805c8`

Hashes for onecontext-0.0.26-py3-none-any.whl

Hashes for onecontext-0.0.26-py3-none-any.whl
Algorithm	Hash digest
SHA256	`be7418a5dfe853559b4796471613a21a9d236a577023ad5dd5ba408b064f3b28`
MD5	`ce9d64d1e4fd4cbd2d618f5f11b542b6`
BLAKE2b-256	`b2bdc96cadf53f04f24c499079beac2c12b2a38aecd00ffa33773a1e7cef43d9`