Vector Vault: Simplified vector database management and secure cloud storage for data science and machine learning workflows.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Vector Vault is designed to simplify the process of working with vector databases. It allows users to manage vector databases efficiently, integrated and accessed seamlessly from the cloud. It's scalable, suitable for both small and large scale databases, and designed with a user-friendly interface. Furthermore, it simplifies complex workflows, ensures secure and isolated data handling, and enables users to create and interact vector databases - aka "vaults" - simply and easily.

Vector Vault was built with the goal of making complex work flows, that utilize vector databases for informed generative ai, simple and easy. By combining similarity vector search with generative ai chat, new possibilities for conversation and communication emerge. Product information can be added to a vault, and then when customers ask a product question, the right information can be instantly retreived and seamlessly used in conversation by chatgpt for an accurate response. This allows for informed conversation and the possibilites range from ai automated customer support, to new ways to get news and entertainment, to ai code reviews that reference documentation, and much more.

Vector Vault uses a proprietary Inception Architecture, allowing you to create any number of vaults, and vaults within a vaults. Each vault is it's own database, and automatically integrates data storage in the cloud. You will need a Vector Vault account in order to get your user id and api key for cloud access. If you don't already have one, you can sign up free at https://vectorvault.io

This python library allows you to interact with Vector Vault using its Python-based API. It includes operations such as creating a vault, deleting the vault, adding data to the vault, getting vector embeddings for the data, saving data to the vault, interacting with OpenAI's ChatGPT model to get responses, and managing conversation history for more contextualized responses.

Interact with your Vault:

add : Add item to the Vault, with automatic text splitting and processing for long texts. Main way to add to vault
add_item : Add item to the Vault
add_item_with_vector : Add item to the Vault with vector provided - only accepts vectors of 1536 dimensions
save : Saves the vectors to the Vault and uploads any metadata
delete : Deletes the current Vault
get_vaults : Retrieves a list of vaults in the current vault
get_similar : Retrieves similar vectors for a given input text
get_vectors : Retrieves the vectors for all items in the Vault
get_chat : Retrieves a response from OpenAI's ChatGPT for a given input text, with support for handling conversation history, summarizing responses, and retrieving context-based responses by accessing similar references in the vault

Basic usage:

from vector_vault import Vault

# Create an instance of the Vault class - a new vault will be created if name does not exist
vault = Vault(user='your_user_id', api_key='your_api_key', vault='name_of_your_vault)

# Some text data we want to store
text_data = 'some data'

# Add the data to the Vault
vault.add(text_data)

# add your openai key to environment variable
os.environ['OPENAI_API_KEY'] = 'your_openai_api_key'

# Get vecctor embeddings for text_data 
# Internally calls openai with automatic rate limiting built in. Large inputs are batched, and concurrently processed for fastest possible embed time.
vault.get_vectors()

# Save the vectors and data to the Vault 
vault.save()

Now that you have saved some data to the vault, you can add more at anytime, and your vault will automatically handle the adding process. These three lines execute very fast.

# Add more data to the Vault
vault.add(more_text_data)

# Get embeddings for it - requires an openai api key set as an environvment variable
vault.get_vectors()

# Save to the Vault
vault.save()

vault.add() is cool. You can add any length of text, even a full book...and it will be all automatically split and processed. vault.get_vectors() is also cool, because you can vault.add() as much as you want, then when you're done, process all the vectors at once with a vault.get_vectors() - Internally batch processes vector embeddings with OpenAI's text embeddings ada 002, and comes with auto rate-limiting and concurrent requests for maximum speed

vault.add(insanely_large_text_data)
vault.get_vectors() 
vault.save()

When you want to use the vault later:

similar_data = vault.get_similar(text_input) # returns a list with 4 results
similar_data = vault.get_similar(text_input, n = 10) # returns 10 results

# Print each similar item 
for result in similar_data:
    print(result['data'])

Use the get_chat() function to get a response from chatgpt

The following searches the vault for 4 similar results and then give those to chatgpt as context, asking chatgpt answer the question using the context

user_input = "This text is going to be used find contextually similar references in the vault"

answer = vault.get_chat(user_input, get_context=True)  
print(answer)

# The following line will just send chatgpt the user_input and not interact with the vault in any way
answer = vault.get_chat(user_input)

Change Vault

In this example science vault, we will print a list of vaults in the current vault directory

science_vault = Vault(user='your_user_id', api_key='your_api_key', vault='science')

print(science_vault.get_vaults())

['biology', 'physics', 'chemistry']

Access vaults within vaults

biology vault within science vault

biology_vault = Vault(user='your_user_id', api_key='your_api_key', vault='science/biology')

chemistry vault within science vault

chemistry_vault = Vault(user='your_user_id', api_key='your_api_key', vault='science/chemistry')

print(chemistry_vault.get_vaults())

['reactions', 'formulas', 'lab notes']

lab notes vault within chemistry vault

lab_notes_vault = Vault(user='your_user_id', api_key='your_api_key', vault='science/chemistry/lab notes')

get_chat()

Chat get response from OpenAI's ChatGPT. Rate limiting, auto retries, and chat histroy slicing built-in so you can chat with ease. Enter your text, add optional chat history, and optionally choose a summary response (default: summmary = False)

Example Signle Usage: response = vault.get_chat(text)
Example Chat: response = vault.get_chat(text, chat_history)
Example Summary: summary = vault.get_chat(text, summary=True)
Example Context-Based Response: response = vault.get_chat(text, get_context = True)
Example Context-Based Response w/ Chat History: response = vault.get_chat(text, chat_history, get_context = True)
Example Context-Response with Context Samples Returned: vault_response = vault.get_chat(text, get_context = True, return_context = True)

Response is a string, unless return_context = True is passed, then response will be a dictionary containing the results from the vault as well as the response:

# print response:
print(vault_response['response'])` 

# print context:
for item in vault_response['context']['results']:
    print("\n\n", f"item {item['metadata']['item_id']}")
    print(item['data'])

Summarize:

You can summarize any text, no matter how large - even an entire book all at once. Long texts are split into the largest possible chunk sizes and a summary is generated for each chunk. When all summaries are finished, they are concatenated and returned as one.

summary = vault.get_chat(text, summary=True)

Real world usage:

user_input = input("What's your question?")

# Get response from Language model
vault_response = vault.get_chat(user_input, get_context=True, return_context=True)

answer = vault_response['response']
print("Question:", user_input, "\n\nAnswer:", answer)

# show the context used to generate the answer
for item in vault_response['context']['results']:
    print("\n\n", f"item {item['metadata']['item_index']}")
    print(item['data'])

Question: What is a token broker?

Answer: A token broker is a service that generates downscoped access tokens for token consumers to access or modify specific resources...

item 33 Various workloads (token consumers) in the same network will send authenticated requests to that broker for downscoped tokens to...

item 4 Another reason to use downscoped credentials is to ensure tokens in flight...

item 37 The following is an...

user_input2 = input("What's your next question?")

history = user_input + answer

# Get response from Language model
vault_response = vault.get_chat(user_input2, history=history, get_context=True, model='gpt-4')

print("Question:", user_input2, "\n\nAnswer:", vault_response2)

Question: How do I use it?

Answer: You can use it by...

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

5.0.2

Apr 25, 2024

5.0.1

Apr 22, 2024

5.0.0

Apr 19, 2024

4.4.5.5

Apr 19, 2024

4.4.5.4

Apr 19, 2024

4.4.5.3

Apr 19, 2024

4.4.5.2

Apr 19, 2024

4.4.5.1

Apr 18, 2024

4.4.5

Apr 18, 2024

4.4.4

Apr 12, 2024

4.4.3.9

Apr 12, 2024

4.4.3.8

Apr 12, 2024

4.4.3.7

Apr 11, 2024

4.4.3.6

Apr 11, 2024

4.4.3.5

Apr 11, 2024

4.4.3.4

Apr 11, 2024

4.4.3.3

Apr 11, 2024

4.4.3.2

Apr 11, 2024

4.4.3.1

Apr 11, 2024

4.4.3

Apr 11, 2024

4.4.2

Apr 10, 2024

4.4.1

Apr 10, 2024

4.4.0

Apr 9, 2024

4.3.9

Apr 9, 2024

4.3.8

Apr 9, 2024

4.3.7

Apr 8, 2024

4.3.6

Apr 8, 2024

4.3.5

Apr 8, 2024

4.3.4

Apr 8, 2024

4.3.3

Apr 4, 2024

4.3.2

Apr 4, 2024

4.3.1

Apr 4, 2024

4.3.0

Apr 4, 2024

4.2.9

Apr 4, 2024

4.2.8

Apr 4, 2024

4.2.7

Apr 4, 2024

4.2.6

Apr 4, 2024

4.2.5

Apr 3, 2024

4.2.4

Apr 1, 2024

4.2.3

Mar 30, 2024

4.2.2

Mar 25, 2024

4.2.1

Mar 25, 2024

4.2.0.420

Apr 19, 2024

4.2.0

Mar 21, 2024

4.1.9

Mar 14, 2024

4.1.8

Mar 13, 2024

4.1.7

Mar 13, 2024

4.1.6

Mar 13, 2024

4.1.5

Mar 13, 2024

4.1.4

Mar 13, 2024

4.1.3

Mar 13, 2024

4.1.2

Mar 13, 2024

4.1.1

Mar 13, 2024

4.1.0

Mar 11, 2024

4.0.9

Mar 9, 2024

4.0.8

Mar 8, 2024

4.0.7

Mar 8, 2024

4.0.6

Mar 6, 2024

4.0.5

Feb 20, 2024

4.0.4

Feb 17, 2024

4.0.3

Feb 14, 2024

4.0.2

Feb 1, 2024

4.0.1

Feb 1, 2024

4.0.0

Jan 24, 2024

3.9.9.1

Jan 23, 2024

3.9.9

Jan 22, 2024

3.9.8

Jan 22, 2024

3.9.7

Jan 22, 2024

3.9.6

Jan 15, 2024

3.9.5

Jan 15, 2024

3.9.4

Jan 15, 2024

3.9.3

Jan 14, 2024

3.9.2

Jan 14, 2024

3.9.1

Jan 12, 2024

3.9.0

Jan 12, 2024

3.8.9

Jan 6, 2024

3.8.8

Jan 5, 2024

3.8.7

Jan 5, 2024

3.8.6

Jan 3, 2024

3.8.5

Jan 3, 2024

3.8.4

Jan 3, 2024

3.8.3

Jan 3, 2024

3.8.2

Jan 2, 2024

3.8.1

Jan 2, 2024

3.8.0

Jan 2, 2024

3.7.9

Jan 2, 2024

3.7.8

Jan 2, 2024

3.7.7

Dec 31, 2023

3.7.6

Dec 31, 2023

3.7.5

Dec 30, 2023

3.7.4

Dec 30, 2023

3.7.3

Dec 30, 2023

3.7.2

Dec 30, 2023

3.7.1

Dec 30, 2023

3.7.0

Dec 30, 2023

3.6.9

Dec 30, 2023

3.6.7

Dec 30, 2023

3.6.5

Dec 30, 2023

3.6.4

Dec 30, 2023

3.6.3

Dec 30, 2023

3.6.2

Dec 30, 2023

3.6.1

Dec 30, 2023

3.6.0

Dec 29, 2023

3.5.9

Dec 28, 2023

3.5.8

Dec 28, 2023

3.5.7

Dec 28, 2023

3.5.6

Dec 19, 2023

3.5.5

Dec 19, 2023

3.5.4

Dec 19, 2023

3.5.3

Dec 19, 2023

3.5.2

Dec 19, 2023

3.5.1

Dec 19, 2023

3.5.0

Dec 19, 2023

3.4.9

Dec 19, 2023

3.4.8

Dec 19, 2023

3.4.7

Dec 19, 2023

3.4.6

Dec 19, 2023

3.4.5

Dec 19, 2023

3.4.4

Dec 19, 2023

3.4.3

Dec 19, 2023

3.4.2

Dec 19, 2023

3.4.1

Dec 19, 2023

3.4.0

Dec 18, 2023

3.3.9

Dec 17, 2023

3.3.8

Dec 17, 2023

3.3.7

Dec 17, 2023

3.3.6

Dec 17, 2023

3.3.5

Dec 16, 2023

3.3.4

Dec 16, 2023

3.3.3

Dec 16, 2023

3.3.2

Dec 16, 2023

3.3.1

Dec 16, 2023

3.3.0

Dec 15, 2023

3.2.9

Dec 14, 2023

3.2.8

Dec 14, 2023

3.2.7

Dec 14, 2023

3.2.6

Dec 14, 2023

3.2.5

Dec 14, 2023

3.2.4

Dec 13, 2023

3.2.3

Dec 13, 2023

3.2.2

Dec 13, 2023

3.2.1

Dec 13, 2023

3.2.0

Dec 13, 2023

3.1.9

Dec 11, 2023

3.1.8

Dec 11, 2023

3.1.7

Dec 11, 2023

3.1.6

Dec 8, 2023

3.1.5

Dec 6, 2023

3.1.4

Dec 1, 2023

3.1.3

Nov 29, 2023

3.1.2

Nov 10, 2023

3.1.1

Nov 9, 2023

3.1.0

Nov 9, 2023

3.0.9

Nov 9, 2023

3.0.8

Nov 9, 2023

3.0.7

Nov 9, 2023

3.0.6

Nov 9, 2023

3.0.5

Nov 9, 2023

3.0.4

Nov 9, 2023

3.0.3.0.3

Nov 8, 2023

3.0.3

Nov 8, 2023

3.0.1

Nov 8, 2023

3.0.0

Nov 8, 2023

2.9.9.1

Nov 8, 2023

2.9.9

Nov 7, 2023

2.9.8.2

Nov 7, 2023

2.9.8.1

Nov 7, 2023

2.9.8

Nov 7, 2023

2.9.7

Nov 7, 2023

2.9.6

Nov 7, 2023

2.9.5

Nov 7, 2023

2.9.4

Nov 7, 2023

2.9.3

Nov 7, 2023

2.9.2

Nov 7, 2023

2.9.1

Nov 5, 2023

2.9.0

Nov 5, 2023

2.8.4

Nov 1, 2023

2.8.3

Nov 1, 2023

2.8.2

Nov 1, 2023

2.8.1

Nov 1, 2023

2.8.0

Nov 1, 2023

2.7.9

Nov 1, 2023

2.7.8

Oct 30, 2023

2.7.7

Oct 30, 2023

2.7.6

Oct 29, 2023

2.7.5

Oct 29, 2023

2.7.4

Oct 29, 2023

2.7.3

Oct 29, 2023

2.7.2

Oct 28, 2023

2.7.1

Oct 28, 2023

2.7.0

Oct 27, 2023

2.6.9

Oct 26, 2023

2.6.8

Oct 26, 2023

2.6.7

Oct 26, 2023

2.6.6

Oct 26, 2023

2.6.5

Oct 26, 2023

2.6.4

Oct 26, 2023

2.6.3

Oct 25, 2023

2.6.2

Oct 24, 2023

2.6.1

Oct 24, 2023

2.6.0

Oct 21, 2023

2.5.9

Oct 21, 2023

2.5.8

Oct 21, 2023

2.5.7

Oct 21, 2023

2.5.6

Oct 21, 2023

2.5.5

Oct 21, 2023

2.5.4

Oct 21, 2023

2.5.3

Oct 21, 2023

2.5.2

Oct 21, 2023

2.5.1

Oct 21, 2023

2.5.0

Oct 21, 2023

2.4.9

Oct 21, 2023

2.4.8

Oct 21, 2023

2.4.7

Oct 21, 2023

2.4.6

Oct 21, 2023

2.4.5

Oct 21, 2023

2.4.4

Oct 21, 2023

2.4.3

Oct 21, 2023

2.4.1

Oct 20, 2023

2.4.0

Oct 20, 2023

2.3.9

Oct 19, 2023

2.3.8

Oct 19, 2023

2.3.7

Oct 19, 2023

2.3.6

Oct 19, 2023

2.3.5

Oct 19, 2023

2.3.4

Oct 19, 2023

2.3.3

Oct 19, 2023

2.3.2

Oct 19, 2023

2.3.1

Oct 18, 2023

2.3.0

Oct 18, 2023

2.2.5

Oct 14, 2023

2.2.4

Oct 14, 2023

2.2.3

Aug 16, 2023

2.2.2

Aug 9, 2023

2.2.1

Aug 7, 2023

2.2.0

Aug 7, 2023

2.1.9

Aug 7, 2023

2.1.8

Aug 7, 2023

2.1.7

Aug 6, 2023

2.1.6

Aug 6, 2023

2.1.5

Aug 5, 2023

2.1.4

Aug 5, 2023

2.1.3

Aug 5, 2023

2.1.2

Jul 18, 2023

2.1.1

Jul 18, 2023

2.1.0

Jul 16, 2023

2.0.7

Jun 10, 2023

2.0.6

Jun 9, 2023

2.0.5

Jun 9, 2023

2.0.4

Jun 8, 2023

2.0.3

Jun 8, 2023

2.0.2

Jun 8, 2023

2.0.1

Jun 6, 2023

2.0.0

Jun 5, 2023

1.8.8

Jun 5, 2023

1.8.7

Jun 4, 2023

1.8.6

Jun 4, 2023

1.8.5

Jun 4, 2023

1.8.4

Jun 4, 2023

1.8.3

Jun 4, 2023

1.8.2

Jun 4, 2023

1.8.1

Jun 4, 2023

1.8.0

Jun 3, 2023

1.7.9

Jun 3, 2023

1.7.8.2

Jun 3, 2023

1.7.8.1

Jun 3, 2023

1.7.8

Jun 3, 2023

1.7.7.1

Jun 3, 2023

1.7.7

Jun 3, 2023

1.7.6

Jun 3, 2023

1.7.5

Jun 3, 2023

1.7.4

Jun 3, 2023

1.7.3

Jun 3, 2023

1.7.2

Jun 3, 2023

1.7.1

Jun 3, 2023

1.7.0

Jun 2, 2023

1.6.9.69

Jun 2, 2023

1.6.9

Jun 2, 2023

1.6.8

Jun 1, 2023

1.6.7

Jun 1, 2023

1.6.6

Jun 1, 2023

1.6.5

Jun 1, 2023

1.6.4

Jun 1, 2023

1.6.3

May 31, 2023

1.6.2

May 30, 2023

1.6.1

May 30, 2023

1.6.0

May 30, 2023

1.5.0

May 29, 2023

1.4.0

May 29, 2023

1.3.0

May 28, 2023

1.2.0

May 27, 2023

1.1.3

May 25, 2023

1.1.2

May 25, 2023

1.1.1

May 25, 2023

1.1.0

May 24, 2023

1.0.8

May 24, 2023

1.0.7

May 24, 2023

1.0.6

May 24, 2023

1.0.5

May 24, 2023

1.0.4

May 24, 2023

1.0.3

May 24, 2023

1.0.2

May 24, 2023

1.0.1

May 24, 2023

1.0.0

May 23, 2023

0.3.0

May 23, 2023

0.2.9

May 23, 2023

0.2.8

May 22, 2023

0.2.7

May 22, 2023

0.2.5

May 22, 2023

0.2.4

May 22, 2023

0.2.3

May 21, 2023

0.2.2

May 21, 2023

0.2.1

May 21, 2023

0.2

May 21, 2023

0.1.9

May 21, 2023

0.1.8

May 21, 2023

0.1.7

May 21, 2023

0.1.6

May 21, 2023

0.1.5

May 21, 2023

0.1.4

May 20, 2023

0.1.3

May 20, 2023

0.1.2

May 20, 2023

This version

0.1.1

May 20, 2023

0.1

May 20, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vector_vault-0.1.1.tar.gz (14.9 kB view hashes)

Uploaded May 20, 2023 Source

Built Distribution

vector_vault-0.1.1-py3-none-any.whl (15.6 kB view hashes)

Uploaded May 20, 2023 Python 3

Hashes for vector_vault-0.1.1.tar.gz

Hashes for vector_vault-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`12d1e36e72fabfc4c7967972fd6add76a503f8c9b1d21bdf78fe74384487441e`
MD5	`43583c85aafec93defd19349c4ec590b`
BLAKE2b-256	`ce4a3aaacc9996a03da482938ac61bd55a375dc633d89a126c2a80229d5eaf78`

Hashes for vector_vault-0.1.1-py3-none-any.whl

Hashes for vector_vault-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`43b48b3883597d4a4ac027843985b90c2ea687dde9c895cf4def70476700493c`
MD5	`676178a79f0cecd69e6ff25c3d3d4685`
BLAKE2b-256	`428027164a86c12dfda76f269a5d9647333f45c53956c7bbb9452fa62cdb2ade`