embedstore

Retrieving the most relevant context for your LLMs

Project description

embedstore

Context retrieval apis to power your LLMs!
Get started »

Play with Embeddings · Discord

Quick start

Installation

pip install embedstore

Get API Key here

# Set the API Key
import os
os.environ["EMBEDSTORE_API_KEY"] = <YOUR API KEY>

Retrieve contexts

In a few lines of code, you can now start retrieving relevant contexts to get better answers from your LLM. We have already embedded Podcast Transcripts and Arxiv Research Papers, with new dataset drops every week (full list here)

from embedstore.rag.retrievers import EmbedStoreRetriever

# Initialize the retriever
arxiv_retriever = EmbedStoreRetriever(dataset_id = "arxiv_01", num_docs=3)

# Query the retriever
contexts = arxiv_retriever.query("What is the state of the art in Autonomous Driving security and safety?")

# Examine the retrieved contexts and then append it to the prompt before you call your LLM
print(contexts[0])

Read more here for detailed usage guidelines.

What is this?

TBD Logo

Usage

More about `EmbedStoreRetriever` class

EmbedStoreRetriever serves as the single point of entrance to interact with the embedstore.

Parameters to initialize the class :

dataset_id (str) : Dataset to query and retrieve contexts from (eg: arxiv_01, podcasts_01)
num_docs (int) : Number of contexts to retrieve from the embeddings

Advanced filtering while querying

You can use EmbedStoreRetriever.query() to get contexts based on the user prompt and also perform filtering.

The query() function takes two inputs:

prompt (str): The user prompt that you are querying for
post_processing_config (dict): A dictionary of dataset specific filters (check here for available options)

Example : Temporal filter on Arxiv research papers

from embedstore.rag.retrievers import EmbedStoreRetriever

# Initialize the retriever
arxiv_retriever = EmbedStoreRetriever(dataset_id = "arxiv_01", num_docs=3)

# Query the retriever
user_query = "What are some recent papers about CRISPR? Give me a brief summary of major trends."

# Filter to do semantic search on papers published in the last 6 months
post_processing_config = {"filters":{"publish_date_start":"2022-12-01T0:0:0Z"}}

# Calling the `query()` function
contexts = arxiv_retriever.query(user_query,post_processing_config)

Example : Categorical filter on Podcast Transcripts

# Initialize the retriever
podcast_retriever = EmbedStoreRetriever(dataset_id = "podcasts_01", num_docs=3)

# Query the retriever
user_query = "What are people saying about the new Twitter CEO?"

# Filter to search on Business and Technology podcast episodes
post_processing_config = {"filters":{"category":["Business","Technology"]}}

# Calling the `query()` function
contexts = podcast_retriever.query(user_query,post_processing_config)

Datasets

Dataset	dataset_id	Description	Status	Filters Available	More Details
Podcast Transcripts	podcasts_01	30 most recent podcasts across Finance, Business and Technology	Live	- `category` : ["Finance","Business","Technology"]	Link
arXiv	arxiv_01	2.2M arxiv papers	Live	- `category` : ['Computer Science', 'Quantitative Biology', 'Economics', 'Quantitative Finance', 'Statistics', 'Electrical Engineering and Systems Science', 'Mathematics', 'Physics'] - `publish_date_start` : To filter papers published after this date (string in `%Y-%m-%dT%H:%M:%SZ`)	-
Wikipedia	wikipedia_01	-	Launching Soon
News	news_01	-	Launching Soon

Get involved

We are early, and we want to know how you use this library and what else would you want to make the most out of it.

Join us on our discord, or feel free to email us at hello@embedding.store!

Project details

Release history Release notifications | RSS feed

0.0.6

Jun 23, 2023

0.0.5

Jun 13, 2023

0.0.4

Jun 13, 2023

This version

0.0.3

Jun 13, 2023

0.0.2

Jun 13, 2023

0.0.1

Jun 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

embedstore-0.0.3-py3-none-any.whl (7.5 kB view hashes)

Uploaded Jun 13, 2023 Python 3

Hashes for embedstore-0.0.3-py3-none-any.whl

Hashes for embedstore-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`619220e168db89c085e1b063121638c5d5a7fbf387afadedccb1a6ca6883ed91`
MD5	`94bd26018ba7e0e478d6f4987136899a`
BLAKE2b-256	`306c2af5f3ba35d77125b7ed7fd24e90b9ba6cfb42c4ffb3ac8bcded7f0b3edf`

embedstore 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

embedstore

Quick start

Installation

Get API Key here

Retrieve contexts

What is this?

Usage

More about `EmbedStoreRetriever` class

Advanced filtering while querying

Example : Temporal filter on Arxiv research papers

Example : Categorical filter on Podcast Transcripts

Datasets

Get involved

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

embedstore 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

embedstore

Quick start

Installation

Get API Key here

Retrieve contexts

What is this?

Usage

More about EmbedStoreRetriever class

Advanced filtering while querying

Example : Temporal filter on Arxiv research papers

Example : Categorical filter on Podcast Transcripts

Datasets

Get involved

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

More about `EmbedStoreRetriever` class