This package offers a lightweight and straightforward solution for implementing Retrieval-augmented generation (RAG) functionality with large language models (LLMs).

These details have not been verified by PyPI

Project description

RAG doc search

Overview

This package offers a lightweight and straightforward solution for implementing Retrieval-augmented generation (RAG) functionality with large language models (LLMs). RAG enhances prediction quality by incorporating external data storage during inference, enabling the construction of more contextually rich prompts. Leveraging combinations of context, history, and up-to-date knowledge, RAG LLMs empower users to generate more accurate and relevant responses. This package streamlines the integration of RAG capabilities into applications, facilitating the creation of more intelligent and context-aware conversational agents, search engines, and text generation systems.

Config Json

{
    "ai_provider": "OPENAI" | "BEDROCK",
    "embeddings_model": "<embedding model of openai or bedrock as per ai_provider>",
    "llm": "<llm model of openai or bedrock as per ai_provider>",
    "llm_temperature": "<llm model of temperature>",
    "llm_max_output_tokens": "<llm model of mac output token>",
    "vector_store_provider": "FAISS" | "PGVector",
    "faiss_vector_embeddings_location": "./data", provide only if vector_store_provider is FAISS
    "faiss_index_name": "fiass-db", provide only if vector_store_provider is FAISS
    "name": "<name of your project>",
    "retriever": {
         use this link [https://python.langchain.com/docs/modules/data_connection/retrievers/vectorstore] for understanding retriver searchtype and search_args
        "search_type": "similarity" | "mmr" | "similarity_score_threshold" ,
        "search_args": {
            "k": 10,
            "fetch_k": 500, 
            "lambda_mult": 0.1 
            "score_threshold" : 0.1
        }
    }
}

Note

Ensure that the necessary environment variables are set before initializing the configuration.

If AI Provider is OPENAI then ENV OPENAI_API_KEY is required.
If AI Provider is BEDROCK then ENV AWS_ACCESS_KEY,AWS_SECRET_ACCESS_KEY are required and AWS_REGION is optional, default is 'us-east-1'.
If Vector Store Provider is PGVector then PGVECTOR_HOST, PGVECTOR_PORT, PGVECTOR_DATABASE, PGVECTOR_USER and PGVECTOR_PASSWORD are required

How to Use

from rag_doc_search import config_init, get_bot_instance
from rag_doc_search.utils.config import Config
from rag_doc_search.src.enums.provider import AIProvider

# provide your config JSON in init config_init method below also set the required ENV as mentioned in Note
config = config_init({})

# Method a bot instance as per AI provider it will give you the bot model for BedRock or for OpenAI
bot_instance = get_bot_instance(config.ai_provider)

qa_instance = bot_model.create_qa_instance()

result = qa_instance.invoke(input="Provide your prompt here")

print(result)

Examples

To understand how to use the package, refer to the following examples:

To create an API that answers your questions, use the example provided here.
To create a WebSocket API for streaming answers to your questions, use the example provided here.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.9

Jul 22, 2024

0.1.8

Jul 22, 2024

0.1.7

May 10, 2024

0.1.6

Apr 24, 2024

0.1.5

Apr 15, 2024

This version

0.1.4

Apr 10, 2024

0.1.2

Mar 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rag_doc_search-0.1.4.tar.gz (11.2 kB view hashes)

Uploaded Apr 10, 2024 Source

Built Distribution

rag_doc_search-0.1.4-py3-none-any.whl (15.8 kB view hashes)

Uploaded Apr 10, 2024 Python 3

Hashes for rag_doc_search-0.1.4.tar.gz

Hashes for rag_doc_search-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`fea8a9a2d4da4c16d63a6f3d0b1ed00bedf8e30d8cb85cf9457bb81996c3af3d`
MD5	`62a869dc5583cbbed2c0bbd7aa3f6b51`
BLAKE2b-256	`f78d62deec1e2aa2b356925c761e92291934d79a29b1d690092e74fc6bc279ff`

Hashes for rag_doc_search-0.1.4-py3-none-any.whl

Hashes for rag_doc_search-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`806adbf29686f42e4172943e926f73b5776a0881eaaace5b19b0a98f86cdb26c`
MD5	`840eeab25f90e3842f61cdea2c73672b`
BLAKE2b-256	`2cdc354ab9be5a35d9aadacd2e46a57a20bb0c824fe0cebe3b76784c1b44fe8b`