A simple package for implementing RAG
Project description
GRAG - Good RAG
GRAG is a simple python package that provides an easy end-to-end solution for implementing Retrieval Augmented Generation (RAG).
The package offers an easy way for running various LLMs locally, Thanks to LlamaCpp and also supports vector stores like Chroma and DeepLake. It also makes it easy to integrage support to any vector stores easy.
Table of Content
Project Overview
- A ready to deploy RAG pipeline for document retrival.
- Basic GUI (Under Development)
- Evaluation Suite (Under Development)
- RAG enhancement using Graphs (Under Development)
Getting Started
To run the projects, make sure the instructions below are followed.
Further customization can be made on the config file, src/config.ini.
git clonethe repositorypip install .from the repository (note: add - then change directory to the cloned repo)- For Dev:
pip install -e .
Requirements
Required packages to install includes (refer to pyproject.toml):
- PyTorch
- LangChain
- Chroma
- Unstructured.io
- sentence-embedding
- instructor-embedding
LLM Models
To quantize model, run:
python -m grag.quantize.quantize
For more details, go to .\llm_quantize\readme.md Tested models:
- Llama-2 7B, 13B
- Mixtral 8x7B
- Gemma 7B
Model Compatibility
Refer to llama.cpp Supported Models (under Description) for list of compatible models.
Supported Vector Databases
1. Chroma
Since Chroma is a server-client based vector database, make sure to run the server.
- To run Chroma locally, move to
src/scriptsthen runsource run_chroma.sh. This by default runs on port 8000. - If Chroma is not run locally, change
hostandportunderchromainsrc/config.ini.
2. Deeplake
For more information refer to Documentation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file grag-0.0.1.tar.gz.
File metadata
- Download URL: grag-0.0.1.tar.gz
- Upload date:
- Size: 28.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.27.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
93fe46e8dcfe45c5a1e724fb52190479496245f575791cd16c13f5d9a3d966b7
|
|
| MD5 |
539cb6ea42580e0b27850ee3827fb6d5
|
|
| BLAKE2b-256 |
5594d425a409a24edfdef99d8243ab0ec45da080217191996de619e623d875c4
|
File details
Details for the file grag-0.0.1-py3-none-any.whl.
File metadata
- Download URL: grag-0.0.1-py3-none-any.whl
- Upload date:
- Size: 56.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.27.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fe4b01a70ff05d409e284f2b39324c671d7746799ecc2f20c9b1ed34e818cf06
|
|
| MD5 |
70fd7a645d82f236bcda6150877d8796
|
|
| BLAKE2b-256 |
0ccd58344eba5ac8c6e00eac7cab68523872ee047f3a80fd7d013d20ca72dff0
|