Pebblo Gen-AI Data Analyzer
Project description
Pebblo enables developers to safely load data and promote their Gen AI app to deployment without worrying about the organization’s compliance and security requirements. The project identifies semantic topics and entities found in the loaded data and summarizes them on the UI or a PDF report.
Pebblo has two components.
- Pebblo Daemon - a REST api application with topic-classifier, entity-classifier and reporting features
- Pebblo Safe DataLoader - a thin wrapper to Gen-AI framework's data loaders
Pebblo Daemon
Installation
Pre-requisites
Mac OSX
brew install pango
Linux (debian/ubuntu)
sudo apt-get install libpango-1.0-0 libpangoft2-1.0-0
Install Pebblo Daemon
pip install pebblo
Run Pebblo daemon
pebblo
see troubleshooting guide for troubleshooting info.
Pebblo daemon now listens to localhost:8000
to accept Gen-AI application data snippets for inspection and reporting.
Pebblo Optional Flags
--config <file>
: Specifies a custom configuration file in yaml format.
pebblo --config config.yaml
Pebblo Safe DataLoader
Langchain
Pebblo Safe DataLoader
is natively supported in Langchain framework. It is available in Langchain versions >=0.1.7
Enable Pebblo in Langchain Application
Add PebbloSafeLoader
wrapper to the existing Langchain document loader(s) used in the RAG application. PebbloSafeLoader
is interface compatible with Langchain BaseLoader
. The application can continue to use load()
and lazy_load()
methods as it would on an Langchain document loader.
Here is the snippet of Lanchain RAG application using CSVLoader
before enabling PebbloSafeLoader
.
from langchain.document_loaders.csv_loader import CSVLoader
loader = CSVLoader(file_path)
documents = loader.load()
vectordb = Chroma.from_documents(documents, OpenAIEmbeddings())
The Pebblo SafeLoader can be enabled with few lines of code change to the above snippet.
from langchain.document_loaders.csv_loader import CSVLoader
from langchain_community.document_loaders.pebblo import PebbloSafeLoader
loader = PebbloSafeLoader(
CSVLoader(file_path),
name="acme-corp-rag-1", # App name (Mandatory)
owner="Joe Smith", # Owner (Optional)
description="Support productivity RAG application", # Description (Optional)
)
documents = loader.load()
vectordb = Chroma.from_documents(documents, OpenAIEmbeddings())
See here for samples with Pebblo enabled RAG applications and this document for more details.
Contribution
Pebblo is a open-source community project. If you want to contribute see Contributor Guidelines for more details.
License
Pebblo is released under the MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file pebblo-0.1.11-py3-none-any.whl
.
File metadata
- Download URL: pebblo-0.1.11-py3-none-any.whl
- Upload date:
- Size: 3.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2b283d07212876f924c52942f6151bcce7fd715fbdb03e2f15cc60b6e35c4973 |
|
MD5 | 3c88229484a96e4701572c85ebdc01a6 |
|
BLAKE2b-256 | ce000f7e583b597cffb740a4918501ee3da4d29ea65defe1bcecda4a6dd1afc1 |