Skip to main content

Pebblo Gen-AI Data Analyzer

Project description


GitHub MIT license Documentation

PyPI PyPI - Downloads PyPI - Python Version

Discord Twitter Follow

Pebblo enables developers to safely load data and promote their Gen AI app to deployment without worrying about the organization’s compliance and security requirements. The project identifies semantic topics and entities found in the loaded data and summarizes them on the UI or a PDF report.

Pebblo has two components.

  1. Pebblo Daemon - a REST api application with topic-classifier, entity-classifier and reporting features
  2. Pebblo Safe DataLoader - a thin wrapper to Gen-AI framework's data loaders

Pebblo Daemon

Installation

Pre-requisites

Mac OSX
brew install pango
Linux (debian/ubuntu)
sudo apt-get install libpango-1.0-0 libpangoft2-1.0-0

Install Pebblo Daemon

pip install pebblo

Run Pebblo daemon

pebblo

see troubleshooting guide for troubleshooting info.

Pebblo daemon now listens to localhost:8000 to accept Gen-AI application data snippets for inspection and reporting.

Pebblo Optional Flags

  • --config <file>: Specifies a custom configuration file in yaml format.
pebblo --config config.yaml

Pebblo Safe DataLoader

Langchain

Pebblo Safe DataLoader is natively supported in Langchain framework. It is available in Langchain versions >=0.1.7

Enable Pebblo in Langchain Application

Add PebbloSafeLoader wrapper to the existing Langchain document loader(s) used in the RAG application. PebbloSafeLoader is interface compatible with Langchain BaseLoader. The application can continue to use load() and lazy_load() methods as it would on an Langchain document loader.

Here is the snippet of Lanchain RAG application using CSVLoader before enabling PebbloSafeLoader.

    from langchain.document_loaders.csv_loader import CSVLoader

    loader = CSVLoader(file_path)
    documents = loader.load()
    vectordb = Chroma.from_documents(documents, OpenAIEmbeddings())

The Pebblo SafeLoader can be enabled with few lines of code change to the above snippet.

    from langchain.document_loaders.csv_loader import CSVLoader
    from langchain_community.document_loaders.pebblo import PebbloSafeLoader

    loader = PebbloSafeLoader(
                CSVLoader(file_path),
                name="acme-corp-rag-1", # App name (Mandatory)
                owner="Joe Smith", # Owner (Optional)
                description="Support productivity RAG application", # Description (Optional)
    )
    documents = loader.load()
    vectordb = Chroma.from_documents(documents, OpenAIEmbeddings())

See here for samples with Pebblo enabled RAG applications and this document for more details.

Contribution

Pebblo is a open-source community project. If you want to contribute see Contributor Guidelines for more details.

License

Pebblo is released under the MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pebblo_sample-0.1.11-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file pebblo_sample-0.1.11-py3-none-any.whl.

File metadata

  • Download URL: pebblo_sample-0.1.11-py3-none-any.whl
  • Upload date:
  • Size: 3.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.7

File hashes

Hashes for pebblo_sample-0.1.11-py3-none-any.whl
Algorithm Hash digest
SHA256 88890fb8d8ea34b39b6d06f3a624f1ba4b06a8fa9ca00140599fc231d26100b5
MD5 f41238321254bdfda8b636dca2a1718c
BLAKE2b-256 cc325b2af86205448c04627a7762104d666d47e06e8428951db6d719db35a71b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page