Flexible RAG

These details have not been verified by PyPI

Project description

WattElse project

WattElse is a NLP suite developed for the needs of RTE (Réseau de Transport d'Electricité).

It is composed of two main modules:

a simple chatbot interface to interact with any LLM -> WattElse GPT
a Retrieval Augmented Generation (RAG) application -> WattElse DOC

Some services are used by several applications/users at the same time. To optimize resource use, these services are implemented in the form of APIs. A description of these services is available in wattelse/api.

WattElse also includes helper modules that provide additional functionalities such as summaries, web scrapping, and document parsing.

Installation

Before trying to install WattElse, you first need to ensure you have:

python >= 3.10
sqlite3 >= 3.35

Clone this project:

git clone https://github.com/rte-france/wattelse.git

Create a virtual environment:

cd wattelse
python -m venv .venv
source .venv/bin/activate

Install WattElse and its dependencies:

./install.sh

Environnement variables

To run WattElse, you need to set the following environment variables:

WATTELSE_BASE_DIR: path where WattElse data will be stored
WATTELSE_CLIENT_SECRET: secret used by the Django WattElse app to communicate with internal APIs. This value should match the one defined on the $HOME/client_registry.json file. Refer to the API documentation for details.
DJANGO_SECRET_KEY: Django secret key (see Django documentation)

To create a Django SECRET_KEY, run the following code in a python shell:

from django.core.management.utils import get_random_secret_key
print(get_random_secret_key())

Django initialization

To initialize the Django database, follow these steps:

Go to the django web_app folder:

cd wattelse/web_app

Create Django tables:

python manage.py makemigrations
python manage.py migrate

Create Django superuser:

python manage.py createsuperuser

Collect static files (needed when DEBUG=False in Django)

python manage.py collectstatic

Launch WattElse

To launch WattElse with all services, go to WattElse root folder and run:

./start_all_services.sh

This script starts all services in separated screens:

Embedding API
RAGOrchestrator API
Django server

Django web app should be running at: http://localhost:8000

If everything has been set up correctly, you should see the following screen on http://localhost:8000

For your first connection, you can use the superuser credentials you created earlier or create a new user.

Stop WattElse

To stop all services, run:

./stop_all_services.sh

Hardware requirements

By default, WattElse only loads an embedding model on start. It requires around 2GB of VRAM if loaded on GPU. To change the embedding model, see wattelse/api and wattelse/api/embedding.

The LLM used depends on the RAG config. By default, no local LLM is loaded so you need to link RAG config to a remote LLM (OpenAI, Azure...). For RAG config management, see wattelse/rag_backend.

If you want to load a local LLM using vLLM, you need to have enough VRAM to load the model. For example, the llama-3.1-8B-instruct model requires around 16GB of VRAM.

RAG service

Overview of main steps

flowchart TB
    subgraph Preprocessing["Data Preprocessing"]
        direction TB
        RawDocs["Raw Documents"]
        TextChunks["Text Chunks"]
        VectorDB[(Vector Database)]

        RawDocs --> |"Text Splitting"| TextChunks
        TextChunks --> |"Embedding Generation"| VectorDB
    end

    subgraph QueryProcessing["Query Processing"]
        direction TB
        UserQuery["User Query"]
        QueryEmbed["Query Embedding"]
        SimilaritySearch["Similarity Search"]
        ContextRetrieval["Context Retrieval"]

        UserQuery --> QueryEmbed
        QueryEmbed --> SimilaritySearch
        SimilaritySearch --> ContextRetrieval
    end

    subgraph ResponseGeneration["Response Generation"]
        direction TB
        LLM["Large Language Model"]
        Response["Generated Response"]

        ContextRetrieval --> LLM
        UserQuery --> LLM
        LLM --> Response
    end

    VectorDB --> SimilaritySearch

Description of components

flowchart TB
    subgraph Input
        User[User Interface]
    end

    subgraph "RAG Core Components"
        RAG[RAG Backend]
        LLM[LLM Service]
        Embedding[Embedding Service]
    end

    subgraph "Storage Layer"
        VectorDB[(Vector Database)]
        DocStore[(Document Store)]
    end

    subgraph "Document Processing"
        Parser[Document Parser]
        Chunker[Text Chunker]
    end

    User <--> RAG
    RAG <--> LLM
    RAG <--> Embedding
    RAG <--> VectorDB
    RAG <--> DocStore
    DocStore <--> Parser
    Parser <--> Chunker

    classDef core fill:#e1f5fe,stroke:#01579b
    classDef storage fill:#e8f5e9,stroke:#1b5e20
    classDef processing fill:#fff3e0,stroke:#e65100
    classDef input fill:#f3e5f5,stroke:#4a148c

    class RAG,LLM,Embedding core
    class VectorDB,DocStore storage
    class Parser,Chunker processing
    class User input

Simplified sequence diagram for RAG

sequenceDiagram
    participant User
    participant UI as User Frontend
    participant LLM as LLM Service
    participant RAG as RAG Backend
    participant Embedding as Embedding Service
    participant VectorDB as Vector Database
    participant DocStore as Document Store
    participant Parser as Document Parser
    participant Chunker as Text Chunker

    User->>UI: Authenticate
    UI-->>User: Authentication OK

    UI->>UI: Create Session

    User->>UI: Upload files
    UI->>RAG: New files
    RAG->>VectorDB: Check cache
    VectorDB->>VectorDB: check file names
    VectorDB-->>RAG: Files not in cache
    RAG->>Parser: Parse files(files)
    Parser->>Chunker: Chunk(documents)
    Chunker-->>RAG: documents chunks
    RAG->>DocStore: store(files)
    RAG->>VectorDB: store(documents chunks)
    VectorDB->>Embedding: compute embeddings(documents chunks)
    Embedding->>Embedding: compute
    Embedding-->>VectorDB: computed embeddings
    VectorDB->>VectorDB: store
    VectorDB-->>RAG: storage ok

    User->>UI: <br><br><br><br>Enter Query
    UI->>RAG: Submit Query
    RAG->>Embedding: Generate Query Embedding
    Embedding-->>RAG: Query Vector

    RAG->>VectorDB: Semantic Search
    VectorDB->>VectorDB: Similarity function
    VectorDB-->>RAG: Relevant Document Extracts

    RAG->>LLM: Generate Response (Query + Context + (history))
    LLM-->>RAG: Generated Response
    RAG-->>UI: Final Answer
    UI-->>User: Displayed answer

    User-->>UI: <br>Provide feedback
    UI-->>UI: Store feedback



    User->>UI: <br><br><br>Logout / Timeout
    UI->>UI: Kill Session

    Note over DocStore,Chunker: Document Processing Pipeline
    Note over RAG,VectorDB: Retrieval Pipeline
    Note over RAG,LLM: Generation Pipeline

Main code dependencies

flowchart TB
   wattelse["wattelse"]

   subgraph ML["Machine Learning"]
       torch["torch"]
       sklearn["scikit-learn"]
       accelerate["accelerate"]
       scipy["scipy"]
       numpy["numpy"]
   end

   subgraph LLM["LLM & RAG"]
       langchain["langchain"]
       langchain_comm["langchain-community"]
       langchain_chroma["langchain-chroma"]
       langchain_openai["langchain-openai"]
       llama_index["llama-index-core"]
       openai["openai"]
       chromadb["chromadb"]
       sent_trans["sentence-transformers"]
       vllm["vllm"]
       fschat["fschat"]
       tiktoken["tiktoken"]
   end

   subgraph Web["Web Framework"]
       django["django"]
       fastapi["fastapi"]
       streamlit["streamlit"]
       uvicorn["uvicorn"]
   end

   subgraph Doc["Document Processing"]
       docxtpl["docxtpl"]
       python_docx["python-docx"]
       python_pptx["python-pptx"]
       pymupdf["pymupdf"]
       unstructured["unstructured"]
       mammoth["mammoth"]
       xlsx2html["xlsx2html"]
   end

   subgraph Data["Data Processing"]
       pandas["pandas"]
       plotly["plotly"]
       seaborn["seaborn"]
       bs4["bs4"]
   end

   wattelse --> ML
   wattelse --> LLM
   wattelse --> Web
   wattelse --> Doc
   wattelse --> Data

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.2.3

Apr 18, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wattelse-1.2.3.tar.gz (200.6 kB view details)

Uploaded Apr 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wattelse-1.2.3-py3-none-any.whl (278.1 kB view details)

Uploaded Apr 18, 2025 Python 3

File details

Details for the file wattelse-1.2.3.tar.gz.

File metadata

Download URL: wattelse-1.2.3.tar.gz
Upload date: Apr 18, 2025
Size: 200.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for wattelse-1.2.3.tar.gz
Algorithm	Hash digest
SHA256	`be8e37fbab628d6a66bf6b540093de580649c7e3800c3f649aec858035294bf2`
MD5	`3d49f33b6e50e32afaf25afce28edde4`
BLAKE2b-256	`0dbdf3e031f068303358722b390fad6d1aee6d4046d62bb4bc062ca630d7fda6`

See more details on using hashes here.

File details

Details for the file wattelse-1.2.3-py3-none-any.whl.

File metadata

Download URL: wattelse-1.2.3-py3-none-any.whl
Upload date: Apr 18, 2025
Size: 278.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for wattelse-1.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`57ad476fa8e899e7be196e06f5b63fc7a0f5d806b9ce1f8691d17620a2bf47f3`
MD5	`6e4dc6ac4de93a04bb3dc70426988764`
BLAKE2b-256	`dec56cce3de2db158271f06b9435a3b6765bb89e1e8f0cc32799ab5ce4de60c0`

See more details on using hashes here.

wattelse 1.2.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

WattElse project

Installation

Environnement variables

Django initialization

Launch WattElse

Stop WattElse

Hardware requirements

RAG service

Overview of main steps

Description of components

Simplified sequence diagram for RAG

Main code dependencies

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes