query About Anything package: A Q&A application powered by LLMs and RAG on custom data.

These details have not been verified by PyPI

Project links

Project description

qA_Ap

query About Anything package

Python License

📝 Description

This package is a simple solution for implementing a Retrieval Augmented Generation (RAG) on custom documents. The database and LLM interfaces are modular.

Supports an all local setup with a flat file database and Ollama to a totally cloud based setup with a Baserow database and Cerebras (both free to use).

An optional API server (bottle.py) with custom authentication and an integrated frontend is available to query your documents simply and immediately.

Architecture

The logic of this package is to query an LLM to find Documents via RAG and expose them in the response. The main purpose would be to ask for solutions or informations for a certain need and have a nice response that presents a set of tools or solutions with direct links when possible.

A Document is defined as a qA_Ap.classes.Document class that consists of a text content, a title, any metadata and any medias.

Support for Notes (qA_Ap.classes.Note) is implemented though quite minimal at this time. Each document can have multiple Notes wich consists of a title, a text content, any metadata and any medias.

As this package is made with a frontend focus the compiles a catalog.json (containing title, short description and metadata of all documents) that should be used to do the search and filtering in the view then query the API for complete a document (and its notes) when needed.

Along with the catalog, indexes can be compiled for each unique values of a given metadata field. This is useful to get directly the list of all existing values for a document attribute. (i.e. your documents can have a tag field and indexing the tag attribute creates a list of all existing tags value that you can use to filter, display or populate a selection dropdown...)

For a better customization, the database and LLM are modulars. Implement your own qA_Ap.db.qaapDB class or qA_Ap.app.ai.AIInterface to suit your needs. Each of these classes are totally ignorant of the rest of the app and should only takes and returns native python types.

qA_Ap diagram

📦 Key Dependencies

oyaml >= 1.0
numpy >= 2.3.2
sentence-transformers >= 5.0.0
semantic-text-splitter >= 0.27.0
bottle >= 0.13.4
faiss-cpu >= 1.12.0             # install faiss if your gpu is suitable
# faiss >= 1.5.3
safer >= 5.1.0                  # required if you use the FlatFileDB or AnyFolderDB
ollama >= 0.5.3                 # required is you use the OllamaAIInterface
cerebras-cloud-sdk >= 1.46.0    # required if you use the CerebrasAIInterface

📁 Package Structure

qA_Ap # setup method and aliases to core components

qA_Ap.app # documents manipulation methods

qA_Ap.app.catalog # Catalog related functions

qA_Ap.app.ai # internal Vectorstore class for the RAG
qA_Ap.app.ai.interfaces # abstract classes for AI interface
qA_Ap.app.ai.interfaces.ollama # Ollama interface
qA_Ap.app.ai.interfaces.cerebras # Cerebras 'personal tier' interface
qA_Ap.app.ai.methods # AI related methods

qA_Ap.classes # Document and Note classes
qA_Ap.classes.errors # application errors
qA_Ap.classes.errors.db # databse errors

qA_Ap.db # abstract class for database
qA_Ap.db.baserowfreeapi # Baserow free API database class
qA_Ap.db.flatfiledb # flat file stuctured database class

qA_Ap.state # global objects used accross the app

qA_Ap.web # controls integrated frontend view
qA_Ap.web.api # API server

🛠️ How to use

You can find a raw documentation here (it will be refined)

Python Setup

Install Python v3.10+ (3.12 recommended)
Create a virtual environment: python -m venv venv
Activate the environment:
- Windows: venv\Scripts\activate
- Unix/MacOS: source venv/bin/activate
Install dependencies: pip install -r requirements.txt

`qA_Ap.init()` method

All the setup can be made with one method call:

import qA_Ap as qp

qp.init()

Customize your setup with these parameters:

database (str | ottoDB.ottoDB, optional): The database to use. Can be a path to a flat file database (uses a FlatFileDB instance) or an instance of ottoDB. Defaults to "data/qaap_db".
ai (AIInterface.AIInterface | str, optional): The AI interface to use. Can be a model name (if so it uses an OllamaAIInterface) or an instance of AIInterface. Defaults to "qwen3:0.6b".
embeddings_model (str, optional): The embeddings model to use via SentenceTransformer. Can be a local path or HuggingFace project name. Defaults to "Qwen3-Embedding-0.6B".
object_of_search (str, optional): The object of search. Will be replaced in the system_prompt. Defaults to "solutions".
system_prompt (str, optional): The system prompt to use. Defaults to qA_Ap.default_system_prompt:

You are an assistant for question-answering tasks.
Your role is to guide the user to find {object_of_search} for its need.
Use the following pieces of retrieved context to answer the question if it matches the user needs.
Do not mention any context that do not matches the question.
Use your general knowledge if the context is lacking.
Use the same language as the question and keep the answer concise.
After your answer, list all relevant documents name between brackets.

Question: {question} 
Context: {context}
Answer:

api_server (int | dict | False, optional): The port on wich to run the API server (bottle.py). If a dictionary is provided, it will be used as the server configuration. If False the server is not run. Defaults to 8080.
auth (bool, optional): Whether to enable authentication on the API server POST endpoints. Defaults to False.
frontend (bool, optional): Whether to run the integrated frontend interface. Defaults to False.

Core objects

qA_Ap.globals

The globals module contains globals variables used accross the app:

database: the current qA_Ap.db.qaapDB class instance used a database
ai_interface: The current qA_Ap.app.ai.interfaces.AIInterface class instance used to query the LLM
vectorstore: The qA_Ap.app.ai.VectorStore class instance used to store embedded documents and retrieve them by similarity search
path_to_emmbeddings_model: The path (local or HuggingFace) to the embeddings model used to vectorize the documents and the query
system_prompt: The system prompt for each LLM query (must contain the interpolated fields {context},{history} and {object_of_search})
object_of_search: The specific naming of what the LMM should find for you. Is interpolated in the system_prompt.

the qA_Ap package has four useful aliases:

qA_Ap.query

The query method does the RAG and outputs the streamed response.

def query(
   query: str,
   history: list[dict[str,str]] = None, 
   include_metadata: bool = False
) -> AIStreamResponse:

query: The user prompt
history: The optional chat history as a list of dict containing a key role that can be set to "user" or "assistant" and a key content with the corresponding message.
include_metadata: Bool to includes the complete metadata of retrieved documents at the end of the stream.

AIStreamResponse is an iterator that wraps the LMM stream response and returns the metadata as the last chunk if enabled.

qA_Ap.server

The Bottle.py server instance that runs the API (and the optional integrated frontend)

qA_Ap.compile_catalog

This method will compile each documents and write the catalog.json file.

qA_Ap.compile_attribute

This method takes an attribute_name (str) and reads each unique attribute values for the given attribute name from the catalog.json file and write the according <attribute_name>.txt.

🚀 Roadmap

These are the planned improvements and features:

⬜ Includes the Notes content in the vectorstore

⬜ Write a detailled documentation and github wiki

⬜ Develop a totally frontend solution with transformers.js

⬜ Develop a Flet interface to query your local documents

📜 License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.9.1

Sep 1, 2025

0.2.9

Sep 1, 2025

0.2.8

Aug 31, 2025

0.2.7

Aug 31, 2025

0.2.6

Aug 31, 2025

This version

0.2.3

Aug 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qa_ap-0.2.3.tar.gz (1.2 MB view details)

Uploaded Aug 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

qa_ap-0.2.3-py3-none-any.whl (98.6 kB view details)

Uploaded Aug 30, 2025 Python 3

File details

Details for the file qa_ap-0.2.3.tar.gz.

File metadata

Download URL: qa_ap-0.2.3.tar.gz
Upload date: Aug 30, 2025
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for qa_ap-0.2.3.tar.gz
Algorithm	Hash digest
SHA256	`79b5551dee62b4f44fbdba333467f83910a8f7c494d80e93b6531d91610087ad`
MD5	`6a191b5313d927e05826c2ed8b0cef28`
BLAKE2b-256	`f58e650798e1b22486985c961c6abd69c506c8bd9014ee313667f706bfaaf58d`

See more details on using hashes here.

File details

Details for the file qa_ap-0.2.3-py3-none-any.whl.

File metadata

Download URL: qa_ap-0.2.3-py3-none-any.whl
Upload date: Aug 30, 2025
Size: 98.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for qa_ap-0.2.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`171acee81be711bbb75582a89488e1010f404844d8ba62205c23db8ae59e679d`
MD5	`f82f3efb82c19b70b6ac4649943f9935`
BLAKE2b-256	`034dff04eef63522b11b8a311ae0f81e3bf26bf86f0c9c63e8ee790202e3f801`

See more details on using hashes here.

qA-Ap 0.2.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

qA_Ap

query About Anything package

📝 Description

Architecture

📦 Key Dependencies

📁 Package Structure

🛠️ How to use

Python Setup

qA_Ap.init() method

Core objects

qA_Ap.globals

qA_Ap.query

qA_Ap.server

qA_Ap.compile_catalog

qA_Ap.compile_attribute

🚀 Roadmap

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`qA_Ap.init()` method