Skip to main content

A tool to index Confluence content for RAG

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

confluence-rag-indexer

Index confluence into a RAG database using pg_vector.

Categorizes documents prior to indexing and allows you to skip specific classifications like "Meeting Notes".

Installation

pip install confluence-rag-indexer

Usage

usage: Confluence to Rag Indexer [-h] [--cutoff CUTOFF] [--confluence-domain CONFLUENCE_DOMAIN] [--confluence-email CONFLUENCE_EMAIL]
                                 [--confluence-api-key CONFLUENCE_API_KEY] [--openai-api-key OPENAI_API_KEY]
                                 [--ignore-classifications IGNORE_CLASSIFICATIONS [IGNORE_CLASSIFICATIONS ...]] [--postgres-url POSTGRES_URL] [-v]
                                 [space ...]

positional arguments:
  space                 The Confluence space(s)

options:
  -h, --help            show this help message and exit
  --cutoff CUTOFF       The cutoff date for Confluence content
  --confluence-domain CONFLUENCE_DOMAIN
                        The Confluence domain
  --confluence-email CONFLUENCE_EMAIL
                        The Confluence email
  --confluence-api-key CONFLUENCE_API_KEY
                        The Confluence API key
  --openai-api-key OPENAI_API_KEY
                        The OpenAI API key
  --ignore-classifications IGNORE_CLASSIFICATIONS [IGNORE_CLASSIFICATIONS ...]
                        Ignore documents with these classifications
  --postgres-url POSTGRES_URL
                        The PostgreSQL URL
  -v, --verbose

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

confluence_rag_indexer-0.2.0.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

confluence_rag_indexer-0.2.0-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file confluence_rag_indexer-0.2.0.tar.gz.

File metadata

  • Download URL: confluence_rag_indexer-0.2.0.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.7

File hashes

Hashes for confluence_rag_indexer-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b3472d33053257156691c2090a71fc23b67047714032c29b25cade8de5a262a1
MD5 d7b1d05f8e5b097154856c0bd32f5a33
BLAKE2b-256 0fc1eb1b2264a32dfae1fee48807f6c4ec9cfdee25f24e83ec6c22f5dda6f4e5

See more details on using hashes here.

File details

Details for the file confluence_rag_indexer-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for confluence_rag_indexer-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85b2100d3f19b3a57cfea906e035b9271de175c0d464ff7f2d8e97e3c4e452d3
MD5 59cec54d3c54665d7af09051dcf337ae
BLAKE2b-256 a9e3c20427894700d44bcf6f02902f483aa0e0a99573ef25d930257f4b5be973

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page