Skip to main content

A modular Python package for implementing Retrieval Augmented Generation chains for the chATLAS project.

Project description

chATLAS_Chains

This package implements and benchmarks various Retrieval Augmented Generation (RAG) chains for use in the chATLAS project.

Installation

From PyPI

pip install chATLAS-Chains

From source

We recommend using uv

cd chATLAS_Chains
uv sync

Environment variables

These are required for the following use cases

  1. Using an OpenAI LLM
export CHATLAS_OPENAI_KEY="your api key"
  1. Using LLMs via the Groq API
export CHATLAS_GROQ_BASE_URL="http://cs-513-ml003:3000"
export CHATLAS_GROQ_KEY="your groq api key"

note The API address is local to the CERN network. If not at CERN, you can forward it like so:

ssh -L 3000:cs-513-ml003:3000 <LXPLUS_USERNAME>@lxplus.cern.ch
export CHATLAS_GROQ_BASE_URL="http://localhost:3000"

Available Chains

  • chains.basic.basic_retrieval_chain
  • chains.basic_graph.basic_retrieval_graph
  • chains.advanced.advanced_rag

Postgres

If you want to create a local postgres server, you need to install psql. Some instructions to do this on macOS using homebrew are here:

Software install

brew install postgresql
brew services start postgresql
brew install pgvector
brew unlink pgvector && brew link pgvector

Create a user

psql -h localhost -U postgres
ALTER USER postgres WITH PASSWORD 'Set_your_password_here';
CREATE EXTENSION IF NOT EXISTS vector;

CHANGELOG

0.1.5

Fix missing retry_config argument in advanced_rag caused by early PyPI upload

0.1.4

Support for Groq-hosted models

Some new functions that go beyond the "basic RAG" workflow:

  • Reciprocal Rerank Fusion chATLAS_Chains.documents.rrf.reciprocal_rank_fusion
  • Document reranking via the Pinecone API chATLAS_Chains.documents.rerank.rerank_documents
  • Query rewriting step chATLAS_Chains.query.query_rewriting.rewrite_query

These are all usable via the new chain chATLAS_Chains.chains.advanced.advanced_rag

Added unit tests to gitlab CI/CD pipeline

0.1.3

Fixing imports

Changed output format of basic_retrieval_chain (docs key is now a list of Document objects, rather than a dict)

Unit tests for basic_retrieval_chain

0.1.2

Unit tests

First Langgraph chain

0.1.1

Initial Release


📄 License

chATLAS_Benchmark is released under Apache v2.0 license.


Made with ❤️ by the ATLAS Collaboration

For questions and support, please contact

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chatlas_chains-0.1.5.tar.gz (52.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chatlas_chains-0.1.5-py3-none-any.whl (61.3 kB view details)

Uploaded Python 3

File details

Details for the file chatlas_chains-0.1.5.tar.gz.

File metadata

  • Download URL: chatlas_chains-0.1.5.tar.gz
  • Upload date:
  • Size: 52.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for chatlas_chains-0.1.5.tar.gz
Algorithm Hash digest
SHA256 41857dbda66f4c2e60a06b6823a4eb71fd7b62c91c8492c8beecbe3a9106f86b
MD5 b012518bdf8ee68ab0bfaf723d6cb1dd
BLAKE2b-256 c7458d6ea027a4c469100ebec701d47f60c96f99a2f9e3c83cbadbef59825936

See more details on using hashes here.

File details

Details for the file chatlas_chains-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for chatlas_chains-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 f161aedb204c1a77f9fe7472733f64f08ae90509c6967d877f85efdb1b1f26ae
MD5 9cf894b1766c36948a041d8c82026508
BLAKE2b-256 dd2f35c8a3af0a041bb1ff54e5f207be24f30e1f60592b828c842190fdc437ab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page