Skip to main content

Neural Retrieval-Augmented Generation for GitHub code blocks

Project description

LLM Retrieval Augmented Generation

GitHub release
short alias lrag

The original version of the NN RAG project was created by Waleed Khalid at the Computer Vision Laboratory, University of Würzburg, Germany.

Overview 📖

A minimal Retrieval-Augmented Generation (RAG) pipeline for code and dataset details.
This project aims to provide LLMs with additional context from the internet or local repos, then optionally fine-tune the LLM for specific tasks.

Requirements

  • Python 3.8+ recommended
  • Pip or Conda for installing dependencies
  • (Optional) GPU with CUDA if you plan to do large-scale training

Installing Dependencies

  1. Create and activate a virtual environment (recommended):
    python -m venv venv
    source venv/bin/activate   # Linux/Mac
    venv\Scripts\activate      # Windows
    
  2. Latest Development Version

Install the latest version directly from GitHub:

pip install git+https://github.com/ABrain-One/nn-rag --upgrade

Usage

Command Line Interface

The package provides a command-line interface for extracting neural network blocks:

# Correct way to run (recommended)
python3 -m ab.rag --help

# Extract a specific block
python3 -m ab.rag --block ResNet

# Extract multiple blocks
python3 -m ab.rag --blocks ResNet VGG DenseNet

# Extract from JSON file (default)
python3 -m ab.rag

# Note: Avoid running 'python3 -m ab.rag.extract_blocks' as it may show warnings

Python API

from ab.rag import BlockExtractor, BlockValidator

# Initialize extractor
extractor = BlockExtractor()

# Warm up the index (clones repos and indexes if needed)
extractor.warm_index_once()

# Extract a single block
result = extractor.extract_single_block("ResNet")

# Extract multiple blocks
results = extractor.extract_multiple_blocks(["ResNet", "VGG"])

# Extract from JSON file (uses default nn_block_names.json)
results = extractor.extract_blocks_from_file()

# Extract with limit
results = extractor.extract_blocks_from_file(limit=10)

# Extract with custom JSON file
results = extractor.extract_blocks_from_file("custom_blocks.json")

# Extract with start_from parameter
results = extractor.extract_blocks_from_file(start_from="ResNet", limit=5)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lrag-2.0.0.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lrag-2.0.0-py3-none-any.whl (3.8 kB view details)

Uploaded Python 3

File details

Details for the file lrag-2.0.0.tar.gz.

File metadata

  • Download URL: lrag-2.0.0.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for lrag-2.0.0.tar.gz
Algorithm Hash digest
SHA256 fcdce835be05c2f41fbc1b477f2c5e298272aa83080c5d2dd940d19f4221e7df
MD5 1f93d3fb42f75c5cfa34c604cdbcfd5d
BLAKE2b-256 2aa789fd92545e30ff35a46d3ec185821f4cafaaa2b24735413472b4d28c5b0c

See more details on using hashes here.

File details

Details for the file lrag-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: lrag-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for lrag-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9fbd6d81539d9ae867f4e8087c5d7b24e0c2f63593a96da456f33a87141b1acd
MD5 e4ca2f2edcafa18b406384593cd30342
BLAKE2b-256 c5f722be08b4531f3a354b00170b0d4b1e59c4d1e6a8aa2cb483ebcb9a3c2174

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page