Skip to main content

Neural Retrieval-Augmented Generation for GitHub code blocks

Project description

LLM Retrieval Augmented Generation

GitHub release
short alias lrag

The original version of the NN RAG project was created by Waleed Khalid at the Computer Vision Laboratory, University of Würzburg, Germany.

Overview 📖

A minimal Retrieval-Augmented Generation (RAG) pipeline for code and dataset details.
This project aims to provide LLMs with additional context from the internet or local repos, then optionally fine-tune the LLM for specific tasks.

Requirements

  • Python 3.8+ recommended
  • Pip or Conda for installing dependencies
  • (Optional) GPU with CUDA if you plan to do large-scale training

Installing Dependencies

  1. Create and activate a virtual environment (recommended):
    python -m venv venv
    source venv/bin/activate   # Linux/Mac
    venv\Scripts\activate      # Windows
    
  2. Latest Development Version

Install the latest version directly from GitHub:

pip install git+https://github.com/ABrain-One/nn-rag --upgrade

Usage

Command Line Interface

The package provides a command-line interface for extracting neural network blocks:

# Correct way to run (recommended)
python3 -m ab.rag --help

# Extract a specific block
python3 -m ab.rag --block ResNet

# Extract multiple blocks
python3 -m ab.rag --blocks ResNet VGG DenseNet

# Extract from JSON file (default)
python3 -m ab.rag

# Note: Avoid running 'python3 -m ab.rag.extract_blocks' as it may show warnings

Python API

from ab.rag import BlockExtractor, BlockValidator

# Initialize extractor
extractor = BlockExtractor()

# Warm up the index (clones repos and indexes if needed)
extractor.warm_index_once()

# Extract a single block
result = extractor.extract_single_block("ResNet")

# Extract multiple blocks
results = extractor.extract_multiple_blocks(["ResNet", "VGG"])

# Extract from JSON file (uses default nn_block_names.json)
results = extractor.extract_blocks_from_file()

# Extract with limit
results = extractor.extract_blocks_from_file(limit=10)

# Extract with custom JSON file
results = extractor.extract_blocks_from_file("custom_blocks.json")

# Extract with start_from parameter
results = extractor.extract_blocks_from_file(start_from="ResNet", limit=5)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nn_rag-2.0.0.tar.gz (82.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nn_rag-2.0.0-py3-none-any.whl (93.8 kB view details)

Uploaded Python 3

File details

Details for the file nn_rag-2.0.0.tar.gz.

File metadata

  • Download URL: nn_rag-2.0.0.tar.gz
  • Upload date:
  • Size: 82.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for nn_rag-2.0.0.tar.gz
Algorithm Hash digest
SHA256 8777db716d1cfec8fb5e58cc3fa7545180072ea09a812c78faf1fd5e349c8f0a
MD5 f5c3b0b119bef2ee0d1fb7eb6bbf8670
BLAKE2b-256 33a213935d1a93e1e1f25b1519c97207b91ca721cea33ab5d5b825c51c92a196

See more details on using hashes here.

File details

Details for the file nn_rag-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: nn_rag-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 93.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for nn_rag-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 33d2b3afba148e0098218c87552489fca1b9f772e5f4a2192cf3b24cce55ef44
MD5 0e4a370966701722fd5b94d82d006068
BLAKE2b-256 72ff37c7b81b257faaff62b016968001d6a19c602c5d5904eec6f6cd8a4270a1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page