Skip to main content

Simple Python CLI tool for turning directories into a single text file for input to LLMs.

Project description

grabit

grabit is a command-line tool for recursively scanning a directory, extracting file contents, and saving or copying them to the clipboard while respecting .grabit ignore rules. The tool helps turn complex projects into LLM input for big context questions.

Features

  • Recursively extracts file contents from a directory.
  • Respects .grabit ignore files (similar to .gitignore).
  • Saves extracted content to a file or copies it to the clipboard.
  • Works on Windows, macOS, and Linux.

Installation

Install grabit using pip:

pip install grabit

Usage

Basic Usage

grabit /path/to/directory

Save Output to a File

grabit /path/to/directory -o output.txt

Copy Output to Clipboard

grabit /path/to/directory -c

Ignore Files

To ignore specific files or patterns, create a .grabit file in the root of your project. This works similarly to .gitignore.

Example .grabit:

*.log
__pycache__/
secrets.txt

Example Output

When run, grabit will generate a structured output:

## `src/main.py`:
``
print("Hello, World!")
``

## `README.md`:
``
# Project Readme
``

Versioning

Major.Minor.Patch

  • Major: Breaking changes.
  • Minor: New feature has been added, but backwards compatible.
  • Patch: Backwards compatible bug fixes.

Future Features

  • include and exclude sections in .grabit.
  • different titled configurations in the .grabit file. i.e. you could have a configuration for getting section A of your app, and another for section B of your app to make it easier to set the configs once and not worry afterwards.
  • include a directory tree in the output as well as the code.
  • tell the user the rough number of tokens that have been found across all the files.
  • predict the number of tokens in a file based on the number of characters in each file.
  • if the number of tokens is high, intelligently group files by suffix and prefix and ask the user if they'd like to include or exclude them.
  • ask the user if they'd like to include only a snippet of parts of long files, i.e. first 50 lines or so. Allow them to set this on a per file basis.
  • automatically generate or update the .grabit file to reflect the user's choices in the above two features.
  • ask the user if they want to include large directories as well, not just by prefix.
  • add an option for setting up a query on the cli that helps grabit decide how simliar files are to that query and suggest including or discluding based on that. Cursor do something like this, a vector database.
  • store the query inside of the .grabit file for faster query re-runs.
  • if the files have git history, find the git history of every collected file and add when it was last changed.
  • taking the above further, optionally add the entire git log of commits for each file, or the last N commits or what have you. This could add extra context for an LLM.
  • add an option to include the git diff of each file, or the last N diffs or what have you. This could add even more context for an LLM, an optional argument could be add that switches this on.
  • add the ablity to collect the whole git history of the repo, all the logs, and get the timeline of each log so you can see how frequently a file is changed compared to how many commits have been made in total. This extra information could be used to help decide which files to include or exclude. It could also help the user decide how important a particular file is. Especially if data on recent history is provided.
  • add tests to the package to avoid changes causing failure.
  • decide on versioning strategy and add to README.md

License

MIT License

Connor Skelland

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grabit-0.2.0.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grabit-0.2.0-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file grabit-0.2.0.tar.gz.

File metadata

  • Download URL: grabit-0.2.0.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.2 Darwin/24.2.0

File hashes

Hashes for grabit-0.2.0.tar.gz
Algorithm Hash digest
SHA256 4907a4335e7ef465ddfc373e629495218a503e01d1890c416a62647b1e8c383e
MD5 cb26eac3d23487c5982ab910b0d365e4
BLAKE2b-256 70c4a9baf236f82ab764da9520b08274e60e745b91453bf7900a60c0a11844ad

See more details on using hashes here.

File details

Details for the file grabit-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: grabit-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 7.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.2 Darwin/24.2.0

File hashes

Hashes for grabit-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f6cdd7e35e4608b841c3722bd78ff5f4b75c2d4b5ec69d298236a90336554e6c
MD5 7c86e36c034807b420ddd1f67f273308
BLAKE2b-256 a494b10d3bf741efd50c68f34c901d592371c0d0535b952a7a349dc1086848ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page