Skip to main content

Gptize is a tool designed to concatenate the contents of project files for ChatGPT

Project description

GPTize

GPTize is a tool for merging the contents of project files into a single text document. It is specifically designed to create datasets that can be loaded into ChatGPT for analysis or training. I, Aleksei Svetlov, created this tool because I was tired of copying file contents and paths to make GPT understand the context of my project. With GPTize, this process is now automated and streamlined.

Features

  • Exception handling for files based on .gitignore (including custom .gitignore-gptize).
  • Support for specifying a target directory with repository root .gitignore.
  • Support for various encodings when reading files.
  • Customizable output file naming based on the input file or directory name.
  • Report generation including all processed files.
  • Enhanced limit checks for file size and token count, with warnings logged instead of errors raised when limits are exceeded.

Installation

To install GPTize, simply use pip:

pip install gptize

This command will install GPTize and all its dependencies. After installation, you can use GPTize from the command line anywhere.

Usage

To run GPTize, you have several options:

Basic Usage

Simply invoke GPTize in the command line to process the current directory:

gptize

This will process all files in the current directory and generate a report with a default name like gptize-output-folder-YYYYMMDD-HHMMSS.txt.

Specifying a Directory

To process a specific directory, use:

gptize /path/to/directory

This will process all files in the specified directory and create a report named gptize-output-folder-YYYYMMDD-HHMMSS.txt.

Specifying a Single File

For processing a single file:

gptize /path/to/file.txt

This will process only the specified file and generate a report named gptize-output-file_name-YYYYMMDD-HHMMSS.txt, where file_name is the name of the input file.

Specifying Repository Root for .gitignore

If your .gitignore is located in the root of the repository but you want to process files in a different subdirectory, you can use the --repo-root option:

gptize /path/to/directory --repo-root /path/to/repo_root

This will apply the .gitignore from the repository root to files in the specified directory.

Custom Output File

If you want to specify a custom output file name, use the -o or --output option:

gptize -o custom_output.txt

This command will override the default naming convention and use custom_output.txt as the output file name.

Uploading to ChatGPT

After generating the merged file using GPTize, you can upload it to ChatGPT for improved context understanding. When making requests to ChatGPT, explicitly reference the uploaded file, for instance, using a phrase like ... based on the imported txt file. This approach significantly enhances the quality of ChatGPT's responses by providing it with specific context.

Components

  • gptizer.py: The main class for file processing.
  • main.py: The entry point of the application.
  • models.py: Data models for files and projects.
  • output_builder.py: Output constructor for report generation.
  • settings.py: Project settings.

Author and Maintainer

Aleksei Svetlov - Creator and main maintainer.

Contact Information

License

The project is distributed under the MIT License.

CHANGELOG

[0.3.0] - 2024-09-26

  • [Feature] Support for specifying a target directory with repository root .gitignore
    • Now gptize can be executed from any directory, while still applying .gitignore rules from the root of the repository.
    • Added the --repo-root argument to specify the root directory of the repository where the .gitignore is located.
    • Example usage: gptize src/py_module/ --repo-root . allows processing files in src/py_module/ while applying .gitignore rules from the repository root.
  • [Feature] Added support for a custom .gitignore for GPTize
    • Now, you can use an additional custom .gitignore-gptize file along with the repository root .gitignore.
    • The custom .gitignore-gptize can be specified and will be applied in addition to the main .gitignore.
    • Example usage: gptize src/py_module/ --repo-root . will apply both .gitignore and .gitignore-gptize.

[0.2.5] - 2023-11-25

  • [Modification] Updated File Size and Token Count Checks
    • Modified the combine_files method in gptizer.py to log a warning instead of raising an error when the total size of the combined content exceeds the MAX_FILE_SIZE_BYTES_LIMIT or MAX_TOKEN_COUNT_LIMIT defined in settings.py.

[0.2.4] - 2023-11-16

  • [Feature] Custom Output File Naming
    • Output files now include the name of the processed file or directory, enhancing traceability and identification.
  • [Enhancement] Settings Method for Custom File Names
    • Updated the Settings class with a new method to generate output file names incorporating the name of the input file or directory.
  • [Modification] Main File Processing Logic
    • Modified main.py to adopt the new output file naming scheme.
  • [Fix] Minor Bug Fixes and Performance Improvements
    • Addressed various minor bugs and optimized performance.

[0.2.3] - 2023-11-12

  • [Enhancement] Detect binary files and handle errors gracefully
    • Added binary file detection logic in load_file_content method.
    • Improved error handling for file reading.
    • Updated the OutputBuilder to handle binary files properly.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gptize-0.3.1.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

gptize-0.3.1-py3-none-any.whl (9.5 kB view details)

Uploaded Python 3

File details

Details for the file gptize-0.3.1.tar.gz.

File metadata

  • Download URL: gptize-0.3.1.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for gptize-0.3.1.tar.gz
Algorithm Hash digest
SHA256 073f2d0ba04ec67a1940aa7ccafba7182fedda2b0ac330a932eb15ac3a937efe
MD5 536acbd9d74631a29c1b8adcb1082db8
BLAKE2b-256 da89978014db5424eb47a3ec3e090db22332c217441647432d53e2e3caa83365

See more details on using hashes here.

File details

Details for the file gptize-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: gptize-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 9.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for gptize-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bc4fe0082361be7ca7fe21832eb93298e2fb1a2d10290d769aa4f0578b02cef2
MD5 d5f95f9ad5aeed21c5855d439fce27be
BLAKE2b-256 749338280487352a711bc5010d4464e59f13a980ad88d6ad537d39f058f519bf

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page