Skip to main content

A tool to combine files in a folder for LLM processing

Project description

File Combiner

File Combiner is a Python package that processes files in a folder and combines them into a single output file, suitable for use with Large Language Models (LLMs).

Installation

You can install File Combiner using pip:

pip install llm_file_combiner

Usage

After installation, you can use the following commands from anywhere in your terminal:

  1. file-combiner: Combines files in a folder
  2. folder-tree: Generates a folder structure tree
  3. file-list: Generates a list of files in a folder

file-combiner command

file-combiner /path/to/folder --output combined_output.txt --extensions .py .txt --ignore venv .git

Arguments:

  • folder_path: Path to the folder to process (required)
  • --output: Name of the output file (default: output.txt)
  • --extensions: File extensions to include (e.g., .py .txt)
  • --ignore: Folders to ignore (e.g., git node_modules) (default: ['.git', 'node_modules', 'pycache'])
  • --add-line-numbers: Add line numbers to the content of each file
  • --mode: Output mode: xml, markdown, or custom (default: xml)
  • --custom-output-template: Path to custom output template file (required if mode is custom)
  • --custom-file-template: Path to custom file content template file (required if mode is custom)

folder-tree command

The folder-tree command generates a tree structure of the specified folder.

folder-tree /path/to/folder --output folder_tree.txt --extensions .py .txt --ignore venv .git

Arguments:

  • folder_path: Path to the folder to process (required)
  • --output: Name of the output file (default: folder_tree.txt)
  • --extensions: File extensions to include (e.g., .py .txt)
  • --ignore: Folders to ignore (default: ['.git', 'node_modules', 'pycache'])

file-list command

The file-list command generates a list of files in the specified folder.

file-list /path/to/folder --output file_list.txt --extensions .py .txt --ignore venv .git

Arguments:

  • folder_path: Path to the folder to process (required)
  • --output: Name of the output file (default: file_list.txt)
  • --extensions: File extensions to include (e.g., .py .txt)
  • --ignore: Folders to ignore (default: ['.git', 'node_modules', 'pycache'])

Example Usage

  1. Basic usage of file-combiner with default XML mode:
file-combiner example_folder
  1. Using Markdown mode with specific extensions and ignoring certain folders:
file-combiner example_folder --mode markdown --output combined_output.md --extensions .py .txt --ignore venv .git
  1. Using custom templates:
file-combiner example_folder --mode custom --output custom_output.txt --custom-output-template my_output_template.txt --custom-file-template my_file_template.txt
  1. Generating a folder tree:
folder-tree example_folder --output folder_structure.txt --extensions .py .txt --ignore venv
  1. Generating a file list:
file-list example_folder --output files_in_project.txt --extensions .py .txt --ignore venv

Template System

File Combiner uses a flexible template system to format the output. There are three built-in modes:

  1. XML (default)
  2. Markdown
  3. Custom

XML Mode

The XML mode uses predefined XML templates for the output structure and individual file content. The XML mode is the default mode. To use it explicitly:

file-combiner /path/to/folder --mode xml

Markdown Mode

The Markdown mode uses predefined Markdown templates for the output structure and individual file content. To use Markdown mode:

file-combiner /path/to/folder --mode markdown --output combined_output.md

Custom Mode

The custom mode allows you to provide your own templates for both the overall output structure and individual file content. To use custom mode:

file-combiner /path/to/folder --mode custom --custom-output-template path/to/output_template.txt --custom-file-template path/to/file_template.txt

Creating Custom Templates

Custom templates use placeholders that are replaced with actual content during processing. Here are the available placeholders:

For the output template:

  • {TOTAL_FILES}: Total number of processed files
  • {DATE_GENERATED}: Date and time when the output was generated
  • {FOLDER_TREE}: Tree structure of the processed folder
  • {FILES_INCLUDED}: List of processed files
  • {FILE_CONTENTS}: Combined content of all processed files

For the file content template:

  • {FILE_PATH}: Relative path of the file
  • {FILE_NAME}: Name of the file
  • {LINES_COUNT}: Number of lines in the file
  • {MODIFIED_TIME}: Last modification time of the file
  • {FILE_CONTENT}: Content of the file

Example custom output template:

# Folder Analysis

Processed on: {DATE_GENERATED}
Total files: {TOTAL_FILES}

## Folder Structure

{FOLDER_TREE}


## Files Included
{FILES_INCLUDED}

## File Contents
{FILE_CONTENTS}

Example custom file content template:

### {FILE_NAME}

- Path: `{FILE_PATH}`
- Lines: {LINES_COUNT}
- Modified: {MODIFIED_TIME}

{FILE_CONTENT}


License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_file_combiner-1.0.3.tar.gz (7.9 kB view hashes)

Uploaded Source

Built Distribution

llm_file_combiner-1.0.3-py3-none-any.whl (7.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page