Skip to main content

A Python tool that extracts function signatures from large codebases and generates concise summaries for LLM context preparation

Project description

TLDR - Function Signature Extractor

TLDR is a Python tool that extracts function signatures from large codebases and generates concise summaries. It's particularly useful for providing context to Large Language Models (LLMs) when dealing with codebases that exceed their context window limits.

Features

  • Multi-language Support: Supports 40+ programming languages via Pygments lexer integration
  • Signature Extraction: Extracts function, class, and method signatures from code files
  • JSON Output: Produces structured JSON output for easy integration with other tools
  • Recursive Processing: Can process entire directory trees recursively
  • Atomic File Writing: Ensures data integrity with atomic file operations
  • (Optional) AI-Powered File Summaries: Generates file summaries using LLM providers (Claude, OpenAI, Grok)

Supported Languages

JavaScript/TypeScript, Python, Java, C/C++, C#, PHP, Ruby, Go, Rust, Swift, Scala, Kotlin, and many more.

Installation

# Clone the repository
git clone https://github.com/csimoes1/tldr-code.git
cd tldr-code
# Create a virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`
# Install dependencies
pip install -r requirements.txt

Usage

# Show usage help message
python tldr_code.py -h 

# Process a local directory tree to generate a tldr summary file
python tldr_code.py /path/to/directory

# Example: Process a GitHub repository and create a tldr summary file (example here is the Python fastapi project)
python tldr_code.py https://github.com/fastapi/fastapi 

# Example: Process a GitHub repository and store the downloaded files and the tldr summary file in a specific directory
python tldr_code.py https://github.com/fastapi/fastapi /Users/csimoes/repos/fastapi

Command Line Options

  • input (GitHub URL | /path/to/directory): Path to the repo/directory to scan (required)
  • output_filename: Output filename (optional, defaults to tldr.json)

Output Format

TLDR generates JSON files with the following structure:

{
  "directory_path": "/path/to/directory",
  "last_updated": "2025-06-16T10:30:00Z",
  "files": [
    {
      "file_path": "/path/to/file.py",
      "last_scanned": "2025-06-16T10:30:00Z",
      "signatures": [
        "class MyClass(BaseClass)",
        "def __init__(self, param1, param2)",
        "def process_data(self, data: List[str]) -> Dict[str, Any]"
      ],
      "summary": "This file implements data processing functionality..."
    }
  ]
}

Use Cases

  1. LLM Context Preparation: Quickly generate summaries of large codebases for LLM analysis
  2. Code Documentation: Automatically extract API signatures for documentation
  3. Codebase Analysis: Get an overview of code structure and functionality
  4. Code Review Assistance: Understand code changes and their impact

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Add tests if applicable
  5. Submit a pull request

License

See LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tldr_code-0.1.2.tar.gz (23.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tldr_code-0.1.2-py3-none-any.whl (21.0 kB view details)

Uploaded Python 3

File details

Details for the file tldr_code-0.1.2.tar.gz.

File metadata

  • Download URL: tldr_code-0.1.2.tar.gz
  • Upload date:
  • Size: 23.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for tldr_code-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c8e4fb732bdbc5cf7908b7d5dd2e1fc2d87a0484c5c8b57d4073d959582c1c50
MD5 d14b760b3383733a6cb7f4c1e7ba3675
BLAKE2b-256 c2038bd165c5dfca17c1de290677deb40d7fb0f7fe78d66d816b744e2747a117

See more details on using hashes here.

File details

Details for the file tldr_code-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: tldr_code-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 21.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for tldr_code-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1855168051e32deed587c190ec9c08af763aa2baa660144d588920de6fec11f6
MD5 7033ee13fe9e889f17f8113679d8c5b7
BLAKE2b-256 a0d32de9427f9cc64895681981ade56fabbd313a35df8cb7a078075c56fd132f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page