A tool to generate comprehensive Markdown artifacts of directory structures and file contents
Project description
CodeMapper
Overview
CodeMapper is a powerful Python tool designed to generate comprehensive Markdown (pdf, xml, and json outputs are on the way) representations of codebases. It bridges the gap between human developers and AI systems by providing a clear, structured view of project architectures and their complete contents. Whether you're working with local directories or GitHub repositories, CodeMapper creates a single, navigable document that encapsulates the full structure and content of a project.
This tool is invaluable for rapid codebase comprehension, whether you're a developer onboarding to a new project or an AI system analyzing code structure.
For audio explanations of this project, see:
- podcasts (Auto-generated by Gemini using NotebookLLM)
Features
- Dual-Purpose Output: Generates content optimized for both human readers and AI analysis
- Intelligent Content Parsing:
- Respects
.gitignorerules - Handles various file types appropriately
- Respects
- Comprehensive Structure Representation:
- Creates an accurate, hierarchical file tree
- Generates a smart table of contents for easy navigation
- Code-Aware Processing:
- Applies appropriate syntax highlighting for different file types
- Intelligently handles large or binary files without bloating the output
- Flexible Input Handling: Works with local directories and GitHub repositories
- Encoding Detection: Ensures accurate content reading across various file encodings
- Customizable Ignore Rules: Option to include files normally ignored by
.gitignore - Efficient Output Management: Organizes generated documents in a '_codemaps' directory
Roadmap
Requirements
- Python 3.6+
pathspeclibrary (for handling.gitignorerules)chardetlibrary (for file encoding detection)
Installation
From PyPI
Install CodeMapper directly from PyPI:
pip install codemapper
From Source
-
Clone this repository:
git clone https://github.com/shaneholloman/codemapper.git
-
Navigate to the cloned directory and install the package:
cd codemapper pip install .
Building the Package
To build the package using pyproject.toml:
python -m build
This will create distribution files in the dist directory.
Usage
Run CodeMapper from the command line, providing the path to the directory or GitHub repository URL you want to analyze:
codemapper <path_to_directory_or_github_url> [--include-ignored]
Options
<path_to_directory_or_github_url>: The path to the directory or GitHub repository URL to analyze (required)--include-ignored: Include files that are normally ignored by.gitignore(optional)
Output
CodeMapper generates a Markdown file named <directory_name>_codemap.md in the '_codemaps' directory. This file contains:
- A comprehensive table of contents for easy navigation
- An accurate file tree representation of the directory structure
- The contents of each file, formatted with appropriate syntax highlighting
- Information about large and binary files (type and size) without their contents
Example usage and output:
codemapper https://github.com/shaneholloman/ansible-role-apache
Use Cases
- Rapidly understand the structure and content of new or unfamiliar projects
- Generate comprehensive documentation for code-based AI prompts
- Facilitate thorough code reviews by providing a complete one-page overview
- Assist AI systems in analyzing and understanding complex codebases
- Analyze GitHub repositories without manual cloning
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
- Thanks to the
pathspecandchardetlibraries for enhancing CodeMapper's functionality.
Version History
For a detailed version history, please refer to the changelog.md.
If you find CodeMapper useful, don't forget to star this repository!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file codemapper-4.0.0.tar.gz.
File metadata
- Download URL: codemapper-4.0.0.tar.gz
- Upload date:
- Size: 16.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e40834a4e424d819ac9bdf0d9317c38e0789e98a29247a7dbb87f248adb6969
|
|
| MD5 |
18c2eef5469fdfeb42fe10e0e11da517
|
|
| BLAKE2b-256 |
483238d68517e4d1ba34126b23347c20c66417952b3a41db49739e88eda847cc
|
File details
Details for the file codemapper-4.0.0-py3-none-any.whl.
File metadata
- Download URL: codemapper-4.0.0-py3-none-any.whl
- Upload date:
- Size: 14.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
86d2f76ff3925a1d796ddc2c84ad8312154ee352c204de082e71a8da55689097
|
|
| MD5 |
0365b4772843dcf44aff25339010da01
|
|
| BLAKE2b-256 |
5306c513942c69fa07656844e64ee2f9453af708ccedddee6fb7b6f36173401d
|