Break down docs, build up knowledge.
Project description
Markdown Splitter: Modular Docs, Maximum Flexibility
SplitmeAI is a Python module that addresses challenges in managing large Markdown files, particularly when creating and maintaining structured static documentation websites such as Mkdocs.
Why Use SplitmeAI?
- Section Splitting: Breaks down large Markdown files into smaller, manageable sections based on specified heading levels.
- Filename Sanitization: Generates clean, unique filenames for each section, ensuring compatibility and readability.
- Reference Link Management: Extracts and appends reference-style links used within each section.
- Hierarchy Preservation: Maintains parent heading context within each split file.
- Thematic Break Handling: Recognizes and handles line breaks (
---,***,___) for intelligent content segmentation. - MkDocs Integration: Automatically generates an
mkdocs.ymlconfiguration file based on the split sections. - CLI Support: Provides a user-friendly Command-Line Interface for seamless operation.
Quick Start
Installation
Install from PyPI using any of the package managers listed below.
pip
Use pip (recommended for most users):
pip install -U splitme-ai
pipx
Install in an isolated environment with pipx:
❯ pipx install readmeai
uv
For the fastest installation use uv:
❯ uv tool install splitme
Usage
Using the CLI
Example 1: Split a Markdown file into sections:
splitme-ai \
--split.i examples/data/README-AI.md \
--split.settings.o examples/output-h2
Example 2: Generate a mkdocs static documentation site config file:
splitme-ai \
--split.i examples/data/README-AI.md \
--split.settings.o examples/output-h2 \
--split.settings.mkdocs
View the output for heading level 2 example here.
Example 3: Split on heading level 3:
splitme-ai \
--split.i examples/data/README-AI.md \
--split.settings.o examples/output-h3 \
--split.settings.hl "###"
View the output for heading level 3 example here.
Example 4: Split on heading level 4:
splitme-ai \
--split.i examples/data/README-AI.md \
--split.settings.o examples/output-h4 \
--split.settings.hl "####"
View the output for heading level 4 example here.
[!NOTE] The Official Documentation site with extensive examples and usage instructions is under development Stay tuned for updates!
Roadmap
- Enhance CLI usability and user experience.
- Integrate AI-powered content analysis and segmentation.
- Add robust chunking and splitting algorithms for LLM applications.
- Add support for additional static site generators.
- Add support for additional input and output formats.
License
Copyright © 2024 splitme-ai.
Released under the MIT license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file splitme_ai-0.1.3.tar.gz.
File metadata
- Download URL: splitme_ai-0.1.3.tar.gz
- Upload date:
- Size: 14.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
eb1c16b3dc593971bdac9a31c62250dd5287cd713717f6c9073e8b08822cb929
|
|
| MD5 |
b2efbd881d72d2f32b7fbda2e1894f65
|
|
| BLAKE2b-256 |
1ef65e2c6e242ff2d4075b1e0ef6bc781763e4807bf9e7f4c6741a29b15e8c27
|
Provenance
The following attestation bundles were made for splitme_ai-0.1.3.tar.gz:
Publisher:
ci.yml on eli64s/splitme-ai
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
splitme_ai-0.1.3.tar.gz -
Subject digest:
eb1c16b3dc593971bdac9a31c62250dd5287cd713717f6c9073e8b08822cb929 - Sigstore transparency entry: 157632389
- Sigstore integration time:
-
Permalink:
eli64s/splitme-ai@a34296fdd637ba47794a36bb742458576c5bfde9 -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/eli64s
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@a34296fdd637ba47794a36bb742458576c5bfde9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file splitme_ai-0.1.3-py3-none-any.whl.
File metadata
- Download URL: splitme_ai-0.1.3-py3-none-any.whl
- Upload date:
- Size: 18.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
18072ecead03cb13a282aea8e8482313e918c3fe9f4da1534cfc6e9249c0c495
|
|
| MD5 |
b03f4bb29bc154577a967bb9b97c5ef6
|
|
| BLAKE2b-256 |
25a3c985a5776c3dc8a430579153f5f29de93be4fc78bc7748a6adb4ab941b8f
|
Provenance
The following attestation bundles were made for splitme_ai-0.1.3-py3-none-any.whl:
Publisher:
ci.yml on eli64s/splitme-ai
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
splitme_ai-0.1.3-py3-none-any.whl -
Subject digest:
18072ecead03cb13a282aea8e8482313e918c3fe9f4da1534cfc6e9249c0c495 - Sigstore transparency entry: 157632390
- Sigstore integration time:
-
Permalink:
eli64s/splitme-ai@a34296fdd637ba47794a36bb742458576c5bfde9 -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/eli64s
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@a34296fdd637ba47794a36bb742458576c5bfde9 -
Trigger Event:
push
-
Statement type: