Skip to main content

Break down docs, build up knowledge.

Project description

splitme-ai logo

Break down your docs. Build up your knowledge.

A Markdown text splitter for modular docs and maximum flexibility.

separator

What is SplitmeAI?

SplitmeAI is a Python module that addresses challenges in managing large Markdown files, particularly when creating and maintaining structured static documentation websites such as Mkdocs.

Key Features:

  • Section Splitting: Breaks down large Markdown files into smaller, manageable sections based on specified heading levels.
  • Filename Sanitization: Generates clean, unique filenames for each section, ensuring compatibility and readability.
  • Reference Link Management: Extracts and appends reference-style links used within each section.
  • Hierarchy Preservation: Maintains parent heading context within each split file.
  • Thematic Break Handling: Recognizes and handles line breaks (---, ***, ___) for intelligent content segmentation.
  • MkDocs Integration: Automatically generates an mkdocs.yml configuration file based on the split sections.
  • CLI Support: Provides a user-friendly Command-Line Interface for seamless operation.

Quick Start

Installation

Install from PyPI using any of the package managers listed below.

 pip

Use pip (recommended for most users):

pip install -U splitme-ai

 pipx

Install in an isolated environment with pipx:

 pipx install readmeai

 uv

For the fastest installation use uv:

 uv tool install splitme

Usage

Using the CLI

Let's take a look at some examples of how to use the splitme-ai CLI.

Example 1: Split a Markdown file on heading level 2 (default setting):

splitme-ai \
    --split.i examples/data/README-AI.md \
    --split.settings.o examples/output-h2

Example 2: Split on heading level 2 and generate an mkdocs.yml configuration file:

splitme-ai \
    --split.i examples/data/README-AI.md \
    --split.settings.o examples/output-h2 \
    --split.settings.mkdocs

View the output generated for splitting on heading level 2 here.

Example 3: Split on heading level 3:

splitme-ai \
    --split.i examples/data/README-AI.md \
    --split.settings.o examples/output-h3 \
    --split.settings.hl "###"

View the output generated for splitting on heading level 3 here.

Example 4: Split on heading level 4:

splitme-ai \
    --split.i examples/data/README-AI.md \
    --split.settings.o examples/output-h4 \
    --split.settings.hl "####"

View the output generated for splitting on heading level 4 here.

[!NOTE] The Official Documentation site with extensive examples and usage instructions is under development Stay tuned for updates!


Roadmap

  • Enhance CLI usability and user experience.
  • Integrate AI-powered content analysis and segmentation.
  • Add robust chunking and splitting algorithms for LLM applications.
  • Add support for additional static site generators.
  • Add support for additional input and output formats.

Contributing

Contributions are welcome! For bug reports, feature requests, or questions, please open an issue or submit a pull request on GitHub.


License

Copyright © 2024 splitme-ai.
Released under the MIT license.

separator

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

splitme_ai-0.1.6.tar.gz (15.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

splitme_ai-0.1.6-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file splitme_ai-0.1.6.tar.gz.

File metadata

  • Download URL: splitme_ai-0.1.6.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for splitme_ai-0.1.6.tar.gz
Algorithm Hash digest
SHA256 c1f0728c3a19472ed50f4f1299111ae254727e90c8bc0613391c299f483232b3
MD5 a2cf1c53fb47f81581dcacb767384eaa
BLAKE2b-256 a8b41c8d04380c36b228244988239f5f31ec9193e60946b886c9b4e9286a3631

See more details on using hashes here.

Provenance

The following attestation bundles were made for splitme_ai-0.1.6.tar.gz:

Publisher: ci.yml on eli64s/splitme-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file splitme_ai-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: splitme_ai-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for splitme_ai-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 54a97881dbf2f384858d259321bc3ea516a00cb14fa1dfbfc68f062d58918b93
MD5 4aa31f1d1598f3430ebd77316ca5978b
BLAKE2b-256 43ad28b7289b12edaaf71c915856485af71bc1749c5d8c427f07f34e7eac6a1e

See more details on using hashes here.

Provenance

The following attestation bundles were made for splitme_ai-0.1.6-py3-none-any.whl:

Publisher: ci.yml on eli64s/splitme-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page