Skip to main content

A Python library for intelligently converting text into Markdown.

Project description

text2markdown 📝

text2markdown is a Python library for intelligently converting plain text into Markdown.

text2markdown is powered by the Isaacus enrichment API, which converts unstructured documents into rich, highly structured knowledge graphs that can easily be transformed into Markdown.

In all, text2markdown is capable of:

  • Identifying and formatting headings.
  • Segmenting text into nested sections.
  • Hyperlinking cross-references within texts to other sections.
  • Italicizing cited documents.
  • Detecting and formatting block quotations.
  • Striking through junk text.

Setup 📦

text2markdown can be installed with pip (or uv):

pip install text2markdown

An Isaacus API key is also required to use this library.

Usage 👩‍💻

The code snippet below demonstrates how you might use text2markdown() to intelligently convert a short document into Markdown.

from text2markdown import text2markdown

text = """\
The Smallest Document In The World
This is a generic document.

Section 1 - Background
One upon a time, there was a mayor who said:
We love Markdown so much that everyone should and must use it for everything.

Section 2 - Problem
The mayor's directive, as stated in Section 1, was sadly too difficult to enforce."""

output = text2markdown(text)
print(output)

The output should look something like this:

# The Smallest Document In The World 

This is a generic document. 

## <a id="seg-1"></a>Section 1 - Background 

One upon a time, there was a mayor who said: 

> We love Markdown so much that everyone should and must use it for everything. 

## Section 2 - Problem 

The mayor's directive, as stated in [Section 1](#seg-1), was sadly too difficult to enforce.

An asynchronous version of text2markdown() is also available, supporting all of the same features and arguments as its synchronous equivalent. It can be used like so:

from text2markdown import text2markdown_async

output = await text2markdown_async(text)
print(output)

All of the various capabilities of text2markdown can be toggled on or off using optional Boolean parameters, as shown below:

from text2markdown import text2markdown

from isaacus import Isaacus

output = text2markdown(
    text,
    link_xrefs=True,
    strike_junk=True,
    block_quotes=True,
    italicize_refs=True,
    enrichment_model="kanon-2-enricher",
    isaacus_client=Isaacus(),
)
print(output)

License 📜

This library is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2markdown-0.1.0.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

text2markdown-0.1.0-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file text2markdown-0.1.0.tar.gz.

File metadata

  • Download URL: text2markdown-0.1.0.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for text2markdown-0.1.0.tar.gz
Algorithm Hash digest
SHA256 64378f9d8ec447e03011b65ec139bf490c1e0423ebb9d97550fe80bfb2f4389b
MD5 4b7b10be984c88734c371c09adf5fd01
BLAKE2b-256 d92cbc879ca134ab72563aa008ade87f0d344949b7d8025c51a9a9fad498fa90

See more details on using hashes here.

Provenance

The following attestation bundles were made for text2markdown-0.1.0.tar.gz:

Publisher: python-publish.yml on isaacus-dev/text2markdown

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file text2markdown-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: text2markdown-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for text2markdown-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e27e4e09df43a38ec879e4ca14985ecd03315abddae1ac0934d1b9f667d64f34
MD5 d7575178f72579c052c522cd1a624834
BLAKE2b-256 35ad58e88551eabbe3500de3c663ae7e69e62f1c2d720aaaa6b01d606fa42ef8

See more details on using hashes here.

Provenance

The following attestation bundles were made for text2markdown-0.1.0-py3-none-any.whl:

Publisher: python-publish.yml on isaacus-dev/text2markdown

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page