Skip to main content

A Python library for intelligently converting text into Markdown.

Project description

text2markdown 📝

text2markdown is a Python library for intelligently converting plain text into Markdown.

text2markdown is powered by the Isaacus enrichment API, which converts unstructured documents into rich, highly structured knowledge graphs that can easily be transformed into Markdown.

In all, text2markdown is capable of:

  • Identifying and formatting headings.
  • Segmenting text into nested sections.
  • Hyperlinking cross-references within texts to other sections.
  • Italicizing cited documents.
  • Italicizing defined terms.
  • Detecting and formatting block quotations.
  • Striking through junk text.

Setup 📦

text2markdown can be installed with pip (or uv):

pip install text2markdown

An Isaacus API key is also required to use this library.

Usage 👩‍💻

The code snippet below demonstrates how you might use text2markdown() to intelligently convert a short document into Markdown.

from text2markdown import text2markdown

text = """\
The Smallest Document In The World
This is a generic document.

Section 1 - Background
One upon a time, there was a mayor who said:
We love Markdown so much that everyone should and must use it for everything.

Section 2 - Problem
The mayor's directive, as stated in Section 1, was sadly too difficult to enforce."""

output = text2markdown(text)
print(output)

The output should look something like this:

# The Smallest Document In The World 

This is a generic document. 

## <a id="seg-1"></a>Section 1 - Background 

One upon a time, there was a mayor who said: 

> We love Markdown so much that everyone should and must use it for everything. 

## Section 2 - Problem 

The mayor's directive, as stated in [Section 1](#seg-1), was sadly too difficult to enforce.

An asynchronous version of text2markdown() is also available, supporting all of the same features and arguments as its synchronous equivalent. It can be used like so:

from text2markdown import text2markdown_async

output = await text2markdown_async(text)
print(output)

All of the various capabilities of text2markdown can be toggled on or off using optional Boolean parameters, as shown below:

from text2markdown import text2markdown

from isaacus import Isaacus

output = text2markdown(
    text,
    link_xrefs=True,
    strike_junk=True,
    block_quotes=True,
    escape_lists=True,
    italicize_refs=True,
    italicize_terms=True,
    enrichment_model="kanon-2-enricher",
    isaacus_client=Isaacus(),
)
print(output)

License 📜

This library is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text2markdown-0.1.3.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

text2markdown-0.1.3-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file text2markdown-0.1.3.tar.gz.

File metadata

  • Download URL: text2markdown-0.1.3.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for text2markdown-0.1.3.tar.gz
Algorithm Hash digest
SHA256 75c47b7c0f7ab872ca6a00204f5240f819d4e289a105c83af1dc2ba10d7a4313
MD5 b6494887213a8dddfa08dc37df05e96c
BLAKE2b-256 e11eae689e614b8cba4e8c04a6f07619710ef38eb3cef2e4e404c59de9d9c845

See more details on using hashes here.

Provenance

The following attestation bundles were made for text2markdown-0.1.3.tar.gz:

Publisher: python-publish.yml on isaacus-dev/text2markdown

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file text2markdown-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: text2markdown-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for text2markdown-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f15a56364bfe7ebaf4de4a551f069c69131427b11fc2b9337827ee2cb82bd72b
MD5 10c220b0ff19d591f750ab46c7ec9bef
BLAKE2b-256 b0980feea99c1f22aaa345d09b532def96ec22a232254b887fd44214f6300376

See more details on using hashes here.

Provenance

The following attestation bundles were made for text2markdown-0.1.3-py3-none-any.whl:

Publisher: python-publish.yml on isaacus-dev/text2markdown

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page