Skip to main content

Convert markdown inside Docx to docx styles

Project description

MarkDownToDocxStyle

Convert Markdown inside Office Word documents

Installation

pip install markdowntodocx

Usage

to convert an existing Docx file:

see examples/example.py

from markdowntodocx.markdownconverter import convertMarkdownInFile

convertMarkdownInFile("/mypath/to/document.docx", "output_path.docx", {"Code Car":"CodeStyle"})

Styles and considerations

You have to define styles in you word document in order to use Markdown **Headers/titles**, **Hyperlinks**, **Code formatting**, **Arrays**, **Unordered List**.

This styles are either standard markdown or come from extended markdown : https://www.markdownguide.org/extended-syntax/

  • Emphasis (italic) *Text* or _Text_: converts to word italic

  • Strong Emphasis (Bold) **Text** or __Text__: converts to word bold

  • Strike through (Strike) ~~Strike~~ : converts to word strike through style

  • Highlight (==highlight==) `==Highlight==' : converts to word Yellow highlight.

  • Header # MarkdownHeader1 to ###### MarkdownHeader6:

    • Must be in alone in a paragraph. IF NOT, the rest will be erased.
    • It will use the document style named "Header" by default.
    • You can specify another style by giving the style dictionnary as last arg for both functions.
    • E.g : res, msg = convertMarkdownInFile("examples/in_document.docx", "examples/out_document.docx", {"Header":"style_name"})
  • (EXTENDED SYNTAX FOR WORD) Change font color <color:FF0000> this text will be very red because the color is in RGB format</color> or <span style="color: rgb(230, 0, 0);> REd colored text </span>

  • Inline Code `Text` (my code):

    • It will use the document style named "Code" (Caracter format) by default.
    • You can specify another style by giving the style dictionnary as last arg for both functions.
    • E.g : markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", {"Code Car":"my_inline_code_style"})
  • Code Block ` ` `T e x t` ` `

my code
* It will use the document style named "Code" by default. 
* You can specify another style by giving the style dictionnary as last arg for both functions. 
* E.g : `markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", {"Code":"my_block_code_style"})`
  • Mermaid support ` ` `mermaid ... ` ` ` (see mermaid https://mermaid.js.org).

    • CAUTION : By default, mermaid graph are generated using https://mermaid.ink/img. You can install a mermaid.ink server locally with https://github.com/jihchi/mermaid.ink and then specify the mermaid server when running the markdown converter. Alternatively, you can specify a mermaid_cli as a binary, typically mmdc
    • E.g : markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", mermaid_server_link="https://localhost:3000/img/")
    • E.g : markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", mermaid_cli="mmdc")
  • Insert Remote Image ![Image name](http://link.do.web/myimage.png):

    • It will download the image from the hyperlink and insert the picture with a width of 18cm
  • Insert Local Image ![image name](file:///home/myuser/images/test.png)

    • It will read images extensioned file and insert them as remote images
  • Hyperlink [google](https://www.google.fr) : Makes it a Word hyperlink google

    • Will also attempt to convert any valid http hyperlink to word : http://www.google.fr -> http://www.google.fr
    • If the link does not start with http, it will be treated as an internal link to a bookmark
  • (EXTENDED SYNTAX FOR WORD) Bookmark ``this will be bookmared with name bookmark1{#bookmark1}

    • You may hyperlink to it : [url text to display]{bookmark1}
  • Footnotes (BETA) :

    • Inline foot notes : this is a conundrum^[https://fr.wiktionary.org/wiki/conundrum]
    • External foot notes : ``` This paragraph will have a footnote[^1] And this paragraph will have another[^2] [^1]: This is a footnote with markdown as well bold And it can have many lines if they are indented. [^2]: This is the second footnote
    
    
  • Array to wordlist: (must be alone in a paragraph otherwise the rest of the paragraph is deleted)

|Column1|column2|Column3|
|-------|-------|-------|
|line|line|line|

-->

Column1 column2 Column3
line line line
* Cells created will use the document style named "Cell" by default. 
* You can specify another style by giving the style dictionnary as last arg for both functions. 
* E.g : `markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", {"Cell":"my_cell_style"})`
  • Unordered List : (- my list or * my list or + my list) :
    • Must be in alone in a paragraph. IF NOT, the rest of the paragraph will be erased.
    • It will use the document style named "Header" by default.
    • You can specify another style by giving the style dictionnary as last arg for both functions.
    • E.g : markdownToWordInFile("/mypath/to/document.docx", "output_path.docx", {"BulletList":"my_bullet_style"})

Advanced usage : Image styles

Docx format supports image format but their is no easy way to save them in docx styles. As a workaround, a keyword arg named image_modifier exists for the function convertMarkdownInFile. It can be used like this to add a centered black shadow to all images in paragraph with a custom style named "ImageModifier":

    res, msg = convertMarkdownInFile("examples/in_document.docx", "examples/out_document.docx" ,{"Header":"Header"}, 
                                     image_modifier=['''<a:outerShdw blurRad="63500" sx="102000" sy="102000"
                                                algn="ctr" rotWithShape="0">
                                                <a:prstClr val="black">
                                                    <a:alpha val="40000" />
                                                </a:prstClr>
                                            </a:outerShdw>'''])

It will add xml to all effectLst elements of pictures in the word document in paragraph with a custom style named "ImageModifier".

To get your goal style, create an empty docx document, add a single image with desired format and save the document. Then, unzip the docx document as if it is a ZIP archive. Open the folder and then word/document.xml should display the XML. Look for your picture style and try to copy it.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

markdowntodocx-0.1.9.4.tar.gz (20.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

markdowntodocx-0.1.9.4-py3-none-any.whl (18.2 kB view details)

Uploaded Python 3

File details

Details for the file markdowntodocx-0.1.9.4.tar.gz.

File metadata

  • Download URL: markdowntodocx-0.1.9.4.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.4

File hashes

Hashes for markdowntodocx-0.1.9.4.tar.gz
Algorithm Hash digest
SHA256 f2990f240b1764ac17e0b1dd98ad0ed598b4a165e7a6d98acf5772c8c85a6ef8
MD5 67333b60cc7cb1566d3342cb9d1b5f47
BLAKE2b-256 e9c9ac132325c319d4031d6382b3781934b0ce300c2eda9d5b4eb115cf70d4e6

See more details on using hashes here.

File details

Details for the file markdowntodocx-0.1.9.4-py3-none-any.whl.

File metadata

File hashes

Hashes for markdowntodocx-0.1.9.4-py3-none-any.whl
Algorithm Hash digest
SHA256 481ea6a107f2e658e8c274e0cfe2b50ff913fb82c313604aaa8a917a9bfc8193
MD5 3973a1dba2f455e3ea71ff49f60828cb
BLAKE2b-256 6c14f3ca72093bbc8609534c2ca4b1ec57d53780a9eb05db601d0fd57f51689a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page