Skip to main content

Convert Markdown (.md) files to PDF ...

Project description

md2ltx

A command-line tool for converting Markdown to LateX-formatted PDF via Pandoc. Requires a pip virtual environment in Ubuntu/ Debian based OS.

1. Quickstart

1.1. Installation

pip install md2ltx; md2ltx --install_dependencies

1.2. Usage

md2ltx [source.md] [output.pdf] [--open] [--help]

source_file: Path to the input Markdown (.md) file.

output_pdf (optional): Path to the output PDF file. If omitted, a default name is derived from the source file, and the working directory is assumed to be the path.

--open: Open the resulting PDF in the system’s default viewer.

--test: Evaluates embedded python and prints the pre-pandoc processed string for the purposes of debugging.

--template template_name: Specify a built-in templates by name. Available templates: "one-column-article", "two-column-article", "report", "slides", "letter").

--help: Access documentation.


2. Templates

md2ltx can inject Markdown content into a LaTeX “template” that defines the overall look and structure of the PDF. You can choose from these built-in templates:

• "one-column-article"
• "two-column-article"
• "report"
• "slides"
• "letter"

When you run md2ltx (or Pandoc directly), you can specify the template with the “--template” flag. Pandoc then loads that template, replacing special variables like $title$, $author$, $date$, and $body$ with metadata and the converted Markdown content.

2.1. Common Fields in the YAML Metadata

• one-column-article/ two-column-article / report:

  • title: Title of your document
  • author: Author name(s)
  • date: Date displayed below the author(s)

• slides (Beamer presentations):

  • title: Presentation title
  • subtitle: (Optional) subtitle for your presentation
  • author: Presenter name(s)
  • date: Date (often included on the title slide)

• letter:

  • author: Sender’s name (also used in \signature)
  • address: Sender’s address
  • date: Date displayed in the letter
  • recipient: Recipient name or address
  • greeting: Opening phrase (e.g., “Dear John,”)
  • closing: Closing phrase (e.g., “Regards,”)

2.2. Using the Templates

Pandoc reads these fields from a YAML block at the top of your Markdown file. For example:

---
title: "My Awesome Title"
author: "John Doe"
date: "October 4, 2023"
---

# Sample Document

This is a **Markdown** document to test `compile_markdown_to_pdf` from `main.py`.

## Advantages of Markdown

- Easy to write
- Human-readable
- Widely supported

## Conclusion

Markdown is fantastic!

When you run md2ltx:

md2ltx my_document.md --template=two-column-article

Pandoc loads the chosen “two-column-article” template, substitutes $title$, $author$, $date$, and $body$, and then compiles a PDF. The same process applies to any of the provided templates.


3. Embedded Python Code with EMBED::function_name

md2ltx supports executing Python code blocks alongside your Markdown, and replacing placeholders of the form EMBED::function_name with the string returned by calling function_name() in Python.

Each code block is enclosed between [START]######## and [END]######## (with at least three “#” characters). All code in those blocks is executed once in the same environment, meaning you can define multiple functions in a single block or spread them across multiple blocks. As soon as the code is executed, any functions you defined become available for embedding.

3.1. General Steps

3.1.1. In your Markdown text, put placeholders where you want to inject dynamic data:

“Markdown is awesome! `EMBED::foo`.”

Each placeholder references a function name that you’ll define in a code block.

3.1.2. Define your code blocks somewhere in the same Markdown file, delimited by [START]…[END]:

[START]#########################################################################
    def foo() -> str:
        return "Hello from foo()"

    def bar() -> str:
        return "Hello from bar()"
[END]###########################################################################

You can define as many functions in a block as you want.

3.1.3. At build time, md2ltx collects all your code blocks, removes common indentation so Python sees them as valid top-level code, executes them, and records the function objects.

3.1.4. Every placeholder EMBED::function_name in your Markdown is then replaced by the return value of calling that function.

3.2. Requirements

• Each function you plan to embed must take no parameters and return a string (or something convertible to string, or a pandas DataFrame).
• If your function returns a pandas.DataFrame, md2ltx automatically converts it to a Markdown table (displaying only the first 5 and last 5 rows if there are more than 10).
• md2ltx does NOT automatically provide additional libraries (e.g. math, pandas, numpy, datetime, rgwfuncs) in the environment. If your code depends on these libraries, you must explicitly import them within your “[START] … [END]” blocks.
• Keep your code blocks consistently indented. By default, if each line is indented by four spaces, md2ltx will remove those four leading spaces and preserve deeper indentation so that nested code within your function remains valid Python.

3.3. Example

Here’s a short Markdown snippet:

Markdown is fantastic! `EMBED::sqrt_of_16`. Another result: `EMBED::compute_2`.

[START]#########################################################################
    import math

    def sqrt_of_16() -> str:
        val = math.sqrt(16)
        return f"The square root of 16 is {val}"
[END]###########################################################################

[START]#########################################################################
    import math

    def compute_2() -> str:
        val = math.sqrt(25)
        return f"The square root of 25 is {val}"
[END]###########################################################################

When processed:

• The code blocks are gathered and executed. Both sqrt_of_16() and compute_2() are defined.
EMBED::sqrt_of_16 is replaced by “The square root of 16 is 4.0.”
EMBED::compute_2 is replaced by “The square root of 25 is 5.0.”

Resulting output might look like:

Markdown is fantastic! The square root of 16 is 4.0. Another result: The square root of 25 is 5.0.

3.4. Example with DataFrames

You can also return a DataFrame—maybe you fetch data from rgwfuncs.load_data_from_query():

Here’s a DataFrame preview: `EMBED::fetch_data`

[START]#########################################################################
    import pandas as pd
    from rgwfuncs import load_data_from_query

    def fetch_data() -> pd.DataFrame:
        df = load_data_from_query("mydb", "SELECT * FROM mytable LIMIT 20")
        return df
[END]###########################################################################

If fetch_data() returns a DataFrame (e.g. 20 rows × N columns), md2ltx converts it to a Markdown pipe table. The final output appears as a table, truncated if there are more than 10 rows.

3.5. Multiple Blocks, Many Functions

It’s perfectly valid to define multiple functions in one code block, or spread them among several:

[START]#########################################################################
    import math

    def foo() -> str:
        return "Hello from foo"

    def bar() -> str:
        return "Hello from bar"
[END]###########################################################################

[START]#########################################################################
    def baz() -> str:
        return "Hello from baz"
[END]###########################################################################

All three functions (foo, bar, baz) become available, and you embed them by writing:

`EMBED::foo`, `EMBED::bar`, or `EMBED::baz`.

3.6. Error Handling

If your code has a Python syntax error or cannot be executed, md2ltx prints an “[Error executing combined code: …]” message in the logs and all the affected functions remain undefined. Any placeholders referencing them become “[Error: No function named 'xyz' has been defined in the code blocks]”.


4. General Pandoc Tranformations

md2ltx uses Pandoc to transform Markdown files into LaTeX, which pdflatex then uses to generate a final PDF. This workflow supports most of Markdown’s core syntax plus many Pandoc extensions. Below is a high-level overview of how Pandoc typically converts various Markdown constructs into LaTeX. For full details, refer to Pandoc’s official documentation.


4.1. Headings

Markdown

# Heading 1  
## Heading 2  
### Heading 3

Pandoc → LaTeX

\section{Heading 1}  
\subsection{Heading 2}  
\subsubsection{Heading 3}

Pandoc chooses \section, \subsection, etc. based on the heading level. It also supports underline-style Markdown headings with “===” or “---” for level-one and level-two headings.


4.2. Emphasis & Strong Emphasis

Markdown

*emphasis* or _emphasis_  
**strong emphasis** or __strong emphasis__

Pandoc → LaTeX

\emph{emphasis}  
\textbf{strong emphasis}

4.3. Inline Code

Markdown

`inline code`

Pandoc → LaTeX

\texttt{inline code}

4.4. Code Blocks

Markdown (fenced)

```  
a = 1  
b = 2  
```

Pandoc → LaTeX (by default)

\begin{verbatim}  
a = 1  
b = 2  
\end{verbatim}

With certain options, Pandoc can use different LaTeX environments (e.g., listings).


4.5. Lists

Unordered (Markdown)

- item 1  
- item 2  
- item 3

Pandoc → LaTeX

\begin{itemize}  
\item item 1  
\item item 2  
\item item 3  
\end{itemize}

Ordered (Markdown)

1. item 1  
2. item 2

Pandoc → LaTeX

\begin{enumerate}  
\item item 1  
\item item 2  
\end{enumerate}

4.6. Links & Images

Link (Markdown)

[Pandoc](https://pandoc.org)

Pandoc → LaTeX

\href{https://pandoc.org}{Pandoc}

Image (Markdown)

![Alt text](image.png)

Pandoc → LaTeX

\includegraphics{image.png}

By default, \includegraphics is placed without floats. You can add captions or figure environments using extended syntax or metadata.


4.7. Blockquotes

Markdown

> This is a blockquote.

Pandoc → LaTeX

\begin{quote}  
This is a blockquote.  
\end{quote}

4.8. Horizontal Rules

Markdown

---  
***  
___

Pandoc → LaTeX

\hrule

4.9. Footnotes (Pandoc Extension)

Markdown

This is some text with a footnote.[^1]

[^1]: This is the footnote text.

Pandoc → LaTeX

This is some text with a footnote.\footnote{This is the footnote text.}

4.10. Tables

Markdown (simple pipe table)

| Column1 | Column2 |  
|---------|---------|  
| Val1    | Val2    |  
| Val3    | Val4    |

Pandoc → LaTeX

\begin{table}  
\centering  
\begin{tabular}{ll}  
\hline  
Column1 & Column2 \\  
\hline  
Val1    & Val2    \\  
Val3    & Val4    \\  
\hline  
\end{tabular}  
\end{table}

4.11. Math & LaTeX Blocks

Inline Math

$E = mc^2$

Pandoc → LaTeX

\(E = mc^2\)

Display Math

$$  
E = mc^2  
$$

Pandoc → LaTeX

\[  
E = mc^2  
\]

4.12. Citations & Bibliographies

Pandoc can handle citations if you provide a bibliography file. A reference like [@smith2009] can become \cite{smith2009} or \autocite depending on the style and Pandoc’s command-line options.


4.14. Raw LaTeX

Pandoc passes raw LaTeX through if you’re converting to LaTeX or PDF. For example:

\newpage

remains \newpage in the output.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

md2ltx-0.0.56.tar.gz (12.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

md2ltx-0.0.56-py3-none-any.whl (16.4 kB view details)

Uploaded Python 3

File details

Details for the file md2ltx-0.0.56.tar.gz.

File metadata

  • Download URL: md2ltx-0.0.56.tar.gz
  • Upload date:
  • Size: 12.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for md2ltx-0.0.56.tar.gz
Algorithm Hash digest
SHA256 67d6ca27e3ae61ecf188e1b23b24dfa091720b031d8d3a21226d1fc5e1bc93b9
MD5 b1648c0823ff414b91691adf03d60c88
BLAKE2b-256 6d92e5ac837b1b9670099b9df777f6ed475db27303805fafce98f2d44bc63c68

See more details on using hashes here.

File details

Details for the file md2ltx-0.0.56-py3-none-any.whl.

File metadata

  • Download URL: md2ltx-0.0.56-py3-none-any.whl
  • Upload date:
  • Size: 16.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for md2ltx-0.0.56-py3-none-any.whl
Algorithm Hash digest
SHA256 52b91a44f0a81ccf2a66b863711acb7c53fbbfdd3126eb7420637a4c79a7f368
MD5 4e07d6eca77576b2c9b02d9d69802f4d
BLAKE2b-256 b438c12ed18fbfa72c2b2ed6f7e25138db447c695f0e25e7a1646bad4bb55776

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page