Skip to main content

Convert Markdown (.md) files to PDF ...

Project description

md2ltx

A command-line tool for converting Markdown to LateX-formatted PDF via Pandoc. Requires a pip virtual environment in Ubuntu/ Debian based OS.

1. Quickstart

1.1. Installation

pip install md2ltx; md2ltx --install_dependencies

1.2. Usage

md2ltx [source.md] [output.pdf] [--open] [--help]

source_file: Path to the input Markdown (.md) file.

output_pdf (optional): Path to the output PDF file. If omitted, a default name is derived from the source file, and the working directory is assumed to be the path.

--open: Open the resulting PDF in the system’s default viewer.

--template template_name: Specify a built-in templates by name. Available templates: "one-column-article", "two-column-article", "report", "slides", "letter").

--help: Access documentation.


2. Templates

md2ltx can inject Markdown content into a LaTeX “template” that defines the overall look and structure of the PDF. You can choose from these built-in templates:

• "one-column-article"
• "two-column-article"
• "report"
• "slides"
• "letter"

When you run md2ltx (or Pandoc directly), you can specify the template with the “--template” flag. Pandoc then loads that template, replacing special variables like $title$, $author$, $date$, and $body$ with metadata and the converted Markdown content.

2.1. Common Fields in the YAML Metadata

• one-column-article/ two-column-article / report:

  • title: Title of your document
  • author: Author name(s)
  • date: Date displayed below the author(s)

• slides (Beamer presentations):

  • title: Presentation title
  • subtitle: (Optional) subtitle for your presentation
  • author: Presenter name(s)
  • date: Date (often included on the title slide)

• letter:

  • author: Sender’s name (also used in \signature)
  • address: Sender’s address
  • date: Date displayed in the letter
  • recipient: Recipient name or address
  • greeting: Opening phrase (e.g., “Dear John,”)
  • closing: Closing phrase (e.g., “Regards,”)

2.2. Using the Templates

Pandoc reads these fields from a YAML block at the top of your Markdown file. For example:

---
title: "My Awesome Title"
author: "John Doe"
date: "October 4, 2023"
---

# Sample Document

This is a **Markdown** document to test `compile_markdown_to_pdf` from `main.py`.

## Advantages of Markdown

- Easy to write
- Human-readable
- Widely supported

## Conclusion

Markdown is fantastic!

When you run md2ltx:

md2ltx my_document.md --template=two-column-article

Pandoc loads the chosen “two-column-article” template, substitutes $title$, $author$, $date$, and $body$, and then compiles a PDF. The same process applies to any of the provided templates.


3. Embedded Python Code with EMBED::function_name

md2ltx supports executing Python code blocks alongside your Markdown, and replacing placeholders of the form EMBED::function_name, with the string returned by calling function_name() in Python.

Each code block is enclosed between [START]######## and [END]######## (at least three “#” characters). All code in those blocks is executed once in the same environment, meaning you can define multiple functions in a single block or spread them across multiple blocks. As soon as the code is executed, any functions you defined become available for embedding.

3.1. General Steps

3.1.1. In your Markdown text, put placeholders where you want to inject dynamic data:

“Markdown is awesome! `EMBED::foo`.”  

Each placeholder references a function name that you’ll define in a code block.

3.1.2. Define your code blocks somewhere in the same Markdown file, delimited by [START]…[END]:

[START]#####################################################################  
    def foo() -> str:  
        return "Hello from foo()"  

    def bar() -> str:  
        return "Hello from bar()"  
[END]#####################################################################  

You can define as many functions in a block as you want.

3.1.3. At build time, md2ltx collects all your code blocks, removes common indentation so Python sees them as valid top-level code, executes them, and records the function objects.

3.1.4. Every placeholder EMBED::function_name in your Markdown is then replaced by the return value of calling that function.

3.2. Requirements

• Each function you plan to embed must take no parameters and return a string (or something convertible to string, or a pandas DataFrame).
• If your function returns a pandas.DataFrame, md2ltx auto-converts it to a Markdown table (truncated to the first 5 and last 5 rows if there are more than 10).
• md2ltx automatically provides math, pandas (as pd), numpy (as np), rgwfuncs, and datetime in the execution environment, so you don’t need to import them yourself.
• Keep your code blocks consistently indented. By default, if each line is indented by four spaces, md2ltx will remove those four leading spaces and preserve deeper indentation so that nested code within your function remains valid Python.

3.3. Example

Here’s a short Markdown snippet:

Markdown is fantastic! `EMBED::sqrt_of_16`. Another result: `EMBED::compute_2`.

[START]#########################################################################
    def sqrt_of_16() -> str:
        val = math.sqrt(16)
        return f"The square root of 16 is {val}"
[END]###########################################################################

[START]#########################################################################
    def compute_2() -> str:
        val = math.sqrt(25)
        return f"The square root of 25 is {val}"
[END]###########################################################################

When processed:

• The code blocks are aggregated and executed. Both sqrt_of_16() and compute_2() are defined.
EMBED::sqrt_of_16 is replaced by “The square root of 16 is 4.0.”
EMBED::compute_2 is replaced by “The square root of 25 is 5.0.”

Resulting output might look like:

Markdown is fantastic! The square root of 16 is 4.0. Another result: The square root of 25 is 5.0.  

3.4. Example with Dataframes

You can also return a DataFrame—maybe you fetch data from rgwfuncs.load_data_from_query():

Here’s a DataFrame preview: `EMBED::fetch_data`

[START]#########################################################################
    def fetch_data() -> pd.DataFrame:
        df = rgwfuncs.load_data_from_query("mydb", "SELECT * FROM mytable LIMIT 20")
        return df
[END]###########################################################################

If fetch_data() returns a DataFrame (20 rows × N columns), md2ltx converts it to a Markdown pipe table. The final output appears as a table truncated if there are more than 10 rows.

3.5. Multiple Blocks, Many Functions

It’s perfectly valid to define multiple functions in one code block, or spread them among several. For instance:

[START]###################
    def foo() -> str:
        return "Hello from foo"

    def bar() -> str:
        return "Hello from bar"
[END]###################

[START]###################
    def baz() -> str:
        return "Hello from baz"
[END]###################

All three functions (foo, bar, baz) become available, and you embed them by writing:

`EMBED::foo`, `EMBED::bar`, or `EMBED::baz`.

3.6. Error Handling

If your code has a Python syntax error or cannot be executed, md2ltx prints an “[Error executing combined code: …]” message in the logs and all the affected functions remain undefined. Any placeholders referencing them become “[Error: No function named 'xyz' has been defined in the code blocks]”.


4. General Pandoc Tranformations

md2ltx uses Pandoc to transform Markdown files into LaTeX, which pdflatex then uses to generate a final PDF. This workflow supports most of Markdown’s core syntax plus many Pandoc extensions. Below is a high-level overview of how Pandoc typically converts various Markdown constructs into LaTeX. For full details, refer to Pandoc’s official documentation.


4.1. Headings

Markdown

# Heading 1  
## Heading 2  
### Heading 3

Pandoc → LaTeX

\section{Heading 1}  
\subsection{Heading 2}  
\subsubsection{Heading 3}

Pandoc chooses \section, \subsection, etc. based on the heading level. It also supports underline-style Markdown headings with “===” or “---” for level-one and level-two headings.


4.2. Emphasis & Strong Emphasis

Markdown

*emphasis* or _emphasis_  
**strong emphasis** or __strong emphasis__

Pandoc → LaTeX

\emph{emphasis}  
\textbf{strong emphasis}

4.3. Inline Code

Markdown

`inline code`

Pandoc → LaTeX

\texttt{inline code}

4.4. Code Blocks

Markdown (fenced)

```  
a = 1  
b = 2  
```

Pandoc → LaTeX (by default)

\begin{verbatim}  
a = 1  
b = 2  
\end{verbatim}

With certain options, Pandoc can use different LaTeX environments (e.g., listings).


4.5. Lists

Unordered (Markdown)

- item 1  
- item 2  
- item 3

Pandoc → LaTeX

\begin{itemize}  
\item item 1  
\item item 2  
\item item 3  
\end{itemize}

Ordered (Markdown)

1. item 1  
2. item 2

Pandoc → LaTeX

\begin{enumerate}  
\item item 1  
\item item 2  
\end{enumerate}

4.6. Links & Images

Link (Markdown)

[Pandoc](https://pandoc.org)

Pandoc → LaTeX

\href{https://pandoc.org}{Pandoc}

Image (Markdown)

![Alt text](image.png)

Pandoc → LaTeX

\includegraphics{image.png}

By default, \includegraphics is placed without floats. You can add captions or figure environments using extended syntax or metadata.


4.7. Blockquotes

Markdown

> This is a blockquote.

Pandoc → LaTeX

\begin{quote}  
This is a blockquote.  
\end{quote}

4.8. Horizontal Rules

Markdown

---  
***  
___

Pandoc → LaTeX

\hrule

4.9. Footnotes (Pandoc Extension)

Markdown

This is some text with a footnote.[^1]

[^1]: This is the footnote text.

Pandoc → LaTeX

This is some text with a footnote.\footnote{This is the footnote text.}

4.10. Tables

Markdown (simple pipe table)

| Column1 | Column2 |  
|---------|---------|  
| Val1    | Val2    |  
| Val3    | Val4    |

Pandoc → LaTeX

\begin{table}  
\centering  
\begin{tabular}{ll}  
\hline  
Column1 & Column2 \\  
\hline  
Val1    & Val2    \\  
Val3    & Val4    \\  
\hline  
\end{tabular}  
\end{table}

4.11. Math & LaTeX Blocks

Inline Math

$E = mc^2$

Pandoc → LaTeX

\(E = mc^2\)

Display Math

$$  
E = mc^2  
$$

Pandoc → LaTeX

\[  
E = mc^2  
\]

4.12. Citations & Bibliographies

Pandoc can handle citations if you provide a bibliography file. A reference like [@smith2009] can become \cite{smith2009} or \autocite depending on the style and Pandoc’s command-line options.


4.14. Raw LaTeX

Pandoc passes raw LaTeX through if you’re converting to LaTeX or PDF. For example:

\newpage

remains \newpage in the output.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

md2ltx-0.0.32.tar.gz (15.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

md2ltx-0.0.32-py3-none-any.whl (16.5 kB view details)

Uploaded Python 3

File details

Details for the file md2ltx-0.0.32.tar.gz.

File metadata

  • Download URL: md2ltx-0.0.32.tar.gz
  • Upload date:
  • Size: 15.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for md2ltx-0.0.32.tar.gz
Algorithm Hash digest
SHA256 feef2ed2d803377155858336787330aa26478b136f18054e7707d9c0b220077c
MD5 1b850b408518420120241cd39b37bb23
BLAKE2b-256 fe4b31e54eca9c28172fd36eb66e6cda8f6cf89257bff4c31ba78a194eb1031d

See more details on using hashes here.

File details

Details for the file md2ltx-0.0.32-py3-none-any.whl.

File metadata

  • Download URL: md2ltx-0.0.32-py3-none-any.whl
  • Upload date:
  • Size: 16.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for md2ltx-0.0.32-py3-none-any.whl
Algorithm Hash digest
SHA256 8fadac5e482ae0157e8c76ba4e36f322208c0bc457bf5245fd91a38bec8668bb
MD5 792a136e6c54efd7892cf41f9ef6711c
BLAKE2b-256 c258d6659eabee5445ec106eac1985df095630dbd6b3f4900263eca25e66ca05

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page