Skip to main content

Thin wrapper for pandoc.

Project description

Build Status PyPI version conda version

pypandoc provides a thin wrapper for pandoc, a universal document converter.

Installation

pypandoc uses pandoc, so it needs an available installation of pandoc. For some common cases (wheels, conda packages), pypandoc already includes pandoc (and pandoc_citeproc) in it’s prebuilt package.

If pandoc is already installed (pandoc is in the PATH), pypandoc uses the version with the higher version number and if both are the same, the already installed version. You can point to a specific version by setting the environment variable PYPANDOC_PANDOC to the full path to the pandoc binary (PYPANDOC_PANDOC=/home/x/whatever/pandoc or PYPANDOC_PANDOC=c:\pandoc\pandoc.exe). If this environment variabel is set, this is the only place where pandoc is searched for.

To use pandoc filters, you must have the relevant filter installed on your machine.

Installing via pip

Install via pip install pypandoc

Prebuilt wheels for Windows and Mac OS X include pandoc. If there is no prebuilt binary available, you have to install pandoc yourself.

If you use Linux and have your own wheelhouse, you can build a wheel which includes pandoc with python setup.py download_pandoc; python setup.py bdist_wheel. Be aware that this works only on 64bit intel systems, as we only download it from the official source.

Installing via conda

Install via conda install -c https://conda.anaconda.org/janschulz pypandoc.

You can also add the channel to your conda config via conda config --add channels https://conda.anaconda.org/janschulz. This makes it possible to use conda install pypandoc directly and also lets you update via conda update pypandoc.

Conda packages include pandoc and are available for py2.7, py3.4 and py3.5, for Windows (32bit and 64bit), Mac OS X (64bit) and Linux (64bit).

Installing pandoc

pandoc is available for many different platforms:

Usage

The basic invocation looks like this: pypandoc.convert('input', 'output format'). pypandoc tries to infer the type of the input automatically. If it’s a file, it will load it. In case you pass a string, you can define the format using the parameter. The example below should clarify the usage:

import pypandoc

output = pypandoc.convert('somefile.md', 'rst')

# alternatively you could just pass some string to it and define its format
output = pypandoc.convert('#some title', 'rst', format='md')
# output == 'some title\r\n==========\r\n\r\n'

If you pass in a string (and not a filename), convert expects this string to be unicode or utf-8 encoded bytes. convert will always return a unicode string.

It’s also possible to directly let pandoc write the output to a file. This is the only way to convert to some output formats (e.g. odt, docx, epub, epub3, pdf). In that case convert() will return an empty string.

import pypandoc

output = pypandoc.convert('somefile.md', 'docx', outputfile="somefile.docx")
assert output == ""

In addition to format, it is possible to pass extra_args. That makes it possible to access various pandoc options easily.

output = pypandoc.convert(
    '<h1>Primary Heading</h1>',
    'md', format='html',
    extra_args=['--atx-headers'])
# output == '# Primary Heading\r\n'
output = pypandoc.convert(
    '# Primary Heading',
    'html', format='md',
    extra_args=['--base-header-level=2'])
# output == '<h2 id="primary-heading">Primary Heading</h2>\r\n'

pypandoc now supports easy addition of pandoc filters.

filters = ['pandoc-citeproc']
pdoc_args = ['--mathjax',
             '--smart']
output = pd.convert(source=filename,
                    to='html5',
                    format='md',
                    extra_args=pdoc_args,
                    filters=filters)

Please pass any filters in as a list and not a string.

Please refer to pandoc -h and the official documentation for further details.

Dealing with Formatting Arguments

Pandoc supports custom formatting though -V parameter. In order to use it through pypandoc, use code such as this:

output = pypandoc.convert('demo.md', 'pdf', outputfile='demo.pdf',
  extra_args=['-V', 'geometry:margin=1.5cm'])

Note that it’s important to separate -V and its argument within a list like that or else it won’t work. This gotcha has to do with the way subprocess.Popen works.

Getting Pandoc Version

As it can be useful sometimes to check what Pandoc version is available at your system, pypandoc provides an utility for this. Example:

version = pypandoc.get_pandoc_version()

Contributing

Contributions are welcome. When opening a PR, please keep the following guidelines in mind:

  1. Before implementing, please open an issue for discussion.

  2. Make sure you have tests for the new logic.

  3. Make sure your code passes flake8 pypandoc.py tests.py

  4. Add yourself to contributors at README.md unless you are already there. In that case tweak your contributions.

Note that for citeproc tests to pass you’ll need to have pandoc-citeproc installed. If you installed a prebuilt wheel or conda package, it is already included.

Contributors

  • Valentin Haenel - String conversion fix

  • Daniel Sanchez - Automatic parsing of input/output formats

  • Thomas G. - Python 3 support

  • Ben Jao Ming - Fail gracefully if pandoc is missing

  • Ross Crawford-d’Heureuse - Encode input in UTF-8 and add Django example

  • Michael Chow - Decode output in UTF-8

  • Janusz Skonieczny - Support Windows newlines and allow encoding to be specified.

  • gabeos - Fix help parsing

  • Marc Abramowitz - Make setup.py fail hard if pandoc is missing, Travis, Dockerfile, PyPI badge, Tox, PEP-8, improved documentation

  • Daniel L. - Add extra_args example to README

  • Amy Guy - Exception handling for unicode errors

  • Florian Eßer - Allow Markdown extensions in output format

  • Philipp Wendler - Allow Markdown extensions in input format

  • Jan Schulz - Handling output to a file, Travis to work on newer version of Pandoc, return code checking, get_pandoc_version. Helped to fix the Travis build.

  • Aaron Gonzales - Added better filter handling

  • David Lukes - Enabled input from non-plain-text files and made sure tests clean up template files correctly if they fail

  • valholl - Set up licensing information correctly and include examples to distribution version

  • Cyrille Rossant - Fixed bug by trimming out stars in the list of pandoc formats. Helped to fix the Travis build.

  • Paul Osborne - Don’t require pandoc to install pypandoc.

  • Felix Yan - Added installation instructions for Arch Linux.

License

pypandoc is available under MIT license. See LICENSE for more details. pandoc itself is available under the GPL2 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypandoc-1.1.3.zip (20.1 MB view details)

Uploaded Source

Built Distributions

pypandoc-1.1.3-cp34-none-win_amd64.whl (20.1 MB view details)

Uploaded CPython 3.4 Windows x86-64

pypandoc-1.1.3-cp34-none-win32.whl (20.1 MB view details)

Uploaded CPython 3.4 Windows x86

pypandoc-1.1.3-cp34-cp34m-macosx_10_5_x86_64.whl (29.2 MB view details)

Uploaded CPython 3.4m macOS 10.5+ x86-64

pypandoc-1.1.3-cp27-none-win_amd64.whl (20.1 MB view details)

Uploaded CPython 2.7 Windows x86-64

pypandoc-1.1.3-cp27-none-win32.whl (20.1 MB view details)

Uploaded CPython 2.7 Windows x86

pypandoc-1.1.3-cp27-none-macosx_10_5_x86_64.whl (29.2 MB view details)

Uploaded CPython 2.7 macOS 10.5+ x86-64

File details

Details for the file pypandoc-1.1.3.zip.

File metadata

  • Download URL: pypandoc-1.1.3.zip
  • Upload date:
  • Size: 20.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pypandoc-1.1.3.zip
Algorithm Hash digest
SHA256 ed2048a655f7bd2f80dc84d9fd81df113c78f3b31aca68dae17c1100a2c05569
MD5 771f376bf9c936a90159cd94235998c2
BLAKE2b-256 4168e985491370cc44a063ac5c70232bf72d7675fa7539a86fa7fa08257bda05

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 92c2f48779f8c524f9da8cc4f974e5c72dc9cb1cb562e36116e977a7a91c28fb
MD5 1a2b474fb3a2cb7ba542efffb4ceb7cd
BLAKE2b-256 6d92fb9af261d4cc3f441b42c2c410afa3b3391dcb41d8745c3219b26e1b9e8a

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp34-none-win32.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp34-none-win32.whl
Algorithm Hash digest
SHA256 96de3ecf99155028d633507a75f1937bc007055183ec425f3419cf179e3cc92d
MD5 1b6d53eebd152d272f85a6e0d9c38159
BLAKE2b-256 278cbf094382bc882fc760b8173c864a0d31c06322ca9c423ba51beb238e8ec2

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp34-cp34m-macosx_10_5_x86_64.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp34-cp34m-macosx_10_5_x86_64.whl
Algorithm Hash digest
SHA256 28dbf52a44a5b931a29c3d7dd2dc1907c83f19ea3c4c60378ae5f9648db2ebc9
MD5 ee412514833e6f653ea3066c5ebee673
BLAKE2b-256 c48f863dd21dafc5bffe884ca45a9f8c81924605795cf49802e847ef9251a18f

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 eef7e5e22da384b48281cf93eb7242118d96f00a32b67f962ce024a30914bfde
MD5 3fb56ff8929bdc3a623f3d19c2f87827
BLAKE2b-256 7273eb68763d79808ab97c22b9449185d14f1e5bf10758ad9109ee3537004bb0

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp27-none-win32.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp27-none-win32.whl
Algorithm Hash digest
SHA256 d22d376039005098d95693aa47f8ce9dba9a62947c2a45102362c911a390d833
MD5 91c42cb0e4d95a1074c7859301372b36
BLAKE2b-256 46e352e82cabbb38999c3828ac4f2e2aadd9bbeb26ae4cc153716ebc0eb3976a

See more details on using hashes here.

File details

Details for the file pypandoc-1.1.3-cp27-none-macosx_10_5_x86_64.whl.

File metadata

File hashes

Hashes for pypandoc-1.1.3-cp27-none-macosx_10_5_x86_64.whl
Algorithm Hash digest
SHA256 9db897b598a06ef68dadc70c10d0fb8b399d03b33c993bee4f5e10273d0834e8
MD5 b52ea2d61676b2ec4d610ab54cc1d2ff
BLAKE2b-256 8600842d46211676cb2e55eada357e3bc3eb935e8b6398ff64b8c287932d646f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page