Skip to main content

Combine LaTeX docs into a single PDF

Project description

Latest PyPI Version License Supported Python Versions Format Downloads

This tool automates compiling LaTeX document collections (for working papers, proceedings, etc.) into a single combined PDF file using the pdfpages package.

Create an INI file giving the name and parts of your collection and build it with the latexpages command-line utility.

latexpages will start one parallel typesetting process per core for speedup.

As each part is typeset independently, this allows to combine documents that cannot be merged into a single master-document (use of incompatible packages/options, latex-dvips-ps2pdf vs. pdflatex, etc.).

Installation

This package runs under Python 2.7 and 3.3+, use pip to install:

$ pip install latexpages

The compilation requires a TeX distribution (e.g. TeX Live or MikTeX) and either latexmk or MikTeX’s texify utility being available on your system.

The optional automatic page numbering (see below) requires the pdftk command-line utility.

Usage

Create a working directory holding your plain-text INI file. Put all your documents into subdirectories with the same name as the corresponding .tex file:

collection/
    collection.ini
    article1/
        article1.tex
        references.bib
    article2/
        article2.tex
        ...

Note: the directory names cannot contain spaces.

Edit the INI file to configure the parts, their order and various other options:

[make]
name = MY_COLL
directory = _output

[parts]
mainmatter =
  article1
  article2

The following will typeset all parts, copy their PDFs to the output directory, and combine them into a single PDF. By default, this also creates a 2-up version:

$ latexpages collection.ini

Check the example directory in the source distribution for a working complete example.

Invocation

Check the usage of the latexpages command:

$ latexpages --help
usage: latexpages [-h] [--version] [-c {latexmk,texify}] [--keep]
                  filename [processes]

Compiles and combines LaTeX docs into a single PDF file

positional arguments:
  filename             INI file configuring the parts and output options
  processes            number of parallel processes (default: one per core)

optional arguments:
  -h, --help           show this help message and exit
  --version            show program's version number and exit
  -c {latexmk,texify}  use latexmk.pl or texify (default: guess from platform)
  --keep               keep combination document(s) and their auxiliary files

Pagination

The following command goes trough all main documents and updates the page number in the first \setcounter{page}{<number>} line of the source according to the page count of the preceding documents’ compiled PDFs.

$ latexpages-paginate collection.ini

Make sure the pdftk executable from pdftk is available on your systems’ path.

To use a different pattern for finding the \setcounter lines, set the update option in the paginate section of your INI file to a suitable regular expression.

[paginate]
update = \\setcounter\{page\}\{(\d+)\}

To also update the page numbers in your table of contents, put the corresponding part name in the paginate section of your INI file.

Directory structure:

collection/
    collection.ini
    prelims/
        prelims.tex
    article1/
        article1.tex
        ...

Configuration:

[parts]
frontmatter =
  prelims
mainmatter =
  article1
  article2

[paginate]
contents = prelims

By default, latexpages-paginate will search and update \startpage{<number>} lines in the source. To use this as marker, define and use a corresponding LaTeX-command in your table of contents, e.g. \newcommand{\startpage}[1]{#1}. A complete example is in the example directory

To use a different pattern for finding the table of contents lines, change the regular expression in the replace option.

[paginate]
replace = \\startpage\{(\d+)\}

Check the usage of the latexpages-paginate command:

$ latexpages-paginate --help
usage: latexpages-paginate [-h] filename

Computes and updates start page numbers in compiled parts and contents

positional arguments:
  filename    INI file configuring the parts and paginate options

optional arguments:
  -h, --help  show this help message and exit
  --version   show program's version number and exit

Advanced options

Below are annotated INI file sections showing the default options for all available configuration settings.

The make section sets the names and file name templates for the results:

[make]
name = COLL              # name of the resulting PDF file
directory = _output      # directory to copy/put the results

two_up = __%(name)s_2up  # name of the 2-up version PDF file
make_two_up = true       # create a 2-up version (yes/no)

# templates for the name of the copied part PDF files for each
# of the three possible groups (frontmatter, mainmatter, extras)
# available substitutions:
#   (note that the percent-sign must be doubled here)
#   %%(name)s    name of the result file (see above)
#   %%(part)s    name of the part directory/filename
#   %%(index0)d  zero-based index inside group
#   %%(index1)d  one-based index inside group

frontmatter = _%%(name)s_%%(part)s
mainmatter = %%(name)s_%%(index1)02d_%%(part)s
extras = %(frontmatter)s

The parts section gives space-delimited lists of parts to compile and/or include:

[parts]
frontmatter =  # include at the beginning, roman page numbering
mainmatter =   # include after frontmatter, arabic page numbering
extras =       # compile and copy only (e.g. a separate cover page)

use_dvips =    # use latex -> dvips -> ps2pdf for these parts
               # instead of pdflatex (e.g. pstricks usage)

# pull the first mainmatter part into the roman page numbering area
first_to_front = false

The substitute section fills the template that is used to create the combination document. With the default template, this allows to set the PDF meta data:

[substitute]
# options for \usepackage{hyperref}
author =       # pdfauthor
title =        # pdftitle
subject =      # pdfsubject
keywords =     # pdfkeywords

The template section allows to customize the details of the combination document:

[template]
filename =         # use a custom template
filename_two_up =  # different template for 2-up version

class = scrartcl   # use this documentclass

# documentclass options for combination and 2-up version
options = paper=a5
options_two_up = paper=a4,landscape

# includepdfmerge options for combination and 2-up version
include = fitpaper
include_two_up = nup=2x1,openright

The compile section allows to change the invocation options of the compilation commands used.

[compile]
latexmk = -silent                   # less verbose

texify = --batch --verbose --quiet  # halt on error, less verbose
# only used with texify (latexmk calls these automatically)
dvips = -q
ps2pdf =

Finally, the paginate section controls latexpages-paginate (see above).

[paginate]
update = \\setcounter\{page\}\{(\d+)\}  # search/update regex
contents =                              # part with table of contents
replace = \\startpage\{(\d+)\}          # toc line search/update regex

See also

License

latexpages is distributed under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

latexpages-0.3.2.zip (31.2 kB view details)

Uploaded Source

Built Distribution

latexpages-0.3.2-py2.py3-none-any.whl (20.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file latexpages-0.3.2.zip.

File metadata

  • Download URL: latexpages-0.3.2.zip
  • Upload date:
  • Size: 31.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for latexpages-0.3.2.zip
Algorithm Hash digest
SHA256 a4844cacd61dc29ec543cadf135e14ad9e5953ff065229ed10efa3e9758dd0e6
MD5 d69726b17c69502c236d93857956a9d5
BLAKE2b-256 bdd64a8ca01036ce9b1c0004a2a68311857ee4a93271d75aff063a584e200163

See more details on using hashes here.

File details

Details for the file latexpages-0.3.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for latexpages-0.3.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f5076f5ea8296fa13d87abca0d32cd193c2bb89493ddda51f3f643ff382e22ae
MD5 3054f5b5a8801430930a611dc65dff55
BLAKE2b-256 e464303781b9578acf34162126d30c5567bf9788ef778365f9e963039b3d9c5d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page