Skip to main content

Alternative e-mail parser for Paperless-ngx

Project description

Alternative e-mail parser for Paperless-ngx

This parser can be used as an alternative to the default e-mail parser provided by Paperless-ngx.

Features

  • If consumption scope isn't EVERYTHING (i.e. parse mail and attachments separately) include attachments in the archived document where possible. If an attachment can't be converted to pdf, include a corresponding note in the archived version. Signature attachments (mime type: application/x-pkcs7-signature) are always excluded.

  • Place a header in front of the pdf containing the text version of the e-mail as well as in front of the html-version.

  • Only include either the text or html version of the e-mail in the archived document. The PDF-Layout values for Paperless-ngx PdfLayout.TEXT_HTML and PdfLayout.HTML_TEXT are therefore interpreted as "if available, use text, else use html version" resp. "if available, use html, else use text version".

  • Preserve original html e-mail content as far as possible. PDF/A version is created using ghostscript preserving links included in the e-mail.

Prerequisites

All python modules used by this parser should already be included in your Paperless-ngx installation.

Ghostscript ist used for creating PDF/A version of archived file if requested, see

https://docs.paperless-ngx.com/configuration/#PAPERLESS_GS_BINARY

https://docs.paperless-ngx.com/configuration/#PAPERLESS_OCR_COLOR_CONVERSION_STRATEGY

Installation

  1. Install using PyPI

    pip install paperlessngx-mail-parser

    For docker based installations use custom container initialization as described here: https://docs.paperless-ngx.com/advanced_usage/#custom-container-initialization

    Place a script with the following content in the directory for your container initialization scripts and make it executable:

    #!/bin/bash
    pip install paperlessngx-mail-parser
    
  2. Add this parser to the PAPERLESS_APPS environment variable, e.g. in your paperless.conf: PAPERLESS_APPS="paperlessngx-mail-parser.apps.MailparserConfig"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

paperlessngx_mail_parser-2.0.2.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

paperlessngx_mail_parser-2.0.2-py3-none-any.whl (21.4 kB view details)

Uploaded Python 3

File details

Details for the file paperlessngx_mail_parser-2.0.2.tar.gz.

File metadata

File hashes

Hashes for paperlessngx_mail_parser-2.0.2.tar.gz
Algorithm Hash digest
SHA256 5762ebe875f4211e1ee1e173b121ed971da55e26a74261e41c440a1c95fa066a
MD5 24fbe84df77c15a43cb49d748abbd5ec
BLAKE2b-256 b0903bdda770605f8468f7201922d97793c4fab663e85e5e22f1c213af84203d

See more details on using hashes here.

File details

Details for the file paperlessngx_mail_parser-2.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for paperlessngx_mail_parser-2.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c574d8fedd8fa11a212ca094da0e2299a07750af86629e7baa234c62171cb5a2
MD5 13b5d4e5279a99b45afa1633c044cabb
BLAKE2b-256 730f8260a0598d331887da62bad22a9257582bee30fe4ae65b4bb8bbad38a2ec

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page