Skip to main content

Alternative e-mail parser for Paperless-ngx

Project description

Alternative e-mail parser for Paperless-ngx

This parser can be used as an alternative to the default e-mail parser provided by Paperless-ngx.

Features

  • If consumption scope isn't EVERYTHING (i.e. parse mail and attachments separately) include attachments in the archived document where possible. If an attachment can't be converted to pdf, include a corresponding note in the archived version. Signature attachments (mime type: application/x-pkcs7-signature) are always excluded.

  • Place a header in front of the pdf containing the text version of the e-mail as well as in front of the html-version.

  • Only include either the text or html version of the e-mail in the archived document. The PDF-Layout values for Paperless-ngx PdfLayout.TEXT_HTML and PdfLayout.HTML_TEXT are therefore interpreted as "if available, use text, else use html version" resp. "if available, use html, else use text version".

  • Preserve original html e-mail content as far as possible. PDF/A version is created using ghostscript preserving links included in the e-mail.

Prerequisites

All python modules used by this parser should already be included in your Paperless-ngx installation.

Ghostscript ist used for creating PDF/A version of archived file if requested, see

https://docs.paperless-ngx.com/configuration/#PAPERLESS_GS_BINARY

https://docs.paperless-ngx.com/configuration/#PAPERLESS_OCR_COLOR_CONVERSION_STRATEGY

Installation

  1. Install using PyPI

    pip install paperlessngx-mail-parser

    For docker based installations use custom container initialization as described here: https://docs.paperless-ngx.com/advanced_usage/#custom-container-initialization

    Place a script with the following content in the directory for your container initialization scripts and make it executable:

    #!/bin/bash
    pip install paperlessngx-mail-parser
    
  2. Add this parser to the PAPERLESS_APPS environment variable, e.g. in your paperless.conf: PAPERLESS_APPS="paperlessngx-mail-parser.apps.MailparserConfig"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

paperlessngx_mail_parser-2.0.0.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

paperlessngx_mail_parser-2.0.0-py3-none-any.whl (21.4 kB view details)

Uploaded Python 3

File details

Details for the file paperlessngx_mail_parser-2.0.0.tar.gz.

File metadata

  • Download URL: paperlessngx_mail_parser-2.0.0.tar.gz
  • Upload date:
  • Size: 19.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for paperlessngx_mail_parser-2.0.0.tar.gz
Algorithm Hash digest
SHA256 77e5969ee22fd0657577fce26e472e706d637f1c12d9bd60c27b281598ce5c69
MD5 176616fc382c362c8123983f2dcde577
BLAKE2b-256 347f07533decd1ce12524c0522e034336b2cc0476137c99adb5d26450255dff8

See more details on using hashes here.

File details

Details for the file paperlessngx_mail_parser-2.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for paperlessngx_mail_parser-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 397ad4639775e80764cb0b4e730244626d0855ba20a08ec2028f54a5bc819151
MD5 4b4df20fd549813bdc0b8d7b5d2e1d0d
BLAKE2b-256 8359c36abb942d5ac019bc30f977c09a3e0a222907a3b31f466c6c1922b58ac6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page