Skip to main content

Alternative e-mail parser for Paperless-ngx

Project description

Alternative e-mail parser for Paperless-ngx

This parser can be used as an alternative to the default e-mail parser provided by Paperless-ngx.

Features

  • If consumption scope isn't EVERYTHING (i.e. parse mail and attachments separately) include attachments in the archived document where possible. If an attachment can't be converted to pdf, include a corresponding note in the archived version. Signature attachments (mime type: application/x-pkcs7-signature) are always excluded.

  • Place a header in front of the pdf containing the text version of the e-mail as well as in front of the html-version.

  • Only include either the text or html version of the e-mail in the archived document. The PDF-Layout values for Paperless-ngx PdfLayout.TEXT_HTML and PdfLayout.HTML_TEXT are therefore interpreted as "if available, use text, else use html version" resp. "if available, use html, else use text version".

  • Preserve original html e-mail content as far as possible. PDF/A version is created using ghostscript preserving links included in the e-mail.

Prerequisites

All python modules used by this parser should already be included in your Paperless-ngx installation.

Ghostscript ist used for creating PDF/A version of archived file if requested, see

https://docs.paperless-ngx.com/configuration/#PAPERLESS_GS_BINARY

https://docs.paperless-ngx.com/configuration/#PAPERLESS_OCR_COLOR_CONVERSION_STRATEGY

Installation

  1. Install using PyPI

    pip install paperlessngx-mail-parser

    For docker based installations use custom container initialization as described here: https://docs.paperless-ngx.com/advanced_usage/#custom-container-initialization

    Place a script with the following content in the directory for your container initialization scripts and make it executable:

    #!/bin/bash
    pip install paperlessngx-mail-parser
    
  2. Add this parser to the PAPERLESS_APPS environment variable, e.g. in your paperless.conf: PAPERLESS_APPS="paperlessngx-mail-parser.apps.MailparserConfig"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

paperlessngx_mail_parser-2.0.1.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

paperlessngx_mail_parser-2.0.1-py3-none-any.whl (21.4 kB view details)

Uploaded Python 3

File details

Details for the file paperlessngx_mail_parser-2.0.1.tar.gz.

File metadata

  • Download URL: paperlessngx_mail_parser-2.0.1.tar.gz
  • Upload date:
  • Size: 19.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for paperlessngx_mail_parser-2.0.1.tar.gz
Algorithm Hash digest
SHA256 dd9884a2da142fcd1851ab6fde01572a520fc1c6dc566310f5efeb450f4714a8
MD5 ea343cb240609e9d59a6f50d52667784
BLAKE2b-256 016f60573748e67e1c72993073d96ef59f6a1131e06498191dd8774076ac054f

See more details on using hashes here.

File details

Details for the file paperlessngx_mail_parser-2.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for paperlessngx_mail_parser-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ad49de9928c4462a8ce0fdcca1c2952f32505143967ef91ec49aeefb994b2f30
MD5 6369d3e3c05f50368e38a7ff7636e38f
BLAKE2b-256 a0a58b7af6842e9d8067cc15b3283d5d3561334f5665a7c67f55b1ea7e5a2d7a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page