Skip to main content

📬 Recursively extract all attachments from .mbox email archives with a single command

Project description

mbox-extractor

mbox-extractor

License Python

📬 Recursively extract all attachments from .mbox email archives with a single command

Features

  • Recursive scanning - Finds all .mbox files in any directory tree
  • Safe filenames - Sanitizes attachment names, removing illegal characters
  • No duplicates - Uses content-based hashing to prevent overwrites
  • Progress display - Visual progress bar for large mailboxes

Quick Start

uv tool install mbox-extractor
mbox-extractor /path/to/emails

Installation

Using uv (recommended)

uv tool install mbox-extractor

Using pip

pip install mbox-extractor

From source

git clone https://github.com/tsilva/mbox-extractor.git
cd mbox-extractor
uv tool install .
pre-commit install

Usage

Extract all attachments from .mbox files under a directory:

mbox-extractor /path/to/search

Attachments from each .mbox file are saved to a folder with the same name:

Found mbox: /emails/archive.mbox -> extracting to /emails/archive
Extracting archive.mbox: 100%|████████████████████| 500/500 [00:10<00:00, 48.5it/s]
Extracted 42 attachments to '/emails/archive'.

How It Works

  1. Recursively scans the given path for .mbox files
  2. Opens each mailbox and iterates through all messages
  3. Extracts attachments with sanitized, unique filenames
  4. Saves them to a folder named after the source .mbox file

Filenames are made unique by appending an 8-character MD5 hash of the file content, preventing overwrites when multiple attachments share the same name.

Requirements

  • Python 3.7+
  • tqdm (installed automatically)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mbox_extractor-0.1.19.tar.gz (168.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mbox_extractor-0.1.19-py3-none-any.whl (4.3 kB view details)

Uploaded Python 3

File details

Details for the file mbox_extractor-0.1.19.tar.gz.

File metadata

  • Download URL: mbox_extractor-0.1.19.tar.gz
  • Upload date:
  • Size: 168.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.19.tar.gz
Algorithm Hash digest
SHA256 779b845aaad2e9ce2ad8593ee2e0c9981394f69ec250f3bdf08f088f290f9c46
MD5 3377e033f0ed7bb50c5a781884f1354b
BLAKE2b-256 ab225fa42051cb1801dcb3fbf0e39e7450bd9afcfdbc9554067a74f5d6c1c6ab

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.19.tar.gz:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mbox_extractor-0.1.19-py3-none-any.whl.

File metadata

File hashes

Hashes for mbox_extractor-0.1.19-py3-none-any.whl
Algorithm Hash digest
SHA256 de05a87ec0304f224416c6669277c753c5170323830a885cba811ee42ec28aa0
MD5 76842cbc642e0dc4004b8f9c57d86374
BLAKE2b-256 9b6392d90b5c1b091aa03ce57a3e3318b58cc08c2a8fde182b465b891a435788

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.19-py3-none-any.whl:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page