Skip to main content

Recursively extract attachments from .mbox files

Project description

mbox-extractor

mbox-extractor

License Python

📬 Recursively extract all attachments from .mbox email archives with a single command

Features

  • Recursive scanning - Finds all .mbox files in any directory tree
  • Safe filenames - Sanitizes attachment names, removing illegal characters
  • No duplicates - Uses content-based hashing to prevent overwrites
  • Progress display - Visual progress bar for large mailboxes

Quick Start

uv tool install mbox-extractor
mbox-extractor /path/to/emails

Installation

Using uv (recommended)

uv tool install mbox-extractor

Using pip

pip install mbox-extractor

From source

git clone https://github.com/tsilva/mbox-extractor.git
cd mbox-extractor
uv tool install .

Usage

Extract all attachments from .mbox files under a directory:

mbox-extractor /path/to/search

Attachments from each .mbox file are saved to a folder with the same name:

Found mbox: /emails/archive.mbox -> extracting to /emails/archive
Extracting archive.mbox: 100%|████████████████████| 500/500 [00:10<00:00, 48.5it/s]
Extracted 42 attachments to '/emails/archive'.

How It Works

  1. Recursively scans the given path for .mbox files
  2. Opens each mailbox and iterates through all messages
  3. Extracts attachments with sanitized, unique filenames
  4. Saves them to a folder named after the source .mbox file

Filenames are made unique by appending an 8-character MD5 hash of the file content, preventing overwrites when multiple attachments share the same name.

Requirements

  • Python 3.7+
  • tqdm (installed automatically)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mbox_extractor-0.1.4.tar.gz (168.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mbox_extractor-0.1.4-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file mbox_extractor-0.1.4.tar.gz.

File metadata

  • Download URL: mbox_extractor-0.1.4.tar.gz
  • Upload date:
  • Size: 168.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.4.tar.gz
Algorithm Hash digest
SHA256 1c8fae5d98de5eb027a75337c88654aa10f2faa7458d163b9fa417414074bc9b
MD5 abc1525184ac1a9f222fd4d01b7aa046
BLAKE2b-256 fd69e86ecdf082633c050d6557a727590075fe0ed849222097847b6662150ee8

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.4.tar.gz:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mbox_extractor-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: mbox_extractor-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 4.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2cffae48267712bf0e4c5202bb78bcf65fd84df78b4cfdd64c2e3de215ff65e7
MD5 3a498d59ea0b21f598babe48611e5682
BLAKE2b-256 b8f1cd6236c5557e428ac1c92dd9c56a7f0a53066272b5fe1d6da43c23e53313

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.4-py3-none-any.whl:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page