Skip to main content

📬 Recursively extract all attachments from .mbox email archives with a single command

Project description

mbox-extractor

mbox-extractor

License Python

📬 Recursively extract all attachments from .mbox email archives with a single command

Features

  • Recursive scanning - Finds all .mbox files in any directory tree
  • Safe filenames - Sanitizes attachment names, removing illegal characters
  • No duplicates - Uses content-based hashing to prevent overwrites
  • Progress display - Visual progress bar for large mailboxes

Quick Start

uv tool install mbox-extractor
mbox-extractor /path/to/emails

Installation

Using uv (recommended)

uv tool install mbox-extractor

Using pip

pip install mbox-extractor

From source

git clone https://github.com/tsilva/mbox-extractor.git
cd mbox-extractor
uv tool install .
pre-commit install

Usage

Extract all attachments from .mbox files under a directory:

mbox-extractor /path/to/search

Attachments from each .mbox file are saved to a folder with the same name:

Found mbox: /emails/archive.mbox -> extracting to /emails/archive
Extracting archive.mbox: 100%|████████████████████| 500/500 [00:10<00:00, 48.5it/s]
Extracted 42 attachments to '/emails/archive'.

How It Works

  1. Recursively scans the given path for .mbox files
  2. Opens each mailbox and iterates through all messages
  3. Extracts attachments with sanitized, unique filenames
  4. Saves them to a folder named after the source .mbox file

Filenames are made unique by appending an 8-character MD5 hash of the file content, preventing overwrites when multiple attachments share the same name.

Requirements

  • Python 3.7+
  • tqdm (installed automatically)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mbox_extractor-0.1.20.tar.gz (168.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mbox_extractor-0.1.20-py3-none-any.whl (4.3 kB view details)

Uploaded Python 3

File details

Details for the file mbox_extractor-0.1.20.tar.gz.

File metadata

  • Download URL: mbox_extractor-0.1.20.tar.gz
  • Upload date:
  • Size: 168.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.20.tar.gz
Algorithm Hash digest
SHA256 4020460236a8aa6c24a209c2208eb61e7219b0ce742fa3118bac45227c56bfab
MD5 5762c19ff1e8fe58f4344e17edd2f31f
BLAKE2b-256 472c703d175f7a1ac7a46623c3910383ca13a3078d64fd6e73fbdbfd3c51e1d6

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.20.tar.gz:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mbox_extractor-0.1.20-py3-none-any.whl.

File metadata

File hashes

Hashes for mbox_extractor-0.1.20-py3-none-any.whl
Algorithm Hash digest
SHA256 1cecd3d07f0c9aa32d89f3bbbdb83a0c6f30a9c9cf57be0c922986420be07f53
MD5 023fe109ffad1fd6c130a6df7da52207
BLAKE2b-256 d49d45d8f20122d78b9a8141bfbbcd866862994900f4380637248504cf96f290

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.20-py3-none-any.whl:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page