Skip to main content

📬 Recursively extract all attachments from .mbox email archives with a single command

Project description

mbox-extractor

mbox-extractor

License Python

📬 Recursively extract all attachments from .mbox email archives with a single command

Features

  • Recursive scanning - Finds all .mbox files in any directory tree
  • Safe filenames - Sanitizes attachment names, removing illegal characters
  • No duplicates - Uses content-based hashing to prevent overwrites
  • Progress display - Visual progress bar for large mailboxes

Quick Start

uv tool install mbox-extractor
mbox-extractor /path/to/emails

Installation

Using uv (recommended)

uv tool install mbox-extractor

Using pip

pip install mbox-extractor

From source

git clone https://github.com/tsilva/mbox-extractor.git
cd mbox-extractor
uv tool install .

Usage

Extract all attachments from .mbox files under a directory:

mbox-extractor /path/to/search

Attachments from each .mbox file are saved to a folder with the same name:

Found mbox: /emails/archive.mbox -> extracting to /emails/archive
Extracting archive.mbox: 100%|████████████████████| 500/500 [00:10<00:00, 48.5it/s]
Extracted 42 attachments to '/emails/archive'.

How It Works

  1. Recursively scans the given path for .mbox files
  2. Opens each mailbox and iterates through all messages
  3. Extracts attachments with sanitized, unique filenames
  4. Saves them to a folder named after the source .mbox file

Filenames are made unique by appending an 8-character MD5 hash of the file content, preventing overwrites when multiple attachments share the same name.

Requirements

  • Python 3.7+
  • tqdm (installed automatically)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mbox_extractor-0.1.7.tar.gz (168.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mbox_extractor-0.1.7-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file mbox_extractor-0.1.7.tar.gz.

File metadata

  • Download URL: mbox_extractor-0.1.7.tar.gz
  • Upload date:
  • Size: 168.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.7.tar.gz
Algorithm Hash digest
SHA256 f15c425786b11aea9072c154f61336206f972177a84c742257422f05c5d622e2
MD5 ac985a949c89b999f8ef79197e95279e
BLAKE2b-256 b0a65bb212392606c23370cc7b0b5e8706833feb2ce860b70dfcc66b157ba6f6

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.7.tar.gz:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mbox_extractor-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: mbox_extractor-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 4.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mbox_extractor-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 310dd41142352f5b9e3ab1315f685262f837f3a90f0519318ea52b2468363b26
MD5 31f0086a6a8a9a28c5be7fa2563d1910
BLAKE2b-256 956fac8943447bf521369a6fded4f4ef1e7a1f3659b5873c728f0ea10c30b7fa

See more details on using hashes here.

Provenance

The following attestation bundles were made for mbox_extractor-0.1.7-py3-none-any.whl:

Publisher: release.yml on tsilva/mbox-extractor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page