Comic book archive multi format metadata read/write/transform tool and image extractor.

These details have not been verified by PyPI

Project links

Project description

Comicbox

Comicbox is a Python library and command line tool that reads, writes, and synthesizes comic book archive metadata. It understands every popular comic metadata standard, merges them into one consistent data model, converts between them, tags comics from online databases, and extracts pages and covers.

It is the metadata engine behind the Codex comic reader, but works just as well as a standalone command line tool for organizing a comic library.

✨ What Comicbox Does

Reads many archive types — CBZ, CBR, CBT, CB7, and (optionally) PDF.
Reads and writes every popular metadata standard — ComicInfo.xml, MetronInfo.xml, ComicBookInfo, CoMet, PDF metadata, and its own YAML/JSON.
Merges every source into one model — combines metadata from each embedded format and the filename into a single normalized view, then writes it back out to whichever formats you choose.
Tags comics online — looks up and matches comics against Metron and ComicVine, then writes the result.
Converts archives — repacks CBR/CBT/CB7 (and comic PDFs) to CBZ, and translates metadata between formats.
Extracts images — pulls cover art or arbitrary page ranges out of any supported archive.
Is scriptable and embeddable — a rich CLI, a Python API, and published JSON Schemas for every format.

📚 Archive Formats

Format	Read	Write
CBZ (zip)	✅	✅
CBR (rar)	✅	converts to CBZ
CBT (tar)	✅	converts to CBZ
CB7 (7z)	✅	converts to CBZ
PDF	✅	✅ embedded metadata

CBR extraction and conversion require the unrar binary on your PATH. PDF support is an optional extra.

🏷️ Metadata Formats

Comicbox reads and writes all of the following, normalizing each into a common schema:

Format	Read	Write	Notes
ComicInfo.xml (ComicRack)	✅	✅	v2.1 (draft) schema
MetronInfo.xml	✅	✅	v1.0 schema
ComicBookInfo (Comic Book Lover)	✅	✅	archive comment JSON
CoMet	✅	✅
PDF metadata	✅	✅	can embed ComicInfo.xml / MetronInfo.xml
Comicbox YAML / JSON	✅	✅	native, lossless
Filename	✅	—	parses metadata out of the file name

A full cross-format tag translation table is available.

🔀 One Unified Metadata Model

Different formats spell the same idea in different ways. Comicbox reconciles them so you never have to:

Identifiers — IDs, GTINs, and URLs from every format are aggregated into a single identifiers structure, and written back out as URNs in the Notes field.
Reprints — Alternate Names, Aliases, and "is version of" relationships collapse into one reprints list.
Notes mining — the heavily-abused Notes field is parsed for embedded data (tagger, timestamps, and identifiers) that formats don't otherwise carry.
Liberal value parsing — fuzzy, caseless values for enum-like fields (Age Rating, Format, credit roles) are accepted, tidied to Title Case, and converted to each output format's own enum on write.
Filename parsing — series, issue, year, and more are extracted from a wide variety of naming conventions via comicfn2dict.

🌐 Online Tagging

Comicbox can identify a comic and tag it from an online database. Metron and ComicVine are supported. It searches by the series, issue, and year it knows about, ranks candidates, breaks close calls with cover-image matching, and writes the best result.

# Interactive: prompts only when the match isn't clear.
comicbox --online metron "GI Joe #007 (1952).cbz"

# Tag by an exact database id (skips searching).
comicbox --id metron:42 "comic.cbz"

# Unattended batch run: never prompts, 4 files at a time.
comicbox --online all --recurse --prompts never -j 4 ./comics/

--match controls how confidently comicbox writes without asking (ask · careful · auto · eager), and --effort (minimal · balanced · thorough) trades matching accuracy for fewer API calls on fan-out sources like ComicVine — Metron doesn't fan out, so it ignores effort and always searches at full strength. Credentials come from --auth, COMICBOX_* environment variables, the config file, or your system keyring. See comicbox -h for the full set of online, caching, and tuning options.

🖼️ Pages, Covers & Conversion

# Extract the cover image.
comicbox --extract-covers --dest-path ./out "comic.cbz"

# Extract a range of pages (zero-based) by index.
comicbox --extract-pages 0:5 --dest-path ./out "comic.cbz"

# Convert a CBR to a CBZ, carrying metadata across.
comicbox --cbz "comic.cbr"

# Convert a single-image-per-page comic PDF to CBZ without re-encoding.
comicbox --cbz --pdf-pages image "comic.pdf"

# Rename a file to comicbox's canonical filename format.
comicbox --rename "comic.cbz"

📦 Installation

pip install comicbox

For PDF support, install the pdf extra:

pip install comicbox[pdf]

Dependencies

Comicbox needs no binary dependencies for CBZ, CBT, and CB7. Reading or converting CBR archives requires the unrar binary on your PATH.

The optional PDF extra pulls in pymupdf, which ships wheels with a bundled libmupdf for most platforms. Some platforms (e.g. Linux on ARM) may need libstdc++ plus C/C++ build tools to compile it.

Installing on ARM (AARCH64)

pymupdf has no pre-built AARCH64 wheels, so pip must build it. On some Python versions the build fails unless this environment variable is set:

PYMUPDF_SETUP_PY_LIMITED_API=0 pip install comicbox[pdf]

You will also need the build-essential and python3-dev (or equivalent) packages.

⌨️ Command Line

Comicbox ships a thorough, self-documenting CLI. Run:

comicbox -h

for the complete reference, including every metadata format key, the --print phases, and the online tagging tables. A few representative commands:

# Print the merged metadata comicbox reads from a comic.
comicbox -p "comic.cbz"

# Set a field and write it as ComicInfo.xml inside the archive.
comicbox -m "{publisher: SmallComics}" -w cix "comic.cbz"

# Recursively set a field across an entire library.
comicbox --recurse -m "{publisher: 'SC Comics'}" -w cix ./comics/

# Export and re-import metadata as a file.
comicbox --export cix "comic.cbz"
comicbox --import ComicInfo.xml -w cix "comic.cbz"

-m/--metadata accepts a compact "linear YAML" using tag names from any of the supported formats. Put a space after each colon so it parses as YAML, and quote values containing YAML special characters (:[]{},). See comicbox -h for many more -m examples, and "escaping YAML" for the escaping details.

💡 Preview before writing. Add -p to print exactly what would be written, or -n/--dry-run to perform an action without touching the filesystem.

Editing or Deleting Metadata

The cleanest way to edit or remove existing tags is to round-trip through a file:

# 1. Export the current metadata to an editable file.
comicbox --export cix "My Overtagged Comic.cbz"

# 2. Edit it.
nvim ComicInfo.xml

# 3. Preview the re-import.
comicbox --import ComicInfo.xml -p "My Overtagged Comic.cbz"

# 4. Wipe the old tags, then write the edited file back (careful!).
comicbox --delete-all-tags "My Overtagged Comic.cbz"
comicbox --import ComicInfo.xml -w cix "My Overtagged Comic.cbz"

You can also drop individual keys with -D/--delete-keys using dotted glom paths, e.g. -D series,reprints.0.series.

🛠 API

Comicbox is primarily a library. The Comicbox class in comicbox.box is the main read interface, and comicbox.write exposes a documented write API. Auto-generated API docs are published with the HTML docs.

from comicbox.box import Comicbox

with Comicbox("comic.cbz") as cb:
    metadata = cb.to_dict()              # merged, normalized metadata
    file_type = cb.get_file_type()       # "CBZ", "PDF", ...
    mtime = cb.get_metadata_mtime()      # last metadata modification time
    cover = cb.get_cover_page()          # cover image bytes

Writing is done through the public write_metadata (single file) and bulk_write (batched) helpers:

from comicbox.write import write_metadata

result = write_metadata(
    "comic.cbz",
    # The patch is the contents under the "comicbox" root tag. The
    # root-wrapped dict Comicbox.to_dict() returns is also accepted.
    {"publisher": {"name": "SmallComics"}, "genres": ["Science Fiction"]},
    formats=["COMIC_INFO"],  # MetadataFormats names; e.g. COMIC_INFO, METRON_INFO
)
print(result.written)

Every operational error these APIs raise derives from comicbox.exceptions.ComicboxError — ArchiveError, ArchiveWriteError, MetadataError, ExportError, WriteValidationError, OnlineConfigurationError, OnlineLookupAbortedError, and UnsupportedArchiveTypeError — so consumers can except ComicboxError without swallowing unrelated programming errors.

⚙️ Configuration

Comicbox is configured by command line arguments, an optional config file, and environment variables (in that order of precedence).

Defaults live in config_default.yaml, which also documents the nested config groups (general, read, write, convert, compute, and online).
Config file — point at one with -c PATH, or place it at ~/.config/comicbox/config.yaml.
Environment variables are prefixed with COMICBOX_.
Log level is set with the LOGLEVEL environment variable:

LOGLEVEL=ERROR comicbox -p "comic.cbz"

📦 Related Packages

Installing comicbox also installs two small sibling libraries, each usable on its own:

comicfn2dict — parses metadata out of comic filenames into Python dicts (also used by ComicTagger).
pdffile — presents a ZipFile-like interface for PDF files (installed with the [pdf] extra).

📜 Documentation

🛠 Development

Comicbox is hosted on GitHub. Most development tasks are driven by the Makefile — run make to see what's available.

The DEBUG_TRANSFORM environment variable prints verbose schema-transform information, useful when debugging format conversions.

📄 License

Comicbox is licensed under the LGPL-3.0-only license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

4.1.0

Jul 4, 2026

4.0.5

Jul 4, 2026

4.0.4

Jun 30, 2026

4.0.3

Jun 28, 2026

4.0.2

Jun 23, 2026

4.0.1

Jun 21, 2026

4.0.0

Jun 18, 2026

3.0.3

May 15, 2026

3.0.2

May 10, 2026

3.0.1

May 9, 2026

3.0.0

May 4, 2026

2.2.3

Apr 12, 2026

2.2.2

Apr 1, 2026

2.2.1

Feb 25, 2026

2.2.0

Feb 18, 2026

2.1.1

Feb 1, 2026

2.1.0

Nov 23, 2025

2.0.6

Oct 29, 2025

2.0.5

Oct 13, 2025

2.0.4

Oct 11, 2025

2.0.3

Sep 17, 2025

2.0.2

Aug 2, 2025

2.0.1

Jul 21, 2025

2.0.0

Jul 21, 2025

1.2.3

Mar 7, 2025

1.2.1

Oct 9, 2024

1.2.0

Aug 6, 2024

1.1.10

Jul 8, 2024

1.1.9

Jun 22, 2024

1.1.8

Jun 3, 2024

1.1.7

May 7, 2024

1.1.6

Apr 26, 2024

1.1.5

Apr 20, 2024

1.1.4

Mar 22, 2024

1.1.3

Mar 17, 2024

1.1.2

Mar 12, 2024

1.1.1

Mar 2, 2024

1.1.0

Feb 28, 2024

1.0.0

Jan 15, 2024

0.10.1

Jun 9, 2023

0.10.0

Jun 9, 2023

0.9.1

May 20, 2023

0.9.0

May 17, 2023

0.8.0

May 15, 2023

0.7.1

May 14, 2023

0.7.0

May 11, 2023

0.6.7

Mar 30, 2023

0.6.6

Mar 30, 2023

0.6.5

Feb 20, 2023

0.6.4

Jan 13, 2023

0.6.3

Jan 5, 2023

0.6.2

Jan 4, 2023

0.6.1

Nov 8, 2022

0.6.0

Oct 17, 2022

0.5.5

Aug 12, 2022

0.5.4

Jul 28, 2022

0.5.3

Jul 26, 2022

0.5.2

May 3, 2022

0.5.1

Apr 22, 2022

0.5.0

Apr 18, 2022

0.4.1 yanked

Apr 16, 2022

Reason this release was yanked:

many bugs getting pages

0.4.0

Apr 13, 2022

0.3.4

Mar 30, 2022

0.3.3

Mar 29, 2022

0.3.2

Mar 26, 2022

0.3.1

Feb 23, 2022

0.3.0

Feb 2, 2022

0.2.2

Jan 11, 2022

0.2.1

Nov 22, 2021

0.2.0

Nov 5, 2021

0.1.7

Oct 11, 2021

0.1.6

Oct 9, 2021

0.1.5

Oct 13, 2020

0.1.4

Sep 6, 2020

0.1.3

Aug 31, 2020

0.1.2

Aug 31, 2020

0.1.0

Aug 8, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

comicbox-4.1.0.tar.gz (981.6 kB view details)

Uploaded Jul 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

comicbox-4.1.0-py3-none-any.whl (330.9 kB view details)

Uploaded Jul 4, 2026 Python 3

File details

Details for the file comicbox-4.1.0.tar.gz.

File metadata

Download URL: comicbox-4.1.0.tar.gz
Upload date: Jul 4, 2026
Size: 981.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.26 {"installer":{"name":"uv","version":"0.11.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for comicbox-4.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8c14e015ad0829651f17d10f8bae34d2d378ff9385c3800e2fb85fe126f8cf56`
MD5	`c212bddca788262d3758a2f8999b5bb5`
BLAKE2b-256	`afb3988bfc40cc288e1036ed9b8d9a37cacbc4b63948a4decde6d855714b1c23`

See more details on using hashes here.

File details

Details for the file comicbox-4.1.0-py3-none-any.whl.

File metadata

Download URL: comicbox-4.1.0-py3-none-any.whl
Upload date: Jul 4, 2026
Size: 330.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.26 {"installer":{"name":"uv","version":"0.11.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for comicbox-4.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4287bc0664260e93605617640fc47762ae742eb52739083aac9a32ef9d781a5a`
MD5	`4bb60c269bc90f192219259a43c6af21`
BLAKE2b-256	`b4e0717c5ea5de203da3e1c0e2722da51c88adfcecb9c193fe32e28d142ab6d5`

See more details on using hashes here.

comicbox 4.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Comicbox

✨ What Comicbox Does

📚 Archive Formats

🏷️ Metadata Formats

🔀 One Unified Metadata Model

🌐 Online Tagging

🖼️ Pages, Covers & Conversion

📦 Installation

Dependencies

Installing on ARM (AARCH64)

⌨️ Command Line

Editing or Deleting Metadata

🛠 API

⚙️ Configuration

📦 Related Packages

📜 Documentation

🛠 Development

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes