An API for reading comic archive contents and metadata: CBZ, CBR, CBT and PDF

These details have not been verified by PyPI

Project links

Project description

Comicbox

A comic book archive metadata reader and writer.

✨ Features

📚Comic Formats

Comicbox reads CBZ, CBR, CBT, and optionally PDF. Comicbox archives and writes CBZ archives and PDF metadata.

🏷️ Metadata Formats

Comicbox reads and writes:

ComicRack ComicInfo.xml v2.1 (draft) schema,
Comic Book Lover ComicBookInfo schema
CoMet schema.
PDF Metadata.
- Embedding ComicInfo.xml inside PDFS.
A variety of filename schemes that encode metadata.

Usefulness

Comicbox's primary purpose is a library for use by Codex comic reader. The API isn't well documented, but you can infer what it does pretty easily here: comicbox.comic_archive as the primary interface.

The command line is increasingly useful and can read and write metadata recursively and extract pages.

Limitations and Alternatives

Comicbox does not use popular metadata database APIs or have a GUI!

Comictagger is a popular alternative. It does most of what Comicbox does but also automatically tags comics with the ComicVine API and has a desktop UI.

📦 Installation

pip install comicbox

Comicbox supports PDFs as an extra when installed like:

pip install comicbox[pdf]

Dependencies

Comicbox generally works without any binary dependencies but requires unrar be on the path to convert CBR into CBZ or extract files from CBRs.

⌨️ Usage

Console

Type

comicbox -h

see the CLI help.

Examples

comicbox test.cbz -m "{Tags: a,b,c, story_arcs: {d:1,e:'',f:3}" -m "Publisher: SmallComics" -w cr

Will write those tags to comicinfo.xml in the archive.

Be sure to add spaces after colons so they are detected as valid YAML key value pairs. This is easy to forget.

But it's probably better to use the --print action to see what it's going to do before you actually write to the archive:

comicbox test.cbz -m "{Tags: a,b,c, story_arcs: {d:1,e:'',f:3}" -m "Publisher: SmallComics" -p

A recursive example:

comicbox --recurse -m "publisher: 'SC Comics'" -w cr ./SmallComicsComics/

Will recursively change the publisher to "SC Comics" for every comic found in under the SmallComicsComics directory.

Escaping YAML

the -m command line argument accepts the YAML language for tags. Certain characters like \,:;_()$%^@ are part of the YAML language. To successful include them as data in your tags, look up "Escaping YAML" documentation online

Deleting Metadata

To delete metadata from the cli you're best off exporting the current metadata, editing the file and then re-importing it with the delete previous metadata option:

# export the current metadata
comicbox --export cix "My Overtagged Comic.cbz"
# Adjust the metadata in an editor.
nvim comicinfo.xml
# Check that importing the metadata will look how you like
comicbox --import comicinfo.xml -p "My Overtagged Comic.cbz"
# Delete all previous metadata from the comic (careful!)
comicbox --delete "My Overtagged Comic.cbz"
# Import the metadata into the file and write it.
comicbox --import comicinfo.xml --write cix "My Overtagged Comic.cbz"

Quirks

The comicbox.yaml format represents the ComicInfo.xml Web tag as an identifiers.url tag. Fear not, you don't have to remember this. The CLI accepts heterogeneous tag types with the -m option, so you can type:

comicbox -p -m "Web: https://foo.com" mycomic.cbz

and the identifier tag should appear in comicbox.yaml as:

identifiers:
  nss: foo.com
  url: https://foo.com

Packages

Comicbox actually installs three different packages:

comicbox The main API and CLI script.
comicfn2dict A separate library for parsing comic filenames into dicts it also includes a CLI script.
pdffile A utility library for reading and writing PDF files with an API like Python's ZipFile

⚙️ Config

comicbox accepts command line arguments but also an optional config file and environment variables.

The variables have defaults specified in a default yaml

The environment variables are the variable name prefixed with COMICBOX_. (e.g. COMICBOX_COMICINFOXML=0)

Log Level

change logging level:

LOGLEVEL=ERROR comicbox -p <path>

🛠 Development

You may access most development tasks from the makefile. Run make to see documentation.

🤔 Motivation

I didn't like Comictagger's API, so I built this for myself as an educational exercise and to use as a library for Codex comic reader.

📋 Schemas

Comicbox supports reading and writing several comic book metadata schemas.

Filename Schema

Comicbox includes a pretty good comic archive filename parser. It can extract a number of common fields from comic archive filenames.

Location	Name
Archive	The archive filename
Import/Export	comicbox-filename.txt

PDF Schema

The pdf metadata standard. Can be exported as an xml file or written directly to the pdf itself.

Adobe PDF Namespace Adobe PDF Standard § 14.3.3 Document Information Dictionary

PDF metadata is only read or written from and to PDF files.

Location	Name
Archive	PDF internal
Import/Export	pdf-metadata.xml

Reading Embedded Metadata from `keywords`

Comicbox will read most any metadata standard it supports from the keywords field. If that fails it will consider the keywords field as a comma delimited "Tags" field.

Writing ComicInfo.xml to `keywords`

By default Comicbox will write ComicInfo XML to the keywords field (e.g. -w pdf)

Codex supports this because it uses Comicbox. Other comic readers do not support PDF embedded ComicInfo.xml, but since they already have ComicInfo.xml parsers it's possible that they might someday.

If Comicbox JSON is included in the write formats (e.g. -w pdf,json) Comicbox will write comicbox.json to the keywords field instead. It is unlikely that any other comic reader other than Codex will ever support this.

CoMet Schema

An old and uncommon comic metadata standard from a defunct comic book reader.

CoMet Specification

Location	Name
Archive	comet.xml
Import/Export	comet.xml

ComicBookInfo Schema (Comic Book Lover)

The Comic Book Lover schema. A rare but still encountered JSON schema. It probably survives because Comictagger supports writing it.

ComicBookInfo

Location	Name
Archive	Zip & Rar Comments
Import/Export	comic-book-info.json

ComicInfo Schema (Comic Rack)

The Comic Rack schema. The de facto standard of comic book metadata. The Comic Rack reader is defunct, but the Anansi Project now publishes the ComicInfo spec and has compatibly and conservatively extended it.

Anansi ComicInfo v2.1 Spec Also, an unofficial, undocumented Mylar extension to ComicInfo.xml that encodes multiple Story Arcs and Story Arc Numbers as CSV values.

Location	Name
Archive	comicinfo.xml
Import/Export	comicinfo.xml

ComicTagger Schema

The most useful comic book metadata writer is ComicTagger. It supports the ComicVine API, is extensible to other APIs, and features a nice desktop GUI. Internally, Comictagger keeps a metadata object to work with the schemas it supports. This schema allows the import and export of that schema.

Comictaggger genericmetadata.py

This schema may only be useful to developers. The author of ComicTagger offers no promises as to the stability of this API and I am very lazy, so the chances of this drifting out of date are anyone's guess. It was included because it was easy to do.

Location	Name
Archive	comictagger.json
Import/Export	comictagger.json

Comicbox Schema

The comicbox internal data structure which acts as a superset of the above schemas to allow interpolating.

Comicbox JSON Schema

JSON Format

Location	Name
Archive	comicbox.json
Import/Export	comicbox.json

YAML Format

YAML is a superset of JSON, so the JSON schema applies here.

Location	Name
Archive	comicbox.yaml
Import/Export	comicbox.yaml

CLI Format

The Comicbox CLI uses "flow style" YAML, which is an all on one line format to enter metadata on the command line.

Specifying metadata on the command line like this is additive.

Location	Name
Comicbox CLI	-m --metadata
Archive	comicbox-cli.yaml
Import/Export	comicbox-cli.yaml

Environment variables

There is a special environment variable DEBUG_TRANSFORM that will print verbose schema transform information

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.2.1

Oct 9, 2024

1.2.0

Aug 6, 2024

1.1.10

Jul 8, 2024

1.1.9

Jun 22, 2024

1.1.8

Jun 3, 2024

1.1.7

May 7, 2024

1.1.6

Apr 26, 2024

1.1.5

Apr 20, 2024

1.1.4

Mar 22, 2024

1.1.3

Mar 17, 2024

1.1.2

Mar 12, 2024

1.1.1

Mar 2, 2024

1.1.0

Feb 28, 2024

1.1.0a0 pre-release

Feb 28, 2024

1.0.0

Jan 15, 2024

1.0.0rc9 pre-release

Jan 14, 2024

1.0.0rc8 pre-release

Jan 14, 2024

1.0.0rc7 pre-release

Jan 11, 2024

1.0.0rc6 pre-release

Jan 3, 2024

1.0.0rc5 pre-release

Jan 1, 2024

1.0.0rc4 pre-release

Dec 28, 2023

1.0.0rc3 pre-release

Dec 26, 2023

1.0.0rc2 pre-release

Dec 26, 2023

1.0.0rc1 pre-release

Dec 21, 2023

1.0.0rc0 pre-release

Sep 18, 2023

0.10.1

Jun 9, 2023

0.10.0

Jun 9, 2023

0.9.1

May 20, 2023

0.9.0

May 17, 2023

0.8.0

May 15, 2023

0.7.1

May 14, 2023

0.7.0

May 11, 2023

0.6.7

Mar 30, 2023

0.6.6

Mar 30, 2023

0.6.5

Feb 20, 2023

0.6.4

Jan 13, 2023

0.6.3

Jan 5, 2023

0.6.2

Jan 4, 2023

0.6.1

Nov 8, 2022

0.6.0

Oct 17, 2022

0.6.0a0 pre-release

Oct 17, 2022

0.5.5

Aug 12, 2022

0.5.4

Jul 28, 2022

0.5.3

Jul 26, 2022

0.5.2

May 3, 2022

0.5.1

Apr 22, 2022

0.5.0

Apr 18, 2022

0.4.1 yanked

Apr 16, 2022

Reason this release was yanked:

many bugs getting pages

0.4.0

Apr 13, 2022

0.3.4

Mar 30, 2022

0.3.3

Mar 29, 2022

0.3.2

Mar 26, 2022

0.3.1

Feb 23, 2022

0.3.0

Feb 2, 2022

0.2.2

Jan 11, 2022

0.2.1

Nov 22, 2021

0.2.0

Nov 5, 2021

0.1.7

Oct 11, 2021

0.1.6

Oct 9, 2021

0.1.5

Oct 13, 2020

0.1.4

Sep 6, 2020

0.1.3

Aug 31, 2020

0.1.2

Aug 31, 2020

0.1.0

Aug 8, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

comicbox-1.2.1.tar.gz (82.0 MB view details)

Uploaded Oct 9, 2024 Source

Built Distribution

comicbox-1.2.1-py3-none-any.whl (105.2 kB view details)

Uploaded Oct 9, 2024 Python 3

File details

Details for the file comicbox-1.2.1.tar.gz.

File metadata

Download URL: comicbox-1.2.1.tar.gz
Upload date: Oct 9, 2024
Size: 82.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.12.7 Linux/5.15.0-1057-aws

File hashes

Hashes for comicbox-1.2.1.tar.gz
Algorithm	Hash digest
SHA256	`283203289ea20cafa0c0cc2f2c1fcddea68642b74d6afd6c19b7157ea07c211e`
MD5	`e73f69e5659e9af3850b779225cb6412`
BLAKE2b-256	`0da72b5a1c6bbef2751779cc9d001080d5f7da95e0b381ff4e5c4b74a8a814de`

See more details on using hashes here.

File details

Details for the file comicbox-1.2.1-py3-none-any.whl.

File metadata

Download URL: comicbox-1.2.1-py3-none-any.whl
Upload date: Oct 9, 2024
Size: 105.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.3 CPython/3.12.7 Linux/5.15.0-1057-aws

File hashes

Hashes for comicbox-1.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`af49c68dd21b0945e49e9c1a42783deb46abe7ead640522cd8685b7798fb406b`
MD5	`2ef593c231627bfbdd8e0b94dc1f9bf5`
BLAKE2b-256	`3087726ab6fa7016ff940876b55daa7f3ad85da33837ace54310ee9db2818e23`

See more details on using hashes here.

comicbox 1.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Comicbox

✨ Features

📚Comic Formats

🏷️ Metadata Formats

Usefulness

Limitations and Alternatives

📦 Installation

Dependencies

⌨️ Usage

Console

Examples

Escaping YAML

Deleting Metadata

Quirks

Packages

⚙️ Config

Log Level

🛠 Development

🤔 Motivation

📋 Schemas

Filename Schema

PDF Schema

Reading Embedded Metadata from keywords

Writing ComicInfo.xml to keywords

CoMet Schema

ComicBookInfo Schema (Comic Book Lover)

ComicInfo Schema (Comic Rack)

ComicTagger Schema

Comicbox Schema

JSON Format

YAML Format

CLI Format

Environment variables

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Reading Embedded Metadata from `keywords`

Writing ComicInfo.xml to `keywords`