Skip to main content

ISCC CLI - Creates ISCC Codes from Media Files

Project description

iscc-cli - Command Line Tool

Linux Build Status Windows Build Status Version

A command line tool that creates ISCC Codes for digital media files based on the reference implementation.

Table of Contents

Background

The International Standard Content Code is a proposal for an open standard for decentralized content identification. ISCC Codes are generated algorithmically from the content itself and offer many powerful features like content similarity clustering and partial integrity checks. If you want to learn more about the ISCC please check out https://iscc.codes.

Requirements

NOTE: Requires JAVA to be installed and on your path!

iscc-cli is tested on Linux and Windows with Python 3.5/3.6/3.7.

This tool depends on tika-python. Tika is used for extracting metadata and content from media files before generating ISCC Codes. On first execution of the iscc command line tool it will automatically download and launch the Java Tika Server in the background (this may take some time). Consecutive runs will access the existing Tika instance. You may explicitly pre-launch the Tika server with $ iscc init

Install

The ISCC command line tool is published with the package name iscc-cli on the Python Package Index and can be installed with pip:

$ pip3 install iscc-cli

Usage

Getting Help

Show help overview by calling iscc without any arguments:

$ iscc
Usage: iscc [OPTIONS] COMMAND [ARGS]...

Options:
  --version  Show the version and exit.
  --help     Show this message and exit.

Commands:
  gen*   Generate ISCC Code for a single media file.
  batch  Batch create ISCC Codes.
  init   Inititalize and check Tika server.
  sim    Estimate Similarity of ISCC Codes.

Get help for a specific command by entering iscc <command>:

$ iscc gen
Usage: iscc gen [OPTIONS] FILE

  Generate ISCC Code for FILE.

Options:
  -t, --title TEXT  Title for Meta-ID creation.
  -e, --extra TEXT  Extra text for Meta-ID creation.
  -v, --verbose     Enables verbose mode.
  -h, --help        Show this message and exit.

Generating ISCC Codes

The gen command generates an ISCC Code for a single file:

$ iscc gen tests/demo.jpg
ISCC:CCTcjug7rM3Da-CYDfTq7Qc7Fre-CDYkLqqmQJaQk-CRAPu5NwQgAhv

The gen command is default so you can skip it and simply do $ iscc tests/demo.jpg

To get a more detailed result use the -v (--verbose) option:

$ iscc -v tests/demo.jpg
ISCC:CCTcjug7rM3Da-CYDfTq7Qc7Fre-CDYkLqqmQJaQk-CRAPu5NwQgAhv
Norm Title: concentrated cat
Tophash:    7a8d0c513142c45f417e761355bf71f11ad61d783cd8958ffc0712d00224a4d0
Filepath:   tests/demo.jpg
GMT:        image

See iscc batch for help on how to generate ISCC codes for multiple files at once.

Similarity of ISCC Codes

The sim command computes estimated similarity of two ISCC Codes:

$ iscc sim CCUcKwdQc1jUM CCjMmrCsKWu1D
Estimated Similarity of Meta-ID: 87.50 %

You may also compare full four-component ISCC Codes.

Maintainers

@titusz

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

You may also want join our developer chat on Telegram at https://t.me/iscc_dev.

Change Log

[0.3.0] - 2019-06-01

  • Add sim command similarity comparison of ISCC Codes

[0.2.0] - 2019-05-31

  • Add support for doc, docx and rtf documents
  • Update to ISCC 1.0.4 (fixes whitespace bug)

[0.1.0] - 2019-05-31

  • Basic ISCC Code creation
  • Supported file types: jpg, png, pdf, epub

License

MIT © 2019 Titusz Pan

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iscc-cli-0.3.0.tar.gz (8.6 kB view hashes)

Uploaded Source

Built Distribution

iscc_cli-0.3.0-py3-none-any.whl (10.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page