Skip to main content

Reduces JSON, YAML, and NDJSON volume by collapsing repeated structures while preserving the schema.

Project description

JSON's Razor — Cut the fat

tests

Reduces JSON, YAML, and NDJSON volume by collapsing repeated structures while preserving the schema.

Large structured data files are hard to parse — not because the structure is complex, but because repetition obscures it. A list of 10,000 objects with identical shape tells you nothing more than a list of 1. JSON's Razor collapses that repetition to its minimum essential form: one representative example of each repeated structure, at every level of nesting.

The output is valid, parseable data in the same format as input — not a summary, not a schema definition. It just has far less volume.


Install

pip install json-razor

Usage

cat big.json | json-razor                    # stdin → stdout
json-razor big.json                          # file input → stdout
json-razor big.json -o small.json            # file input → file output
json-razor big.yaml                          # auto-detected as YAML
json-razor app.log --format ndjson           # NDJSON log file

Options

Flag Default Description
--keep N 1 Number of examples to keep per repeated structure
--depth N unlimited Stop collapsing below this nesting depth
--format auto Force format: json, yaml, or ndjson
--truncate N 100 Max string length before truncating

How it works

Arrays — collapsed to one item. Mixed-type arrays keep one of each distinct type.

// input
[{"id": 1, "name": "alice"}, {"id": 2, "name": "bob"}, {"id": 3, "name": "carol"}]

// output
[{"id": 1, "name": "alice"}]

Mixed types — one representative per JSON type (null, bool, number, string, array, object).

// input
[1, "hello", {"id": 1}, null, true, [1, 2, 3]]

// output
[1, "hello", {"id": 1}, null, true, [1]]

Nested structures — collapsed recursively at every level.

NDJSON — collapsed across lines; one representative line kept.

Nulls and empty values — always preserved (null, [], {}).

Long strings — truncated to a configurable preview.


Supported formats

Format Auto-detected from
JSON .json
YAML .yaml, .yml
NDJSON .ndjson

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

json_razor-0.1.1.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

json_razor-0.1.1-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file json_razor-0.1.1.tar.gz.

File metadata

  • Download URL: json_razor-0.1.1.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for json_razor-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8607ce15d022fb69a72f5f3dd635f6558094bb97e89a7c2138524db37b3eef8a
MD5 6899a90a929298e39077238af69a49dc
BLAKE2b-256 7709b9ecf6e476ed825f07eb801d09706dee5a548c1e16737c88d9d55e2fc2f0

See more details on using hashes here.

Provenance

The following attestation bundles were made for json_razor-0.1.1.tar.gz:

Publisher: release.yml on rick-does/json-razor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file json_razor-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: json_razor-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for json_razor-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8bead2a643362ae0fe3e07f18d13b3da4452af35bac865199133fda21f4366a5
MD5 8503cd39ab2f801b6ab72b02539c314e
BLAKE2b-256 f4d37bc153b74d261c8d30ae5fe82ece084cfe2c49828e202d7eb8500b382cc6

See more details on using hashes here.

Provenance

The following attestation bundles were made for json_razor-0.1.1-py3-none-any.whl:

Publisher: release.yml on rick-does/json-razor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page