Skip to main content

Tools for inspecting, comparing, & cleaning JSON-Lines files

Project description

# JLTool - Tools for JSON-Lines Records by Cathal Garvey, ©2016, Released under terms of the GNU AGPLv3 or later

The [JSON-Lines format]( is a clean alternative to comma-separated values as a means to store data records in a scaleable, flat manner, for cases where a database is too much but a flat JSON file is inefficient.

JLTool is a tool for working with JSON-Lines records; it assists in schema validation, duplicate detection, de-duplication and normalisation, and ‘grepping’ using objectpath queries.

[ObjectPath]( is supported for most operations; in particular, for fetching a unique, representative ID from objects for purposes of deduplication or diffing documents. For grepping, ObjectPath can be used to query for matching rows.

Installing JLTool with python3 install or pip install jl will install the jltool command-line tool, which is the primary intended purpose. However, for operations on files the subcommands of jltool are all available in the jltool import if desired. Just open it in ipython and take a look at the docs on the command functions for more information.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jltool-1.0.0.tar.gz (6.3 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page