Tools for inspecting, comparing, & cleaning JSON-Lines files
Project description
# JLTool - Tools for JSON-Lines Records by Cathal Garvey, ©2016, Released under terms of the GNU AGPLv3 or later
The [JSON-Lines format](http://jsonlines.org) is a clean alternative to comma-separated values as a means to store data records in a scaleable, flat manner, for cases where a database is too much but a flat JSON file is inefficient.
JLTool is a tool for working with JSON-Lines records; it assists in schema validation, duplicate detection, de-duplication and normalisation, and ‘grepping’ using objectpath queries.
[ObjectPath](http://objectpath.org/) is supported for most operations; in particular, for fetching a unique, representative ID from objects for purposes of deduplication or diffing documents. For grepping, ObjectPath can be used to query for matching rows.
Installing JLTool with python3 setup.py install or pip install jl will install the jltool command-line tool, which is the primary intended purpose. However, for operations on files the subcommands of jltool are all available in the jltool import if desired. Just open it in ipython and take a look at the docs on the command functions for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.