NewlineJSON·PyPI

Streaming newline delimited JSON I/O.

These details have not been verified by PyPI

Project links

Homepage

Project description

Streaming newline delimited JSON I/O.

https://travis-ci.org/geowurster/NewlineJSON.svg?branch=master

https://coveralls.io/repos/geowurster/NewlineJSON/badge.svg?branch=master

Example

Calling newlinejson.open() returns a file-like object that behaves like Python’s io.TextIOWrapper:

import newlinejson as nlj

with nlj.open('sample-data/dictionaries.json') as src, \
        with nlj.open('out.json', 'w') as dst:
    for line in src:
        dst.write(line)

with open('out.json') as f:
    print(f.read()))
{'field2': 'l1f2', 'field3': 'l1f3', 'field1': 'l1f1'}
{'field2': 'l2f2', 'field3': 'l2f3', 'field1': 'l2f1'}
{'field2': 'l3f2', 'field3': 'l3f3', 'field1': 'l3f1'}
{'field2': 'l4f2', 'field3': 'l4f3', 'field1': 'l4f1'}
{'field2': 'l5f2', 'field3': 'l5f3', 'field1': 'l5f1'}

Command Line Interface

Rather than provide another utility, the CLI is accessed from python -m newlinejson:

$ python -m newlinejson --help
Usage: newlinejson [OPTIONS] COMMAND [ARGS]...

  NewlineJSON commandline interface.

  Common simple ETL commands for homogeneous data.

Options:
  --version  Show the version and exit.
  --help     Show this message and exit.

Commands:
  csv2nlj  Convert a CSV to newline JSON dictionaries.
  insp     Open a file and launch a Python interpreter.
  nlj2csv  Convert newline JSON dictionaries to a CSV.

The included utilities are for working with homogeneous data, meaning that every line has the same fields. The goal is to provide simple data translation tools rather than a more comprehensive suite.

Can’t I do everything this module does with one function?

Pretty much - this is the simplest newline delimited JSON API:

import json

def reader(stream):
    for line in stream:
        yield json.loads(line)

with open('sample-data/lists.json') as src, open('outfile.json', 'w') as dst:
    for line in reader(src):
        dst.write(json.dumps(line))

But it doesn’t handle failures and every time it needs to be used it has to be re-written, which means it needs to be packaged, which means it needs unittests, may as well be a little more Pythonic, and now we’re back to this module. It’s easier and more Pythonic to just import newlinejson and know that it will work rather than solve the exact same problem multiple times.

Why is this better than MsgPack, Protobuf, or any other packed-binary format?

It probably isn’t. If you’re looking for a module to incorporate into a high capacity data pipeline or bandwidth limited environment you definitely want a packed-binary format. If you’re working with a small amount of local data to produce a one-off product, proofing a workflow, or want to provide additional I/O capabilities to a commandline application reading/writing from/to stdin/stdout, this module is pretty easy to work with.

The goal of this module is to fill a gap in the Python ecosystem in an easy to use and intuitive manner, not to provide highly optimized I/O. If Python’s built-in JSON library isn’t fast enough but newline delimited JSON is the right answer to your problem, one of many faster JSON libraries can be used globally with newlinejson.core.JSON_LIB = module or by setting json_lib=module as a keyword argument in open(), load(), etc.

Installing

Via pip:

$ pip install NewlineJSON

>From master:

$ git clone https://github.com/geowurster/NewlineJSON.git
$ cd NewlineJSON
$ python setup.py install

Developing

Install:

$ pip install virtualenv
$ git clone https://github.com/geowurster/NewlineJSON
$ cd NewlineJSON
$ virtualenv venv
$ source venv/bin/activate
$ pip install -e .[test]
$ py.test tests --cov newlinejson --cov-report term-missing
$ pep8 --max-line-length=95 newlinejson

License

See LICENSE.txt

Changelog

See CHANGES.md

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0

Sep 22, 2015

0.3.2

May 22, 2015

0.3.1

May 20, 2015

0.3

May 18, 2015

0.2

Mar 8, 2015

0.1.0

Jan 8, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NewlineJSON-1.0.tar.gz (12.3 kB view details)

Uploaded Sep 22, 2015 Source

Built Distribution

NewlineJSON-1.0-py2.py3-none-any.whl (12.8 kB view details)

Uploaded Sep 22, 2015 Python 2Python 3

File details

Details for the file NewlineJSON-1.0.tar.gz.

File metadata

Download URL: NewlineJSON-1.0.tar.gz
Upload date: Sep 22, 2015
Size: 12.3 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for NewlineJSON-1.0.tar.gz
Algorithm	Hash digest
SHA256	`4e483a6398a1956d7735471664dcd05f8218ccadb4d969fbc49166a3ae1434ba`
MD5	`9ce893d710a8e574d4ab7f5737103cd7`
BLAKE2b-256	`ec4c04f0e60c8cc22581b1362a40e7c1342288517f0583aee221a49a50a075ef`

See more details on using hashes here.

File details

Details for the file NewlineJSON-1.0-py2.py3-none-any.whl.

File metadata

Download URL: NewlineJSON-1.0-py2.py3-none-any.whl
Upload date: Sep 22, 2015
Size: 12.8 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for NewlineJSON-1.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`433498a1d5b4c4d28bca8153dd8ce77f7ac68cfde57fa58a7a8c80a930ebfd41`
MD5	`1c454f73cbf389135aeace8550e825a4`
BLAKE2b-256	`336944ed82ab29d71c0365fbf5982fdce552304741b0463fb4ddf89790abb257`

See more details on using hashes here.

NewlineJSON 1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Example

Command Line Interface

Can’t I do everything this module does with one function?

Why is this better than MsgPack, Protobuf, or any other packed-binary format?

Installing

Developing

License

Changelog

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes