Stream JSON and JSON-Lines lazily.
Project description
json-streams
Read and write JSON lazy, especially json-arrays.
Handles both the JSON format:
[
{
"a": 1
},
{
"a": 2
}
]
As well as JSON LINES format:
{"a":1}
{"a": 2}
Uses orjson
if present, otherwise standard json
.
Usage
Installation
# Using standard json
pip install json-streams
# Using orjson
pip install json-streams[orjson]
Note
This library prefers files opened in binary mode.
Therefore does all dumps
-methods return bytes
.
All loads
methods handles str
, bytes
and bytesarray
arguments.
Examples
Allows you to use json.load
and json.dump
with
both json and json-lines files as well as dumping generators.
import json_streams
# This command tries to guess format and opens the file
data = json_streams.load_from_file("data.json") # or data.jsonl
# Write to file, again guessing format
json_streams.dump_to_file(data, "data.jsonl")
from json_streams import json_iter, jsonl_iter
# Open and read the file without guessing
data = json_iter.load_from_file("data.json")
# Process file
# Write to file without guessing
jsonl_iter.dump_to_file(data, "data.jsonl")
import json_streams
def process(data):
for entry in data:
# process
yield entry
def read_process_and_write(filename_in, filename_out):
json_streams.dump_to_file(
process(
json_streams.load_from_file(filename_in)
),
filename_out
)
You can also use json_streams as a sink, that you can send data to.
import json_streams
with open("out.json", "bw") as fp:
# guessing format
with json_streams.sink(fp) as sink:
for data in data_source():
sink.send(data)
Changelog
Development
After cloning the repo, just run
$ make dev
$ make test
to setup a virtual environment, install dev dependencies and run the unit tests.
Note: If you run the command in a activated virtual environment, that environment is used instead.
Deployment
Push a tag in the format v\d+.\d+.\d+
to main
-branch, to build & publish package to PyPi.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for json_streams-0.11.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b375443011fa6d367faa3c5847c74d87ed16af2f12d2a961e85a27cbaa853596 |
|
MD5 | b5bad18a586e8fb92820af7a35f517ba |
|
BLAKE2b-256 | 70cd0e13948a6d42fd1e4a5daa92b9a4d6d259f2566b1aaca5dbd02402c4d309 |