Stream JSON and JSON-Lines lazily.
Project description
json-arrays
Read and write JSON lazy, especially json-arrays.
Handles both the JSON format:
[
{
"a": 1
},
{
"a": 2
}
]
As well as JSON LINES format:
{"a":1}
{"a": 2}
Also supports streaming from gzipped files.
Uses orjson
if present, otherwise standard json
.
Usage
Installation
# Using standard json
pip install json-arrays
# Using orjson
pip install json-arrays[orjson]
Note
This library prefers files opened in binary mode.
Therefore does all dumps
-methods return bytes
.
All loads
methods handles str
, bytes
and bytesarray
arguments.
Examples
Allows you to use json.load
and json.dump
with
both json and json-lines files as well as dumping generators.
import json_arrays
# This command tries to guess format and opens the file
data = json_arrays.load_from_file("data.json") # or data.jsonl
# Write to file, again guessing format
json_arrays.dump_to_file(data, "data.jsonl")
from json_arrays import json_iter, jsonl_iter
# Open and read the file without guessing
data = json_iter.load_from_file("data.json")
# Process file
# Write to file without guessing
jsonl_iter.dump_to_file(data, "data.jsonl")
import json_arrays
def process(data):
for entry in data:
# process
yield entry
def read_process_and_write(filename_in, filename_out):
json_arrays.dump_to_file(
process(
json_arrays.load_from_file(filename_in)
),
filename_out
)
You can also use json_arrays as a sink, that you can send data to.
import json_arrays
with open("out.json", "bw") as fp:
# guessing format
with json_arrays.sink(fp) as sink:
for data in data_source():
sink.send(data)
Release Notes
This projects keeps a CHANGELOG.
Development
This project uses pdm. After cloning the repo, just run
make dev
make test
to setup a virtual environment, install dev dependencies and run the unit tests.
Note: If you run the command in a activated virtual environment, that environment is used instead.
Deployment
Push a tag in the format v\d+.\d+.\d+
to main
-branch, to build & publish package to PyPi.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for json_arrays-0.14.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c52b1a7afee2de05d37ea9e43e6b3079d930ab8bf6ae0294792a2d9e97d05b84 |
|
MD5 | 536cb483c5fde70e9df83824774b6790 |
|
BLAKE2b-256 | 3e8872150867479f5c4dd17de311500b2e75cd700f6bfbe12f8bdf384b762e0e |