No project description provided
Project description
What is this?
A JSON-based DSL for describing paths in your project.
Why is this?
My etl/data analysis projects are littered with code like,
import os
DATA_DIR = "data"
CLEAN_DIR = os.path.join(DATA_DIR, "clean")
RAW_DIR = os.path.join(DATA_DIR, "raw")
TARGET_HTML = os.path.join(RAW_DIR, "something.html")
OUTPUT_FILE = os.path.join(CLEAN_DIR, "something.csv")
with open(TARGET_HTML) as fp:
csv = process(fp)
with open(OUTPUT_FILE) as fp:
write_csv(fp)
It’s fine for one file, but when you have a whole ELT pipeline tucked into a Makefile, the duplication leads to fragility and violates DRY. It’s a REALLY common pattern in file-based processing. This package and format lets you do create a .paths.json file like,
{
"__ENV": {"VERSION": "0.0.1"},
"DATA_DIR": ["data", "$$VERSION"],
"CLEAN_DIR": ["$DATA_DIR", "clean"],
"RAW_DIR": ["$DATA_DIR", "raw"],
"SOMETHING_HTML": ["$RAW_DIR", "something.html"],
"SOMETHING_CSV": ["$RAW_DIR", "something.csv"]
}
Then, from your python scripts,
from pathsjson.automagic import PATHS
print("Processing:", PATHS['SOMETHING_HTML'])
with PATHS.resolve('SOMETHING_HTML').open() as fp:
csv = process(fp)
with PATHS.resolve('SOMETHING_CSV').open("w") as fp:
write_csv(fp)
Installation
pip install pathsjson
Validation
There is a .paths.json schema. It’s validated with JSON-Schema.
More details
Read the docs: here.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pathsjson-0.0.3.tar.gz
(12.0 kB
view details)
File details
Details for the file pathsjson-0.0.3.tar.gz
.
File metadata
- Download URL: pathsjson-0.0.3.tar.gz
- Upload date:
- Size: 12.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 61370bbfdfe99e068cd5582338d6d13af021031144bebecbfcf612b9c638488a |
|
MD5 | f328d2497cb1c9db37c6b541fe867332 |
|
BLAKE2b-256 | 808e5066f27ada75500358cc2a69924fb999b0312e4937d09e756e392bd180ef |