Skip to main content

Easiest-possible IO for basic file types.

Project description

dummio

IO for dummies! A unified save/load interface using the most common and recommendable default options for IO between various object types and file types. For example, instead of

with open(file_path, 'r', encoding='utf-8') as file:
    data = json.load(file)

you can simply

data = dummio.json.load(file_path)

Users may pass additional keyword arguments to the underlying IO methods.

Direct IO vs cloud paths

Most dummio IO calls "just work" against cloud paths like s3://bucket/key, gs://bucket/key, or az://container/key. For example, dummio.json.load("s3://bucket/key") will read a json file from an S3 bucket. Notes:

  • Shout-out: universal-pathlib powers much of the cloud-iteroperability on our backend.
  • Warning: Although we manually run demo/cloud.py to ensure basic functionality, current CI unit testing does not cover cloud interactions.

Standardized IO interface

In some coding applications it is desirable to pass an IO module as an argument to a function. Here it is convenient to pass a dummio submodule, since all dummio submodules have the same save and load interface, having equivalent signatures (except for differences hidden in **kwargs).

Supported object and file types

So far we support:

  • text, pickle, and dill
  • simple dictionaries:
    • json
    • orjson
    • yaml
  • pandas dataframes:
    • csv
    • feather
    • parquet
  • numpy arrays (thin wrapper on numpy.save/load)
  • onnx.ModelProto instances
  • pydantic models (relying on the built-in json serialization methods)
  • mashumaro models inheriting the json or yaml serialization mixins

Filepaths passed to save and load methods can be of type str, pathlib.Path, or universal_pathlib.UPath.

Dependencies

universal-pathlib is our only required dependency.

For other dependencies, such as pandas, calling from dummio.pandas import df_parquet will raise a helpful message to install pandas if you have not already done so.

Examples

Basic IO methods can be accessed directly as dummio.text, dummio.json, etc:.

import dummio

text = "hello world"
data = {"key": text}
path = "io_example_file"

# Text
dummio.text.save(text, path=path)
assert text == dummio.text.load(path)

# YAML
dummio.yaml.save(data)
assert data == dummio.yaml.load(path)

See demo/cloud.py for more many other examples.

Installation

We're on pypi, so pip install dummio.

If working directly on this repo, consider using the simplest-possible virtual environment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dummio-1.10.0.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dummio-1.10.0-py3-none-any.whl (18.1 kB view details)

Uploaded Python 3

File details

Details for the file dummio-1.10.0.tar.gz.

File metadata

  • Download URL: dummio-1.10.0.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.6

File hashes

Hashes for dummio-1.10.0.tar.gz
Algorithm Hash digest
SHA256 354fb3999606e1032d433c1ee6751a21b2dd3104897b68c39d74f0ecad56306a
MD5 a739b559e78a981a98996aa762f453da
BLAKE2b-256 404642ff2e5f25fed59945a41691ac13628e0bd31309a59bc5d4c1dfd3467284

See more details on using hashes here.

File details

Details for the file dummio-1.10.0-py3-none-any.whl.

File metadata

  • Download URL: dummio-1.10.0-py3-none-any.whl
  • Upload date:
  • Size: 18.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.6

File hashes

Hashes for dummio-1.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6d2b6893a3ea187c6be52bd78145820affb7c712f4bf146438e78d24e2e1df3d
MD5 0aec13adfb0ca2688ccbe1274490f663
BLAKE2b-256 9506db8b675db8cbf31bb8112a8abffe26fafb59d67ad6842cd4bfffa4682bcb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page