Skip to main content

Easiest-possible IO for basic file types.

Project description

dummio

IO for dummies! A unified save/load interface using the most common and recommendable default options for IO between various object types and file types. For example, instead of

with open(file_path, 'r', encoding='utf-8') as file:
    data = json.load(file)

you can simply

data = dummio.json.load(file_path)

Users may pass additional keyword arguments to the underlying IO methods.

Direct IO vs cloud paths

Most dummio IO calls "just work" against cloud paths like s3://bucket/key, gs://bucket/key, or az://container/key. For example, dummio.json.load("s3://bucket/key") will read a json file from an S3 bucket. Notes:

  • Shout-out: universal-pathlib powers much of the cloud-iteroperability on our backend.
  • Warning: Although we manually run demo/cloud.py to ensure basic functionality, current CI unit testing does not cover cloud interactions.

Standardized IO interface

In some coding applications it is desirable to pass an IO module as an argument to a function. Here it is convenient to pass a dummio submodule, since all dummio submodules have the same save and load interface, having equivalent signatures (except for differences hidden in **kwargs).

Supported object and file types

So far we support:

  • text, pickle, and dill
  • simple dictionaries:
    • json
    • yaml
  • pandas dataframes:
    • csv
    • feather
    • parquet
  • numpy arrays (thin wrapper on numpy.save/load)
  • onnx.ModelProto instances
  • pydantic models (relying on the built-in json serialization methods)
  • mashumaro models inheriting the json or yaml serialization mixins

Filepaths passed to save and load methods can be of type str, pathlib.Path, or universal_pathlib.UPath.

Dependencies

universal-pathlib is our only required dependency.

For other dependencies, such as pandas, calling from dummio.pandas import df_parquet will raise a helpful message to install pandas if you have not already done so.

Examples

Basic IO methods can be accessed directly as dummio.text, dummio.json, etc:.

import dummio

text = "hello world"
data = {"key": text}
path = "io_example_file"

# Text
dummio.text.save(text, path=path)
assert text == dummio.text.load(path)

# YAML
dummio.yaml.save(data)
assert data == dummio.yaml.load(path)

See demo/cloud.py for more many other examples.

Installation

We're on pypi, so pip install dummio.

If working directly on this repo, consider using the simplest-possible virtual environment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dummio-1.9.0.tar.gz (10.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dummio-1.9.0-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file dummio-1.9.0.tar.gz.

File metadata

  • Download URL: dummio-1.9.0.tar.gz
  • Upload date:
  • Size: 10.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.11

File hashes

Hashes for dummio-1.9.0.tar.gz
Algorithm Hash digest
SHA256 8e58f35da0220afb57fa7e75cef76767fb70ad8dca53451027f99d6c6b1c186a
MD5 dee21f9dc5b9cd6f95743475a5509043
BLAKE2b-256 a4bed62c5c6187c4193e5471338bafe31c7cf66cee567270d250c1f9766c4db6

See more details on using hashes here.

File details

Details for the file dummio-1.9.0-py3-none-any.whl.

File metadata

  • Download URL: dummio-1.9.0-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.11

File hashes

Hashes for dummio-1.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 de4bab950e6f70f668376b2f2455c63a809463ffa9764ab3f0c975036124cc6a
MD5 6da6469525f1139119d213fa74b6a8e0
BLAKE2b-256 0f067ae5e6efd0b6c316b8e5f841c0065e070bd94ecb3bdb0445040f612c6113

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page