Skip to main content

Easiest-possible IO for basic file types.

Project description

dummio

IO for dummies! A unified save/load interface using the most common and recommendable default options for IO between various object types and file types. For example, instead of

with open(file_path, 'r', encoding='utf-8') as file:
    data = json.load(file)

you can simply

data = dummio.json.load(file_path)

Users may pass additional keyword arguments to the underlying IO methods.

Direct IO vs cloud paths

Most dummio IO calls "just work" against cloud paths like s3://bucket/key, gs://bucket/key, or az://container/key. For example, dummio.json.load("s3://bucket/key") will read a json file from an S3 bucket. Notes:

  • Shout-out: universal-pathlib powers much of the cloud-iteroperability on our backend.
  • Warning: Although we manually run demo/cloud.py to ensure basic functionality, current CI unit testing does not cover cloud interactions.

Standardized IO interface

In some coding applications it is desirable to pass an IO module as an argument to a function. Here it is convenient to pass a dummio submodule, since all dummio submodules have the same save and load interface, having equivalent signatures (except for differences hidden in **kwargs).

Supported object and file types

So far we support:

  • text, pickle, and dill
  • simple dictionaries:
    • json
    • yaml
  • pandas dataframes:
    • csv
    • feather
    • parquet
  • numpy arrays (thin wrapper on numpy.save/load)
  • onnx.ModelProto instances
  • pydantic models (relying on the built-in json serialization methods)
  • mashumaro models inheriting the json or yaml serialization mixins

Filepaths passed to save and load methods can be of type str, pathlib.Path, or universal_pathlib.UPath.

Dependencies

universal-pathlib is our only required dependency.

For other dependencies, such as pandas, calling from dummio.pandas import df_parquet will raise a helpful message to install pandas if you have not already done so.

Examples

Basic IO methods can be accessed directly as dummio.text, dummio.json, etc:.

import dummio

text = "hello world"
data = {"key": text}
path = "io_example_file"

# Text
dummio.text.save(text, path=path)
assert text == dummio.text.load(path)

# YAML
dummio.yaml.save(data)
assert data == dummio.yaml.load(path)

See demo/cloud.py for more many other examples.

Installation

We're on pypi, so pip install dummio.

If working directly on this repo, consider using the simplest-possible virtual environment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dummio-1.9.1.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dummio-1.9.1-py3-none-any.whl (17.5 kB view details)

Uploaded Python 3

File details

Details for the file dummio-1.9.1.tar.gz.

File metadata

  • Download URL: dummio-1.9.1.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.11

File hashes

Hashes for dummio-1.9.1.tar.gz
Algorithm Hash digest
SHA256 7c374bdd7fe102c5a06755b11c1954715e1ace47b0ffcdb42ac5e324e14e99ed
MD5 56ae3841f042a5a26fbf979b5db31681
BLAKE2b-256 df97f81186a64bf95c83e849f1b6148d653b7047c12285c9798f73787ad2fa5b

See more details on using hashes here.

File details

Details for the file dummio-1.9.1-py3-none-any.whl.

File metadata

  • Download URL: dummio-1.9.1-py3-none-any.whl
  • Upload date:
  • Size: 17.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.11

File hashes

Hashes for dummio-1.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 63423740c815905b0ab39fd192b2cb641f6e3115d36adfd58859c216c776c4fb
MD5 fb543851fecb965cf165c7de3d322ad0
BLAKE2b-256 af8e2c73a05e1b28a346d80681b7af8630573f79e648651c44c84d957f21630b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page