Skip to main content

A Unified File I/O Library for Python

Project description

⚓ Minato

Actions Status Python version License pypi version

A Unified File I/O Library for Python

Minato is a Python library that provides a unified and simple interface to work with local and remote files, as well as compressed and archived files. With Minato, you can seamlessly read and write files from various sources like local filesystem, HTTP(S), Amazon S3, Google Cloud Storage, and Hugging Fase Hub. It also supports reading and writing compressed files such as gzip, bz2, and lzma, as well as directly accessing files inside archives like zip and tar.

One of Minato's key features is its built-in caching mechanism, which allows you to cache remote files locally, and manage the cache with a provided CLI. The cache is automatically updated based on ETag headers, ensuring that you always work with the latest version of the files.

Features

  • Unified file I/O for local and remote files (HTTP(S), S3, GCP, Hugging Face Hub)
  • Support for reading and writing compressed files (gzip, bz2, lzma)
  • Direct access to files inside archives (zip, tar)
  • Local caching of remote files with cache management CLI
  • Automatic cache updates based on ETag headers

Installation

Install Minato using pip:

pip install minato                   # minimal installation for only local/http(s) file I/O
pip install minato[s3]               # for Amazon S3
pip install minato[gcs]              # for Google Cloud Storage
pip install minato[huggingface-hub]  # for Hugging Face Hub
pip install minato[all]              # for all supported file I/O

Usage

Quick Start

Here's a simple example demonstrating how to read and write files on online storage:

import minato

# Write a file to an S3 bucket
s3_path = "s3://your_bucket/path/to/file"
with minato.open(s3_path, "w") as f:
    f.write("Create a new file on AWS S3!")

Access cached online resources in local storage:

# Cache a remote file and get its local path
remote_path = "http://example.com/path/to/archive.zip!inner/path/to/file"
local_filename = minato.cached_path(remote_path)

Access files inside archives like zip by connecting the archive path and inner file path with an exclamation mark (!) like above.

Automatically decompress files with gzip / lzma / bz2 compression:

with minato.open("data.txt.gz", "rt", decompress=True) as f:
    content = f.read()

In the example above, Minato will automatically detect the file format based on the file's content or filename and decompress the file accordingly.

Cache Management

 poetry run minato --help
usage: minato

positional arguments:
  {cache,list,remove,update}
    cache               cache remote file and return cached local file path
    list                show list of cached files
    remove              remove cached files
    update              update cached files

optional arguments:
  -h, --help            show this help message and exit
  --version             show program's version number and exit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

minato-0.12.2.tar.gz (24.1 kB view details)

Uploaded Source

Built Distribution

minato-0.12.2-py3-none-any.whl (34.8 kB view details)

Uploaded Python 3

File details

Details for the file minato-0.12.2.tar.gz.

File metadata

  • Download URL: minato-0.12.2.tar.gz
  • Upload date:
  • Size: 24.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.8.2 Linux/5.15.0-1041-azure

File hashes

Hashes for minato-0.12.2.tar.gz
Algorithm Hash digest
SHA256 1616fcb0f272e9302cea43e8a2449cc38efd974b774bd9c57ffcef7f16b4ee61
MD5 750c81a17e8d98863acd2aff3eedd20a
BLAKE2b-256 db57a3d814449596f16fb580d1f9842a1bba80dac3cb162afeeaa409aac09145

See more details on using hashes here.

File details

Details for the file minato-0.12.2-py3-none-any.whl.

File metadata

  • Download URL: minato-0.12.2-py3-none-any.whl
  • Upload date:
  • Size: 34.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.8.2 Linux/5.15.0-1041-azure

File hashes

Hashes for minato-0.12.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8ba0b07c72ca0cb2a456d0e6696ee47c4046b8af720e85e29a46af6409861508
MD5 3d09e9d4ba107d3de080e1a1bca380e4
BLAKE2b-256 8ab1cb8c92a99a7b1f92f3d08aa0b3f14b93710ae41743cd112b0532a9aae879

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page