Skip to main content

DataLad FUSE extension package

Project description

DataLad FUSE extension package

codecov.io tests docs

datalad-fuse provides commands for reading files in a DataLad dataset from their remote web URLs without having to download them in their entirety first. Instead, fsspec is used to sparsely download and locally cache the files as needed.

Installation

datalad-fuse requires Python 3.6 or higher. Just use pip for Python 3 (You have pip, right?) to install it:

python3 -m pip install datalad-fuse

In addition, use of the datalad fusefs command requires FUSE to be installed; on Debian-based systems, this can be done with:

sudo apt-get install fuse

Commands

datalad fsspec-cache-clear [<options>]

Clears the local download cache for a dataset.

Options

  • -d <DATASET>, --dataset <DATASET> — Specify the dataset to operate on. If no dataset is given, an attempt is made to identify the dataset based on the current working directory.

  • -r, --recursive — Clear the caches of subdatasets as well.

datalad fsspec-head [<options>] <path>

Shows leading lines/bytes of an annexed file by fetching its data from a remote URL.

Options

  • -d <DATASET>, --dataset <DATASET> — Specify the dataset to operate on. If no dataset is given, an attempt is made to identify the dataset based on the current working directory.

  • -n <INT>, --lines <INT> — How many lines to show (default: 10)

  • -c <INT>, --bytes <INT> — How many bytes to show

datalad fusefs [<options>] <mount-path>

Create a read-only FUSE mount at <mount-path> that exposes the files in the given dataset. Opening a file under the mount that is not locally present in the dataset will cause its contents to be downloaded from the file's web URL as needed.

When the command finishes, fsspec-cache-clear may be run depending on the value of the datalad.fusefs.cache-clear configuration option. If it is set to "visited", then any (sub)datasets that were accessed in the FUSE mount will have their caches cleared; if it is instead set to "recursive", then all (sub)datasets in the dataset being operated on will have their caches cleared.

Options

  • --allow-other — Allow all users to access files in the mount. This requires setting user_allow_other in /etc/fuse.conf.

  • -d <DATASET>, --dataset <DATASET> — Specify the dataset to operate on. If no dataset is given, an attempt is made to identify the dataset based on the current working directory.

  • -f, --foreground — Run the FUSE process in the foreground; use Ctrl-C to exit. This option is currently required.

  • --mode-transparent — Expose the dataset's .git directory in the mount

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datalad-fuse-0.4.0.tar.gz (52.5 kB view details)

Uploaded Source

Built Distribution

datalad_fuse-0.4.0-py3-none-any.whl (25.5 kB view details)

Uploaded Python 3

File details

Details for the file datalad-fuse-0.4.0.tar.gz.

File metadata

  • Download URL: datalad-fuse-0.4.0.tar.gz
  • Upload date:
  • Size: 52.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.0

File hashes

Hashes for datalad-fuse-0.4.0.tar.gz
Algorithm Hash digest
SHA256 2fde26084bc1eeb87044c5315f46a928951b4d9f65397861fbf2fd57f4330a91
MD5 9428f6bb4593c2d9cfd35aa070d2fba0
BLAKE2b-256 daf786e204bc3a529ae2a92e3faa014bc97b263cbb9aeb4ba5b28c3b38dac34f

See more details on using hashes here.

File details

Details for the file datalad_fuse-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: datalad_fuse-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 25.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.0

File hashes

Hashes for datalad_fuse-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cc898ebdb58131174d090e5908aeb96c6bd8e5efdaf981202dcce81f009bd8ed
MD5 b5be5819bf577de7df80128fec69f109
BLAKE2b-256 75f7ff29f70a86f87d5bd0f37eb3150601b96198d111c3edd75aa51f87bf0b8f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page