Skip to main content

Grid File Access Library — Python rewrite of gfal2, based on fsspec (HTTP/HTTPS by default, optional XRootD)

Project description

gfal

CI Codecov PyPI PyPI Downloads Conda Python License All Contributors Lines of code Lines of tests Tests

Grid File Access Library — Python rewrite of gfal2

Documentation: lobis.github.io/gfal

A pip-installable Python-only rewrite of the gfal2-util CLI tools, built on fsspec — no C library required. Supports HTTP/HTTPS out of the box, with XRootD support via fsspec-xrootd when XRootD bindings are available.

gfal is both a Python library (sync + async) and a command-line tool. Use it to stat, list, copy, checksum, and manage files on local, HTTP/WebDAV, and XRootD storage from Python or the terminal.

Installation

pip install gfal

This installs the CLI and the Python library with local-file and HTTP/HTTPS support. For XRootD (root://) support, you additionally need XRootD bindings:

# Conda (recommended)
conda install -c conda-forge xrootd

# Or install the full bundle from the lobis channel
conda install -c lobis -c conda-forge gfal

See the installation docs for RPM packages, native repositories, and CERN CA certificate setup.

Python library

import gfal gives you an async-first client and a synchronous wrapper — same methods, same API:

Synchronous

import gfal

client = gfal.GfalClient()

# Stat
info = client.stat("/tmp/data.txt")
print(f"size={info.size}, is_file={info.is_file()}")

# List a directory
for entry in client.ls("/tmp/mydir"):
    print(f"{entry.size:>10}  {entry.info['name']}")

# Copy with checksum verification
client.copy(
    "https://example.com/data.root",
    "file:///tmp/data.root",
    options=gfal.CopyOptions(
        overwrite=True,
        checksum=gfal.ChecksumPolicy("ADLER32"),
    ),
)

# Checksum
print(client.checksum("/tmp/data.root", "MD5"))

Asynchronous

import asyncio
import gfal


async def main():
    client = gfal.AsyncGfalClient()

    info = await client.stat("/tmp/data.txt")
    print(f"size={info.size}")

    entries = await client.ls("/tmp/mydir")
    for entry in entries:
        print(entry.info["name"])

    await client.copy(
        "root://server//eos/data/file.root",
        "file:///tmp/file.root",
    )


asyncio.run(main())

Key features

Method Description
stat(url) POSIX-style metadata (StatResult)
exists(url) Check existence
ls(url) List directory (returns StatResult list or names)
copy(src, dst) Copy files/directories with optional checksum, TPC, dry-run
start_copy(src, dst) Background copy returning a TransferHandle
checksum(url, algo) Compute ADLER32, MD5, SHA256, etc.
open(url, mode) Open remote file for reading/writing
mkdir(url) Create directories (with parents=True for -p)
rm(url) Remove files/directories
rename(src, dst) Rename within same filesystem
chmod(url, mode) Change permissions
getxattr / setxattr / listxattr / xattrs Extended attributes

Error handling

All operations raise typed exceptions inheriting from GfalError (OSError):

try:
    client.stat("file:///nonexistent")
except gfal.GfalFileNotFoundError:
    print("File not found")
except gfal.GfalPermissionError:
    print("Permission denied")
except gfal.GfalError as e:
    print(f"Error (errno={e.errno}): {e}")

Available exceptions: GfalFileNotFoundError, GfalPermissionError, GfalFileExistsError, GfalNotADirectoryError, GfalIsADirectoryError, GfalTimeoutError.

For the full Python API reference, see the Python API documentation.

CLI quick start

After installation, the gfal command is available on your PATH:

# Stat a remote file (HTTPS)
gfal stat https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C

# List a directory (XRootD)
gfal ls -l root://eospublic.cern.ch//eos/opendata/phenix/emcal-finding-pi0s-and-photons/

# Compute a checksum
gfal sum https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C MD5

# Download a file
gfal cp https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C file:///tmp/single_cluster_r5.C

# Peek at the contents
gfal cat https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C | head -n 5

Local paths work as bare paths or file:// URIs. See EOS public examples for more.

CLI reference

Commands

Command Description
gfal ls List directory contents
gfal cp Copy files or directories
gfal rm Remove files or directories
gfal stat Display file status
gfal mkdir Create directories
gfal cat Print file contents to stdout
gfal save Write stdin to a remote file
gfal rename Rename / move a file
gfal chmod Change file permissions
gfal sum Compute file checksums
gfal xattr Get or set extended attributes
gfal completion Generate shell completions (bash, zsh, fish)

Common options (all commands)

Option Description
-v / --verbose Verbose output (stackable: -vv, -vvv)
-t N / --timeout N Global timeout in seconds (default: 1800)
-E CERT / --cert CERT Path to client certificate (PEM)
--key KEY Path to client key (PEM)
--no-verify Disable TLS certificate verification
--log-file FILE Write log output to a file

gfal ls

gfal ls [OPTIONS] URI [URI ...]
Option Description
-l Long listing (permissions, size, date)
-a / --all Show hidden files
-d / --directory List the entry itself, not its contents
-H / --human-readable Human-readable sizes (e.g. 1.2M)
-r / --reverse Reverse sort order
--sort Sort by: name (default), size, time, extension, version, none
-S Sort by size (shorthand for --sort=size)
--time-style Timestamp format: locale, iso, long-iso, full-iso
--full-time Equivalent to --time-style=full-iso
--color Colorise output: auto (default), always, never
--xattr ATTR Show an extended attribute column
gfal ls -lH root://server//eos/data/
gfal ls -la /tmp/mydir/
gfal ls -l --sort=size --reverse root://server//eos/data/

gfal cp

gfal cp [OPTIONS] SRC [SRC ...] DST
Option Description
-f / --force Overwrite destination if it exists
-r / -R / --recursive Copy directories recursively
-p / --parent Create parent directories at destination
-K ALG / --checksum ALG Verify checksum (ADLER32, MD5, SHA256, …); use ALG:value to supply expected hash
--checksum-mode both (default), source, target
--compare Skip if destination matches: size, size_mtime, checksum, none
--parallel N Concurrent transfers during recursive copy
--preserve-times Preserve modification timestamps (default)
--no-preserve-times Don't preserve timestamps
--dry-run Show what would be copied without copying
--from-file FILE Read source URIs from a file (one per line)
--abort-on-failure Stop after the first failed transfer
-T N / --transfer-timeout N Per-file timeout in seconds
--tpc Attempt third-party copy, fall back to streaming
--tpc-only Require third-party copy (fail if unsupported)
--tpc-mode TPC direction: pull (default) or push
--scitag N WLCG SciTag flow identifier (65–65535)
# Simple copy
gfal cp file:///tmp/src.txt https://server/dst.txt

# Force overwrite with ADLER32 verification
gfal cp -f -K ADLER32 root://server//path/file.root file:///tmp/file.root

# Recursive copy, create parents, with parallel transfers
gfal cp -r -p --parallel 4 root://server//eos/srcdir/ file:///tmp/dstdir/

# Skip if destination already has same size
gfal cp --compare size root://server//eos/data/ file:///tmp/data/

# Third-party copy between two servers
gfal cp --tpc root://src-server//path/file root://dst-server//path/file

# Dry-run preview
gfal cp -r --dry-run root://server//eos/data/ file:///tmp/backup/

gfal rm

gfal rm [OPTIONS] URI [URI ...]
Option Description
-r / -R / --recursive Remove directories and their contents
--dry-run Show what would be deleted
--from-file FILE Read URIs to delete from a file
--just-delete Skip the stat check and delete directly
gfal rm file:///tmp/old.txt
gfal rm -r root://server//eos/old_dir/
gfal rm --dry-run root://server//eos/dir/

gfal stat

gfal stat URI [URI ...]

Prints POSIX-style stat information (size, permissions, timestamps):

  File: 'root://server//eos/data/file.root'
  Size: 1048576        regular file
Access: (0644/-rw-r--r--)      Uid: 1000   Gid: 1000
Access: 2025-06-01 12:34:56.000000
Modify: 2025-06-01 12:34:56.000000
Change: 2025-06-01 12:34:56.000000

gfal mkdir

gfal mkdir [OPTIONS] URI [URI ...]
Option Description
-p / --parents Create intermediate directories; no error if exists
-m MODE Permissions in octal (default: 755)
gfal mkdir root://server//eos/user/j/jdoe/newdir
gfal mkdir -p root://server//eos/user/j/jdoe/a/b/c

gfal sum

gfal sum URI ALGORITHM

Supported: ADLER32, CRC32, CRC32C, MD5, SHA1, SHA256, SHA512.

gfal sum file:///tmp/file.root ADLER32
# file:///tmp/file.root 0a1b2c3d

gfal cat

gfal cat URI [URI ...]

Prints file contents to stdout. Multiple files are concatenated.

gfal save

echo "hello" | gfal save root://server//eos/user/j/jdoe/hello.txt

Reads from stdin and writes to the given URI.

gfal rename

gfal rename SOURCE DESTINATION

gfal chmod

gfal chmod MODE URI [URI ...]

MODE is an octal permission string, e.g. 0644 or 755.

gfal xattr

# Get an attribute
gfal xattr root://server//eos/data/file.root xroot.checksum

# Set an attribute
gfal xattr root://server//eos/data/file.root user.tag=important

Shell completion

Generate shell completion scripts:

# Bash
gfal completion bash >> ~/.bashrc

# Zsh
gfal completion zsh >> ~/.zshrc

# Fish
gfal completion fish > ~/.config/fish/completions/gfal.fish

Authentication

X.509 proxy (XRootD / HTTPS): If X509_USER_PROXY is set or a proxy exists at /tmp/x509up_u<uid>, it is used automatically. Override with -E/--key.

HTTPS client certificates: Pass --cert and --key for mutual TLS authentication.

Supported protocols

Scheme Description Requirements
file:// or bare path Local filesystem Built-in
http:// / https:// HTTP/WebDAV Built-in
dav:// / davs:// WebDAV (converted to HTTP) Built-in
root:// XRootD xrootd + fsspec-xrootd

Development

git clone https://github.com/lobis/gfal.git
cd gfal

python -m venv .venv
source .venv/bin/activate

pip install -e ".[dev]"
pytest tests/

Contributors ✨

Thanks goes to these wonderful people (emoji key):

Luis Antonio Obis Aparicio
Luis Antonio Obis Aparicio

💻 📖 🚧 🚇 🤔

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gfal-0.1.52.tar.gz (272.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gfal-0.1.52-py3-none-any.whl (76.0 kB view details)

Uploaded Python 3

File details

Details for the file gfal-0.1.52.tar.gz.

File metadata

  • Download URL: gfal-0.1.52.tar.gz
  • Upload date:
  • Size: 272.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gfal-0.1.52.tar.gz
Algorithm Hash digest
SHA256 50e585a78eeac16ac081a1ea171b12755371449561013b3bfff99df58bce30d4
MD5 7ee683c568229e3e75637e1a8943caa6
BLAKE2b-256 fe9892ae9db564f80d67253fb573836d7bb022789bf75a7e531b92c76df7572a

See more details on using hashes here.

Provenance

The following attestation bundles were made for gfal-0.1.52.tar.gz:

Publisher: ci.yml on lobis/gfal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gfal-0.1.52-py3-none-any.whl.

File metadata

  • Download URL: gfal-0.1.52-py3-none-any.whl
  • Upload date:
  • Size: 76.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gfal-0.1.52-py3-none-any.whl
Algorithm Hash digest
SHA256 1f39ba9e036e3a49164246dccf45e81d4b6055fb6c3232e464a7a46bf03e2d2b
MD5 49618f1e79182448a70962824b7fbf30
BLAKE2b-256 42924204d94661178c07da930a2a98e90155516a823ed8d37f47fc809ff62265

See more details on using hashes here.

Provenance

The following attestation bundles were made for gfal-0.1.52-py3-none-any.whl:

Publisher: ci.yml on lobis/gfal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page