Skip to main content

Grid File Access Library — Python rewrite of gfal2, based on fsspec (HTTP/HTTPS by default, optional XRootD client bindings)

Project description

gfal

CI Codecov PyPI PyPI Downloads Conda Python License All Contributors Lines of code Lines of tests Tests

Grid File Access Library — Python rewrite of gfal2

Documentation: lobis.github.io/gfal

A pip-installable Python-only rewrite of the gfal2-util CLI tools, built on fsspec — no C library required. Supports HTTP/HTTPS out of the box, with the lightweight XRootD fsspec adapter included by default and optional S3 and SSH/SFTP backends via their corresponding fsspec drivers.

gfal is both a Python library (sync + async) and a command-line tool. Use it to stat, list, copy, checksum, and manage files on local, HTTP/WebDAV, and XRootD storage from Python or the terminal.

Installation

pip install gfal

This installs the CLI and the Python library with local-file, HTTP/HTTPS, and the lightweight fsspec-xrootd adapter.

For a fully pip-managed XRootD (root://) client stack:

pip install "gfal[xrootd]"

This adds the PyPI xrootd bindings on top of the base install.

On grid systems where XRootD Python bindings are already provided and centrally managed, prefer the site package manager or conda for those bindings and keep gfal itself lean:

# Conda
conda install -c conda-forge xrootd

# Or install the full conda bundle from the lobis channel
conda install -c lobis -c conda-forge gfal

Optional protocol backends are installed separately:

# pip
pip install "gfal[s3]"
pip install "gfal[ssh]"

# conda
conda install -c conda-forge s3fs boto3
conda install -c conda-forge paramiko sshfs

S3 and SSH support are intentionally not included in the base conda package for every user.

Because gfal is built on fsspec, many other fsspec-supported protocols can also work with the same CLI and Python API once their backend library is installed. The project currently tests HTTP/HTTPS, XRootD, S3, and SSH/SFTP most explicitly, but adding support for other fsspec backends is generally straightforward when needed.

See the installation docs for RPM packages, native repositories, and CERN HTTPS options. On systems that do not trust CERN CAs by default, either install the CERN Root CA for verified https:// access, use matching root:// URLs where possible, or pass --no-verify for an insecure smoke test.

Python library

import gfal gives you an async-first client and a synchronous wrapper — same methods, same API:

Synchronous

import gfal

client = gfal.GfalClient()
phenix_file = (
    "https://eospublic.cern.ch/eos/opendata/phenix/"
    "emcal-finding-pi0s-and-photons/single_cluster_r5.C"
)
phenix_dir = (
    "https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/"
)
atlas_file = (
    "https://eospublic.cern.ch/eos/opendata/atlas/rucio/data16_13TeV/"
    "DAOD_PHYSLITE.37019892._000001.pool.root.1"
)

# Stat
info = client.stat(phenix_file)
print(f"size={info.size}, is_file={info.is_file()}")

# List a directory
for entry in client.ls(phenix_dir):
    print(f"{entry.size:>10}  {entry.info['name']}")

# Copy with checksum verification
client.copy(
    atlas_file,
    "file:///tmp/atlas-phy.root",
    options=gfal.CopyOptions(
        overwrite=True,
        checksum=gfal.ChecksumPolicy("ADLER32"),
    ),
)

# Checksum
print(client.checksum("file:///tmp/atlas-phy.root", "MD5"))

Asynchronous

import asyncio
import gfal


async def main():
    client = gfal.AsyncGfalClient()
    phenix_file = (
        "https://eospublic.cern.ch/eos/opendata/phenix/"
        "emcal-finding-pi0s-and-photons/single_cluster_r5.C"
    )
    phenix_dir = (
        "https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/"
    )
    atlas_file = (
        "https://eospublic.cern.ch/eos/opendata/atlas/rucio/data16_13TeV/"
        "DAOD_PHYSLITE.37019892._000001.pool.root.1"
    )

    info = await client.stat(phenix_file)
    print(f"size={info.size}")

    entries = await client.ls(phenix_dir)
    for entry in entries:
        print(entry.info["name"])

    await client.copy(
        atlas_file,
        "file:///tmp/atlas-phy.root",
    )


asyncio.run(main())

For a medium-sized public source family, the first 37 files matching DAOD_PHYSLITE.37019892.* in https://eospublic.cern.ch/eos/opendata/atlas/rucio/data16_13TeV/ add up to about 5.0 GiB as measured on April 22, 2026.

Key features

Method Description
stat(url) POSIX-style metadata (StatResult)
exists(url) Check existence
ls(url) List directory (returns StatResult list or names)
copy(src, dst) Copy files/directories with optional checksum, TPC, dry-run
start_copy(src, dst) Background copy returning a TransferHandle
checksum(url, algo) Compute ADLER32, MD5, SHA256, etc.
open(url, mode) Open remote file for reading/writing
mkdir(url) Create directories (with parents=True for -p)
rm(url) Remove files/directories
rename(src, dst) Rename within same filesystem
chmod(url, mode) Change permissions
getxattr / setxattr / listxattr / xattrs Extended attributes

Error handling

All operations raise typed exceptions inheriting from GfalError (OSError):

try:
    client.stat("file:///nonexistent")
except gfal.GfalFileNotFoundError:
    print("File not found")
except gfal.GfalPermissionError:
    print("Permission denied")
except gfal.GfalError as e:
    print(f"Error (errno={e.errno}): {e}")

Available exceptions: GfalFileNotFoundError, GfalPermissionError, GfalFileExistsError, GfalNotADirectoryError, GfalIsADirectoryError, GfalTimeoutError.

For the full Python API reference, see the Python API documentation.

CLI quick start

After installation, the gfal command is available on your PATH:

# Stat a remote file (HTTPS)
gfal stat https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C

# List a directory (XRootD)
gfal ls -l root://eospublic.cern.ch//eos/opendata/phenix/emcal-finding-pi0s-and-photons/

# Compute a checksum
gfal sum https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C MD5

# Download a file
gfal cp https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C file:///tmp/single_cluster_r5.C

# Peek at the contents
gfal cat https://eospublic.cern.ch/eos/opendata/phenix/emcal-finding-pi0s-and-photons/single_cluster_r5.C | head -n 5

Local paths work as bare paths or file:// URIs. See EOS public examples for more.

S3 / S3-compatible storage

S3 support is a first-class optional backend, useful for AWS S3, MinIO, Ceph RGW, and similar deployments.

Install support with:

# pip
pip install "gfal[s3]"

# conda
conda install -c conda-forge s3fs boto3

Authentication uses the standard AWS environment variables or ~/.aws/credentials:

export AWS_ACCESS_KEY_ID=...
export AWS_SECRET_ACCESS_KEY=...
export AWS_DEFAULT_REGION=us-east-1

# For MinIO or other S3-compatible endpoints
export AWS_ENDPOINT_URL=https://minio.example.com

Examples:

# List a bucket
gfal ls s3://my-bucket/

# List a single object
gfal ls s3://my-bucket/data/file.root

# Upload
gfal cp /tmp/data.root s3://my-bucket/data/data.root

# Download
gfal cp s3://my-bucket/data/data.root /tmp/data.root

# Checksum
gfal sum s3://my-bucket/data/data.root MD5

SSH / SFTP

Install support with:

# pip
pip install "gfal[ssh]"

# conda
conda install -c conda-forge paramiko sshfs

Examples:

# Stat a remote file
gfal stat sftp://user@host/path/to/file.txt

# Copy local file to remote
gfal cp /tmp/data.root sftp://user@host/data/data.root

# Download from SFTP
gfal cp sftp://user@host/data/data.root /tmp/data.root

# List a remote directory
gfal ls sftp://user@host/data/

Other fsspec protocols such as gs://, abfs://, or similar backends may also work once their driver is installed, even if they are not currently covered as heavily as HTTP/XRootD/S3/SSH in this repository.

CLI reference

Commands

Command Description
gfal ls List directory contents
gfal cp Copy files or directories
gfal rm Remove files or directories
gfal stat Display file status
gfal mkdir Create directories
gfal cat Print file contents to stdout
gfal save Write stdin to a remote file
gfal rename Rename / move a file
gfal chmod Change file permissions
gfal sum Compute file checksums
gfal xattr Get or set extended attributes
gfal completion Generate shell completions (bash, zsh, fish)

Common options (all commands)

Option Description
-v / --verbose Verbose output (stackable: -vv, -vvv)
-t N / --timeout N Global timeout in seconds (default: 1800)
-E CERT / --cert CERT Path to client certificate (PEM)
--key KEY Path to client key (PEM)
--authz-token TOKEN Append an EOS authz token to EOS URLs
--no-verify Disable TLS certificate verification
--log-file FILE Write log output to a file

For CERN HTTPS endpoints that fail certificate verification, install the CERN Root CA for normal verified use, use root:// where possible to avoid HTTPS, or pass --no-verify only for an explicit insecure test.

For scoped EOS access (any EOS instance — eospublic, eospilot, eosatlas, …), generate a zteos64: token on the EOS side with eos token and pass it to gfal with --authz-token or export it as EOSAUTHZ. See EOS authz token workflow for the full recipe, including how to mint the token and which endpoint to use.

gfal ls

gfal ls [OPTIONS] URI [URI ...]
Option Description
-l Long listing (permissions, size, date)
-a / --all Show hidden files
-d / --directory List the entry itself, not its contents
-H / --human-readable Human-readable sizes (e.g. 1.2M)
-r / --reverse Reverse sort order
--sort Sort by: name (default), size, time, extension, version, none
-S Sort by size (shorthand for --sort=size)
--time-style Timestamp format: locale, iso, long-iso, full-iso
--full-time Equivalent to --time-style=full-iso
--color Colorise output: auto (default), always, never
--xattr ATTR Show an extended attribute column
gfal ls -lH root://server//eos/data/
gfal ls -la /tmp/mydir/
gfal ls -l --sort=size --reverse root://server//eos/data/

gfal cp

gfal cp [OPTIONS] SRC [SRC ...] DST
Option Description
-f / --force Overwrite destination if it exists
-r / -R / --recursive Copy directories recursively
-p / --parent Create parent directories at destination
-K ALG / --checksum ALG Verify checksum (ADLER32, MD5, SHA256, …); use ALG:value to supply expected hash
--checksum-mode both (default), source, target
--compare Skip if destination matches: size, size_mtime, checksum, none
--parallel N Concurrent transfers during recursive copy
--preserve-times Preserve modification timestamps (default)
--no-preserve-times Don't preserve timestamps
--dry-run Show what would be copied without copying
--from-file FILE Read source URIs from a file (one per line)
--abort-on-failure Stop after the first failed transfer
-T N / --transfer-timeout N Per-file timeout in seconds
--tpc Attempt third-party copy, fall back to streaming
--tpc-only Require third-party copy (fail if unsupported)
--tpc-mode TPC direction: pull (default) or push
--scitag N WLCG SciTag flow identifier (65–65535)
# Simple copy
gfal cp file:///tmp/src.txt https://server/dst.txt

# Force overwrite with ADLER32 verification
gfal cp -f -K ADLER32 root://server//path/file.root file:///tmp/file.root

# Recursive copy, create parents, with parallel transfers
gfal cp -r -p --parallel 4 root://server//eos/srcdir/ file:///tmp/dstdir/

# Skip if destination already has same size
gfal cp --compare size root://server//eos/data/ file:///tmp/data/

# Third-party copy between two servers
gfal cp --tpc root://src-server//path/file root://dst-server//path/file

# Dry-run preview
gfal cp -r --dry-run root://server//eos/data/ file:///tmp/backup/

gfal rm

gfal rm [OPTIONS] URI [URI ...]
Option Description
-r / -R / --recursive Remove directories and their contents
--dry-run Show what would be deleted
--from-file FILE Read URIs to delete from a file
--just-delete Skip the stat check and delete directly
gfal rm file:///tmp/old.txt
gfal rm -r root://server//eos/old_dir/
gfal rm --dry-run root://server//eos/dir/

gfal stat

gfal stat URI [URI ...]

Prints POSIX-style stat information (size, permissions, timestamps):

  File: 'root://server//eos/data/file.root'
  Size: 1048576        regular file
Access: (0644/-rw-r--r--)      Uid: 1000   Gid: 1000
Access: 2025-06-01 12:34:56.000000
Modify: 2025-06-01 12:34:56.000000
Change: 2025-06-01 12:34:56.000000

gfal mkdir

gfal mkdir [OPTIONS] URI [URI ...]
Option Description
-p / --parents Create intermediate directories; no error if exists
-m MODE Permissions in octal (default: 755)
gfal mkdir root://server//eos/user/j/jdoe/newdir
gfal mkdir -p root://server//eos/user/j/jdoe/a/b/c

gfal sum

gfal sum URI ALGORITHM

Supported: ADLER32, CRC32, CRC32C, MD5, SHA1, SHA256, SHA512.

gfal sum file:///tmp/file.root ADLER32
# file:///tmp/file.root 0a1b2c3d

gfal cat

gfal cat URI [URI ...]

Prints file contents to stdout. Multiple files are concatenated.

gfal save

echo "hello" | gfal save root://server//eos/user/j/jdoe/hello.txt

Reads from stdin and writes to the given URI.

gfal rename

gfal rename SOURCE DESTINATION

gfal chmod

gfal chmod MODE URI [URI ...]

MODE is an octal permission string, e.g. 0644 or 755.

gfal xattr

# Get an attribute
gfal xattr root://server//eos/data/file.root xroot.checksum

# Set an attribute
gfal xattr root://server//eos/data/file.root user.tag=important

Shell completion

Generate shell completion scripts:

# Bash
gfal completion bash >> ~/.bashrc

# Zsh
gfal completion zsh >> ~/.zshrc

# Fish
gfal completion fish > ~/.config/fish/completions/gfal.fish

Authentication

X.509 proxy (XRootD / HTTPS): If X509_USER_PROXY is set or a proxy exists at /tmp/x509up_u<uid>, it is used automatically. Override with -E/--key.

HTTPS client certificates: Pass --cert and --key for mutual TLS authentication.

Supported protocols

Scheme Description Requirements
file:// or bare path Local filesystem Built-in
http:// / https:// HTTP/WebDAV Built-in
dav:// / davs:// WebDAV (converted to HTTP) Built-in
root:// XRootD Built-in adapter; xrootd client bindings required

Development

git clone https://github.com/lobis/gfal.git
cd gfal

python -m venv .venv
source .venv/bin/activate

pip install -e ".[dev]"
pytest tests/

Contributors ✨

Thanks goes to these wonderful people (emoji key):

Luis Antonio Obis Aparicio
Luis Antonio Obis Aparicio

💻 📖 🚧 🚇 🤔

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gfal-0.1.61.tar.gz (318.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gfal-0.1.61-py3-none-any.whl (93.4 kB view details)

Uploaded Python 3

File details

Details for the file gfal-0.1.61.tar.gz.

File metadata

  • Download URL: gfal-0.1.61.tar.gz
  • Upload date:
  • Size: 318.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gfal-0.1.61.tar.gz
Algorithm Hash digest
SHA256 5e62b21b142b5b01ef1928d2f0209664c3dae1431b9cf56d1ece94a169891e7f
MD5 ea029c7c89700809e2a11d8ac2b4724f
BLAKE2b-256 7bdc3646a1e1370d114f4a53293f5d2a7bdf3777a7acfbaee4ca6e66b79f0420

See more details on using hashes here.

Provenance

The following attestation bundles were made for gfal-0.1.61.tar.gz:

Publisher: ci.yml on lobis/gfal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gfal-0.1.61-py3-none-any.whl.

File metadata

  • Download URL: gfal-0.1.61-py3-none-any.whl
  • Upload date:
  • Size: 93.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gfal-0.1.61-py3-none-any.whl
Algorithm Hash digest
SHA256 d00d7398c2e095b65962205c10e075ce232737eaf64c86910eac018e47951df5
MD5 48ab84552b1f211fb8221ed2b6eaa8ca
BLAKE2b-256 ff3055ae6a09347e658e3966795dab922819f43a444dfe49172a2b909669c45e

See more details on using hashes here.

Provenance

The following attestation bundles were made for gfal-0.1.61-py3-none-any.whl:

Publisher: ci.yml on lobis/gfal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page