Skip to main content

HED validation, summary, and analysis tools for annotating events and experimental metadata.

Project description

PyPI - Status Python3 Maintainability Code Coverage DOI Docs

HEDTools - Python

Python tools for validation, analysis, and transformation of HED (Hierarchical Event Descriptors) tagged datasets.

Overview

HED (Hierarchical Event Descriptors) is a framework for systematically describing both laboratory and real-world events as well as other experimental metadata. HED tags are comma-separated path strings that provide a standardized vocabulary for annotating events and experimental conditions.

Key Features:

  • Validate HED annotations against schema specifications
  • Analyze and summarize HED-tagged datasets
  • Full HED support in BIDS (Brain Imaging Data Structure)
  • HED support in NWB (Neurodata Without Borders) when used the ndx-hed extension.
  • Platform-independent and data-neutral
  • Command-line tools and Python API

Note: Table remodeling tools have been moved to a separate package. See table-remodeler on PyPI or visit https://www.hedtags.org/table-remodeler for more information.

Quick start

Online tools (no installation required)

For simple validation or transformation tasks, use the online tools at https://hedtools.org/hed - no installation needed!

Browser-based validation (no data upload) is available at https://www.hedtags.org/hed-javascript

A development version of the online tools is available at: https://hedtools.org/hed_dev

Python installation

Requirements: Python 3.10 or higher

Install from PyPI:

pip install hedtools

Or install from GitHub (latest):

pip install git+https://github.com/hed-standard/hed-python/@main

Development installation

For development work or to access optional features, install from the cloned repository:

# Clone the repository
git clone https://github.com/hed-standard/hed-python.git
cd hed-python

# Install in editable mode with base dependencies
pip install -e .

# Install with optional dependency groups
pip install -e ".[dev]"       # Development tools (ruff, typos)
pip install -e ".[docs]"      # Documentation tools (sphinx, furo)
pip install -e ".[test]"      # Testing tools (coverage)
pip install -e ".[examples]"  # Jupyter notebook support

# Install all optional dependencies
pip install -e ".[dev,docs,test,examples]"

Optional dependency groups:

  • dev - Code quality tools: ruff (linter + formatter), typos, mdformat
  • docs - Documentation generation: sphinx, furo theme, myst-parser
  • test - Code coverage reporting: coverage
  • examples - Jupyter notebook support: jupyter, notebook, ipykernel

Basic usage

from hed import HedString, get_printable_issue_string, load_schema_version


# Load the latest HED schema
schema = load_schema_version("8.4.0")

# Create and validate a HED string
hed_string = HedString("Sensory-event, Visual-presentation, (Onset, (Red, Square))", schema)
issues = hed_string.validate()

if issues:
    print(get_printable_issue_string(issues, title="Validation issues found"))
else:
    print("HED string is valid!")

Command-line tools

HEDTools provides a unified command-line interface with git-like subcommands:

# Main command (new unified interface)
hedpy --help

# Validate a BIDS dataset
hedpy validate bids-dataset /path/to/bids/dataset

# Validate a HED string
hedpy validate string "Sensory-event, (Red, Square)" -sv 8.4.0

# Extract sidecar template from a BIDS dataset
hedpy extract bids-sidecar /path/to/dataset

# Validate HED schemas
hedpy schema validate /path/to/schema.xml

# Convert schema between formats (XML, MEDIAWIKI, TSV, JSON)
hedpy schema convert /path/to/schema.xml

Legacy commands (deprecated, use hedpy instead):

validate_bids /path/to/dataset
hed_validate_schemas /path/to/schema.xml

Note: The run_remodel command has been removed. Table remodeling functionality is now available in the separate table-remodeler package.

Note: The visualization tools such as the word cloud visualization have been moved to a separate hed-vis project.

For more examples, see the user guide.

Jupyter notebook examples

Note: Example notebooks are available in the GitHub repository only, not in the PyPI package.

The examples/ directory contains Jupyter notebooks demonstrating common HED workflows with BIDS datasets:

  • validate_bids_dataset.ipynb - Validate HED annotations in a BIDS dataset
  • summarize_events.ipynb - Summarize event file contents and value counts
  • sidecar_to_spreadsheet.ipynb - Convert JSON sidecars to spreadsheet format
  • merge_spreadsheet_into_sidecar.ipynb - Merge spreadsheet annotations into JSON sidecars
  • extract_json_template.ipynb - Generate JSON sidecar templates from event files
  • find_event_combinations.ipynb - Find unique combinations of event values
  • validate_bids_dataset_with_libraries.ipynb - Validate with HED library schemas

To use these notebooks:

# Clone the repository to get the examples
git clone https://github.com/hed-standard/hed-python.git
cd hed-python

# Install HEDTools with Jupyter support
pip install -e .[examples]

# Launch Jupyter
jupyter notebook examples/

See examples/README.md for more details.

Experimental features

String-based search (hed.models.string_search and hed.models.schema_lookup) is a middle-ground search facility that operates on raw HED strings without requiring pre-parsed HedString objects or a loaded schema. It supports the full QueryHandler query syntax (logical operators, grouping, wildcards) and optionally accepts a pre-generated schema lookup dictionary to enable ancestor-aware matching on short-form strings.

This facility is still experimental. Its API — including class names, function signatures, and module structure — may change in future releases without notice. These modules are not exported from the top-level hed package; import them directly:

from hed.models.string_search import StringQueryHandler, string_search
from hed.models.schema_lookup import generate_schema_lookup

See the search details documentation for a full comparison of all three search implementations and performance benchmarks.

Documentation

📖 Full Documentation: https://www.hedtags.org/hed-python

Building docs locally

# Install documentation dependencies
pip install -e .[docs]

# Build the documentation
cd docs
sphinx-build -b html . _build/html

To view the built documentation open docs/_build/html/index.html in your browser

Code Formatting

This project uses ruff format for consistent code formatting.

# Check if code is properly formatted (without making changes)
ruff format --check .

# Check and show what would change
ruff format --check --diff .

# Format all Python code in the repository
ruff format .

# Format specific files or directories
ruff format hed/
ruff format tests/

Configuration: Formatter settings are in pyproject.toml under [tool.ruff.format] with a line length of 120 characters.

Exclusions: Ruff automatically excludes .venv/, __pycache__/, auto-generated files (hed/_version.py), and external repos (spec_tests/hed-examples/, spec_tests/hed-schemas/).

CI integration: All code is automatically checked for formatting in GitHub Actions. Run ruff format . before committing to ensure your code passes CI checks.

Related repositories

The HED ecosystem consists of several interconnected repositories:

Repository Description
hed-python Python validation and analysis tools (this repo)
hed-web Web interface and deployable Docker services
hed-resources Tutorials and other HED resources
hed-specification Official HED specification documents
hed-schemas Official HED schema repository
table-remodeler Table transformation and remodeling tools
hed-vis HED visualization tools (word clouds, etc.)
ndx-hed HED support for NWB
hed-javascript JavaScript HED validation tools

Contributing

We welcome contributions! Here's how you can help:

  1. Report issues: Use GitHub Issues for bug reports and feature requests
  2. Submit pull requests (PRs): PRs should be from a non-main fork and target the main branch
  3. Improve documentation: Help us make HED easier to use
  4. Share examples: Contribute example code and use cases

Development setup:

# Clone the repository
git clone https://github.com/hed-standard/hed-python.git
cd hed-python

# Install in development mode with all dependencies
pip install -e .[dev,test,docs,examples]

# Or just core + test dependencies
pip install -e .[test]

# Run tests
python -m unittest discover tests

# Run specific test file
python -m unittest tests/path/to/test_file.py

# Test notebooks (requires examples dependencies)
python -m unittest tests.test_notebooks

For detailed contribution guidelines, please see CONTRIBUTING.md.

Configuration

Schema caching

By default, HED schemas are cached in ~/.hedtools/ (location varies by OS).

# Change the cache directory
import hed
hed.schema.set_cache_directory('/custom/path/to/cache')

Starting with hedtools 0.2.0, local copies of recent schema versions are bundled within the package for offline access.

Citation

If you use HEDTools in your research, please cite:

@software{hedtools,
  author = {Ian Callanan, Robbins, Kay and others},
  title = {HEDTools: Python tools for HED},
  year = {2026},
  publisher = {GitHub},
  url = {https://github.com/hed-standard/hed-python},
  doi = {10.5281/zenodo.8056010}
}

License

HEDTools is licensed under the MIT License. See LICENSE for details.

Support

Funding

Partial support for this project was provided by NIH 1R01MH126700-01A1.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hedtools-1.1.0.tar.gz (4.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hedtools-1.1.0-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file hedtools-1.1.0.tar.gz.

File metadata

  • Download URL: hedtools-1.1.0.tar.gz
  • Upload date:
  • Size: 4.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for hedtools-1.1.0.tar.gz
Algorithm Hash digest
SHA256 062f21beecff4b465918c8ff2ea4af6c9b51b5edb63e70f890eaab5d8dca935f
MD5 6bf01bc67c928a1741abaf4d1cf8183c
BLAKE2b-256 fd94db8655a22bfcfdd70c0c6dca4c0c20d504f8247b45d2542dc1de2506f44b

See more details on using hashes here.

File details

Details for the file hedtools-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: hedtools-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for hedtools-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dff0562756523350350277a656f6a4399f9ca57ede7611feb1441e0fa6b788bc
MD5 26054f69c793df18d5a14580c6b2fe47
BLAKE2b-256 d3ca815db5a5076325527ffe3a350b2c0af334b06697499ce0fe595fba2f12a5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page