Skip to main content

Dataset fetcher for neuroscience research (OpenNeuro, BIDS, etc.)

Project description

SciTeX Dataset

Unified access to neuroscience datasets for AI-powered research

PyPI version Tests License: AGPL-3.0

SciTeX Dataset provides a unified interface to discover and fetch metadata from major neuroscience data repositories.

Part of SciTeX.

Data Sources

Repository Description Data Types
OpenNeuro Open platform for sharing neuroimaging data MRI, EEG, MEG, iEEG, PET
DANDI BRAIN Initiative data archive Electrophysiology, Ophys
PhysioNet Physiological signal databases ECG, EEG, clinical data

Quick Start

pip install scitex-dataset

Python API

from scitex_dataset import fetch_all_datasets, format_dataset

# Fetch datasets from OpenNeuro
datasets = fetch_all_datasets(max_datasets=10)

# Format for analysis
for ds in datasets:
    formatted = format_dataset(ds)
    print(f"{formatted['id']}: {formatted['name']} ({formatted['n_subjects']} subjects)")

CLI

# Fetch OpenNeuro datasets
scitex-dataset openneuro -n 100 -o datasets.json -v

# Search across repositories
scitex-dataset search "epilepsy EEG" --source openneuro

# Database operations
scitex-dataset db init
scitex-dataset db sync openneuro
scitex-dataset db query "modality:eeg"

MCP Server

SciTeX Dataset includes an MCP (Model Context Protocol) server, enabling AI agents like Claude to discover and query neuroscience datasets.

# Add to Claude Code MCP config
scitex-dataset mcp install

# Or run directly
scitex-dataset mcp start

Available MCP Tools:

Tool Description
dataset_openneuro_fetch Fetch datasets from OpenNeuro
dataset_openneuro_search Search OpenNeuro by query
dataset_dandi_fetch Fetch datasets from DANDI Archive
dataset_dandi_search Search DANDI by query
dataset_physionet_fetch Fetch datasets from PhysioNet
dataset_physionet_search Search PhysioNet by query
dataset_search Unified search across all repositories
dataset_stats Get repository statistics

With SciTeX Session

import scitex as stx
from scitex_dataset import fetch_all_datasets, format_dataset

@stx.session
def main(logger=stx.INJECTED):
    datasets = fetch_all_datasets(max_datasets=100, logger=logger)
    formatted = [format_dataset(ds) for ds in datasets]
    stx.io.save(formatted, "openneuro_datasets.json")
    return 0

if __name__ == "__main__":
    main()

Why SciTeX Dataset?

  • Unified Interface: One API for OpenNeuro, DANDI, PhysioNet, and more
  • AI-Ready: MCP server enables LLMs to discover relevant datasets
  • Metadata Focus: Fast metadata queries without downloading full datasets
  • SciTeX Integration: Works seamlessly with @stx.session for reproducible research

SciTeX
AGPL-3.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scitex_dataset-0.1.3.tar.gz (53.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scitex_dataset-0.1.3-py3-none-any.whl (43.9 kB view details)

Uploaded Python 3

File details

Details for the file scitex_dataset-0.1.3.tar.gz.

File metadata

  • Download URL: scitex_dataset-0.1.3.tar.gz
  • Upload date:
  • Size: 53.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scitex_dataset-0.1.3.tar.gz
Algorithm Hash digest
SHA256 bb8fce238ba36e0b090cf2c6c1c72fef886c52d777277f431daae6c33080e490
MD5 a7f108f0b78ea33a64b85d23cd1ad519
BLAKE2b-256 8377085baec0761bfd73068543352a02ca8313263313bbe36b659ceab2e3014f

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_dataset-0.1.3.tar.gz:

Publisher: publish-pypi.yml on ywatanabe1989/scitex-dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file scitex_dataset-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: scitex_dataset-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 43.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scitex_dataset-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 3b74453d02a2785c65d594a7d21dedb3cf014fbdcee43c15c38da7768c6d672f
MD5 b1d05b37add20d6613074473dd8e412d
BLAKE2b-256 99cd34740e37181702e6dbc6f48ae11781ce14502d6cde5a278dbe4c45d58a93

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_dataset-0.1.3-py3-none-any.whl:

Publisher: publish-pypi.yml on ywatanabe1989/scitex-dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page