Skip to main content

Dataset fetcher for neuroscience research (OpenNeuro, BIDS, etc.)

Project description

SciTeX Dataset

Unified access to neuroscience datasets for AI-powered research

PyPI version Tests License: AGPL-3.0

SciTeX Dataset provides a unified interface to discover and fetch metadata from major neuroscience data repositories.

Part of SciTeX.

Data Sources

Repository Description Data Types
OpenNeuro Open platform for sharing neuroimaging data MRI, EEG, MEG, iEEG, PET
DANDI BRAIN Initiative data archive Electrophysiology, Ophys
PhysioNet Physiological signal databases ECG, EEG, clinical data

Quick Start

pip install scitex-dataset

Python API

from scitex_dataset import fetch_all_datasets, format_dataset

# Fetch datasets from OpenNeuro
datasets = fetch_all_datasets(max_datasets=10)

# Format for analysis
for ds in datasets:
    formatted = format_dataset(ds)
    print(f"{formatted['id']}: {formatted['name']} ({formatted['n_subjects']} subjects)")

CLI

# Fetch OpenNeuro datasets
scitex-dataset openneuro -n 100 -o datasets.json -v

# Search across repositories
scitex-dataset search "epilepsy EEG" --source openneuro

# Database operations
scitex-dataset db init
scitex-dataset db sync openneuro
scitex-dataset db query "modality:eeg"

MCP Server

SciTeX Dataset includes an MCP (Model Context Protocol) server, enabling AI agents like Claude to discover and query neuroscience datasets.

# Add to Claude Code MCP config
scitex-dataset mcp install

# Or run directly
scitex-dataset mcp start

Available MCP Tools:

Tool Description
dataset_openneuro_fetch Fetch datasets from OpenNeuro
dataset_openneuro_search Search OpenNeuro by query
dataset_dandi_fetch Fetch datasets from DANDI Archive
dataset_dandi_search Search DANDI by query
dataset_physionet_fetch Fetch datasets from PhysioNet
dataset_physionet_search Search PhysioNet by query
dataset_search Unified search across all repositories
dataset_stats Get repository statistics

With SciTeX Session

import scitex as stx
from scitex_dataset import fetch_all_datasets, format_dataset

@stx.session
def main(logger=stx.INJECTED):
    datasets = fetch_all_datasets(max_datasets=100, logger=logger)
    formatted = [format_dataset(ds) for ds in datasets]
    stx.io.save(formatted, "openneuro_datasets.json")
    return 0

if __name__ == "__main__":
    main()

Why SciTeX Dataset?

  • Unified Interface: One API for OpenNeuro, DANDI, PhysioNet, and more
  • AI-Ready: MCP server enables LLMs to discover relevant datasets
  • Metadata Focus: Fast metadata queries without downloading full datasets
  • SciTeX Integration: Works seamlessly with @stx.session for reproducible research

SciTeX
AGPL-3.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scitex_dataset-0.1.1.tar.gz (50.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scitex_dataset-0.1.1-py3-none-any.whl (43.8 kB view details)

Uploaded Python 3

File details

Details for the file scitex_dataset-0.1.1.tar.gz.

File metadata

  • Download URL: scitex_dataset-0.1.1.tar.gz
  • Upload date:
  • Size: 50.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scitex_dataset-0.1.1.tar.gz
Algorithm Hash digest
SHA256 2e73190d7a15219374657f55395d9d322b71c2c7ae57658b7b934efdc90c9f14
MD5 4bfd9719e05b0361228a0922896df243
BLAKE2b-256 6b7f22b0a5b8fefbdda1f065ab302c8cf8b47712698f0dfa54b1efa00067da6b

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_dataset-0.1.1.tar.gz:

Publisher: publish-pypi.yml on ywatanabe1989/scitex-dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file scitex_dataset-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: scitex_dataset-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 43.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scitex_dataset-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 89bce15a9b7a19ff8aa7e44c4c3b43e19e2e11a77ab34c72f14d8e1064aba012
MD5 65a33ba387281e05e2b1adc7e57c552b
BLAKE2b-256 c2e869aa73febcdd4fb0880613073b8f158acfe126fbd2947f31d0b9dedca2aa

See more details on using hashes here.

Provenance

The following attestation bundles were made for scitex_dataset-0.1.1-py3-none-any.whl:

Publisher: publish-pypi.yml on ywatanabe1989/scitex-dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page