Skip to main content

Python Client for Crucible - National Archive for NSRC Observations

Project description

nano-crucible : National Archive for NSRC Observations

PyPI version PyPI downloads GitHub release Python Version License GitHub stars Discord Docs

A Python client library and CLI tool for Crucible - the Molecular Foundry's data lakehouse for scientific research. Crucible stores experimental and synthetic data from DOE Nanoscale Science Research Centers (NSRCs), along with comprehensive metadata about samples, projects, instruments, and users.

🔬 What is Crucible?

Crucible is the centralized data infrastructure for the Molecular Foundry and other DOE Nanoscale Science Research Centers, providing:

  • Unified data storage for experimental and synthetic data
  • Rich metadata capture to associate with datasets
  • Sample provenance tracking with parent-child relationships

✨ Features

Python API

  • Dataset Management: Create, query, update, and download datasets
  • Sample Tracking: Manage samples with hierarchical relationships and provenance
  • Metadata: Store and retrieve scientific metadata and experimental parameters
  • Linking: Connect datasets, samples, and create relationships programmatically

🖥️ Command-Line Interface

  • crucible config: One-time setup and configuration management
  • crucible dataset: Create, list, update, and manage datasets
  • crucible sample: Create, list, and manage samples with relationships
  • crucible file: List, inspect, and download files attached to datasets
  • crucible instrument: Create, list, and update instruments
  • crucible project: Manage projects and users (admin)
  • crucible open: Open resources in the Crucible Web Explorer with one command

🆕 What's New in 2.1.1

  • API v2 by default: CrucibleClient now targets https://crucible.lbl.gov/api/v2. v1 still works but emits a DeprecationWarning — run crucible config set api_url https://crucible.lbl.gov/api/v2 to upgrade.
  • crucible file subcommand: list, inspect, and download files attached to datasets, with lookup by dataset_mfid or SHA-256 hash.
  • Chunked GCS uploads: large files upload in 32 MiB chunks with per-chunk CRC32C and incremental SHA-256, verified at /complete.
  • Public samples: Sample.public is now a first-class field and crucible sample create --public is supported.
  • Email-based user lookup: crucible project add-user / remove-user accept emails directly — no client-side ORCID resolution needed.
  • Richer display: linked resources surfaced above metadata, alphabetized scientific metadata, MFID labels, and centered QR codes.

📦 Installation

From PyPI (Recommended)

pip install nano-crucible

From GitHub (Latest Development)

pip install git+https://github.com/MolecularFoundryCrucible/nano-crucible

With Optional Dependencies

# Install with parser support (includes ASE for LAMMPS and MatEnsemble parsers)
pip install nano-crucible[parsers]

# Install everything
pip install nano-crucible[all]

# For development
pip install nano-crucible[dev]

For Development

git clone https://github.com/MolecularFoundryCrucible/nano-crucible.git
cd nano-crucible
pip install -e ".[dev]"

🚀 Quick Start

Configuration

First, configure your API credentials:

crucible config init

Get your API key at: https://crucible.lbl.gov/api/v2/user_apikey

Alternative (without terminal access): Initialize the client directly with your credentials:

from crucible import CrucibleClient
client = CrucibleClient(
    api_url="https://crucible.lbl.gov/api/v2",
    api_key="your-api-key-here"
)

Python API

from crucible import CrucibleClient
from crucible.models import Dataset

# Initialize client
client = CrucibleClient()

# Create a dataset
dataset = Dataset(
    dataset_name="My Experiment",
    measurement="XRD",
    project_id="my-project",
    public=False
)

result = client.datasets.create(
    dataset=dataset,
    scientific_metadata={"temperature_C": 300},
    keywords=["experiment", "test"]
)

print(f"Dataset created: {result['dsid']}")

Command-Line Interface

# Configure API credentials
crucible config init

# Create a dataset with files
crucible dataset create -i data.csv -n "My Dataset" -m "XRD" -pid my-project

# List datasets in a project
crucible dataset list -pid my-project

# Get dataset details
crucible dataset get DATASET_ID

# Create a sample
crucible sample create -n "Sample A" -pid my-project

# Link sample to dataset
crucible sample add-dataset SAMPLE_ID -d DATASET_ID

# Open a resource in your browser
crucible open DATASET_ID

🤝 Contributing

We welcome contributions! Areas where you can help:

  • New parsers for additional data formats
  • Bug reports and feature requests
  • Documentation improvements
  • Example notebooks and tutorials

📄 License

This project is licensed under the BSD-3-Clause License - see the LICENSE file for details.

🔗 Links

💬 Support

For issues, questions, or feature requests:


nano-crucible is developed and maintained by the Data Group at the Molecular Foundry at Lawrence Berkeley National Laboratory.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nano_crucible-2.1.2.tar.gz (135.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nano_crucible-2.1.2-py3-none-any.whl (164.7 kB view details)

Uploaded Python 3

File details

Details for the file nano_crucible-2.1.2.tar.gz.

File metadata

  • Download URL: nano_crucible-2.1.2.tar.gz
  • Upload date:
  • Size: 135.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nano_crucible-2.1.2.tar.gz
Algorithm Hash digest
SHA256 53e63d4f5c0fa39f13d024c8c8d759f8d2572fac46a454443afb10eb34083376
MD5 077d68296a89e2ba38bc20c1fab57b99
BLAKE2b-256 dbb724e181172b4ba3ee55bfa7e3fd00a1da6f2f69651ef87a103a15a84c8c70

See more details on using hashes here.

Provenance

The following attestation bundles were made for nano_crucible-2.1.2.tar.gz:

Publisher: manual-publish-to-pypi.yml on MolecularFoundryCrucible/nano-crucible

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nano_crucible-2.1.2-py3-none-any.whl.

File metadata

  • Download URL: nano_crucible-2.1.2-py3-none-any.whl
  • Upload date:
  • Size: 164.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nano_crucible-2.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a599f8da76d7f79e410f80a537dcc2bc0709bb8c6aa9f701ee36414d2c0681e9
MD5 4ab1f532d70436bdf754cc019b089e7e
BLAKE2b-256 57d9421062cb9b4ccc420792e9889ac0504c72448bf56eeb2288b6ead5dc02e4

See more details on using hashes here.

Provenance

The following attestation bundles were made for nano_crucible-2.1.2-py3-none-any.whl:

Publisher: manual-publish-to-pypi.yml on MolecularFoundryCrucible/nano-crucible

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page