Skip to main content

Python reader and verifier for the ISOF v1.0/1.1/1.2 format, open standard for geochemical isotopic, geochemistry, physicochemistry and dissolved molecules data exchange

Project description

isof

Python reader and verifier for the ISOF v1.0 / v1.1 / v1.2 (Isotopic Secure Open Format) format, an open standard for exchanging geochemical and analytical data.

The ISOF format allows exchanging in a single file:

  • Isotopic ratios with full analytical metadata (since v1.0)
  • Elemental concentrations with normalized and original units (v1.2)
  • Physicochemistry parameters such as pH, Eh, temperature (v1.2)
  • Dissolved molecules and ions with regulatory compliance (v1.2)
  • Analytical methods, pipelines and purification yields
  • Traceable modifications through SHA-256 (level 1) or ECDSA P-256 + IsoFind PKI (level 2) signatures
  • Optional end-to-end encryption of the scientific payload via X25519 + ChaCha20-Poly1305 (v1.2)

Sovereignty and Confidentiality: Signature verification and decryption are 100% local. No data is sent to a third-party server.

import isof

report = isof.load("analyse_bolivie.isof")

if report.is_authentic():
    print(f"Signed by: {report.signature.signed_by}")

df = report.to_pandas()
print(df[["sample_name", "element", "ratio", "ratio_2se"]])

The ISOF format is used by IsoFind, but this parser is independent and can read any file compliant with the ISOF specification.


Installation

pip install isof

With pandas support:

pip install isof[pandas]

Requires Python ≥ 3.9.


Usage

Load a file

import isof

report = isof.load("analyse.isof")
print(report)
# <ISOfDocument v1.2 — 12 échantillon(s) — IGE Grenoble>

From a JSON string (API, database):

with open("analyse.isof") as f:
    text = f.read()

report = isof.loads(text)

Verify integrity

# Simple check
if report.is_authentic():
    print("Data integrity confirmed")

# Detailed result
result = report.verify()
print(result.valid)      # bool
print(result.level)      # 1 (SHA-256) or 2 (IsoFind PKI)
print(result.signer)     # organisation or certificate CN
print(result.signed_at)  # ISO 8601 timestamp
print(result.reason)     # None if valid, error message otherwise

Two signature levels coexist in the format:

Level Mechanism Guarantee
1 SHA-256 over the data Integrity, file has not been modified since export
2 ECDSA P-256 + IsoFind PKI Authenticity, signed by a laboratory certified by IsoFind

Verification works offline: IsoFind certificates are embedded in the package.

Decrypt an encrypted file (v1.2)

ISOF v1.2 introduces optional end-to-end encryption of the scientific payload. A file may be readable and signature-verifiable while its samples, methods and yields remain opaque until the intended recipient decrypts it.

report = isof.load("mission_defense.isof")

if report.is_encrypted:
    # The envelope remains readable (created_by, project, signature)
    # but samples/methods/... are empty collections until decryption
    print("Opaque scientific payload, private key required")
    priv_pem = open("my_private_key.pem").read()
    report = report.decrypt(priv_pem)

# After decryption, use the document as usual
df = report.to_pandas()

The encryption uses a hybrid envelope:

  • X25519 ECDH wraps a random 32-byte session key for the recipient
  • ChaCha20-Poly1305 (or AES-256-GCM) encrypts the canonicalized scientific payload
  • HKDF-SHA256 with context ISOF-v1.2-key-wrap derives the wrapping key

The private key may be provided as PEM PKCS#8, raw 32 bytes, or base64 of the raw bytes. Encryption uses no network: the entire operation is local.

Access data

# Sample list
for sample in report.samples:
    print(sample.id, sample.name, sample.classification)
    for iso in sample.isotope_data:
        print(f"  {iso.element} {iso.system} = {iso.ratio} ± {iso.ratio_2se}")
    for geo in sample.geochem_data:
        print(f"  {geo.element}: {geo.display_value} {geo.display_unit}")
    for phys in sample.physico_data:
        print(f"  {phys.parameter} = {phys.value}")
    for mol in sample.molecules_data:
        print(f"  {mol.nom} ({mol.cas}): {mol.valeur} {mol.unite}")

# Look up a sample by identifier
s = report.sample("BOL-24-01")

# Look up a physicochemistry parameter
ph_record = s.physico_parameter("pH")
if ph_record and ph_record.value < 5.0:
    print("Acidic water")

# Filter — covers isotope ratios AND geochem concentrations
sources   = report.filter_samples(classification="source")
sb_samples = report.filter_samples(element="Sb")
combined  = report.filter_samples(element="Pb", material_type="Ore")

# Regulatory alerts (v1.2)
for sample, molecule in report.non_compliant_molecules():
    print(f"Alert {sample.name}: {molecule.nom} exceeds "
          f"{molecule.seuil_ref} {molecule.seuil_ref_unit}")

# Metadata
print(report.created_by.organisation)
print(report.project.reference)

Purification yields

# Yields for a sample
yields = report.yields_for_sample("BOL-24-01")
for y in yields:
    print(f"{y.element}: {y.value_pct}%")

# Contamination alerts (yield > 105%)
suspects = report.suspicious_yields()
for y in suspects:
    print(f"Possible contamination — {y.sample_id} / {y.element}: {y.value_pct}%")

Methods and pipelines

# Preparation methods
for key, method in report.methods.items():
    print(f"{key}{method.name} ({method.type})")
    if method.yield_min_pct:
        print(f"  Expected yield: {method.yield_min_pct}{method.yield_max_pct}%")

# Pipelines
for key, pipeline in report.pipelines.items():
    print(f"{pipeline.name} ({pipeline.element})")
    for stage in pipeline.stages:
        print(f"  {stage.order}. {stage.label}")

Export to pandas

The DataFrame export now takes a family parameter that selects which measurement family to flatten:

# Isotope ratios (default, v1.0 behavior)
df = report.to_pandas()
# One row per isotopic measurement
df[["sample_name", "element", "ratio", "ratio_2se", "instrument"]]

# Elemental concentrations (v1.2)
df_geo = report.to_pandas(family="geochem")
df_geo[["sample_name", "element", "value_normalized", "display_value", "display_unit"]]

# Physicochemistry parameters (v1.2)
df_phys = report.to_pandas(family="physico")
df_phys[["sample_name", "parameter", "value", "uncertainty", "method"]]

# Dissolved molecules (v1.2)
df_mol = report.to_pandas(family="molecules")
df_mol[["sample_name", "nom", "cas", "valeur", "unite", "conforme"]]

# Standard pandas filtering
pb_data = df[df["element"] == "Pb"]
sources = df[df["classification"] == "source"]
alerts  = df_mol[df_mol["conforme"] == False]

CSV export

report.to_csv("isotopes.csv")                       # default family
report.to_csv("geochem.csv",   family="geochem")
report.to_csv("physico.csv",   family="physico")
report.to_csv("molecules.csv", family="molecules")

Error handling

from isof.exceptions import (
    ISOfParseError, ISOfVersionError,
    ISOfSignatureError, ISOfEncryptionError,
)

try:
    report = isof.load("file.isof")
except ISOfVersionError as e:
    print(f"Version {e.found} not supported, please update isof")
except ISOfParseError as e:
    print(f"Invalid file: {e}")

# Corrupted vs. absent signature — two distinct cases
result = report.verify()
if result.level == 0:
    print("No signature in this file")
elif not result.valid:
    print(f"Signature present but invalid: {result.reason}")

# Decryption errors
try:
    clear = report.decrypt(my_private_key_pem)
except ISOfEncryptionError as e:
    print(f"Decryption failed: {e}")

ISOF format

Structure of an .isof document (JSON):

{
  "isof_version": "1.2",
  "created_at": "2025-03-10T14:32:00Z",
  "created_by": { "software", "operator", "organisation" },
  "project": { "name", "reference", "client", "classification" },
  "samples": [ {
      "id", "name", "matrix", ...,
      "isotope_data":   [ ... ],  ← v1.0+
      "geochem_data":   [ ... ],  ← v1.2, optional
      "physico_data":   [ ... ],  ← v1.2, optional
      "molecules_data": [ ... ]   ← v1.2, optional
  } ],
  "methods": { ... },
  "pipelines": { ... },
  "purification": { ... },
  "assignments": [ ... ],
  "signature": { ... },          ← optional
  "encryption": { ... }          ← optional, v1.2
}

Full specification: isofind.tech/isof-spec


Development

git clone https://github.com/ColinFerrari/isof
cd isof
pip install -e ".[dev]"
pytest tests/ -v

License

MIT, see LICENSE.

This package is maintained by Colin Ferrari. The ISOF format is an open standard, third-party contributions and implementations are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

isof-1.2.0.tar.gz (65.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

isof-1.2.0-py3-none-any.whl (39.7 kB view details)

Uploaded Python 3

File details

Details for the file isof-1.2.0.tar.gz.

File metadata

  • Download URL: isof-1.2.0.tar.gz
  • Upload date:
  • Size: 65.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for isof-1.2.0.tar.gz
Algorithm Hash digest
SHA256 78c43b0dadd953d3b81c961beb4772a041e96b5882cafd16b11219b0a92ba107
MD5 94dea0b90cf79ada22f33389df78285f
BLAKE2b-256 8bae65a22859dddf30f9f197f5dc396fa503c91b81ca415685d593f86bf4198d

See more details on using hashes here.

File details

Details for the file isof-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: isof-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 39.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for isof-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7ca7a9bd34c416a6a84b6049fd0bae5115662bb431e4552b3a244dff16a6b103
MD5 4169c5c503679f07428e0ad71cc4110e
BLAKE2b-256 b900f4e3ce95cbe04d38d2a6a9d1904946990a35aaba9ce637d6779343a8966c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page