Skip to main content

Crystal presets database for gemmological and mineralogical visualization

Project description

mineral-database

PyPI version Python License: MIT

Mineral Database - A comprehensive database of mineral crystal habits with CDL notation and gemmological properties for visualization applications.

Part of the Gemmology Project.

Installation

pip install mineral-database

Quick Start

from mineral_database import get_preset, list_presets, search_presets

# Get a specific mineral preset
diamond = get_preset('diamond')
print(diamond['cdl'])      # 'cubic[m3m]:{111}@1.0 + {110}@0.2'
print(diamond['system'])   # 'cubic'
print(diamond['hardness']) # 10

# List all presets
all_presets = list_presets()

# List presets by crystal system
cubic_presets = list_presets('cubic')

# Search presets
garnet_matches = search_presets('garnet')

Features

  • 216 CDL expressions across 159 mineral families (116 natural + 19 synthetic + 20 simulant + 4 composite)
  • CDL v2.0 notation for crystal habit visualization, including amorphous, nested growth, and aggregate expressions
  • FGA-standard properties (RI, SG, optical character, etc.)
  • SQLite backend for fast queries
  • Full-text search across mineral names and properties
  • Backwards compatible with original CRYSTAL_PRESETS dict API

Database Contents

The database includes presets for all major crystal systems:

System Count Examples
Cubic ~25 Diamond, Garnet, Fluorite, Pyrite
Hexagonal ~8 Beryl, Emerald, Aquamarine, Apatite
Trigonal ~15 Quartz, Ruby, Sapphire, Tourmaline
Tetragonal ~5 Zircon, Rutile, Cassiterite
Orthorhombic ~10 Topaz, Peridot, Tanzanite
Monoclinic ~10 Kunzite, Epidote, Gypsum
Triclinic ~5 Turquoise, Kyanite, Labradorite
Amorphous ~7 Opal, Pearl, Malachite, Turquoise, Sodalite, Lazurite
Twins ~15 Japan Law, Spinel Macle, Iron Cross

CDL v2.0 Expressions

All 140 CDL expressions parse successfully with CDL v2.0, including:

  • Amorphous expressions for materials without crystalline structure: opal (amorphous[opalescent]:{botryoidal}), pearl, malachite, turquoise, sodalite, lazurite, rhodochrosite (banded form)
  • Nested growth expressions using the > operator: scepter quartz, phantom quartz, diamond phantom
  • Aggregate expressions using the ~ operator: quartz cluster, amethyst geode druse, pyrite cluster, fluorite cluster, calcite parallel growth
  • Doc comments (#!) on major expressions documenting system, habit, and species information

Database Completeness (Updated 2026-01-25)

The mineral database has been comprehensively enriched with FGA-standard gemmological data:

  • 159 mineral families (116 natural + 19 synthetic + 20 simulant + 4 composite) with complete crystallographic data
  • 92/95 minerals (96.8%) with complete RI (refractive index)
  • 94/95 minerals (98.9%) with complete SG (specific gravity)
  • 94/95 minerals (98.9%) with optical character classification
  • 93/95 minerals (97.9%) with pleochroism data
  • 94/95 minerals (98.9%) with lustre, cleavage, and fracture data
  • 60/95 minerals (63.2%) with dispersion values
  • 40+ gemstones with comprehensive treatments and diagnostic inclusions

Data Completeness by Crystal System

System Minerals RI SG Optical Character
Cubic 26 92.3% 100% 100%
Trigonal 19 100% 100% 100%
Monoclinic 15 100% 100% 100%
Orthorhombic 13 92.3% 92.3% 92.3%
Hexagonal 8 100% 100% 100%
Triclinic 7 100% 100% 100%
Tetragonal 6 100% 100% 100%
Amorphous 1 100% 100% 100%

Data Sources

  • FGA (Fellowship of the Gemmological Association) curriculum materials
  • Mindat.org mineralogical database
  • GIA (Gemological Institute of America) gem encyclopedia
  • Webmineral.com physical property database
  • International Gem Society reference materials

All data has been cross-validated against multiple authoritative sources and validated for consistency with crystal system optical properties.

API Reference

Query Functions

from mineral_database import (
    get_preset,           # Get preset dict by name
    get_mineral,          # Get Mineral object by name
    list_presets,         # List preset names
    list_preset_categories,  # List categories
    search_presets,       # Full-text search
    filter_minerals,      # Filter by criteria
    get_presets_by_form,  # Get by crystal form
    count_presets,        # Total count
)

Backwards Compatibility

The package provides dict-like CRYSTAL_PRESETS for code migration:

from mineral_database import CRYSTAL_PRESETS

# All dict operations work
preset = CRYSTAL_PRESETS['diamond']
preset = CRYSTAL_PRESETS.get('ruby')
'garnet' in CRYSTAL_PRESETS
for name in CRYSTAL_PRESETS:
    print(name)

Mineral Object

from mineral_database import get_mineral, Mineral

mineral = get_mineral('ruby')
print(mineral.id)           # 'ruby'
print(mineral.name)         # 'Ruby'
print(mineral.system)       # 'trigonal'
print(mineral.chemistry)    # 'Al2O3:Cr'
print(mineral.hardness)     # 9
print(mineral.ri)           # '1.762-1.770'
print(mineral.localities)   # ['Myanmar', 'Mozambique', ...]

Property Formatting

from mineral_database import (
    INFO_GROUPS,
    get_info_properties,
    get_property_label,
    format_property_value,
)

# Get FGA-style properties
props = get_info_properties('ruby', 'fga')
# Returns: {'name': 'Ruby', 'ri': '1.762-1.770', 'sg': 4.0, ...}

# Format for display
label = get_property_label('sg')  # 'SG'
value = format_property_value('sg', 4.0)  # '4'

CLI Usage

# List all presets
mineral-db --list

# List by crystal system
mineral-db --list cubic

# Show preset details
mineral-db --info diamond

# Search presets
mineral-db --search garnet

# Output as JSON
mineral-db --json ruby

# Show categories
mineral-db --categories

Database Schema

The SQLite database stores minerals with full gemmological properties:

# Core fields (all minerals)
id, name, cdl, system, point_group, chemistry, hardness, description

# Optional gemmological properties
sg, ri, birefringence, optical_character, dispersion, lustre,
cleavage, fracture, pleochroism, twin_law, phenomenon

# List fields (JSON-encoded)
localities, forms, colors, treatments, inclusions

Building the Database

For development, you can rebuild the database from source:

# From legacy Python dict
python scripts/build_db.py --from-legacy crystal_presets.py -o minerals.db

# From YAML files
python scripts/build_db.py --from-yaml data/source/minerals -o minerals.db

# Export to YAML for editing
python scripts/build_db.py --export-yaml minerals.db -o data/source/minerals

Development

# Clone and install
git clone https://github.com/gemmology-dev/mineral-database.git
cd mineral-database
pip install -e ".[dev]"

# Run tests
pytest

# Build database
python scripts/build_db.py --from-legacy /path/to/crystal_presets.py -o src/mineral_database/data/minerals.db

License

MIT License - see LICENSE for details.

Related Projects

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gemmology_mineral_database-2.3.0.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gemmology_mineral_database-2.3.0-py3-none-any.whl (1.3 MB view details)

Uploaded Python 3

File details

Details for the file gemmology_mineral_database-2.3.0.tar.gz.

File metadata

File hashes

Hashes for gemmology_mineral_database-2.3.0.tar.gz
Algorithm Hash digest
SHA256 1ab458c673fe9aa9f9be8e92d4f9fa0b7d1984fcdc640d7c7cb6225c23e4c319
MD5 a72448acffe2df6a5707fdff787cbd5c
BLAKE2b-256 118e4933af4cdfaacbbb340e3082f5533341cfdc869bb58df7ff8ed6766cb17e

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemmology_mineral_database-2.3.0.tar.gz:

Publisher: pypi-publish.yml on gemmology-dev/mineral-database

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gemmology_mineral_database-2.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for gemmology_mineral_database-2.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a29a7b69679892ee7e6f04132196e490cb96d66871cf4de07a3e785ec5e0c5d6
MD5 9c4eff7e214485f946ec1c12952bc6aa
BLAKE2b-256 9b2c198f85eabc2f3821417895c7bc6e388ba34379f37ab0f41ad1a3425ec9ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemmology_mineral_database-2.3.0-py3-none-any.whl:

Publisher: pypi-publish.yml on gemmology-dev/mineral-database

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page