Skip to main content

Python package containing a variety of distance functions for crystals, as well as evaluation metrics for crystal generation.

Project description

xtalmet

License: MIT Formatter: Ruff Linter: Ruff Packaging: uv Docs: Sphinx GitHub issues CI Status

The xtalmet package offers a variety of distance functions for comparing crystal structures. These include both binary and continuous as well as compositional and structural functions. It also enables you to evaluate a set of crystals using uniqueness and novelty metrics derived from these distances.

Motivation

A central challenge in materials science is the efficient discovery of functional crystals from the vast chemical space. Recently, the inverse design of crystals using advanced machine learning generative models has emerged as a promising approach due to their ability to rapidly generate numerous candidates. However, while these models have become increasingly sophisticated, their evaluation metrics have remained largely unchanged since the seminal work of Xie et al [1] and Zeni et al [2]. To effectively guide model development, these evaluation metrics must also continue to improve. We aim to refine two primary metrics, uniqueness and novelty, by revising the underlying distance function used to compare crystal structures.

[1] Tian Xie et al. Crystal diffusion variational autoencoder for periodic material generation. International Conference on Learning Representations 2022.

[2] Claudio Zeni et al. A generative model for inorganic materials design. Nature 2025.

Installation

The latest stable version can be installed via pip:

pip install xtalmet

Usage

Two primary features of xtalmet are the calculation of distances between crystals and the uniqueness/novelty evaluation. For the former usage, suppose you have two crystals xtal_1 and xtal_2 (pymatgen.core.Structure) whose distance you want to measure. You can do so with one line of code:

from xtalmet.distance import d_amd
distance = d_amd(xtal_1, xtal_2)

Here, d_amd is a type of continuous distance based on structural fingerprints. For a complete list of available distances, please refer to the documentation.

For the uniqueness/novelty evaluation, imagine that you want to assess a set of crystals gen_xtals (list[pymatgen.core.Structure]) generated from a model trained on the MP20 dataset. This can be done with just a few lines of code:

from xtalmet.evaluator import Evaluator
evaluator = Evaluator(gen_xtals)
uniqueness = evaluator.uniqueness(distance="d_amd")
novelty = evaluator.novelty(train_xtals="mp20", distance="d_amd")

A more detailed tutorial notebook is provided in the examples directory.

Acknowledgements

Although we have argued that the progress on evaluation metrics is slower than that on generative models, we are not the sole contributors to this field. Our work builds upon several influential studies. We particularly acknowledge the following contributions:

  • Zeni et al. [2]: For their implementation of the S.U.N. (Stability / Uniqueness / Novelty) metric.

  • Baird et al. [3]: For their code for uniqueness and novelty, and their framework for creating a Python benchmarking package.

  • Widdowson et al. [4, 5]: For their work on a continuous structural distance between crystals.

Other studies, while not directly related, were also inspiring in shaping our approach to measuring distances between crystals.
These include the work of Onwuli et al. [6] on distances between compositional embeddings, as well as Hargreaves et al.'s [7] development of optimal transport-based methods for inorganic compositions.

[3] Baird et al. matbench-genmetrics: A Python library for benchmarking crystal structure generative models using time-based splits of Materials Project structures. Journal of Open Source Software 2024.

[4] Widdowson et al. Resolving the data ambiguity for periodic crystals. Advances in Neural Information Processing Systems 2022.

[5] Widdson et al. Average Minimum Distances of periodic point sets - foundational invariants for mapping periodic crystals. MATCH Communications in Mathematical and in Computer Chemistry 2022.

[6] A. Onwuli et al. Element similarity in high-dimensional materials representations. Digital Discovery 2023.

[7] Hargreaves et al. The earth mover’s distance as a metric for the space of inorganic compositions. Chemistry of Materials 2020.

Citation

If you find xtalmet useful in your research, please consider citing:

@misc{negishi2025continuousuniquenessnoveltymetrics,
      title={Continuous Uniqueness and Novelty Metrics for Generative Modeling of Inorganic Crystals}, 
      author={Masahiro Negishi and Hyunsoo Park and Kinga O. Mastej and Aron Walsh},
      year={2025},
      eprint={2510.12405},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2510.12405}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xtalmet-0.1.0.tar.gz (13.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xtalmet-0.1.0-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file xtalmet-0.1.0.tar.gz.

File metadata

  • Download URL: xtalmet-0.1.0.tar.gz
  • Upload date:
  • Size: 13.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.22

File hashes

Hashes for xtalmet-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7b5d2f0e08f52a66542a7c641cbbe2eed6c8a8aa9da2f48c163327148d8961f6
MD5 afe74aca895d4e4186e3afab6a3fc2b3
BLAKE2b-256 089508ca44326d5412ea4a21f30a8d91ff493ac59c23d4a37190a77e900dbb09

See more details on using hashes here.

File details

Details for the file xtalmet-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: xtalmet-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 15.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.22

File hashes

Hashes for xtalmet-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a4ed1fbc3eaa16962e0ab74dbe728f1f6e030cadc94d5ece5899245692b82651
MD5 3e6d7e8b1931fca26e5a04fbbb1352ed
BLAKE2b-256 e8cddedcecb333f9236918e1c37bd5e9376bf0d1dad21a4005726cf1571dcd56

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page