Skip to main content

quasigraph: Chemical and Geometric Descriptor Toolkit for Machine Learning Models.

Project description

PyPI - License PyPI DOI

Quasigraph is an open-source toolkit designed for generating chemical and geometric descriptors to be used in machine learning models.

Installation

The easiest method to install quasigraph is by utilizing pip:

$ pip install quasigraph

Getting started

from ase.build import molecule
from quasigraph import QuasiGraph

# Initialize an Atoms object for methanol (CH3OH) using ASE's molecule function
atoms = molecule('CH3OH')

# Instantiate a QuasiGraph object containing chemical and coordination numbers
qgr = QuasiGraph(atoms)

# Convert the QuasiGraph object into a pandas DataFrame
df = qgr.get_dataframe()

# Convert the QuasiGraph object into a vector
vector = qgr.get_vector()

Descriptor

The descriptor can be separated into two parts, a chemical part and a geometric part.

Chemical part

The chemical part of the descriptor employs the Mendeleev library, incorporating atomic details like the valence electron concentration, covalent radius, atomic radius, Pauling electronegativity and electron affinitity for every element within the object.

For example, for methanol (CH3OH) we have the table:

VEC covalent_radius en_pauling
0 4 0.75 2.55
1 6 0.63 3.44
2 1 0.32 2.2
3 1 0.32 2.2
4 1 0.32 2.2
5 1 0.32 2.2

Geometric part

The geometric part involves identifying all bonds and computing the coordination numbers for each atom, indicated as CN. Additionally, the generalized coordination number (GCN)[^1] is determined by summing the coordination numbers of the neighboring ligands for each atom and normalizing this sum by the highest coordination number found in the molecule.

Figure 1 - Schematic representation of the methanol molecule, indicating the chemical symbol and coordination number (CN) for every atom.

For example, for methanol (CH3OH) we have the geometric data, as shown in Fig. 1.

CN GCN
4 1.25
2 1.25
1 1.00
1 0.50
1 1.00
1 1.00

License

This is an open source code under MIT License.

Acknowledgements

We thank financial support from FAPESP (Grant No. 2022/14549-3), INCT Materials Informatics (Grant No. 406447/2022-5), and CNPq (Grant No. 311324/2020-7).

[^1]: Calle-Vallejo, F., Martínez, J. I., García-Lastra, J. M., Sautet, P. & Loffreda, D. Fast Prediction of Adsorption Properties for Platinum Nanocatalysts with Generalized Coordination Numbers, Angew. Chem. Int. Ed. 53, 8316-8319 (2014).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quasigraph-26.5.1.tar.gz (158.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quasigraph-26.5.1-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file quasigraph-26.5.1.tar.gz.

File metadata

  • Download URL: quasigraph-26.5.1.tar.gz
  • Upload date:
  • Size: 158.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for quasigraph-26.5.1.tar.gz
Algorithm Hash digest
SHA256 569945ec83d85b67b72cb3029883d92297a27d17b591415916a4982b1c5deb8f
MD5 1a6486892dff9544c309f1d01bae8416
BLAKE2b-256 9e03ea6ca8ad358cb30b810c9f385a1b29ae90fce736ab3618470882c2906b70

See more details on using hashes here.

File details

Details for the file quasigraph-26.5.1-py3-none-any.whl.

File metadata

  • Download URL: quasigraph-26.5.1-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for quasigraph-26.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0a26d9ecaa0480707228ef5e8020bce8abace3eded76a45c961e5811506c05b8
MD5 3aab56fe118730759d36fdde400dc567
BLAKE2b-256 4649c78f5b3f32971df92ad54ad40457d4c18e1e104d300e3b5b17ab6bbca17b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page