Skip to main content

quasigraph: Chemical and Geometric Descriptor Toolkit for Machine Learning Models.

Project description

PyPI - License PyPI DOI

Quasigraph is an open-source toolkit designed for generating chemical and geometric descriptors to be used in machine learning models.

Installation

The easiest method to install quasigraph is by utilizing pip:

$ pip install quasigraph

Getting started

from ase.build import molecule
from quasigraph import QuasiGraph

# Initialize an Atoms object for methanol (CH3OH) using ASE's molecule function
atoms = molecule('CH3OH')

# Instantiate a QuasiGraph object containing chemical and coordination numbers
qgr = QuasiGraph(atoms)

# Convert the QuasiGraph object into a pandas DataFrame
df = qgr.get_dataframe()

# Convert the QuasiGraph object into a vector
vector = qgr.get_vector()

Descriptor

The descriptor can be separated into two parts, a chemical part and a geometric part.

Chemical part

The chemical part of the descriptor employs the Mendeleev library, incorporating atomic details like the valence electron concentration, covalent radius, atomic radius, Pauling electronegativity and electron affinitity for every element within the object.

For example, for methanol (CH3OH) we have the table:

VEC covalent_radius en_pauling
0 4 0.75 2.55
1 6 0.63 3.44
2 1 0.32 2.2
3 1 0.32 2.2
4 1 0.32 2.2
5 1 0.32 2.2

Geometric part

The geometric part involves identifying all bonds and computing the coordination numbers for each atom, indicated as CN. Additionally, the generalized coordination number (GCN)[^1] is determined by summing the coordination numbers of the neighboring ligands for each atom and normalizing this sum by the highest coordination number found in the molecule.

Figure 1 - Schematic representation of the methanol molecule, indicating the chemical symbol and coordination number (CN) for every atom.

For example, for methanol (CH3OH) we have the geometric data, as shown in Fig. 1.

CN GCN
4 1.25
2 1.25
1 1.00
1 0.50
1 1.00
1 1.00

License

This is an open source code under MIT License.

Acknowledgements

We thank financial support from FAPESP (Grant No. 2022/14549-3), INCT Materials Informatics (Grant No. 406447/2022-5), and CNPq (Grant No. 311324/2020-7).

[^1]: Calle-Vallejo, F., Martínez, J. I., García-Lastra, J. M., Sautet, P. & Loffreda, D. Fast Prediction of Adsorption Properties for Platinum Nanocatalysts with Generalized Coordination Numbers, Angew. Chem. Int. Ed. 53, 8316-8319 (2014).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quasigraph-26.5.0.tar.gz (157.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quasigraph-26.5.0-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file quasigraph-26.5.0.tar.gz.

File metadata

  • Download URL: quasigraph-26.5.0.tar.gz
  • Upload date:
  • Size: 157.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for quasigraph-26.5.0.tar.gz
Algorithm Hash digest
SHA256 50c2f36855fc4510bacc75a8b708b0a988460a2f90a32005b8fed6ad2d68810a
MD5 5a55ee0999e97ec341b09c3ce4ab79e2
BLAKE2b-256 ba4096ba31bc1f8a8b8c418ffe8f7cf21efd139322e1f6c4d3c0f2ad8360e6ee

See more details on using hashes here.

File details

Details for the file quasigraph-26.5.0-py3-none-any.whl.

File metadata

  • Download URL: quasigraph-26.5.0-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for quasigraph-26.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4b246b1f96fc9c42ed50990d1695c9a0f62aa92b3ab3ab9cb89286774f6aecfd
MD5 5d8ceb994f3df8d42e99dafae2a5c8e1
BLAKE2b-256 49ff7e8b1ae7c52af6227110690fdd02b0ae357403bcea77172efa0596735584

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page