Skip to main content

Stiv Llenga (Master Strange) created this simple package to assist master students and other researchers who are working on generating molecular representations and manipulating arrays in various forms.

Project description

master_strange_mol_rep

Stiv Llenga (a.k.a. Master Strange) created this simple package to assist master students and other researchers who are working on generating molecular representations and manipulating arrays in various forms. For more information, please contact the author via email (stiv.llenga@h-its.org)

The following information describes the purpose and how to use the functions included with msc_strange_mol_rep:

a) Generating SOAP representation :

Please read this paper if you need to refresh your memory on what SOAP representation is : (https://doi.org/10.48550/arXiv.1209.3140).

Command:

ARRAY = create_soap(path,rcut_i=8,processors=1)

INPUT:

  • --path -> (Str) The full path to your xyz files;

  • --rcut_i -> (Float) Cutoff radius in Angstrom (Å);

  • --processors -> (Int) The total number of processors.

OUTPUT:

  • ARRAY -> The SOAP ndarray. The length of the longest molecular SOAP array is used to pad all of the molecular SOAP.

Comments:

This command generates SOAP representations for all molecules in the path-specified directory. The xyz format is required for molecular structures.

b) Generating Coulomb Matrix representation :

Please read this paper if you need to refresh your memory on what CM representation is : (https://doi:10.1103/PhysRevLett.108.058301).

Command:

ARRAY = create_cm(path, processors=1)

INPUT:

  • --path -> (Str) The full path to your xyz files;

  • --processors -> (Int) The total number of processors.

OUTPUT:

  • ARRAY -> The CM ndarray. For all molecules, the coulomb matrix is flattened and the length of the longest molecular CM array is used to pad all of the molecular CM.

Comments:

This command generates CM representations for all molecules in the path-specified directory. The xyz format is required for molecular structures.

c) Generating the Spectrum of London and Axillrod-Teller-Muto potential (SLATM) representation :

Please read this paper if you need to refresh your memory on what SLATM representation is : (https://arxiv.org/pdf/1807.04259.pdf).

Command:

ARRAY = create_slatm(path)

INPUT:

  • --path -> (Str) The full path to your xyz files

OUTPUT:

  • ARRAY -> The SLATM ndarray. The length of the longest molecular SLATM array is used to pad all of the molecular arrays.

Comments:

This command generates SLATM representations for all molecules in the path-specified directory. The xyz format is required for molecular structures.

d) Generating the MIBOC representation (Only for CCC) :

Command:

ARRAY = create_MIBOC(path,basis_set='def2-TZVP', charge=0, spin=0)

INPUT:

  • --path -> (Str) The full path to your xyz files;

  • --basis_set -> (Str) The basis set you want to use;

  • --charge -> (Int) The system's charge;

  • --spin -> (Int) The system's spin (Nr_spin_alpha - Nr_spin_beta).

OUTPUT:

  • ARRAY -> The MIBOC ndarray. The length of the longest molecular MIBOC array is used to pad all of the molecular arrays.

Comments:

This command generates MIBOC representations for all molecules in the path-specified directory. The xyz format is required for molecular structures. Please use the number of electrons with spin up minus electrons with spin down when calculating the spin (not 2S+1).

e) Generating the molecular QR representation (Only for CCC) :

Command:

create_qr(array,path)

INPUT:

  • --array -> (Str) An array of molecular representations;

  • --path -> (Str) The location on your computer where the QR images will be saved.

OUTPUT:

All QR images are saved locally.

Comments:

This command generates QR representations for all molecules arrays (any of them) and saves the results in the path specified.

f) Dimension reduction by Principal Component Analysis:

Command:

pca = dim_red(data,nr_dim=2)

INPUT:

  • --data -> (Str) The ndarray array whose dimensions should be reduced (for example, N*m array);
  • --nr_dim -> (Int) The total number of final dimensions (2 or 3).

OUTPUT:

  • --pca -> The new low-dimensional arrays (for example, N2 or N3 array)

Comments:

This command can be used to reduce the dimensionality of an array.

g) Making arrays of same size:

i) Command:

new_array = zero_pad_inner(array)

INPUT:

  • --array -> (ndarray or list) The ndarray whose components are not all of the same size.

OUTPUT:

  • --new_array -> The ndarray whose components are all of the same size.

Comments:

Adjusts the sizes of all of an ndarray's elements to fix the dimensions of the array.

ii) Command:

new_array = zero_pad_two_ndarrays(big_array,small_array)

INPUT:

  • --big_array -> The N*m-dimensional ndarray;

  • --small_array -> The ndarray with dimensions N*b, where b < m.

OUTPUT:

  • --new_array -> The modified small array with N*m dimensions.

Comments:

When two arrays must be the same size for various reasons, this command changes the smaller array's size to match that of the larger array.

h) Cosine similarity analysis:

Command:

similarity = cosine_similarity(array_one,array_two)

INPUT:

  • --array_one -> (array) The first ndarray;
  • --array_two -> (array) The second ndarray.

OUTPUT:

  • --similarity -> The similarity of array_one and array_two.

Comments:

Returns an ndarray containing the similarities of each array.

i) Set up the necessary dependencies:

Command:

install_dependencies()

j) Add in the necessary libraries:

Command:

define_modules()

HOW TO USE IT:

a) Install the package (pip install msc-strange-mol-rep==0.0.1)

b) Import mol_rep (from msc_strange_mol_rep import mol_rep)

c) Use the functions inside mol_rep (mol_rep.#)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

master_strange_mol_rep-0.0.1.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

master_strange_mol_rep-0.0.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file master_strange_mol_rep-0.0.1.tar.gz.

File metadata

  • Download URL: master_strange_mol_rep-0.0.1.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for master_strange_mol_rep-0.0.1.tar.gz
Algorithm Hash digest
SHA256 c8c6fe79c2ac83330e61250a67e69346d53ae17bfd4b2968281c272ae58f0d29
MD5 4f9f07de626a790c404a02a10b595fe1
BLAKE2b-256 fe83b63f3accf5b51899e3bae1e542147681e5f4c58f5a29efd4075474964f61

See more details on using hashes here.

File details

Details for the file master_strange_mol_rep-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for master_strange_mol_rep-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 642e5ce1946c6b4131490c1f10f7343d140dca9df59ae7c4b713f48d592f9aa9
MD5 5726d62c2af73748dd2d682b886e6ddd
BLAKE2b-256 66836b7ab7ed838afe1c077a3785b1455ccf14bd66b20a7205db077f8278c44a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page