No project description provided
Project description
This repo contains code for extracting the molecules in https://data.hpc.imperial.ac.uk/resolve/?doi=4618 into an AtomLite database.
Why?
Because the original format of the published data is an out-of-date format, namely an stk JSON dump.
How?
The easiest thing to do is
pip install cage-json-extractor
Now you can download the files
cages.tar.gz - https://data.hpc.imperial.ac.uk/resolve/?doi=4618&file=3&access=
cage_prediction.db - https://data.hpc.imperial.ac.uk/resolve/?doi=4618&file=2&access=
And run
tar xf cages.tar.gz
cage_json_extractor cages/amine2aldehyde3.json cage_prediction.db amine2aldehyde3.db
Now if we want to extract all the shape persistent 4+6 cages we can run
extract_cages amine2aldehyde3.db FourPlusSix --output_directory extracted_cages
This will create a folder extracted_cages which holds a sub-folder for every shape persistent 4+6 cage in amine2aldehyde3.db. In the sub-folder you will find the .mol file of the cage and its building blocks.
Enjoy! (and sorry I deprecated the .json files)
=)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for cage-json-extractor-1.0.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 803c8e31bff5afc9c7fe6fc625b57d495b30012d7c8563329944042dc065bd6e |
|
MD5 | a0477b5e758e6123f84c505f19672cf5 |
|
BLAKE2b-256 | a90c06d01cea0eb47e2f147a498938101dd63d6d7475e79f96c9f8aa24cec574 |
Hashes for cage_json_extractor-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7bce3dc1aa6137710d9432050bb2d1970a66e49e1ca9569a4f9bcd046005d3b5 |
|
MD5 | 9c184c372d7d35b8ada712b32d99a1af |
|
BLAKE2b-256 | 4eb787f83e9a94b9af8b497881ed32b1492fcf7cb8209ef2ada3e8a6658d5419 |