Skip to main content

A python script to parallelize basic MolVS functions

Project description

chemo_sanitizer

A set of script using RDKit and MolVS to proceed to the standardization , sanitization and different format conversion. Parallelized.

In this state it take only InChI as inputs and returns sanitized SMILES, InChI, InChIKeys, Short InChIkeys, MolecularFormula, ExactMass and XLogP.

Requirements

Install the conda environment by

conda env create -f environment.yml

Specific requirement

Note that some modules of MolVS are not loaded by default. You will need to edit the MolVS init.py in ~/opt/anaconda3/lib/python3.7/site-packages/molvs/ accordingly. Add these lines to the init.py

from .fragment import LargestFragmentChooser, FragmentRemover
from .charge import Uncharger

Usage

Go to the src folder Then add input and output file path as first and second argument, InChI column header as third argument and finally the number of cpus you want to use.

Example :

cd src
python chemosanitizer.py ~/translatedStructureRdkit.tsv ./test.tsv structureTranslated 6

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chemosanitizer-0.0.2.tar.gz (9.2 kB view details)

Uploaded Source

File details

Details for the file chemosanitizer-0.0.2.tar.gz.

File metadata

  • Download URL: chemosanitizer-0.0.2.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for chemosanitizer-0.0.2.tar.gz
Algorithm Hash digest
SHA256 92e46fa5a7763a3d97585c91b9fd887e2fa2d3f34428feb57d00da7838b84f31
MD5 3453ece11f9ccd4862857f052a9e42bf
BLAKE2b-256 bece5df5d5c3436eaafe1a3940a29fdcebb2e1fbd522ac77838f739f3b2a4a99

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page