A toolkit for extracting chemical information from the scientific literature.
Project description
ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.
Features
HTML, XML and PDF document readers
Chemistry-aware natural language processing pipeline
Chemical named entity recognition
Rule-based parsing grammars for property and spectra extraction
Table parser for extracting tabulated data
Document processing to resolve data interdependencies
Installation
To install ChemDataExtractor, simply run:
pip install chemdataextractor
Or if you are an Anaconda user, run:
conda install -c chemdataextractor chemdataextractor
Alternatively, try one of the other installation options.
Documentation
Full documentation is available at http://chemdataextractor.org/docs
License
ChemDataExtractor is licensed under the MIT license, a permissive, business-friendly license for open source software.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ChemDataExtractor-c-1.0.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71ade0baa176fd8fde7118908482b10c109ad5e9effba40a994108b82f6ca2be |
|
MD5 | 8a7ec543333c75457f77509a124d6b1b |
|
BLAKE2b-256 | 4e7e08b993538190b0967f92a835b59ff6d92a575a790039a075e24080c3dcdb |
Hashes for ChemDataExtractor_c-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c68e212b3e64acc2e50dc68b9203c0db8b9a5b1a2e20e9322dfd278f9300dd7a |
|
MD5 | b0a2a1ffaf58bdc848a193b14e038db7 |
|
BLAKE2b-256 | b7807c76a54ce372bc2d0ec45543bd28d2db850a90355f3629374cbada388af1 |