A toolkit for extracting chemical information from the scientific literature.
Project description
ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.
Features
HTML, XML and PDF document readers
Chemistry-aware natural language processing pipeline
Chemical named entity recognition
Rule-based parsing grammars for property and spectra extraction
Table parser for extracting tabulated data
Document processing to resolve data interdependencies
Installation
To install ChemDataExtractor, simply run:
pip install chemdataextractor
Or if you are an Anaconda user, run:
conda install -c chemdataextractor chemdataextractor
Alternatively, try one of the other installation options.
Documentation
Full documentation is available at http://chemdataextractor.org/docs
License
ChemDataExtractor is licensed under the MIT license, a permissive, business-friendly license for open source software.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ChemDataExtractor-c-1.0.0.tar.gz
.
File metadata
- Download URL: ChemDataExtractor-c-1.0.0.tar.gz
- Upload date:
- Size: 304.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71ade0baa176fd8fde7118908482b10c109ad5e9effba40a994108b82f6ca2be |
|
MD5 | 8a7ec543333c75457f77509a124d6b1b |
|
BLAKE2b-256 | 4e7e08b993538190b0967f92a835b59ff6d92a575a790039a075e24080c3dcdb |
File details
Details for the file ChemDataExtractor_c-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: ChemDataExtractor_c-1.0.0-py3-none-any.whl
- Upload date:
- Size: 182.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c68e212b3e64acc2e50dc68b9203c0db8b9a5b1a2e20e9322dfd278f9300dd7a |
|
MD5 | b0a2a1ffaf58bdc848a193b14e038db7 |
|
BLAKE2b-256 | b7807c76a54ce372bc2d0ec45543bd28d2db850a90355f3629374cbada388af1 |