A toolkit for extracting chemical information from the scientific literature.
Project description
ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.
Features
HTML, XML and PDF document readers
Chemistry-aware natural language processing pipeline
Chemical named entity recognition
Rule-based parsing grammars for property and spectra extraction
Table parser for extracting tabulated data
Document processing to resolve data interdependencies
Installation
To install ChemDataExtractor, simply run:
pip install chemdataextractor
Alternatively, try one of the other installation options.
Documentation
Full documentation is available at http://chemdataextractor.org/docs
License
ChemDataExtractor is licensed under the MIT license.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file ChemDataExtractor-1.2.0.tar.gz
.
File metadata
- Download URL: ChemDataExtractor-1.2.0.tar.gz
- Upload date:
- Size: 184.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 67f396d96358dbcff48c7d42fd70363d36cb5ca461789f78f7d7e026e9d82ba4 |
|
MD5 | 04d6debb9910bc8f3e97ef0975966d15 |
|
BLAKE2b-256 | 3c839c531ef3c64b7457753dadeb4179a250a704c861efca9644b47286ff3b46 |