Extract entities from text documents and transform them to schema.org resources
Project description
doc2sdo
Extract entities from text documents and transform them to schema.org resources.
Installation
pip install doc2sdo
Usage
From the command line
doc2sdo path/to/your.pdf
From Python
from pathlib import Path
import sys
from doc2sdo import doc2sdo
for thing in doc2sdo(Path("/path/to/your.pdf")):
thing.resource.graph.serialize(sys.stdout.buffer)
Development
Prerequisites
Install dependencies
script/bootstrap
Run tests
script/test
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
doc2sdo-1.0.0.tar.gz
(20.9 kB
view hashes)
Built Distribution
doc2sdo-1.0.0-py3-none-any.whl
(25.6 kB
view hashes)