Skip to main content

Transform unstructured document collections to structured Linked Data

Project description

Ferenda is a python library and framework for transforming unstructured document collections into structured Linked Data. It helps with downloading documents, parsing them to add explicit semantic structure and RDF-based metadata, finding relationships between documents, and publishing the results, including through a REST-based HTTP API.

https://badge.fury.io/py/ferenda.png https://travis-ci.org/staffanm/ferenda.png?branch=master https://ci.appveyor.com/api/projects/status/aqdo3c39cdof8opa/branch/master https://coveralls.io/repos/staffanm/ferenda/badge.png?branch=master Code Health https://pypip.in/d/ferenda/badge.png

Quick start

This example uses ferenda’s project framework to download the 50 latest RFCs and W3C standards, parse documents into structured, RDF-enabled XHTML documents, loads all RDF metadata into a triplestore and generates a web site of static HTML5 files that are usable offline:

pip install ferenda
ferenda-setup myproject
cd myproject
./ferenda-build.py ferenda.sources.tech.RFC enable
./ferenda-build.py ferenda.sources.tech.W3Standards enable
./ferenda-build.py all all --downloadmax=50 --staticsite --fulltextindex=False
open data/index.html

The same functionality can also be accessed through a python API, if you want to use ferenda as part of a larger system. It’s also possible to just use the parts of ferenda that you need (eg. only the downloading and parsing features).

More information

See http://ferenda.readthedocs.org/ for in-depth documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ferenda-0.3.0.tar.gz (835.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ferenda-0.3.0-py2.py3-none-any.whl (842.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file ferenda-0.3.0.tar.gz.

File metadata

  • Download URL: ferenda-0.3.0.tar.gz
  • Upload date:
  • Size: 835.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for ferenda-0.3.0.tar.gz
Algorithm Hash digest
SHA256 7638d56a40ccfb8e4fbc532ca274b7a89589d6408e1709ba9118525116c8aaf3
MD5 0dea0c445ffd871821fc58e67d8851c3
BLAKE2b-256 a8edf5f3b650f3ceb9a7e6e27f7a215a9a1a9b1e5ba02fff9f8a3b9b67c6f50d

See more details on using hashes here.

File details

Details for the file ferenda-0.3.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for ferenda-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 889d1eb5ac17a61cb0aa946a3c566ae9998f1a1eac8ea17dd4e87c164f1c052a
MD5 f76c3bb402a268478293dd70b5d38bbb
BLAKE2b-256 ac1ca608e8fb4d830ceb4247759d5ac67c92c4b5df9a7b0b9226b9b839ff887c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page