Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

--- IN DEVELOPMENT ---

Includes:

  • information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.1.0.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.1.0-py3-none-any.whl (10.1 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.1.0.tar.gz.

File metadata

  • Download URL: hc-nlp-0.1.0.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.3

File hashes

Hashes for hc-nlp-0.1.0.tar.gz
Algorithm Hash digest
SHA256 cf008a8cf58f05615482b65679b7eb5256f98fd9abafe0c73e0207ac17ac0d7f
MD5 ee9897f295fe27d1ae16f86681ebeb9e
BLAKE2b-256 058b62b3c7d96070d599db57405812598281bcb4742fde9cf26e97a7d5a13ce8

See more details on using hashes here.

File details

Details for the file hc_nlp-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 10.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.3

File hashes

Hashes for hc_nlp-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 91e0653acb6a16920d9c74ad483f67d3b39ed96e8e6347102ff01ba0b0912439
MD5 c56c27e5b10e09cea50cb50261c888f3
BLAKE2b-256 6e86da0ed9c6cb8f1cc49b7138a51a4b75c8cb8c090f4940f35ba1a590e4df51

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page