Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

For more information see https://doi.org/10.1002/ail2.23.

--- IN DEVELOPMENT ---

Includes:

  • low-data extensions for information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.3.7.tar.gz (17.3 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.3.7-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.3.7.tar.gz.

File metadata

  • Download URL: hc-nlp-0.3.7.tar.gz
  • Upload date:
  • Size: 17.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for hc-nlp-0.3.7.tar.gz
Algorithm Hash digest
SHA256 a0a2b25619d8f7a5fe6df661eab5ddd5bc4870f5d75efed66b106727f0930481
MD5 e354789e62b3c32ad53ae0fb6b18ffb0
BLAKE2b-256 97eb6576d6238f4dbd93586752609ef5562e859965a306623ab9f353e14e6943

See more details on using hashes here.

File details

Details for the file hc_nlp-0.3.7-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.3.7-py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for hc_nlp-0.3.7-py3-none-any.whl
Algorithm Hash digest
SHA256 19d3b0579738e455d183ca0fd2f178d10750c10c846e00431c3b76e003708273
MD5 af491525e538b3147e0a6141134e736d
BLAKE2b-256 c523990d28cd7e2ebe39b4e4694230cc014d01011ea608c80bf7d1ef1b08df26

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page