Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

For more information see https://doi.org/10.1002/ail2.23.

--- IN DEVELOPMENT ---

Includes:

  • low-data extensions for information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.3.6.tar.gz (16.2 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.3.6-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.3.6.tar.gz.

File metadata

  • Download URL: hc-nlp-0.3.6.tar.gz
  • Upload date:
  • Size: 16.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for hc-nlp-0.3.6.tar.gz
Algorithm Hash digest
SHA256 f8e505a092e04ac6dd09b0b2582da0eec9c8e336c75dda1645fb5ddac29b9cdb
MD5 baf04cdcb6e6eb057350081108a23994
BLAKE2b-256 4bdc44d53ff137c7bfcdc4c2a240edf338ae6e89b679499fa508f2648b9f4f12

See more details on using hashes here.

File details

Details for the file hc_nlp-0.3.6-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.3.6-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for hc_nlp-0.3.6-py3-none-any.whl
Algorithm Hash digest
SHA256 59c95732676e9aeb6b596daa05095173744d58cfe6f1549c76e23671abb016f9
MD5 4a71bad977618d62d479833448a2d253
BLAKE2b-256 23c8438474ad82ba0365270b7017814c852da9a66ee869e30470b94fd5270da1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page