Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

For more information see https://doi.org/10.1002/ail2.23.

--- IN DEVELOPMENT ---

Includes:

  • low-data extensions for information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.3.8.tar.gz (17.4 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.3.8-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.3.8.tar.gz.

File metadata

  • Download URL: hc-nlp-0.3.8.tar.gz
  • Upload date:
  • Size: 17.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.4.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.9.5

File hashes

Hashes for hc-nlp-0.3.8.tar.gz
Algorithm Hash digest
SHA256 04c205d42703bfb7c87cd4a8d8ea1366e7b0f6b0dc8b10c8ac187a1d61fa481f
MD5 5707267b0675fe24fcd08287aba97070
BLAKE2b-256 3337841a81e5837f6e744a668c69ce77489111ab8bfd94eb133f5efcb48641df

See more details on using hashes here.

File details

Details for the file hc_nlp-0.3.8-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.3.8-py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.4.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.9.5

File hashes

Hashes for hc_nlp-0.3.8-py3-none-any.whl
Algorithm Hash digest
SHA256 6025dbf9434c6c9cd76a7df501d69e8301904970a6a93e02e2e7c031e5d3a52e
MD5 91d29d7158de8e48ee3dad41aae70d49
BLAKE2b-256 c06950504b1a5172f8fc473db4f82b8db5fea9428b0c7381b08b3c1d46743ef6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page