Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

--- IN DEVELOPMENT ---

(note about spaCy: the master branch and all releases after 0.2.1 use spaCy v3, which is currently in nightly and not meant for production use.)

Includes:

  • information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.3.3.tar.gz (11.0 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.3.3-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.3.3.tar.gz.

File metadata

  • Download URL: hc-nlp-0.3.3.tar.gz
  • Upload date:
  • Size: 11.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.9.1

File hashes

Hashes for hc-nlp-0.3.3.tar.gz
Algorithm Hash digest
SHA256 80143e6329e236bdba704ff605a2efe57f480f4214faff855faaea08b3c2d7e8
MD5 dd4c9670c0065aae56150546ee3be697
BLAKE2b-256 9dcfedff819171d5c0fc2cd68e2434b36587ba982a008b0dfc07bff9bff4127f

See more details on using hashes here.

File details

Details for the file hc_nlp-0.3.3-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.3.3-py3-none-any.whl
  • Upload date:
  • Size: 10.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.9.1

File hashes

Hashes for hc_nlp-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e39e0babf1ed557d4c01ff3bddd9cb4f7d458d79e56001e61444e999a4c73148
MD5 ceb804d38e5d365bea90ce0f8c3d853d
BLAKE2b-256 fa1b323ab73bd236fe847715bfdc71c6d3397b9399bc4e55e992480a11ba0f8b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page