Skip to main content

Heritage Connector NLP

Project description

heritage-connector-nlp

Text processing for the Heritage Connector: a set of NLP utilities for the Heritage sector.

--- IN DEVELOPMENT ---

Includes:

  • information extraction (NER, NEL, relation classification)
  • labelling (Label Studio)
  • test suite for models

Usage

Label Studio

Setting up (first time):

  1. Run label-studio start labelling --init, which will start up Label Studio and take you to a configuration wizard.
  2. Select Named Entity Recognition from the top menu, and fill in the entity types you want to annotate

Running: Run label-studio start labelling from the root directory.

Useful parameters:

  • --sampling=uniform: have Label Studio show documents in a random order
  • --label-config label_studio_config_sample.xml: load config from a file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hc-nlp-0.2.0.tar.gz (10.2 kB view details)

Uploaded Source

Built Distribution

hc_nlp-0.2.0-py3-none-any.whl (9.9 kB view details)

Uploaded Python 3

File details

Details for the file hc-nlp-0.2.0.tar.gz.

File metadata

  • Download URL: hc-nlp-0.2.0.tar.gz
  • Upload date:
  • Size: 10.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.0

File hashes

Hashes for hc-nlp-0.2.0.tar.gz
Algorithm Hash digest
SHA256 9a269f52de00eab5c12692feb870cbd9bb9c77e67f5c170b53db8c64faf4c454
MD5 f087f65cda765932169ed96925cb79fa
BLAKE2b-256 a16b752fdc9f63ff8eb33e7e75d3b0f86cfa5616732992b04ba74be20960c951

See more details on using hashes here.

File details

Details for the file hc_nlp-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: hc_nlp-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 9.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.0

File hashes

Hashes for hc_nlp-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dec5b013b8ead7936cba900a196973e8c67bf23b70dc1ecf25d4b1b1e019fd52
MD5 f83340a5bc6405d4587d3dcec0c9a84a
BLAKE2b-256 ed471684fb031c73e281a565c92db7b9715c3d1fcd631f37697fc64784be4f06

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page