Scikit-Learn like Named Entity Recognition modules
Project description
sequence-learn
Sklearn-like API for Sequence Learning tasks like Named Entity Recognition.
sequence-learn
takes as input embedded token lists, which you can produce using e.g. Spacy or NLTK for tokenization and Sklearn or Hugging Face for the embedding procedure. The labels are on token-level, i.e., for each token, you must provide some information in a simple list.
Example
# some token-level embedding, e.g. based on character embeddings
x = [[
[0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 1, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 1, 0, 0],
],[
[0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 1, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 1, 0, 0],
[1, 0, 0, 0, 0, 0, 0, 1, 0, 1, 1, 0, 0]
]]
# token-level labels, where OUTSIDE means that this token contains no label
y = [["OUTSIDE", "LABEL-1"],
["LABEL-2","LABEL-1","OUTSIDE"]]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for sequencelearn-0.0.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0be1a8209188c7b362893588a3734ee3d9b6396fa663f0a5fe5adc1f67b13a3 |
|
MD5 | 0dfb49aca53b7485b3bfe6ef2c906179 |
|
BLAKE2b-256 | f66a3c09ce545373b514ceb779be150dc8f30c1021fbe3955cf67265229071f5 |