SpaCy annotator for Named Entity Recognition (NER) using ipywidgets.
Project description
spacy-annotator
SpaCy annotator for Named Entity Recognition (NER) using ipywidgets. The annotator allows users to quickly assign (custom) labels to one or more entities in the text, including noisy-prelabelling!
Features:
- The annotator supports pandas dataframe: it adds annotations in a separate 'annotation' column of the dataframe;
- Why not use transformers to label your data for you? If a model is passed into the annotator, it is used to identify entities and pre-fill the annotator for you.
- The annotations adhere to spaCy format and are ready to serve as input to a spaCy NER model.
No additional code required!
Blog post: medium/enrico.alemani/spacy-annotator
Installation
pip install spacy-annotator
Example: annotations using spaCy model
For code, see spacy_annotator demo notebook.
Contributors
dayalstrub-cma - Refactored code to class, added displacy visualisation and entity ruler.
LeafmanZ - Added to_spacy
method.
Contributing
- Fork the repo on GitHub;
- Clone the project to your own machine;
- Commit changes to your own branch; and
- Push your work back up to your own fork;
- Submit a Pull request so that I can review your changes.
Dependencies
Spacy-annotator works with SpaCy 3.X, and ipywidgets 7.X.
References
spacy-annotator is based on spaCy and pigeon (see also PigeonXT).
Many thanks to them for making their awesome libraries publicly available. Another interesting project is Doccano.
Note: spaCy is a great library and, most importantly, free to use. So please also consider using the https://prodi.gy/ annotator to keep supporting the spaCy deveopment.
Changelog
2024-03-25: Update ipywidget requirements to >=8
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file spacy_annotator-2.1.4.tar.gz
.
File metadata
- Download URL: spacy_annotator-2.1.4.tar.gz
- Upload date:
- Size: 6.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 380a20d90af21d2a8918735ca45bdd643b18d7cbef6b735c6a8bb49803e42f54 |
|
MD5 | c11ea41259adc424562e14e90bdae207 |
|
BLAKE2b-256 | 29c33a92c28d59792829129e7f5290f8703af044e0db9150385fdd4665d397db |