Various Ner dataset for multiple domains and languages
Project description
=============================== Datasets for Entity Recognition
This repository implements standerdized access to NER datasets from several domains and languages annotated with a variety of entity types, useful for named entity recognition (NER) tasks.
Datasets for NER
.. |check| unicode:: 0x2714
The following table shows the list of datasets for language specific entity recognition. the data are also listed below and more will be added in the future.
============== =============== ======================= =============================== ================================== Dataset Domain License Language Reference ============== =============== ======================= =============================== ================================== CONLL 2003 News en CONLL 2002 nl-es ============== =============== ======================= =============================== ==================================
Licenses
Notes on licenses:
The data set are under various type of licences. I do not have the time to worry about the licences now
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for ner_dataset-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 325baebd72dfa19a25e09d1957d595f3b4c0e5f606daf5058e26697554c97e47 |
|
MD5 | 9385f50deec7c99382248b7d9d381b5b |
|
BLAKE2b-256 | ea36bdc2fa2df79ea8d29e52a07e4337fb29c9e2c1d5ca7621be1e02eed5481b |