Various Ner dataset for multiple domains and languages
Project description
=============================== Datasets for Entity Recognition
This repository implements standerdized access to NER datasets from several domains and languages annotated with a variety of entity types, useful for named entity recognition (NER) tasks.
Datasets for NER
.. |check| unicode:: 0x2714
The following table shows the list of datasets for language specific entity recognition. the data are also listed below and more will be added in the future.
============== =============== ======================= =============================== ================================== Dataset Domain License Language Reference ============== =============== ======================= =============================== ================================== CONLL 2003 News en CONLL 2002 nl-es ============== =============== ======================= =============================== ==================================
Licenses
Notes on licenses:
The data set are under various type of licences. I do not have the time to worry about the licences now
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file ner_dataset-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: ner_dataset-0.0.2-py3-none-any.whl
- Upload date:
- Size: 12.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 325baebd72dfa19a25e09d1957d595f3b4c0e5f606daf5058e26697554c97e47 |
|
MD5 | 9385f50deec7c99382248b7d9d381b5b |
|
BLAKE2b-256 | ea36bdc2fa2df79ea8d29e52a07e4337fb29c9e2c1d5ca7621be1e02eed5481b |