A fast and simple NER tool
Project description
Quickner
A simple, fast, and easy to use NER annotator for Python.
Installation
pip install quickner
Usage
from quickner import Quickner
# Initialize the annotator
annotator = Quickner() # or Quickner("config.toml")
# Annotate the texts using the config file
annotator.process() # or annotator.process(True) to save the annotated data to a file
Configuration
The configuration file is a TOML file with the following structure:
# Configuration file for the NER tool
[general]
# Mode to run the tool, modes are:
# Annotation from the start
# Annotation from already annotated texts
# Load annotations and add new entities
[logging]
level = "debug" # level of logging (debug, info, warning, error, fatal)
[texts]
[texts.input]
filter = false # if true, only texts in the filter list will be used
path = "texts.csv" # path to the texts file
[texts.filters]
accept_special_characters = ".,-" # list of special characters to accept in the text (if special_characters is true)
alphanumeric = false # if true, only strictly alphanumeric texts will be used
case_sensitive = false # if true, case sensitive search will be used
max_length = 1024 # maximum length of the text
min_length = 0 # minimum length of the text
numbers = false # if true, texts with numbers will not be used
punctuation = false # if true, texts with punctuation will not be used
special_characters = false # if true, texts with special characters will not be used
[annotations]
format = "spacy" # format of the output file (jsonl, spaCy, brat, conll)
[annotations.output]
path = "annotations.jsonl" # path to the output file
[entities]
[entities.input]
filter = true # if true, only entities in the filter list will be used
path = "entities.csv" # path to the entities file
save = true # if true, the entities found will be saved in the output file
[entities.filters]
accept_special_characters = ".-" # list of special characters to accept in the entity (if special_characters is true)
alphanumeric = false # if true, only strictly alphanumeric entities will be used
case_sensitive = false # if true, case sensitive search will be used
max_length = 20 # maximum length of the entity
min_length = 0 # minimum length of the entity
numbers = false # if true, entities with numbers will not be used
punctuation = false # if true, entities with punctuation will not be used
special_characters = true # if true, entities with special characters will not be used
[entities.excludes]
# path = "excludes.csv" # path to entities to exclude from the search
License
MOZILLA PUBLIC LICENSE Version 2.0
Contributing
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.
Authors
- [Omar MHAIMDAT]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
quickner-0.0.1a3.tar.gz
(36.9 kB
view hashes)
Built Distributions
Close
Hashes for quickner-0.0.1a3-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7e5b96b5add09b650acae5ba951f4661462540eee83aab9e102f440fb24255f |
|
MD5 | 3ceff37db1df50ade02188c79ae514e7 |
|
BLAKE2b-256 | 7057cdc0cc93e0e67da65da0a6d9e832138d7afa3a06a195ce222567ea00f1ae |
Close
Hashes for quickner-0.0.1a3-pp38-pypy38_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cd0e369d750adc9f0763e060c9173af9e6b879bb56da714ceebf7d1eb132e0e4 |
|
MD5 | c6cbb4ca4af287ddc951c69b27f21199 |
|
BLAKE2b-256 | 2cc334b9d697df91efc38e231431772909f2487e966688bae1692b47c65a6de4 |
Close
Hashes for quickner-0.0.1a3-pp37-pypy37_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ef784370ed59d82d64585040e6e94451b90f3475d78e0b305cf7b12e5dae918 |
|
MD5 | 755b48f07c45edc972d423ce3989bae2 |
|
BLAKE2b-256 | c0aa5f7d489c62b47d493e2ff7ebbf9b023e3a60b3707b2029c6cbaa3c0faafd |
Close
Hashes for quickner-0.0.1a3-cp311-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4220ad729bffe51ea692b687358d73ff715db2592abf7f24ce1bcdcb1faae868 |
|
MD5 | 92fbdd9308deec69119c756d1dd284b6 |
|
BLAKE2b-256 | 6ce3331c936fa0e227328cbf5bafe93fcab7735e89646d3d78241998b932113d |
Close
Hashes for quickner-0.0.1a3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a5245e7018a16ff33fc03477e1b8ab660b31ad96d1eb87ee4239a1a110b752d2 |
|
MD5 | 78d2fec00bf0148d068f902bde1003e9 |
|
BLAKE2b-256 | 67676548d06394c9bea9f1d913adc35d1a022496264654207194d5fa3bf75007 |
Close
Hashes for quickner-0.0.1a3-cp311-cp311-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b03bd6eef8d9e6456be216137be3a10a6c21c2265cbe4c9ba5ba830563a5be41 |
|
MD5 | 0f0b53e8d49d77d84b1c8de9152237e3 |
|
BLAKE2b-256 | 84cb4a7d6ee040dcc942dd2edae48a0995fd6ac40b68570cea4f96ce46c96340 |
Close
Hashes for quickner-0.0.1a3-cp310-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 01aa109a3e58714a6c988831268cab28405834cfdf42e0e08d3b95813bc49063 |
|
MD5 | abde5c95c19e9782d124d7548da513cb |
|
BLAKE2b-256 | 7de91fd223a2468b22617aac95e4d112a2f80682ebb983b101aea119e683915b |
Close
Hashes for quickner-0.0.1a3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 264786e3fe900fec3757e37b61577f434b63b1fe01648fb59840b382554b52d8 |
|
MD5 | 7a4cf4a23f9d6b385e01eed70efbb713 |
|
BLAKE2b-256 | ec937e7cf0d47b43d3ce04d38bb57c0b3abaf87a0067f9b89e7765e0d48311d9 |
Close
Hashes for quickner-0.0.1a3-cp310-cp310-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d4d245b516c0548ab887ef79e4b937d6d5e14f8682dccb07768ffe8c9dec716a |
|
MD5 | fb745af4859bd0889280c90f82d56726 |
|
BLAKE2b-256 | c73c9b895bbb25d5187320df3c8e2281161a42b3618cf5cf76c01e14af638f41 |
Close
Hashes for quickner-0.0.1a3-cp39-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 55f846e4a3058dced1743e3b0ab60404fd5026af0bbee0a641c23d3e0196c1a2 |
|
MD5 | 281c4a76a403af78975c02b26472161c |
|
BLAKE2b-256 | 0484f75f66b5a29373d17e539ed1a6a3ecbfb75187ef2738597daebb7332a63e |
Close
Hashes for quickner-0.0.1a3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c0199f3fbab8b68b25fbbeb9a870979f199ebdadc5c82d259e9895b7baeb45ee |
|
MD5 | ade0f4c7bfb02967dd82c54e4f2b1d3c |
|
BLAKE2b-256 | dea3b6818cfd281584bb692a53cadfa24c73a9017967b6f2489fa4576d3a816f |
Close
Hashes for quickner-0.0.1a3-cp39-cp39-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6fd78c5eb553713ba18d29bca3a6c33e3fd2b4c0abb55002e34fa25dc2a2ce95 |
|
MD5 | 3ac46f03b885b36ccc422cfae28abd52 |
|
BLAKE2b-256 | be6c92cc4dca581570917a1e40019582539d7214924a78a7a85f99a163deb272 |
Close
Hashes for quickner-0.0.1a3-cp38-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 659ac6545eefd13aa831355f7fd99e26adfe5ee22e745e9a3d9a5bc638071503 |
|
MD5 | 80eba00a614035dac432b24aa47d0480 |
|
BLAKE2b-256 | a4b825312f1e376ba6fcacc69836b34cda1b283bddad30136c7e189b91828acf |
Close
Hashes for quickner-0.0.1a3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f2a53a712b75c09d171a6cee3c2a539966a6912252c64cca12365885d1c8add9 |
|
MD5 | cb7cd05453027c0d66be4dbf12c35d66 |
|
BLAKE2b-256 | 97332ac0eea51ffae56fdb6d8fef7a439e410b1b083ba78a372e892bd637769f |
Close
Hashes for quickner-0.0.1a3-cp38-cp38-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab2f25df206e950d3131bf6c9f19a8facae07ccb4136d3728ea9aa65f4184d86 |
|
MD5 | 95334e7008eba5f2e875bb87a73a6e80 |
|
BLAKE2b-256 | 9634ef21d61f24290d63c5c500d1a8fb8e63684d1b6d5e1deb84fa1e1b20a66c |
Close
Hashes for quickner-0.0.1a3-cp37-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4cc6a8610c60b6f6949c7c8392ee16b5e1557640113c8990daf58348d2166de0 |
|
MD5 | 138c247f1ad82057672ae005ecd9ddc0 |
|
BLAKE2b-256 | 25a1c2427f60426e34ceebf24c4f81db1036716aab081425ba79f963cf71047d |
Close
Hashes for quickner-0.0.1a3-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f77169b0a396aa0d793cb1810569b944c788dddd6ab8043863c12ac9cf8db3bd |
|
MD5 | 120b88555624840e1f87ddc219fc2bd8 |
|
BLAKE2b-256 | a462be28e52d94ce5854c3158781ea3319749733b48c52c6473b6919976e8938 |
Close
Hashes for quickner-0.0.1a3-cp37-cp37m-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5b2b8014514012edee6f1af682f123aac9895f38e558f898a4d1fcfce562dbb0 |
|
MD5 | a29d9e84f95263aedda12c2ecdfded47 |
|
BLAKE2b-256 | 8b073199d1a266f7a91aae011e0b9a07aa8ea837b92eb872f16e698def2c6a0e |