Skip to main content

Ce package highlight les entités dans un pdf

Project description

NER-API

Ce package permet le traitemnt du texte , l'extraction des entités(inclus code swift et code imo), ainsi que le highlighting des ces entités présente dans un fichier pdf

Installation

pip install nerforpdf

Usage/Exemples

import nerforpdf as nfp
nerforpdf.text_preprocessing.text_preprocessing(text,accented=True,stopw=True,punctuation=True,lowercase=True,lemmatize=True,spelling=True,expand_contraction=True,urls=True)

cette fonction permet de traiter le text en utilisant les foltres présents comme argument

import nerforpdf as nfp
nerforpdf.text_preprocessing.spacy_preprocessing(text,lowercase=True,stopw=True,punctuation=True,alphabetic=True,lemmatize=True,)

Permet de faire du traitement du texte à l'aide de spacy

import nerforpdf as nfp
nerforpdf.highlight_pdf.output(input_file)

cette fonction prend en argument le chemin vers un fichier pdf , extrait les entités(code swift et imo inclus),les highlight , et enregistre le pdf highlighté dans le dossier courant sous le nom "output.pdf"

API Reference

get_entities(text)

Prend un texte(String) et retourne ses entités

highlight_pdf(pdf)

Prend le pdf encodé en base64 et retourne le pdf highlighté encodé en base64 ainsi que les entités détectées

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nerforpdf-0.0.4.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

nerforpdf-0.0.4-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file nerforpdf-0.0.4.tar.gz.

File metadata

  • Download URL: nerforpdf-0.0.4.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for nerforpdf-0.0.4.tar.gz
Algorithm Hash digest
SHA256 1109de1acc09164b914960c6add520011f9a68828ff5da11a75768bb2e4d09e8
MD5 9d23426dbd62d5875180eefcf72a28d6
BLAKE2b-256 93df2ba71c08bee81fbdec74f29e299c557a92c205b755ff73d4a5f8cf5b1e8b

See more details on using hashes here.

File details

Details for the file nerforpdf-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: nerforpdf-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 5.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for nerforpdf-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 53c86af4b033d6ee211a2723c46c6772e3047b670f2d74485b91b5e53cd60651
MD5 3b12f31d3f7370275d15382bbcda36ae
BLAKE2b-256 75c4500f09e6ae700c203a932878f1ff675b1fbd835e48804b521970d577532d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page