Skip to main content

Ce package highlight les entités dans un pdf

Project description

NER-API

Ce package permet le traitemnt du texte , l'extraction des entités(inclus code swift et code imo), ainsi que le highlighting des ces entités présente dans un fichier pdf

Installation

pip install nerforpdf

Usage/Exemples

import nerforpdf as nfp
nerforpdf.text_preprocessing.text_preprocessing(text,accented=True,stopw=True,punctuation=True,lowercase=True,lemmatize=True,spelling=True,expand_contraction=True,urls=True)

cette fonction permet de traiter le text en utilisant les foltres présents comme argument

import nerforpdf as nfp
nerforpdf.text_preprocessing.spacy_preprocessing(text,lowercase=True,stopw=True,punctuation=True,alphabetic=True,lemmatize=True,)

Permet de faire du traitement du texte à l'aide de spacy

import nerforpdf as nfp
nerforpdf.highlight_pdf.output(input_file)

cette fonction prend en argument le chemin vers un fichier pdf , extrait les entités(code swift et imo inclus),les highlight , et enregistre le pdf highlighté dans le dossier courant sous le nom "output.pdf"

API Reference

get_entities(text)

Prend un texte(String) et retourne ses entités

highlight_pdf(pdf)

Prend le pdf encodé en base64 et retourne le pdf highlighté encodé en base64 ainsi que les entités détectées

Screenshots

image highlited pdf image original

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nerforpdf-0.0.3.tar.gz (890.1 kB view details)

Uploaded Source

Built Distribution

nerforpdf-0.0.3-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file nerforpdf-0.0.3.tar.gz.

File metadata

  • Download URL: nerforpdf-0.0.3.tar.gz
  • Upload date:
  • Size: 890.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for nerforpdf-0.0.3.tar.gz
Algorithm Hash digest
SHA256 dd08cfe22a94be553516a7c9d9a3ef55691a0b340731b694757386a326367a9e
MD5 554caa0f1250ff7ea4e5d251b05b2697
BLAKE2b-256 fb9378f1ecca1348a0168bf356818255ffe49c527d204534425e3184303ff9a5

See more details on using hashes here.

File details

Details for the file nerforpdf-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: nerforpdf-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 5.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for nerforpdf-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 9a30fb5b8749730269986a2f7d010a409cd5d26fcced37d6d07028fa9f23a986
MD5 cd1ff9256794b88361b07d5fd29a8372
BLAKE2b-256 8aaadc24dbee9f2dd06b921e2ed5bfc8ef0ec45cc0513c7289f0c02d2f75bb62

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page