Um leitor de documentos em Python para extrair campos, baseado em expressões regulares
Project description
document-reader
Leitor de documentos em Python para extrair campos, baseado em expressões regulares.
Instalação
pip install document-reader
Uso
from document_reader import Document, Field
doc = Document("pdf_file.pdf")
doc.register_fields(
Field(name="contract", regex=r"\d+/.*?/\d+", page=0),
Field(name="nup", regex=r"\d{5}\.\d{6}/\d{4}-\d{2}", page=1),
)
data = doc.open()
print(data)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
document_reader-0.0.5.tar.gz
(3.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file document_reader-0.0.5.tar.gz.
File metadata
- Download URL: document_reader-0.0.5.tar.gz
- Upload date:
- Size: 3.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
794658479721725f34fcf7da7d14f1f721c5aa0a6250dfb4e7d8c41dcf22ffe2
|
|
| MD5 |
74462df297849a536515f25e0bf0c221
|
|
| BLAKE2b-256 |
7fbb8b52f1791bd0b89e5952b699207c57886b59a65d0f9eba424959bff97daf
|
File details
Details for the file document_reader-0.0.5-py3-none-any.whl.
File metadata
- Download URL: document_reader-0.0.5-py3-none-any.whl
- Upload date:
- Size: 3.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ad7cf1da2e1cca73ef519af349141acc16b7da6870e94fbe9995ad1104926f0f
|
|
| MD5 |
f6466c45b556941ea77b35ee8ce3c7da
|
|
| BLAKE2b-256 |
b83fa05129e90824f487afa80938fedcc12a427a03201ff147cce7ad56a36e65
|