An amazing aquaparser-parser.
Project description
Aqua-parser
Description
Aqua-parser is a package for extracting data from structured reports in pdf format.
How to use
First of all, you need to install the package:
pip install aqua-parser
Next, the package must be imported into your project:
import aquaparser
To extract the data, you just need to pass the file to the function:
measurement = aquaparser.parse('document.pdf')
The function will return you the dataclass "Measurement" object:
@dataclass
class Measurement:
title: MeasurementTitle
toc: list[MeasurementTOC]
@dataclass
class MeasurementTitle:
measurement_object: str
project: str
report_date: datetime
responsible_person: str
@dataclass
class MeasurementTOC:
smd: str
status: Optional[str]
value_description: Optional[str]
single_value: Optional[str]
trial_object: Optional[str]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
aqua-parser-0.2.1.tar.gz
(5.7 kB
view hashes)
Built Distribution
Close
Hashes for aqua_parser-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cc6ef2c738d105dcfbae6ef21c694f516fb883aad0a427594be1ff2772fd7c79 |
|
MD5 | d60b51b16b0ac769cfaeb15ab3099275 |
|
BLAKE2b-256 | b45b704eda078c4bf1226f56c31efe1c61ab70b6e2e1e9efbd81339b3976292d |