An amazing aquaparser-parser.
Project description
Aqua-parser
Description
Aqua-parser is a package for extracting data from structured reports in pdf format.
How to use
First of all, you need to install the package:
pip install aqua-parser
Next, the package must be imported into your project:
import aquaparser
To extract the data, you just need to pass the file to the function:
measurement = aquaparser.parse('document.pdf')
The function will return you the dataclass "Measurement" object:
@dataclass
class Measurement:
title: MeasurementTitle
toc: list[MeasurementTOC]
@dataclass
class MeasurementTitle:
measurement_object: str
project: str
report_date: datetime
responsible_person: str
@dataclass
class MeasurementTOC:
smd: str
status: Optional[str]
value_description: Optional[str]
single_value: Optional[str]
trial_object: Optional[str]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
aqua-parser-0.2.0.tar.gz
(5.7 kB
view hashes)
Built Distribution
Close
Hashes for aqua_parser-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1c855e2594b927a4d92db8cc3a990508a0aa50611c7238a6d878589abf709c13 |
|
MD5 | 235d3bad8f06615694b0bd0b74357f27 |
|
BLAKE2b-256 | c564aeb7320f2d7e38f0ad25efb433592b55072c4ee86ef2154ec9ecc1db35a4 |