Skip to main content

An amazing aquaparser-parser.

Project description

Aqua-parser

Description

Aqua-parser is a package for extracting data from structured reports in pdf format.

How to use

First of all, you need to install the package:

pip install aqua-parser

Next, the package must be imported into your project:

import aquaparser

To extract the data, you just need to pass the file to the function:

measurement = aquaparser.parse('document.pdf')

The function will return you the dataclass "Measurement" object:

@dataclass
class Measurement:
    title: MeasurementTitle
    toc: list[MeasurementTOC]


@dataclass
class MeasurementTitle:
    measurement_object: str
    project: str
    report_date: datetime
    responsible_person: str


@dataclass
class MeasurementTOC:
    smd: str
    status: Optional[str]
    value_description: Optional[str]
    single_value: Optional[str]
    trial_object: Optional[str]

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aqua-parser-0.2.1.tar.gz (5.7 kB view hashes)

Uploaded Source

Built Distribution

aqua_parser-0.2.1-py3-none-any.whl (5.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page