Python binding to Java Archery Framework
Project description
PyArchery
This repository is a binging of Archery for Python language.
Description
In today's data-driven landscape, navigating the complexities of semi-structured documents poses a significant challenge for organizations. These documents, characterized by diverse formats and a lack of standardization, often require specialized skills for effective manipulation and analysis. However, we propose a novel framework to address this challenge. By leveraging innovative algorithms and machine learning techniques, Archery offers a solution that gives you control over the data extraction process with tweakable and repeatable settings. Moreover, by automating the extraction process, it not only saves time but also minimizes errors, particularly beneficial for industries dealing with large volumes of such documents. Crucially, this framework integrates with machine learning workflows, unlocking new possibilities for data enrichment and predictive modeling. By leveraging determinist algorithms, this framework is perfect to prepare your data for training processes in a predictive and reproductible manner. Aligned with the paradigm of data as a service, it offers a scalable and efficient means of managing semi-structured data, thereby expanding the toolkit of data services available to organizations.
Visit our full documentation and learn more about how it works, try our tutorials and find a full list of plugins and models.
Getting Started
Dependencies
- The Java Developer Kit, version 17.
- Python 3.8.2 or above.
- Pip 20.0.2 or above.
- Poetry 1.7.1 or above.
- Just 1.24.0 or above.
Install and setup locally
Run the following command line:
pip install pyjarchery
Run the examples
To run the tutorial1:
python examples/tutorial1.py
To run the tutorial2:
python examples/tutorial2.py
Documentation
The following links will give you documentation about some background information, takes you through some implementation details, and then focuses on step-by-step instructions for getting the most out of Any2Json:
- Using PyArchery: here.
Contribute
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.
Authors
- Romuald Rousseau, romuald.rousseau@servier.com
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyjarchery-0.1.7.tar.gz.
File metadata
- Download URL: pyjarchery-0.1.7.tar.gz
- Upload date:
- Size: 1.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
18b5dea2fb75c15414f100e99ad937a0dff91807366326a56a00e25d3be2f59f
|
|
| MD5 |
83f7ebb48d3dc75a432efd8ce58c5ecb
|
|
| BLAKE2b-256 |
30446541631be41cd653bffffb93fb558b6f4ba046fc511552e6f0c8fa49626d
|
File details
Details for the file pyjarchery-0.1.7-py3-none-any.whl.
File metadata
- Download URL: pyjarchery-0.1.7-py3-none-any.whl
- Upload date:
- Size: 25.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dd3896ccda4361f323f62432531a354b4e54d3a9d01f77aa32d0f836ea33086a
|
|
| MD5 |
0d11bd1a7711c2e64a15bf1d0e81126d
|
|
| BLAKE2b-256 |
85c7cd7c15c116641c7e2a021308ba1fac56e4c57019d03748c4e243f226c511
|