Open parser for all.
Project description
🌊 OpenParse
OpenParse provides an API to accurately extract your unstructured data (e.g. PDF, images, charts) into structured format.
:seedling: Set up your OpenParser API key
OpenParse is still in private beta. If you are interested in testing our document models, please reach out at info@cambioml.com for a FREE API key.
To set up your API key CAMBIO_API_KEY
, you will need to :
- create a
.env
file in your root folder; - add the following one line to your `.env file:
CAMBIO_API_KEY=17b************************
:computer: Installation
conda create -n openparse python=3.10 -y
conda activate openparse
pip3 install open_parser
:bashfile usage
To use OpenParse via curl
requests, you can run the following bash command from the root folder of this repository:
bash parse.sh <your apiKey> <file path> <prompt for parse (optional, default="")>
For example, to extract a table from a PDF file, you can run the following command:
bash parse.sh gl************************************** /path/to/your/file.pdf "Return the table in a JSON format with each box's key and value."
:scroll: Examples
OpenParse can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run OpenParse within 10 lines of code!
Prompt to Extract Key-values into JSON from W2 (PDF)
Do you want to extract key-values from a W2 PDF into JSON format? Check out this notebook (3-min read)!
Extract a Table from an Image into Markdown Format
Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file open_parser-0.0.7.tar.gz
.
File metadata
- Download URL: open_parser-0.0.7.tar.gz
- Upload date:
- Size: 2.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.10.14 Darwin/23.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6a247ed14f1a71166860fbf5f6ceba56ee811e40fa71aab3c0365042495dc1e7 |
|
MD5 | 1beabe523953a9b98d579f0b01dc5672 |
|
BLAKE2b-256 | c28398d8c616378298470094485f901eb85213b581aad841c82bd25535cec1cb |
File details
Details for the file open_parser-0.0.7-py3-none-any.whl
.
File metadata
- Download URL: open_parser-0.0.7-py3-none-any.whl
- Upload date:
- Size: 3.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.10.14 Darwin/23.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0bcff7d6e33cb47162bf0755b6e0dba780b98cbbfa1962f072112d0468a55f08 |
|
MD5 | b65ac016232691edd9f3b907f11e2599 |
|
BLAKE2b-256 | 73524e277e141c81f130fe55e958eca5c4255ccfdae6ac3da5d3efed0a15267a |