Skip to main content

Open parser for all.

Project description

🌊 OpenParse

OpenParse provides an API to accurately extract your unstructured data (e.g. PDF, images, charts) into structured format.

:seedling: Set up your OpenParser API key

OpenParse is still in private beta. If you are interested in testing our document models, please reach out at info@cambioml.com for a FREE API key.

To set up your API key CAMBIO_API_KEY, you will need to :

  1. create a .env file in your root folder;
  2. add the following one line to your `.env file:
    CAMBIO_API_KEY=17b************************
    

:computer: Installation

conda create -n openparse python=3.10 -y
conda activate openparse
pip3 install open_parser

:bashfile usage

To use OpenParse via curl requests, you can run the following bash command from the root folder of this repository:

bash parse.sh <your apiKey> <file path> <prompt for parse (optional, default="")>

For example, to extract a table from a PDF file, you can run the following command:

bash parse.sh gl**************************************  /path/to/your/file.pdf "Return the table in a JSON format with each box's key and value."

:scroll: Examples

OpenParse can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run OpenParse within 10 lines of code!

Prompt to Extract Key-values into JSON from W2 (PDF)

Do you want to extract key-values from a W2 PDF into JSON format? Check out this notebook (3-min read)!

Extract a Table from an Image into Markdown Format

Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

open_parser-0.0.7.tar.gz (2.7 kB view details)

Uploaded Source

Built Distribution

open_parser-0.0.7-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file open_parser-0.0.7.tar.gz.

File metadata

  • Download URL: open_parser-0.0.7.tar.gz
  • Upload date:
  • Size: 2.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.10.14 Darwin/23.4.0

File hashes

Hashes for open_parser-0.0.7.tar.gz
Algorithm Hash digest
SHA256 6a247ed14f1a71166860fbf5f6ceba56ee811e40fa71aab3c0365042495dc1e7
MD5 1beabe523953a9b98d579f0b01dc5672
BLAKE2b-256 c28398d8c616378298470094485f901eb85213b581aad841c82bd25535cec1cb

See more details on using hashes here.

File details

Details for the file open_parser-0.0.7-py3-none-any.whl.

File metadata

  • Download URL: open_parser-0.0.7-py3-none-any.whl
  • Upload date:
  • Size: 3.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.10.14 Darwin/23.4.0

File hashes

Hashes for open_parser-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 0bcff7d6e33cb47162bf0755b6e0dba780b98cbbfa1962f072112d0468a55f08
MD5 b65ac016232691edd9f3b907f11e2599
BLAKE2b-256 73524e277e141c81f130fe55e958eca5c4255ccfdae6ac3da5d3efed0a15267a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page