Parser for all.
Project description
🌊 AnyParser
AnyParser provides an API to accurately extract your unstructured data (e.g. PDF, images, charts) into structured format.
:seedling: Set up your AnyParser API key
You can generate your keys at the Playground Account Page with up to 2 keys and 100 total free pages per account.
⚠️ Note: The free API is limited to 10 pages/call.
If you're interested in more AnyParser usage and applications, please reach out at info@cambioml.com for details.
To set up your API key CAMBIO_API_KEY
, you will need to :
- create a
.env
file in your root folder; - add the following one line to your `.env file:
CAMBIO_API_KEY=0cam************************
:computer: Installation
conda create -n any-parse python=3.10 -y
conda activate any-parse
pip3 install any-parser
If you want to run pdf_to_markdown.ipynb, install the following:
- Mac:
brew install poppler
- Linux:
sudo apt update sudo apt install poppler-utils
- Windows:
choco install poppler
:scroll: Examples
AnyParser can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run AnyParser within 10 lines of code!
Extract all text and layout from PDF into Markdown Format
Are you an AI engineer who need to ACCURATELY extract both the text and its layout (e.g. table of content or markdown headers hierarchy) from a PDF. Check out this notebook demo (3-min read)!
Extract a Table from an Image into Markdown Format
Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for any_parser-0.0.13-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e1f1c83b9475af8ab39eb5121105fef84872df25cb7be53679f3fae55847bd5 |
|
MD5 | bfe2adf6169f6649df2730abd3559fbe |
|
BLAKE2b-256 | 390c23a7cf9e98aa952acb27f2643839a83da5d47be9169e5b8b4c5f49bc66bd |