Skip to main content

Infer schema and extract data from unstructured files

Project description

LlamaExtract (Experimental)

LlamaExtract is an API created by LlamaIndex to efficiently infer schema and extract data from unstructured files.

LlamaExtract directly integrates with LlamaIndex.

Note: LlamaExtract is currently experimental and may change in the future.

Read below for some quickstart information, or see the full documentation.

Getting Started

First, login and get an api-key from https://cloud.llamaindex.ai ↗.

Install the package:

pip install llama-extract

Now you can easily infer schemas and extract data from your files:

import nest_asyncio

nest_asyncio.apply()

from llama_extract import LlamaExtract

extractor = LlamaExtract(
    api_key="llx-...",  # can also be set in your env as LLAMA_CLOUD_API_KEY
    num_workers=4,  # if multiple files passed, split in `num_workers` API calls
    verbose=True,
)

# Infer schema
schema = extractor.infer_schema(
    "my_schema", ["./my_file1.pdf", "./my_file2.pdf"]
)

# Extract data
results = extractor.extract(schema.id, ["./my_file1.pdf", "./my_file2.pdf"])

Examples

Several end-to-end examples can be found in the examples folder

Documentation

https://docs.cloud.llamaindex.ai/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_extract-0.0.5.tar.gz (5.6 kB view details)

Uploaded Source

Built Distribution

llama_extract-0.0.5-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file llama_extract-0.0.5.tar.gz.

File metadata

  • Download URL: llama_extract-0.0.5.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.0 Linux/6.8.0-1014-azure

File hashes

Hashes for llama_extract-0.0.5.tar.gz
Algorithm Hash digest
SHA256 91ecdbe69df5a292b88dee1041d0a702591b7cc1d79a55006f431d77a90f66ed
MD5 cc276cb68f5a4b9e68e417739c496d0a
BLAKE2b-256 26b1a49cd4fa1ac2f0c515d67537ce715e4c66729f9327c8f17ea5ba942d5322

See more details on using hashes here.

File details

Details for the file llama_extract-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: llama_extract-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 6.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.0 Linux/6.8.0-1014-azure

File hashes

Hashes for llama_extract-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 8bf7ec8c7db3052fefb0afcdac785e87824593a65fe408eed18726b0fd1d88fa
MD5 34283d0005c19a7fbcd8b47bd147ca43
BLAKE2b-256 fd2f0d5f4f46aad941f157ecb73d22596d56664d735494f348fa48cef48486e5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page