Skip to main content

llama-index readers pdf_table integration

Project description

PDF Table Loader

pip install llama-index-readers-pdf-table

This loader reads the tables included in the PDF.

Users can input the PDF file and the pages from which they want to extract tables, and they can read the tables included on those pages.

Usage

Here's an example usage of the PDFTableReader. pages parameter is the same as camelot's pages. Therefore, you can use patterns such as all, 1,2,3, 10-20, and so on.

from llama_index.readers.pdf_table import PDFTableReader
from pathlib import Path

reader = PDFTableReader()
pdf_path = Path("/path/to/pdf")
documents = reader.load_data(file=pdf_path, pages="80-90")

Example

This loader is designed to be used as a way to load data into LlamaIndex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_pdf_table-0.4.1.tar.gz (15.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_readers_pdf_table-0.4.1-py3-none-any.whl (15.1 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_readers_pdf_table-0.4.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_pdf_table-0.4.1.tar.gz
Algorithm Hash digest
SHA256 8eccee6f0f395ac1afd2a76b62f39c109a105bdb55939cce35031a4dbf64ca12
MD5 e8bbbc53a90d7f21e5efd1d7b5cfe518
BLAKE2b-256 42f964e160b57649f7b9f341ef6ff6a5d4ee01f283ca659bfe09eb9e72e57afa

See more details on using hashes here.

File details

Details for the file llama_index_readers_pdf_table-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_pdf_table-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 223ba1c5189851590bbdfa0fa54e8116550bcccbf5c14b74596f3d42168f3a01
MD5 a91ce69e1e8a689d3e4fd1142269cf89
BLAKE2b-256 d15f919dac3188a53cbb1b300db2751154c20b6ec22d59f6f0a874db54b40ad3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page