Skip to main content

LlamaIndex integration for Anyparser

Project description

Anyparser LlamaIndex: Seamless Integration of Anyparser with LlamaIndex

https://anyparser.com

Integrate Anyparser's powerful content extraction capabilities with LlamaIndex for enhanced AI workflows. This integration package enables seamless use of Anyparser's document processing and data extraction features within your LlamaIndex applications, making it easier than ever to build sophisticated AI pipelines.

Installation

pip install anyparser-llamaindex

Setup

Before running the examples, make sure to set your Anyparser API credentials as environment variables:

export ANYPARSER_API_KEY="your-api-key"
export ANYPARSER_API_URL="https://anyparserapi.com"

Anyparser LlamaIndex Examples

This examples directory contains examples demonstrating different ways to use the Anyparser LlamaIndex integration.

python examples/01_basic_usage.py
python examples/02_single_file_json.py
python examples/03_single_file_markdown.py
python examples/04_multiple_files_json.py
python examples/05_multiple_files_markdown.py
python examples/06_load_folder.py
python examples/07_ocr_markdown.py
python examples/08_ocr_json.py
python examples/09_web_crawler.py

Features Demonstrated

Document Processing

  • Different output formats (markdown, JSON)
  • Multiple file handling
  • Folder processing
  • Metadata handling

Web Crawling

  • Basic crawling with depth and scope control
  • Advanced URL and content filtering
  • Crawling strategies (BFS, LIFO)
  • Rate limiting and robots.txt respect

Notes

  • All examples use async/await for better performance
  • Error handling is included in all examples
  • Each example includes detailed comments explaining the options used
  • OCR examples support multiple languages
  • Crawler examples demonstrate various filtering and control options

Features Demonstrated

  • Different output formats (markdown, JSON)
  • OCR capabilities with language support
  • OCR performance presets
  • Image extraction
  • Table extraction
  • Metadata handling
  • Error handling
  • Async/await usage

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anyparser_llamaindex-0.0.2.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

anyparser_llamaindex-0.0.2-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file anyparser_llamaindex-0.0.2.tar.gz.

File metadata

  • Download URL: anyparser_llamaindex-0.0.2.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for anyparser_llamaindex-0.0.2.tar.gz
Algorithm Hash digest
SHA256 e651a773ffe227a36f295492d1866ce535fa2fe930ca3e7ea8feb69ec5e7ad07
MD5 fa795f43c185a07e9ba12b9e95d0acd4
BLAKE2b-256 b2e237d8f4d093369da5ea80fc8c417282e3abec089da4e71255dd8b3ad75a13

See more details on using hashes here.

File details

Details for the file anyparser_llamaindex-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for anyparser_llamaindex-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 fcbee509886933ad0ce533774553b68c3987d5e71014fa9d7293ec06eef250f6
MD5 e4cb8c4a3580c7254415d7de15942ec3
BLAKE2b-256 9507a7c1e973844d622522ade358ea363840f01d5368fd19b9a3f04e6e55bcce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page