Skip to main content

A comprehensive document and web page conversion toolkit powered by Llama AI

Project description

llama-d5

PyPI version License Python Version CI Status

Llama D5 (llama-d5) is a document processing toolkit within the LlamaSearch AI ecosystem. It appears to focus on document conversion (potentially from various formats to PDF, indicated by llamadoc2pdf) and capturing document representations (like screenshots, suggested by llama_screenshot).

Key Features

  • Document Conversion: Includes components for converting document formats, possibly centered around PDF output (llamadoc2pdf/).
  • Document Capture: Functionality to capture visual representations of documents, like screenshots (llama_screenshot.py).
  • Core Module: Likely orchestrates the conversion and capture processes (core.py).
  • Configurable: Supports configuration options (config.py).

Installation

pip install llama-d5
# Or install directly from GitHub for the latest version:
# pip install git+https://github.com/llamasearchai/llama-d5.git

Usage

(Usage examples for document conversion and screenshot generation will be added here.)

# Placeholder for Python client usage
# from llama_d5 import DocProcessor, ConversionConfig

# config = ConversionConfig.load("config.yaml")
# processor = DocProcessor(config)

# # Convert a document to PDF
# pdf_path = processor.convert_to_pdf(input_file="document.docx")
# print(f"PDF saved to: {pdf_path}")

# # Take a screenshot
# screenshot_path = processor.take_screenshot(url="https://example.com", output_file="webpage.png")
# print(f"Screenshot saved to: {screenshot_path}")

Architecture Overview

graph TD
    A[Input Document / URL] --> B{Core Processor (core.py)};
    B -- Conversion Task --> C{Document Converter (llamadoc2pdf/)};
    C --> D[Output PDF];
    B -- Capture Task --> E{Screenshot Tool (llama_screenshot.py)};
    E --> F[Output Image (Screenshot)];

    G[Configuration (config.py)] -- Configures --> B;
    G -- Configures --> C;
    G -- Configures --> E;

    style B fill:#f9f,stroke:#333,stroke-width:2px
  1. Input: Accepts a document file or URL.
  2. Core Processor: Manages the request and routes it to the appropriate tool.
  3. Converter: Handles conversion of input documents into PDF format.
  4. Screenshot Tool: Captures a visual representation (screenshot) of a document or webpage.
  5. Output: Produces either a PDF file or an image file.
  6. Configuration: Settings control the behavior of the conversion and capture tools.

Configuration

(Details on configuring input/output formats, screenshot resolution, conversion options, etc., will be added here.)

Development

Setup

# Clone the repository
git clone https://github.com/llamasearchai/llama-d5.git
cd llama-d5

# Install in editable mode with development dependencies
pip install -e ".[dev]"

Testing

pytest tests/

Contributing

Contributions are welcome! Please refer to CONTRIBUTING.md and submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_d5-0.1.0.tar.gz (60.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_d5-0.1.0-py3-none-any.whl (48.0 kB view details)

Uploaded Python 3

File details

Details for the file llama_d5-0.1.0.tar.gz.

File metadata

  • Download URL: llama_d5-0.1.0.tar.gz
  • Upload date:
  • Size: 60.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for llama_d5-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e9365733bd6c9817dd242438f57abcc96bc33be8069731163ee59770a5c264c2
MD5 973a05ae2ee019b74a356f340418e307
BLAKE2b-256 47cab60347441e6ac8efe41ddfca6740361ea26cb523bd216a3dcdbba5d29eb9

See more details on using hashes here.

File details

Details for the file llama_d5-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: llama_d5-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 48.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for llama_d5-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 141874f4dbad85610a6bfe1ea0f192181f319f633c7e8d33ecc928873bf20135
MD5 15b21a92df38169935af0bea18c57c1b
BLAKE2b-256 b33e934765249b8a57958a616a2f89e3f661204a95a4ff3f5d07f339e7557e6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page