A comprehensive document and web page conversion toolkit powered by Llama AI
Project description
llama-d5
Llama D5 (llama-d5) is a document processing toolkit within the LlamaSearch AI ecosystem. It appears to focus on document conversion (potentially from various formats to PDF, indicated by llamadoc2pdf) and capturing document representations (like screenshots, suggested by llama_screenshot).
Key Features
- Document Conversion: Includes components for converting document formats, possibly centered around PDF output (
llamadoc2pdf/). - Document Capture: Functionality to capture visual representations of documents, like screenshots (
llama_screenshot.py). - Core Module: Likely orchestrates the conversion and capture processes (
core.py). - Configurable: Supports configuration options (
config.py).
Installation
pip install llama-d5
# Or install directly from GitHub for the latest version:
# pip install git+https://github.com/llamasearchai/llama-d5.git
Usage
(Usage examples for document conversion and screenshot generation will be added here.)
# Placeholder for Python client usage
# from llama_d5 import DocProcessor, ConversionConfig
# config = ConversionConfig.load("config.yaml")
# processor = DocProcessor(config)
# # Convert a document to PDF
# pdf_path = processor.convert_to_pdf(input_file="document.docx")
# print(f"PDF saved to: {pdf_path}")
# # Take a screenshot
# screenshot_path = processor.take_screenshot(url="https://example.com", output_file="webpage.png")
# print(f"Screenshot saved to: {screenshot_path}")
Architecture Overview
graph TD
A[Input Document / URL] --> B{Core Processor (core.py)};
B -- Conversion Task --> C{Document Converter (llamadoc2pdf/)};
C --> D[Output PDF];
B -- Capture Task --> E{Screenshot Tool (llama_screenshot.py)};
E --> F[Output Image (Screenshot)];
G[Configuration (config.py)] -- Configures --> B;
G -- Configures --> C;
G -- Configures --> E;
style B fill:#f9f,stroke:#333,stroke-width:2px
- Input: Accepts a document file or URL.
- Core Processor: Manages the request and routes it to the appropriate tool.
- Converter: Handles conversion of input documents into PDF format.
- Screenshot Tool: Captures a visual representation (screenshot) of a document or webpage.
- Output: Produces either a PDF file or an image file.
- Configuration: Settings control the behavior of the conversion and capture tools.
Configuration
(Details on configuring input/output formats, screenshot resolution, conversion options, etc., will be added here.)
Development
Setup
# Clone the repository
git clone https://github.com/llamasearchai/llama-d5.git
cd llama-d5
# Install in editable mode with development dependencies
pip install -e ".[dev]"
Testing
pytest tests/
Contributing
Contributions are welcome! Please refer to CONTRIBUTING.md and submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llama_d5-0.1.0.tar.gz.
File metadata
- Download URL: llama_d5-0.1.0.tar.gz
- Upload date:
- Size: 60.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9365733bd6c9817dd242438f57abcc96bc33be8069731163ee59770a5c264c2
|
|
| MD5 |
973a05ae2ee019b74a356f340418e307
|
|
| BLAKE2b-256 |
47cab60347441e6ac8efe41ddfca6740361ea26cb523bd216a3dcdbba5d29eb9
|
File details
Details for the file llama_d5-0.1.0-py3-none-any.whl.
File metadata
- Download URL: llama_d5-0.1.0-py3-none-any.whl
- Upload date:
- Size: 48.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
141874f4dbad85610a6bfe1ea0f192181f319f633c7e8d33ecc928873bf20135
|
|
| MD5 |
15b21a92df38169935af0bea18c57c1b
|
|
| BLAKE2b-256 |
b33e934765249b8a57958a616a2f89e3f661204a95a4ff3f5d07f339e7557e6e
|