Skip to main content

Easily display PDFs in Gradio. Forked from Freddy Boulton's gradio_pdf repo: https://github.com/freddyaboulton/gradio-pdf

Project description

gradio_pdf_redaction

Static Badge Static Badge Static Badge

Easily display PDFs in Gradio. Forked from Freddy Boulton's gradio_pdf repo: https://github.com/freddyaboulton/gradio-pdf

Installation

pip install gradio_pdf_redaction

Usage

import gradio as gr
from gradio_pdf_redaction import PDF
from pdf2image import convert_from_path
from transformers import pipeline
from pathlib import Path

dir_ = Path(__file__).parent

p = pipeline(
    "document-question-answering",
    model="impira/layoutlm-document-qa",
)

def qa(question: str, doc: str) -> str:
    img = convert_from_path(doc)[0]
    output = p(img, question)
    return sorted(output, key=lambda x: x["score"], reverse=True)[0]['answer']


EXAMPLES = [
    ["What is the total gross worth?", str(dir_ / "invoice_2.pdf")],
    ["Whos is being invoiced?", str(dir_ / "sample_invoice.pdf")],
]


def create_demo():
    return gr.Interface(
        qa,
        [gr.Textbox(label="Question"), PDF(label="Document")],
        gr.Textbox(),
        examples=EXAMPLES,
        cache_examples=False,
    )


# Required for `gradio cc dev` hot reload — must be a module-level Blocks/Interface.
demo = create_demo()

if __name__ == "__main__":
    demo.launch()

PDF

Initialization

name type default description
value
typing.Any
None None
height
int | None
None None
label
str | None
None None
info
str | None
None None
show_label
bool | None
None None
container
bool
True None
scale
int | None
None None
min_width
int | None
None None
interactive
bool | None
None None
visible
bool
True None
elem_id
str | None
None None
elem_classes
list[str] | str | None
None None
render
bool
True None
load_fn
typing.Optional[typing.Callable[..., typing.Any]][
    typing.Callable[..., typing.Any][Ellipsis, typing.Any],
    None,
]
None None
every
float | None
None None
starting_page
int | None
1 None

Events

name description
change
upload

User function

The impact on the users predict function varies depending on whether the component is used as an input or output for an event (or both).

  • When used as an Input, the component only impacts the input signature of the user function.
  • When used as an output, the component only impacts the return signature of the user function.

The code snippet below is accurate in cases where the component is used as both an input and an output.

  • As output: Is passed, the preprocessed input data sent to the user's function in the backend.
  • As input: Should return, the output data received by the component from the user's function in the backend.
def predict(
    value: str
) -> str | None:
    return value

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gradio_pdf_redaction-0.0.25.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gradio_pdf_redaction-0.0.25-py3-none-any.whl (376.1 kB view details)

Uploaded Python 3

File details

Details for the file gradio_pdf_redaction-0.0.25.tar.gz.

File metadata

  • Download URL: gradio_pdf_redaction-0.0.25.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for gradio_pdf_redaction-0.0.25.tar.gz
Algorithm Hash digest
SHA256 94f601e2c571e13e69331f2091b209ae1235fe3bdf971a2e11cd6e7d6bc1f2ec
MD5 0afa51771600c5f7dfb2b05c9fe5dd86
BLAKE2b-256 459e4e7bb89f51aa9c251cc0c28594e1189ae8b50e0a89812cc7494aeca79cf4

See more details on using hashes here.

File details

Details for the file gradio_pdf_redaction-0.0.25-py3-none-any.whl.

File metadata

File hashes

Hashes for gradio_pdf_redaction-0.0.25-py3-none-any.whl
Algorithm Hash digest
SHA256 d7f252b4af765dfcd0487e7debb3af0fcbba4324eca7cb2f2495c0b34b8dbd2b
MD5 91c0a187ff08e1bf46b67bd3b37320ae
BLAKE2b-256 820025ff0100d98d2879de5cbb8056357cd16dd9812be44c94d103fd0759f6ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page