Skip to main content

extract text from simple pdf documents

Project description


tags: [gradio-custom-component, , text extraction, pdf to string] title: gradio_simpletextextractfrompdf short_description: extract text from simple pdf documents colorFrom: blue colorTo: yellow sdk: gradio pinned: false app_file: space.py

gradio_simpletextextractfrompdf

PyPI - Version

extract text from simple pdf documents

Installation

pip install gradio_simpletextextractfrompdf

Usage

import gradio as gr
from gradio_simpletextextractfrompdf import SimpleTextExtractFromPDF

def first_200_chars(text):
    return text[:200]


demo = gr.Interface(
    fn=first_200_chars,
    inputs=SimpleTextExtractFromPDF(),
    outputs=gr.Textbox(label="First 200 characters of the extracted text"),
    title="Simple Text Extract From PDF",
    description="Extract text from a PDF file or URL",
)


if __name__ == "__main__":
    demo.launch()

SimpleTextExtractFromPDF

Initialization

name type default description
value
str | None
None default text to provide in textbox. If a function is provided, the function will be called each time the app loads to set the initial value of this component.
every
Timer | float | None
None Continously calls `value` to recalculate it if `value` is a function (has no effect otherwise). Can provide a Timer whose tick resets `value`, or a float that provides the regular interval for the reset Timer.
label
str | I18nData | None
None the label for this component, displayed above the component if `show_label` is `True` and is also used as the header if there are a table of examples for this component. If None and used in a `gr.Interface`, the label will be the name of the parameter this component corresponds to.
inputs
Component | Sequence[Component] | set[Component] | None
None None
show_label
bool | None
None if True, will display label.
scale
int | None
None relative size compared to adjacent Components. For example if Components A and B are in a Row, and A has scale=2, and B has scale=1, A will be twice as wide as B. Should be an integer. scale applies in Rows, and to top-level Components in Blocks where fill_height=True.
min_width
int
160 minimum pixel width, will wrap if not sufficient screen space to satisfy this value. If a certain scale value results in this Component being narrower than min_width, the min_width parameter will be respected first.
interactive
bool | None
None if True, will be rendered as an editable textbox; if False, editing will be disabled. If not provided, this is inferred based on whether the component is used as an input or output.
visible
bool
True If False, component will be hidden.
elem_id
str | None
None An optional string that is assigned as the id of this component in the HTML DOM. Can be used for targeting CSS styles.
elem_classes
list[str] | str | None
None An optional list of strings that are assigned as the classes of this component in the HTML DOM. Can be used for targeting CSS styles.
render
bool
True If False, component will not render be rendered in the Blocks context. Should be used if the intention is to assign event listeners now but render the component later.
key
int | str | tuple[int | str, ...] | None
None in a gr.render, Components with the same key across re-renders are treated as the same component, not a new component. Properties set in 'preserved_by_key' are not reset across a re-render.
preserved_by_key
list[str] | str | None
"value" A list of parameters from this component's constructor. Inside a gr.render() function, if a component is re-rendered with the same key, these (and only these) parameters will be preserved in the UI (if they have been changed by the user or an event listener) instead of re-rendered based on the values provided during constructor.

Events

name description
submit
upload

User function

The impact on the users predict function varies depending on whether the component is used as an input or output for an event (or both).

  • When used as an Input, the component only impacts the input signature of the user function.
  • When used as an output, the component only impacts the return signature of the user function.

The code snippet below is accurate in cases where the component is used as both an input and an output.

  • As output: Is passed, passes the extracted text into the function - string.
  • As input: Should return, expects a {string} returned from the function and sets component value to it.
def predict(
    value: str | None
) -> str | None:
    return value

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gradio_simpletextextractfrompdf-0.0.2.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file gradio_simpletextextractfrompdf-0.0.2.tar.gz.

File metadata

File hashes

Hashes for gradio_simpletextextractfrompdf-0.0.2.tar.gz
Algorithm Hash digest
SHA256 c1be3f61d8aa0511a2eca0819f9247f29b8c73d45889c1a71558518b7e727592
MD5 2a0cf174b03c1c2cacf16fce888ba430
BLAKE2b-256 58a91c976c86ad08b68572dc880ec5cfa1dd892d9ac80aa9e689f86bc950ff16

See more details on using hashes here.

File details

Details for the file gradio_simpletextextractfrompdf-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for gradio_simpletextextractfrompdf-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 97b7c4a8cccfa7a7844e9ba166cb44d9663dddfdf1d8a5c49b494608c1d74ac8
MD5 718d6bfc25ef5884f872092ee355d54e
BLAKE2b-256 2adc94d7b9e5791fe33b967379da522ae14e899cbd232366d8c8f4ba2facd6ad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page