Skip to main content

Konfuzio Software Development Kit

Project description

Konfuzio SDK

Downloads

The Konfuzio Software Development Kit (Konfuzio SDK) provides a Python API to interact with the Konfuzio Server.

Features

The SDK allows you to retrieve visual and text features to build your own document models. Konfuzio Server serves as an UI to define the data structure, manage training/test data and to deploy your models as API.

Function Public Host Free* On-Site (Paid)
OCR Text :heavy_check_mark: :heavy_check_mark:
OCR Handwriting :heavy_check_mark: :heavy_check_mark:
Text Annotation :heavy_check_mark: :heavy_check_mark:
PDF Annotation :heavy_check_mark: :heavy_check_mark:
Image Annotation :heavy_check_mark: :heavy_check_mark:
Table Annotation :heavy_check_mark: :heavy_check_mark:
Download HOCR :heavy_check_mark: :heavy_check_mark:
Download Images :heavy_check_mark: :heavy_check_mark:
Download PDF with OCR :heavy_check_mark: :heavy_check_mark:
Deploy AI models :heavy_multiplication_x: :heavy_check_mark:

* Under fair use policy: We will impose 10 pages/hour throttling eventually.

:ledger: Docs Read the docs
:floppy_disk: Installation How to install the Konfuzio SDK
:mortar_board: Tutorials See what the Konfuzio SDK can do with our Notebooks & Scripts
:bulb: Explanations Here are links to teaching material about the Konfuzio SDK.
:gear: API Reference Python classes, methods, and functions
:heart: Contributing Learn how to contribute!
:bug: Issue Tracker Report and monitor Konfuzio SDK issues
:telescope: Changelog Review the release notes
:newspaper: MIT License Review the license

Installation

As developer register on our public HOST for free: https://app.konfuzio.com

Then you can use pip to install Konfuzio SDK and run init:

pip install konfuzio_sdk

konfuzio_sdk init

The init will create a Token to connect to the Konfuzio Server. This will create variables KONFUZIO_USER, KONFUZIO_TOKEN and KONFUZIO_HOST in an .env file in your working directory.

Find the full installation guide here or setup PyCharm as described here.

CLI

We provide the basic function to create a new Project via CLI:

konfuzio_sdk create_project YOUR_PROJECT_NAME

You will see "Project {YOUR_PROJECT_NAME} (ID {YOUR_PROJECT_ID}) was created successfully!" printed.

And download any project via the id:

konfuzio_sdk export_project YOUR_PROJECT_ID

Tutorials

You can find detailed examples about how to set up and run document AI pipelines in our Tutorials, including:

Basics

Here we show how to use the Konfuzio SDK to retrieve data hosted on a Konfuzio Server instance.

from konfuzio_sdk.data import Project, Document

# Initialize the Project
YOUR_PROJECT_ID: int
my_project = Project(id_=YOUR_PROJECT_ID)

# Get any Document online
DOCUMENT_ID_ONLINE: int
doc: Document = my_project.get_document_by_id(DOCUMENT_ID_ONLINE)

# Get the Annotations in a Document
doc.annotations()

# Filter Annotations by Label
MY_OWN_LABEL_NAME: str
label = my_project.get_label_by_name(MY_OWN_LABEL_NAME)
doc.annotations(label=label)

# Or get all Annotations that belong to one Category
YOUR_CATEGORY_ID: int
category = my_project.get_category_by_id(YOUR_CATEGORY_ID)
label.annotations(categories=[category])

# Force a Project update. To save time Documents will only be updated if they have changed.
my_project.get(update=True)

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

konfuzio_sdk-0.2.21.dev20230609211212.tar.gz (140.2 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file konfuzio_sdk-0.2.21.dev20230609211212.tar.gz.

File metadata

File hashes

Hashes for konfuzio_sdk-0.2.21.dev20230609211212.tar.gz
Algorithm Hash digest
SHA256 8708f8bb02fc322b3689ba3e09442616d2bdecd765096178315c95f2de7850e8
MD5 486daece03cbe57e0a8cc072c90a9313
BLAKE2b-256 4e782ab49d98d5d5b8feb7b7e32f89496902d26a90f41c5d2d6d9d22f1974ad1

See more details on using hashes here.

File details

Details for the file konfuzio_sdk-0.2.21.dev20230609211212-py3-none-any.whl.

File metadata

File hashes

Hashes for konfuzio_sdk-0.2.21.dev20230609211212-py3-none-any.whl
Algorithm Hash digest
SHA256 999a74bbdfeeb1bc4ac3dd263f1c7a32dab9d9d1e4bd42ecf6d79a012e88ed93
MD5 ff825132bca20d7b943e54d1de3a7d4c
BLAKE2b-256 8e1e18ea643ec305756c8bd11814763cf1097e5e2c3b10904c185c4df3289e9a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page