Python Client SDK for Unstructured API
Project description
Python SDK for the Unstructured API
This is a Python client for the Unstructured API.
SDK Installation
pip install unstructured-client
Usage
Only the files
parameter is required. See the general partition page for all available parameters.
from unstructured_client import UnstructuredClient
from unstructured_client.models import shared
from unstructured_client.models.errors import SDKError
s = UnstructuredClient(api_key_auth="YOUR_API_KEY")
filename = "sample-docs/layout-parser-paper-fast.pdf"
with open(filename, "rb") as f:
# Note that this currently only supports a single file
files=shared.Files(
content=f.read(),
file_name=filename,
)
req = shared.PartitionParameters(
files=files,
# Other partition params
strategy='ocr_only',
languages=["eng"],
)
try:
resp = s.general.partition(req)
print(resp.elements[0])
except SDKError as e:
print(e)
# {
# 'type': 'UncategorizedText',
# 'element_id': 'fc550084fda1e008e07a0356894f5816',
# 'metadata': {
# 'filename': 'layout-parser-paper-fast.pdf',
# 'filetype': 'application/pdf',
# 'languages': ['eng'],
# 'page_number': 1
# }
# }
Change the base URL
If you are self hosting the API, or developing locally, you can change the server URL when setting up the client.
# Using a local server
s = unstructured_client.UnstructuredClient(
server_url="http://localhost:8000",
api_key_auth=api_key,
)
# Using your own server
s = unstructured_client.UnstructuredClient(
server_url="https://your-server",
api_key_auth=api_key,
)
Custom HTTP Client
The Python SDK makes API calls using the (requests)[https://pypi.org/project/requests/] HTTP library. In order to provide a convenient way to configure timeouts, cookies, proxies, custom headers, and other low-level configuration, you can initialize the SDK client with a custom requests.Session
object.
For example, you could specify a header for every request that this sdk makes as follows:
import unstructured_client
import requests
http_client = requests.Session()
http_client.headers.update({'x-custom-header': 'someValue'})
s = unstructured_client.UnstructuredClient(client: http_client)
Maturity
This SDK is in beta, and there may be breaking changes between versions without a major version update. Therefore, we recommend pinning usage to a specific package version. This way, you can install the same version each time without breaking changes unless you are intentionally looking for the latest version.
Contributions
While we value open-source contributions to this SDK, this library is generated programmatically. Feel free to open a PR or a Github issue as a proof of concept and we'll do our best to include it in a future release!
SDK Created by Speakeasy
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for unstructured-client-0.15.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6769871507b80f8ab37114eb0e7289aec9b80204cc22e5de8bcbb07c02d43e19 |
|
MD5 | f6f63b4991c1ff39a6ada83336647fdb |
|
BLAKE2b-256 | 173c69202085e278b1b8f51b07ec7efea3cfffc5aa0de764a9e8b92279db7c4f |
Hashes for unstructured_client-0.15.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c1a4f1dd8128eb08c2b07b404912111147f0135bdd2e6736b1f9b5a451b13c7d |
|
MD5 | 2be6080ce89cbf2ce8806698b6420950 |
|
BLAKE2b-256 | 663cbbf7fda1ee3b1f70de6bc16670038e373c4d131aeb56a11c2ed49023143e |