Skip to main content

SDK for the Datalab document intelligence API

Project description

Datalab SDK

A Python SDK for the Datalab API - a document intelligence platform powered by marker and surya.

See the full documentation at https://documentation.datalab.to.

Installation

pip install datalab-python-sdk

Quick Start

Authentication

Get your API key from https://www.datalab.to/app/keys:

export DATALAB_API_KEY="your_api_key_here"

Basic Usage

from datalab_sdk import DatalabClient

client = DatalabClient() # use env var from above, or pass api_key="your_api_key_here"

# Convert PDF to markdown
result = client.convert("document.pdf")
print(result.markdown)

# OCR a document
ocr_result = client.ocr("document.pdf")
print(ocr_result.pages)  # Get all text as string

Workflows

Workflows allow you to chain multiple document processing steps together. Each workflow consists of one or more steps that can depend on previous steps.

Note: All workflow operations require authentication. Make sure you have set your DATALAB_API_KEY environment variable or pass api_key when creating the client (see Authentication section above).

For more Workflow tips, see our examples here and documentation.

CLI Usage

The SDK includes a command-line interface:

# Convert document to markdown
datalab convert document.pdf

# OCR with JSON output
datalab ocr document.pdf --output-format json

# Workflow commands
datalab create-workflow --help
datalab execute-workflow --help
datalab get-execution-status --help
datalab list-workflows --help
datalab get-workflow --help
datalab visualize-workflow --help

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datalab_python_sdk-0.1.14.tar.gz (2.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datalab_python_sdk-0.1.14-py3-none-any.whl (20.4 kB view details)

Uploaded Python 3

File details

Details for the file datalab_python_sdk-0.1.14.tar.gz.

File metadata

  • Download URL: datalab_python_sdk-0.1.14.tar.gz
  • Upload date:
  • Size: 2.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for datalab_python_sdk-0.1.14.tar.gz
Algorithm Hash digest
SHA256 fbfcddbb2e7acddca5efd5352356e1a44e4f0c19fb76e103857348e3299c5193
MD5 4565a1c6dcec50edc7088044403fdd93
BLAKE2b-256 3ad03cbc2fdeb856a18facf889e75fba16138bf0bec657f40635e56b55ff6d44

See more details on using hashes here.

File details

Details for the file datalab_python_sdk-0.1.14-py3-none-any.whl.

File metadata

  • Download URL: datalab_python_sdk-0.1.14-py3-none-any.whl
  • Upload date:
  • Size: 20.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for datalab_python_sdk-0.1.14-py3-none-any.whl
Algorithm Hash digest
SHA256 94d80f1967007fbe415d000c4e5de4ea7575ff091e3c4f1144a94a39dc299ff7
MD5 7884aeddf9ba5610ecc877dca00c1c47
BLAKE2b-256 09fa1c3f24fb863bfbc9a2cd08839f097ff9380fa41eb99a3ab90c0f017e6e7a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page