Docling LangChain integration
Project description
Docling LangChain integration
A Docling integration for LangChain.
Installation
Simply install langchain-docling
from your package manager, e.g. pip:
pip install langchain-docling
Development setup
To develop for Docling Core, you need Python >=3.9 <=3.13 and uv. You can then install from your local clone's root dir:
uv sync
Usage
Basic usage
Basic usage of DoclingLoader
looks as follows:
from langchain_docling import DoclingLoader
FILE_PATH = ["https://arxiv.org/pdf/2408.09869"] # Docling Technical Report
loader = DoclingLoader(file_path=FILE_PATH)
docs = loader.load()
Advanced usage
When initializing a DoclingLoader
, you can use the following parameters:
file_path
: source as single str (URL or local file) or iterable thereofconverter
(optional): any specific Docling converter instance to useconvert_kwargs
(optional): any specific kwargs for conversion executionexport_type
(optional): export mode to use:ExportType.DOC_CHUNKS
(default) orExportType.MARKDOWN
md_export_kwargs
(optional): any specific Markdown export kwargs (for Markdown mode)chunker
(optional): any specific Docling chunker instance to use (for doc-chunk mode)meta_extractor
(optional): any specific metadata extractor to use
Docs and examples
For more details and usage examples, check out this page.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file langchain_docling-1.0.0.tar.gz
.
File metadata
- Download URL: langchain_docling-1.0.0.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
c46af24e8d714a245913c9f71b39660c1f664a02dbe36e38ba0997483b8a78d3
|
|
MD5 |
a2fe779b3eba7fd1582f644594d29562
|
|
BLAKE2b-256 |
7520c1212f64b51bdb0d0c2e219c4c6d199f8b4128191cd254c46ae04dd327d4
|
Provenance
The following attestation bundles were made for langchain_docling-1.0.0.tar.gz
:
Publisher:
pypi.yml
on docling-project/docling-langchain
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1
-
Predicate type:
https://docs.pypi.org/attestations/publish/v1
-
Subject name:
langchain_docling-1.0.0.tar.gz
-
Subject digest:
c46af24e8d714a245913c9f71b39660c1f664a02dbe36e38ba0997483b8a78d3
- Sigstore transparency entry: 234519981
- Sigstore integration time:
-
Permalink:
docling-project/docling-langchain@e881cf34905d93bec8aa81dcc92cd4f283110c59
-
Branch / Tag:
refs/tags/v1.0.0
- Owner: https://github.com/docling-project
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com
-
Runner Environment:
github-hosted
-
Publication workflow:
pypi.yml@e881cf34905d93bec8aa81dcc92cd4f283110c59
-
Trigger Event:
release
-
Statement type:
File details
Details for the file langchain_docling-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: langchain_docling-1.0.0-py3-none-any.whl
- Upload date:
- Size: 5.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
10d37812774e49da58ee7025bdf925bbdfc50691ea925453cd5e2cff6587abe4
|
|
MD5 |
77a01c92de21162af931e4908a88150a
|
|
BLAKE2b-256 |
70b5b528f0e223cc1841f11cd5b894f7cedc98b09b25bfbf074b733bc98f6435
|
Provenance
The following attestation bundles were made for langchain_docling-1.0.0-py3-none-any.whl
:
Publisher:
pypi.yml
on docling-project/docling-langchain
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1
-
Predicate type:
https://docs.pypi.org/attestations/publish/v1
-
Subject name:
langchain_docling-1.0.0-py3-none-any.whl
-
Subject digest:
10d37812774e49da58ee7025bdf925bbdfc50691ea925453cd5e2cff6587abe4
- Sigstore transparency entry: 234519984
- Sigstore integration time:
-
Permalink:
docling-project/docling-langchain@e881cf34905d93bec8aa81dcc92cd4f283110c59
-
Branch / Tag:
refs/tags/v1.0.0
- Owner: https://github.com/docling-project
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com
-
Runner Environment:
github-hosted
-
Publication workflow:
pypi.yml@e881cf34905d93bec8aa81dcc92cd4f283110c59
-
Trigger Event:
release
-
Statement type: