Unified ingestion tool for Open Knowledge Format (OKF) bundles — validate, load into a portable DuckDB catalog, and semantically search.
Project description
okf (Python binding)
Python binding of okf-ingest — a unified ingestion tool for Open Knowledge Format (OKF) bundles. Validate a bundle, load it into a portable DuckDB catalog, and semantically search it.
pip install okf-ingest
okf validate ./bundle
okf ingest ./bundle --db catalog.duckdb
okf embed catalog.duckdb # uses local Ollama nomic-embed-text by default
okf rag catalog.duckdb --query "how is revenue computed?" -k 5
import okf
con, summary = okf.ingest("./bundle", db_path="catalog.duckdb")
okf.embed(con) # pluggable embedder
okf.rag_search(con, "revenue", k=5)
The catalog format is shared with the R binding (okf on CRAN-style install),
so you can ingest/embed in one language and query from the other. See the
project README and docs/ for the
full spec-conformance notes and architecture.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file okf_ingest-0.1.0.tar.gz.
File metadata
- Download URL: okf_ingest-0.1.0.tar.gz
- Upload date:
- Size: 9.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3af7f1ae3a82476561d5a44d440f8f737fe36d6a3c097105d3fa9426b3fcd2de
|
|
| MD5 |
9d94f1dea6415651f384d4688885eaab
|
|
| BLAKE2b-256 |
eff1280ab8a4b0e7e91aed0242af839a9d8cda27ea790972056b7a167b48810a
|
File details
Details for the file okf_ingest-0.1.0-py3-none-any.whl.
File metadata
- Download URL: okf_ingest-0.1.0-py3-none-any.whl
- Upload date:
- Size: 9.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
22436013131b37cbabb5717921f8fe140fbc81c1ccfe7e7ec536c5cb6f68f083
|
|
| MD5 |
c8fe8e9a2bdf6ebd741e535e3f39bb48
|
|
| BLAKE2b-256 |
280e9a7f1f87d084d178ef56156f05e10f3c07040decb8d2ded1d8738c96917e
|