Async client for SVO semantic chunker microservice.
Project description
svo-client
Async Python client for the SVO semantic chunker. Canonical API: SvoChunkerClient (adapter / JsonRpcClient only).
Install
pip install -e .
# or: pip install svo-client
Quick start
import asyncio
from svo_client import SvoChunkerClient
async def main():
async with SvoChunkerClient(
protocol="mtls",
host="127.0.0.1",
port=8009,
cert="mtls_certificates/client/svo-chunker.crt",
key="mtls_certificates/client/svo-chunker.key",
ca="mtls_certificates/ca/ca.crt",
check_hostname=False,
) as client:
chunks = await client.chunk(
text="Your text.",
chunk_type="Comment",
aggregation_mode="paragraph",
respect_structural_blocks=True,
language="en",
)
print(len(chunks))
asyncio.run(main())
Current Chunker Parameters
chunk_type="Comment"is passed as RPC fieldtype="Comment"for comments, docstrings, or other pre-extracted blocks. The server owns the Comment keep-block policy.aggregation_modeis optional. Supported values are"sentence","none", and"paragraph". When omitted, the client does not send the field.respect_structural_blocksis optional. UseTrueorFalseto sendrespect_structural_blocks; omit it to preserve the server default.
from svo_client import build_file_chunk_params
from svo_client.file_processor import file_chunk_from_local_path
chunks = await client.chunk(
text="# module comment",
chunk_type="Comment",
aggregation_mode="paragraph",
respect_structural_blocks=True,
)
batch = await client.chunk_batch(
["first paragraph", "second paragraph"],
aggregation_mode="paragraph",
respect_structural_blocks=False,
)
params = build_file_chunk_params(
filename="comments.py",
file_content_base64=encoded,
chunk_type="Comment",
aggregation_mode="paragraph",
respect_structural_blocks=True,
)
result = await client.file_chunk(params)
result = await file_chunk_from_local_path(
client,
"comments.py",
chunk_type="Comment",
aggregation_mode="paragraph",
respect_structural_blocks=True,
)
CLI: svo-client or python -m svo_client — thin wrapper; local file chunking orchestration is file_chunk_from_local_path in svo_client.file_processor. Text chunking: chunk (default) and chunk text; file chunking: chunk file or file-chunk (alias); ws prints inbound WebSocket JSON lines to stdout. --help lists all flags. RPC discovery: svo-chunker (list / help / run).
svo-client chunk text -i sample.txt -o out.json --format json --chunk-type Comment
svo-client file-chunk -i sample.md -o out.jsonl --aggregation-mode paragraph --respect-structural-blocks
svo-client file-chunk -i comments.py --no-download --chunk-type Comment --no-respect-structural-blocks
Configuration precedence (CLI → SVO_CHUNKER_CLIENT_* → JSON), --config, and CLI details (ws, --quiet-status, file-chunk status on stderr): svo_client/README.md.
Tests
.venv/bin/python -m pytest tests/ -q
Live integration (chunker on 127.0.0.1:8009): port must be listening or free for managed spawn; see tests/conftest.py.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file svo_client-3.1.1.tar.gz.
File metadata
- Download URL: svo_client-3.1.1.tar.gz
- Upload date:
- Size: 65.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
901121d880ff08eed74183bf3aa62d56eb7825755dc51d300ec1c5cb8c98ca5a
|
|
| MD5 |
8889819953a9ebf4bfc582682bc9f9a1
|
|
| BLAKE2b-256 |
a9d0c4901a0b9cd773a56b12a96479a50e4d28a78bf7c9c88a1bbb8a2af56667
|
File details
Details for the file svo_client-3.1.1-py3-none-any.whl.
File metadata
- Download URL: svo_client-3.1.1-py3-none-any.whl
- Upload date:
- Size: 86.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a8705a34bc952c0365cd0e5cff6d17c203485c560616f93f31672cf8a02fee2f
|
|
| MD5 |
02ac6b65d0fa9c5e83d1a2d7d3f1cf0d
|
|
| BLAKE2b-256 |
664464a841ef244b4c32e05d2abcc5a74caa0452369a39bb2abc31236c4f6662
|