Open-source knowledge-base ingestion and retrieval service.

These details have not been verified by PyPI

Project description

ByKC

Beyond Knowledge Core · Enterprise Knowledge Hub

From document ingestion to multi-hop reasoning — a knowledge foundation for enterprise AI agents.

English | 中文

Project Positioning

ByKC (Beyond Knowledge Core) is an open-source enterprise knowledge reasoning engine — the company's "digital senior expert". It weaves information scattered across messaging apps, email, meetings, and code into a living knowledge network, so newcomers can tap into the experience of senior staff from day one.

Technically, ByKC aims to improve the accuracy of existing RAG knowledge bases on complex QA scenarios.

The core flow of traditional RAG is "retrieve top-K chunks → concatenate → generate", which has systemic gaps when facing real business questions:

Traditional RAG Problem	ByKC Approach
Compound questions fail — "How do A and B differ?" needs separate retrievals to compare; a single retrieval mixes targets and yields low-relevance hits	Sub-question decomposition — Splits a compound question into independent sub-questions, each issuing its own retrieval. Single-target retrievals produce more relevant hits Example: "How do product A and B differ in battery life and weight?" → split into 4 sub-retrievals: "A battery life", "A weight", "B battery life", "B weight"
Multi-hop reasoning breaks — "What projects does Zhang San's manager own?" has chained dependencies; one retrieval cannot fetch the full answer	Step-by-step iterative retrieval — Runs retrieval in iterative rounds along the reasoning chain, rebuilding the next query from the previous round's result, advancing until the full evidence chain is gathered Example: "What projects does Zhang San's manager own?" → round 1 retrieves "Who is Zhang San's manager" → gets "Li Si" → round 2 retrieves "Projects owned by Li Si"
Cross-KB silos — Information lives in multiple knowledge bases; a single retrieval only hits one source	KBs unified as agent tools — Exposes multiple knowledge bases as a standard agent tool set. The agent calls them in parallel and aggregates results uniformly Example: "What's the company's remote work policy?" → calls HR-policy, IT-security, and finance-reimbursement KBs in parallel, then aggregates into a complete answer

To deliver these capabilities, ByKC adopts a layered architecture — the reasoning engines connect to knowledge storage through an adapter layer. You can use the built-in knowledge base end-to-end, or layer ByKC directly on top of your existing RAG infrastructure:

graph TD
    subgraph engine [QA Reasoning Engines]
        direction LR
        Fast[<b>Fast Engine</b>]
        Instant[<b>Instant Engine</b>]
    end

    subgraph adapter [Adapter Layer]
        Dispatcher[<b>ServiceToolDispatcher</b>]
    end

    subgraph storage [Knowledge Storage]
        subgraph builtin [ByKC Built-in KB]
            KB["knowledge_base
Document mgmt · Metadata · Hybrid search"]
            Build["knowledge_build
Parse · Chunk · Embed"]
            subgraph infra [Infrastructure]
                OG[OpenGauss]
                MIO[MinIO]
            end
        end
        External["Existing Enterprise KB
RAG / Vector DB / Doc System"]
    end

    Fast --> Dispatcher
    Instant --> Dispatcher
    Dispatcher --> builtin
    Dispatcher --> External
    KB --> infra
    Build --> infra

    style engine fill:#dbeafe,stroke:#3b82f6,stroke-width:2px,color:#000
    style adapter fill:#fef3c7,stroke:#f59e0b,stroke-width:2px,color:#000
    style storage fill:#d1fae5,stroke:#10b981,stroke-width:2px,color:#000
    style builtin fill:#ecfdf5,stroke:#10b981,color:#000
    style infra fill:#f3f4f6,stroke:#6b7280,color:#000
    style Fast fill:#bfdbfe,stroke:#3b82f6,color:#000
    style Instant fill:#bfdbfe,stroke:#3b82f6,color:#000
    style Dispatcher fill:#fde68a,stroke:#f59e0b,color:#000
    style External fill:#fee2e2,stroke:#ef4444,color:#000
    style OG fill:#f3f4f6,stroke:#6b7280,color:#000
    style MIO fill:#f3f4f6,stroke:#6b7280,color:#000
    style KB fill:#ecfdf5,stroke:#10b981,color:#000
    style Build fill:#ecfdf5,stroke:#10b981,color:#000

Tech Stack

Layer	Technology
API Framework	FastAPI 0.115+ · Pydantic v2
AI Orchestration	LangGraph 0.2+ · LangChain
LLM / Embedding	OpenAI-compatible API (any compatible endpoint)
Document Parsing	PyMuPDF (PDF) · python-docx (Word) · python-pptx (PPT) · openpyxl (Excel)
Database	OpenGauss (PostgreSQL-compatible, with vector search)
Full-text Search	OpenGauss FTS · Jieba (Chinese tokenization)
Object Storage	MinIO (S3-compatible)
Cache / Service Registry	Redis
Runtime	Python 3.12+ · uv

Core Features

Dual-mode QA engines — Fast Engine handles simple questions; Instant Engine handles multi-hop complex questions. One codebase, switch by scenario.
AgentOverride hot-swap — Each reasoning node (decomposer, retrieval agent, aggregator, generator) supports independent replacement of prompt / middleware / tools, so the same engine adapts to legal, customer service, R&D, and other domains.
Knowledge bases as a tool set — ServiceToolDispatcher automatically converts remote knowledge-base APIs into LangGraph tools (search / listDir / glob / readFile). The QA engine is not bound to any specific storage and works with any compatible service.
Metadata management and structured retrieval — Supports custom metadata fields, file-level metadata write/read, metadata field enumeration, and Agent DSL-based structured filtering. The same knowledge base can run full-text, vector, or hybrid retrieval.

Core Design

Dual Engines: Fast vs Instant

	Fast Engine	Instant Engine
Scenario	Factual lookup, definition, single information point	Comparison, multi-condition filtering, cross-document synthesis
Flow	Linear: rewrite → retrieve → generate	Graph: decompose → parallel agents → aggregate → final answer
Latency	Low (single-round LLM + single retrieval)	Higher (multi-round tool calls, parallel sub-questions offset some latency)
Example	"What is the reimbursement process?"	"How do products A and B differ in reliability and performance?"

AgentOverride: Per-Node Behavior Configuration

Every agent node in both engines can be configured independently via AgentOverride — no engine code changes needed:

from by_qa.qa.common.config import AgentOverride

config = {
    "agents": {
        # Instant engine nodes:
        #   decomposer_agent / single_hop_agent / multi_hop_agent /
        #   multi_hop_summary_agent / aggregator_agent
        "single_hop_agent": AgentOverride(
            prompt="You are a legal document assistant. Always cite the original clause number...",
            middleware=[YourCustomMiddleware()],
            tools=[your_extra_tool],
        ),
        # Fast engine nodes:
        #   rewriter_agent / answer_agent
        "answer_agent": AgentOverride(
            prompt="Answer in three sentences or fewer, friendly tone...",
        ),
    }
}

Same engine code: legal scenarios swap in a strict-citation prompt, customer service injects a concise style, R&D adds a code search tool.

ServiceToolDispatcher: Knowledge Base → Agent Tools

The QA engine never accesses a database directly; it converts remote knowledge-base services into LangGraph tools:

dispatcher = ServiceToolDispatcher(knowledge_bases=[
    KnowledgeBaseConfig(
        kb_code="product_docs",
        service_name="by-qa-manager",
        operations={
            OperationType.KNOWLEDGE_SEARCH: "/api/v1/knowledgeItems/search",
            OperationType.LIST_DIR: "/api/v1/listDir",
            OperationType.GLOB: "/api/v1/glob",
            OperationType.READ_FILE: "/api/v1/readFile",
        }
    )
])
tools = dispatcher.build_tools()
# → [search_knowledge, list_directory, glob_search, read_file]

Works with any service that implements the same protocol — not tied to ByKC's own knowledge_base module
Agents get full knowledge-exploration powers: search, list directories, glob, read original content
Parallel retrieval across multiple KBs, results unified by relevance

Quick Start

Requirements

Python 3.12+
uv (recommended) or pip
Docker (required by the knowledge base module)

Install

# pip install (recommended)
pip install by-qa[all]

# Or via uv
uv pip install by-qa[all]

# Install on demand
pip install by-qa[knowledge]   # knowledge base only
pip install by-qa[qa]          # QA engines only

From source:

git clone https://github.com/beyonai/ByKC.git && cd ByKC
uv sync --all-extras

Start Middleware

make kb-stack-up   # Bring up OpenGauss + MinIO + Redis in one shot

Configure

cp .env.example .env

Key variables:

LLM_BASE_URL=http://your-llm/v1       # OpenAI-compatible endpoint
LLM_API_KEY=your-key
LLM_STANDARD_MODEL=gpt-4o             # Primary reasoning
LLM_LIGHTWEIGHT_MODEL=gpt-4o-mini     # Decomposition / rewrite

EMBEDDING_BASE_URL=http://your-embedding
EMBEDDING_MODEL_NAME=bge-m3
EMBEDDING_DIMENSION=1024

Run

by-qa

Visit /health to see loaded modules and /docs for the knowledge base API reference.

End-to-End Walkthrough

After installing by-qa via pip/uv, you can run the full pipeline:

# Install
pip install by-qa[all]
# Or
uv pip install by-qa[all]

# Configure env vars (LLM, embedding, middleware addresses)
cp .env.example .env && vi .env

# Start the service
by-qa

Once the service is up, call the APIs in order to build knowledge and ask questions:

# 1. Create a knowledge base
curl -X POST http://127.0.0.1:8000/api/v1/knowledgeBases/create \
  -H "Content-Type: application/json" \
  -d '{"knName": "my-docs", "knDescription": "Product docs"}'
# → {"resultObject": {"knCode": "74", ...}}

# 2. Import a file (PDF/Word/PPT/Excel/Markdown/CSV supported)
curl -X POST http://127.0.0.1:8000/api/v1/knowledgeItems/import \
  -F "knCode=74" \
  -F "filePath=/docs/handbook.md" \
  -F "fileContent=@handbook.md"

# 3. Trigger parsing → chunking → embedding (async background)
curl -X POST http://127.0.0.1:8000/api/v1/fileToMarkdownIndex \
  -H "Content-Type: application/json" \
  -d '{"knCode": "74", "filePath": "/docs/handbook.md"}'

# 4. Check build status (status=complete means ready)
curl -X POST http://127.0.0.1:8000/api/v1/fileBuildStatus \
  -H "Content-Type: application/json" \
  -d '{"knCode": "74", "filePath": "/docs/handbook.md"}'

# 5. Verify retrieval
curl -X POST http://127.0.0.1:8000/api/v1/knowledgeItems/search \
  -H "Content-Type: application/json" \
  -d '{"knCodeList": ["74"], "query": "how to use", "topK": 3, "searchMode": "mixedRecall"}'

If you need to define business metadata fields, write file metadata, run pure structured filtering, or add where filters to full-text / vector / hybrid retrieval, see the separate Metadata and Retrieval Extension API.

After knowledge is built, use the QA scripts in the repo to ask questions:

# Instant engine (multi-hop, for complex questions)
python examples/e2e_kb_qa/run_instant_qa.py --query "What's the difference between A and B?"

# Fast engine (linear pipeline, for simple questions)
python examples/e2e_kb_qa/run_instant_qa.py --mode fast --query "What is the reimbursement process?"

# More options
python examples/e2e_kb_qa/run_instant_qa.py --help

Usage

Knowledge Base API

The knowledge base documentation is split into two parts:

Base knowledge-base APIs: document and directory management, knowledge build, original-content reading, and file download. See Knowledge Module API
Metadata and retrieval extension APIs: metadata field management, file-level metadata maintenance, structured retrieval, and DSL-filtered retrieval. See Metadata and Retrieval Extension API

Current built-in capabilities include:

Document and directory management: knowledge bases, directories, file import, content read, original file download
Knowledge build: parsing, chunking, embedding, build status tracking
Metadata management: metadata field definition, batch creation, deletion, file-level metadata update/read, global field enumeration
Retrieval modes: full-text, vector, and hybrid retrieval
Structured filtering: Agent DSL filters over custom fields and system fields such as fileName, fileType, fileSize, mimeType, createdAt, updatedAt, and filePath
File-level recall: searchFile aggregates multi-chunk hits by file, which is useful when you want to shortlist files before reading the original content

QA Engines (Code-Level Integration)

The QA engines are code-level entry points for upper-layer agent frameworks or business services to integrate:

from by_qa.qa.engines.instant.engine import InstantQAEngine
from by_qa.qa.engines.fast.engine import FastQAEngine
from by_qa.qa.common.models import CoreInput
from by_qa.qa.common.config import AgentOverride

retrieval_config = {
    "knowledge_bases": [{
        "kb_code": "my_kb",
        "kb_name": "Product docs",
        "service_name": "by-qa-manager",
        "base_url": "http://127.0.0.1:8000",
        "operations": {
            "knowledgeSearch": "/api/v1/knowledgeItems/search",
            "listDir": "/api/v1/listDir",
            "readFile": "/api/v1/readFile",
        }
    }]
}

# Simple question → Fast
async with FastQAEngine({"retrieval": retrieval_config}) as engine:
    async for event in engine.stream_search(CoreInput(query="What is the reimbursement process?")):
        if event.type == "token":
            print(event.data["content"], end="")

# Complex question → Instant + customized agent behavior
async with InstantQAEngine({
    "retrieval": retrieval_config,
    "agents": {"single_hop_agent": AgentOverride(prompt="When citing original text, include the page number...")}
}) as engine:
    async for event in engine.stream_search(CoreInput(query="How do the breach clauses in the two contracts differ?")):
        if event.type == "token":
            print(event.data["content"], end="")

Evaluation

A standardized evaluation framework is built in. Currently supports the FRAMES multi-hop QA benchmark:

uv sync --extra eval --extra qa
uv run python -m eval.cli download frames
uv run python -m eval.cli ingest frames --kb-base-url http://127.0.0.1:8000
uv run python -m eval.cli run frames --mode instant --sample 10

Adding a new dataset: implement the DatasetSpec protocol under eval/datasets/<name>/.

Project Structure

src/by_qa/
├── main.py                 # FastAPI entry, dynamic module registration
├── config.py               # Pydantic Settings
├── core/                   # ModelConfigProvider protocol, logging, service discovery
├── knowledge_base/
│   ├── api/                # REST routes
│   ├── services/           # KB management, ingestion, metadata, retrieval
│   ├── repositories/       # OpenGauss data access
│   └── infrastructure/     # DB connection pool, MinIO client
├── knowledge_build/
│   └── services/           # Document parsing, semantic chunking, embedding
├── knowledge_common/       # Cross-module shared schemas
└── qa/
    ├── common/
    │   ├── base_engine.py          # BaseQAEngine abstract base class
    │   ├── config.py               # QAEngineConfig / AgentOverride / QARetrievalConfig
    │   ├── models.py               # StreamEvent / CoreInput / CoreOutput
    │   ├── operation_registry.py   # Tool registry
    │   └── middleware/             # ToolCallGuardMiddleware
    ├── agents/                     # Reusable subgraphs
    │   ├── query_decomposer.py
    │   ├── single_hop_react.py
    │   ├── multi_hop_react.py
    │   ├── subanswer_aggregator.py
    │   └── answer_synthesizer.py
    ├── engines/
    │   ├── instant/                # Instant engine
    │   └── fast/                   # Fast engine
    ├── services/                   # LLMService, CheckpointerFactory
    └── tools/                      # ServiceToolDispatcher

Roadmap

AgentOverride extensions: support MCP Server, external Tools, and Skills as agent capability extensions, for seamless integration with the external tool ecosystem
Knowledge base metadata foundation: custom metadata fields, file-level metadata maintenance, structured retrieval, and full-text / vector / hybrid retrieval modes
Structured anchors: tag unstructured documents with business master data, automatically linking documents to business entities
Compounding flywheel: user graph → enterprise graph, turning personal knowledge sediment into reusable organizational assets

License

MIT License

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.12

Jun 1, 2026

0.1.11

May 25, 2026

0.1.10

May 20, 2026

0.1.9

May 15, 2026

0.1.8

May 8, 2026

0.1.7

Apr 30, 2026

0.1.6

Apr 30, 2026

0.1.5

Apr 23, 2026

0.1.4

Apr 18, 2026

0.1.3

Apr 18, 2026

0.1.2

Apr 15, 2026

0.1.1

Apr 11, 2026

0.1.0

Apr 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

by_qa-0.1.12.tar.gz (348.5 kB view details)

Uploaded Jun 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

by_qa-0.1.12-py3-none-any.whl (187.9 kB view details)

Uploaded Jun 1, 2026 Python 3

File details

Details for the file by_qa-0.1.12.tar.gz.

File metadata

Download URL: by_qa-0.1.12.tar.gz
Upload date: Jun 1, 2026
Size: 348.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for by_qa-0.1.12.tar.gz
Algorithm	Hash digest
SHA256	`d77e594cd579076830e73c0409bb77e8216f5c25fa62fc274d8c70119d355db5`
MD5	`b317f270244f1f140a5ef4933df160fe`
BLAKE2b-256	`f1d7db8757b1b6768b940c173a0bc20bac2939a487973f1fdc7ba58d71bd6ce4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for by_qa-0.1.12.tar.gz:

Publisher: release.yml on beyonai/ByKC

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: by_qa-0.1.12.tar.gz
- Subject digest: d77e594cd579076830e73c0409bb77e8216f5c25fa62fc274d8c70119d355db5
- Sigstore transparency entry: 1690335370
- Sigstore integration time: Jun 1, 2026
Source repository:
- Permalink: beyonai/ByKC@3ee00f8411dbc4e4eecd247e0e9a2d456dc0abc7
- Branch / Tag: refs/tags/v0.1.12
- Owner: https://github.com/beyonai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@3ee00f8411dbc4e4eecd247e0e9a2d456dc0abc7
- Trigger Event: push

File details

Details for the file by_qa-0.1.12-py3-none-any.whl.

File metadata

Download URL: by_qa-0.1.12-py3-none-any.whl
Upload date: Jun 1, 2026
Size: 187.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for by_qa-0.1.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`638ff7e0a4ba7dcec60687ce8a838e0c7bab237efdcb6f766f9f433551190bb2`
MD5	`7a78159c13a2844a300e018364c28bad`
BLAKE2b-256	`48087c77e09d2597e6557aac55f43f58b6b33e7df4dd6ae250432bf97c583ea4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for by_qa-0.1.12-py3-none-any.whl:

Publisher: release.yml on beyonai/ByKC

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: by_qa-0.1.12-py3-none-any.whl
- Subject digest: 638ff7e0a4ba7dcec60687ce8a838e0c7bab237efdcb6f766f9f433551190bb2
- Sigstore transparency entry: 1690335384
- Sigstore integration time: Jun 1, 2026
Source repository:
- Permalink: beyonai/ByKC@3ee00f8411dbc4e4eecd247e0e9a2d456dc0abc7
- Branch / Tag: refs/tags/v0.1.12
- Owner: https://github.com/beyonai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@3ee00f8411dbc4e4eecd247e0e9a2d456dc0abc7
- Trigger Event: push

by-qa 0.1.12

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

ByKC

Project Positioning

Tech Stack

Core Features

Core Design

Dual Engines: Fast vs Instant

AgentOverride: Per-Node Behavior Configuration

ServiceToolDispatcher: Knowledge Base → Agent Tools

Quick Start

Requirements

Install

Start Middleware

Configure

Run

End-to-End Walkthrough

Usage

Knowledge Base API

QA Engines (Code-Level Integration)

Evaluation

Project Structure

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance