No project description provided

These details have not been verified by PyPI

Project description

BotRun LlamaIndex Knowledge Base

一個基於 LlamaIndex 和 Qdrant 實作的智慧知識庫系統，專為繁體中文和台灣用語優化，支援多格式文件處理和語義搜尋。

📋 專案簡介

BotRun LlamaIndex Knowledge Base 是一個企業級的知識庫解決方案，具備以下特色：

智慧文件處理: 支援 TXT、MD、PDF、CSV 等多種格式
Hybrid Search: 結合語義搜尋（Dense Vectors）和關鍵字匹配（Sparse Vectors）
- https://qdrant.tech/articles/hybrid-search/
- Dense Vectors: 擅長捕捉文本的語義細微差別
- Sparse Vectors: 精確地識別關鍵詞
語義搜尋 (Dense): 使用 Google GenAI Embedding (gemini-embedding-001) 進行語義理解
關鍵字搜尋 (Sparse): 整合 FastEmbed BM25 進行精確關鍵字匹配
ReAct Agent: 整合智慧代理進行互動式查詢
繁體中文優化: 針對台灣用語和繁體中文進行特別優化
模組化架構: 使用 Constructor Dependency Injection 設計模式
向量儲存: 基於 Qdrant 的高效能 Hybrid 向量資料庫

🚀 快速開始

環境需求

Python 3.11+
Poetry (套件管理)
Qdrant 服務 (本地或遠端)
Google GenAI API Key

安裝

clone 專案:

git clone <repository-url>
cd botrun-llama-kb

安裝 dependencies:

poetry install

設定環境變數:

創建 .env 檔案或設定以下環境變數：

# Google GenAI API Key (必要)
GOOGLE_API_KEY=your_google_api_key_here

# Qdrant 配置 (根據你的 Qdrant 服務設定)
QDRANT_HOST=localhost          # 預設: localhost
QDRANT_PORT=6333              # 預設: 6333
QDRANT_API_KEY=               # 如果需要認證
QDRANT_PREFIX=/qdrant         # API 路徑前綴
QDRANT_HTTPS=false            # 是否使用 HTTPS

使用方式

程式化使用

根據不同的使用情境，有三種主要的操作模式：

1. 完全重建模式 (全新建立或完全清空重建)

import asyncio
from botrun_llama_kb.knowledge_base_factory import create_kb_store_from_local_dir, get_qdrant_config

async def full_rebuild():
    # 建立知識庫實例
    kb_store = await create_kb_store_from_local_dir(
        directory="path/to/your/documents",
        qdrant_config=get_qdrant_config(),  # 自動從環境變數讀取
        embedding_model="gemini-embedding-001",
        agent_model="gemini-2.5-flash"
    )
    
    # ⚠️  完全清空知識庫和快取 (會刪除所有資料!)
    await kb_store.clear_knowledge_base()
    
    # 刷新知識庫 (重新處理所有文件)
    await kb_store.refresh_knowledge_base()
    
    # 查詢測試
    result = await kb_store.query_knowledge_base("你的問題")
    print(result)

asyncio.run(full_rebuild())

2. 增量更新模式 (檢查更新並同步)

import asyncio
from botrun_llama_kb.knowledge_base_factory import create_kb_store_from_local_dir

async def incremental_update():
    # 建立知識庫實例
    kb_store = await create_kb_store_from_local_dir(
        directory="path/to/your/documents"
    )
    
    # 🔄 智能刷新 (如果不存在會建立，存在則檢查更新)
    await kb_store.refresh_knowledge_base()
    
    # 查詢測試
    result = await kb_store.query_knowledge_base("你的問題")
    print(result)

asyncio.run(incremental_update())

3. 快速載入模式 (直接使用現有索引)

import asyncio
from botrun_llama_kb.knowledge_base_factory import create_kb_store_from_local_dir

async def fast_load():
    # 建立知識庫實例
    kb_store = await create_kb_store_from_local_dir(
        directory="path/to/your/documents"  # 目錄用於確定 collection 名稱
    )
    
    # ⚡ 直接載入現有的向量索引 (最快速)
    await kb_store.load_from_existing_collection()
    
    # 查詢測試
    result = await kb_store.query_knowledge_base("你的問題")
    print(result)

asyncio.run(fast_load())

使用情境說明

模式	使用時機	優點	缺點
完全重建	首次建立、文件大幅變更、快取損壞	確保資料完整性、清理舊快取	處理時間最長
增量更新	日常使用、文件有增減	智能檢測更新、利用快取加速	中等處理時間
快速載入	開發測試、生產服務啟動	啟動最快速、無需重新處理	需要現有索引存在

進階配置範例

import asyncio
from botrun_llama_kb.knowledge_base_factory import create_kb_store_from_local_dir

async def advanced_usage():
    # 自訂 Qdrant 配置
    qdrant_config = {
        "host": "your-qdrant-host.com",
        "port": 443,
        "api_key": "your-api-key",
        "prefix": "/qdrant",
        "https": True
    }
    
    # 建立知識庫實例
    kb_store = await create_kb_store_from_local_dir(
        directory="path/to/your/documents",
        qdrant_config=qdrant_config,
        embedding_model="gemini-embedding-001",  # 或 "text-embedding-004"
        agent_model="gemini-2.5-flash"          # 或 "gemini-2.5-pro"
    )
    
    # 根據需求選擇操作模式
    # await kb_store.clear_knowledge_base()        # 完全清空
    # await kb_store.refresh_knowledge_base()      # 智能刷新
    # await kb_store.load_from_existing_collection()  # 快速載入
    
    # 查詢 (支援繁體中文和台灣用語)
    result = await kb_store.query_knowledge_base("你的問題", top_k=5)
    print(result)

asyncio.run(advanced_usage())

🏗️ 系統架構

核心模組

botrun_llama_kb/
├── adapters/                          # 文件來源適配器
│   ├── file_source_adapter.py         # 抽象基類
│   └── local_directory_adapter.py     # 本地目錄實現
├── knowledge_base_store.py            # 知識庫抽象介面
├── knowledge_base_qdrant_store.py     # Qdrant 實現
├── knowledge_base_factory.py          # 工廠模式建構器
└── constants.py                       # 常數定義

設計模式

Abstract Factory Pattern: FileSourceAdapter 支援不同資料來源
Strategy Pattern: KnowledgeBaseStore 支援不同實現方式
Dependency Injection: Constructor 注入依賴項目
Factory Method: knowledge_base_factory 統一建構流程

核心組件

FileSourceAdapter: 負責文件掃描和載入
KnowledgeBaseStore: 知識庫核心操作界面
QdrantVectorStore: 混合向量儲存和檢索 (Dense + Sparse)
GoogleGenAIEmbedding: Dense 向量化 (gemini-embedding-001, 3072維)
FastEmbed BM25: Sparse 向量化 (Qdrant/bm25 關鍵字匹配)
SemanticSplitterNodeParser: 語義切分
Hybrid Query Engine: 混合搜尋查詢引擎
ReActAgent: 智慧查詢代理 (支援混合搜尋)

批次處理與容錯機制

系統採用 IngestionPipeline 進行大量檔案的批次處理，具備完整的容錯和斷點續傳機制：

批次處理流程:
檔案載入 → IngestionPipeline (批次: 50 檔案/批)
    ├── 文件 ID 生成 (MD5: file_path + page_label)
    ├── 重複檢測 (SimpleDocumentStore)
    ├── 語義切分 (SemanticSplitterNodeParser)
    ├── 向量化處理 (GoogleGenAI Embedding)
    ├── 失敗重試 (最多 3 次，間隔 60 秒)
    └── 快取持久化 (每批次完成後立即保存)

容錯機制參數:

BATCH_SIZE=50: 每批次處理檔案數量
MAX_RETRIES=3: 批次失敗最大重試次數
RETRY_DELAY=60: 重試間隔 (秒)
num_workers=1: 順序處理避免 multiprocessing 問題
快取目錄: .pipeline_cache/storage_{collection_name}/
斷點續傳: 基於 doc_id 的增量處理

GoogleGenAI 連線優化:

retries=5: API 重試次數
timeout=30: 連線逾時 (秒)
retry_min_seconds=10: 最小重試間隔
retry_max_seconds=30: 最大重試間隔
retry_exponential_base=2: 指數退避基數

混合搜尋架構

系統採用 Dense + Sparse 混合搜尋 架構，結合語義理解和關鍵字匹配：

查詢處理流程:
使用者查詢 → Hybrid Query Engine
    ├── Dense Vector Search (語義搜尋)
    │   ├── Google GenAI Embedding (gemini-embedding-001)
    │   └── 語義相似度匹配 (similarity_top_k=2)
    ├── Sparse Vector Search (關鍵字搜尋)  
    │   ├── FastEmbed BM25 (Qdrant/bm25)
    │   └── 關鍵字精確匹配 (sparse_top_k=12)
    └── Fusion Algorithm (結果融合)
        └── LlamaIndex 內建融合 (hybrid_top_k=3)

核心配置參數:

enable_hybrid=True: 啟用混合搜尋模式
fastembed_sparse_model="Qdrant/bm25": BM25 稀疏向量模型
similarity_top_k=2: Dense 向量搜尋結果數
sparse_top_k=12: Sparse 向量搜尋結果數
hybrid_top_k=3: 最終融合結果數
batch_size=20: 批次處理優化

🔧 進階配置

自訂嵌入模型

# 在 sh/kb_cli.py 中修改
GEMINI_EMBEDDING_MODEL = "gemini-embedding-001"  # 或其他支援的模型

自訂 Agent 模型

# 在 sh/kb_cli.py 中修改
GEMINI_AGENT_MODEL = "gemini-2.5-flash"  # 或 gemini-pro

Qdrant 高級配置

qdrant_config = {
    "host": "your-qdrant-host.com",
    "port": 443,
    "api_key": "your-api-key",
    "prefix": "/qdrant",
    "https": True
}

📊 支援的文件格式

格式	支援程度	說明
`.txt`	✅ 完整支援	純文字檔案
`.md`	✅ 完整支援	Markdown 格式
`.pdf`	✅ 文字內容	提取純文字內容
`.csv`	✅ 表格資料	結構化資料處理

🧪 測試與驗證

專案提供完整的測試流程：

# 執行完整測試
python sh/kb_cli.py

更新 `requirements.txt`

poetry export -f requirements.txt --output requirements.txt --without-hashes --without-urls

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

5.8.31

Aug 3, 2025

This version

5.8.22

Aug 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

botrun_llama_kb-5.8.22.tar.gz (17.2 kB view details)

Uploaded Aug 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

botrun_llama_kb-5.8.22-py3-none-any.whl (18.7 kB view details)

Uploaded Aug 2, 2025 Python 3

File details

Details for the file botrun_llama_kb-5.8.22.tar.gz.

File metadata

Download URL: botrun_llama_kb-5.8.22.tar.gz
Upload date: Aug 2, 2025
Size: 17.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for botrun_llama_kb-5.8.22.tar.gz
Algorithm	Hash digest
SHA256	`d17e22db0e964356a7ef2e71ab6380885a319f5e3fb3edaa131b14d7eba70e5e`
MD5	`60bd1a1a74c9b7935e53186c689a4422`
BLAKE2b-256	`7e830a239065835dd935741a6c54d11f9b5aa73d49ee54ad11558874b77f1e4d`

See more details on using hashes here.

File details

Details for the file botrun_llama_kb-5.8.22-py3-none-any.whl.

File metadata

Download URL: botrun_llama_kb-5.8.22-py3-none-any.whl
Upload date: Aug 2, 2025
Size: 18.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for botrun_llama_kb-5.8.22-py3-none-any.whl
Algorithm	Hash digest
SHA256	`45c8f1f629b2d94f4753c3fb59fa85257a463d7e996ae7b803cb8e09388c7008`
MD5	`3969ff95b0d040230310b4855b972a4c`
BLAKE2b-256	`5b09374cbb29e667da5117d21bf07176485a6173ba4cb8c13c7e7c6afa024a58`

See more details on using hashes here.

botrun-llama-kb 5.8.22

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

BotRun LlamaIndex Knowledge Base

📋 專案簡介

🚀 快速開始

環境需求

安裝

使用方式

程式化使用

1. 完全重建模式 (全新建立或完全清空重建)

2. 增量更新模式 (檢查更新並同步)

3. 快速載入模式 (直接使用現有索引)

使用情境說明

進階配置範例

🏗️ 系統架構

核心模組

設計模式

核心組件

批次處理與容錯機制

混合搜尋架構

🔧 進階配置

自訂嵌入模型

自訂 Agent 模型

Qdrant 高級配置

📊 支援的文件格式

🧪 測試與驗證

更新 requirements.txt

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

更新 `requirements.txt`