多语言轻量级高性能混合检索引擎，专为RAG全链路检索&重排设计

These details have not been verified by PyPI

Project links

GitHub

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

🌟 JiaJia-Search

多语言轻量级高性能混合检索引擎 | 专为 RAG 全链路检索 & 重排设计

🌏 语言切换 | Language Switch

👉 点击快速切换：

中文文档｜ English Documentation

📘 中文版本

📖 项目介绍

JiaJia-Search 是一款多语言、泛化性极强、生产级可用的混合检索框架，聚焦于 RAG（检索增强生成）场景的召回 + 融合 + 重排全链路优化。

框架支持独立向量生成（可直接存入向量数据库）、BM25 全文检索与向量检索并行加速、RRF 多路排序融合、CrossEncoder 高精度重排以及位置感知加权融合，所有 API 均可独立调用，满足多样化检索需求。

核心优势：无第三方服务依赖、全 ONNX CPU模型加速、全局单例模型加载、动态配置无源码侵入、极致性能与泛化性兼顾。

✨ 核心特性

独立 Embedding 生成

支持单文本 / 批量文本生成向量，直接输出 numpy 数组，无缝对接 Milvus/FAISS/Chroma 等向量数据库。
双路并行检索

BM25 全文检索 + 向量语义检索同时执行，检索性能显著提升。
RRF 智能排序融合

支持查询加权、Top 排名奖励机制，解决多路检索结果融合问题。
CrossEncoder 高精度重排

适配 bge-reranker-v2-m3 等 ONNX 模型，支持分数 Sigmoid 归一化 + 相关性等级判定。
位置感知加权融合

按检索排名动态分配权重，兼顾精准匹配与语义相关性。
全局单例模型加载

向量模型、重排模型仅加载 1 次，常驻内存复用，杜绝重复加载性能损耗。
动态配置体系

无需修改源码，一行代码配置模型路径与超参，开箱即用。
全 API 独立调用

所有模块解耦，可单独使用 Embedding / 检索 / 重排 / 融合能力，泛化性拉满。

🚀 快速安装

  官方 PyPI 安装

pip install jiajia-search

  升级最新版本

pip install --upgrade jiajia-search

⚙️ 核心配置

首次使用必须配置本地 ONNX 模型路径，框架支持动态配置，无需修改源码。

from jiajia_search import config

  全局配置（一次配置，全生命周期生效）

config.setup(

    # 向量 Embedding 模型（必填）

    vector_model_path=r"你的multilingual-e5-small-onnx模型路径",

    vector_onnx_file="model.onnx",

    # 重排 CrossEncoder 模型（必填）

    rerank_model_path=r"你的bge-reranker-v2-m3-ONNX模型路径",

    rerank_onnx_file="model_int8.onnx",

    # 可选参数

    vector_dim=384,

    rrf_k=60,

    top_k_recall=30,

    default_query_weight=2.0

)

📌 快速使用

1. 独立生成 Embedding（存入向量库）

from jiajia_search import generate_embedding, generate_embeddings_batch

  单文本生成 Embedding

text = "苹果手机多少钱"

embedding = generate_embedding(text)

print(f"向量维度: {embedding.shape}")  # (384,)

  批量文本生成 Embedding

texts = ["iPhone 15 售价", "华为手机报价", "小米手机价格"]

embeddings = generate_embeddings_batch(texts)

2. 独立 BM25 全文检索

from jiajia_search import bm25_fts_retrieval

query = "苹果手机多少钱"

documents = ["iPhone 15 官方定价", "华为 Mate 70 报价", "小米 14 价格"]

ranked_results = bm25_fts_retrieval(query, documents)

3. 独立向量语义检索

from jiajia_search import vector_index_retrieval

ranked_results = vector_index_retrieval(query, documents)

4. 独立 CrossEncoder 重排

from jiajia_search import rerank_with_normalize

norm_scores, rel_levels = rerank_with_normalize(query, documents)

print("归一化分数:", norm_scores)

print("相关性等级:", rel_levels)

5. 完整检索重排流水线（一键调用）

from jiajia_search import jiajia_search_pipeline

query = "苹果手机多少钱"

documents = [

    "iPhone 15 售价多少",

    "苹果手机官方定价",

    "华为手机最新报价",

    "小米手机价格查询",

    "二手苹果手机多少钱"

]

  启动全流程检索

final_rank=jiajia_search_pipeline(query, documents)

🧠 相关性分数标准

分数范围	相关性等级	中文释义
0.8 ~ 1.0	Highly relevant	高度相关
0.5 ~ 0.8	Moderately relevant	中等相关
0.2 ~ 0.5	Somewhat relevant	一般相关
0.0 ~ 0.2	Low relevance	低相关

🏗️ 技术架构

架构流程图（核心流程）

┌─────────────────────────────────────────────────────────────────────────────┐

│                       JiaJia-Search Hybrid Retrieval Pipeline               │

└─────────────────────────────────────────────────────────────────────────────┘

                              ┌─────────────────┐

                              │   用户查询 (User Query)  │

                              └────────┬────────┘

                                       │

                        ┌──────────────┼──────────────┐

                        ▼                             ▼

               ┌────────────────┐            ┌────────────────┐

               │   BM25 全文检索  │            │   向量语义检索   │

               │ (Jieba 分词 + FTS) │            │ (E5-ONNX 向量匹配) │

               └───────┬────────┘            └───────┬────────┘

                       │                             │

                       │ 双路并行执行 (Parallel Run)  │

                       └──────────────┬──────────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │   RRF 融合 + 排名奖励   │

                          │  RRF _K=60 + Top30 保留  │

                          │  Top1: +0.05 / Top2-3: +0.02 │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ CrossEncoder 重排      │

                          │ (bge-reranker-v2-ONNX) │

                          │ Sigmoid 归一化 + 相关性等级 │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ 位置感知加权融合       │

                          │ Top1-3: 75% RRF + 25% 重排分 │

                          │ Top4-10: 60% RRF + 40% 重排分 │

                          │ Top11+: 40% RRF + 60% 重排分  │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ 最终排序结果          │

                          │ (含相关性等级/置信分数) │

                          └───────────────────────┘

分数归一化与融合策略

检索后端分数转换（Score Normalization）

检索后端	原始分数类型	转换方式	输出范围
BM25 全文检索	BM25 原始分数	取绝对值（abs (score)）	0 ~ 25+
向量检索	余弦相似度	直接使用（原生 0~1 分布）	0.0 ~ 1.0
CrossEncoder 重排	Logits 分数	Sigmoid 函数归一化	0.0 ~ 1.0

融合核心策略（Fusion Strategy）

JiaJia-Search 采用「双路并行召回 + 智能融合 + 精准重排」的工业级流程，核心逻辑如下：

双路并行召回：BM25 全文检索（关键词精准匹配）与向量语义检索（上下文语义匹配）同时执行，性能提升近 100%；
RRF 多路融合：使用 Reciprocal Rank Fusion（RRF）算法融合双路结果，公式为 Σ(1/(RRF_K + rank + 1))（RRF _K=60），并对 Top 排名文档追加奖励（Top1+0.05，Top2-3+0.02）；
候选集筛选：融合后保留 Top30 文档进入重排阶段，平衡召回率与计算成本；
CrossEncoder 重排：使用轻量级 ONNX 重排模型（bge-reranker-v2-m3）对候选集精细化排序，输出 0~1 归一化分数与相关性等级；
位置感知加权：根据 RRF 融合后的排名动态分配权重：

前排文档（Top1-3）：优先保留检索阶段的精准匹配结果（75% RRF 分数 + 25% 重排分数）；
中排文档（Top4-10）：均衡检索与重排结果（60% RRF 分数 + 40% 重排分数）；
后排文档（Top11+）：依赖重排模型修正语义相关性（40% RRF 分数 + 60% 重排分数）；

结果输出：最终按加权分数排序，附带相关性等级（Highly/Moderately/Somewhat/Low relevant），支持业务层快速筛选。

📦 支持模型

框架基于 ONNX 格式运行，需自行下载以下开源模型：

多语言向量 Embedding 模型：multilingual-e5-small-onnx
多语言重排模型：bge-reranker-v2-m3-ONNX

⚠️ 注意事项

模型必须为 ONNX 格式，不支持 PyTorch 原生模型；
模型采用全局单例加载，首次调用加载，后续常驻内存；
必须通过 config.setup() 配置模型路径；
所有依赖自动安装，无需手动配置环境。

修复与升级

1.修复linux安装使用问题

📄 许可证

MIT License

自由使用、修改、分发，适用于个人与商业场景。

🤝 贡献与反馈

GitHub 仓库：jiajia-search
问题反馈：Issues 板块
功能建议：欢迎提交 PR 与 Issue

🌟 JiaJia-Search

Multilingual Lightweight & High-Performance Hybrid Search Engine | Built for RAG

🌏 语言切换 | Language Switch

👉 Click to quickly switch：

中文文档｜ English Documentation

📗 English Version

📖 Introduction

JiaJia-Search is a Multilingual highly generalized, production-ready hybrid search framework focusing on full-link optimization of retrieval + fusion + reranking for RAG (Retrieval-Augmented Generation) scenarios.

It supports independent embedding generation (directly storable in vector databases), parallel acceleration of BM25 full-text search & vector search, RRF multi-way ranking fusion, CrossEncoder high-precision reranking, and position-aware weighted fusion. All APIs are independently callable to meet diverse search requirements.

Core Advantages: No third-party service dependencies, full ONNX CPU model acceleration, global singleton model loading, dynamic configuration without code intrusion, perfect balance of extreme performance and generalization.

✨ Core Features

Independent Embedding Generation

Support single/batch text vectorization, output numpy arrays directly, seamlessly connect to Milvus/FAISS/Chroma and other vector databases.
Dual-Parallel Retrieval

BM25 full-text search + vector semantic search run simultaneously, significantly improving retrieval performance.
RRF Intelligent Ranking Fusion

Support query weighting and top-rank bonus mechanism to solve multi-retrieval result fusion.
CrossEncoder High-Precision Reranking

Adapt to ONNX models such as bge-reranker-v2-m3, support Sigmoid normalization + relevance level classification.
Position-Aware Weighted Fusion

Dynamically allocate weights according to retrieval ranks, balancing exact match and semantic relevance.
Global Singleton Model Loading

Vector model & rerank model loaded only once, resident in memory for reuse, eliminating repeated loading performance loss.
Dynamic Configuration System

No code modification required, configure model paths and hyperparameters with one line of code.
Fully Independent API Calls

All modules are decoupled; embedding/retrieval/reranking/fusion can be used separately for maximum generalization.

🚀 Quick Installation

  Install from PyPI

pip install jiajia-search

  Upgrade to latest version

pip install --upgrade jiajia-search

⚙️ Core Configuration

Local ONNX model paths must be configured for first use; the framework supports dynamic configuration without code changes.

from jiajia_search import config

  Global configuration (takes effect for the entire lifecycle)

config.setup(

    # Vector Embedding Model (Required)

    vector_model_path=r"Yourmultilingual-e5-small-onnxPath",

    vector_onnx_file="model.onnx",

    # CrossEncoder Rerank Model (Required)

    rerank_model_path=r"Yourbge-reranker-v2-m3-ONNXPath",

    rerank_onnx_file="model_int8.onnx",

    # Optional Parameters

    vector_dim=384,

    rrf_k=60,

    top_k_recall=30,

    default_query_weight=2.0

)

📌 Quick Start

1. Independent Embedding Generation (For Vector DB)

from jiajia_search import generate_embedding, generate_embeddings_batch

  Single text embedding

text = "How much is an iPhone?"

embedding = generate_embedding(text)

print(f"Vector Shape: {embedding.shape}")  # (384,)

  Batch text embeddings

texts = ["iPhone 15 Price", "Huawei Phone Quote", "Xiaomi Phone Price"]

embeddings = generate_embeddings_batch(texts)

2. Standalone BM25 Retrieval

from jiajia_search import bm25_fts_retrieval

query = "How much is an iPhone?"

documents = ["iPhone 15 Official Price", "Huawei Mate 70 Quote", "Xiaomi 14 Price"]

ranked_results = bm25_fts_retrieval(query, documents)

3. Standalone Vector Retrieval

from jiajia_search import vector_index_retrieval

ranked_results = vector_index_retrieval(query, documents)

4. Standalone CrossEncoder Reranking

from jiajia_search import rerank_with_normalize

norm_scores, rel_levels = rerank_with_normalize(query, documents)

print("Normalized Scores:", norm_scores)

print("Relevance Levels:", rel_levels)

5. Full Search & Rerank Pipeline

from jiajia_search import jiajia_search_pipeline

query = "How much is an iPhone?"

documents = [

    "iPhone 15 Price",

    "Apple Phone Official Price",

    "Latest Huawei Phone Quote",

    "Xiaomi Phone Price Inquiry",

    "How much is a used iPhone?"

]

  Run full pipeline

final_rank=jiajia_search_pipeline(query, documents)

🧠 Relevance Score Standard

Score Range	Relevance Level	Meaning
0.8 ~ 1.0	Highly relevant	Highly relevant
0.5 ~ 0.8	Moderately relevant	Moderately relevant
0.2 ~ 0.5	Somewhat relevant	Somewhat relevant
0.0 ~ 0.2	Low relevance	Low relevance

🏗️ Technical Architecture

Architecture Flowchart (Core Process)

┌─────────────────────────────────────────────────────────────────────────────┐

│                       JiaJia-Search Hybrid Retrieval Pipeline               │

└─────────────────────────────────────────────────────────────────────────────┘

                              ┌─────────────────┐

                              │   User Query    │

                              └────────┬────────┘

                                       │

                        ┌──────────────┼──────────────┐

                        ▼                             ▼

               ┌────────────────┐            ┌────────────────┐

               │ BM25 Full-Text │            │ Vector Semantic│

               │ Search (Jieba) │            │ Search (E5-ONNX)│

               └───────┬────────┘            └───────┬────────┘

                       │                             │

                       │  Parallel Execution        │

                       └──────────────┬──────────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ RRF Fusion + Rank Bonus│

                          │ RRF_K=60 + Top30 Kept  │

                          │ Top1: +0.05 / Top2-3: +0.02 │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ CrossEncoder Reranking│

                          │ (bge-reranker-v2-ONNX)│

                          │ Sigmoid Normalization + Relevance Level │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ Position-Aware Blending│

                          │ Top1-3: 75% RRF + 25% Rerank │

                          │ Top4-10: 60% RRF + 40% Rerank │

                          │ Top11+: 40% RRF + 60% Rerank  │

                          └───────────┬───────────┘

                                      │

                                      ▼

                          ┌───────────────────────┐

                          │ Final Ranked Results  │

                          │ (Relevance Level + Confidence Score) │

                          └───────────────────────┘

Score Normalization & Fusion Strategy

Retrieval Backend Score Conversion

Backend	Raw Score Type	Conversion Method	Output Range
BM25 Full-Text Search	BM25 Raw Score	Absolute value (abs(score))	0 ~ 25+
Vector Search	Cosine Similarity	Direct use (native 0~1)	0.0 ~ 1.0
CrossEncoder Reranking	Logits Score	Sigmoid Normalization	0.0 ~ 1.0

Core Fusion Strategy

JiaJia-Search adopts an industrial-grade workflow of "Dual-Parallel Retrieval + Intelligent Fusion + Precise Reranking" with the following core logic:

Dual-Parallel Retrieval: BM25 full-text search (keyword exact match) and vector semantic search (contextual semantic match) run simultaneously, improving performance by nearly 100%;
RRF Multi-Way Fusion: Combine dual-path results using Reciprocal Rank Fusion (RRF) with the formula Σ(1/(RRF_K + rank + 1)) (RRF_K=60), and add bonuses for top-ranked documents (Top1+0.05, Top2-3+0.02);
Candidate Selection: Keep Top30 documents after fusion for reranking, balancing recall rate and computational cost;
CrossEncoder Reranking: Use a lightweight ONNX reranking model (bge-reranker-v2-m3) for fine-grained sorting of candidates, outputting 0~1 normalized scores and relevance levels;
Position-Aware Blending: Dynamically assign weights based on RRF fusion ranks:

Top1-3: Prioritize precise matching from retrieval (75% RRF score + 25% rerank score);
Top4-10: Balance retrieval and reranking results (60% RRF score + 40% rerank score);
Top11+: Rely on reranking model to correct semantic relevance (40% RRF score + 60% rerank score);

Result Output: Finally sort by weighted scores, with relevance levels (Highly/Moderately/Somewhat/Low relevant) for quick business-layer filtering.

📦 Supported Models

The framework runs on ONNX format, please download the following open-source models manually:

Multilingual Vector Embedding Model: multilingual-e5-small-onnx
Multilingual Reranking Model: bge-reranker-v2-m3-ONNX

⚠️ Notes

Models must be in ONNX format (PyTorch native models are not supported);
Global singleton loading: models are loaded once and reused in memory;
Model paths must be configured via config.setup();
All dependencies are installed automatically.

Fixes and Updates

Fixed issues related to Linux installation and usage

📄 License

MIT License

Free for personal and commercial use.

🤝 Contribution & Feedback

GitHub Repository: jiajia-search
Issue Feedback: GitHub Issues
Feature Requests: Pull Requests & Issues are welcome

Project details

These details have not been verified by PyPI

Project links

GitHub

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.1.2

Apr 9, 2026

0.1.1

Mar 25, 2026

0.1.0

Mar 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jiajia_search-0.1.2.tar.gz (20.4 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jiajia_search-0.1.2-py3-none-any.whl (16.4 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file jiajia_search-0.1.2.tar.gz.

File metadata

Download URL: jiajia_search-0.1.2.tar.gz
Upload date: Apr 9, 2026
Size: 20.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for jiajia_search-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`388559e792000db675c26ebc079be36a0fc9f42da01cc451edf122808945c475`
MD5	`b3f2f357e3b0bee70ad8a1627dd9b875`
BLAKE2b-256	`4eccc6030ef3c02597b8fde1093d87a9f7740e9c5abbf18622aadab45345413a`

See more details on using hashes here.

File details

Details for the file jiajia_search-0.1.2-py3-none-any.whl.

File metadata

Download URL: jiajia_search-0.1.2-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 16.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for jiajia_search-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`56ccc2c98b072489d749630a351b89500d07229db4df0463a7a790d1efd5e985`
MD5	`eab31b45f4cf5596ff1bfa13138cb83b`
BLAKE2b-256	`fc5108340cd9a8ef219c44a1b99b92c87a3b6a37ba4590dd2780e17717cc9d70`

See more details on using hashes here.

jiajia-search 0.1.2

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

🌟 JiaJia-Search

🌏 语言切换 | Language Switch

📘 中文版本

📖 项目介绍

✨ 核心特性

🚀 快速安装

⚙️ 核心配置

📌 快速使用

1. 独立生成 Embedding（存入向量库）

2. 独立 BM25 全文检索

3. 独立向量语义检索

4. 独立 CrossEncoder 重排

5. 完整检索重排流水线（一键调用）

🧠 相关性分数标准

🏗️ 技术架构

架构流程图（核心流程）

分数归一化与融合策略

检索后端分数转换（Score Normalization）

融合核心策略（Fusion Strategy）

📦 支持模型

⚠️ 注意事项

修复与升级

📄 许可证

🤝 贡献与反馈

🌟 JiaJia-Search

🌏 语言切换 | Language Switch

📗 English Version

📖 Introduction

✨ Core Features

🚀 Quick Installation

⚙️ Core Configuration

📌 Quick Start

1. Independent Embedding Generation (For Vector DB)

2. Standalone BM25 Retrieval

3. Standalone Vector Retrieval

4. Standalone CrossEncoder Reranking

5. Full Search & Rerank Pipeline

🧠 Relevance Score Standard

🏗️ Technical Architecture

Architecture Flowchart (Core Process)

Score Normalization & Fusion Strategy

Retrieval Backend Score Conversion

Core Fusion Strategy

📦 Supported Models

⚠️ Notes

Fixes and Updates

📄 License

🤝 Contribution & Feedback

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes