Python bindings for memcache_hybrid implemented with pybind11
Project description
🔄Latest News
-
[2025/12] MemCache已作为vllm-ascend backend使能大模型推理加速,详情查看vllm-ascend开源社区,使用示例
-
[2025/11] MemCache项目于2025年11月开源,开源社区地址为:https://gitcode.com/Ascend/memcache
🔜 Roadmap
MemCache roadmap和版本分支策略详见: Roadmap
🎉概述
MemCache是针对LLM推理、GR推理场景设计的高性能分布式KVCache存储引擎,其主要特性包括:
- 基于对象操作的API:支持批量和非批量的put/get/exist/remove操作,支持多层结构的KV Block读写接口。
- 支持多副本:单个对象支持多副本放置到不同的LocalService,默认是单副本,支持put接口指定副本数量。
- 高带宽低时延:使用 MemFabric 作为多级内存和异构网络传输的底座,在Ascend硬件上,基于device_rdma(A2)、device_sdma(A3)、host_rdma(A2/A3) 等路径提供OneCopy跨机跨介质数据直接访问能力,满足高带宽,低时延的读写性能述求。在鲲鹏硬件上,支持host_urma(K5)。支持host_shm实现同节点共享内存通信。
- 支持扩缩容:支持LocalService动态加入和移除
- HA能力:在K8S集群中,MetaService支持多活能力,支持元数据恢复,提供尽力而为的HA能力。
🧩核心组件
MemCache包含LocalService和MetaService两大核心组件:
-
MetaService:
- 负责管理整个集群中内存池空间的分配和管理,处理LocalService的加入与退出。
- MetaService作为独立进程运行,提供两种启动方式:python API启动;二进制启动,详见 安装部署
- MetaService支持两种部署形态: 1、单点模式:MetaService由单个进程组成,部署方式简单,但存在单点故障的问题。如果MetaService进程崩溃或无法访问,系统将无法继续提供服务,直至重新恢复为止。 2、HA模式:该模式基于K8S的的ClusterIP Service和Lease资源构建,部署较为复杂,该模式会部署多个MetaService进程实例,实现多活高可用。部署详见 MetaService HA
-
LocalService:负责承担如下功能:
- 客户端:作为客户端,以whl/so形式作为共享库被应用进程加载调用API
- 内存提供者:负责提供一段连续的内存区域作为内存池空间的一部分,其内存可以被其他LocalService实例基于地址直接访问。
🔥性能表现
MemCache核心能力是提供大容量内存池和高性能的H2D、D2H、D2RH、RH2D数据访问能力,由于MemCache以 MemFabric 作为池化底座,所以支持RH2D、D2RH等OneCopy跨机跨介质数据直接访问能力,下图为RH2D对比其他中转路径的对比示意图。
基于OneCopy跨机跨介质数据直接访问的能力,MemCache在A2/A3做了相关性能测试如下: 模拟构造DeepSeek-R1模型KV大小的block,单个block size为:61x128K + 61x16K = 8784KB ≈ 8.57MB,共122个离散地址。
- 使用2个昇腾A2节点(每节点8张卡)组成双机内存池进行读写测试性能如下:
- 使用2个昇腾A3节点(每节点8张卡16Die)组成双机内存池进行读写测试性能如下:
🔍目录结构
├── LICENSE # LICENSE
├── .clang-format # 格式化配置
├── .gitmodules # git配置
├── .gitignore # git忽视配置文件
├── CMakeLists.txt # 项目的CMakeList
├── doc # 文档目录
├── example # 样例
│ ├── cpp # c++样例
│ └── python # python样例
├── script # 构建脚本
│ ├── build_and_pack_run.sh # 编译+打包脚本
│ ├── build.sh # 编译脚本
│ ├── run_ut.sh # 编译+运行ut脚本
├── test # test目录
│ ├── python # python测试用例
│ ├── k8s_deploy # k8s ha样例脚本
│ └── ut # 单元测试用例
├── src # 源码
│ ├── memcache # MemCache 源码
├── config # 配置目录
│ ├── mmc-local.conf # 本地服务配置文件模板
│ ├── mmc-meta.conf # meta服务配置文件模板
├── README.md
🚀快速入门
请访问以下文档获取简易教程。
- 编译部署:介绍组件编译和安装教程。
- 配置文件:涉及MetaService、LocalService公共配置
- 样例执行:介绍如何端到端执行样例代码,包括C++和Python样例。
- [最佳实践]:推出中...
📑学习教程
📦软件硬件配套说明
- MemCache软件依赖 MemFabric,相关配套与MemFabric相同
📌FAQ
常见问题请参考:FAQ
📝相关信息
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file memcache_hybrid-1.1.0-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 4.0 MB
- Tags: CPython 3.13, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
26c1b716a1ca3e0ce81a10f27df50099a02c89a105fb05b75aac02501fb33296
|
|
| MD5 |
aafba23c9e3e851c071798fdb17443f8
|
|
| BLAKE2b-256 |
01c37f5fe221887f1b1115245a89d036833692d3b3b6163a56af016e76c29a78
|
File details
Details for the file memcache_hybrid-1.1.0-cp313-cp313-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp313-cp313-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
- Upload date:
- Size: 3.7 MB
- Tags: CPython 3.13, manylinux: glibc 2.26+ ARM64, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d04211a2fc4f5df34913f36b67cdb28a186648c4a5e2e4d2bce0c18212fede55
|
|
| MD5 |
c402dae2c6ef9f48406cabadb3db849a
|
|
| BLAKE2b-256 |
f7c0576a5c873788a8250f73b9ca644f317921d323d831e309f2d6df6b785a8c
|
File details
Details for the file memcache_hybrid-1.1.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 4.0 MB
- Tags: CPython 3.12, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
08a2314f794fe0c12c144743277a5874cea74c1edb2d3d152ef8747922192525
|
|
| MD5 |
fba5d1107a67a1ddc1a8631a34cdbfb7
|
|
| BLAKE2b-256 |
9463f5910223a28dbaf0e895f733592cc2112e71c02e2d74bc2436ee75e9bd60
|
File details
Details for the file memcache_hybrid-1.1.0-cp312-cp312-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp312-cp312-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
- Upload date:
- Size: 3.7 MB
- Tags: CPython 3.12, manylinux: glibc 2.26+ ARM64, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9069957044bbb940c7a29dd70b12cbe1c1d34c54cd6a95697b50ec0721c97b92
|
|
| MD5 |
adb1e03118c182cdb9676a0f44f85ba4
|
|
| BLAKE2b-256 |
1a9df3f6155ec06ac47ca55a1b1ad05fd4343786e941384346ad94cedbeb5d04
|
File details
Details for the file memcache_hybrid-1.1.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 4.0 MB
- Tags: CPython 3.11, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aff9f6a0be20e558638028123533356d8f9e43d05c799bb1586bb80d3fab9908
|
|
| MD5 |
08d635fb75946b936eab38820770e00d
|
|
| BLAKE2b-256 |
83ab20ad429ec3550cfe1df226628a1ef448e0b9b89000c19e4d513307f60b83
|
File details
Details for the file memcache_hybrid-1.1.0-cp311-cp311-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp311-cp311-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
- Upload date:
- Size: 3.7 MB
- Tags: CPython 3.11, manylinux: glibc 2.26+ ARM64, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2c96e52ad8c0e3788a1ec31588d23ef72f9611ede5793fa51986471a0a60ad4f
|
|
| MD5 |
eabecf432bce361ec9ce60daa8d4672e
|
|
| BLAKE2b-256 |
d09ad7969a7deda5a9a4499325535fe76754716fc3714f64d0aefcf14336e6d0
|
File details
Details for the file memcache_hybrid-1.1.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 4.0 MB
- Tags: CPython 3.10, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3dd200d389baf2892f68bfbe4d2f3abe82384342b62ab6bdbb8c355a403ea9c2
|
|
| MD5 |
55c391beaeebd2f3931e08d89a3717e3
|
|
| BLAKE2b-256 |
8a7ae578448f38b17f287ba33f90f6751a1fca0114a5544572d39979870552cd
|
File details
Details for the file memcache_hybrid-1.1.0-cp310-cp310-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl.
File metadata
- Download URL: memcache_hybrid-1.1.0-cp310-cp310-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
- Upload date:
- Size: 3.7 MB
- Tags: CPython 3.10, manylinux: glibc 2.26+ ARM64, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b99cbb858b355cb93ae31bc2012d04baa7d0b2772004f1ddaa3be4eb70569895
|
|
| MD5 |
1bc936fc63ac8165e3910f978ca381a6
|
|
| BLAKE2b-256 |
47e19b68eb886689734728a766046cb522f987888dd434ead4a80fd10b55b037
|