Machine-to-Memory Edge-Optimized Vector Search Database with Vulkan Compute API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

M2M Vector Search Engine

Machine-to-Memory (M2M) Engine & Gaussian Splat Vector Store

A vector database with hierarchical retrieval for local-first applications. Now available in two explicit flavors: The minimal SQLite-style for Edge, and the Advanced Agent-style for intelligence.

📋 Table of Contents

Overview
Common Use Cases
Two Modes of Operation
- SimpleVectorDB (Edge / "SQLite" approach)
- AdvancedVectorDB (Agentic approach)
Architecture & 3-Tier Memory
Comparison
Benchmarks
Installation
Troubleshooting
License

🎯 Overview

M2M Vector Search is a vector database built on Gaussian Splats with hierarchical retrieval (HRM2). Designed originally for generative exploration and Self-Organized Criticality (SOC), it has been refined to offer production-ready profiles for both sheer speed (Edge computing) and complex reasoning (Agents).

Core Engine Features

Feature	Description
Hierarchical Retrieval (HRM2)	Two-level clustering (Level 1 Coarse, Level 2 Fine) for sub-millisecond searches.
Gaussian Splats	Full latent representation (μ, α, κ).
Local-First	No cloud dependencies, pure local Python/NumPy logic.
GPU Acceleration	Optional true Vulkan compute shader acceleration for MoE routers.

🌟 Common Use Cases

Edge AI Devices: Run fast vector searches on resource-constrained devices without internet access.
Autonomous Agents: Use AdvancedVectorDB for agents that need to dynamically cluster and forget information over time.
Local RAG Pipelines: Integrate with LangChain/LlamaIndex for private document Q&A without sending data to APIs.

🌐 Omnimodal & Multimodal Ready

M2M does not care about the source of your vectors; it seamlessly stores and routes high-dimensional embeddings from any modality. By pairing M2M with state-of-the-art embedding models, you can achieve true omnimodal retrieval.

Supported Data Formats & Best Practices

Modality	Recommended Embedding Model	Format / Best Practice
Text	OpenAI `text-embedding-3`, BGE, `all-MiniLM`	Chunks of Markdown, raw string text, JSON blobs. Normalize text before embedding.
Images	OpenAI CLIP, SigLIP	`.png`, `.jpg`, `.webp`. Normalize vectors to S^D space to perfectly fit M2M's spherical mapping.
Audio	ImageBind, Whisper-based encoders	`.wav`, `.mp3`. Embed 5-second acoustic slices to match natural HRM2 clustering.
Video	VideoMAE, ImageBind	Frame aggregations or temporal tokens. Group frames as "splat clusters".
Spatial/3D	PointNet++, 3D Gaussian Splatting	Pass raw splat features (μ, α, κ) directly for 3D routing applications.
Telemetry	Time2Vec	Server logs, IoT sensory data encoded to hyperspheres.

🌓 Two Modes of Operation

We realized that applications need completely different things. Standard RAG needs a blazing fast, "dumb" vector store. Autonomous agents need exploratory latent spaces. Thus, M2M provides two compatible, interchangeable interfaces:

1. SimpleVectorDB

"The SQLite of Vector DBs"

Designed for raw edge computing and pure embedding retrieval. It strips away all advanced mechanics (generative sampling, memory tiering, entropy tracking) to maximize throughput and minimize RAM/VRAM footprint.

Best for: RAG workflows, embedding lookup, static local vector caches.

import numpy as np
from m2m import SimpleVectorDB

# Zero-configuration initialization
db = SimpleVectorDB(device='cpu')

# Add embeddings dynamically
db.add(np.random.randn(10000, 640).astype(np.float32))

# Blazing fast hierarchical search
results = db.search(np.random.randn(1, 640).astype(np.float32), k=10)

# Save/Load your index instantly
# db.load("vector_cache.bin")

2. AdvancedVectorDB

"The Cognitive Latent Space"

Designed for Autonomous Agents. Enables the 3-Tier Memory Manager (VRAM -> RAM -> SSD), Langevin Dynamics for generative vector exploration, and Self-Organized Criticality (SOC) to passively consolidate redundant memory.

Best for: Long-running Agents, dynamic memory systems, associative reasoning.

import numpy as np
from m2m import AdvancedVectorDB

# Initialize Full Cognitive Suite
agent_db = AdvancedVectorDB(device='vulkan')
agent_db.add(np.random.randn(50000, 640).astype(np.float32))

# 1. Standard Search
nearest = agent_db.search(np.random.randn(1, 640).astype(np.float32), k=10)

# 2. Generative Latent Exploration
# Uses Underdamped Langevin Dynamics to explicitly walk the energy manifold
creative_samples = agent_db.generate(query=np.random.randn(1, 640).astype(np.float32), n_steps=20)

# 3. Consolidate Memory via Self-Organized Criticality
# Automatically removes near-duplicate or useless splats based on access frequency
removed_count = agent_db.consolidate(threshold=0.85)

3. M2M Cluster

"The Distributed Vector Network"

Designed for horizontal scalability and high availability. It wraps multiple M2M instances (typically SimpleVectorDB on edge devices) into a unified cluster, exposing a seamless M2MClusterClient for distributed routing & aggregation (Reciprocal Rank Fusion).

This mode natively supports HTTP distributed architecture using FastAPI, allowing instances to run on completely separate machines and communicate via HTTP REST endpoints safely. Features out of the box Semantic/Geo-aware Sharding, auto-balancing least-loaded algorithms, and Offline SyncQueue so edge devices buffer queries independently through network internet blackouts.

Best for: Hybrid edge-cloud setups, huge datasets >100K splitting, failure-resistant local clusters.

import numpy as np
from m2m import M2MConfig
from m2m.cluster import EdgeNode, ClusterRouter, M2MClusterClient

# Initialize nodes (can be on different machines)
config = M2MConfig(device='cpu')
edge1 = EdgeNode(edge_id="edge-1", config=config)
edge2 = EdgeNode(edge_id="edge-2", config=config)

# Setup routing and client
router = ClusterRouter()
client = M2MClusterClient(in_memory_router=router)
client.register_local_edge(edge1)
client.register_local_edge(edge2)

# Distributed ingestion (auto-shards data)
client.ingest(np.random.randn(1000, 640).astype(np.float32))

# Distributed search (queries all edges, merges with RRF)
results = client.search(np.random.randn(1, 640).astype(np.float32), k=10)

Note: Both systems utilize the same underlying SplatStore and HRM2Engine. An index built and persisted in SimpleVectorDB can be loaded natively into AdvancedVectorDB or EdgeNode, and vice-versa!

4. Native Entity Extractor & Gaussian Graphs

"The Zero-Dependency Knowledge Core"

M2M includes a Native Entity Extractor and Gaussian Graph Store that build a structured Knowledge Graph dynamically. Instead of relying on heavy external NER models (like GLiNER), it uses N-Grams, Structural patterns, and the S^639 hypersphere latent space to cluster entities.

Feature Comparison: M2M Native Extractor vs GLiNER

Feature	GLiNER (External)	M2M Native Extractor
Dependencies	Requires extra libraries	Only numpy + sklearn
VRAM/RAM Usage	~500MB for the model	~0MB (reuses HRM2 structure)
Inference Speed	50-200ms per text	10-50ms per text
Offline Use	Needs pre-downloaded weights	100% strictly offline
Integration	External wrapper needed	Natively integrated into GraphStore

from m2m.graph_splat import GaussianGraphStore
from m2m.entity_extractor import M2MEntityExtractor, M2MGraphEntityExtractor

# Fully integrated pipeline
store = GaussianGraphStore(dim=640)
extractor = M2MEntityExtractor()
graph_pipeline = M2MGraphEntityExtractor(extractor, store)

doc_id = store.add_document("Apple Inc. reported strong earnings.", dummy_embedding)
results = graph_pipeline.extract_and_store(
    text="Apple Inc. reported strong earnings.",
    doc_embedding=dummy_embedding,
    doc_id=doc_id,
    embedding_model=my_embedding_model
)
# Automatically builds MENTIONS relationships between Documents and extracted Entities.

🔗 Integrations

M2M natively supports the industry-standard frameworks for building RAG applications and Agentic workflows.

LangChain Integration

from langchain.vectorstores import M2MVectorStore
from langchain.embeddings import HuggingFaceEmbeddings

embeddings = HuggingFaceEmbeddings(model_name="sentence-transformers/all-MiniLM-L6-v2")

vectorstore = M2MVectorStore(
    embedding_function=embeddings.embed_query,
    splat_capacity=100000,
    enable_vulkan=True
)

vectorstore.add_texts(["Document 1", "Document 2"])
results = vectorstore.similarity_search("Query", k=5)

LlamaIndex Integration

from llamaindex import VectorStoreIndex, SimpleDirectoryReader
from m2m.integrations.llamaindex import M2MVectorStore

documents = SimpleDirectoryReader("./docs").load_data()

vectorstore = M2MVectorStore(latent_dim=640, max_splats=100000, enable_vulkan=True)
index = VectorStoreIndex.from_documents(documents, vector_store=vectorstore)

query_engine = index.as_query_engine()
response = query_engine.query("Your search query")

🏗 Architecture

Architecture

The 3-Tier Memory Hierarchy (Advanced Mode)

Tier	Storage	Latency	Use Case
Hot	VRAM	~0.1ms	Active queries, highly recurrent context.
Warm	RAM	~0.5ms	Cached HRM2 embeddings, mid-term context.
Cold	SSD	~10ms	Long-term persisted cold storage.

Component Breakdown

M2MEngine: The main router and orchestrator.
SplatStore: splats.py handles the physical tensors and GPU tracking.
HRM2Engine: hrm2_engine.py builds the two-level K-Means lookup tree.

⚖️ Comparison with other Vector DBs

Feature	M2M Vector Search	FAISS	Pinecone	Chroma
Deployment	Local / Edge	Local	Cloud / Managed	Local / Server
Engine Focus	Gaussian Splats, SOC	IVF, HNSW	Proprietary ANN	SQLite / HNSW
GPU Support	Vulkan (Cross-platform)	CUDA (NVIDIA only)	N/A	N/A (CPU mostly)
Agentic Features	Yes (Memory Tiering, SOC)	No	No	No

📊 Benchmarks

Benchmark Comparison

Test Configuration

Parameter	Value
CPU	Dual Core Local Edge Device
RAM	2GB Available
Vectors	10,000 (sklearn fallback)
Dimensions	640D

Results (Sklearn Fallback - Dense/Sub-optimal)

System	Avg Latency	Throughput	Speedup
Linear Scan	47.80ms	20.92 QPS	1.0x (baseline)
M2M CPU	81.03ms	12.34 QPS	0.6x
M2M Vulkan	73.45ms	13.61 QPS	0.7x
M2M Transformed	8.68ms	115.20 QPS	5.5x

(Reproduce local benchmarks via python benchmarks/run_benchmark.py --dataset sklearn --n-splats 10000 --n-queries 1000 --k 10 --device all)

Results (Synthetic Clustered - Ideal Heterogeneous Case)

When data naturally forms tight clusters (the ideal environment for M2M's hierarchical routing):

System	Avg Latency	Throughput	Speedup
Linear Scan	47.55ms	21.03 QPS	1.0x (baseline)
M2M CPU	78.67ms	12.71 QPS	0.6x
M2M Vulkan	76.98ms	12.99 QPS	0.6x
M2M Transformed	9.71ms	103.01 QPS	4.9x

(Reproduce local benchmarks via python benchmarks/run_benchmark.py --dataset clustered --n-splats 10000 --n-queries 1000 --k 10 --device all)

🚀 Installation

System Requirements

Component	Minimum	Recommended
OS	Windows 10, Linux, macOS	Linux / Windows
CPU	2 Cores	4+ Cores
RAM	2 GB	8+ GB
GPU	Optional (Any Vulkan 1.0+ compatible device)	Dedicated GPU (NVIDIA/AMD) with Vulkan support

Note on Homogeneous Distributions vs Latency (LSH Integration): If vectors are perfectly homogeneous (a highly dense cluster without clear boundaries), the internal K-Means index struggles to separate them into distinct semantic paths. Consequently, the HRM2 engine must probe multiple overlapping clusters, forcing the latency closer to O(N) linear time as opposed to the ideal O(sqrt(N)) logarithmic speedup seen in normally distributed or distinctly grouped datasets.

Solution for Homogeneous Data: For strictly homogeneous datasets, it is highly recommended to pair M2M with Locality-Sensitive Hashing (LSH) or similar approximate pre-filters. By using LSH to quickly reduce the search space into a smaller candidate pool, you can then rely on M2M's exact scan for final ranking. Alternatively, you can use the built-in M2MDatasetTransformer to artificially induce clustered hierarchies onto your flat distributions, instantly restoring sub-millisecond search capabilities.

Prerequisites

Python 3.8+
NumPy 1.24+
scikit-learn 1.2+

From Source

git clone https://github.com/schwabauerbriantomas-gif/m2m-vector-search.git
cd m2m-vector-search
pip install -r requirements.txt

Verify your installation:

python scripts/validate_project.py

🛠️ Troubleshooting

Having issues? Check out our Troubleshooting Guide for solutions to common problems.

📄 License & References

Licensed under the AGPLv3.

Methodology Conclusions: METHODOLOGY_CONCLUSIONS.md
Config Guide: CONFIG_RAG.md

M2M: Machine-to-Memory

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

schwabauerbriantomas-gif

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.0.3

Mar 13, 2026

2.0.0

Mar 9, 2026

This version

1.5.0

Mar 8, 2026

1.1.0

Mar 8, 2026

1.0.8

Mar 8, 2026

1.0.7

Mar 7, 2026

1.0.6

Mar 6, 2026

1.0.5

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

m2m_vector_search-1.5.0.tar.gz (85.3 kB view details)

Uploaded Mar 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

m2m_vector_search-1.5.0-py3-none-any.whl (85.1 kB view details)

Uploaded Mar 8, 2026 Python 3

File details

Details for the file m2m_vector_search-1.5.0.tar.gz.

File metadata

Download URL: m2m_vector_search-1.5.0.tar.gz
Upload date: Mar 8, 2026
Size: 85.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for m2m_vector_search-1.5.0.tar.gz
Algorithm	Hash digest
SHA256	`ece1ffaf59a0d5c51b4ddb8ed2f18ea4ab821afd9bdce1e656cfd88967d746cd`
MD5	`0ce3056c0acdc7374f36114292e6e99b`
BLAKE2b-256	`88dbf57fec29b038c471f584b8c6fe5c7f2e6d55f1bf2d7f9721b64e6c38e05e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for m2m_vector_search-1.5.0.tar.gz:

Publisher: publish.yml on schwabauerbriantomas-gif/m2m-vector-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: m2m_vector_search-1.5.0.tar.gz
- Subject digest: ece1ffaf59a0d5c51b4ddb8ed2f18ea4ab821afd9bdce1e656cfd88967d746cd
- Sigstore transparency entry: 1059759531
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: schwabauerbriantomas-gif/m2m-vector-search@85ed218c6ef6f018ce78aa645acc6c90982b2eb8
- Branch / Tag: refs/tags/1.5.0
- Owner: https://github.com/schwabauerbriantomas-gif
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@85ed218c6ef6f018ce78aa645acc6c90982b2eb8
- Trigger Event: release

File details

Details for the file m2m_vector_search-1.5.0-py3-none-any.whl.

File metadata

Download URL: m2m_vector_search-1.5.0-py3-none-any.whl
Upload date: Mar 8, 2026
Size: 85.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for m2m_vector_search-1.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`580afd1257f354fa20eff0b82e37ea458308637d7eb13837463a2be06023a8e5`
MD5	`3a9c89f9e4ed2436fe95efb5fa05e9d8`
BLAKE2b-256	`e3d5996fb14256a173e8d1054418c4d0c7299e04ac689a295512167495bacdd7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for m2m_vector_search-1.5.0-py3-none-any.whl:

Publisher: publish.yml on schwabauerbriantomas-gif/m2m-vector-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: m2m_vector_search-1.5.0-py3-none-any.whl
- Subject digest: 580afd1257f354fa20eff0b82e37ea458308637d7eb13837463a2be06023a8e5
- Sigstore transparency entry: 1059759532
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: schwabauerbriantomas-gif/m2m-vector-search@85ed218c6ef6f018ce78aa645acc6c90982b2eb8
- Branch / Tag: refs/tags/1.5.0
- Owner: https://github.com/schwabauerbriantomas-gif
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@85ed218c6ef6f018ce78aa645acc6c90982b2eb8
- Trigger Event: release

m2m-vector-search 1.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

M2M Vector Search Engine

📋 Table of Contents

🎯 Overview

Core Engine Features

🌟 Common Use Cases

🌐 Omnimodal & Multimodal Ready

Supported Data Formats & Best Practices

🌓 Two Modes of Operation

1. SimpleVectorDB

2. AdvancedVectorDB

3. M2M Cluster

4. Native Entity Extractor & Gaussian Graphs

🔗 Integrations

LangChain Integration

LlamaIndex Integration

🏗 Architecture

The 3-Tier Memory Hierarchy (Advanced Mode)

Component Breakdown

⚖️ Comparison with other Vector DBs

📊 Benchmarks

Test Configuration

Results (Sklearn Fallback - Dense/Sub-optimal)

Results (Synthetic Clustered - Ideal Heterogeneous Case)

🚀 Installation

System Requirements

Prerequisites

From Source

🛠️ Troubleshooting

📄 License & References

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance