Portable, tier-selected memory system (record + vector + hybrid search) for agents and apps. Ports & adapters; runs from a laptop SQLite file up to a full Milvus/vLLM cluster.

These details have not been verified by PyPI

Project description

era-memory

A portable, tier-selected memory system for agents and apps — record storage, vector storage, and hybrid (vector + lexical) search with RRF fusion, recency decay, and deduplication. The retrieval logic is identical at every scale; only the adapters change.

It is ports & adapters: all logic depends on nine small interfaces (RecordStore, VectorStore, Embedder, Queue, Extractor, BlobStore, KMS, Auth, Telemetry), and one MEMORY_TIER knob selects an adapter set.

Tier	Stack	Extras
0 — laptop / offline	SQLite + sqlite-vec, in-process	`era-memory[tier0]`
1 — team / VM / Cloud SQL	Postgres + pgvector, hosted embeddings, HTTP	`era-memory[tier1]`
2 — enterprise	Milvus + vLLM + Redis Sentinel + GCP KMS	`era-memory[milvus,vllm,redis,gcp]`

The base package has zero third-party dependencies — no mandatory cloud account, GPU, or private registry to run Tier 0/1. Every backend is an optional extra.

Install

Not yet on PyPI. Until the first release is published, install from source. See INSTALL.md for the full guide (extras, embedder setup, service deploy).

git clone https://github.com/Era-Laboratories/era-memory.git && cd era-memory
pip install -e ".[tier0]"        # laptop/offline; swap for [tier1] for the Postgres service

The Tier 2 observability stack uses private Era packages and is not part of the public package — Era-internal deploys install it separately via requirements-internal.txt. Every extra listed above is fully public.

Status

M0, M1, and M2 are done. The single conformance suite passes against the in-memory, SQLite, and Postgres/pgvector backends; Tier 1 has been validated end-to-end against a live docker compose stack. 104 tests pass with Postgres (81 + 4 skipped without it). See docs/PROGRESS.md for the milestone detail and docs/era-memory-light-spec.md for the full spec.

Not yet available (tracked for the public release): PyPI package (install from source for now); Tier 2 (Milvus/vLLM/Redis) adapters. (The offline ONNX embedder is now shipped — see the embedder note above and era-memory setup.)

Deploying inside Era Labs Tools? Start with docs/HANDOVER-era-labs-tools.md. Everyone else: see INSTALL.md.

⚠️ Read this before judging retrieval quality. The default embedder is a deterministic hashed bag-of-words stand-in. It exercises the storage/search plumbing so the library runs with zero setup, but it is not semantic — a query only matches on shared literal tokens, so "what does Ada drink?" will not find "Ada prefers dark roast coffee". For real retrieval, set up a real embedder (one-time):
pip install "era-memory[localembed]"   # ONNX, CPU-only
era-memory setup                        # downloads a small model from Hugging Face, caches it
After that, a plain build_memory(...) auto-uses the cached model — no endpoint, GPU, or API key. Prefer a one-liner? build_memory(embedder="auto") downloads-if-needed and serves it. Prefer a hosted/local endpoint instead? Point the OpenAI-compatible embedder (era_memory.adapters.openai.OpenAICompatibleEmbedder, or MEMORY_EMBEDDING_URL) at any /embeddings server — OpenAI, Ollama, vLLM, LiteLLM. See Choosing a store, embedder, and dimension and INSTALL.md.

Use it as a library (in-process, no infra)

import asyncio
from era_memory import build_memory, MemoryRecord, SearchRequest, SessionPayload

async def main():
    # Tier 0, persisted to a single SQLite file (records + vectors + FTS together).
    #
    # With no `embedder=` this auto-uses a local model if you've run `era-memory setup`,
    # otherwise it falls back to the non-semantic dev embedder (see the warning above).
    # `embedder="auto"` downloads-and-serves a local model on the spot.
    mem = build_memory(tier=0, db_path="memory.db")

    await mem.store(MemoryRecord(user_id="u1", content="Ada prefers dark roast coffee"))

    # Or extract memories from a raw conversation (heuristic extractor by default):
    await mem.encode(SessionPayload(
        user_id="u1", session_id="s1",
        conversation="User: I just adopted a cat named Mochi.\nUser: She loves tuna.",
    ))

    res = await mem.search(SearchRequest(user_id="u1", query="what does Ada drink?"))
    print(res.results[0].content if res.results else "no results")

asyncio.run(main())

Drop db_path for a pure in-memory store (tests, ephemeral use).

Run it as a service (Tier 1)

docker compose up --build          # Postgres+pgvector + the HTTP API on :8080

TOKEN="change-me"
curl -s -H "Authorization: Bearer $TOKEN" -H "X-User-Id: u1" -H "Content-Type: application/json" \
  -X POST localhost:8080/api/memories -d '{"content":"Ada prefers dark roast coffee"}'

curl -s -H "Authorization: Bearer $TOKEN" -H "X-User-Id: u1" -H "Content-Type: application/json" \
  -X POST localhost:8080/api/memories/search -d '{"query":"coffee"}'

HTTP routes: GET /health, GET /ready, POST /api/memories, POST /api/memories/search. Configure via env — see docs/HANDOVER-era-labs-tools.md.

Architecture in one paragraph

Records and vectors live behind the RecordStore and VectorStore ports. At Tiers 0/1 they share one backend (SQLite file / one Postgres DB), so the dual-write collapses into a single transaction — there is no "record saved but vector failed" orphan state. At Tier 2 they are separate systems (Postgres + Milvus) and the orchestrator preserves era-core's fail-fast semantics. Hybrid search fuses a vector-ANN leg and a lexical leg via Reciprocal Rank Fusion (k=60, 0.6/0.4), then scales by importance and a 30-day recency half-life.

Choosing a store, embedder, and dimension

The embedding dimension is not fixed — the vector column is created at whatever dimension your embedder emits (vec0(dim) on SQLite, halfvec(dim) on Postgres, where halfvec indexes up to 4000 dims). Pick a store + embedder + dimension to match your deployment; nothing in the retrieval logic changes.

Deployment	Store	Embedder (license)	Dim
Laptop / offline / tests	SQLite + sqlite-vec	`all-MiniLM-L6-v2` (Apache-2.0)	384
Small team / single VM	SQLite or Postgres + pgvector	`bge-base-en-v1.5` (MIT) / `nomic-embed-text-v1.5` (Apache-2.0)	768
Production / Cloud SQL	Postgres + pgvector (`halfvec`)	`mxbai-embed-large-v1` (Apache-2.0) or OpenAI `text-embedding-3-*` (MRL→1024)	1024
Enterprise / era-core parity	Postgres + Milvus	Qwen3 family via vLLM, MRL→2048	2048

These are starting points, not the only valid choices — any model your endpoint serves works. There is one hard rule (enforced by a (model, dim) guard and a startup check): a store is pinned to a single (model, dim) for its whole life. The contract is same vector space on both write and query — which means same model, same dimension, same MRL-truncation length, and same normalization. Changing the model or dimension later means re-embedding into a new store, not a config flip. Lightweight models cap at ~1024 native dims; reaching 2048+ requires a larger model (e.g. Qwen3) MRL-truncated down — see docs/adr/0001-dimension-is-a-per-deployment-contract.md for the full reasoning.

Set the dimension with MEMORY_EMBEDDING_DIMENSIONS (or pass an Embedder whose .dimensions is the source of truth — if you do both and they disagree, wiring fails fast).

Development

python -m venv .venv && . .venv/bin/activate
pip install -e ".[dev,sqlite,postgres,openai,server,encryption]"
pytest -q                                              # 81 + 4 skipped (no Postgres)
MEMORY_TEST_PG_DSN=postgresql://postgres:era@localhost:55433/era pytest -q   # 104 (with Postgres)
ruff check src tests

A new backend is added by implementing a port and making it pass the existing tests/conformance/ suite — that is the contract.

License

Apache-2.0.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.1

Jun 21, 2026

This version

0.1.0

Jun 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

era_memory-0.1.0.tar.gz (70.8 kB view details)

Uploaded Jun 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

era_memory-0.1.0-py3-none-any.whl (53.1 kB view details)

Uploaded Jun 21, 2026 Python 3

File details

Details for the file era_memory-0.1.0.tar.gz.

File metadata

Download URL: era_memory-0.1.0.tar.gz
Upload date: Jun 21, 2026
Size: 70.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for era_memory-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`f8b10166461b4cb3390628b6f504cded24e87598c8f64cb577ae0d525f3b9692`
MD5	`e7a3280b3b2b2b2dd0b8bc9cb679954a`
BLAKE2b-256	`c12fb1e33183cb6d1a70810e902f8f149cc14f794972c5e4c6e4ffbb102dd980`

See more details on using hashes here.

Provenance

The following attestation bundles were made for era_memory-0.1.0.tar.gz:

Publisher: release.yml on Era-Laboratories/era-memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: era_memory-0.1.0.tar.gz
- Subject digest: f8b10166461b4cb3390628b6f504cded24e87598c8f64cb577ae0d525f3b9692
- Sigstore transparency entry: 1901842923
- Sigstore integration time: Jun 21, 2026
Source repository:
- Permalink: Era-Laboratories/era-memory@ff2dcaa7b025da11afb93d2dd5d83a60eeb1619a
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Era-Laboratories
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ff2dcaa7b025da11afb93d2dd5d83a60eeb1619a
- Trigger Event: push

File details

Details for the file era_memory-0.1.0-py3-none-any.whl.

File metadata

Download URL: era_memory-0.1.0-py3-none-any.whl
Upload date: Jun 21, 2026
Size: 53.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for era_memory-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5390c0316a403240dc79ed8f5a5707ec6dc804161e2f0b89d886dddfa60603ba`
MD5	`ada5dd79b6ced30be5a278d4a265630e`
BLAKE2b-256	`ea247dcad2a771a0eb65559a23760820edd0aa3ad18fed72b0a775abc2b8391c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for era_memory-0.1.0-py3-none-any.whl:

Publisher: release.yml on Era-Laboratories/era-memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: era_memory-0.1.0-py3-none-any.whl
- Subject digest: 5390c0316a403240dc79ed8f5a5707ec6dc804161e2f0b89d886dddfa60603ba
- Sigstore transparency entry: 1901843208
- Sigstore integration time: Jun 21, 2026
Source repository:
- Permalink: Era-Laboratories/era-memory@ff2dcaa7b025da11afb93d2dd5d83a60eeb1619a
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Era-Laboratories
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ff2dcaa7b025da11afb93d2dd5d83a60eeb1619a
- Trigger Event: push

era-memory 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

era-memory

Install

Status

Use it as a library (in-process, no infra)

Run it as a service (Tier 1)

Architecture in one paragraph

Choosing a store, embedder, and dimension

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance