Skip to main content

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

Project description

QMEM

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

QMem is a toolkit for vector search. It provides a command-line interface (CLI) and a Python library for interacting with a Qdrant database. It is designed for directness and utility, offering a guided CLI for interactive tasks and a minimal Python API for programmatic control..


Features

  • Store documents in a vector database (Qdrant(cloud) or Chroma(local))
  • Search text by semantic meaning, not just exact words
  • Apply filters to narrow search results
  • Choose embeddings from OpenAI, Gemini, Voyage, or MiniLM
  • MongoDB integration for analytics

Installation

pip install qmem

Getting Started

Step 1: Setup

Run once to configure qmem:

qmem init

You will be asked to select:

  • Database backend: Qdrant (cloud) or Chroma (local)
  • Embedding provider: OpenAI, Gemini, Voyage, or MiniLM
  • API keys and embedding model

Configuration is saved in .qmem/config.toml.


Step 2: Prepare Your Data

qmem expects a .jsonl file (one JSON object per line).

Example:

{"query": "What is the plot of Inception?", "response": "A thief enters dreams to steal secrets.", "summary": "Dream heist story.", "title": "Inception", "description": "A skilled thief enters people's dreams to steal corporate secrets.", "genre": "Sci-Fi", "year": 2010}
{"query": "Who is the main character in The Godfather?", "response": "Michael Corleone, son of Vito.", "summary": "Mafia family saga.", "title": "The Godfather", "description": "The patriarch of a crime dynasty hands control to his reluctant son.", "genre": "Crime", "year": 1972}
{"query": "What challenge does Batman face in The Dark Knight?", "response": "He must stop the Joker.", "summary": "Hero vs chaos.", "title": "The Dark Knight", "description": "Batman faces the Joker, a criminal mastermind terrorizing Gotham.", "genre": "Action", "year": 2008}
{"query": "What is Parasite about?", "response": "A poor family cons its way into a rich household.", "summary": "Class divide thriller.", "title": "Parasite", "description": "A poor family infiltrates a wealthy household with dire consequences.", "genre": "Thriller", "year": 2019}
{"query": "What is Interstellar about?", "response": "Explorers travel through a wormhole to save humanity.", "summary": "Space odyssey.", "title": "Interstellar", "description": "A group of explorers journey through a wormhole to find a new home.", "genre": "Adventure", "year": 2014}
  • query: main text to search on
  • response:response to the query
  • others: extra information

Step 3: Add Data

import qmem as qm

# Create a collection
qm.create(collection_name="testing", dim=1024, distance_metric="cosine")

# Ingest data from file
qm.ingest(file="data.jsonl", embed_field="query",payload_field="x,y,z") #what to put in payload , not added then everything goes in payload

Step 4: Search

results = qm.retrieve(
    query="who is batman",
    top_k=5,
    show=["description", "title"]
)
print(results)

Results are returned in a table showing similarity scores and payload fields.


Step 5: Use Filters

Build filters interactively (CLI):

qmem filter

Filters are saved under .qmem/filters/.

Apply a filter in Python:

filtered = qm.retrieve_filter(
    query="who is batman",
    filter_json=".qmem/filters/latest.json",
    top_k=5,
    show=["description", "title"]
)
print(filtered)

Step 6: Mirror Data to MongoDB

qm.mongo(
    collection_name="testing",
    mongo_db="final_test_db",
    mongo_collection="final_test",
    fields=["description", "title"]
)

This copies your Qdrant collection into MongoDB for downstream use.


CLI Commands

qmem init      # configure qmem
qmem filter    # create filters interactively
qmem index     # (only for Qdrant) build indices for faster filtering

Python API

  • qm.create() → Create a new collection
  • qm.ingest() → Add documents from a file
  • qm.retrieve() → Search documents by meaning
  • qm.retrieve_filter() → Search with filters
  • qm.mongo() → Copy a collection into MongoDB

Why use qmem?

  • Simple to set up and use
  • Works with both local (Chroma) and cloud (Qdrant) databases
  • AI embeddings understand meaning, not just exact words
  • Filters allow more precise retrieval
  • MongoDB integration enables analytics pipelines

Example Workflow

  1. qmem init → configure backend and embeddings
  2. qm.create(...) → create a collection
  3. qm.ingest(...) → add data from JSONL
  4. qm.retrieve(...) → search data by meaning
  5. qmem filter + qm.retrieve_filter(...) → apply filters
  6. qm.mongo(...) → mirror to MongoDB if needed

qmem makes it easy to:

  • Store your documents
  • Search them intelligently
  • Filter results by properties
  • Extend usage with MongoDB integration

Searching for “car” may also return results for “automobile” or “vehicle.”

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qmem-0.1.8.tar.gz (31.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

qmem-0.1.8-py3-none-any.whl (32.5 kB view details)

Uploaded Python 3

File details

Details for the file qmem-0.1.8.tar.gz.

File metadata

  • Download URL: qmem-0.1.8.tar.gz
  • Upload date:
  • Size: 31.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.1.8.tar.gz
Algorithm Hash digest
SHA256 1b00c41a9b5ff5c083665ad4b5b0eca212a78119185b2aedf97a0badee58418e
MD5 2c69840df9a5f27a85b993ce59c1a693
BLAKE2b-256 249f47e7114c7206988f456779353ba0b3e55ed5192b32c2c3ad5534e6e3f0c4

See more details on using hashes here.

File details

Details for the file qmem-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: qmem-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 32.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 288ab7700c6b6ccfc4fcc2cc7cb5711dd598a9d5c336974181777be7a24c1a94
MD5 1182d6746c543ae02aabb4dfa8dc657c
BLAKE2b-256 1023157c8f7f1649f17578f65bced27ec4cf3436b67b499d5d84e97486da6a0b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page