Skip to main content

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

Project description

QMEM

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

QMem is a toolkit for vector search. It provides a command-line interface (CLI) and a Python library for interacting with a Qdrant database. It is designed for directness and utility, offering a guided CLI for interactive tasks and a minimal Python API for programmatic control..


Features

  • Store documents in a vector database (Qdrant(cloud) or Chroma(local))
  • Search text by semantic meaning, not just exact words
  • Apply filters to narrow search results
  • Choose embeddings from OpenAI, Gemini, Voyage, or MiniLM
  • MongoDB integration for analytics

Installation

pip install qmem

Getting Started

Step 1: Setup

Run once to configure qmem:

qmem init

You will be asked to select:

  • Database backend: Qdrant (cloud) or Chroma (local)
  • Embedding provider: OpenAI, Gemini, Voyage, or MiniLM
  • API keys and embedding model

Configuration is saved in .qmem/config.toml.


Step 2: Prepare Your Data

qmem expects a .jsonl file (one JSON object per line).

Example:

{"query": "What is the plot of Inception?", "response": "A thief enters dreams to steal secrets.", "summary": "Dream heist story.", "title": "Inception", "description": "A skilled thief enters people's dreams to steal corporate secrets.", "genre": "Sci-Fi", "year": 2010}
{"query": "Who is the main character in The Godfather?", "response": "Michael Corleone, son of Vito.", "summary": "Mafia family saga.", "title": "The Godfather", "description": "The patriarch of a crime dynasty hands control to his reluctant son.", "genre": "Crime", "year": 1972}
{"query": "What challenge does Batman face in The Dark Knight?", "response": "He must stop the Joker.", "summary": "Hero vs chaos.", "title": "The Dark Knight", "description": "Batman faces the Joker, a criminal mastermind terrorizing Gotham.", "genre": "Action", "year": 2008}
{"query": "What is Parasite about?", "response": "A poor family cons its way into a rich household.", "summary": "Class divide thriller.", "title": "Parasite", "description": "A poor family infiltrates a wealthy household with dire consequences.", "genre": "Thriller", "year": 2019}
{"query": "What is Interstellar about?", "response": "Explorers travel through a wormhole to save humanity.", "summary": "Space odyssey.", "title": "Interstellar", "description": "A group of explorers journey through a wormhole to find a new home.", "genre": "Adventure", "year": 2014}
  • query: main text to search on
  • response:response to the query
  • others: extra information

Step 3: Add Data

import qmem as qm

# Create a collection
qm.create(collection_name="testing", dim=1024, distance_metric="cosine")

# Ingest data from file
qm.ingest(file="data.jsonl", embed_field="query",payload_field="x,y,z") #what to put in payload , not added then everything goes in payload

Step 4: Search

results = qm.retrieve(
    query="who is batman",
    top_k=5,
    show=["description", "title"]
)
print(results)

Results are returned in a table showing similarity scores and payload fields.


Step 5: Use Filters

Build filters interactively (CLI):

qmem filter

Filters are saved under .qmem/filters/.

Apply a filter in Python:

filtered = qm.retrieve_filter(
    query="who is batman",
    filter_json=".qmem/filters/latest.json",
    top_k=5,
    show=["description", "title"]
)
print(filtered)

Step 6: Mirror Data to MongoDB

qm.mongo(
    collection_name="testing",
    mongo_db="final_test_db",
    mongo_collection="final_test",
    fields=["description", "title"]
)

This copies your Qdrant collection into MongoDB for downstream use.


CLI Commands

qmem init      # configure qmem
qmem filter    # create filters interactively
qmem index     # (only for Qdrant) build indices for faster filtering

Python API

  • qm.create() → Create a new collection
  • qm.ingest() → Add documents from a file
  • qm.retrieve() → Search documents by meaning
  • qm.retrieve_filter() → Search with filters
  • qm.mongo() → Copy a collection into MongoDB

Why use qmem?

  • Simple to set up and use
  • Works with both local (Chroma) and cloud (Qdrant) databases
  • AI embeddings understand meaning, not just exact words
  • Filters allow more precise retrieval
  • MongoDB integration enables analytics pipelines

Example Workflow

  1. qmem init → configure backend and embeddings
  2. qm.create(...) → create a collection
  3. qm.ingest(...) → add data from JSONL
  4. qm.retrieve(...) → search data by meaning
  5. qmem filter + qm.retrieve_filter(...) → apply filters
  6. qm.mongo(...) → mirror to MongoDB if needed

qmem makes it easy to:

  • Store your documents
  • Search them intelligently
  • Filter results by properties
  • Extend usage with MongoDB integration

Searching for “car” may also return results for “automobile” or “vehicle.”

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qmem-0.2.0.tar.gz (33.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

qmem-0.2.0-py3-none-any.whl (34.9 kB view details)

Uploaded Python 3

File details

Details for the file qmem-0.2.0.tar.gz.

File metadata

  • Download URL: qmem-0.2.0.tar.gz
  • Upload date:
  • Size: 33.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.2.0.tar.gz
Algorithm Hash digest
SHA256 1c742df2e3537770e2368dd1b334b21c752b8050e13295218a0f7f4928d64074
MD5 a245ae38382b1ecc76bccd45a6d2f118
BLAKE2b-256 4ab85565d776cc26f7d4cf8daf03544141ac55d2c00305d79a4a1c01d12366bd

See more details on using hashes here.

File details

Details for the file qmem-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: qmem-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 34.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a7af11e3b2772524901fc141c8a187ce087fc5ffeb77a7dae32b916cfd34b336
MD5 11f3202bd8676afda57c446e8c9ae4e1
BLAKE2b-256 afd4e024a0cefd4404e7354215370e1454d90f53d2f7566936d50671dfbe1d29

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page