Skip to main content

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

Project description

QMEM

Memory library for seamless data ingestion, storage, and retrieval with customizable embedding models.

QMem is a toolkit for vector search. It provides a command-line interface (CLI) and a Python library for interacting with a Qdrant database. It is designed for directness and utility, offering a guided CLI for interactive tasks and a minimal Python API for programmatic control..


Features

  • Store documents in a vector database (Qdrant(cloud) or Chroma(local))
  • Search text by semantic meaning, not just exact words
  • Apply filters to narrow search results
  • Choose embeddings from OpenAI, Gemini, Voyage, or MiniLM
  • MongoDB integration for analytics

Installation

pip install qmem

Getting Started

Step 1: Setup

Run once to configure qmem:

qmem init

You will be asked to select:

  • Database backend: Qdrant (cloud) or Chroma (local)
  • Embedding provider: OpenAI, Gemini, Voyage, or MiniLM
  • API keys and embedding model

Configuration is saved in .qmem/config.toml.


Step 2: Prepare Your Data

qmem expects a .jsonl file (one JSON object per line).

Example:

{"query": "What is the plot of Inception?", "response": "A thief enters dreams to steal secrets.", "summary": "Dream heist story.", "title": "Inception", "description": "A skilled thief enters people's dreams to steal corporate secrets.", "genre": "Sci-Fi", "year": 2010}
{"query": "Who is the main character in The Godfather?", "response": "Michael Corleone, son of Vito.", "summary": "Mafia family saga.", "title": "The Godfather", "description": "The patriarch of a crime dynasty hands control to his reluctant son.", "genre": "Crime", "year": 1972}
{"query": "What challenge does Batman face in The Dark Knight?", "response": "He must stop the Joker.", "summary": "Hero vs chaos.", "title": "The Dark Knight", "description": "Batman faces the Joker, a criminal mastermind terrorizing Gotham.", "genre": "Action", "year": 2008}
{"query": "What is Parasite about?", "response": "A poor family cons its way into a rich household.", "summary": "Class divide thriller.", "title": "Parasite", "description": "A poor family infiltrates a wealthy household with dire consequences.", "genre": "Thriller", "year": 2019}
{"query": "What is Interstellar about?", "response": "Explorers travel through a wormhole to save humanity.", "summary": "Space odyssey.", "title": "Interstellar", "description": "A group of explorers journey through a wormhole to find a new home.", "genre": "Adventure", "year": 2014}
  • query: main text to search on
  • response:response to the query
  • others: extra information

Step 3: Add Data

import qmem as qm

# Create a collection
qm.create(collection_name="testing", dim=1024, distance_metric="cosine")

# Ingest data from file
qm.ingest(file="data.jsonl", embed_field="query",payload_field="x,y,z") #what to put in payload , not added then everything goes in payload

Step 4: Search

results = qm.retrieve(
    query="who is batman",
    top_k=5,
    show=["description", "title"]
)
print(results)

Results are returned in a table showing similarity scores and payload fields.


Step 5: Use Filters

Build filters interactively (CLI):

qmem filter

Filters are saved under .qmem/filters/.

Apply a filter in Python:

filtered = qm.retrieve_filter(
    query="who is batman",
    filter_json=".qmem/filters/latest.json",
    top_k=5,
    show=["description", "title"]
)
print(filtered)

Step 6: Mirror Data to MongoDB

qm.mongo(
    collection_name="testing",
    mongo_db="final_test_db",
    mongo_collection="final_test",
    fields=["description", "title"]
)

This copies your Qdrant collection into MongoDB for downstream use.


CLI Commands

qmem init      # configure qmem
qmem filter    # create filters interactively
qmem index     # (only for Qdrant) build indices for faster filtering

Python API

  • qm.create() → Create a new collection
  • qm.ingest() → Add documents from a file
  • qm.retrieve() → Search documents by meaning
  • qm.retrieve_filter() → Search with filters
  • qm.mongo() → Copy a collection into MongoDB

Why use qmem?

  • Simple to set up and use
  • Works with both local (Chroma) and cloud (Qdrant) databases
  • AI embeddings understand meaning, not just exact words
  • Filters allow more precise retrieval
  • MongoDB integration enables analytics pipelines

Example Workflow

  1. qmem init → configure backend and embeddings
  2. qm.create(...) → create a collection
  3. qm.ingest(...) → add data from JSONL
  4. qm.retrieve(...) → search data by meaning
  5. qmem filter + qm.retrieve_filter(...) → apply filters
  6. qm.mongo(...) → mirror to MongoDB if needed

qmem makes it easy to:

  • Store your documents
  • Search them intelligently
  • Filter results by properties
  • Extend usage with MongoDB integration

Searching for “car” may also return results for “automobile” or “vehicle.”

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qmem-0.1.9.tar.gz (30.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

qmem-0.1.9-py3-none-any.whl (32.4 kB view details)

Uploaded Python 3

File details

Details for the file qmem-0.1.9.tar.gz.

File metadata

  • Download URL: qmem-0.1.9.tar.gz
  • Upload date:
  • Size: 30.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.1.9.tar.gz
Algorithm Hash digest
SHA256 d56331468092b2100484e0974e92dbcba4c81b620ecb1e282d7eada7e0f10551
MD5 bd89be7bc5dcd7a7110c329a0a340092
BLAKE2b-256 b98d5fd648b1dd809d104de3dc2218b6aa86596d474a695628721cf5c3dd9991

See more details on using hashes here.

File details

Details for the file qmem-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: qmem-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 32.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for qmem-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 821e571c64c24a654f414f59c73242bbfa24c45c41e4d6326e9358ab3607bd19
MD5 4add51067ec2a8f543f94c3614aa8c33
BLAKE2b-256 d36843252abb2b3d4cb97531e53dcba85090ceb81d7e2f233374b1da81647876

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page