Benchmark suite for vector databases with Redis support. Forked from the original vector-db-benchmark project.

These details have not been verified by PyPI

Project links

Project description

vector-db-benchmark

A comprehensive benchmarking tool for vector databases, including Redis (both RediSearch and Vector Sets), Weaviate, Milvus, Qdrant, OpenSearch, Postgres, and others...

In a one-liner cli tool you can get this and much more:

docker run --rm --network=host redis/vector-db-benchmark:latest run.py --host localhost --engines vectorsets-fp32-default --datasets glove-100-angular --parallels 100
(...)
================================================================================
BENCHMARK RESULTS SUMMARY
Experiment: vectorsets-fp32-default - glove-100-angular
================================================================================

Precision vs Performance Trade-off:
--------------------------------------------------
Precision  QPS      P50 (ms)   P95 (ms)  
--------------------------------------------------
0.86       1408.3   61.877     107.548   
0.80       2136.3   38.722     69.102    
0.72       2954.3   25.820     48.072    
0.68       3566.5   20.229     38.581    

QPS vs Precision Trade-off - vectorsets-fp32-default - glove-100-angular (up and to the right is better):

  3566 │●                                                           
       │             ●                                              
       │                                                            
  2594 │                                                            
       │                                       ●                    
       │                                                            
  1621 │                                                           ●
       │                                                            
       │                                                            
   648 │                                                            
       │                                                            
       │                                                            
     0 │                                                            
       └────────────────────────────────────────────────────────────
        0.680          0.726          0.772          0.817          
        Precision (0.0 = 0%, 1.0 = 100%)
================================================================================

View results

There are various vector search engines available, and each of them may offer a different set of features and efficiency. But how do we measure the performance? There is no clear definition and in a specific case you may worry about a specific thing, while not paying much attention to other aspects. This project is a general framework for benchmarking different engines under the same hardware constraints, so you can choose what works best for you.

Running any benchmark requires choosing an engine, a dataset and defining the scenario against which it should be tested. A specific scenario may assume running the server in a single or distributed mode, a different client implementation and the number of client instances.

Quick Start

Quick Start with Docker

The easiest way to run vector-db-benchmark is using Docker. We provide pre-built images on Docker Hub.

# Pull the latest image
docker pull redis/vector-db-benchmark:latest

# Run with help
docker run --rm redis/vector-db-benchmark:latest run.py --help

# Check which datasets are available
docker run --rm redis/vector-db-benchmark:latest run.py --describe datasets

# Basic Redis benchmark with local Redis
docker run --rm -v $(pwd)/results:/app/results --network=host \
  redis/vector-db-benchmark:latest \
  run.py --host localhost --engines redis-default-simple --datasets glove-25-angular

# At the end of the run, you will find the results in the `results` directory. Lets open the summary one, in the precision summary

$ jq ".precision_summary" results/*-summary.json
{
  "0.91": {
    "qps": 1924.5,
    "p50": 49.828,
    "p95": 58.427
  },
  "0.94": {
    "qps": 1819.9,
    "p50": 51.68,
    "p95": 66.83
  },
  "0.9775": {
    "qps": 1477.8,
    "p50": 65.368,
    "p95": 73.849
  },
  "0.9950": {
    "qps": 1019.8,
    "p50": 95.115,
    "p95": 106.73
  }
}

Using with Redis

For testing with Redis, start a Redis container first:

# Start Redis container
docker run -d --name redis-test -p 6379:6379 redis:8.2-bookworm

# Run benchmark against Redis

docker run --rm -v $(pwd)/results:/app/results --network=host \
  redis/vector-db-benchmark:latest \
  run.py --host localhost --engines redis-default-simple --datasets random-100

# Or use the convenience script
./docker-run.sh -H localhost -e redis-default-simple -d random-100


# Clean up Redis container when done
docker stop redis-test && docker rm redis-test

Available Docker Images

Latest: redis/vector-db-benchmark:latest

For detailed Docker setup and publishing information, see DOCKER_SETUP.md.

Data sets

We have a number of precomputed data sets. All data sets have been pre-split into train/test and include ground truth data for the top-100 nearest neighbors.

Dataset	Dimensions	Train size	Test size	Neighbors	Distance
LAION Image Embeddings (512D)
LAION-1M: subset of LAION 400M English (image embedings)	512	1,000,000	10,000	100	Cosine
LAION-10M: subset of LAION 400M English (image embedings)	512	10,000,000	10,000	100	Cosine
LAION-20M: subset of LAION 400M English (image embedings)	512	20,000,000	10,000	100	Cosine
LAION-40M: subset of LAION 400M English (image embedings)	512	40,000,000	10,000	100	Cosine
LAION-100M: subset of LAION 400M English (image embedings)	512	100,000,000	10,000	100	Cosine
LAION-200M: subset of LAION 400M English (image embedings)	512	200,000,000	10,000	100	Cosine
LAION-400M: from LAION 400M English (image embedings)	512	400,000,000	10,000	100	Cosine
LAION Image Embeddings (768D)
LAION-1M: 768D image embeddings	768	1,000,000	10,000	100	Cosine
LAION-1B: 768D image embeddings	768	1,000,000,000	10,000	100	Cosine
Standard Benchmarks
GloVe-25: Word vectors	25	1,183,514	10,000	100	Cosine
GloVe-100: Word vectors	100	1,183,514	10,000	100	Cosine
Deep Image-96: CNN image features	96	9,990,000	10,000	100	Cosine
GIST-960: Image descriptors	960	1,000,000	1,000	100	L2
Text and Knowledge Embeddings
DBpedia OpenAI-1M: Knowledge embeddings	1,536	1,000,000	10,000	100	Cosine
LAION Small CLIP: Small CLIP embeddings	512	100,000	1,000	100	Cosine
Yandex Datasets
Yandex T2I: Text-to-image embeddings	200	1,000,000	100,000	100	Dot
Random and Synthetic
Random-100: Small synthetic dataset	100	100	9	9	Cosine
Random-100-Euclidean: Small synthetic dataset	100	100	9	9	L2
Filtered Search Datasets
H&M-2048: Fashion product embeddings (with filters)	2,048	105,542	2,000	100	Cosine
H&M-2048: Fashion product embeddings (no filters)	2,048	105,542	2,000	100	Cosine
ArXiv-384: Academic paper embeddings (with filters)	384	2,205,995	10,000	100	Cosine
ArXiv-384: Academic paper embeddings (no filters)	384	2,205,995	10,000	100	Cosine
Random Match Keyword-100: Synthetic keyword matching (with filters)	100	1,000,000	10,000	100	Cosine
Random Match Keyword-100: Synthetic keyword matching (no filters)	100	1,000,000	10,000	100	Cosine
Random Match Int-100: Synthetic integer matching (with filters)	100	1,000,000	10,000	100	Cosine
Random Match Int-100: Synthetic integer matching (no filters)	100	1,000,000	10,000	100	Cosine
Random Range-100: Synthetic range queries (with filters)	100	1,000,000	10,000	100	Cosine
Random Range-100: Synthetic range queries (no filters)	100	1,000,000	10,000	100	Cosine
Random Geo Radius-100: Synthetic geo queries (with filters)	100	1,000,000	10,000	100	Cosine
Random Geo Radius-100: Synthetic geo queries (no filters)	100	1,000,000	10,000	100	Cosine
Random Match Keyword-2048: Large synthetic keyword matching (with filters)	2,048	100,000	1,000	100	Cosine
Random Match Keyword-2048: Large synthetic keyword matching (no filters)	2,048	100,000	1,000	100	Cosine
Random Match Int-2048: Large synthetic integer matching (with filters)	2,048	100,000	1,000	100	Cosine
Random Match Int-2048: Large synthetic integer matching (no filters)	2,048	100,000	1,000	100	Cosine
Random Range-2048: Large synthetic range queries (with filters)	2,048	100,000	1,000	100	Cosine
Random Range-2048: Large synthetic range queries (no filters)	2,048	100,000	1,000	100	Cosine
Random Geo Radius-2048: Large synthetic geo queries (with filters)	2,048	100,000	1,000	100	Cosine
Random Geo Radius-2048: Large synthetic geo queries (no filters)	2,048	100,000	1,000	100	Cosine
Random Match Keyword Small Vocab-256: Small vocabulary keyword matching (with filters)	256	1,000,000	10,000	100	Cosine
Random Match Keyword Small Vocab-256: Small vocabulary keyword matching (no filters)	256	1,000,000	10,000	100	Cosine

How to run a benchmark?

Benchmarks are implemented in server-client mode, meaning that the server is running in a single machine, and the client is running on another.

Run the server

All engines are served using docker compose. The configuration is in the servers.

To launch the server instance, run the following command:

cd ./engine/servers/<engine-configuration-name>
docker compose up

Containers are expected to expose all necessary ports, so the client can connect to them.

Run the client

Install dependencies:

pip install poetry
poetry install

Run the benchmark:

# Basic usage examples
python run.py --engines redis-default-simple --datasets random-100
python run.py --engines redis-default-simple --datasets glove-25-angular
python run.py --engines "*-m-16-*" --datasets "glove-*"

# Using custom engine configurations from a JSON file
python run.py --engines-file custom_engines.json --datasets glove-25-angular

# Get information about available engines (with pattern matching)
python run.py --engines "*redis*" --describe engines --verbose

# Get information about engines from a custom file  
python run.py --engines-file custom_engines.json --describe engines --verbose

# Docker usage (recommended)
docker run --rm -v $(pwd)/results:/app/results --network=host \
  redis/vector-db-benchmark:latest \
  run.py --host localhost --engines redis-default-simple --datasets random-100

# Get help
python run.py --help

Command allows you to specify wildcards for engines and datasets. Results of the benchmarks are stored in the ./results/ directory.

Using Custom Engine Configurations

The benchmark tool supports two ways to specify which engine configurations to use:

1. Pattern Matching (Default)

Use the --engines flag with wildcard patterns to select configurations from the experiments/configurations/ directory:

python run.py --engines "*redis*" --datasets glove-25-angular
python run.py --engines "qdrant-m-*" --datasets random-100

2. Custom Configuration File

Use the --engines-file flag to specify a JSON file containing custom engine configurations:

python run.py --engines-file my_engines.json --datasets glove-25-angular

The JSON file should contain an array of engine configuration objects. Each configuration must have a name field and follow the same structure as configurations in experiments/configurations/:

[
  {
    "name": "my-custom-redis-config",
    "engine": "redis",
    "connection_params": {},
    "collection_params": {
      "algorithm": "hnsw",
      "data_type": "FLOAT32",
      "hnsw_config": {
        "M": 16,
        "DISTANCE_METRIC": "L2",
        "EF_CONSTRUCTION": 200
      }
    },
    "search_params": [
      {
        "parallel": 1,
        "top": 10,
        "search_params": {
          "ef": 100,
          "data_type": "FLOAT32"
        }
      }
    ],
    "upload_params": {
      "parallel": 16,
      "data_type": "FLOAT32"
    }
  }
]

Note: You cannot use both --engines and --engines-file at the same time.

How to update benchmark parameters?

Each engine has a configuration file, which is used to define the parameters for the benchmark. Configuration files are located in the configuration directory.

Each step in the benchmark process is using a dedicated configuration's path:

connection_params - passed to the client during the connection phase.
collection_params - parameters, used to create the collection, indexing parameters are usually defined here.
upload_params - parameters, used to upload the data to the server.
search_params - passed to the client during the search phase. Framework allows multiple search configurations for the same experiment run.

Exact values of the parameters are individual for each engine.

How to register a dataset?

Datasets are configured in the datasets/datasets.json file. Framework will automatically download the dataset and store it in the datasets directory.

How to implement a new engine?

There are a few base classes that you can use to implement a new engine.

BaseConfigurator - defines methods to create collections, setup indexing parameters.
BaseUploader - defines methods to upload the data to the server.
BaseSearcher - defines methods to search the data.

See the examples in the clients directory.

Once all the necessary classes are implemented, you can register the engine in the ClientFactory.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Feb 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vector_benchmark-0.1.0.tar.gz (166.7 kB view details)

Uploaded Feb 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vector_benchmark-0.1.0-py3-none-any.whl (87.8 kB view details)

Uploaded Feb 12, 2026 Python 3

File details

Details for the file vector_benchmark-0.1.0.tar.gz.

File metadata

Download URL: vector_benchmark-0.1.0.tar.gz
Upload date: Feb 12, 2026
Size: 166.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for vector_benchmark-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`516e42965c4a0cb5ab200cec4b39d991b190bd906f18948d85e927d8295d62e0`
MD5	`29cd756951ada60966c59cf8f8543ab6`
BLAKE2b-256	`2c4c7d0b5c5bc6a7036cbcf406d09c016d6003099bae31da9944de32f21ebcd9`

See more details on using hashes here.

File details

Details for the file vector_benchmark-0.1.0-py3-none-any.whl.

File metadata

Download URL: vector_benchmark-0.1.0-py3-none-any.whl
Upload date: Feb 12, 2026
Size: 87.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for vector_benchmark-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7684f701b1d863ca1910d92f0eed4ef3d0dd27fa8e1f30cecd0ce679811962f0`
MD5	`52c008d2ee5e3dd818cbf2773ceb25a4`
BLAKE2b-256	`065dbbeff5810dcb7793f8db15892731c2d2eed382076b9901fbae705c9d643f`

See more details on using hashes here.

vector-benchmark 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

vector-db-benchmark

Quick Start

Quick Start with Docker

Using with Redis

Available Docker Images

Data sets

How to run a benchmark?

Run the server

Run the client

Using Custom Engine Configurations

1. Pattern Matching (Default)

2. Custom Configuration File

How to update benchmark parameters?

How to register a dataset?

How to implement a new engine?

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes