Benchmark tool for Milvus geo search functionality

Project description

Milvus Geo Search Benchmark Tool

A comprehensive benchmark tool for evaluating Milvus geo search functionality with synthetic datasets.

Features

Dataset Generation: Create synthetic geospatial datasets with points and polygons
Ground Truth Calculation: Use Shapely for accurate spatial computations
Milvus Integration: Support for Milvus cloud and local instances with URI/token authentication
Performance Benchmarking: Measure query latency, throughput, and success rates
Accuracy Evaluation: Compare results against ground truth with precision/recall metrics
Comprehensive Reports: Generate detailed markdown and JSON evaluation reports
Flexible CLI: Easy-to-use command-line interface with configuration file support

Installation

Prerequisites

Python 3.12+
uv package manager

Install from source

git clone https://github.com/mmga-lab/milvus-geo-bench.git
cd milvus-geo-bench
make install

Set up environment

# Copy configuration templates
cp config.yaml.example config.yaml
cp .env.example .env

# Edit .env file with your Milvus credentials
export MILVUS_URI="https://your-instance.api.gcp-us-west1.zillizcloud.com"
export MILVUS_TOKEN="your-api-token"

Quick Start

Full Benchmark Workflow

Run the complete benchmark pipeline:

milvus-geo-bench full-run --config config.yaml

This will:

Generate synthetic datasets
Load data into Milvus
Execute benchmark queries
Evaluate results and generate reports

Individual Commands

Using Configuration File (Recommended)

All commands support the --config option to use configuration file defaults:

# 1. Generate Dataset (uses config.yaml defaults)
milvus-geo-bench --config config.yaml generate-dataset

# 2. Load Data to Milvus (uses config.yaml + environment variables)
milvus-geo-bench --config config.yaml load-data

# 3. Run Benchmark (uses config.yaml for all settings)
milvus-geo-bench --config config.yaml run-benchmark

# 4. Evaluate Results (uses config.yaml for output paths)
milvus-geo-bench --config config.yaml evaluate

Command Line Arguments (Override Config)

You can override specific config values with command line arguments:

# Override specific parameters while using config defaults
milvus-geo-bench --config config.yaml generate-dataset --num-points 50000
milvus-geo-bench --config config.yaml run-benchmark --concurrency 50
milvus-geo-bench --config config.yaml evaluate --format json

Legacy Usage (Without Config File)

# 1. Generate Dataset
milvus-geo-bench generate-dataset \
  --num-points 100000 \
  --num-queries 1000 \
  --output-dir ./data

# 2. Load Data to Milvus
milvus-geo-bench load-data \
  --uri $MILVUS_URI \
  --token $MILVUS_TOKEN \
  --collection geo_bench \
  --data-file ./data/train.parquet

# 3. Run Benchmark
milvus-geo-bench run-benchmark \
  --uri $MILVUS_URI \
  --token $MILVUS_TOKEN \
  --collection geo_bench \
  --queries ./data/test.parquet \
  --output ./data/results.parquet

# 4. Evaluate Results
milvus-geo-bench evaluate \
  --results ./data/results.parquet \
  --ground-truth ./data/ground_truth.parquet \
  --output ./reports/evaluation_report.md

Configuration

Environment Variables

MILVUS_URI=https://your-instance.api.gcp-us-west1.zillizcloud.com
MILVUS_TOKEN=your-api-token

Configuration File (config.yaml)

New: All commands now support --config for unified configuration!

# Dataset generation settings
dataset:
  num_points: 1000000          # Number of training points
  num_queries: 1000            # Number of test queries  
  output_dir: ./data           # Output directory
  bbox: [-180, -90, 180, 90]   # World coordinates [min_lon, min_lat, max_lon, max_lat]
  min_points_per_query: 100    # Minimum results per query

# Milvus connection settings
milvus:
  uri: ${MILVUS_URI}           # Environment variable substitution
  token: ${MILVUS_TOKEN}       # Environment variable substitution
  collection: geo_bench        # Collection name
  batch_size: 1000            # Batch size for data insertion
  timeout: 30                 # Connection timeout

# Benchmark execution settings
benchmark:
  timeout: 30                 # Query timeout in seconds
  warmup: 10                  # Number of warmup queries
  concurrency: 100            # Parallel query execution threads

# Output settings
output:
  results: ./data/results.parquet              # Benchmark results file
  report: ./reports/evaluation_report.md       # Evaluation report file

Benefits of using config files:

Consistent settings across all commands
Environment variable substitution (${VAR_NAME})
Override specific values with CLI arguments
Version control friendly configuration

Data Format

The tool uses Parquet format for all data storage:

Training Data (train.parquet)

Column	Type	Description
id	int64	Unique identifier
wkt	string	WKT Point geometry
vec	list[float64]	8-dimensional vector

Test Queries (test.parquet)

Column	Type	Description
query_id	int64	Query identifier
expr	string	ST_WITHIN expression
polygon_wkt	string	Query polygon WKT
center_lon	float64	Polygon center longitude
center_lat	float64	Polygon center latitude
radius	float64	Polygon radius

Ground Truth (ground_truth.parquet)

Column	Type	Description
query_id	int64	Query identifier
result_ids	list[int64]	Expected result IDs
result_count	int64	Number of expected results

Benchmark Results (results.parquet)

Column	Type	Description
query_id	int64	Query identifier
query_time_ms	float64	Query execution time
result_ids	list[int64]	Returned result IDs
result_count	int64	Number of returned results
success	bool	Query success status

Evaluation Metrics

The tool provides comprehensive accuracy and performance metrics:

Accuracy Metrics

Precision: Fraction of returned results that are correct
Recall: Fraction of correct results that are returned
F1-Score: Harmonic mean of precision and recall
Macro/Micro averages: Different aggregation methods

Performance Metrics

Query Latency: Response time statistics (mean, median, P95, P99)
Throughput: Queries per second
Success Rate: Percentage of successful queries

Report Formats

Markdown: Human-readable evaluation reports
JSON: Machine-readable metrics for integration

CLI Reference

Global Options

# Get help
milvus-geo-bench --help

# Use configuration file with verbose logging
milvus-geo-bench --verbose --config config.yaml <command>

# All commands support the --config option
milvus-geo-bench --config config.yaml <command> [options]

Commands

generate-dataset: Generate training and test datasets
load-data: Load data into Milvus collection
run-benchmark: Execute benchmark queries
evaluate: Evaluate results against ground truth
full-run: Execute complete benchmark workflow

Use --help with any command for detailed options.

Examples

Using Configuration File (Recommended)

# Small test run (override config values)
milvus-geo-bench --config config.yaml generate-dataset --num-points 10000 --num-queries 100

# Geographic region test (San Francisco Bay Area)
milvus-geo-bench --config config.yaml generate-dataset --bbox "-122.5,37.0,-121.5,38.0"

# High performance benchmark
milvus-geo-bench --config config.yaml run-benchmark --timeout 60 --warmup 50 --concurrency 200

# Custom output format
milvus-geo-bench --config config.yaml evaluate --format json --output ./reports/metrics.json

# Complete workflow with config
milvus-geo-bench --config config.yaml full-run

Legacy Examples (Without Config File)

# Small test run
milvus-geo-bench generate-dataset --num-points 10000 --num-queries 100

# Geographic region (San Francisco)
milvus-geo-bench generate-dataset --bbox "-122.5,37.0,-121.5,38.0"

# High performance test
milvus-geo-bench run-benchmark --uri $MILVUS_URI --token $MILVUS_TOKEN --timeout 60 --warmup 50

Development

Dependencies

The tool uses the following key dependencies:

pymilvus>=2.4.0: Milvus Python client (from TestPyPI)
shapely>=2.0.0: Geospatial computations
pandas>=2.0.0: Data processing
pyarrow>=14.0.0: Parquet file support
click>=8.0.0: CLI framework
ruff>=0.6.0: Code formatting and linting (dev)

Code Quality

The project uses ruff for code formatting and linting:

# Format code
make format

# Check code quality  
make check

# Install dependencies
make install

# Install with dev dependencies
make dev-install

# Clean generated files
make clean

Project Structure

src/milvus_geo_bench/
├── __init__.py          # CLI commands
├── dataset.py           # Dataset generation
├── milvus_client.py     # Milvus client wrapper
├── benchmark.py         # Benchmark execution
├── evaluator.py         # Result evaluation
└── utils.py             # Utility functions

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

[Add your license information here]

Support

For issues and questions:

Create an issue in the repository
Check the documentation and examples

Project details

Release history Release notifications | RSS feed

This version

0.1.0

Sep 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

milvus_geo_bench-0.1.0.tar.gz (87.5 kB view details)

Uploaded Sep 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

milvus_geo_bench-0.1.0-py3-none-any.whl (31.7 kB view details)

Uploaded Sep 11, 2025 Python 3

File details

Details for the file milvus_geo_bench-0.1.0.tar.gz.

File metadata

Download URL: milvus_geo_bench-0.1.0.tar.gz
Upload date: Sep 11, 2025
Size: 87.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.20

File hashes

Hashes for milvus_geo_bench-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`98f3e6a8d912b6808e76a0a989be02b7c5ffc3a247c269e5397350e90384b884`
MD5	`edb8b185a0207c07e36e1f900542f3a9`
BLAKE2b-256	`47eb454a2789cf3e9a3b84b20e4a2fce0be93f300de89b1c5df698f680a8feff`

See more details on using hashes here.

File details

Details for the file milvus_geo_bench-0.1.0-py3-none-any.whl.

File metadata

Download URL: milvus_geo_bench-0.1.0-py3-none-any.whl
Upload date: Sep 11, 2025
Size: 31.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.20

File hashes

Hashes for milvus_geo_bench-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5499fc7b63ccead273511994e91175f1d004c695b8d6e1c1dc946ea75256a9a7`
MD5	`dc0319a544716d279596c0014b983ecc`
BLAKE2b-256	`46afe4fc03c2ecece890f547fe91ce0ad4ab61a60b2c2d91ec32117d1c4ee296`

See more details on using hashes here.

milvus-geo-bench 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Milvus Geo Search Benchmark Tool

Features

Installation

Prerequisites

Install from source

Set up environment

Quick Start

Full Benchmark Workflow

Individual Commands

Using Configuration File (Recommended)

Command Line Arguments (Override Config)

Legacy Usage (Without Config File)

Configuration

Environment Variables

Configuration File (config.yaml)

Data Format

Training Data (train.parquet)

Test Queries (test.parquet)

Ground Truth (ground_truth.parquet)

Benchmark Results (results.parquet)

Evaluation Metrics

Accuracy Metrics

Performance Metrics

Report Formats

CLI Reference

Global Options

Commands

Examples

Using Configuration File (Recommended)

Legacy Examples (Without Config File)

Development

Dependencies

Code Quality

Project Structure

Contributing

License

Support

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes