PatANN is a massively parallel, distributed, and scalable in-memory/on-disk vector database library for efficient nearest neighbor search across large-scale datasets by finding vector patterns.
Project description
PatANN - Pattern-Aware Vector Database / ANN
Overview
PatANN is a pattern-aware, massively parallel, and distributed vector search framework designed for scalable and efficient nearest neighbor search, operating both in-memory and on-disk. Unlike conventional algorithms, PatANN leverages macro and micro patterns within vectors to drastically reduce search space before performing costly distance computations.
Refer to the website for technical details, algorithm overview, key innovations, benchmarks, and tutorials.
Website: https://patann.dev
While still in beta, PatANN's pattern-first approach delivers unprecedented performance advantages. As shown in our benchmarks (Figure 1), PatANN consistently outperforms leading ANN libraries including HNSW (hnswlib), Google ScaNN, Facebook FAISS variants, and others in the critical recall-throughput tradeoff.
Installation
Install PatANN using pip:
pip install patann
Usage
PatANN provides Python API as well as a command-line interface to download example code.
Command Line Interface
patann --help
Output:
PatANN Version: 0.1.0
Python Version: 3.11.5
PatANN - https://github.com/mesibo/patann
usage: patann [-h] [--help-examples] [--list-examples] [--copy-examples [DEST]]
PatANN: Pattern-Aware Approximate Nearest Neighbors Search
options:
-h, --help show this help message and exit
--help-examples Show information about examples
--list-examples List available examples
--copy-examples [DEST]
Copy examples to specified directory (default: current directory)
Examples
PatANN comes with several example scripts to help you get started:
patann --help-examples
This will display information about the included examples:
PatANN Version: 0.1.0
Python Version: 3.11.5
PatANN - https://github.com/mesibo/patann
PatANN: Efficient Approximate Nearest Neighbors Search
Example Helper Functions:
-----------------------
patann.help() - Display this help message
patann.list_examples() - List all available examples with descriptions
patann.get_examples_dir() - Get the directory path where examples are located
patann.copy_examples() - Copy all example files to your current directory
Available Examples:
-----------------
patann_sync_example.py:
- Demonstrates synchronous (blocking) usage of PatANN
- Shows how to create an index, add vectors, and perform queries in a sequential manner
- Suitable for simple applications where asynchronous operation is not needed
patann_async_example.py:
- Demonstrates basic asynchronous usage with a single query
- Shows how to set up callbacks for index building and query results
- Good starting point for understanding PatANN's asynchronous operation
patann_async_parallel_example.py:
- Demonstrates advanced usage with multiple parallel queries
- Shows how to handle multiple unique query vectors simultaneously
- Suitable for high-throughput applications needing parallel processing
To run an example after copying:
$ python patann_sync_example.py
For more information, visit: https://github.com/mesibo/patann
To list all available examples:
patann --list-examples
Output:
PatANN Version: 0.1.0
Python Version: 3.11.5
PatANN - https://github.com/mesibo/patann
Example files located at: /usr/local/lib/python3.11/site-packages/patann/examples
Available examples:
- patann_sync_example.py: Synchronous (blocking) usage of PatANN
- patann_async_example.py: Basic asynchronous usage with a single query
- patann_async_parallel_example.py: Advanced parallel query processing with multiple vectors
To copy examples to current directory, run: patann.copy_examples()
You can copy examples to your current directory either with:
patann --copy-examples
Or from within Python:
import patann
patann.copy_examples()
```
### Python API
PatANN supports both synchronous (blocking) and asynchronous (non-blocking) operation modes:
1. **Synchronous Mode**: Simple sequential operations, ideal for straightforward applications
2. **Asynchronous Mode**: High-throughput parallel operations with callbacks, ideal for production systems
## Platforms
- Linux
- macOS (Apple Silicon)
- Windows
- Android
- iOS
## Code Examples
Code examples for all platforms (Python, Kotlin, Java, Objective-C, Swift) in both asynchronous and synchronous modes are available at https://github.com/mesibo/patann
## Key Distinguishing Features
- Novel pattern-based probing for ANN search
- In-Memory, On-Disk, and Hybrid Index
- Fully asynchronous operations with built-in support for conventional synchronous execution
- Refined search, filtering, and pagination algorithms
- Unlimited scalability without pre-specified capacity
- Dynamic sharding to load balance across servers
- Cloud (in-progress) and Serverless
- SIMD-Accelerated for both x86_64 (SSE*, AVX2, AVX-512) and ARM (NEON, SVE) platforms
- OS-optimized I/O—huge pages (Linux), large pages (Windows), and super pages (macOS)
- NUMA-aware architecture
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file patann-0.1.0.tar.gz.
File metadata
- Download URL: patann-0.1.0.tar.gz
- Upload date:
- Size: 4.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4863f1ec87e132f500595d39417414127d62e7315cdb6ceaa98c4b148bdef118
|
|
| MD5 |
c9b86add9bb735e1653d0cb7c94990cd
|
|
| BLAKE2b-256 |
1947343004ccf581fcc375ade44a60b753ed67e38ec5ed86f99bce3ca25bec7d
|
File details
Details for the file patann-0.1.0-py3-none-any.whl.
File metadata
- Download URL: patann-0.1.0-py3-none-any.whl
- Upload date:
- Size: 4.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
772dfb6da68c03956a7d9b3a540751a5524671fe1b47213968729dbc94bb8e00
|
|
| MD5 |
663cc1cfbeecc9f1646510e9670b963e
|
|
| BLAKE2b-256 |
2ec7802bf982fcd8e8553c7665dfa5de0cdb15a6c097041278178889a622a256
|