Skip to main content

PatANN is a massively parallel, distributed, and scalable in-memory/on-disk vector database library for efficient nearest neighbor search across large-scale datasets by finding vector patterns.

Project description

PatANN

Overview

PatANN is a massively parallel, distributed, and scalable in-memory/on-disk vector database library for efficient nearest neighbor search across large-scale datasets by finding vector patterns.

PatANN leverages patterns for data partitioning like Google ScANN, implements disk-based I/O similar to DiskANN, and employs search techniques akin to HNSWlib, resulting in an algorithm that synthesizes the best features to outperform existing approaches.

Status

Beta Version: Currently uploaded for benchmarking purposes. Complete documentation and updates are under development. Not for production use yet.

Platforms

Beta Version: Restricted to Linux to prevent premature circulation of beta version Production Releases (late Feb 2024)*: Will support all platforms that are supported by mesibo

Key Features

  • Faster Index building and Searching
  • Supports both in-memory and on-disk operations
  • Dynamic sharding and load balancing across servers
  • Advanced search, filtering and pagination
  • Unlimited scalability without pre-specified capacity

Algorithmic Approach

  • Combines NSW (Navigable Small World) graph with a novel pattern based partitioning algorithm
  • Preliminary results show phenomenal performance in building index and searching
  • Potential slight variations in lower-end matching

Contributions Welcome

We are seeking help to:

  • Run additional datasets
  • Validate and improve the algorithm

Contact

For queries, please contact: support@mesibo.com

Disclaimer

Results may vary. Detailed research paper forthcoming.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

patann-0.0.4.tar.gz (582.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

patann-0.0.4-py3-none-any.whl (587.2 kB view details)

Uploaded Python 3

File details

Details for the file patann-0.0.4.tar.gz.

File metadata

  • Download URL: patann-0.0.4.tar.gz
  • Upload date:
  • Size: 582.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for patann-0.0.4.tar.gz
Algorithm Hash digest
SHA256 979f2b0a82a24721896c471173a9292151b54668aac3bb62df4c910e592b6677
MD5 ad9841cd803b26d12082f0bf8a305548
BLAKE2b-256 f98a811718f73bfb50eb08b2220b176179e44e6c484a1b14257f6a64cd685fdb

See more details on using hashes here.

File details

Details for the file patann-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: patann-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 587.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for patann-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 9529230e09639e17941ca7d904e8ed60f30b5130ff3e650832942154fd3ba987
MD5 51640e71681d56d3b2a42664fa6629b7
BLAKE2b-256 f84efaab7bfcf815c6f876267be306c4def1d0c78ee639a3297971c4a00b8dcb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page