Skip to main content

PatANN is a massively parallel, distributed, and scalable in-memory/on-disk vector database library for efficient nearest neighbor search across large-scale datasets by finding vector patterns.

Project description

PatANN

Overview

PatANN is a massively parallel, distributed, and scalable in-memory/on-disk vector database library for efficient nearest neighbor search across large-scale datasets by finding vector patterns.

PatANN leverages patterns for data partitioning like Google ScANN, implements disk-based I/O similar to DiskANN, and employs search techniques akin to HNSWlib, resulting in an algorithm that synthesizes the best features to outperform existing approaches.

Status

Beta Version: Currently uploaded for benchmarking purposes. Complete documentation and updates are under development. Not for production use yet.

Key Features

  • Faster Index building and Searching
  • Supports both in-memory and on-disk operations
  • Dynamic sharding and load balancing across servers
  • Advanced search, filtering and pagination
  • Unlimited scalability without pre-specified capacity

Algorithmic Approach

  • Combines NSW (Navigable Small World) graph with a novel pattern based partitioning algorithm
  • Preliminary results show phenomenal performance in building index and searching
  • Potential slight variations in lower-end matching

Contributions Welcome

We are seeking help to:

  • Run additional datasets
  • Validate and improve the algorithm

Contact

For queries, please contact: support@mesibo.com

Disclaimer

Results may vary. Detailed research paper forthcoming.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

patann-0.0.3.tar.gz (582.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

patann-0.0.3-py3-none-any.whl (587.1 kB view details)

Uploaded Python 3

File details

Details for the file patann-0.0.3.tar.gz.

File metadata

  • Download URL: patann-0.0.3.tar.gz
  • Upload date:
  • Size: 582.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for patann-0.0.3.tar.gz
Algorithm Hash digest
SHA256 0e700ae66a302d56aecfa3466b2651de0f0da0bf59962226a9daa99276f4d79f
MD5 eca3a49d61b447f840f285c984b5682c
BLAKE2b-256 e1c5848adac12e7fbde40160d35a7aa0a3808cb314be18c101b5ae7b6c887cc9

See more details on using hashes here.

File details

Details for the file patann-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: patann-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 587.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for patann-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 b7fca15b55d5d93db4108b65b00cd2b4991e4c9198e12ff0484f041da220e036
MD5 14a44e7e26b4c38086d64a9c8fa2f742
BLAKE2b-256 074ef89d7db13172284dbb133ba810e033ce4fb40c0ec2d4b30e51fcb083ec7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page