Skip to main content

Retrieval provider bundle for NLWeb - includes all vector database and search providers

Project description

nlweb-retrieval

Bundle package containing all retrieval providers for NLWeb.

Included Providers

This package includes clients for:

  • Azure AI Search - Azure's vector search service with managed identity support
  • Elasticsearch - Popular open-source search and analytics engine
  • Milvus - Open-source vector database
  • Qdrant - Vector similarity search engine
  • PostgreSQL (pgvector) - PostgreSQL with vector extension
  • OpenSearch - Open-source search and analytics suite
  • Snowflake Cortex Search - Snowflake's vector search
  • Shopify MCP - Shopify Model Context Protocol
  • Cloudflare AutoRAG - Cloudflare's RAG service
  • Bing Search - Microsoft Bing web search

Installation

pip install nlweb-core nlweb-retrieval

Configuration Example

Create config.yaml:

retrieval:
  provider: elasticsearch
  import_path: nlweb_retrieval.elasticsearch_client
  class_name: ElasticsearchClient
  api_endpoint_env: ELASTICSEARCH_URL
  index_name: my_index
  use_knn: true

Azure AI Search Example

retrieval:
  provider: azure_ai_search
  import_path: nlweb_retrieval.azure_search_client
  class_name: AzureSearchClient
  api_endpoint_env: AZURE_SEARCH_ENDPOINT
  auth_method: azure_ad  # or api_key
  index_name: my-search-index

Qdrant Example

retrieval:
  provider: qdrant
  import_path: nlweb_retrieval.qdrant
  class_name: QdrantVectorClient
  database_path: ./data/qdrant_db  # for local storage
  # OR for remote:
  # api_endpoint_env: QDRANT_URL
  # api_key_env: QDRANT_API_KEY

Usage

import nlweb_core

# Initialize with config
nlweb_core.init(config_path="./config.yaml")

# Use retrieval
from nlweb_core import retriever

# Search
results = await retriever.search(
    query="example query",
    site="example.com",
    num_results=10
)

Provider Import Paths

Use these in your config file:

Provider import_path class_name
Azure AI Search nlweb_retrieval.azure_search_client AzureSearchClient
Elasticsearch nlweb_retrieval.elasticsearch_client ElasticsearchClient
Milvus nlweb_retrieval.milvus_client MilvusVectorClient
Qdrant nlweb_retrieval.qdrant QdrantVectorClient
PostgreSQL nlweb_retrieval.postgres_client PgVectorClient
OpenSearch nlweb_retrieval.opensearch_client OpenSearchClient
Snowflake nlweb_retrieval.snowflake_client SnowflakeCortexSearchClient
Shopify MCP nlweb_retrieval.shopify_mcp ShopifyMCPClient
Cloudflare nlweb_retrieval.cf_autorag_client CloudflareAutoRAGClient
Bing Search nlweb_retrieval.bing_search_client BingSearchClient

License

MIT License - Copyright (c) 2025 Microsoft Corporation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlweb_retrieval-0.5.3.tar.gz (30.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nlweb_retrieval-0.5.3-py3-none-any.whl (38.7 kB view details)

Uploaded Python 3

File details

Details for the file nlweb_retrieval-0.5.3.tar.gz.

File metadata

  • Download URL: nlweb_retrieval-0.5.3.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for nlweb_retrieval-0.5.3.tar.gz
Algorithm Hash digest
SHA256 8b52f2ee2fbdd2f6d37702c9a79d1493cdbcfb415e212af97239831042e2a0bf
MD5 56bb04baff47188b577b390d9085579c
BLAKE2b-256 5c2d37746847d3e9ebd2a9ff5d96cf3caf190075327241316d3f607d678711c0

See more details on using hashes here.

File details

Details for the file nlweb_retrieval-0.5.3-py3-none-any.whl.

File metadata

File hashes

Hashes for nlweb_retrieval-0.5.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6ec7470cc1708c855c611a7c8d12a9147f239b84a64a779bb5292bb91974976d
MD5 34fc9dfef623c9bae4edb0624e2ede8b
BLAKE2b-256 72d25041cffc7ab1914198e8fda1d118b45614c788f08c0b09bc8dcde6bb615b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page