Skip to main content

Retrieval provider bundle for NLWeb - includes all vector database and search providers

Project description

nlweb-retrieval

Bundle package containing all retrieval providers for NLWeb.

Included Providers

This package includes clients for:

  • Azure AI Search - Azure's vector search service with managed identity support
  • Elasticsearch - Popular open-source search and analytics engine
  • Milvus - Open-source vector database
  • Qdrant - Vector similarity search engine
  • PostgreSQL (pgvector) - PostgreSQL with vector extension
  • OpenSearch - Open-source search and analytics suite
  • Snowflake Cortex Search - Snowflake's vector search
  • Shopify MCP - Shopify Model Context Protocol
  • Cloudflare AutoRAG - Cloudflare's RAG service
  • Bing Search - Microsoft Bing web search

Installation

pip install nlweb-core nlweb-retrieval

Configuration Example

Create config.yaml:

retrieval:
  provider: elasticsearch
  import_path: nlweb_retrieval.elasticsearch_client
  class_name: ElasticsearchClient
  api_endpoint_env: ELASTICSEARCH_URL
  index_name: my_index
  use_knn: true

Azure AI Search Example

retrieval:
  provider: azure_ai_search
  import_path: nlweb_retrieval.azure_search_client
  class_name: AzureSearchClient
  api_endpoint_env: AZURE_SEARCH_ENDPOINT
  auth_method: azure_ad  # or api_key
  index_name: my-search-index

Qdrant Example

retrieval:
  provider: qdrant
  import_path: nlweb_retrieval.qdrant
  class_name: QdrantVectorClient
  database_path: ./data/qdrant_db  # for local storage
  # OR for remote:
  # api_endpoint_env: QDRANT_URL
  # api_key_env: QDRANT_API_KEY

Usage

import nlweb_core

# Initialize with config
nlweb_core.init(config_path="./config.yaml")

# Use retrieval
from nlweb_core import retriever

# Search
results = await retriever.search(
    query="example query",
    site="example.com",
    num_results=10
)

Provider Import Paths

Use these in your config file:

Provider import_path class_name
Azure AI Search nlweb_retrieval.azure_search_client AzureSearchClient
Elasticsearch nlweb_retrieval.elasticsearch_client ElasticsearchClient
Milvus nlweb_retrieval.milvus_client MilvusVectorClient
Qdrant nlweb_retrieval.qdrant QdrantVectorClient
PostgreSQL nlweb_retrieval.postgres_client PgVectorClient
OpenSearch nlweb_retrieval.opensearch_client OpenSearchClient
Snowflake nlweb_retrieval.snowflake_client SnowflakeCortexSearchClient
Shopify MCP nlweb_retrieval.shopify_mcp ShopifyMCPClient
Cloudflare nlweb_retrieval.cf_autorag_client CloudflareAutoRAGClient
Bing Search nlweb_retrieval.bing_search_client BingSearchClient

License

MIT License - Copyright (c) 2025 Microsoft Corporation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlweb_retrieval-0.5.0.tar.gz (30.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nlweb_retrieval-0.5.0-py3-none-any.whl (38.7 kB view details)

Uploaded Python 3

File details

Details for the file nlweb_retrieval-0.5.0.tar.gz.

File metadata

  • Download URL: nlweb_retrieval-0.5.0.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for nlweb_retrieval-0.5.0.tar.gz
Algorithm Hash digest
SHA256 fc06465d126a677b5662e6be6ac25fe52c941ffa19c8d8ab7d10709b0eb5ced0
MD5 d92c9608d3c0ed52773443d8947a6a2e
BLAKE2b-256 dd4d3d95946ea51f1faaacbf19743fe9ffa740921a1f20a7d6b6866c1076fe74

See more details on using hashes here.

File details

Details for the file nlweb_retrieval-0.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nlweb_retrieval-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f31d206c0cb165e913d11e59165748b3a31c7d1777b86f98a3b0037359b41ae3
MD5 a042da15474f235671fa084aabf966c5
BLAKE2b-256 6856fa3b2f66a542b03c54b9668a8fd41097df7963eac3fa1b87c6ca14e6e966

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page