Redactive Python SDK
Project description
Redactive Python SDK
The Redactive Python SDK provides a robust and intuitive interface for interacting with the Redactive platform, enabling developers to seamlessly integrate powerful data redaction and anonymization capabilities into their Python applications.
Installation
In order to use the package to integrate with Redactive.ai, run:
pip install --upgrade redactive
There is no need to clone this repository.
If you would like to modify this package, clone the repo and install from source:
python -m pip install .
Requirements
- Python 3.11+
Usage
The library has the following components:
- AuthClient - provides functionality to interact with data sources
- SearchClient - provides functionality to search chunks with Redactive search service
- MultiUserClient - provides functionality manage multi-user search with Redactive search service
- RerankingSearchClient [Experimental] - retrieves extra results, then re-ranks them using a more precise ranking function, returning the top_k results
AuthClient
AuthClient needs to be configured with your account's API key which is available in the Apps page at Redactive Dashboard.
from redactive.auth_client import AuthClient
client = AuthClient(api_key="API-KEY")
# Establish an connection to data source
# Possible data sources: confluence, google-drive, jira, zendesk, slack, sharepoint
redirect_uri = "https://url-debugger.vercel.app"
sign_in_url = await client.begin_connection(
provider="confluence", redirect_uri=redirect_uri
)
# Navigate User to sign_in_url
# User will receive an oauth2 auth code after consenting the app's data source access permissions.
# Use this code to exchange Redactive access_token with Redactive API
response = await client.exchange_tokens(code="OAUTH2-TOKEN")
SearchClient
With the Redactive access_token, you can perform three types of searches using the Redactive Search service:
- Semantic Query Search: Retrieve relevant chunks of information that are semantically related to a user query.
- URL-based Search: Obtain all the chunks from a document by specifying its URL.
- Document Name Search: Query for all the chunks from a document based on the name of the document.
from redactive.search_client import SearchClient
client = SearchClient()
# Semantic Search: retrieve text extracts (chunks) from various documents pertaining to the user query
client.query_chunks(
access_token="REDACTIVE-USER-ACCESS-TOKEN",
semantic_query="Tell me about AI"
)
# URL-based Search: retrieve all chunks of the document at that URL
client.get_chunks_by_url(
access_token="REDACTIVE-USER-ACCESS-TOKEN",
url="https://example.com/document"
)
# Document Name Search: retrieve all chunks of a document identified by its name
client.query_chunks_by_document_name(
access_token="REDACTIVE-USER-ACCESS-TOKEN",
document_name="Project Plan"
)
Filters
Query methods, i.e. query_chunks
, query_chunks_by_document_name
, support a set of optional filters. The filters are applied in a logical 'AND' operation. If a data source provider does not support a filter-type, then no results from that provider are returned.
from datetime import datetime, timedelta
from redactive.search_client import SearchClient
from redactive.grpc.v1 import Filters
client = SearchClient()
# Query chunks from Confluence only, that are from documents created before last week, modified since last week,
# and that are from documents associated with a user's email. Include chunks from trashed documents.
last_week = datetime.now() - timedelta(weeks=1)
filters = Filters().from_dict({
"scope": ["confluence"],
"created": {
"before": last_week,
},
"modified": {
"after": last_week,
},
"userEmails": ["myEmail@example.com"],
"includeContentInTrash": True,
})
client.query_chunks(
access_token="REDACTIVE-USER-ACCESS-TOKEN",
semantic_query="Tell me about AI",
filters=filters
)
Multi-User Client
The MultiUserClient
class helps manage multiple users' authentication and access to the Redactive search service.
from redactive.multi_user_client import MultiUserClient
multi_user_client = MultiUserClient(
api_key="REDACTIVE-API-KEY",
callback_uri="https://example.com/callback/",
read_user_data=...,
write_user_data=...,
)
# Present `connection_url` in browser for user to interact with:
user_id = ...
connection_url = await multi_user_client.get_begin_connection_url(user_id=user_id, provider="confluence")
# On user return from OAuth connection flow:
sign_in_code, state = ..., ... # from URL query parameters
is_connection_successful = await multi_user_client.handle_connection_callback(
user_id=user_id,
sign_in_code=sign_in_code,
state=state
)
# User can now use Redactive search service via `MultiUserClient`'s other methods:
semantic_query = "Tell me about the missing research vessel, the Borealis"
chunks = await multi_user_client.query_chunks(user_id=user_id, semantic_query=semantic_query)
Development
The Python SDK code can be found thesdks/python
directory in Redactive Github Repository.
In order to comply with the repository style guide, we recommend running the following tools.
To format your code, run:
hatch fmt
To check type, run:
hatch run types:check
To test changes, run:
hatch test
To build Python SDK, run:
hatch build
To install local version, run:
python -m pip install -e .
Contribution Guide
Please check here
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file redactive-1.4.9.tar.gz
.
File metadata
- Download URL: redactive-1.4.9.tar.gz
- Upload date:
- Size: 14.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64855f51c3fdc6e41cb92f6d554c37dbee3d97f7d2894618013c712df448940a |
|
MD5 | b547f17a2a0553e96079bbcbae8625d3 |
|
BLAKE2b-256 | ce776772cb6dc72ac1a62a16d4f4a79e5729158aaba5b2781830b4b1b1673d53 |
File details
Details for the file redactive-1.4.9-py3-none-any.whl
.
File metadata
- Download URL: redactive-1.4.9-py3-none-any.whl
- Upload date:
- Size: 13.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9ac622ee4b881288ece53de9dcb69f87aded5d79d15860d76e6c2578f749ef26 |
|
MD5 | 484fc484af8382e0a29233c333cdeae3 |
|
BLAKE2b-256 | b03a51b8767ed7aa55a5fa5e55df38de9751f2c5ce7a4480f27203640bea1c68 |