No project description provided
Project description
Relevance AI - The ML Platform for Unstructured Data Analysis
🌎 80% of data in the world is unstructured in the form of text, image, audio, videos, and more.
🔥 Use Relevance to unlock the value of your unstructured data:
- ⚡ Quickly analyze unstructured data with pre-trained machine learning models in a few lines of code.
- ✨ Visualize your unstructured data. Text highlights from Named entity recognition, Word cloud from keywords, Bounding box from images.
- 📊 Create charts for both structured and unstructured.
- 🔎 Drilldown with filters and similarity search to explore and find insights.
- 🚀 Share data apps with your team.
Relevance AI also acts as a platform for:
- 🔑 Vectors, storing and querying vectors with flexible vector similarity search, that can be combined with multiple vectors, aggregates and filters.
- 🔮 ML Dataset Evaluation, for debugging dataset labels, model outputs and surfacing edge cases.
🧠 Documentation
Type | Link |
---|---|
Python API | Documentation |
Python Reference | Documentation |
Cloud Dashboard | Documentation |
🛠️ Installation
Using pip:
pip install -U relevanceai
Using conda:
conda install -c relevance relevanceai
⏩ Quickstart
Login to relevanceai
:
from relevanceai import Client
client = Client()
Prepare your documents for insertion by following the below format:
- Each document should be a dictionary
- Include a field
_id
as a primary key, otherwise it's automatically generated - Suffix vector fields with
_vector_
docs = [
{"_id": "1", "example_vector_": [0.1, 0.1, 0.1], "data": "Documentation"},
{"_id": "2", "example_vector_": [0.2, 0.2, 0.2], "data": "Best document!"},
{"_id": "3", "example_vector_": [0.3, 0.3, 0.3], "data": "document example"},
{"_id": "4", "example_vector_": [0.4, 0.4, 0.4], "data": "this is another doc"},
{"_id": "5", "example_vector_": [0.5, 0.5, 0.5], "data": "this is a doc"},
]
Insert data into a dataset
Create a dataset object with the name of the dataset you'd like to use. If it doesn't exist, it'll be created for you.
ds = client.Dataset("quickstart")
ds.insert_documents(docs)
Quick tip! Our Dataset object is compatible with common dataframes methods like
.head()
,.shape()
and.info()
.
Perform vector search
query = [
{"vector": [0.2, 0.2, 0.2], "field": "example_vector_"}
]
results = ds.search(
vector_search_query=query,
page_size=3,
)
Learn more about how to flexibly configure your vector search ->
Perform clustering
Generate clusters
clusterop = ds.cluster(vector_fields=["example_vector_"])
clusterop.list_closest()
Generate clusters with sklearn
from sklearn.cluster import AgglomerativeClustering
cluster_model = AgglomerativeClustering()
clusterop = ds.cluster(vector_fields=["example_vector_"], model=cluster_model, alias="agglomerative")
clusterop.list_closest()
Learn more about how to flexibly configure your clustering ->
🧰 Config
The config object contains the adjustable global settings for the SDK. For a description of all the settings, see here.
To view setting options, run the following:
client.config.options
The syntax for selecting an option is section.key. For example, to disable logging, run the following to modify logging.enable_logging:
client.config.set_option('logging.enable_logging', False)
To restore all options to their default, run the following:
Changing the base URL
You can change the base URL as such:
client.base_url = "https://.../latest"
🚧 Development
Getting Started
To get started with development, ensure you have pytest and mypy installed. These will help ensure typechecking and testing.
python -m pip install pytest mypy
Then run testing using:
Don't forget to set your test credentials!
export TEST_PROJECT = xxx
export TEST_API_KEY = xxx
python -m pytest
mypy relevanceai
Set up precommit
pip install precommit
pre-commit install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for RelevanceAI-dev-3.2.6.2022.9.19.0.57.37.996451.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9132f0cb8c1e58a67df008f705d4b56d19986571750977bf9c32cf88879f2c2e |
|
MD5 | 27ed2255e7f76818caa197a716628ae2 |
|
BLAKE2b-256 | 322752eb905efdd16341acd584972c1eb1cd21f91e460d6e56e7e111cfa53d6c |
Hashes for RelevanceAI_dev-3.2.6.2022.9.19.0.57.37.996451-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ded22b46d9de4f22e211aac06a3d5679e01ff9e3b8a874c39f3f99d82966005 |
|
MD5 | 11038cf7d92ba41d2b1cdd8c9fc7207d |
|
BLAKE2b-256 | dffa009ce8e02f601f965ed273c3a618b282978d56f57aac89b6ca6e61b2c298 |