No project description provided
Project description
Relevance AI - The ML Platform for Unstructured Data Analysis
🌎 80% of data in the world is unstructured in the form of text, image, audio, videos, and more.
🔥 Use Relevance to unlock the value of your unstructured data:
- ⚡ Quickly analyze unstructured data with pre-trained machine learning models in a few lines of code.
- ✨ Visualize your unstructured data. Text highlights from Named entity recognition, Word cloud from keywords, Bounding box from images.
- 📊 Create charts for both structured and unstructured.
- 🔎 Drilldown with filters and similarity search to explore and find insights.
- 🚀 Share data apps with your team.
Relevance AI also acts as a platform for:
- 🔑 Vectors, storing and querying vectors with flexible vector similarity search, that can be combined with multiple vectors, aggregates and filters.
- 🔮 ML Dataset Evaluation, for debugging dataset labels, model outputs and surfacing edge cases.
🧠 Documentation
Type | Link |
---|---|
Python API | Documentation |
Python Reference | Documentation |
Cloud Dashboard | Documentation |
🛠️ Installation
Using pip:
pip install -U relevanceai
Using conda:
conda install -c relevance relevanceai
⏩ Quickstart
Login to relevanceai
:
from relevanceai import Client
client = Client()
Prepare your documents for insertion by following the below format:
- Each document should be a dictionary
- Include a field
_id
as a primary key, otherwise it's automatically generated - Suffix vector fields with
_vector_
docs = [
{"_id": "1", "example_vector_": [0.1, 0.1, 0.1], "data": "Documentation"},
{"_id": "2", "example_vector_": [0.2, 0.2, 0.2], "data": "Best document!"},
{"_id": "3", "example_vector_": [0.3, 0.3, 0.3], "data": "document example"},
{"_id": "4", "example_vector_": [0.4, 0.4, 0.4], "data": "this is another doc"},
{"_id": "5", "example_vector_": [0.5, 0.5, 0.5], "data": "this is a doc"},
]
Insert data into a dataset
Create a dataset object with the name of the dataset you'd like to use. If it doesn't exist, it'll be created for you.
ds = client.Dataset("quickstart")
ds.insert_documents(docs)
Quick tip! Our Dataset object is compatible with common dataframes methods like
.head()
,.shape()
and.info()
.
Perform vector search
query = [
{"vector": [0.2, 0.2, 0.2], "field": "example_vector_"}
]
results = ds.search(
vector_search_query=query,
page_size=3,
)
Learn more about how to flexibly configure your vector search ->
Perform clustering
Generate clusters
clusterop = ds.cluster(vector_fields=["example_vector_"])
clusterop.list_closest()
Generate clusters with sklearn
from sklearn.cluster import AgglomerativeClustering
cluster_model = AgglomerativeClustering()
clusterop = ds.cluster(vector_fields=["example_vector_"], model=cluster_model, alias="agglomerative")
clusterop.list_closest()
Learn more about how to flexibly configure your clustering ->
🧰 Config
The config object contains the adjustable global settings for the SDK. For a description of all the settings, see here.
To view setting options, run the following:
client.config.options
The syntax for selecting an option is section.key. For example, to disable logging, run the following to modify logging.enable_logging:
client.config.set_option('logging.enable_logging', False)
To restore all options to their default, run the following:
Changing the base URL
You can change the base URL as such:
client.base_url = "https://.../latest"
🚧 Development
Getting Started
To get started with development, ensure you have pytest and mypy installed. These will help ensure typechecking and testing.
python -m pip install pytest mypy
Then run testing using:
Don't forget to set your test credentials!
export TEST_PROJECT = xxx
export TEST_API_KEY = xxx
python -m pytest
mypy relevanceai
Set up precommit
pip install precommit
pre-commit install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file RelevanceAI-dev-3.2.22.2023.1.3.0.10.55.984477.tar.gz
.
File metadata
- Download URL: RelevanceAI-dev-3.2.22.2023.1.3.0.10.55.984477.tar.gz
- Upload date:
- Size: 301.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b23ea5ae26377d5e40bd53907152d8a08c1638156d4e1c25c83735ff41a78270 |
|
MD5 | ed9dd1b5aa65667ee181134b21fde446 |
|
BLAKE2b-256 | ba89e8e8d75cf0dc1f0e865fb0b1d58b615511d8c516f1ca63a6d25ff6521fe0 |
File details
Details for the file RelevanceAI_dev-3.2.22.2023.1.3.0.10.55.984477-py3-none-any.whl
.
File metadata
- Download URL: RelevanceAI_dev-3.2.22.2023.1.3.0.10.55.984477-py3-none-any.whl
- Upload date:
- Size: 427.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 361e50fa9862efcaab723317cf45da975383b48c4d518df986058fb2a745f1fe |
|
MD5 | 1520dd91e06fbe521b2d3b5abed4e425 |
|
BLAKE2b-256 | 1f16d5a625dc49783873010ce4d623df4636d4d7f15590954561b0060dae014f |