SDK for the Sturdy Statistics API
Project description
Sturdy Stats SDK
This is the sdk for the Sturdy Statistics API. We host a series of public indicies trained on Earnings Calls, ArXiv, HackerNews, and various news streams that anyone can use for public data analysis. Uploading data requires signing up at https://sturdystatistics.com in order to create an api key.
Installation
Core API: pip install sturdy-stats-sdk
Regression Extension: pip install sturdy-stats-sdk[regression]
Resources
Explore our gallery to browse visualization created by the sturdy-stats-sdk. Follow along with our quickstart to hit the ground running or browse our advanced examples to perform rigorous analyses.
Technical Features
- Automatic Structuring of Unstructured Text Data
- Convert unstructured documents into structured formats, allowing seamless analysis alongside traditional tabular data. Learn More >
- Explainable Text Classification
- Gain clear insights into how text data is categorized, while enhancing transparency and trust in your analyses. Learn More >
- Effective with Small Datasets
- Achieve meaningful results even with limited data, making our solutions accessible to organizations of all sizes. Learn More >
- Powerful Search Capabilities
- Leverage our robust search API to retrieve and analyze specific information within your unstructured data. Learn More >
- Comprehensive Data Lake
- Store and analyze all your data — structured and unstructured — in one place, facilitating holistic insights. Learn More >
Quickstart
Explore Your Data
from sturdystats import Index, Job
import plotly.express as px
index = Index(id="index_99051ff1489844878fd792784d7baa90")
topic_df = index.topicSearch()
fig = px.sunburst(
topic_df,
path=["topic_group_short_title", "short_title"],
values="prevalence",
hover_data=["topic_id"]
)
Run SQL queries against your unstructured ata
topic_id = 12
df = pd.DataFrame(index.queryMeta(f"""
SELECT
quarter,
sum(sparse_list_extract({topic_id+1}, sum_topic_counts_inds, sum_topic_counts_vals)) as n_occurences
FROM doc
GROUP BY quarter
ORDER BY quarter""") )
Create a Index from scratch
from sturdystats import Index, Job
import pandas as pd
df = pd.read_parquet('data.parquet')
index = Index(API_key="XXX", name='tech_earnings_calls_2024')
res = index.upload(df.to_dict("records"))
job = index.train(params=dict(), fast=True, wait=True)
Train robust linear models.
pip install sturdy-stats-sdk[regression]
from sturdystats.model import LinearRegressor
import arviz as az
model = LinearRegression(API_key=API_KEY)
model.sample(X, Y)
az.plot_trace(model.inference_data)
Detect mislabelled datapoints.
from sturdystats.model import SturdyLogisticRegressor
import arviz as az
model = SturdyLogisticRegressor(API_key=API_KEY)
model.sample(X, Y)
az.plot_trace(model.inference_data)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sturdy_stats_sdk-1.0.49.tar.gz.
File metadata
- Download URL: sturdy_stats_sdk-1.0.49.tar.gz
- Upload date:
- Size: 13.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dd547598c3344e8d2892a63b0ab0db91d0a1eb177514c736ad314aea62709029
|
|
| MD5 |
bf342e1c74944113311a0bcf6f615462
|
|
| BLAKE2b-256 |
30278390ad45e03f564a0d2308d81feb00e99dd512c98d664132a8a82fc6b5a6
|
File details
Details for the file sturdy_stats_sdk-1.0.49-py3-none-any.whl.
File metadata
- Download URL: sturdy_stats_sdk-1.0.49-py3-none-any.whl
- Upload date:
- Size: 14.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a84fe527002762af4392dff0c0b86c2268e414f79ba57df0f6c36538b7eab9c6
|
|
| MD5 |
9b58064feb260a74b848d127bf880673
|
|
| BLAKE2b-256 |
cd7308e6d6c8669053b0928ea9dd0af7597319d6ae0bcde198cac1ac96af21d6
|