Skip to main content

SDK for the Sturdy Statistics API

Project description

Sturdy Stats SDK

This is the sdk for the Sturdy Statistics API. We host a series of public indicies trained on Earnings Calls, ArXiv, HackerNews, and various news streams that anyone can use for public data analysis. Uploading data requires signing up at https://sturdystatistics.com in order to create an api key.

Installation

Core API: pip install sturdy-stats-sdk

Regression Extension: pip install sturdy-stats-sdk[regression]

Resources

Explore our gallery to browse visualization created by the sturdy-stats-sdk. Follow along with our quickstart to hit the ground running or browse our advanced examples to perform rigorous analyses.

Technical Features

Automatic Structuring of Unstructured Text Data
Convert unstructured documents into structured formats, allowing seamless analysis alongside traditional tabular data. Learn More >
Explainable Text Classification
Gain clear insights into how text data is categorized, while enhancing transparency and trust in your analyses. Learn More >
Effective with Small Datasets
Achieve meaningful results even with limited data, making our solutions accessible to organizations of all sizes. Learn More >
Powerful Search Capabilities
Leverage our robust search API to retrieve and analyze specific information within your unstructured data. Learn More >
Comprehensive Data Lake
Store and analyze all your data — structured and unstructured — in one place, facilitating holistic insights. Learn More >

Quickstart

Explore Your Data

from sturdystats import Index, Job
import plotly.express as px

index = Index(id="index_99051ff1489844878fd792784d7baa90")
topic_df = index.topicSearch()
fig = px.sunburst(
    topic_df, 
    path=["topic_group_short_title", "short_title"],
    values="prevalence", 
    hover_data=["topic_id"]
)

Run SQL queries against your unstructured ata

topic_id = 12
df = pd.DataFrame(index.queryMeta(f"""
SELECT
    quarter,
    sum(sparse_list_extract({topic_id+1}, sum_topic_counts_inds, sum_topic_counts_vals)) as n_occurences
FROM doc 
GROUP BY quarter 
ORDER BY quarter""") )

Create a Index from scratch

from sturdystats import Index, Job
import pandas as pd

df = pd.read_parquet('data.parquet')
index = Index(API_key="XXX", name='tech_earnings_calls_2024')

res = index.upload(df.to_dict("records"))
job = index.train(params=dict(), fast=True, wait=True)

Train robust linear models.

pip install sturdy-stats-sdk[regression]

from sturdystats.model import LinearRegressor 
import arviz as az

model = LinearRegression(API_key=API_KEY)
model.sample(X, Y) 
az.plot_trace(model.inference_data)

Detect mislabelled datapoints.

from sturdystats.model import SturdyLogisticRegressor
import arviz as az

model = SturdyLogisticRegressor(API_key=API_KEY)
model.sample(X, Y) 
az.plot_trace(model.inference_data)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sturdy_stats_sdk-1.0.49.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sturdy_stats_sdk-1.0.49-py3-none-any.whl (14.9 kB view details)

Uploaded Python 3

File details

Details for the file sturdy_stats_sdk-1.0.49.tar.gz.

File metadata

  • Download URL: sturdy_stats_sdk-1.0.49.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for sturdy_stats_sdk-1.0.49.tar.gz
Algorithm Hash digest
SHA256 dd547598c3344e8d2892a63b0ab0db91d0a1eb177514c736ad314aea62709029
MD5 bf342e1c74944113311a0bcf6f615462
BLAKE2b-256 30278390ad45e03f564a0d2308d81feb00e99dd512c98d664132a8a82fc6b5a6

See more details on using hashes here.

File details

Details for the file sturdy_stats_sdk-1.0.49-py3-none-any.whl.

File metadata

File hashes

Hashes for sturdy_stats_sdk-1.0.49-py3-none-any.whl
Algorithm Hash digest
SHA256 a84fe527002762af4392dff0c0b86c2268e414f79ba57df0f6c36538b7eab9c6
MD5 9b58064feb260a74b848d127bf880673
BLAKE2b-256 cd7308e6d6c8669053b0928ea9dd0af7597319d6ae0bcde198cac1ac96af21d6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page