Skip to main content

Tea: A High-level Language and Runtime System to Automate Statistical Analysis

Project description

tea-lang Build Status Coverage Status

[WIP] Tea: A High-level Language and Runtime System for Automating Statistical Analyses

What is Tea?

Tea is a domain specific programming language that automates statistical test selection and execution. Tea is currently written in/for Python.

Tea has an academic research paper.

Users provide 5 pieces of information:

  • the dataset of interest,
  • the variables in the dataset they want to analyze,
  • the study design (e.g., independent, dependent variables),
  • the assumptions they make about the data based on domain knowledge(e.g., a variable is normally distributed), and
  • a hypothesis.

Tea then "compiles" these into logical constraints to select valid statistical tests. Tests are considered valid if and only if all the assumptions they make about the data (e.g., normal distribution, equal variance between groups, etc.) hold. Tea then finally executes the valid tests.

What kinds of statistical analyses are possible with Tea?

Tea currently provides a module to conduct Null Hypothesis Significance Testing (NHST).

We are actively working on expanding the kinds of analyses Tea can support. Some ideas we have: linear modeling and Bayesian inference.

How can I use Tea?

Tea is available on pip!

pip install tealang

See community examples here. If you have trouble using Tea with your use case, feel free to open an issue, and we'll try to help.

Step through a more guided, thorough documentation and a worked example.

How can I cite Tea?

For now, please cite:

article{JunEtAl2019:Tea,
  title={Tea: A High-level Language and Runtime System for Automating Statistical Analysis},
  author={Jun, Eunice and Daum, Maureen and Roesch, Jared and Chasins, Sarah E. and Berger, Emery D. and Just, Rene and Reinecke, Katharina},
  journal={Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (UIST)},
  year={2019}
}

How reliable is Tea?

Tea is currently a research prototype. Our constraint solver is based on statistical texts (see our paper for more info).

If you find any bugs, please let us know (email Eunice at emjun [at] cs.washington.edu)!

I want to collaborate! Where do I begin?

This is great! We're excited to have new collaborators. :)

To contribute code, please see docs and gudielines and open an issue or pull request.

If you want to use Tea for a project, talk about Tea's design, or anything else, please get in touch: emjun [at] cs.washington.edu!

Where can I learn more about Tea?

Please find more information at our website.

I have ideas. I want to chat.

Please reach out! We are nice :) Email Eunice at emjun [at] cs.washington.edu!

FAQs

By the way, why Python?

Python is a common language for data science. We hope Tea can easily integrate into user workflows.

What format should my data be in?

Tea accepts data either as a CSV or a Pandas DataFrame. Tea asumes data is in "long format."

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tealang-0.4.2.tar.gz (48.8 kB view details)

Uploaded Source

Built Distribution

tealang-0.4.2-py3-none-any.whl (53.8 kB view details)

Uploaded Python 3

File details

Details for the file tealang-0.4.2.tar.gz.

File metadata

  • Download URL: tealang-0.4.2.tar.gz
  • Upload date:
  • Size: 48.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.7 CPython/3.8.11 Darwin/21.1.0

File hashes

Hashes for tealang-0.4.2.tar.gz
Algorithm Hash digest
SHA256 227cef141a5903716a944cd31dec9eb31c626fee3a3cfc6047ecac07ae581799
MD5 1fd58f34b116854e76f2cd7115205f7b
BLAKE2b-256 64ea4515fb2125bff94cd53bc29fd86ac6af57476785af585d1b0fd3108c0e49

See more details on using hashes here.

File details

Details for the file tealang-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: tealang-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 53.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.7 CPython/3.8.11 Darwin/21.1.0

File hashes

Hashes for tealang-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2240608a32996dd5f201f345111f5eb66740ef3c7735ccdedc21d7d59531b33d
MD5 7c0f3dfae3d7a77dd9b4c698472b3103
BLAKE2b-256 45c3ff2a6020feb6971f545078f2e6971752a75f19ca0b1f29ac0cdc9d544be8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page