tea-lang

Tea: A High-level Language and Runtime System for Automating Statistical Analyses

What is Tea?

Tea is a domain specific programming language that automates statistical test selection and execution. Tea is currently written in/for Python.

Tea has an academic research paper.

Users provide 5 pieces of information:

the dataset of interest,
the variables in the dataset they want to analyze,
the study design (e.g., independent, dependent variables),
the assumptions they make about the data based on domain knowledge(e.g., a variable is normally distributed), and
a hypothesis.

Tea then "compiles" these into logical constraints to select valid statistical tests. Tests are considered valid if and only if all the assumptions they make about the data (e.g., normal distribution, equal variance between groups, etc.) hold. Tea then finally executes the valid tests.

What kinds of statistical analyses are possible with Tea?

Tea currently provides a module to conduct Null Hypothesis Significance Testing (NHST).

Since Tea, we have developed Tisane for generalized mixed-effects models (for Python) and rTisane for generalized models (for R). Both author statistical models from higher-level conceptual models. The latest DSL for supporting generalized mixed-effects models that combines insights from both Tisane and rTisane is rTisanePy (for Python). Tea and the lessons we learned from developing and using it informed all of the above!

How can I use Tea?

Tea is available on pip!

pip install tealang

See community examples here. If you have trouble using Tea with your use case, feel free to open an issue, and we'll try to help.

Step through a more guided, thorough documentation and a worked example.

How can I cite Tea?

For now, please cite:

article{JunEtAl2019:Tea,
  title={Tea: A High-level Language and Runtime System for Automating Statistical Analysis},
  author={Jun, Eunice and Daum, Maureen and Roesch, Jared and Chasins, Sarah E. and Berger, Emery D. and Just, Rene and Reinecke, Katharina},
  journal={Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (UIST)},
  year={2019}
}

How reliable is Tea?

Our constraint solver is based on statistical texts (see our paper for more info).

If you find any bugs, please create a Github issue or let us know (email Eunice at emjun [at] cs.ucla.edu)!

Where can I learn more about Tea?

Please find more information at our website. Tea is a research prototype we have been trying our best to maintain and improve since 2019.

I want to collaborate! Where do I begin?

This is great! We're excited to have new collaborators. :)

To contribute code, please see docs and gudielines and open an issue or pull request.

If you want to use Tea for a project, talk about Tea's design, or anything else, please get in touch: emjun [at] cs.ucla.edu!

I have ideas. I want to chat.

Please reach out! We are nice :) Email Eunice at emjun [at] cs.ucla.edu!

FAQs

By the way, why Python?

Python is a common language for data science. We hope Tea can easily integrate into user workflows.

What format should my data be in?

Tea accepts data either as a CSV or a Pandas DataFrame. Tea asumes data is in "long format."

Name		Name	Last commit message	Last commit date
Latest commit History 476 Commits
.github/workflows		.github/workflows
evals		evals
examples		examples
tea		tea
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
SOURCES.md		SOURCES.md
documentation.md		documentation.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
tea_demo.ipynb		tea_demo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tea-lang

Tea: A High-level Language and Runtime System for Automating Statistical Analyses

What is Tea?

What kinds of statistical analyses are possible with Tea?

How can I use Tea?

How can I cite Tea?

How reliable is Tea?

Where can I learn more about Tea?

I want to collaborate! Where do I begin?

I have ideas. I want to chat.

FAQs

By the way, why Python?

What format should my data be in?

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tea-lang

Tea: A High-level Language and Runtime System for Automating Statistical Analyses

What is Tea?

What kinds of statistical analyses are possible with Tea?

How can I use Tea?

How can I cite Tea?

How reliable is Tea?

Where can I learn more about Tea?

I want to collaborate! Where do I begin?

I have ideas. I want to chat.

FAQs

By the way, why Python?

What format should my data be in?

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages