Risk modeling and prediction

These details have not been verified by PyPI

Project links

Project description

Triage

ML/Data Science Toolkit for Social Good and Public Policy Problems

Building ML/Data Science systems requires answering many design questions, turning them into modeling choices, which in turn define and machine learning models. Questions such as cohort selection, unit of analysis determination, outcome determination, feature (explanatory variables or predictors) generation, model/classifier training, evaluation, selection, bias audits, interpretation, and list generation are often complicated and hard to make design choices around a priori. In addition, once these choices are made, they have to be combined in different ways throughout the course of a project.

Triage is designed to:

Guide users (data scientists, analysts, researchers) through these design choices by highlighting critical operational use questions.
Provide an integrated interface to components that are needed throughout a ML/data science project workflow.

Getting Started with Triage

Are you completely new to Triage? Run through a quick tutorial hosted on google colab (no setup necessary) to see what triage can do! Tutorial hosted on Google Colab
Run it locally on an example problem and data set from Donors Choose
Dirty Duck Tutorial - Want a more in-depth walk through of triage's functionality and concepts? Go through the dirty duck tutorial that you can install on your local machine with sample data
QuickStart Guide - Try Triage out with your own project and data
Triage Documentation Site - Used Triage before and want more reference documentation?
Development - Contribute to Triage development.

Installation

To install Triage locally, you need:

Ubuntu/RedHat
Python 3.9+
A PostgreSQL 9.6+ database with your source data (events, geographical data, etc) loaded.
- NOTE: If your database is PostgreSQL 11+ you will get some speed improvements. We recommend updating to a recent version of PostgreSQL.
Ample space on an available disk (or for example in Amazon Web Services's S3) to store the matrices and models that will be created for your experiments

We recommend starting with a new Python virtual environment and pip installing triage there.

$ virtualenv triage-env
$ . triage-env/bin/activate
(triage-env) $ pip install triage

If you get an error related to pg_config executable, run the following command (make sure you have sudo access):

(triage-env) $ sudo apt-get install libpq-dev python3.9-dev

Then rerun pip install triage

(triage-env) $ pip install triage

To test if triage was installed correctly, type:

(triage-env) $ triage -h

Data

Triage needs data in a postgres database and a configuration file that has credentials for the database. The Triage CLI defaults database connection information to a file stored in 'database.yaml' (example in example/database.yaml).

If you don't want to install Postgres yourself, try triage db up to create a vanilla Postgres 12 database using docker. For more details on this command, check out Triage Database Provisioner

Configure Triage for your project

Triage is configured with a config.yaml file that has parameters defined for each component. You can see some sample configuration with explanations to see what configuration looks like.

Using Triage

Via CLI:

triage experiment example/config/experiment.yaml

Import as a python package:

from triage.experiments import SingleThreadedExperiment

experiment = SingleThreadedExperiment(
    config=experiment_config, # a dictionary
    db_engine=create_engine(...), # http://docs.sqlalchemy.org/en/latest/core/engines.html
    project_path='/path/to/directory/to/save/data' # could be an S3 path too: 's3://mybucket/myprefix/'
)
experiment.run()

There are a plethora of options available for experiment running, affecting things like parallelization, storage, and more. These options are detailed in the Running an Experiment page.

Development

Triage was initially developed at University of Chicago's Center For Data Science and Public Policy and is now being maintained at Carnegie Mellon University.

To build this package (without installation), its dependencies may alternatively be installed from the terminal using pip:

pip install -r requirement/main.txt

Testing

To add test (and development) dependencies, use test.txt:

pip install -r requirement/test.txt [-r requirement/dev.txt]

Then, to run tests:

pytest

Development Environment

To quickly bootstrap a development environment, having cloned the repository, invoke the executable develop script from your system shell:

./develop

A "wizard" will suggest set-up steps and optionally execute these, for example:

(install) begin

(pyenv) installed

(python-3.9.10) installed

(virtualenv) installed

(activation) installed

(libs) install?
1) yes, install {pip install -r requirement/main.txt -r requirement/test.txt -r requirement/dev.txt}
2) no, ignore
#? 1

Contributing

If you'd like to contribute to Triage development, see the CONTRIBUTING.md document.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

5.5.1

May 12, 2025

5.5.0

May 8, 2025

5.4.0

Mar 21, 2025

5.3.3

Oct 21, 2024

5.3.2

Feb 2, 2024

5.3.1

Feb 1, 2024

5.3.0

Jan 22, 2024

5.2.3

Sep 19, 2023

5.2.2

Nov 8, 2022

5.2.1

Oct 25, 2022

5.2.0

May 20, 2022

5.1.1

Feb 8, 2022

5.1.0

Dec 7, 2021

5.0.0

Aug 27, 2021

4.4.0

Aug 26, 2021

4.3.1

Apr 22, 2021

4.2.0

Nov 11, 2020

4.1.1

Jul 9, 2020

4.1.0

Jun 30, 2020

4.0.1

Dec 20, 2019

4.0.0

Dec 16, 2019

3.3.0

Feb 20, 2019

3.2.1

Dec 10, 2018

3.2.0

Dec 10, 2018

3.1.1

Nov 2, 2018

3.0.2

Oct 8, 2018

3.0.1

Oct 5, 2018

2.2.0

May 15, 2018

2.1.0

Nov 14, 2017

2.0.0

Nov 14, 2017

1.1.1

Oct 26, 2017

1.1.0

Oct 20, 2017

1.0.4

Mar 1, 2019

1.0.0

Jul 28, 2017

0.5.0

Jul 28, 2017

0.3.0

Nov 2, 2018

0.1.0

Nov 2, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

triage-5.5.1.tar.gz (3.9 MB view details)

Uploaded May 12, 2025 Source

Built Distribution

triage-5.5.1-py2.py3-none-any.whl (314.1 kB view details)

Uploaded May 12, 2025 Python 2Python 3

File details

Details for the file triage-5.5.1.tar.gz.

File metadata

Download URL: triage-5.5.1.tar.gz
Upload date: May 12, 2025
Size: 3.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for triage-5.5.1.tar.gz
Algorithm	Hash digest
SHA256	`ab930a2fe3aa23768210c7040ec37265c3a91a4ec69446ceb3b64611d93b1464`
MD5	`99fac6c33cb906b1056ea363b2ff1a4e`
BLAKE2b-256	`b10389c7ed114f3bf300d088bdecca8302d5419c91d92cb826c6d9e20fb1c662`

See more details on using hashes here.

File details

Details for the file triage-5.5.1-py2.py3-none-any.whl.

File metadata

Download URL: triage-5.5.1-py2.py3-none-any.whl
Upload date: May 12, 2025
Size: 314.1 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for triage-5.5.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`edee24d5f5052cdebcf76d93672e02c1b10b959756fa9761bbbe6c1cc2b7711d`
MD5	`430b0d439ef71a3760b57443870f8e83`
BLAKE2b-256	`08d461afd059b8188f17b13b1d219c9ad8adfdc183613b9bd111e665fbcaa97b`

See more details on using hashes here.

triage 5.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Triage

Getting Started with Triage

Installation

Data

Configure Triage for your project

Using Triage

Development

Testing

Development Environment

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes