lyscripts

Package containing scripts used in lynference pipelines

These details have not been verified by PyPI

Project links

GitHub Statistics

Project description

What are these `lyscripts`?

This package provides convenient scripts for performing inference and learning regarding the lymphatic spread of head & neck cancer. Essentially, it provides a command line interface (CLI) to the lymph library.

We are making these "convenience" scripts public, because doing so is one necessary requirement to making our research easily and fully reproducible. There exists another repository, lynference, where we store the pipelines that produce(d) our published results in a persistent way. Head over there to learn more about how to reproduce our work.

Installation

These scripts can be installed via pip:

pip install lyscripts

or installed from source by cloning this repo

git clone https://github.com/rmnldwg/lyscripts.git
cd lyscripts
pip install .

Usage

After installing the package, run python -m lyscripts --help to see the following output:

usage: lyscripts [-h] [-v]
                 {generate,join,enhance,clean,split,sample,evaluate,predict,plot,
                  temp_schedule}
                 ...

Utility for performing common tasks w.r.t. the inference and prediction tasks one
can use the `lymph` package for.


POSITIONAL ARGUMENTS
  {generate,join,enhance,clean,split,
   sample,evaluate,predict,plot,
   temp_schedule}

    generate                            Generate synthetic patient data for testing
                                        and validation purposes.

    join                                Join datasets from different sources (but of
                                        the same format) into one.

    enhance                             Enhance a LyProX-style CSV dataset in two ways:

                                        1. Add consensus diagnoses based on all
                                        available modalities using on of two
                                        methods: `max_llh` infers the most likely
                                        true state of involvement given only the
                                        available diagnoses. `rank` uses the
                                        available diagnositc modalities and ranks
                                        them based on their respective sensitivity
                                        and specificity.

                                        2. Complete sub- & super-level fields. This
                                        means that if a dataset reports LNLs IIa and
                                        IIb separately, this script will add the
                                        column for LNL II and fill it with the
                                        correct values. Conversely, if e.g. LNL II
                                        is reported to be healthy, we can assume the
                                        sublevels IIa and IIb would have been
                                        reported as healthy, too.

    clean                               Transform the enhanced lyDATA CSV files into
                                        a format that can be used by the lymph model
                                        using this package's utilities.

    split                               Split the full dataset into cross-validation
                                        folds according to the content of the
                                        params.yaml file.

    sample                              Learn the spread probabilities of the HMM
                                        for lymphatic tumor progression using the
                                        preprocessed data as input and MCMC as
                                        sampling method.

                                        This is the central script performing for
                                        our project on modelling lymphatic spread in
                                        head & neck cancer. We use it for model
                                        comparison via the thermodynamic integration
                                        functionality and use the sampled parameter
                                        estimates for risk predictions. This risk
                                        estimate may in turn some day guide
                                        clinicians to make more objective decisions
                                        with respect to defining the *elective
                                        clinical target volume* (CTV-N) in
                                        radiotherapy.

    evaluate                            Evaluate the performance of the trained
                                        model by computing quantities like the
                                        Bayesian information criterion (BIC) or (if
                                        thermodynamic integration was performed) the
                                        actual evidence (with error) of the model.

    predict                             This module provides functions and scripts
                                        to predict the risk of hidden involvement,
                                        given observed diagnoses, and prevalences of
                                        patterns for diagnostic modalities.

    plot                                Provide various plotting utilities for
                                        displaying results of e.g. the inference or
                                        prediction process.

    temp_schedule                       Generate inverse temperature schedules for
                                        thermodynamic integration using various
                                        different methods.

                                        Thermodynamic integration is quite sensitive
                                        to the specific schedule which is used. I
                                        noticed in my models, that within the
                                        interval $[0, 0.1]$, the increase in the
                                        expected log-likelihood is very steep.
                                        Hence, the inverse temparature $\beta$ must
                                        be more densely spaced in the beginning.

                                        This can be achieved by using a power
                                        sequence: Generate $n$ linearly spaced
                                        points in the interval $[0, 1]$ and then
                                        transform each point by computing
                                        $\beta_i^k$ where $k$ could e.g. be 5.


OPTIONAL ARGUMENTS
  -h, --help                            show this help message and exit
  -v, --version                         Display the version of lyscripts (default: False)

Each of the individual subcommands provides a help page like this respectively that detail the positional and optional arguments along with their function.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

Release history Release notifications | RSS feed

1.0.0a2 pre-release

May 28, 2024

1.0.0a1 pre-release

Apr 3, 2024

1.0.0a0 pre-release

Dec 20, 2023

0.7.3

Aug 29, 2023

0.7.2

Jul 31, 2023

0.7.1

Jul 31, 2023

0.7.0

Jun 26, 2023

0.6.9

Jun 21, 2023

0.6.8

May 30, 2023

0.6.7

May 23, 2023

0.6.6

Dec 1, 2022

0.6.5

Dec 1, 2022

0.6.4

Dec 1, 2022

0.6.3

Nov 25, 2022

0.6.2

Nov 25, 2022

0.6.1

Nov 24, 2022

0.6.0

Nov 23, 2022

0.5.11

Nov 8, 2022

0.5.10

Oct 13, 2022

0.5.9

Sep 16, 2022

0.5.8

Sep 12, 2022

This version

0.5.7

Aug 29, 2022

0.5.6

Aug 29, 2022

0.5.5

Aug 25, 2022

0.5.4

Aug 24, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lyscripts-0.5.7.tar.gz (39.0 kB view hashes)

Uploaded Aug 29, 2022 Source

Built Distribution

lyscripts-0.5.7-py3-none-any.whl (49.0 kB view hashes)

Uploaded Aug 29, 2022 Python 3

Hashes for lyscripts-0.5.7.tar.gz

Hashes for lyscripts-0.5.7.tar.gz
Algorithm	Hash digest
SHA256	`148c3aa6ac061ae84fdee2d1967215226050b690b873f5751c199b9f1e5d827e`
MD5	`c0649c6c98a5ddb29bbbffabf7441aa7`
BLAKE2b-256	`af02e75dc3916c1de7a739b9f15ed06a0869cfcf87272b56205194e82daaf55a`

Hashes for lyscripts-0.5.7-py3-none-any.whl

Hashes for lyscripts-0.5.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ba866b7c308ffa712e8845f93e62c337c05a939beb8841eced0cfcc5931a92ff`
MD5	`723125b2a1db126b963e247073ea874e`
BLAKE2b-256	`63a96b202b94ac5df3e5cf1c33432e561040ee10f7082960ab425c71f8a0930f`

lyscripts 0.5.7

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

What are these `lyscripts`?

Installation

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

lyscripts 0.5.7

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

What are these lyscripts?

Installation

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

What are these `lyscripts`?