synmod

Temporal model synthesis

These details have not been verified by PyPI

Project links

Homepage

Project description

======
synmod
======

.. image:: https://travis-ci.com/cloudbopper/synmod.svg?branch=master
    :target: https://travis-ci.com/cloudbopper/synmod
    :alt: Build Status

.. image:: https://readthedocs.org/projects/synmod/badge/?version=latest
    :target: https://synmod.readthedocs.io/en/latest/?badge=latest
    :alt: Documentation Status

``synmod`` is a Python library to generate synthetic time-series data and models.
Given the input configuration (the number of instances, number of features and sequence
length),
it randomly samples feature types and generators, and uses these to generate time-series data.
It also partitions the set of features into relevant and irrelevant, selects
temporal windows for each feature, and uses these to create a function representing
a time-series model.

``synmod`` is developed to
support anamod_, a Python library implementing algorithms designed
to analyze feature importance of black-box models.

.. _anamod: https://github.com/cloudbopper/anamod


------------------
Feature Generation
------------------

We randomly sample a mixture of discrete and continuous features, and for each feature, we randomly
sample sequence generators based on Bernoulli processes (binary features only) and Markov chains
(binary/categorical/continuous features). Each feature is associated with a randomly selected window,
so that the generating process within the window is different from that outside the window.
For some continuous features, sampled values are accumulated so as to model trends, i.e.
contiguous time periods where the value of the feature increases, decreases or
stays nearly constant, so that the ordering of values becomes significant.

.. figure:: images/sim-generator.png

    Markov chain generator for a continuous feature. A window (i.e. subsequence)
    is randomly selected for the feature. The sequence
    for a given instance is generated via a random walk through the
    chain. Different chains are used for in-window and out-of-window
    timestamps. Each state is associated with a Gaussian distribution, and the
    sampled value determines the value of the feature at a given timestep. The
    number of states per generator, state distributions and transition
    probabilities are randomly chosen.

.. figure:: images/sim-series.png

    Three generated time series for a feature using such a generator. In this
    example, the value sampled for the feature at a given timestep is used to
    increment the value of the feature to generate trends.


---------------
Synthetic Model
---------------

**[If you're opening this page on Github, see https://synmod.readthedocs.io/en/latest/readme.html for properly rendered LaTeX.]**

We create a two-level function over the features. We first partition the set of
features into *relevant* and *irrelevant*. For each feature :math:`j`, we sample a function :math:`g_j`
(average/max/slope) that aggregates the values within the window :math:`[k_1, k_2]` of that feature:

.. math::

    \bar{x}_j^{(i)} = g_j \left( x_{j, k_1} \ldots x_{j, k_2} \right)

We then take a linear combination of the aggregated values of relevant features :math:`\mathcal{R}`
to represent the *ground-truth model* :

.. math::

    y^{(i)} = \sum\limits_{j \in \mathcal{R}} \alpha_j \bar{x}_j^{(i)}

We also create an *approximate model* :math:`f \left( \mathbf{X} \right)` that includes the
irrelevant features :math:`\mathcal{I}`:

.. math::

    f \left( \mathbf{X}^{(i)} \right) = \sum\limits_{j \in \mathcal{R}} \alpha_j \bar{x}_j^{(i)} + \beta \sum\limits_{j \in \mathcal{I}} \alpha_j \bar{x}_j^{(i)}

For the purpose of downstream analysis, the ground-truth model serves to generate labels over the data,
and the approximate model represents the time-series model being analyzed.

------------
Installation
------------

To install synmod from github::

    pip install synmod@git+https://github.com/cloudbopper/synmod

-----
Usage
-----

``synmod`` may be invoked from within Python (detailed usage documentation forthcoming), or from the command-line as follows::

    python -m synmod -num_instances 100 -num_features 10 -sequence_length 20 -output_dir out1 -write_outputs 1

``synmod`` also supports other configuration options, including different model types (classifier vs. regression),
static (i.e. non-temporal) models, fraction of features chosen to be relevant, and interaction terms.
Run ``python -m synmod -h`` to see a full list of options.

-------
License
-------

``synmod`` is free, open source software, released under the MIT license. See LICENSE_ for details.

.. _LICENSE: https://github.com/cloudbopper/synmod/blob/master/LICENSE

-------
Contact
-------

`Akshay Sood`_

.. _Akshay Sood: https://github.com/cloudbopper


=========
Changelog
=========

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.4

Mar 23, 2022

0.1.3

Oct 26, 2021

0.1.2

Dec 24, 2020

0.1.1

Dec 9, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

synmod-0.1.4.tar.gz (878.9 kB view details)

Uploaded Mar 23, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

synmod-0.1.4-py2.py3-none-any.whl (404.7 kB view details)

Uploaded Mar 23, 2022 Python 2Python 3

File details

Details for the file synmod-0.1.4.tar.gz.

File metadata

Download URL: synmod-0.1.4.tar.gz
Upload date: Mar 23, 2022
Size: 878.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.12

File hashes

Hashes for synmod-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`ecb70dd9d013d7bfea27a35eb6df01d0e648c3f73abd2a3d548af846ad11bbe9`
MD5	`a1fd6a631e2ddc386261a1ebed5a5ac1`
BLAKE2b-256	`360a296aeeab810b02b4e1ab099c20daab38594e1f34a732f45da2d770c7c369`

See more details on using hashes here.

File details

Details for the file synmod-0.1.4-py2.py3-none-any.whl.

File metadata

Download URL: synmod-0.1.4-py2.py3-none-any.whl
Upload date: Mar 23, 2022
Size: 404.7 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.12

File hashes

Hashes for synmod-0.1.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`e377dd0c9374ef9da242020951013d66fdce02104ebfaff897f666d41a2c7041`
MD5	`0a569a36298e8bf7cba93239438f186c`
BLAKE2b-256	`9debfc504c62476e71f0fc988dc2a76db3953bc08e2a4c7102f48afa5ec66547`

See more details on using hashes here.

synmod 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes