Skip to main content

A small package created to aid in the calculation of Persistence Curves

Project description

PersistenceCurves

A python package for computing Persistence Curves

Brief Introduction

Computational Topology is a field of mathematics concerned with examining and utilizing the shape of a dataset to discern the shape of the underlying space. The main tool of this field is Persistent Homology. Using this tool on a dataset yields a visual summary called a Persistence Diagram, which are multisets of points. We can define a metric on these diagrams called the bottleneck distance. These diagrams are stable with respect to this distance in the sense that a small change in the dataset leads to a small change in the diagrams. The set of all persistence diagrams with this bottleneck distance forms a metric space. However, performing machine learning tasks with this space is difficult. For this reason, much work has been done towards creating useful, clever vectorizations of these diagrams that are compatible with machine learning algorithms. This package is named after the Persistence Curve framework, which is a generalization of many such vectorizations.

The Diagram Class

The sole class of this package is . This package assumes the user has already calculated the persistence diagram(s) of interest. Diagrams are collection of ordered pairs (b,d) (birth and death respectively) where d>b and d can take the value infinity. Essentially, a diagram is an array or data frame of shape (x,2). Suppose Dgm is a diagram. The code below transforms D to the Diagram class.

import persistencecurves as pc
D = pc.Diagram(Dgm =Dgm, globalmaxdeath = None, infinitedeath=None, inf_policy="keep")

Global Values

This class has various global variables such as

diagram
= Dgm
Birth} = All the Birth values of Dgm

Death
= All the Death values of Dgm
globalmaxdeath
= If one is considering multiple samples of a space where a largest possible death value, that value should be input here. For example, images have a global max death of 255.
infinitedeath
= The value signifying an infinite death. For most softwares, this value is -1. Thus if left unset, any negative death value is assumed to be infinite.
shape
= Dgm.shape.

The Built-in Curves

The bulk of this package is the curve calculation. The Diagram class comes with some pre-set Persistence Curves as well as the option for a custom curve. All of the preset curves, except landscape, have as inputs meshstart, meshstop, and num_in_mesh. These numbers follow the same concept as NumPy's linspace where meshstart is the leftmost endpoint, meshstop is the right endpoint and num_in_mesh is the number of points in the mesh. The method returns a vector of length num_in_mesh. For a Diagram arising from an image we can (and do) use:

D.Betticurve(meshstart=0,meshstop=255,num_in_mesh=256)

For landscapes, we have an additional argument k for the level.

D.landscape(k=1,meshstart=0,meshstop=255,num_in_mesh=256)

The Custom Curve

A Persistence Curve has two main components. An inner function fun and a statistic (recall that a statistic is just a function of a multi-set). As such the

customcurve
method requires an input of both.

The inner function fun should be a function of three variables b,d,and t that returns a single value. The statistic can be any function applicable to an entire array. Some examples follow:

The code below returns exactly the Betti curve.

def fun(b,d,t):
    return 1
D.custom_curve(fun = fun, statistic = np.sum, meshstart=0, meshstop=255, num_in_mesh=256)

The code below returns the 0th persistent landscape

def fun(b,d,t):
    return np.min(t-b, d-t, 0)
D.custom_curve(fun=fun, statistic=np.max, meshstart=0, meshstop=255, num_in_mesh=256)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

persistencecurves-0.0.2.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

persistencecurves-0.0.2-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file persistencecurves-0.0.2.tar.gz.

File metadata

  • Download URL: persistencecurves-0.0.2.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.1

File hashes

Hashes for persistencecurves-0.0.2.tar.gz
Algorithm Hash digest
SHA256 c74acb565f5423f2014b8435c09bec3e488cc339d4caed45a213808fad8e3fb2
MD5 373563845ddc66f7d30107d69995deac
BLAKE2b-256 75f981d94fe0d15e10b45307f0c6b31c6cacfda46f86205785a0660b3770d49f

See more details on using hashes here.

File details

Details for the file persistencecurves-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: persistencecurves-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 6.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.1

File hashes

Hashes for persistencecurves-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 f06e6988d739c0f6b2e155d950e2bfad11f2aab4d7cb7bfb52d2bdc84bbeb02d
MD5 2d24c9f9a68db398b6ebb0cff603af4d
BLAKE2b-256 687945b96e0b4aa197b77f4ab0963d890ba69a8f9595679ad87798520ae4de62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page