A simple and efficient Python implementation of Mapper algorithm for Topological Data Analysis
Project description
tda-mapper
tda-mapper is a simple and efficient Python library implementing the Mapper algorithm for Topological Data Analysis (TDA). It enables fast computation of Mapper graphs by using vp-trees to optimize the construction of open covers, improving both performance and scalability.
For further details, please refer to the preprint and online documentation.
-
Efficient Mapper Computation: Optimized for higher-dimensional lenses.
-
Interactive Visualizations: Multiple plotting backends for flexibility.
-
Interactive App: Interactive tool for quick, in-depth data exploration.
Background
The Mapper algorithm is a well-known technique in the field of topological data analysis that allows data to be represented as a graph. Mapper is used in various fields such as machine learning, data mining, and social sciences, due to its ability to preserve topological features of the underlying space, providing a visual representation that facilitates exploration and interpretation. For an in-depth coverage of Mapper you can read the original paper.
Step 1 | Step 2 | Step 3 | Step 4 |
---|---|---|---|
Chose lens | Cover image | Run clustering | Build graph |
Citations
If you use tda-mapper in your work, please consider citing both the library, archived in a permanent Zenodo record, and the preprint, which provides a broader methodological overview. We recommend citing the specific version of the library used in your research, as well as the latest version of the preprint. For citation examples, refer to the documentation.
Quick Start
Installation
To install the latest version uploaded on PyPI
pip install tda-mapper
How to Use
Here's a minimal example using the circles dataset from scikit-learn
to
demonstrate how to use tda-mapper:
import numpy as np
from sklearn.datasets import make_circles
from sklearn.decomposition import PCA
from sklearn.cluster import DBSCAN
from tdamapper.learn import MapperAlgorithm
from tdamapper.cover import CubicalCover
from tdamapper.plot import MapperPlot
# load a labelled dataset
X, labels = make_circles(n_samples=5000, noise=0.05, factor=0.3, random_state=42)
y = PCA(2, random_state=42).fit_transform(X)
cover = CubicalCover(n_intervals=10, overlap_frac=0.3)
clust = DBSCAN()
graph = MapperAlgorithm(cover, clust).fit_transform(X, y)
# color according to labels
fig = MapperPlot(graph, dim=2, seed=42, iterations=60).plot_plotly(colors=labels)
fig.show(config={'scrollZoom': True})
Original Dataset | Mapper Graph |
---|---|
More examples can be found in the documentation.
Interactive App
You can explore a live demo of tda-mapper directly on Streamlit Cloud, or run it locally using the following:
pip install -r app/requirements.txt
streamlit run app/streamlit_app.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tda_mapper-0.8.0.tar.gz
.
File metadata
- Download URL: tda_mapper-0.8.0.tar.gz
- Upload date:
- Size: 42.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1be95bf6ecaaa23feb50b6507fee816bec2bd15e8ac63f24db986dda4b53a6f |
|
MD5 | 07c20a2a362a1068fd0327adddf24bc4 |
|
BLAKE2b-256 | 24489655b1da8c3bc70bf9ffb60b1a592e0082b06c54a14e3b3cb4f89001634d |
Provenance
The following attestation bundles were made for tda_mapper-0.8.0.tar.gz
:
Publisher:
publish.yml
on lucasimi/tda-mapper-python
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
tda_mapper-0.8.0.tar.gz
- Subject digest:
f1be95bf6ecaaa23feb50b6507fee816bec2bd15e8ac63f24db986dda4b53a6f
- Sigstore transparency entry: 150348507
- Sigstore integration time:
- Predicate type:
File details
Details for the file tda_mapper-0.8.0-py3-none-any.whl
.
File metadata
- Download URL: tda_mapper-0.8.0-py3-none-any.whl
- Upload date:
- Size: 41.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d7affc544bd06bedc507058f9025ca71c0213e7e01b848a570f33ade513bafcf |
|
MD5 | 9440086e9b8d7f590b7794e1135eaa2a |
|
BLAKE2b-256 | fc370dc3330524830ca5635563a193b32671303dc7dabc217525bc922575e941 |
Provenance
The following attestation bundles were made for tda_mapper-0.8.0-py3-none-any.whl
:
Publisher:
publish.yml
on lucasimi/tda-mapper-python
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
tda_mapper-0.8.0-py3-none-any.whl
- Subject digest:
d7affc544bd06bedc507058f9025ca71c0213e7e01b848a570f33ade513bafcf
- Sigstore transparency entry: 150348508
- Sigstore integration time:
- Predicate type: