Skip to main content

A tool for visualizing categorical data over time.

Project description

DOI

PyCatFlow

This package is a visualization tool which allows the representation of temporal developments, based on categorical data. I wrote a short article on Medium in which I outline the basic idea of PyCatFlow and provide a Tutorial for non-programmers based on a Jupyter Notebook with interactive widgets that can be run online.

Install

PyCatFlow is available on PyPi:

$ pip3 install pycatflow

Alternatively you can download the repository and install the package by running the setup.py install routine. Make sure to install the requirements as well:

pip3 install -r requirements.txt
python3 setup.py install

Additional Requirements: The visualization and export is based on the drawSvg package that in turn requires cairo to be installed as an external requirement. Platform-specific instructions for installing cairo are available on the cairo homepage.

On macOS cairo can be installed easily using homebrew:

$ brew install cairo

Basic usage

The visualization library provides many functionalities for adjusting the visual output. A simple use case is however as follows:

import pycatflow as pcf

# Loading and parsing data:
data = pcf.read_file("sample_data_ChatterBot_Requirements.csv", columns="column", nodes="items", categories="category", column_order="column order")

# Generating the visualization
viz = pcf.visualize(data, spacing=20, width=800, maxValue=20, minValue=2)
viz.savePng('sample_viz.png')
viz.saveSvg('sample_viz.svg')
viz

The code and sample data are provided in the example folder. The data contains annual snapshots of requirements of the ChatterBots framework developed and maintained by Gunther Cox.

Running the above code creates this visualization:

Sample Visualization

Credits & License

PyCatFlow was conceptualized by Marcus Burkhardt and implemented in collaboration with Herbert Natta (@herbertmn). It is inspired by the Rankflow visualization tool develped by Bernhard Rieder.

Cite as: Marcus Burkhardt, and Herbert Natta. 2021. “PyCatFlow: A Python Package for Visualizing Categorical Data over Time”. Zenodo. https://doi.org/10.5281/zenodo.5531785.

The package is released under MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycatflow-0.0.9.tar.gz (12.3 kB view details)

Uploaded Source

Built Distribution

pycatflow-0.0.9-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file pycatflow-0.0.9.tar.gz.

File metadata

  • Download URL: pycatflow-0.0.9.tar.gz
  • Upload date:
  • Size: 12.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for pycatflow-0.0.9.tar.gz
Algorithm Hash digest
SHA256 2073cbf5c31b69a7e6073c16b57c6b5eb2080b2bc4703990d4e67d70ee2cdb7b
MD5 9758083acdde104f679c7bf4f9ad13ad
BLAKE2b-256 cd111e2e14f96dd9c3c8b754ee75a4efebae2e53fe296a3c1f5392ad1fa28dcc

See more details on using hashes here.

File details

Details for the file pycatflow-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: pycatflow-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for pycatflow-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 1a43e141cc597fae8fec643d8d412ac35e88376833031bc84cbea23e730e308d
MD5 caf5d3ca07278920a973dfa144f076bf
BLAKE2b-256 29b988f8db9e23aecce7c9eeca5bf119093bb3a0bec8ee90053ad6230b0d123d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page