Skip to main content

A plotly plotting script generator and data parser

Project description

plotgen: easy data parsing and plot script generator

PyPI Latest Release License

About

Uses plotly to visualize data (e.g. tab, space, comma separated files) or pickled pandas dataframes. A lot of features are supported like reading in files with different separators, selecting and sorting columns, processing of multiple files, plotting line, scatter, bar, violin, box and gantt charts and customizing the graphs to all the needs. Many operations can be applied before plotting, like joining files, computing over rows and columns (sum, mean, median...), normalising data and filtering.

This script acts as a master script generating a plotting script. In case this script doesn't cover something that is required for a graph, it can output the final plotting script to make manual adjustments like adding annotations.

Installation

Easy installation via pip:

pip install plotgen

Manual

This is a single script project to allow the highest level of portability. Therefor manual installation is easy as you only have to put plotgen within your path and satisfy the requirements:

pip install -r requirements.txt

Usage

The script is designed to process any formatted data file (e.g. tsv, csv, pandas df) into a table of rows and columns. Every row is a data point while every column is a data trace. Columns starting with '_' are treated as special columns (e.g. _error _labels, _colour or _offset). Rows starting with '#' are treated as comments and are ignored. (Those special characters can be adjusted with the rescpective parameters).

The input parsing arguments are processed in order and can be used as many times as whished. This enables very powerful preprocessing of the data before the actual plotting takes place. One can even do very complex inter-column and inter-row computation, normalisation, adding rows/columns with the average and a lot more. This script can also be used to just process the input data and output it as a new file or pickle container.

There are global parameters which can be set once per call like all colour options, legend options, width, height or output. The next set or parameters can only be specified on an input file and affect how the file is treated and plotted like row, col, sorting options, selection and ignoring of data points or plot type (line, bar, box, violin). However some options will be inherited by the following input files e.g. row, col and plot type and some won't be inherited like all input file parsing options (sorting, selection...) or options that affect the actual plot like axis titles or ranges. Options that are inherited can also be specified before an input file.

By default plotgen will create an html plot and opens it in the default browser:

plotgen -i lines.tsv

plotgen can also just print the data it has parsed from the file:

plotgen -i lines.tsv --print

Or create a standalone plotting script out of the input data which incorporates all the options that can be set and are shown later:

plotgen -i lines.tsv --output lines.py

All these options can also be combined to print the data, open a browser plot and save it as a PDF:

plotgen -i lines.tsv --print --browser --output lines.py lines.pdf --width 1920 --height 1080

The full power becomes visisble when working with multiple files (but also the same file more than once) and processing the data and making subplots:

plotgen -i lines.tsv --row 1 --select-columns Sin -i lines.tsv --row 2 --sort-order desc --sort-by-column Sin --select-column Cos 

Selection of rows and columns is order sensitive and allows custom trace or data point ordering or duplication of traces and datapoints by repeating the names. All data row and column options also support numeric indexing by using e.g. --select-icolumns 0 1 2 etc.

The width and height of individual subplots can be simply adjusted by using colspan and rowspan:

plotgen -i lines.tsv --row 1 --rowspan 2 -i lines.tsv --row 3 --rowspan 1

It is also possible to plot many files into the same plot by just targetting the same row and col. Setting plot options like x-title, y-title or axis ranges should be done only once per subplot and ideally for the last file of the subplot.

Examples

Line Chart

plotgen -i lines.tsv --plot line --output line.png

Line Chart

Scatter Chart

plotgen -i scatter.tsv --plot line --line-mode markers --output scatter.png

Scatter Chart

Bar Chart

plotgen -i bar.tsv --plot bar --output bar.png

Bar Chart

Box Chart

plotgen -i distribution.tsv --plot box --output box.png

Box Chart

Violin Chart

plotgen -i distribution.tsv --plot violin --output violin.png

Violin Chart

Gantt Chart

plotgen -i gantt.tsv --plot bar --bar-mode stack --bar-text-position inside --orientation h --x-type date --output gantt.png

Gantt Chart

plotgen -i gantt_time.tsv --plot bar --bar-mode stack --bar-text-position outside --orientation h --x-type date --output gantt_time.png

Gantt Time Chart

Combined Subplots

plotgen -i lines.tsv --colspan 2 -i scatter.tsv --row 2 --colspan 1 --line-mode markers -i bar.tsv --col 2 --plot bar -i distribution.csv --row 3 --col 1 --plot violin -i distribution.csv --col 2 --plot box --horizontal-spacing 0.05 --vertical-spacing 0.05 --per-trace-colours --output subplot.png

Subplot Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

plotgen-0.8.19-py3-none-any.whl (40.1 kB view details)

Uploaded Python 3

File details

Details for the file plotgen-0.8.19-py3-none-any.whl.

File metadata

  • Download URL: plotgen-0.8.19-py3-none-any.whl
  • Upload date:
  • Size: 40.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for plotgen-0.8.19-py3-none-any.whl
Algorithm Hash digest
SHA256 766d6fb55071732e0dd93b859b57b651b85662e5c878fb7d86b626c038bf721b
MD5 5b8d84014cec9715a0082c1bbb1334a7
BLAKE2b-256 6e02e125b507ff4bfc6cd9247ea75c98790653ca364b5904c17d5e140e4780fc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page