A plotly plotting script generator and data parser
Project description
plotgen: easy data parsing and plot script generator
About
Uses plotly to visualize data (e.g. tab, space, comma separated files) or pickled pandas dataframes. A lot of features are supported like reading in files with different separators, selecting and sorting columns, processing of multiple files, plotting line, scatter, bar, violin, box and gantt charts and customizing the graphs to all the needs. Many operations can be applied before plotting, like joining files, computing over rows and columns (sum, mean, median...), normalising data and filtering.
This script acts as a master script generating a plotting script. In case this script doesn't cover something that is required for a graph, it can output the final plotting script to make manual adjustments like adding annotations.
Installation
Easy installation via pip:
pip install plotgen
Manual
This is a single script project to allow the highest level of portability. Therefor manual installation is easy as you only have to put plotgen
within your path and satisfy the requirements:
pip install -r requirements.txt
Usage
The script is designed to process any formatted data file (e.g. tsv, csv, pandas df) into a table of rows and columns. Every row is a data point while every column is a data trace. Columns starting with '_' are treated as special columns (e.g. _error _labels, _colour or _offset). Rows starting with '#' are treated as comments and are ignored. (Those special characters can be adjusted with the rescpective parameters).
The input parsing arguments are processed in order and can be used as many times as whished. This enables very powerful preprocessing of the data before the actual plotting takes place. One can even do very complex inter-column and inter-row computation, normalisation, adding rows/columns with the average and a lot more. This script can also be used to just process the input data and output it as a new file or pickle container.
There are global parameters which can be set once per call like all colour options, legend options, width, height or output. The next set or parameters can only be specified on an input file and affect how the file is treated and plotted like row, col, sorting options, selection and ignoring of data points or plot type (line, bar, box, violin). However some options will be inherited by the following input files e.g. row, col and plot type and some won't be inherited like all input file parsing options (sorting, selection...) or options that affect the actual plot like axis titles or ranges. Options that are inherited can also be specified before an input file.
By default plotgen will create an html plot and opens it in the default browser:
plotgen -i lines.tsv
plotgen can also just print the data it has parsed from the file:
plotgen -i lines.tsv --print
Or create a standalone plotting script out of the input data which incorporates all the options that can be set and are shown later:
plotgen -i lines.tsv --output lines.py
All these options can also be combined to print the data, open a browser plot and save it as a PDF:
plotgen -i lines.tsv --print --browser --output lines.py lines.pdf --width 1920 --height 1080
The full power becomes visisble when working with multiple files (but also the same file more than once) and processing the data and making subplots:
plotgen -i lines.tsv --row 1 --select-columns Sin -i lines.tsv --row 2 --sort-order desc --sort-by-column Sin --select-column Cos
Selection of rows and columns is order sensitive and allows custom trace or data point ordering or duplication of traces and datapoints by repeating the names. All data row and column options also support numeric indexing by using e.g. --select-icolumns 0 1 2 etc.
The width and height of individual subplots can be simply adjusted by using colspan and rowspan:
plotgen -i lines.tsv --row 1 --rowspan 2 -i lines.tsv --row 3 --rowspan 1
It is also possible to plot many files into the same plot by just targetting the same row and col. Setting plot options like x-title, y-title or axis ranges should be done only once per subplot and ideally for the last file of the subplot.
Examples
Line Chart
plotgen -i lines.tsv --plot line --output line.png
Scatter Chart
plotgen -i scatter.tsv --plot line --line-mode markers --output scatter.png
Bar Chart
plotgen -i bar.tsv --plot bar --output bar.png
Box Chart
plotgen -i distribution.tsv --plot box --output box.png
Violin Chart
plotgen -i distribution.tsv --plot violin --output violin.png
Gantt Chart
plotgen -i gantt.tsv --plot bar --bar-mode stack --bar-text-position inside --orientation h --x-type date --output gantt.png
plotgen -i gantt_time.tsv --plot bar --bar-mode stack --bar-text-position outside --orientation h --x-type date --output gantt_time.png
Combined Subplots
plotgen -i lines.tsv --colspan 2 -i scatter.tsv --row 2 --colspan 1 --line-mode markers -i bar.tsv --col 2 --plot bar -i distribution.csv --row 3 --col 1 --plot violin -i distribution.csv --col 2 --plot box --horizontal-spacing 0.05 --vertical-spacing 0.05 --per-trace-colours --output subplot.png
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file plotgen-0.8.1-py3-none-any.whl
.
File metadata
- Download URL: plotgen-0.8.1-py3-none-any.whl
- Upload date:
- Size: 39.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 colorama/0.4.4 importlib-metadata/4.6.4 keyring/23.5.0 pkginfo/1.8.2 readme-renderer/34.0 requests-toolbelt/0.9.1 requests/2.25.1 rfc3986/1.5.0 tqdm/4.57.0 urllib3/1.26.5 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | def5eac95353d64ed040bcd081acb3e29d038a26fa459640354b5a92d60df4c0 |
|
MD5 | f8e23ad49e7cc6c21b7514e2c15773ea |
|
BLAKE2b-256 | e59e4c39220f4e90474c39bc92aa5babc6985525a5ff345080f121026d5901d0 |