Skip to main content

Provides plotting tools useful in datascience.

Project description

dataplot

Provides plotting tools useful in datascience.

A lightweight plotting library for data science that unifies data transformation and plotting in a single chainable API. dataplot is designed for fast exploration, teaching demos, and script-based analysis workflows.

Installation

$ pip install dataplot

Features

dataplot focuses on an analysis-first plotting workflow: data processing and visual diagnostics are written in one concise chain.

Capability What it gives you Typical API
📦 Data-as-object API Treat raw arrays/series as first-class plotting objects with metadata and settings. dp.data(...)
🔗 Composable transforms Build reproducible feature pipelines before plotting. .rolling().demean().zscore().rank()
📊 Statistical chart set Switch between distribution, trend, and diagnostic views quickly. .hist(), .plot(), .scatter(), .qqplot()
🧩 Artist-first composition Compose multiple plots into one figure in a clean, deferred style. dp.figure(artist1, artist2, ...)
🎛️ Layered settings model Configure defaults at dataset / axes / figure scope with consistent fallback. set_plot(), set_axes(), set_figure()
🐍 Scientific Python stack Keep compatibility with familiar numeric and plotting ecosystems. numpy, pandas, scipy, matplotlib, seaborn

In short: less boilerplate, clearer analysis flow, and more reusable plotting code.

Quick Start

import dataplot as dp
import numpy as np

raw = np.random.randn(300)
x = dp.data(raw, label="daily_return")

artist1 = x.hist(bins=30, alpha=0.7)
artist2 = x.qqplot(baseline="normal")

fig = dp.figure(artist1, artist2, title="Distribution diagnostics")
fig

Core Concepts

dp.data(...)

dp.data(...) is the entry point of dataplot. It converts array-like input (for example list, numpy.ndarray, pandas.Series, or another PlottableData) into a unified PlottableData object.

PlottableData is the core abstraction that carries:

  • data values
  • transformation history
  • plot-level settings (labels, style hints, etc.)

This design keeps analysis context attached to the data itself.

import dataplot as dp
import numpy as np

raw = np.random.randn(300)
x = dp.data(raw, label="daily_return")

Typical Workflow

Raw data -> dp.data(...) -> transform chain -> artist(s) -> dp.figure(...)

You can think of dataplot as a 4-step loop:

  1. Wrap your data with dp.data(...).
  2. Transform it with chainable operations (rolling, zscore, rank, ...).
  3. Render one or more Artist objects via plot methods.
  4. Compose artists into a figure and apply final figure/axes settings.

Data Operations

PlottableData supports both arithmetic operators and built-in transforms:

  • Arithmetic: + - * / **
  • Log / power family: log() / log10() / signedlog() / signedlog10() / pow() / root() / sqrt() / ...
  • Statistical transforms: rolling() / demean() / zscore() / rank(pct=True) / cumsum() / abs()
  • State management: copy() / reset() / undo_all() / resample()

Operations are chainable, which is useful for quick experimentation:

x.zscore().rolling(5).rank(pct=True)

Plot Methods

Every plot method returns an Artist object instead of drawing immediately. This enables deferred composition and clean multi-panel figure assembly:

  • Distribution: hist(...)
  • Trend / relationship: plot(...), scatter(...)
  • Goodness-of-fit diagnostics: qqplot(...), ppplot(...), ksplot(...)
  • Structure overview: corrmap(...)
artist1 = x.hist(bins=30, alpha=0.7)
artist1  # show one plot
artist2 = x.qqplot(baseline="normal")
fig = dp.figure(artist1, artist2, title="Distribution diagnostics")
fig  # show both plots

Plot Settings

Common settings:

  • title, xlabel, ylabel
  • alpha, grid, grid_alpha
  • style, figsize, dpi
  • fontdict, legend_loc
  • subplots_adjust
  • reference_lines (for example: "y=x", "y=0", "x=1")

Requirements

validating
lazyr
loggings
matplotlib
numpy
pandas
scipy
seaborn

See Also

Github repository

PyPI project

License

This project falls under the BSD 3-Clause License.

History

v0.1.12

  • Added rank-normalization support via PlottableData.ranknorm(...), using normal-quantile mapping for percentile ranks.
  • Optimized rank computation by vectorizing tie handling in PlottableData.rank(...).
  • Completed the naming migration from PlotDataSet/PlottableDatas to PlottableData/PlottableDataSet for clearer API consistency.
  • Refactored PlottableData by moving its data-processing logic into a new parent class Data, keeping only plotting settings.

v0.1.11

  • Added arithmetic operator support to MultiObject (including reflected operators), enabling element-wise math workflows for grouped PlottableData objects with length checks.
  • Improved numerical robustness in utils.math by consistently sanitizing NaN/Inf inputs and validating finite sample counts in 1D linear regression.
  • Refined PlottableData.plot() / PlottableData.scatter() x-tick handling to accept broader array-like inputs and convert them consistently.
  • Added and expanded unit tests for core/container behavior and utility modules.

v0.1.10

  • Renamed dist_or_sample= to baseline= in PlottableData.qqplot() for clearer baseline specification.
  • Removed edge_precision= from PlottableData.ppplot() and PlottableData.ksplot(), and refined probability-range handling in the related diagnostic plotting flow.
  • Improved QQ/PP plot readability by updating default axis labels and ensuring the rightmost x-axis tick label is preserved.

v0.1.9

  • New method PlottableData.rank(pct=True) for rank transformation, supporting pct=False to return raw ranks.
  • Updated scatter behavior with improved defaults and removed implicit x-axis sorting.
  • Refactored plotting APIs to remove the ax constructor argument for a cleaner artist construction flow.

v0.1.8

  • Improved naming inference in dp.data(...) when decorators from validating are involved.
  • Refined module exports around artist helpers to make wildcard-style artist imports behave consistently.
  • Refactored plot-setting fallback behavior to use dp.defaults as the global source of defaults.
  • Enhanced get_setting(...) (including overload/type-hint coverage) so default values align with the corresponding setting types, and dict defaults are handled more safely.
  • Removed legacy set_default() usage and simplified setting application paths across artists/containers.
  • Added dependency loggings and updated warning emission in figure setting flows.

v0.1.7

  • dp.data(...) can accept PlottableData objects now.
  • FigWrapper.__enter__() now returns a copy safely via _entered_copy.
  • Internal maintenance and stability refinements.

v0.1.6

  • New method PlottableData.scatter() to draw true scatter charts while keeping PlottableData.plot() as line chart behavior.
  • Improved automatic label inference for dp.data(...), plotting labels, and x-axis labels in interactive contexts.
  • Plot builders now use deferred drawing; removed dp.show() and improved axis/figure rendering in object representations.
  • Refined rendering stability with fixes for empty-axis cleanup and figure re-rendering in FigWrapper.__repr__.
  • Updated minimum required Python version to >=3.13.

v0.1.5

  • Fixed issue: unworking figure settings in the artist methods.

v0.1.4

  • Fixed issue: incorrectly displayed histogram statistics when the x-label had been modified by the user.

v0.1.3

  • Allowed users to set the plot-settings by kwargs in artist methods like PlottableData.hist(), PlottableData.plot(), etc.
  • New operation methods PlottableData.signedpow() and PlottableData.log10().
  • Renamed PlottableData.signlog() to .signedlog(); renamed PlottableData.opclear() to .undo_all(); removed PlottableData.opclear_records_only().
  • New optional parameter format_label= for PlottableData.set_plot() to decide whether to format the label when painting on the axes.
  • When defining the data classes, used dataclasses instead of attrs for a faster import.

v0.1.2

  • New methods PlottableData.corrmap(), PlottableData.ppplot(), and PlottableData.resample().
  • New optional parameter fmt= for PlottableData.plot(), PlottableData.qqplot(), PlottableData.ppplot(), and PlottableData.ksplot().
  • Bugfix.

v0.1.1

  • New module-level function dp.show().
  • New methods PlottableData.qqplot(), PlottableData.ksplot() and PlottableData.abs().
  • All the plotting method (e.g., .hist()) will now return an Artist object instead of None.
  • New plot settings: grid and grid_alpha.
  • Parameters of FigWrapper.set_figure(), AxesWrapper.set_axes() and PlottableData.set_plot() are keyword-only now.
  • The returns of .set_figure() and .set_axes() will be None (instead of self) to avoid misunderstandings.
  • New optional parameter inplace= for PlottableData.set_plot() to decide whether the changes will happen in-place (which is the only option before) or in a new copy.
  • Parameter ticks= for PlottableData.plot() can be set to a PlottableData object now.

v0.1.0

  • PlottableData now supports binary operations including +, -, *, /, and **.
  • New methods FigWrapper.set_figure() and AxesWrapper.set_axes() - use them instead of *.set_plot().
  • Simplified the usage of AxesWrapper.
  • New plot settings: subplots_adjust=, fontdict= and dpi=.
  • After this version, the required Python version is updated to >=3.11.9. Download and install v0.0.2 if the user is under lower Python version (>=3.8.13).

v0.0.2

  • Updated the meta-data.

v0.0.1

  • Initial release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataplot-0.1.12.tar.gz (36.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataplot-0.1.12-py3-none-any.whl (41.2 kB view details)

Uploaded Python 3

File details

Details for the file dataplot-0.1.12.tar.gz.

File metadata

  • Download URL: dataplot-0.1.12.tar.gz
  • Upload date:
  • Size: 36.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for dataplot-0.1.12.tar.gz
Algorithm Hash digest
SHA256 a9bb891f99995d766c286e58abf00ba52a00c84616439ccc29f1f53baddb6ab8
MD5 2a7e2a1fe0f501ddd6882a20c6857c70
BLAKE2b-256 6660bb09165fd2cfb83814d19d71135edad6eeaca187dffdf554d6193388f10b

See more details on using hashes here.

File details

Details for the file dataplot-0.1.12-py3-none-any.whl.

File metadata

  • Download URL: dataplot-0.1.12-py3-none-any.whl
  • Upload date:
  • Size: 41.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for dataplot-0.1.12-py3-none-any.whl
Algorithm Hash digest
SHA256 44654b8d76825ddbe45bb5b85fcdaddd6a4634d2e82abe0ec4f7ef5583b5c0c4
MD5 48b776207f9264f77c527152b0046186
BLAKE2b-256 8bf90a0ff697b0dac07b5c049e1ae9fa15d8b1f07f47217c33d728f324be203c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page