Skip to main content

Transform raw data into insights.

Project description

Pychemist

The Alchemist of Data Science

Transform raw data into insights.

Pychemist is a lightweight Python library designed to simplify and enrich your data science workflow. Inspired by the transformation mindset of an alchemist, it helps you turn raw data into golden insights. Pychemist replaces complex, repetitive code with a clean and intuitive syntax that streamlines data cleaning, transformation, and preparation to reveal clear insights. It also enables clean and well-structured presentation of results, making it easier to communicate findings effectively.


Features

  • Create lagged or lead variables for time-series and panel data
  • Run quick, readable t-tests on treatment groups
  • Filter model summaries to hide fixed effects
  • Conditional mutation of DataFrames
  • Pandas accessor (.chem) for fluent, chainable workflows

Installation

pip install pychemist

Usage

import pychemist as chem

Example 1: Conditional mutation using the df.chem.mutate DataFrame accessor.

Update the total_assets for a specific company and year to a given value:

df=df.chem.mutate('company_id == "8ga62sav" & year==2025', "total_assets", 82000000)

Example 2: Conditional mutation using the df.chem.mutate DataFrame accessor.

Set the Promotion column to 1 for managers who haven't been promoted in 3 or more years and have a performance rating of at least 4; otherwise set it to 0.

df = df.chem.mutate('YearsSinceLastPromotion >= 3 & JobRole == "Manager" & PerformanceRating >= 4', 'Promotion', 1, 0)

Example 3: Creating lagged variables using the df.chem.lag DataFrame accessor.

Create lagged versions of total assets and net income for each ticker, only when the year difference is exactly 1:

df=df.chem.lag(['total assets','net income'],'ticker','year')

Example 4: Creating leading variables using the df.chem.lead DataFrame accessor.

Create lead (future) versions of total assets and net income for each ticker, only when the year difference is exactly 1:

df=df.chem.lead(['total assets','net income'],'ticker','year')

Example 5: Creating 2-year lagged variables using the df.chem.lag DataFrame accessor.

Create lagged versions of total assets and net income for each ticker, only when the year difference is exactly 2:

df=df.chem.lag(['total assets','net income'],'ticker','year',2)

Example 6: T-test between treated and control groups

chem.ttest(df, variable="outcome", treatment="treated")

Example 7: Model summary without fixed effects

import statsmodels.formula.api as smf
model = smf.ols("y ~ x + C(firm)", data=df).fit()
print(chem.summary_no_fe(model))

MIT License Copyright (c) Jeroen van Raak (2025)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pychemist-0.0.8.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pychemist-0.0.8-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file pychemist-0.0.8.tar.gz.

File metadata

  • Download URL: pychemist-0.0.8.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pychemist-0.0.8.tar.gz
Algorithm Hash digest
SHA256 a9fd76a2cff791063b3ea0b72baace654ea83159ba482824e5a47cf02c73fe93
MD5 13fc2e28bd9356eed7d118e265e273ab
BLAKE2b-256 aebda5c1bcc558a8264b972f592e85dc6cfb3769dd0faf5d0550b0aae7c226d6

See more details on using hashes here.

Provenance

The following attestation bundles were made for pychemist-0.0.8.tar.gz:

Publisher: release.yaml on vanraak/pychemist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pychemist-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: pychemist-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pychemist-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 f16ccd5b0480b1d550c3a01e9c2f1b7610b1d439ea0e6f7b157ddda6989c508b
MD5 51d0de0cd001e2d0f81e43665aa352d5
BLAKE2b-256 f330252e46c89f14f2bc7e3c5ce83fdfe73bb605d821cd1bc7f347fe5c188d57

See more details on using hashes here.

Provenance

The following attestation bundles were made for pychemist-0.0.8-py3-none-any.whl:

Publisher: release.yaml on vanraak/pychemist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page