Skip to main content

A collection of handy tools for data handling, visualization, and reporting.

Project description

forgekit

forgekit is a python3 package designed to simplify data handling, visualization, and reporting. It provides a wide range of functions for cleaning, transforming, visualizing data, and performing machine learning tasks.

Features

  • Display and summarize DataFrames
  • Export and load CSV files
  • Handle missing data, outliers, and invalid values
  • Data scaling and transformation (min-max scaling, standardization, log transformation)
  • Generate interactive and static plots (matplotlib and Plotly)
  • Machine learning utilities (K-Means clustering, feature importance calculation, train-test splitting)

Installation

You can install forgekit by cloning the repository and installing it locally.

  1. Clone the repository:

    git clone https://github.com/yourusername/forgekit.git
    
  2. Navigate to the package directory and install it:

    cd forgekit
    pip3 install .
    
  3. Alternatively, install dependencies directly from requirements.txt:

    pip3 install -r requirements.txt
    

Usage

Here’s an example of how to use forgekit:

import pandas as pd
from forgekit import ForgeKit

# Sample data
data = {
    'Domain': ['example.com', 'example.net'],
    'Price': [10.0, 12.5]
}
df = pd.DataFrame(data)

# Display the DataFrame
ForgeKit.display_dataframe(df)

# Plot the DataFrame
ForgeKit.plot_dataframe(df, kind='bar', title="Domain Prices")

Available Functions

Data Display and Summarization:

  • display_dataframe(): Display a DataFrame with a row limit.
  • summary_stats(): Show summary statistics of a DataFrame.
  • custom_summary(): Show a custom summary of data types, missing values, and statistics.

Data Cleaning:

  • impute_missing_data(): Handle missing values with strategies like mean, median, mode, or a constant value.
  • remove_outliers(): Remove outliers using the IQR method.
  • remove_duplicates(): Remove duplicate rows in the DataFrame.
  • clean_text_columns(): Clean text columns by stripping whitespace and converting to lowercase.

Data Transformation:

  • minmax_scale(): Scale numerical data between 0 and 1.
  • standard_scale(): Standardize numerical data to have zero mean and unit variance.
  • log_transform(): Apply log transformation to reduce skewness.

Data Visualization:

  • plot_dataframe(): Generate static plots using matplotlib.
  • interactive_plot(): Generate interactive plots using Plotly.

Machine Learning Tools:

  • kmeans_clustering(): Perform K-Means clustering on the DataFrame.
  • train_test_split_data(): Split the DataFrame into training and test sets.
  • feature_importance(): Calculate feature importance using a Random Forest classifier.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Feel free to open issues or pull requests if you would like to contribute!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

forgekit-1.0.0.tar.gz (5.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

forgekit-1.0.0-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file forgekit-1.0.0.tar.gz.

File metadata

  • Download URL: forgekit-1.0.0.tar.gz
  • Upload date:
  • Size: 5.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for forgekit-1.0.0.tar.gz
Algorithm Hash digest
SHA256 bb0862d79e6b12a3dfbe2fad36e6ae9571bf114e52e5295a8638dfcd0d93f0a5
MD5 ccc9ecc7180529b0ad182bde7ccf9d3b
BLAKE2b-256 0af48f6bd437bb4b5d69a71d48dedef158bc1076fbddb4dfd469eb72d1e8ea56

See more details on using hashes here.

File details

Details for the file forgekit-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: forgekit-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for forgekit-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9aa3b1abf26390aa0f1bbe6df1fd177e04586355819d3935427c7301160f99a5
MD5 8be44c85846eef5b5bacb408ce31da64
BLAKE2b-256 316443f3a876f5b05dc72caec0c136d5df47500f8ba89ab8c9e491161d739b10

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page