Skip to main content

Python Regression Analysis.

Project description

PRegress

PRegress is a Python package for regression analysis and data visualization. It provides tools for model fitting, prediction, and various types of plots to help visualize your data particularly for regression analysis.

Features

  • Model fitting and prediction with convenient formula notation
  • Streamlined code for plotting (boxplot, histogram, scatter plot, etc.)
  • Regression analysis diagnostic tools
  • Integration with popular libraries like pandas and statsmodels

Installation

You can install the package using pip:

pip install pregress

Usage

Here are some examples of how to use the key functions in the package.

Importing the Package

To use the functions provided by the package, import it as follows:

import pregress as pr

Getting Data

There are multiple datasets available from PRegress and are easily attained using the get_data function. The datasets currently available are:

  • AirBnb.csv
  • Betas.csv
  • Charges.csv
  • Employment.csv
  • HousePrices.csv
  • HR_retention.csv
  • MarketingToys.csv
  • Sales.csv
  • Top200.csv
  • Twitter.csv
  • Youtube.csv

See Applied Linear Regression for Business Analytics with Python for details regarding these datasets. Sample import example:

import pregress as pr

# Load data from PRegress
df = pr.get_data("Betas.csv")

# Format the data (for later)
df.drop(columns = df.columns[0], inplace=True)

Model Fitting and Prediction

PRegress formula supports formula functionality similar to R. Fit a model with a formula:

# Fit model with formula 
model = pr.fit("SPY ~ .", df)

Summary types are specified using the out argument. Different summaries are available including:

  • statsmodels (default)
  • R
  • STATA
  • simple
  • ANOVA
  • coefficients (coef)
# Generate a model summary
pr.summary(model)

Making Predictions

A Statsmodels object is created by default. From this object, the predict function can be used. Since df is the dataframe used to fit the model, the following lines produce the same result.

# Make predictions
pr.predict(model, df)

# Produce fitted values
pr.predict(model)

General Plotting

Plotting code is streamlined and built on top of Seaborn and MatPlotLib. Samples provided below.

# Generate a boxplot
pr.boxplot("SPY ~ .", df)

# Generate a histogram
pr.hist(df.SPY)

# Multiple histograms
pr.hists("SPY ~ .",data = df)

# Scatter plot
pr.plotXY("MSFT ~ SPY", data = df)

# Multiple Scatter plots
pr.plots("SPY ~ .", data = df)

# Correlation Plot
pr.plot_cor(df)

Required Fixes

Based on current testing, the following fixes are required:

  1. Ensure global scope accessibility for variables.
  2. Adjust summary spacing.
  3. Provide compatibility with scikit-learn.
  4. Implement AI-generated summaries.
  5. Allow for additional plotting customization (using kwargs).
  6. Review and improve diagnostic plots.
  7. Provide support for logistic regression and other GLMs.
  8. Provide support for automatic dummy variable retrieval.
  9. Plots should work without formulas.

Contributing

We welcome contributions to PRegress! If you find a bug or have a feature request, please open an issue on GitHub. You can also contribute by:

  1. Forking the repository
  2. Creating a new branch (git checkout -b feature-branch)
  3. Committing your changes (git commit -am 'Add some feature')
  4. Pushing to the branch (git push origin feature-branch)
  5. Creating a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

We would like to thank all contributors and users of PRegress for their support and feedback. Special thanks to Mintra Putlek!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pregress-1.0.6.tar.gz (197.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pregress-1.0.6-py3-none-any.whl (216.8 kB view details)

Uploaded Python 3

File details

Details for the file pregress-1.0.6.tar.gz.

File metadata

  • Download URL: pregress-1.0.6.tar.gz
  • Upload date:
  • Size: 197.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.3

File hashes

Hashes for pregress-1.0.6.tar.gz
Algorithm Hash digest
SHA256 7527ba9951fd1b27e015e1a1a08cec2ae4230ca47fa76dfb381ad90b336a40e2
MD5 7d51663c60b7f63bd3341f0eafd73f4f
BLAKE2b-256 5d5eaa080b14686ecffcb8e724d74aacee0636a9f702aaa0884594436f0302eb

See more details on using hashes here.

File details

Details for the file pregress-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: pregress-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 216.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.3

File hashes

Hashes for pregress-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 ddae0169956ffd9e7b7d31e4d3a04bcd61a2afc5c54edc3c80289941dbc3e37c
MD5 34b2b71280002731a0912df75e0736b3
BLAKE2b-256 4a6aa80e6b6e9047be542781f25d4aa1435e0917fae19f5a625a3efbf8be739a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page