Skip to main content

Automatic Feature Engineering and Selection Linear Regression Model

Project description

# `autofeat` library
### A Linear Regression Model with Automatic Feature Engineering and Selection

This library contains the `AutoFeatRegression` model with a similar interface as the `scikit-learn` models:
- `fit()` function to fit the model parameters
- `predict()` function to predict the target variable given the input
- `score()` function to calculate the goodness of the fit (R^2 value)
- `fit_transform()` and `transform()` functions, which extend the given data by the additional features that were engineered and selected by the model

When calling the `fit()` function, internally the `fit_transform()` function will be called, so if you're planing to call `transform()` on the same data anyways, just call `fit_transform()` right away. `transform()` is mostly useful if you've split your data into training and test data and did not call `fit_transform()` on your whole dataset. The `predict()` and `score()` functions can be either be given data in the format of the original dataframe that was used when calling `fit()`/`fit_transform()` or they can be given an already transformed dataframe.

The [notebook](https://github.com/cod3licious/autofeat/blob/master/autofeat_test.ipynb) contains a simple usage example - try it out! :)

For further details on the model and implementation please refer to the [paper](https://arxiv.org/abs/1901.07329) - and of course if any of this code was helpful for your research, please consider citing it:
```
@article{horn2019autofeat,
author = {Horn, Franziska and Pack, Robert and Rieger, Michael},
title = {The autofeat Python Library for Automatic Feature Engineering and Selection},
year = {2019},
journal = {arXiv preprint arXiv:1901.07329},
}
```

The code is intended for research purposes.

If you have any questions please don't hesitate to send me an [email](mailto:cod3licious@gmail.com) and of course if you should find any bugs or want to contribute other improvements, pull requests are very welcome!

## Installation
You either download the code from here and include the autofeat folder in your `$PYTHONPATH` or install (the library components only) via pip:

$ pip install autofeat

The library requires Python 3! Other dependencies: `numpy`, `pandas`, `scikit-learn`, `sympy`, and `pint`

## Acknowledgments

This project was made possible thanks to support by the [BASF](https://www.basf.com).


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autofeat-0.1.1.tar.gz (14.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autofeat-0.1.1-py3-none-any.whl (15.5 kB view details)

Uploaded Python 3

File details

Details for the file autofeat-0.1.1.tar.gz.

File metadata

  • Download URL: autofeat-0.1.1.tar.gz
  • Upload date:
  • Size: 14.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.9.1 pkginfo/1.4.1 requests/2.12.4 setuptools/39.1.0 requests-toolbelt/0.8.0 tqdm/4.15.0 CPython/2.7.6

File hashes

Hashes for autofeat-0.1.1.tar.gz
Algorithm Hash digest
SHA256 71aa3b7b67957d44de49d0c1ea429c22322d522503e768bbd5b9a3917a1cb215
MD5 0bacaf3cfc77783b71de7f0da2d93e98
BLAKE2b-256 61b51f2c01be4a74dcc1dd5a6b175aa091d0f1f95115429c23f9528ba72b5117

See more details on using hashes here.

File details

Details for the file autofeat-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: autofeat-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 15.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.9.1 pkginfo/1.4.1 requests/2.12.4 setuptools/39.1.0 requests-toolbelt/0.8.0 tqdm/4.15.0 CPython/2.7.6

File hashes

Hashes for autofeat-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4443ed2e97d64ddf66478c7a015a6ac0c417ce3452a68c1bc6642733e9cccfb0
MD5 412a0a03d244b37f5664efe55ffb0153
BLAKE2b-256 cce615bd211710559ad31704a3927e28d33114a51e35339d0cfce607467a89fb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page