lecrapaud

Framework for machine and deep learning, with regression, classification and time series analysis

These details have not been verified by PyPI

Project description

Welcome to LeCrapaud

An all-in-one machine learning framework

🚀 Introduction

LeCrapaud is a high-level Python library for end-to-end machine learning workflows on tabular data, with a focus on financial and stock datasets. It provides a simple API to handle feature engineering, model selection, training, and prediction, all in a reproducible and modular way.

✨ Key Features

🧩 Modular pipeline: Feature engineering, preprocessing, selection, and modeling as independent steps
🤖 Automated model selection and hyperparameter optimization
📊 Easy integration with pandas DataFrames
🔬 Supports both regression and classification tasks
🛠️ Simple API for both full pipeline and step-by-step usage
📦 Ready for production and research workflows

⚡ Quick Start

Install the package

pip install lecrapaud

How it works

This package provides a high-level API to manage experiments for feature engineering, model selection, and prediction on tabular data (e.g. stock data).

Typical workflow

from lecrapaud import LeCrapaud

# 1. Create the main app
app = LeCrapaud(uri=uri)

# 2. Define your experiment context (see your notebook or api.py for all options)
context = {
    "data": your_dataframe,
    "columns_drop": [...],
    "columns_date": [...],
    # ... other config options
}

# 3. Create an experiment
experiment = app.create_experiment(**context)

# 4. Run the full training pipeline
experiment.train(your_dataframe)

# 5. Make predictions on new data
predictions = experiment.predict(new_data)

Database Configuration (Required)

LeCrapaud requires access to a MySQL database to store experiments and results. You must either:

Pass a valid MySQL URI to the LeCrapaud constructor:

app = LeCrapaud(uri="mysql+pymysql://user:password@host:port/dbname")

OR set the following environment variables before using the package:
- DB_USER, DB_PASSWORD, DB_HOST, DB_PORT, DB_NAME
- Or set DB_URI directly with your full connection string.

If neither is provided, database operations will not work.

Using OpenAI Embeddings (Optional)

If you want to use the columns_pca embedding feature (for advanced feature engineering), you must set the OPENAI_API_KEY environment variable with your OpenAI API key:

export OPENAI_API_KEY=sk-...

If this variable is not set, features relying on OpenAI embeddings will not be available.

Experiment Context Arguments

Below are the main arguments you can pass to create_experiment (or the Experiment class):

Argument	Type	Description	Example/Default
`columns_binary`	list	Columns to treat as binary	`['flag']`
`columns_boolean`	list	Columns to treat as boolean	`['is_active']`
`columns_date`	list	Columns to treat as dates	`['date']`
`columns_drop`	list	Columns to drop during feature engineering	`['col1', 'col2']`
`columns_frequency`	list	Columns to frequency encode	`['category']`
`columns_onehot`	list	Columns to one-hot encode	`['sector']`
`columns_ordinal`	list	Columns to ordinal encode	`['grade']`
`columns_pca`	list	Columns to use for PCA/embeddings (requires `OPENAI_API_KEY` if using OpenAI embeddings)	`['text_col']`
`columns_te_groupby`	list	Columns for target encoding groupby	`['sector']`
`columns_te_target`	list	Columns for target encoding target	`['target']`
`data`	DataFrame	Your main dataset (required for new experiment)	`your_dataframe`
`date_column`	str	Name of the date column	`'date'`
`experiment_name`	str	Name for the training session	`'my_session'`
`group_column`	str	Name of the group column	`'stock_id'`
`max_timesteps`	int	Max timesteps for time series models	`30`
`models_idx`	list	Indices of models to use for model selection	`[0, 1, 2]`
`number_of_trials`	int	Number of trials for hyperparameter optimization	`20`
`perform_crossval`	bool	Whether to perform cross-validation	`True`/`False`
`perform_hyperopt`	bool	Whether to perform hyperparameter optimization	`True`/`False`
`plot`	bool	Whether to plot results	`True`/`False`
`preserve_model`	bool	Whether to preserve the best model	`True`/`False`
`target_clf`	list	List of classification target column indices/names	`[1, 2, 3]`
`target_mclf`	list	Multi-class classification targets (not yet implemented)	`[11]`
`target_numbers`	list	List of regression target column indices/names	`[1, 2, 3]`
`test_size`	int/float	Test set size (count or fraction)	`0.2`
`time_series`	bool	Whether the data is time series	`True`/`False`
`val_size`	int/float	Validation set size (count or fraction)	`0.2`

Note:

Not all arguments are required; defaults may exist for some.
For columns_pca with OpenAI embeddings, you must set the OPENAI_API_KEY environment variable.

Modular usage

You can also use each step independently:

data_eng = experiment.feature_engineering(data)
train, val, test = experiment.preprocess_feature(data_eng)
features = experiment.feature_selection(train)
std_data, reshaped_data = experiment.preprocess_model(train, val, test)
experiment.model_selection(std_data, reshaped_data)

⚠️ Using Alembic in Your Project (Important for Integrators)

If you use Alembic for migrations in your own project and you share the same database with LeCrapaud, you must ensure that Alembic does not attempt to drop or modify LeCrapaud tables (those prefixed with lecrapaud_).

By default, Alembic's autogenerate feature will propose to drop any table that exists in the database but is not present in your project's models. To prevent this, add the following filter to your env.py:

def include_object(object, name, type_, reflected, compare_to):
    if type_ == "table" and name.startswith("lecrapaud_"):
        return False  # Ignore LeCrapaud tables
    return True

context.configure(
    # ... other options ...
    include_object=include_object,
)

This will ensure that Alembic ignores all tables created by LeCrapaud when generating migrations for your own project.

🤝 Contributing

Reminders for Github usage

Creating Github repository

$ brew install gh
$ gh auth login
$ gh repo create

Initializing git and first commit to distant repository

$ git init
$ git add .
$ git commit -m 'first commit'
$ git remote add origin <YOUR_REPO_URL>
$ git push -u origin master

Use conventional commits
https://www.conventionalcommits.org/en/v1.0.0/#summary
Create environment

$ pip install virtualenv
$ python -m venv .venv
$ source .venv/bin/activate

Install dependencies

$ make install

Deactivate virtualenv (if needed)

$ deactivate

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.35.0

May 6, 2026

0.34.0

May 6, 2026

0.33.1

Apr 4, 2026

0.33.0

Mar 31, 2026

0.32.2

Mar 20, 2026

0.32.1

Mar 17, 2026

0.32.0

Mar 12, 2026

0.31.9

Feb 27, 2026

0.31.8

Feb 26, 2026

0.31.7

Feb 25, 2026

0.31.6

Feb 25, 2026

0.31.5

Feb 25, 2026

0.31.4

Feb 24, 2026

0.31.3

Feb 24, 2026

0.31.2

Feb 24, 2026

0.31.1

Feb 24, 2026

0.31.0

Feb 23, 2026

0.30.0

Feb 17, 2026

0.29.0

Feb 17, 2026

0.28.0

Feb 13, 2026

0.27.30

Feb 12, 2026

0.27.29

Feb 11, 2026

0.27.28

Feb 9, 2026

0.27.27

Feb 9, 2026

0.27.26

Jan 29, 2026

0.27.25

Jan 28, 2026

0.27.24

Jan 28, 2026

0.27.23

Jan 28, 2026

0.27.22

Jan 28, 2026

0.27.21

Jan 28, 2026

0.27.20

Jan 28, 2026

0.27.19

Jan 28, 2026

0.27.18

Jan 26, 2026

0.27.17

Jan 26, 2026

0.27.16

Jan 26, 2026

0.27.15

Jan 26, 2026

0.27.14

Jan 25, 2026

0.27.13

Jan 25, 2026

0.27.12

Jan 24, 2026

0.27.11

Jan 24, 2026

0.27.10

Jan 24, 2026

0.27.9

Jan 24, 2026

0.27.8

Jan 24, 2026

0.27.7

Jan 23, 2026

0.27.6

Jan 23, 2026

0.27.4

Jan 22, 2026

0.27.3

Jan 22, 2026

0.27.2

Jan 22, 2026

0.27.1

Jan 22, 2026

0.27.0

Jan 22, 2026

0.26.0

Jan 19, 2026

0.25.2

Jan 19, 2026

0.25.1

Jan 14, 2026

0.25.0

Jan 13, 2026

0.24.5

Jan 13, 2026

0.24.4

Jan 12, 2026

0.24.3

Jan 12, 2026

0.24.2

Jan 8, 2026

0.24.1

Jan 8, 2026

0.24.0

Jan 5, 2026

0.23.3

Dec 20, 2025

0.23.2

Dec 20, 2025

0.23.1

Dec 20, 2025

0.23.0

Dec 17, 2025

0.22.6

Dec 14, 2025

0.22.5

Dec 8, 2025

0.22.4

Dec 7, 2025

0.22.3

Dec 4, 2025

0.22.2

Dec 1, 2025

0.22.1

Nov 27, 2025

0.22.0

Nov 27, 2025

0.21.2

Oct 30, 2025

0.21.1

Oct 30, 2025

0.21.0

Oct 30, 2025

0.20.2

Oct 30, 2025

0.20.1

Oct 28, 2025

0.20.0

Oct 27, 2025

0.19.3

Oct 24, 2025

0.19.2

Sep 24, 2025

0.19.1

Aug 28, 2025

0.19.0

Aug 28, 2025

0.18.10

Aug 28, 2025

0.18.9

Aug 28, 2025

0.18.8

Aug 27, 2025

0.18.7

Aug 27, 2025

0.18.6

Aug 27, 2025

0.18.5

Aug 27, 2025

0.18.4

Aug 27, 2025

0.18.3

Aug 25, 2025

0.18.2

Aug 25, 2025

0.18.1

Aug 25, 2025

0.18.0

Aug 25, 2025

0.17.0

Aug 21, 2025

0.16.7

Aug 19, 2025

0.16.6

Jul 17, 2025

0.16.5

Jul 17, 2025

0.16.4

Jul 16, 2025

0.16.3

Jul 16, 2025

0.16.2

Jul 16, 2025

0.16.1

Jul 16, 2025

0.16.0

Jul 16, 2025

0.15.0

Jul 15, 2025

0.14.8

Jul 4, 2025

0.14.7

Jul 4, 2025

0.14.6

Jul 4, 2025

0.14.5

Jul 1, 2025

0.14.4

Jul 1, 2025

0.14.3

Jul 1, 2025

0.14.2

Jul 1, 2025

0.14.1

Jun 30, 2025

0.14.0

Jun 30, 2025

0.13.1

Jun 26, 2025

This version

0.13.0

Jun 26, 2025

0.12.2

Jun 26, 2025

0.12.1

Jun 26, 2025

0.12.0

Jun 26, 2025

0.11.6

Jun 26, 2025

0.11.5

Jun 26, 2025

0.11.4

Jun 26, 2025

0.11.3

Jun 26, 2025

0.11.2

Jun 26, 2025

0.11.1

Jun 26, 2025

0.11.0

Jun 26, 2025

0.10.2

Jun 26, 2025

0.10.1

Jun 26, 2025

0.10.0

Jun 26, 2025

0.9.4

Jun 25, 2025

0.9.3

Jun 25, 2025

0.9.2

Jun 25, 2025

0.9.1

Jun 25, 2025

0.9.0

Jun 25, 2025

0.8.4

Jun 25, 2025

0.8.3

Jun 25, 2025

0.8.2

Jun 25, 2025

0.8.1

Jun 25, 2025

0.8.0

Jun 25, 2025

0.7.1

Jun 25, 2025

0.7.0

Jun 25, 2025

0.6.2

Jun 25, 2025

0.5.1

Jun 20, 2025

0.5.0

Jun 20, 2025

0.4.2

Jun 19, 2025

0.4.1

Jun 19, 2025

0.3.0

Jun 19, 2025

0.2.1

Jun 19, 2025

0.2.0

Jun 19, 2025

0.1.0

Jun 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lecrapaud-0.13.0.tar.gz (75.7 kB view details)

Uploaded Jun 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lecrapaud-0.13.0-py3-none-any.whl (90.6 kB view details)

Uploaded Jun 26, 2025 Python 3

File details

Details for the file lecrapaud-0.13.0.tar.gz.

File metadata

Download URL: lecrapaud-0.13.0.tar.gz
Upload date: Jun 26, 2025
Size: 75.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for lecrapaud-0.13.0.tar.gz
Algorithm	Hash digest
SHA256	`a2751c80cc5dd39428f47cd72d0e46d3ddc8eaf51bd7ff45eb14b94a04ff2dd0`
MD5	`3f1b707530fe7c6982df6d178c35c467`
BLAKE2b-256	`87456adb5b5cf41b0ace142e80d1eb8428b0c4600cbec53dc450e04889f42986`

See more details on using hashes here.

File details

Details for the file lecrapaud-0.13.0-py3-none-any.whl.

File metadata

Download URL: lecrapaud-0.13.0-py3-none-any.whl
Upload date: Jun 26, 2025
Size: 90.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for lecrapaud-0.13.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`14368ed1da38ca1e9d93adcc749846ced80a26b940d09e13953975c2279260eb`
MD5	`30b8a25cc78bfee37b57e2c803b32718`
BLAKE2b-256	`86a4c7602b508a808bf604147093b6f4ee6ab9f49750ab77419e6daf9c9c7fe7`

See more details on using hashes here.

lecrapaud 0.13.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Welcome to LeCrapaud

🚀 Introduction

✨ Key Features

⚡ Quick Start

Install the package

How it works

Typical workflow

Database Configuration (Required)

Using OpenAI Embeddings (Optional)

Experiment Context Arguments

Modular usage

⚠️ Using Alembic in Your Project (Important for Integrators)

🤝 Contributing

Reminders for Github usage

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes