A machine learning package

These details have not been verified by PyPI

Project links

Homepage

Project description

Pygmalion in the greek mythologie is a sculptor that fell in love with one of his creations. In the myth, Aphrodite gives life to Galatea, the sculpture he fell in love with. This package is a python machine learning library that implements models for some common machine learning tasks. Everything that you need to give a mind of their own to inanimate objects.

Installing pygmalion

pygmalion can be installed through pip.

python -m pip install pygmalion

Fast prototyping of models with pygmalion

Architectures for several common machine learning tasks (regression, image classification, machine translation ...) are implemented in this package.

The inputs and outputs of the models are common python objects (such as numpy array and pandas dataframes).

In this section we are going to see how to load a dataset, train a model, display some metrics, and save a model.

>>> import pygmalion as ml
>>> import pygmalion.neural_networks as nn
>>> import pandas as pd
>>> import numpy as np
>>> import matplotlib.pyplot as plt

You can download a dataset and split it with the split function.

>>> ml.datasets.boston_housing("./")
>>> df = pd.read_csv("./boston_housing.csv")
>>> df_train, df_val, df_test = ml.utilities.split(df, weights=(0.8, 0.1, 0.1))

Creating and training a model takes few lines of code.

>>> inputs, target = [c for c in df.columns if c != "medv"], "medv"
>>> model = nn.DenseRegressor(inputs, target, hidden_layers=[32, 32])
>>> x_train, y_train = model.data_to_tensor(df_train[inputs], df_train[target])
>>> x_val, y_val = model.data_to_tensor(df_val[inputs], df_val[target])
>>> history = model.fit((x_train, y_train), (x_val, y_val), n_steps=1000, patience=100, learning_rate=1.0E-3)

Some usefull metrics can easily be evaluated.

For a regressor model, the available metrics are MSE, RMSE, R2, and the correlation between target and prediction can be visualized with the plot_fitting function.

>>> f, ax = plt.subplots()
>>> ml.utilities.plot_fitting(df_train[target], model.predict(df_train), ax=ax, label="training")
>>> ml.utilities.plot_fitting(df_val[target], model.predict(df_val), ax=ax, label="validation")
>>> ml.utilities.plot_fitting(df_test[target], model.predict(df_test), ax=ax, label="testing", color="C3")
>>> R2 = ml.utilities.R2(model.predict(df_test), df_test[target])
>>> ax.set_title(f"RÂ²={R2:.3g}")
>>> ax.set_xlabel("target")
>>> ax.set_ylabel("predicted")
>>> plt.show()

pairplot

For a classifier model you can evaluate the accuracy, and display the confusion matrix.

>>> ml.datasets.iris("./")
>>> df = pd.read_csv("./iris.csv")
>>> df_train, df_val, df_test = ml.utilities.split(df, weights=(0.7, 0.2, 0.1))
>>> inputs, target = [c for c in df.columns if c != "variety"], "variety"
>>> classes = df[target].unique()
>>> model = nn.DenseClassifier(inputs, target, classes, hidden_layers=[8, 8, 8])
>>> train_data = model.data_to_tensor(df_train[inputs], df_train[target])
>>> val_data = model.data_to_tensor(df_train[inputs], df_train[target])
>>> model.fit(train_data, val_data, n_steps=1000, patience=100)
>>> f, ax = plt.subplots()
>>> y_test, y_pred = df_test[target], model.predict(df_test)
>>> ml.utilities.plot_matrix(ml.utilities.confusion_matrix(y_test, y_pred, classes=classes), ax=ax, cmap="Greens", write_values=True, format=".2%")
>>> acc = ml.utilities.accuracy(y_pred, y_test)
>>> ax.set_title(f"Accuracy: {acc:.2%}")
>>> plt.tight_layout()
>>> plt.show()

confusion matrix

All the models can be saved directly to the disk with the save method. A model saved on the disk can then be loaded back with the load_model function.

>>> model.save("./model.pth")
>>> model = ml.utilities.load_model("./model.pth")

Implemented models

For examples of model training see the samples folder in the github page.

Neural networks

The neural networks are implemented in pytorch under the hood. Each model is a pytorch Module. The fit method of neural networks returns a train loss, validation loss, gradient scale history that can be ploted with the plot_loss functions.

>>> train_losses, val_losses, grad, best_step = model.fit(...)
>>> ml.utilities.plot_losses(train_losses, val_losses, grad, best_step)

loss history

DenseRegressor

A DenseRegressor (or multi layer perceptron regressor) predicts a scalar value given an input of several variables. An example of DenseRegressor training was demonstrated in a previous section.

DenseClassifier

A DenseClassifier (or multi layer perceptron classifier) predicts a str class value given an input of several variables. An example of DenseClassifier training was presented in a previous section.

ImageClassifier

An ImageClassifier predicts a str class given as input an image. Here below the predictions of a model trained on the fashion-MNIST dataset.

fashion-MNIST predictions

It is implemented as a Convolutional Neural Network similar to ResNet.

ImageSegmenter

An ImageSegmenter predicts a class for each pixel of the input image (semantic segmentation). Here below the predictions of a model trained on the cityscape dataset.

segmented_cityscapes

It is implemented as a Convolutional Neural Network similar to U-Net.

ImageObjectDetector

An ImageObjectDetector predict the presence and box coordinates of objects in an image. This model is an implementation of the YOLO convolutional neural network. Here below the prediction of a model trained to detect circles and squares in images generated on the fly:

segmented_cityscapes

TextClassifier

A TextClassifier classifies text inputs. It is implemented as a transformer encoder. Here below some prediction of the model on a sentiment analysis task where tweets were to be classified as positive, neutral or negative.

@JetBlue Thanks! Her flight leaves at 2 but she's arriving to the airport early. Wedding is in VT in Sept. Grateful you fly to BTV!! :)
>>> positive

@united how are conditions in BOS today? I'm in UA994. Everything appears to be in time but I wanted to check.
>>> neutral

@AmericanAir it's been almost 3 days and it's still frozen. Thanks doll ðŸ˜˜ðŸ˜‘
>>> negative

TextTranslator

A TextTranslator model predicts a string outputs for a string inputs. It is implemented as an encoder/decoder transformer. Here below some predictions of a model trained to translate arabic numerals to roman numerals.

402 >>> ['CDII']
863 >>> ['DCCCLXIII']
1275 >>> ['MCCLXXV']
798 >>> ['DCCXCVIII']
1532 >>> ['MDXXXII']
223 >>> ['CCXXIII']
90 >>> ['XC']
1261 >>> ['MCCLXI']
1032 >>> ['MXXXII']
432 >>> ['CDXXXII']

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.8

Nov 16, 2023

0.1.7

Aug 12, 2023

This version

0.1.6

Aug 5, 2023

0.1.5

May 11, 2023

0.1.4

Apr 11, 2023

0.1.3

Apr 11, 2023

0.1.2

Mar 31, 2023

0.1.1

Mar 31, 2023

0.1.0

Mar 31, 2023

0.0.11

Jul 25, 2021

0.0.10

Jun 10, 2021

0.0.9

Jun 10, 2021

0.0.8

Jun 6, 2021

0.0.7

Feb 6, 2021

0.0.6

Feb 5, 2021

0.0.5

Feb 5, 2021

0.0.4

Jan 17, 2021

0.0.3

Jan 13, 2021

0.0.2

Jan 13, 2021

0.0.1

Jan 13, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pygmalion-0.1.6.tar.gz (65.5 kB view hashes)

Uploaded Aug 5, 2023 Source

Built Distribution

pygmalion-0.1.6-py3-none-any.whl (99.0 kB view hashes)

Uploaded Aug 5, 2023 Python 3

Hashes for pygmalion-0.1.6.tar.gz

Hashes for pygmalion-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`f00ed4837971619e581db8f75f9e25c0a4c654e0fff1e02f16e2515f99d72680`
MD5	`f036817df10fc03b4bc1d36ce9d7c254`
BLAKE2b-256	`536f3dc65363e1435cd72b806a70781452ab27647acda9bdc9463e26e6f69a63`

Hashes for pygmalion-0.1.6-py3-none-any.whl

Hashes for pygmalion-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f0a584c76ae92f2710c1d0ca3417b5525abdf3b869001f3654da3ef291f3669b`
MD5	`45a9c2d75340e93ae7835b7ff215f1e2`
BLAKE2b-256	`478f65561f92e8ce90be296cb49d98e79c2221ba8ced0b23c19fdb02f9bc5685`