Implementation of very small scale Neural Network from scratch.
This project has been archived.
The maintainers of this project have marked this project as archived. No new releases are expected.
Project description
miniMLP
This repository contains an implementation of a small Multilayer Perceptron (MLP) Neural Network in Python, with the ability to specify the number of hidden layers, the number of neurons in each layer, the activation function, the optimizer to be used, and support for batch training, validation, and different loss functions.
Installation
To install miniMLP, simply run:
```
pip install miniMLP
```
Dependencies
The code requires the following dependencies:
- NumPy: For matrix operations and mathematical computations.
- matplotlib (optional): For visualization of training metrics, like loss curves.
Install them via pip:
```
pip install numpy matplotlib
```
Features
- Customizable Layers: Specify the number of layers, neurons per layer, and activation functions.
- Multiple Optimizers: Choose from various optimizers (SGD, Adam, Momentum, etc.).
- Flexible Loss Functions: Support for several loss functions, including MSE, Cross Entropy, MAE, and more.
- Mini-Batch Training: Efficient training using mini-batches for large datasets.
- Validation Support: Monitor validation performance during training.
- Training History: Track loss over epochs, useful for plotting and debugging.
- Learning Rate Scheduling: Support for dynamic learning rates.
- Early Stopping: Halt training when validation loss stops improving.
- Model Persistence: Save and load network weights easily.
- Evaluation Metrics: Quickly compute loss and accuracy on datasets.
- Gradient Clipping: Prevent exploding gradients during training.
- Model Summary: Display a summary of layer shapes and parameter counts.
- L1/L2 Regularization: Encourage sparse or small weights.
- Weight Initialization: Choose He or Xavier initialization per layer.
- Weight Introspection: Easily get and set model weights.
- Class Prediction Helper:
predict_classesconverts outputs to labels.
Example Usage
Creating an MLP Model
Create an instance of the MLP class by specifying the input size, output size, hidden layers, the number of neurons in each layer,
activation functions, and optimizer:
```python
import numpy as np
from miniMLP.engine import MLP
from miniMLP.activation import ActivationFunction
from miniMLP.optimizers import Adam
from miniMLP.losses import MSE
from miniMLP.layers import Layer
# Example MLP Architecture
layers = [
Layer(input_size=2, output_size=4, activation=ActivationFunction.relu),
Layer(input_size=4, output_size=6, activation=ActivationFunction.relu, init='xavier'),
Layer(input_size=6, output_size=1, activation=ActivationFunction.sigmoid)
]
# Define loss function and optimizer
loss_fn = MSE()
optimizer = Adam(learning_rate=0.001)
# Initialize MLP
mlp = MLP(layers=layers, loss_function=loss_fn, optimizer=optimizer)
# Display model architecture
mlp.summary()
```
Training the MLP
Use the train method to train the MLP, specifying the training data, validation data, learning rate,
number of epochs, batch size, and more.
```python
X_train = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
y_train = np.array([[0], [1], [1], [0]])
def scheduler(epoch):
return 0.001 * (0.95 ** epoch)
# Train the model
mlp.train(
X_train, y_train,
epochs=2000, batch_size=4,
validation=False,
lr_scheduler=scheduler,
early_stopping=True,
patience=20,
clip_value=1.0,
)
```
Making Predictions
After training, use the predict method to generate predictions for new data:
```python
X_new = np.array([[1, 1], [0, 0]])
y_pred = mlp.predict(X_new)
print(y_pred)
y_labels = mlp.predict_classes(X_new)
print(y_labels)
```
Evaluating, Saving, and Loading
```python
# Evaluate on a dataset
loss, acc = mlp.evaluate(X_train, y_train)
print("Loss", loss, "Accuracy", acc)
# Save and later load the weights
mlp.save("model.pkl")
mlp.load("model.pkl")
# Access raw weights
weights = mlp.get_weights()
mlp.set_weights(weights)
```
Activation Functions
The following activation functions are supported:
- Sigmoid
- ReLU
- Tanh
- Softmax
- Leaky ReLU
- ELU
- GELU
- Softplus
- SeLU
- PReLU
- Swish
- Gaussian
Optimizers
The following optimizers are supported:
- Adam
- Stochastic Gradient Descent (SGD) with momentum and Nesterov
- RMSProp
- Momentum
- Nesterov Accelerated Gradient (NAG)
Regularizers
Built-in regularization helpers:
- L2Regularizer for weight decay
- L1Regularizer for sparsity
Loss Functions
Supported loss functions include:
- Mean Squared Error (MSE)
- Mean Absolute Error (MAE)
- Cross Entropy (for classification)
- Binary Cross Entropy
- Hinge Loss (used in SVM)
- Huber Loss (robust regression)
Example with Validation
You can also pass validation data to track model performance:
```python
X_val = np.array([[1, 1], [0, 1]])
y_val = np.array([[0], [1]])
history = mlp.train(X_train, y_train, X_val=X_val, Y_val=y_val, epochs=2000, batch_size=4, validation=True)
```
This will output the training loss and validation loss for each epoch.
Plotting Training and Validation Loss
If you track loss history during training, you can plot it using matplotlib:
```python
import matplotlib.pyplot as plt
plt.plot(history['train_loss'], label='Training Loss')
plt.plot(history['val_loss'], label='Validation Loss')
plt.xlabel('Epochs')
plt.ylabel('Loss')
plt.title('Training and Validation Loss')
plt.legend()
plt.show()
```
License
This project is licensed under the MIT License. Feel free to use and modify this code for your own projects.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file minimlp-0.1.0.tar.gz.
File metadata
- Download URL: minimlp-0.1.0.tar.gz
- Upload date:
- Size: 14.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2f2c67e333eb157c3bc4abab953636445497e361b7eb2a9b75b3c3c680e40fbc
|
|
| MD5 |
dd7235d64ebab7ab19825679c8339ff7
|
|
| BLAKE2b-256 |
6ceffc2d4188c1ca73f3e0674b78ea4126f45577f333e40c501de878c5ccf243
|
File details
Details for the file minimlp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: minimlp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a0e12d226b40f4bfd2a1edf005c866c398a5fc5246effdf5b458f1c81c28576f
|
|
| MD5 |
00c1820f026b0c54825e2a32e7daa213
|
|
| BLAKE2b-256 |
643831f4d0d9339446528c0dbba2de5c3a9dd67d23e89385963f5e3683fa6d39
|