A framework/library for building and training neural networks.

These details have not been verified by PyPI

Project links

Homepage

Project description

NeuralEngine Cover

NeuralEngine

A framework/library for building and training neural networks in Python. NeuralEngine provides core components for constructing, training and evaluating neural networks, with support for both CPU and GPU (CUDA) acceleration. Designed for extensibility, performance and ease of use, it is suitable for research, prototyping and production.

Features
Installation
Example Usage
Project Structure
Capabilities & Documentation
Contribution Guide
License
Attribution

Features

Custom tensor operations (CPU/GPU support via NumPy and optional CuPy)
Configurable neural network layers (Linear, Flatten, etc.)
Built-in loss functions, metrics and optimizers
Model class for easy training and evaluation
Device management (CPU/CUDA)
Utilities for deep learning workflows
Autograd capabilities using dynamic computational graphs
Extensible design for custom layers, losses and optimizers
Flexible data type configuration and runtime type validation

Installation

Install via pip:

pip install NeuralEngine

Or clone and install locally:

pip install .

Optional CUDA Support

To enable GPU acceleration, Install via pip:

pip install NeuralEngine[cuda]

Or install the optional dependency

pip install cupy-cuda12x

Example Usage

import neuralengine as ne

# Set device (CPU or CUDA)
ne.set_device(ne.Device.CUDA)

# Load your dataset (example: MNIST)
(x_train, y_train), (x_test, y_test) = load_mnist_data()

y_train = ne.one_hot(y_train) # Preprocess if needed
y_test = ne.one_hot(y_test)

train_data = ne.DataLoader(x_train, y_train, batch_size=10000)
test_data = ne.DataLoader(x_test, y_test, batch_size=10000, shuffle=False)

# Build your model
model = ne.Model(
    input_size=(28, 28),
    optimizer=ne.Adam(),
    loss=ne.CrossEntropy(),
    metrics=ne.ClassificationMetrics(),
    dtype=ne.DType.FLOAT16
)
model(
    ne.Flatten(),
    ne.Linear(64, activation=ne.ReLU()),
    ne.Linear(10, activation=ne.Softmax()),
)

# Train and evaluate
model.train(train_data, epochs=30)
result = model.eval(test_data)

Project Structure

neuralengine/
    __init__.py
    config.py
    tensor.py
    utils.py
    nn/
        __init__.py
        dataload.py
        layers.py
        loss.py
        metrics.py
        model.py
        optim.py
setup.py
requirements.txt
pyproject.toml
MANIFEST.in
LICENSE
README.md

Capabilities & Documentation

NeuralEngine offers the following core capabilities:

Device Management

ne.set_device(device): Switch between CPU and GPU (CUDA) for computation.
Tensor.to(device), Layer.to(device): Move tensors and layers to specified device.
Device enum: ne.Device.CPU, ne.Device.CUDA.

Tensors & Autograd

Custom tensor implementation supporting NumPy and CuPy backends.
Automatic differentiation (autograd) using dynamic computational graphs for backpropagation.
Supports gradients, parameter updates and custom operations.
Supported tensor operations:
- Arithmetic: +, -, *, /, ** (power)
- Matrix multiplication: @
- Mathematical: log, sqrt, exp, abs
- Reductions: sum, max, min, mean, var
- Shape: transpose, reshape, concatenate, stack, slice, set_slice
- Elementwise: masked_fill
- Comparison: ==, !=, >, >=, <, <=
- Type conversion: dtype (get / set)
- Utility: zero_grad() (reset gradients)
- Autograd: backward() (compute gradients for the computation graph)

Layers

ne.Linear(out_size, *in_size, bias=True, activation=None): Fully connected layer with optional activation.
ne.LSTM(...): Long Short-Term Memory layer with options for attention, bidirectionality, sequence/state output. You can build deep LSTM networks by stacking multiple LSTM layers. When building encoder-decoder models, ensure that the hidden units for decoder's first layer is set correctly:
- For a standard LSTM, the hidden state shape for the last timestep is (batch, hidden_units).
- For a bidirectional LSTM, the hidden and cell state shape becomes (batch, hidden_units * 2).
- If attention is enabled, the hidden state shape is (batch, 2 * hidden_units) (self-attention), if enc_size is provided, the hidden state shape is (batch, hidden_units + enc_size) (cross-attention).
- If LSTM layers require state initializations from prior layers, set the hidden units accordingly to match the output shape of the previous LSTM (including adjustments for bidirectionality and attention).
ne.MultiplicativeAttention(units, *in_size): Soft attention mechanism for sequence models.
ne.MultiHeadAttention(*in_size, num_heads=1): Multi-head attention layer for transformer and sequence models.
ne.Embedding(embed_size, *vocab_size, timesteps=None): Embedding layer for mapping indices to dense vectors, with optional positional encoding.
ne.LayerNorm(*num_feat, eps=1e-7): Layer normalization for stabilizing training.
ne.Dropout(prob=0.5): Dropout regularization for reducing overfitting.
ne.Flatten(): Flattens input tensors to 2D (batch, features).
ne.Layer.dtype = ne.DType: Get or set layer parameters data types.
ne.Layer.freezed = True/False: Freeze or unfreeze layer parameters during training.
All layers inherit from a common base and support extensibility for custom architectures.

Activations

ne.Sigmoid(): Sigmoid activation function.
ne.Tanh(): Tanh activation function.
ne.ReLU(alpha=0, parametric=False): ReLU, Leaky ReLU, or Parametric ReLU activation.
ne.SiLU(beta=False): SiLU (Swish) activation function.
ne.Softmax(axis=-1): Softmax activation for classification tasks.
All activations inherit from a common base and support extensibility for custom architectures.

Loss Functions

ne.CrossEntropy(binary=False, eps=1e-7): Categorical and binary cross-entropy loss for classification tasks.
ne.MSE(): Mean Squared Error loss for regression.
ne.MAE(): Mean Absolute Error loss for regression.
ne.Huber(delta=1.0): Huber loss, robust to outliers.
ne.GaussianNLL(eps=1e-7): Gaussian Negative Log Likelihood loss for probabilistic regression.
ne.KLDivergence(eps=1e-7): Kullback-Leibler Divergence loss for measuring distribution differences.
All loss functions inherit from a common base, support autograd and loss accumulation.

Optimizers

ne.Adam(lr=1e-3, betas=(0.9, 0.99), eps=1e-7, reg=0): Adam optimizer (switches to RMSProp if only one beta is provided).
ne.SGD(lr=1e-2, reg=0, momentum=0, nesterov=False): Stochastic Gradient Descent with optional momentum and Nesterov acceleration.
All optimizers support L2 regularization and gradient reset.

Metrics

ne.ClassificationMetrics(num_classes=None, acc=True, prec=False, rec=False, f1=False, eps=1e-7): Computes accuracy, precision, recall and F1 score for classification tasks.
ne.RMSE(): Root Mean Squared Error for regression.
ne.R2(eps=1e-7): R2 Score for regression.
ne.Perplexity(eps=1e-7): Perplexity metric for generative models.
All metrics store results as dictionaries, support batch evaluation and metric accumulation.

Model API

ne.Model(input_size, optimizer, loss, metrics, dtype): Create a model specifying input size, optimizer, loss function, metrics and data type for model layers.
Add layers by calling the model instance: model(layer1, layer2, ...) or using model.build(layer1, layer2, ...).
model.train(dataloader, epochs=10, ckpt_interval=None): Train the model on dataset, with support for metric/loss reporting and checkpointing per epoch.
model.eval(dataloader): Evaluate the model on dataset, disables gradient tracking using with ne.NoGrad():, prints loss and metrics and returns output tensor.
Layers are set to training or evaluation mode automatically during train and eval.
model.save(filename, weights_only=False): Save the model architecture or model parameters to a file.
model.load_params(filepath): Load model parameters from a saved file.
ne.Model.load_model(filepath): Load a model from a saved file.

DataLoader

ne.DataLoader(x, y, dtype=(None, None), batch_size=32, shuffle=True, random_seed=None, bar_size=30): Create a data loader for batching and shuffling datasets during training and evaluation.
Supports lists, tuples, numpy arrays, pandas dataframes and tensors as input data.
Provides batching, shuffling and progress bar display during iteration.
Extensible for custom data loading strategies.

Utilities

Tensor creation: tensor(data, requires_grad=False, dtype=None), zeros(*shape), ones(*shape), rand(*shape), randn(*shape, xavier=False), randint(low, high, *shape) and their _like variants for matching shapes.
Tensor operations: sum, min, max, argmax, mean, var, log, sqrt, exp, abs, concat, stack, where, clip, array(data, dtype=None) for elementwise, reduction and conversion operations.
Encoding: one_hot(labels, num_classes=None) for converting integer labels to one-hot encoding.
Autograd management: with NoGrad() context manager to disable gradient tracking in a block. @no_grad decorator to disable gradients for specific functions.

Type Validation

metaclass=ne.Typed: Metaclass for enforcing type hints on class methods, properties and subclasses. Add STRICT = True in class definition to enforce strict type checking.
@ne.Typed.validate: Decorator for validating function arguments and return values based on type hints.
ne.Typed.validation(True|False): Enable or disable type validation globally.
Data type enum: ne.DType.FLOAT32, ne.DType.INT8, ne.DType.UINT16, etc.

Extensibility

NeuralEngine is designed for easy extension and customization:

Custom Layers: Create new layers by inheriting from the Layer base class and implementing the forward(self, x) method. You can add parameters, initialization logic and custom computations as needed. All built-in layers follow this pattern, making it simple to add your own.
Custom Losses: Define new loss functions by inheriting from the Loss base class and implementing the compute(self, z, y) method. This allows you to integrate any custom loss logic with autograd support.
Custom Optimizers: Implement new optimization algorithms by inheriting from the Optimizer base class and providing your own step(self) method. You can manage optimizer state and parameter updates as required.
Custom Metrics: Add new metrics by inheriting from the Metric base class and implementing the compute(self, z, y) method. This allows you to track any performance measure with metric accumulation.
Custom DataLoaders: Extend the DataLoader class to create specialized data loading strategies. Override the __getitem__ method to define how batches are constructed.
All core components are modular and can be replaced or extended for research or production use.

Contribution Guide

NeuralEngine is an open-source project, and I warmly welcome all kinds of contributions whether it's code, documentation, bug reports, feature ideas, or sharing cool examples. If you want to help make NeuralEngine better, you're in the right place!

How to Contribute

Fork the repository and create a new branch for your feature, fix, or documentation update.
Keep it clean and consistent: Try to follow the existing code style, naming conventions and documentation patterns. Well-commented, readable code is always appreciated!
Add tests for new features or bug fixes if you can.
Document your changes: Update or add docstrings and README sections so others can easily understand your work.
Open a pull request describing what you've changed and why it's awesome.

What Can You Contribute?

New layers, loss functions, optimizers, metrics, or utility functions
Improvements to existing components
Bug fixes and performance tweaks
Documentation updates and tutorials
Example scripts and notebooks
Feature requests, feedback and ideas

Every contribution is reviewed for quality and consistency, but don't worry—if you have questions or need help, just open an issue or start a discussion. I'm happy to help and love seeing new faces in the community!

Thanks for making NeuralEngine better, together! 🚀

License

MIT License with attribution clause. See LICENSE file for details.

Attribution

If you use this project, please credit the original developer: Prajjwal Pratap Shah.

Special thanks to the Autograd Framework From Scratch project by Eduardo Leitão da Cunha Opice Leão, which served as a reference for tensor operations and autograd implementations.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.5.9

Feb 15, 2026

0.5.8

Feb 13, 2026

0.5.7

Feb 11, 2026

0.5.6

Feb 11, 2026

0.5.5

Feb 6, 2026

0.5.4

Jan 27, 2026

0.5.3

Jan 27, 2026

0.5.2

Jan 26, 2026

0.5.1

Jan 25, 2026

0.5.0

Jan 25, 2026

0.4.9

Jan 24, 2026

0.4.8

Jan 23, 2026

0.4.7

Jan 18, 2026

0.4.6

Jan 18, 2026

0.4.5

Jan 17, 2026

0.4.4

Jan 17, 2026

0.4.3

Jan 16, 2026

0.4.2

Jan 14, 2026

0.4.1

Jan 13, 2026

0.4.0

Jan 13, 2026

0.3.9

Jan 13, 2026

0.3.8

Jan 8, 2026

0.3.7

Jan 8, 2026

0.3.6

Jan 8, 2026

0.3.5

Jan 7, 2026

0.3.3

Jan 5, 2026

This version

0.3.2

Jan 4, 2026

0.3.1

Jan 3, 2026

0.3.0

Dec 31, 2025

0.2.9

Dec 31, 2025

0.2.8

Dec 30, 2025

0.2.7

Dec 29, 2025

0.2.6

Dec 29, 2025

0.2.4

Dec 26, 2025

0.2.3

Dec 26, 2025

0.2.0

Dec 10, 2025

0.1.9

Dec 9, 2025

0.1.8

Dec 9, 2025

0.1.7

Dec 9, 2025

0.1.6

Dec 9, 2025

0.1.5

Dec 9, 2025

0.1.4

Dec 9, 2025

0.1.3

Jul 16, 2025

0.1.2

Jul 16, 2025

0.1.1

Jul 14, 2025

0.1.0

Jul 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neuralengine-0.3.2.tar.gz (26.6 kB view details)

Uploaded Jan 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

neuralengine-0.3.2-py3-none-any.whl (30.3 kB view details)

Uploaded Jan 4, 2026 Python 3

File details

Details for the file neuralengine-0.3.2.tar.gz.

File metadata

Download URL: neuralengine-0.3.2.tar.gz
Upload date: Jan 4, 2026
Size: 26.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for neuralengine-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`d4f249f602a21dc7ac62bab2b6d07803fb7b3dc015c0c28e009f0f932139589b`
MD5	`a6e1a78802498efdf833af259480fce7`
BLAKE2b-256	`d03fc89910a5876608341fe43f7a76b62029305c198e8ed70c8fbe82fba7238d`

See more details on using hashes here.

File details

Details for the file neuralengine-0.3.2-py3-none-any.whl.

File metadata

Download URL: neuralengine-0.3.2-py3-none-any.whl
Upload date: Jan 4, 2026
Size: 30.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for neuralengine-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`295940dbebdeec32653628d0bf66ea2010b8723aeb0e2fb387bf764db385a54c`
MD5	`bff953505191e3fb3fb6cbf3d31fc2c5`
BLAKE2b-256	`99e31e57feaf3bec594101381a5b380d659dd0533687cbf375cd03bc181fccee`

See more details on using hashes here.

NeuralEngine 0.3.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NeuralEngine

Table of Contents

Features

Installation

Optional CUDA Support

Example Usage

Project Structure

Capabilities & Documentation

Device Management

Tensors & Autograd

Layers

Activations

Loss Functions

Optimizers

Metrics

Model API

DataLoader

Utilities

Type Validation

Extensibility

Contribution Guide

How to Contribute

What Can You Contribute?

License

Attribution

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes