Perturbed Saddle-point Descent optimizer for PyTorch

These details have not been verified by PyPI

Project links

Homepage

Project description

Perturbed Saddle-escape Descent (PSD)

Project Summary

This repository implements the Perturbed Saddle-escape Descent (PSD) algorithm for escaping saddle points in non-convex optimisation problems. It contains reference NumPy implementations, framework specific optimisers for PyTorch and TensorFlow, and utilities for reproducing the synthetic experiments reported in the accompanying manuscript.

Features

Reference implementations of PSD, PSD-Probe and baseline gradient descent variants in pure NumPy.
Suite of analytic test functions with gradients and Hessians.
Synthetic data generator producing the tables and figures used in the paper (experiments.py).
Framework specific optimisers: PSDTorch, PSDTensorFlow and a PSDOptimizer/PerturbedAdam package for PyTorch.
Example training scripts for MNIST and CIFAR-10.

Technology Stack

The core project depends on the following libraries:

Library	Purpose
`numpy`	numerical routines for reference implementations
`torch`, `torchvision`	deep-learning framework and datasets
`optuna`	hyper-parameter search utilities
`matplotlib`	visualisation in notebooks

Python 3.8 or later is required.

Installation

Install the published optimiser package:

pip install psd-optimizer

Or install the repository in editable mode for development:

git clone https://github.com/farukalpay/PSD.git
cd PSD
pip install -e ".[dev]"

Usage

Using the Reference Algorithms

The core PSD routines and test functions can be imported from the psd package:

import numpy as np
from psd import algorithms, functions

x0 = np.array([1.0, -1.0])
x_star, _ = algorithms.gradient_descent(x0, functions.SEPARABLE_QUARTIC.grad)

This structure allows you to experiment with the reference NumPy implementations directly in your projects.

Generating Synthetic Data

python experiments.py

The command writes CSV summaries to results/ and training curves to data/.

Training with the PyTorch Optimiser

from psd_optimizer import PSDOptimizer

model = ...
opt = PSDOptimizer(model.parameters(), lr=1e-3)

def closure():
    opt.zero_grad()
    output = model(x)
    loss = criterion(output, y)
    loss.backward()
    return loss

opt.step(closure)

Example scripts using this API are available in the notebooks/ directory.

Training a Small Language Model

An illustrative example for fine-tuning a compact transformer with PSDOptimizer is provided in scripts/train_small_language_model.py. The script downloads a tiny GPT-style model from the Hugging Face Hub and optimizes it on a short dummy corpus.

Run the example with default settings:

python scripts/train_small_language_model.py

Specify a different pretrained model and number of epochs:

python scripts/train_small_language_model.py --model distilgpt2 --epochs 5

Documentation

Further materials are available:

notebooks/10_minute_start.ipynb – an interactive notebook showcasing the optimiser.
docs/section_1_5_extension.md – theoretical notes on extending PSD to stochastic settings.
notebooks/navigation.ipynb – links to all example notebooks including advanced_usage.ipynb.

Testing

After installing the repository in editable mode, run the test suite to verify that everything works:

pytest

The current suite is small but helps prevent regressions.

Repository Structure

psd/              # Reference implementations and framework-specific optimisers
    algorithms.py # PSD and baseline algorithms
    functions.py  # Analytic test functions and registry
psd_optimizer/    # PyTorch optimiser package
experiments.py    # Synthetic data generation

Contributing

Contributions are welcome! Please open an issue or pull request on GitHub and see CONTRIBUTING.md for guidelines. By participating you agree to abide by the CODE_OF_CONDUCT.md.

License

This project is released under the MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.2

Aug 25, 2025

This version

0.1.1

Aug 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

psd_optimizer-0.1.1.tar.gz (23.1 kB view details)

Uploaded Aug 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

psd_optimizer-0.1.1-py3-none-any.whl (24.2 kB view details)

Uploaded Aug 23, 2025 Python 3

File details

Details for the file psd_optimizer-0.1.1.tar.gz.

File metadata

Download URL: psd_optimizer-0.1.1.tar.gz
Upload date: Aug 23, 2025
Size: 23.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for psd_optimizer-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`2c57dd79adfb9b7703239ae8839846de60d45642200255125b5858e56adb7bc8`
MD5	`7ea197597f5715d5931f1f0a6b928a70`
BLAKE2b-256	`78c265fa0a3392a3ddbbf6316675ea2c678b6a8d96a94de271e7f575a334709e`

See more details on using hashes here.

File details

Details for the file psd_optimizer-0.1.1-py3-none-any.whl.

File metadata

Download URL: psd_optimizer-0.1.1-py3-none-any.whl
Upload date: Aug 23, 2025
Size: 24.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for psd_optimizer-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d59ee0288146c6d54d25edc504478f1ca8b7ffb3ad0141a6c3ed220545df2fc`
MD5	`87c6a57c11dc3ac4206ea67485539eea`
BLAKE2b-256	`3ec77cbd934655ddd8bd869173dba4927303c9ef0a5cab8edba5bba0b86ee6db`

See more details on using hashes here.

psd-optimizer 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Perturbed Saddle-escape Descent (PSD)

Project Summary

Features

Technology Stack

Installation

Usage

Using the Reference Algorithms

Generating Synthetic Data

Training with the PyTorch Optimiser

Training a Small Language Model

Documentation

Testing

Repository Structure

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes