torchcurves

PyTorch module for differentiable parametric curves with learnable coefficients

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

alexshtf

These details have not been verified by PyPI

Project links

Documentation

Project description

Torchcurves Logo

Python version

A PyTorch module for vectorized and differentiable parametric curves with learnable coefficients, such as a B-spline curve with learnable control points, for KANs, continuous embeddings, and shape constraints.

Use cases

Turns out all the above use cases have one thing in common: they can all be expressed using learnable parametric curves, and that is exactly what this library provides.

Learn

A simple "hello world" example: evaluate three two-dimensional B-spline curves at four points:

import torch
import torchcurves as tc

u = torch.rand(4, 3)        # (B, C)
curve = tc.BSplineCurve(
    num_curves=3,           # C
    dim=2,                  # D
)
y = curve(u)                # (B, C, D)

print(u.shape, "->", y.shape)            # torch.Size([4, 3]) -> torch.Size([4, 3, 2])

For more information:

Documentation site.
Example notebooks for you to try out.

Features

Differentiable: Custom autograd function ensures gradients flow properly through the curve evaluation.
Vectorized: Vectorized operations for efficient batch and multi-curve evaluation.
Efficient numerics: Clenshaw recursion for polynomials, Cox-DeBoor for splines.

Installation

With pip:

pip install torchcurves

With uv:

uv add torchcurves

Use cases

There are examples in the doc/source/examples directory showing how to build models using this library. Here we show some simple code snippets to appreciate the library.

Use case 1 - continuous embeddings

import torchcurves as tc
from torch import nn
import torch


class Net(nn.Module):
    def __init__(self, num_categorical, num_numerical, dim, num_knots=10):
        super().__init__()
        self.cat_emb = nn.Embedding(num_categorical, dim)
        self.num_emb = tc.BSplineCurve(num_numerical, dim, knots_config=num_knots)
        self.embedding_based_model = MySuperDuperModel()  # placeholder for your encoder model

    def forward(self, x_categorical, x_numerical):
        embeddings = torch.cat([
            self.cat_emb(x_categorical),
            self.num_emb(x_numerical)
        ], dim=-2)
        return self.embedding_based_model(embeddings)

MySuperDuperModel is a placeholder for your downstream architecture.

Use case 2 - monotone functions

Working on online advertising, and want to model the probability of winning an ad auction given the bid? We know higher bids must result in a higher win probability, so we need a monotone function. Turns out B-splines are monotone if their coefficient vectors are monotone. Want an increasing function? Ensure the spline coefficients are increasing, and the resulting spline will be monotone increasing.

Below is an example with an auction encoder that encodes the auction into a vector, we then transform it to an increasing vector, and use it as the coefficient vector for a B-spline curve.

import torch
from torch import nn
import torchcurves.functional as tcf


class AuctionWinModel(nn.Module):
    def __init__(self, num_auction_features, num_bid_coefficients):
        super().__init__()
        self.auction_encoder = make_auction_encoder(  # placeholder: an MLP, a transformer, etc.
            input_features=num_auction_features,
            output_features=num_bid_coefficients,
        )
        self.register_buffer(
            "spline_knots",
            tcf.uniform_augmented_knots(
                n_control_points=num_bid_coefficients,
                degree=3,
                k_min=0,
                k_max=1,
            ),
        )

    def forward(self, auction_features, bids):
        # map auction features to increasing spline coefficients
        spline_coeffs = self._make_increasing(self.auction_encoder(auction_features))

        # map bids to [0, 1] using the arctan (or any other) normalization
        mapped_bid = tcf.arctan(bids)

        # evaluate the spline at the mapped bids, treating each
        # mini-batch sample as a separate curve
        return tcf.bspline_curves(
            mapped_bid.unsqueeze(0),     # 1 x B (B curves in 1 dimension)
            spline_coeffs.unsqueeze(-1), # B x C x 1 (B curves with C coefs in 1 dimension)
            self.spline_knots,
            degree=3
        )

    def _make_increasing(self, x):
        # transform a mini-batch of vectors to a mini-batch of increasing vectors
        initial = x[..., :1]
        increments = nn.functional.softplus(x[..., 1:])
        concatenated = torch.concat((initial, increments), dim=-1)
        return torch.cumsum(concatenated, dim=-1)

make_auction_encoder is a placeholder for your encoder architecture.

Now we can train the model to predict the probability of winning auctions given auction features and bid:

import torch.nn.functional as F

for auction_features, bids, win_labels in train_loader:
    win_logits = model(auction_features, bids)
    loss = F.binary_cross_entropy_with_logits(  # or any loss we desire
        win_logits,
        win_labels
    )

    optimizer.zero_grad()
    loss.backward()
    optimizer.step()

Use case 3 - Kolmogorov-Arnold networks

A KAN [1] based on the B-spline basis, along the lines of the original paper:

import torchcurves as tc
from torch import nn

input_dim = 2
intermediate_dim = 5
num_control_points = 10

kan = nn.Sequential(
    # layer 1
    tc.BSplineCurve(input_dim, intermediate_dim, knots_config=num_control_points),
    tc.Sum(dim=-2),
    # layer 2
    tc.BSplineCurve(intermediate_dim, intermediate_dim, knots_config=num_control_points),
    tc.Sum(dim=-2),
    # layer 3
    tc.BSplineCurve(intermediate_dim, 1, knots_config=num_control_points),
    tc.Sum(dim=-2),
)

Yes, we know the original KAN paper used a different curve parametrization, B-spline + arcsinh, but the whole point of this repo is showing that KAN activations can be parametrized in arbitrary ways.

For example, here is a KAN based on Legendre polynomials of degree 5:

import torchcurves as tc
from torch import nn

input_dim = 2
intermediate_dim = 5
degree = 5

kan = nn.Sequential(
    # layer 1
    tc.LegendreCurve(input_dim, intermediate_dim, degree=degree),
    tc.Sum(dim=-2),
    # layer 2
    tc.LegendreCurve(intermediate_dim, intermediate_dim, degree=degree),
    tc.Sum(dim=-2),
    # layer 3
    tc.LegendreCurve(intermediate_dim, 1, degree=degree),
    tc.Sum(dim=-2),
)

Since KANs are the primary use case for the tc.Sum() layer, we can omit the dim=-2 argument, but it is provided here for clarity.

Advanced features

The curves we provide here typically rely on their inputs to lie in a compact interval, typically [-1, 1]. Arbitrary inputs need to be normalized to this interval. We provide two simple out-of-the-box normalization strategies described below.

Rational scaling

This is the default strategy — this strategy computes

x \to \frac{x}{\sqrt{s^2 + x^2}},

and is based on the paper

Wang, Z.Q. and Guo, B.Y., 2004. Modified Legendre rational spectral method for the whole line. Journal of Computational Mathematics, pp.457-474.

In Python it looks like this:

tc.BSplineCurve(num_curves, curve_dim, normalize_fn='rational', normalization_scale=s)

Arctan scaling

This strategy computes

x \to \frac{2}{\pi} \arctan(x / s).

This kind of scaling function, up to constants, is the CDF of the Cauchy distribution. It is useful when our inputs are assumed to be heavy tailed.

In Python it looks like this:

tc.BSplineCurve(num_curves, curve_dim, normalize_fn='arctan', normalization_scale=s)

Clamping

The inputs are simply clipped to [-1, 1] after scaling, i.e.

x \to \max(\min(1, x / s), -1)

In Python it looks like this:

tc.BSplineCurve(num_curves, curve_dim, normalize_fn='clamp', normalization_scale=s)

Custom normalization

Provide a custom function that maps its input to the designated range after scaling. Example:

def erf_clamp(x: Tensor, scale: float = 1, out_min: float = -1, out_max: float = 1) -> torch.Tensor:
    mapped = torch.special.erf(x / scale)
    return ((mapped + 1) * (out_max - out_min)) / 2 + out_min

tc.BSplineCurve(num_curves, curve_dim, normalize_fn=erf_clamp, normalization_scale=s)

Gradient checkpointing for Legendre curves

For large degrees, the backward pass can be memory-intensive. Use checkpoint_segments to trade compute for memory. Larger values create more segments (lower memory, higher compute). Set to None to disable. Checkpointing is applied only when gradients are enabled.

# Functional API
tc.functional.legendre_curves(x, coeffs, checkpoint_segments=4)

# Module API
tc.LegendreCurve(num_curves, curve_dim, degree=degree, checkpoint_segments=4)

Example: B-spline KAN with clamping

A KAN based on a clamped B-spline basis with the default scale of $s=1$:

import torchcurves as tc
from torch import nn

input_dim = 2
intermediate_dim = 5
num_control_points = 10

config = dict(knots_config=num_control_points, normalize_fn='clamp')
spline_kan = nn.Sequential(
    # layer 1
    tc.BSplineCurve(input_dim, intermediate_dim, **config),
    tc.Sum(),
    # layer 2
    tc.BSplineCurve(intermediate_dim, intermediate_dim, **config),
    tc.Sum(),
    # layer 3
    tc.BSplineCurve(intermediate_dim, 1, **config),
    tc.Sum(),
)

Legendre KAN with clamping

import torchcurves as tc
from torch import nn

input_dim = 2
intermediate_dim = 5
degree = 5

config = dict(degree=degree, normalize_fn="clamp")
kan = nn.Sequential(
    # layer 1
    tc.LegendreCurve(input_dim, intermediate_dim, **config),
    tc.Sum(),
    # layer 2
    tc.LegendreCurve(intermediate_dim, intermediate_dim, **config),
    tc.Sum(),
    # layer 3
    tc.LegendreCurve(intermediate_dim, 1, **config),
    tc.Sum(),
)

Development

Development Installation

Using uv (recommended):

# Clone the repository
git clone https://github.com/alexshtf/torchcurves.git
cd torchcurves

# Create virtual environment and install
uv venv
uv sync --all-groups

Running Tests

# Run all tests
uv run pytest

# Run with coverage
uv run pytest --cov=torchcurves

# Run specific test file
uv run pytest tests/test_bspline.py -v

Performance Benchmarks

This project includes opt-in performance benchmarks (forward and backward passes) using pytest-benchmark.

Location: benchmarks/

Run benchmarks:

# Run all benchmarks
uv run pytest benchmarks -q

# Or select only perf-marked tests if you mix them into tests/
uv run pytest -m perf -q

CUDA timing notes: We synchronize before/after timed regions for accurate GPU timings.

Compare runs and fail CI on regressions:

# Save a baseline
uv run pytest benchmarks --benchmark-save=legendre_baseline

# Compare current run to baseline (fail if mean slower by 10% or more)
uv run pytest benchmarks --benchmark-compare --benchmark-compare-fail=mean:10%

Export results:

uv run pytest benchmarks --benchmark-json=bench.json

Building the docs

# Prepare API docs
cd doc
make html

Citation

If you use this package in your research, please cite:

@software{torchcurves,
  author = {Shtoff, Alex},
  title = {torchcurves: Differentiable Parametric Curves in PyTorch},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/alexshtf/torchcurves}
}

Related software

Several well-maintained PyTorch libraries use splines in practice. They mostly target interpolation/resampling or geometric warping rather than providing a generic, drop-in learnable parametric curve layer.

ND interpolation and resampling

torch-interpol (also on PyPI) implements high-order spline interpolation for ND tensors (e.g., 2D/3D images), with TorchScript acceleration and explicit forward/backward implementations. It is primarily designed for resampling under a sampling grid / deformation-field workflows, including dimension-specific interpolation orders and boundary handling (bound). Best suited for resampling tensor data on fixed grids.
xitorch – Interp1D (repo: xitorch/xitorch) provides differentiable 1D interpolation including cubic splines (method="cspline") for non-uniform sample locations with configurable boundary conditions and extrapolation options. This is an interpolation primitive: you provide (x, y) samples and query at xq. Designed as a functional primitive for data interpolation.

Learnable continuous fields via grids

torch-cubic-spline-grids (also on PyPI) provides learnable, continuous parametrisations of 1–4D spaces using uniform grids whose coordinate system spans [0, 1] along each dimension. It supports both cubic B-spline grids (C2, not interpolating) and cubic Catmull–Rom grids (C1, interpolating), which are well suited to learning smooth spatial/temporal fields (e.g., deformation fields). Targets dense continuous fields rather than curve trajectories.

Thin-plate / polyharmonic spline warping

torch-tps (also on PyPI) implements generalized polyharmonic spline interpolation (thin-plate splines in 2D) for learning smooth mappings between Euclidean spaces from control point correspondences, with configurable spline order and regularization. Specializes in spatial warping and point-set registration.
Kornia includes TPS utilities such as get_tps_transform and warp_image_tps (see kornia.geometry.transform docs) as part of a larger differentiable computer vision and geometry toolkit, mainly targeting point/image warping operations. Focuses on image geometry transforms.

References

[1]: Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljacic, Thomas Y. Hou, Max Tegmark. "KAN: Kolmogorov–Arnold Networks." ICLR (2025).
[2]: Juergen Schmidhuber. "Learning to control fast-weight memories: An alternative to dynamic recurrent networks." Neural Computation, 4(1), pp.131-139. (1992)
[3]: Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "Attention is all you need." Advances in neural information processing systems 30 (2017).
[4]: Alex Shtoff, Elie Abboud, Rotem Stram, and Oren Somekh. "Function Basis Encoding of Numerical Features in Factorization Machines." Transactions on Machine Learning Research.
[5]: Rügamer, David. "Scalable Higher-Order Tensor Product Spline Models." In International Conference on Artificial Intelligence and Statistics, pp. 1-9. PMLR, 2024.
[6]: Steffen Rendle. "Factorization machines." In 2010 IEEE International conference on data mining, pp. 995-1000. IEEE, 2010.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

alexshtf

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

0.3.1

Apr 24, 2026

0.3.0

Apr 6, 2026

0.2.3

Apr 5, 2026

This version

0.2.2

Apr 5, 2026

0.2.1

Nov 21, 2025

0.2.0

Nov 8, 2025

0.1.1

Sep 27, 2025

0.1.0

Sep 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchcurves-0.2.2.tar.gz (2.5 MB view details)

Uploaded Apr 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

torchcurves-0.2.2-py3-none-any.whl (23.6 kB view details)

Uploaded Apr 5, 2026 Python 3

File details

Details for the file torchcurves-0.2.2.tar.gz.

File metadata

Download URL: torchcurves-0.2.2.tar.gz
Upload date: Apr 5, 2026
Size: 2.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for torchcurves-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`72e328f8325aa00491ec5ea6ff05cb53ad4024a2d29fbb73e3589679a1fd3f8b`
MD5	`7937e2529129051586b21640c96a28cf`
BLAKE2b-256	`e43c3d2b109b6d8ad7a0e5283de49b9f80fb498c448f0f0c5f6224438a594537`

See more details on using hashes here.

Provenance

The following attestation bundles were made for torchcurves-0.2.2.tar.gz:

Publisher: release.yml on alexshtf/torchcurves

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: torchcurves-0.2.2.tar.gz
- Subject digest: 72e328f8325aa00491ec5ea6ff05cb53ad4024a2d29fbb73e3589679a1fd3f8b
- Sigstore transparency entry: 1239197859
- Sigstore integration time: Apr 5, 2026
Source repository:
- Permalink: alexshtf/torchcurves@705ca325728e331de812f0ef3b488c4564cbbf64
- Branch / Tag: refs/heads/master
- Owner: https://github.com/alexshtf
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@705ca325728e331de812f0ef3b488c4564cbbf64
- Trigger Event: workflow_dispatch

File details

Details for the file torchcurves-0.2.2-py3-none-any.whl.

File metadata

Download URL: torchcurves-0.2.2-py3-none-any.whl
Upload date: Apr 5, 2026
Size: 23.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for torchcurves-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`47d80a6402939cade13c33d3468ac874df28450d8c2fa0cfcafc7b94da428fbc`
MD5	`4f0e73084206b5cb8b0e5dd61e269dbe`
BLAKE2b-256	`680efbd3c51fb07d5d2d7e14c9d13fb44047551eb745a38049c9de736c9949ce`

See more details on using hashes here.

Provenance

The following attestation bundles were made for torchcurves-0.2.2-py3-none-any.whl:

Publisher: release.yml on alexshtf/torchcurves

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: torchcurves-0.2.2-py3-none-any.whl
- Subject digest: 47d80a6402939cade13c33d3468ac874df28450d8c2fa0cfcafc7b94da428fbc
- Sigstore transparency entry: 1239197862
- Sigstore integration time: Apr 5, 2026
Source repository:
- Permalink: alexshtf/torchcurves@705ca325728e331de812f0ef3b488c4564cbbf64
- Branch / Tag: refs/heads/master
- Owner: https://github.com/alexshtf
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@705ca325728e331de812f0ef3b488c4564cbbf64
- Trigger Event: workflow_dispatch

torchcurves 0.2.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Learn

Features

Installation

Use cases

Use case 1 - continuous embeddings

Use case 2 - monotone functions

Use case 3 - Kolmogorov-Arnold networks

Advanced features

Rational scaling

Arctan scaling

Clamping

Custom normalization

Gradient checkpointing for Legendre curves

Example: B-spline KAN with clamping

Legendre KAN with clamping

Development

Development Installation

Running Tests

Performance Benchmarks

Building the docs

Citation

Related software

ND interpolation and resampling

Learnable continuous fields via grids

Thin-plate / polyharmonic spline warping

References

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance