A Deep Learning Library with focus on fast experimentation.

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

Axon

This is not a deep learning framework. It's an argument.

🚀 Philosophy: The Unapologetic Truth of Data Transformations

Axon challenges the status quo of deep learning frameworks. We believe in a world where experimentation is effortless, extending functionality is trivial, and maintenance is a breeze. Our core philosophy revolves around:

Neural Networks as Pure Data Pipelines: Forget monolithic blocks. A neural network is nothing more than a relentless, sequential pipeline of data transformations. Axon embraces this brutal simplicity, allowing you to compose and manipulate these transformations with unprecedented clarity and control.
Unrivaled Hackability: Dive deep, inspect, and mutate anything at runtime. Axon treats you like a hacker, not just a user. We expose the guts because you deserve to see them.
Fast & Easy Experimentation: Rapidly iterate on ideas without boilerplate. Focus on your unique logic, not framework intricacies. Your time is too valuable for abstraction layers that fight you.
Effortless Extensibility & Maintainability: The modular design ensures that adding new features or maintaining existing ones is straightforward and intuitive. No more archaeological digs through opaque codebases.
AI Accelerator Startup Advantage: For those building AI accelerators, Axon significantly reduces the backend operational overhead. Achieve PyTorch-like results with drastically less effort in writing custom ops. Stop writing thousands of lines of backend code when you could be innovating.
Laziness is Superior: Embrace lazy evaluation for computations. This paradigm allows for powerful optimizations, efficient resource utilization, and a more intuitive way to define complex computational graphs. Why compute what you don't need, when you don't need it?

Axon achieves this through a clear, functional approach to deep learning components:

The Model (Pipeline): Compose your model's layers into a flexible Pipeline object, managing learnable parameters and enabling dynamic architecture changes. It's your data's journey, laid bare.
The Experiment (Experiment): A dedicated tool for tracking hyperparameters, logging metrics, and managing artifacts, providing a clear overview of your experimental runs. No more guessing what worked and why.

This design promotes clean, composable systems over monolithic structures, giving you full control.

The Axon Way in 4 Steps

See how Axon's design leads to a cleaner, faster, and more hackable workflow.

Step 1: Define Hyperparameters & Initialize Experiment

import axon
from axon.experiment import Experiment
import uuid

# Define your experiment's hyperparameters
hyperparams = {
    "in_channels": 3,
    "base_channels": 64,
    "block_type": 'ResBlock',
    "learning_rate": 1e-3,
    "batch_size": 128,
    "epochs": 10,
    "optimizer": 'Adam'
}

# Initialize an experiment to track your run
exp = Experiment(id=str(uuid.uuid4()), name="MyFirstAxonExperiment")
for key, value in hyperparams.items():
    exp.log_hyperparameter(key, value)

Step 2: Build the Model (`Pipeline`)

import axon.metrics as metrics
from axon.nn import Pipeline, Conv2d, Linear

# Layers are composed using the >> operator or passed to the Pipeline constructor
model = Pipeline(
    Conv2d(hyperparams["in_channels"], hyperparams["base_channels"], kernel_size=7)) 
    >> Linear(hyperparams["base_channels"], 10)
)

# You can reset parameters at any time to start from scratch
# model.reset_parameters()

Step 3: Define YOUR Logic (`training_step`)

def my_training_step(pipeline_model, batch):
    x, y_true = batch
    y_pred = pipeline_model(x)
    loss = metrics.bce(y_pred, y_true)
    return {"loss": loss, "y_pred": y_pred}

Step 4: Run the Training Loop & Log with Experiment

# Assuming my_train_data and my_eval_data are defined
# Assuming an optimizer is initialized, e.g., optimizer = axon.optim.Adam(model.params, lr=hyperparams["learning_rate"])

for epoch in range(hyperparams["epochs"]):
    total_loss = 0
    for i, batch in enumerate(my_train_data):
        # Forward pass, backward pass, and optimization
        results = my_training_step(model, batch)
        loss = results["loss"]
        # ... (backward pass and optimizer step would go here) ...

        total_loss += loss.item()
        exp.log_metric("train_loss", loss.item(), step=epoch * len(my_train_data) + i)

    avg_train_loss = total_loss / len(my_train_data)
    exp.log_metric("avg_epoch_train_loss", avg_train_loss, step=epoch)

    # Evaluation loop (simplified)
    eval_loss = 0
    for batch in my_eval_data:
        results = my_training_step(model, batch) # Using same step for simplicity
        eval_loss += results["loss"].item()
    avg_eval_loss = eval_loss / len(my_eval_data)
    exp.log_metric("avg_epoch_eval_loss", avg_eval_loss, step=epoch)

    print(f"Epoch {epoch+1}: Train Loss = {avg_train_loss:.4f}, Eval Loss = {avg_eval_loss:.4f}")

exp.save() # Save experiment results

💣 Hackability Is Not a Feature. It's the Point.

Most libraries hide their internals like it’s some sacred artifact. Axon doesn’t do that. Everything is an object you can poke, inspect, and mutate at runtime. We tear down the walls.

🔍 Access all model parameters with pipeline.params
🧱 Access buffers (non-trainable states) with pipeline.buffers
🧠 Access individual layers by index: my_layer = pipeline[idx]
💥 Inject any arbitrary PyFunc into a pipeline step
👁️‍🗨️ Pure functional >> pipelines — you trace, log, wrap, or fuse
🧬 Plug and play — it’s just Python dicts and functional calls

Frameworks treat you like a user. Axon treats you like a hacker.

Dynamic Architecture: Swap Layers, Reset State, or Keep Going. Your Call.

Axon gives you the power to fundamentally alter your network's structure and state mid-run. This isn't just flexibility; it's a declaration of war on static, rigid architectures.

from axon.nn import Conv2d, Linear, Pipeline
import axon.core.tensor as T

# Assume 'model' is an existing Pipeline
# Assume 'input_tensor' is a Tensor

# Original first layer
print(f"Original first layer: {model[0]}")

# Create a new layer with different parameters
new_first_layer = Conv2d(3, 128, kernel_size=3)

# --- Scenario 1: Swap and reset parameters of the *entire* pipeline ---
# This is for when you want a fresh start after a major architectural change.
print("\n--- Swapping layer and resetting ALL parameters ---")
model[0] = new_first_layer
model.reset_parameters() # Re-initializes ALL layers in the pipeline
print(f"New first layer after full reset: {model[0]}")
# All weights in the model are now re-randomized.

# --- Scenario 2: Swap and *only* reset parameters of the new layer ---
# Ideal for fine-tuning or targeted modifications without disturbing learned weights elsewhere.
print("\n--- Swapping layer without full pipeline reset ---")
another_new_layer = Linear(10, 5) # Assuming this fits the pipeline
original_layer_at_idx = model[1] # Save original to compare
model[1] = another_new_layer
if isinstance(model[1], T.Module): # Check if it's a Module to reset
    model[1].reset_parameters() # Only resets the new layer's parameters
print(f"Layer at index 1 after swap (only new layer reset): {model[1]}")
# Parameters of other layers in the pipeline remain unchanged.

# --- Scenario 3: Freezing layers ---
# Lock down parts of your network. Essential for transfer learning or architectural experiments.
print("\n--- Freezing a layer ---")
print(f"Requires grad for first layer before freeze: {model[0].params[0].requires_grad}")
model[0].freeze() # Freeze the first layer
print(f"Requires grad for first layer after freeze: {model[0].params[0].requires_grad}")
# This layer's parameters will no longer be updated during backpropagation.

You want to freeze weights? Detach the gradient. You want to mutate activations on the fly? Go ahead. Want to track internal outputs? Inject a hook or just override the fn inline. It's your engine. Drive it like a maniac.

🔥 The Vision: A Fully Fused Stack & Lazy Evaluation

Axon's design enables our ultimate goal: bridging the gap between high-level expression and bare-metal performance, powered by lazy evaluation. This isn't just an optimization; it's a declaration of intent.

JIT Compiler for CUDA: The explicit pipeline (>>) is a parsable AST. We will trace and fuse it into high-performance CUDA kernels, leveraging lazy evaluation for optimal execution. This isn't just an optimization; it's a fundamental shift in how we execute deep learning.
Lazy Evaluation for Efficiency: Computations are only performed when their results are actually needed, leading to significant performance gains and reduced memory footprint, especially for complex graphs. Why waste cycles on intermediate results that might never be used?

✅ Core Features & Status

✅ Core Components: Flexible Pipeline for model definition and composition, Experiment for tracking.
✅ Transparent API: Index/slice-based layer access and explicit data flows.
✅ Core Autograd Engine: Tape-based and fully functional.
🔧 JIT Compilation Engine: AST parsing and fusion (In Development)
✅ Built-in Metrics: Accuracy, Precision, F1, etc.
✅ Core Layers, Losses, Optimizers
✅ CUDA Backend: Custom kernels for performance
✅ Lazy Evaluation: Core to the computational graph and optimization strategy.
✅ Runtime Layer Swapping & Flexible Parameter Initialization (reset_parameters()).
✅ Layer Freezing (freeze()).

🚀 Getting Started

git clone https://github.com/yushi2006/axon.git
cd axon
pip install -e .

📄 License

MIT License — free to use, modify, and commercialize with attribution.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

This version

0.1.3

Oct 3, 2025

0.1.2

Oct 3, 2025

0.1.1

Oct 3, 2025

0.1.0

Oct 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

axon_dl-0.1.3.tar.gz (135.9 kB view details)

Uploaded Oct 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

axon_dl-0.1.3-py3-none-any.whl (124.8 kB view details)

Uploaded Oct 3, 2025 Python 3

File details

Details for the file axon_dl-0.1.3.tar.gz.

File metadata

Download URL: axon_dl-0.1.3.tar.gz
Upload date: Oct 3, 2025
Size: 135.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for axon_dl-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`7b04b90c9dccb1a872232f59aebb27605e32c9236086d19ee1c7fb0f00892d7d`
MD5	`19e6e27dfa086cc7f003a54d9f0440ff`
BLAKE2b-256	`dc78078b3b2105d00d32899004d9946df44fad232ed2cee92a400f223c997915`

See more details on using hashes here.

File details

Details for the file axon_dl-0.1.3-py3-none-any.whl.

File metadata

Download URL: axon_dl-0.1.3-py3-none-any.whl
Upload date: Oct 3, 2025
Size: 124.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for axon_dl-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a010f68d300c107cd0910e6f0f005b2ec244c864e234f7c6dcc00851d306b256`
MD5	`65d16b969465ca0b953ca6485641142c`
BLAKE2b-256	`32df809a1bd7873890b90fc862b3c265228ed709ede242ed63cb0b6452efde87`

See more details on using hashes here.

axon-dl 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Axon

🚀 Philosophy: The Unapologetic Truth of Data Transformations

The Axon Way in 4 Steps

Step 1: Define Hyperparameters & Initialize Experiment

Step 2: Build the Model (`Pipeline`)

Step 3: Define YOUR Logic (`training_step`)

Step 4: Run the Training Loop & Log with Experiment

💣 Hackability Is Not a Feature. It's the Point.

Dynamic Architecture: Swap Layers, Reset State, or Keep Going. Your Call.

🔥 The Vision: A Fully Fused Stack & Lazy Evaluation

✅ Core Features & Status

🚀 Getting Started

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

axon-dl 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Axon

🚀 Philosophy: The Unapologetic Truth of Data Transformations

The Axon Way in 4 Steps

Step 1: Define Hyperparameters & Initialize Experiment

Step 2: Build the Model (Pipeline)

Step 3: Define YOUR Logic (training_step)

Step 4: Run the Training Loop & Log with Experiment

💣 Hackability Is Not a Feature. It's the Point.

Dynamic Architecture: Swap Layers, Reset State, or Keep Going. Your Call.

🔥 The Vision: A Fully Fused Stack & Lazy Evaluation

✅ Core Features & Status

🚀 Getting Started

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Step 2: Build the Model (`Pipeline`)

Step 3: Define YOUR Logic (`training_step`)