A Deep Learning Library with focus on fast experimentation.
Project description
Axon
This is not a deep learning framework. It's an argument.
🚀 Philosophy: The Unapologetic Truth of Data Transformations
Axon challenges the status quo of deep learning frameworks. We believe in a world where experimentation is effortless, extending functionality is trivial, and maintenance is a breeze. Our core philosophy revolves around:
- Neural Networks as Pure Data Pipelines: Forget monolithic blocks. A neural network is nothing more than a relentless, sequential pipeline of data transformations. Axon embraces this brutal simplicity, allowing you to compose and manipulate these transformations with unprecedented clarity and control.
- Unrivaled Hackability: Dive deep, inspect, and mutate anything at runtime. Axon treats you like a hacker, not just a user. We expose the guts because you deserve to see them.
- Fast & Easy Experimentation: Rapidly iterate on ideas without boilerplate. Focus on your unique logic, not framework intricacies. Your time is too valuable for abstraction layers that fight you.
- Effortless Extensibility & Maintainability: The modular design ensures that adding new features or maintaining existing ones is straightforward and intuitive. No more archaeological digs through opaque codebases.
- AI Accelerator Startup Advantage: For those building AI accelerators, Axon significantly reduces the backend operational overhead. Achieve PyTorch-like results with drastically less effort in writing custom ops. Stop writing thousands of lines of backend code when you could be innovating.
- Laziness is Superior: Embrace lazy evaluation for computations. This paradigm allows for powerful optimizations, efficient resource utilization, and a more intuitive way to define complex computational graphs. Why compute what you don't need, when you don't need it?
Axon achieves this through a clear, functional approach to deep learning components:
- The Model (
Pipeline): Compose your model's layers into a flexiblePipelineobject, managing learnable parameters and enabling dynamic architecture changes. It's your data's journey, laid bare. - The Experiment (
Experiment): A dedicated tool for tracking hyperparameters, logging metrics, and managing artifacts, providing a clear overview of your experimental runs. No more guessing what worked and why.
This design promotes clean, composable systems over monolithic structures, giving you full control.
The Axon Way in 4 Steps
See how Axon's design leads to a cleaner, faster, and more hackable workflow.
Step 1: Define Hyperparameters & Initialize Experiment
import axon
from axon.experiment import Experiment
import uuid
# Define your experiment's hyperparameters
hyperparams = {
"in_channels": 3,
"base_channels": 64,
"block_type": 'ResBlock',
"learning_rate": 1e-3,
"batch_size": 128,
"epochs": 10,
"optimizer": 'Adam'
}
# Initialize an experiment to track your run
exp = Experiment(id=str(uuid.uuid4()), name="MyFirstAxonExperiment")
for key, value in hyperparams.items():
exp.log_hyperparameter(key, value)
Step 2: Build the Model (Pipeline)
import axon.metrics as metrics
from axon.nn import Pipeline, Conv2d, Linear
# Layers are composed using the >> operator or passed to the Pipeline constructor
model = Pipeline(
Conv2d(hyperparams["in_channels"], hyperparams["base_channels"], kernel_size=7))
>> Linear(hyperparams["base_channels"], 10)
)
# You can reset parameters at any time to start from scratch
# model.reset_parameters()
Step 3: Define YOUR Logic (training_step)
def my_training_step(pipeline_model, batch):
x, y_true = batch
y_pred = pipeline_model(x)
loss = metrics.bce(y_pred, y_true)
return {"loss": loss, "y_pred": y_pred}
Step 4: Run the Training Loop & Log with Experiment
# Assuming my_train_data and my_eval_data are defined
# Assuming an optimizer is initialized, e.g., optimizer = axon.optim.Adam(model.params, lr=hyperparams["learning_rate"])
for epoch in range(hyperparams["epochs"]):
total_loss = 0
for i, batch in enumerate(my_train_data):
# Forward pass, backward pass, and optimization
results = my_training_step(model, batch)
loss = results["loss"]
# ... (backward pass and optimizer step would go here) ...
total_loss += loss.item()
exp.log_metric("train_loss", loss.item(), step=epoch * len(my_train_data) + i)
avg_train_loss = total_loss / len(my_train_data)
exp.log_metric("avg_epoch_train_loss", avg_train_loss, step=epoch)
# Evaluation loop (simplified)
eval_loss = 0
for batch in my_eval_data:
results = my_training_step(model, batch) # Using same step for simplicity
eval_loss += results["loss"].item()
avg_eval_loss = eval_loss / len(my_eval_data)
exp.log_metric("avg_epoch_eval_loss", avg_eval_loss, step=epoch)
print(f"Epoch {epoch+1}: Train Loss = {avg_train_loss:.4f}, Eval Loss = {avg_eval_loss:.4f}")
exp.save() # Save experiment results
💣 Hackability Is Not a Feature. It's the Point.
Most libraries hide their internals like it’s some sacred artifact. Axon doesn’t do that. Everything is an object you can poke, inspect, and mutate at runtime. We tear down the walls.
- 🔍 Access all model parameters with
pipeline.params - 🧱 Access buffers (non-trainable states) with
pipeline.buffers - 🧠 Access individual layers by index:
my_layer = pipeline[idx] - 💥 Inject any arbitrary PyFunc into a pipeline step
- 👁️🗨️ Pure functional
>>pipelines — you trace, log, wrap, or fuse - 🧬 Plug and play — it’s just Python dicts and functional calls
Frameworks treat you like a user. Axon treats you like a hacker.
Dynamic Architecture: Swap Layers, Reset State, or Keep Going. Your Call.
Axon gives you the power to fundamentally alter your network's structure and state mid-run. This isn't just flexibility; it's a declaration of war on static, rigid architectures.
from axon.nn import Conv2d, Linear, Pipeline
import axon.core.tensor as T
# Assume 'model' is an existing Pipeline
# Assume 'input_tensor' is a Tensor
# Original first layer
print(f"Original first layer: {model[0]}")
# Create a new layer with different parameters
new_first_layer = Conv2d(3, 128, kernel_size=3)
# --- Scenario 1: Swap and reset parameters of the *entire* pipeline ---
# This is for when you want a fresh start after a major architectural change.
print("\n--- Swapping layer and resetting ALL parameters ---")
model[0] = new_first_layer
model.reset_parameters() # Re-initializes ALL layers in the pipeline
print(f"New first layer after full reset: {model[0]}")
# All weights in the model are now re-randomized.
# --- Scenario 2: Swap and *only* reset parameters of the new layer ---
# Ideal for fine-tuning or targeted modifications without disturbing learned weights elsewhere.
print("\n--- Swapping layer without full pipeline reset ---")
another_new_layer = Linear(10, 5) # Assuming this fits the pipeline
original_layer_at_idx = model[1] # Save original to compare
model[1] = another_new_layer
if isinstance(model[1], T.Module): # Check if it's a Module to reset
model[1].reset_parameters() # Only resets the new layer's parameters
print(f"Layer at index 1 after swap (only new layer reset): {model[1]}")
# Parameters of other layers in the pipeline remain unchanged.
# --- Scenario 3: Freezing layers ---
# Lock down parts of your network. Essential for transfer learning or architectural experiments.
print("\n--- Freezing a layer ---")
print(f"Requires grad for first layer before freeze: {model[0].params[0].requires_grad}")
model[0].freeze() # Freeze the first layer
print(f"Requires grad for first layer after freeze: {model[0].params[0].requires_grad}")
# This layer's parameters will no longer be updated during backpropagation.
You want to freeze weights? Detach the gradient. You want to mutate activations on the fly? Go ahead. Want to track internal outputs? Inject a hook or just override the fn inline. It's your engine. Drive it like a maniac.
🔥 The Vision: A Fully Fused Stack & Lazy Evaluation
Axon's design enables our ultimate goal: bridging the gap between high-level expression and bare-metal performance, powered by lazy evaluation. This isn't just an optimization; it's a declaration of intent.
- JIT Compiler for CUDA: The explicit pipeline (
>>) is a parsable AST. We will trace and fuse it into high-performance CUDA kernels, leveraging lazy evaluation for optimal execution. This isn't just an optimization; it's a fundamental shift in how we execute deep learning. - Lazy Evaluation for Efficiency: Computations are only performed when their results are actually needed, leading to significant performance gains and reduced memory footprint, especially for complex graphs. Why waste cycles on intermediate results that might never be used?
✅ Core Features & Status
- ✅ Core Components: Flexible
Pipelinefor model definition and composition,Experimentfor tracking. - ✅ Transparent API: Index/slice-based layer access and explicit data flows.
- ✅ Core Autograd Engine: Tape-based and fully functional.
- 🔧 JIT Compilation Engine: AST parsing and fusion (In Development)
- ✅ Built-in Metrics: Accuracy, Precision, F1, etc.
- ✅ Core Layers, Losses, Optimizers
- ✅ CUDA Backend: Custom kernels for performance
- ✅ Lazy Evaluation: Core to the computational graph and optimization strategy.
- ✅ Runtime Layer Swapping & Flexible Parameter Initialization (
reset_parameters()). - ✅ Layer Freezing (
freeze()).
🚀 Getting Started
git clone https://github.com/yushi2006/axon.git
cd axon
pip install -e .
📄 License
MIT License — free to use, modify, and commercialize with attribution.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file axon_dl-0.1.3.tar.gz.
File metadata
- Download URL: axon_dl-0.1.3.tar.gz
- Upload date:
- Size: 135.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7b04b90c9dccb1a872232f59aebb27605e32c9236086d19ee1c7fb0f00892d7d
|
|
| MD5 |
19e6e27dfa086cc7f003a54d9f0440ff
|
|
| BLAKE2b-256 |
dc78078b3b2105d00d32899004d9946df44fad232ed2cee92a400f223c997915
|
File details
Details for the file axon_dl-0.1.3-py3-none-any.whl.
File metadata
- Download URL: axon_dl-0.1.3-py3-none-any.whl
- Upload date:
- Size: 124.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a010f68d300c107cd0910e6f0f005b2ec244c864e234f7c6dcc00851d306b256
|
|
| MD5 |
65d16b969465ca0b953ca6485641142c
|
|
| BLAKE2b-256 |
32df809a1bd7873890b90fc862b3c265228ed709ede242ed63cb0b6452efde87
|