A Cross-Library Compatible Library for PyTorch and TensorFlow that provides advanced features for ease of use

These details have not been verified by PyPI

Project links

Project description

ToTf

A Cross-Library Compatible Library for PyTorch and TensorFlow that provides advanced features for ease of use, which are not available directly in the base frameworks.

Features

🎯 TrainingMonitor

Real-time training progress tracking with automatic logging:

Progress bars with tqdm integration
CSV logging with timestamps and running averages (Keras-style)
Resource monitoring - RAM and VRAM usage tracking
Crash-resistant - auto-flush to prevent data loss
Flexible - works with any DataLoader or iterable

🔍 SmartSummary

Advanced model analysis with intelligent insights (UNIQUE features vs torchsummary/torchinfo):

Bottleneck detection - automatically identifies optimization opportunities
Gradient tracking - reveals training instabilities and vanishing/exploding gradients
Loss curve analysis - diagnose training dynamics (convergence, overfitting, divergence, plateau detection)
Comprehensive analysis - layer shapes, parameters, and memory usage
Export capabilities - save to files or export as dictionaries
Complex architectures - works with nested models, residual connections, etc.
Cross-framework support - Available for both PyTorch and TensorFlow/Keras

✨ What's New (Added NEW!)

Top-level dynamic dispatcher (ToTf): a centralized backend dispatcher now exposes core components from the package root so users can from ToTf import SmartSummary, ModelView, draw_graph, get_component instead of importing backend-specific subpackages.
get_component(name, backend_name=...): load utilities or components dynamically from the detected or selected backend (replaces importing ToTf.pytorch.* or ToTf.tenf.* directly).
draw_graph() dispatcher: call ToTf.draw_graph(...) and the package will route the call to the active backend's implementation (convenience wrapper for ModelView rendering).
Examples updated: example scripts were updated to use the top-level ToTf API and include a small sys.path guideline to run examples from ToTf/examples/ when not installed as a package.
Dry-run init & gradient checks - runs a lightweight forward/backward pass to flag zero or abnormally large initial gradients and suspicious initialization scales (Xavier/He heuristics)
Receptive Field bookkeeping - per-layer RF/jump/start propagation supporting dilations and asymmetric kernels across branches; emits warnings when inputs to merges have mismatched RF metadata
Precise activation memory profiling - optional keep_activations=True stores captured output tensors and uses actual tensor sizes for a more accurate memory profile. PyTorch also supports keep_activations_strong=True with max_saved_activation_bytes to retain tensors for deeper analysis while avoiding OOM.
Configurable thresholds - constructor options control sensitivity: grad_large_threshold, grad_zero_tol, param_ratio_bottleneck, activation_bottleneck_mb, and init/std heuristics (init_std_warn_multiply, init_std_warn_min_mult).

🛠️ Utility Functions

Framework-agnostic "missing" functions that save time and improve code quality:

Auto-Shape Flattener

lazy_flatten(tensor) - Automatically flatten tensors without manual size calculation
get_flatten_size(shape) - Calculate flattened dimensions for Conv->Linear/Dense transitions
Use case: Eliminates error-prone manual calculations when transitioning from convolutional to fully connected layers

Normalized Cross-Correlation (NCC) Loss

loss_ncc(y_true, y_pred) - NCC loss function for medical imaging and registration
ncc_score(y_true, y_pred) - NCC similarity metric (higher is better)
NCCLoss() - Keras-compatible NCC loss class (TensorFlow)
Use case: Medical imaging (ACDC, ADNI datasets), robust to intensity variations, better than MSE for registration tasks

Learning Rate Finder

find_lr(model, optimizer, ...) - Find optimal learning rate automatically
LRFinder - Full-featured LR range test class with plotting
Use case: Automatically discover the best learning rate before training (fast.ai style), avoid manual tuning

📊 ModelView (PyTorch & TensorFlow)

Publication-quality neural network architecture diagrams for both frameworks:

PyTorch support - Wraps torchview for comprehensive PyTorch model visualization
TensorFlow support - Native implementation for Keras models
High-resolution outputs - PNG, PDF, SVG for research papers (300-600 DPI)
Automatic graph layout - Beautiful visualizations with minimal configuration
Comprehensive annotations - Layer types, shapes, parameter counts
Complex architectures - Residual connections, multi-input/output, branching
Customizable styling - Colors, fonts, layouts for different aesthetics
Export formats - JSON summaries, text tables, visual diagrams
Unified API - Same interface for both PyTorch and TensorFlow
Use case: Generate publication-ready architecture diagrams for research papers, presentations, and documentation

Installation

pip install -r requirements.txt

System Requirements for ModelView

ModelView requires Graphviz and torchview for rendering diagrams:

# Install Python packages
pip install graphviz torchview

# Install system Graphviz
# Ubuntu/Debian:
sudo apt-get install graphviz

# macOS:
brew install graphviz

# Windows (via conda):
conda install -c conda-forge graphviz

Documentation

📚 Detailed Guides:

ModelView Quick Start - PyTorch - Get started with PyTorch diagrams in 5 minutes
ModelView Quick Start - TensorFlow - Get started with TensorFlow diagrams in 5 minutes
Loss Curve Analysis Guide - Diagnose training dynamics and detect overfitting
PyTorch ModelView Implementation - PyTorch-specific implementation details
ModelView Implementation Details - Technical specifications and features
Utilities Implementation - Detailed utility functions documentation
TensorFlow Implementation - TensorFlow-specific features and design

📖 Examples:

example_modelview_pytorch.py - PyTorch ModelView examples (6+ comprehensive examples)
example_modelview_tf.py - TensorFlow ModelView examples (8+ comprehensive examples)
example_smartsummary.py / example_smartsummary_tf.py - SmartSummary usage
example_loss_analysis_pytorch.py - Loss curve analysis examples (PyTorch)
example_loss_analysis_tf.py - Loss curve analysis examples (TensorFlow)
example_utils_pytorch.py / example_utils_tf.py - Utility functions

🧪 Verification:

Run python verify_modelview_pytorch.py to test PyTorch ModelView installation
Run python verify_modelview.py to test TensorFlow ModelView installation
Run pytest test/ to run full test suite

Installation
Documentation
Quick Start
TrainingMonitor Guide
SmartSummary Guide
ModelView Guide
Utility Functions Guide
Project Structure
Examples
API Reference
Comparison with Alternatives

Quick Start

PyTorch

TrainingMonitor (PyTorch)

from ToTf.pytorch import TrainingMonitor

for epoch in range(epochs):
    monitor = TrainingMonitor(train_loader, desc=f"Epoch {epoch+1}")
    
    for batch in monitor:
        loss = train_step(batch)
        monitor.log({'loss': loss.item()})

SmartSummary (PyTorch)

from ToTf.pytorch import SmartSummary

# Basic analysis
model = YourModel()
summary = SmartSummary(model, input_size=(3, 224, 224))
summary.show()

# Find bottlenecks
bottlenecks = summary.get_bottlenecks(top_n=5)
for bn in bottlenecks:
    print(f"{bn['layer']}: {', '.join(bn['reasons'])}")

# Track gradients for debugging
summary = SmartSummary(model, input_size=(3, 224, 224), track_gradients=True)
    - `keep_activations_strong` (opt-in): retain strong references to activations; use with `max_saved_activation_bytes` to limit memory
    - `grad_large_threshold`, `grad_zero_tol`: control gradient anomaly detection
    - `param_ratio_bottleneck`, `activation_bottleneck_mb`: control bottleneck detection thresholds
summary.show()

Utility Functions (PyTorch)

from ToTf.pytorch import lazy_flatten, loss_ncc, find_lr

# 1. Auto-flatten Conv output
class MyModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv = nn.Conv2d(3, 64, 3)
        self.fc = nn.Linear(get_flatten_size((64, 30, 30)), 10)
    
    def forward(self, x):
        x = self.conv(x)
        x = lazy_flatten(x)  # No manual calculation needed!
        return self.fc(x)

# 2. NCC loss for medical imaging
loss = loss_ncc(ground_truth, prediction)  # Robust to intensity variations

# 3. Find optimal learning rate
best_lr = find_lr(model, optimizer, criterion, train_loader)
print(f"Use learning rate: {best_lr}")

ModelView (PyTorch)

from ToTf.pytorch import ModelView, draw_graph

# Quick visualization
model = MyModel()
draw_graph(model, input_size=(3, 224, 224), save_path='model.png')

# Advanced usage with customization
view = ModelView(model, input_size=(3, 224, 224))
view.show()  # Text summary powered by torchview

# High-resolution PNG for papers (300 DPI)
view.render('architecture.png', dpi=300, show_shapes=True)

# PDF for LaTeX documents
view.render('architecture.pdf', format='pdf')

# SVG for perfect scaling
view.render('architecture.svg', format='svg')

# Horizontal layout for wide figures
view.render('architecture_wide.png', rankdir='LR', dpi=600)

# Advanced torchview features
view_detailed = ModelView(
    model, 
    input_size=(3, 224, 224),
    depth=4,  # Show nested modules  
    expand_nested=True,  # Expand Sequential blocks
    hide_inner_tensors=False  # Show all tensors
)
view_detailed.render('detailed_architecture.png', dpi=300)

# Export summary as JSON
view.save_summary_json('model_summary.json')

TensorFlow/Keras

SmartSummary (TensorFlow)

from ToTf.tenf import SmartSummary

# Basic analysis
model = tf.keras.Sequential([...])
summary = SmartSummary(model, input_shape=(224, 224, 3))
summary.show()

# Find bottlenecks
bottlenecks = summary.get_bottlenecks(top_n=5)
for bn in bottlenecks:
    print(f"{bn['layer']}: {', '.join(bn['reasons'])}")

# Track gradients for debugging
summary = SmartSummary(model, input_shape=(224, 224, 3), track_gradients=True)
    - `param_ratio_bottleneck`, `activation_bottleneck_mb`: control bottleneck detection thresholds
    - `grad_large_threshold`, `init_std_warn_multiply`, `init_std_warn_min_mult`: control dry-run init/gradient heuristics
summary.show()

ModelView (TensorFlow)

from ToTf.tenf import ModelView, draw_graph

# Quick visualization
model = tf.keras.Sequential([...])
draw_graph(model, input_shape=(224, 224, 3), save_path='model.png')

# Advanced usage with customization
view = ModelView(model, input_shape=(224, 224, 3))
view.show()  # Text summary

# High-resolution PNG for papers (300 DPI)
view.render('architecture.png', dpi=300, show_shapes=True, show_params=True)

# PDF for LaTeX documents
view.render('architecture.pdf', format='pdf')

# SVG for perfect scaling
view.render('architecture.svg', format='svg')

# Horizontal layout for wide figures
view.render('architecture_wide.png', rankdir='LR', dpi=600)

# Export summary as JSON
view.save_summary_json('model_summary.json')

Utility Functions (TensorFlow)

from ToTf.tenf import lazy_flatten, NCCLoss, find_lr

# 1. Auto-flatten Conv output
class MyModel(keras.Model):
    def __init__(self):
        super().__init__()
        self.conv = keras.layers.Conv2D(64, 3)
        self.fc = keras.layers.Dense(10)
    
    def call(self, x):
        x = self.conv(x)
        x = lazy_flatten(x)  # Automatic flattening!
        return self.fc(x)

# 2. NCC loss for medical imaging
model.compile(optimizer='adam', loss=NCCLoss())  # Keras-compatible!

# 3. Find optimal learning rate
best_lr = find_lr(model, keras.losses.CategoricalCrossentropy(), train_dataset)
print(f"Use learning rate: {best_lr}")

---

## Framework Differences

### SmartSummary: PyTorch vs TensorFlow

Both implementations provide the same core features, but with framework-specific adaptations:

| Aspect | PyTorch | TensorFlow/Keras |
|--------|---------|------------------|
| **Import** | `from ToTf.pytorch import SmartSummary` | `from ToTf.tenf import SmartSummary` |
| **Input parameter** | `input_size=(3, 224, 224)` (channels first) | `input_shape=(224, 224, 3)` (channels last) |
| **Model type** | `torch.nn.Module` | `tf.keras.Model` |
| **Device param** | `device='cuda'` ✓ | Not needed (TF auto-manages) |
| **Gradient tracking** | Uses backward hooks | Uses `tf.GradientTape` |
| **Multi-input** | Tuple of sizes | List of shapes |
| **Extra methods** | - | `compare_with_keras_summary()` |

**Key differences:**
- **Shape convention**: PyTorch uses channels-first (C, H, W), TensorFlow uses channels-last (H, W, C)
- **Automatic builds**: TensorFlow models build automatically on first forward pass
- **Multi-input models**: TensorFlow has first-class support with `input_shape=[(shape1), (shape2)]`

---

## TrainingMonitor Guide

### Basic Usage

```python
from ToTf import TrainingMonitor

for epoch in range(epochs):
    monitor = TrainingMonitor(train_loader, desc=f"Epoch {epoch+1}", log_file="train.csv")
    
    for batch in monitor:
        loss = train_step(batch)
        monitor.log({'loss': loss.item(), 'lr': optimizer.param_groups[0]['lr']})

Features

Automatic CSV Logging:

timestamp,step,loss,lr,RAM_pct,VRAM_gb
2026-02-06 10:30:15,0,0.6931,0.001,45.2,2.14
2026-02-06 10:30:16,1,0.6523,0.001,45.4,2.14

Running Averages:
Metrics are automatically averaged across steps (Keras-style), so displayed values represent cumulative running averages.

Resource Monitoring:
Automatically tracks RAM usage (%) and VRAM usage (GB if CUDA available).

SmartSummary Guide

Common Patterns

PyTorch

1. Quick Model Analysis

from ToTf.pytorch import SmartSummary

summary = SmartSummary(model, input_size=(3, 224, 224))
summary.show()

2. Find Bottlenecks

bottlenecks = summary.get_bottlenecks(top_n=5)
for bn in bottlenecks:
    print(f"⚠️ {bn['layer']}: Score {bn['score']:.1f}")
    print(f"   Issues: {', '.join(bn['reasons'])}")
    print(f"   Parameters: {bn['params']:,}")

3. Debug Training Issues

summary = SmartSummary(model, input_size=(3, 224, 224), track_gradients=True)
summary.show()  # Shows gradient variance/mean/max - useful for finding vanishing/exploding gradients

4. Export Analysis

summary.save_to_file("model_analysis.txt")
data = summary.to_dict()  # For programmatic access

5. CUDA Models

summary = SmartSummary(model, input_size=(3, 224, 224), device='cuda')

6. Without Forward Pass (Fast)

summary = SmartSummary(model)  # Just count parameters, no shape inference

7. Analyze Training Loss Curves (PyTorch)

# During training loop, collect losses
train_losses = []
val_losses = []

for epoch in range(epochs):
    train_loss = train_epoch(model, train_loader, optimizer)
    val_loss = validate_epoch(model, val_loader)
    
    train_losses.append(train_loss)
    val_losses.append(val_loss)

# Analyze training dynamics
summary = SmartSummary(model, input_size=(3, 224, 224))
result = summary.analyze_loss_curve(train_losses, val_losses)
# Detects: overfitting, divergence, oscillation, plateau, convergence

TensorFlow/Keras

1. Quick Model Analysis

from ToTf.tenf import SmartSummary

summary = SmartSummary(model, input_shape=(224, 224, 3))
summary.show()

2. Find Bottlenecks

bottlenecks = summary.get_bottlenecks(top_n=5)
for bn in bottlenecks:
    print(f"⚠️ {bn['layer']}: Score {bn['score']:.1f}")
    print(f"   Issues: {', '.join(bn['reasons'])}")
    print(f"   Parameters: {bn['params']:,}")

3. Debug Training Issues

summary = SmartSummary(model, input_shape=(224, 224, 3), track_gradients=True)
summary.show()  # Shows gradient variance/mean/max

4. Multi-Input Models

# For models with multiple inputs
summary = SmartSummary(model, input_shape=[(224, 224, 3), (100,)])
summary.show()

5. Compare with Keras Summary

summary.compare_with_keras_summary()  # Shows both summaries

6. Export Analysis

summary.save_to_file("model_analysis.txt")
data = summary.to_dict()  # For programmatic access

7. Analyze Training Loss Curves

# Collect losses during training
train_losses = [2.5, 2.1, 1.8, 1.6, 1.5, 1.45, 1.42, 1.41]
val_losses = [2.6, 2.2, 1.9, 1.7, 1.65, 1.62, 1.60, 1.59]

# Analyze training dynamics
result = summary.analyze_loss_curve(train_losses, val_losses)
# Automatically detects: overfitting, divergence, oscillation, plateau, convergence
# Provides actionable recommendations for improvement

# Programmatic use (no printing)
result = summary.analyze_loss_curve(train_losses, val_losses, verbose=False)
if result['status'] == 'Overfitting Detected':
    print("Early stopping recommended!")

Loss Curve Analysis

SmartSummary can analyze your training loss curves to diagnose training health:

Detects:

✅ Healthy Convergence - Loss decreasing and stabilizing properly
⚠️ Overfitting - Training loss decreasing while validation loss increases
⚠️ Divergence - Loss increasing (training instability)
📊 Plateau - Loss stopped improving
📈 Oscillation - High variance in loss values
🐌 Slow Convergence - Training progressing too slowly

Provides:

Trend analysis (slope, improvement percentage)
Stability metrics (coefficient of variation)
Actionable recommendations (learning rate adjustments, regularization, etc.)

Quick Example:

# During training
history = model.fit(x_train, y_train, validation_data=(x_val, y_val), epochs=20)

# Analyze the training
result = summary.analyze_loss_curve(
    train_losses=history.history['loss'],
    val_losses=history.history['val_loss']
)

print(f"Status: {result['status']}")
print(f"Improvement: {result['metrics']['improvement_percent']:.2f}%")
for rec in result['recommendations']:
    print(f"  • {rec}")

See Loss Curve Analysis Documentation for detailed examples and integration patterns.

Bottleneck Detection

SmartSummary identifies bottlenecks based on:

High parameters: Layers with >10% of total model parameters
High gradient variance: Indicates training instability (when tracking enabled)
Large outputs: Intermediate tensors >10MB

Each bottleneck gets a score - higher scores indicate more critical optimization opportunities.

ModelView Guide

Overview

ModelView generates publication-quality neural network architecture diagrams for TensorFlow/Keras models, similar to torchview for PyTorch. Perfect for research papers, presentations, and documentation.

Key Features:

🎨 High-resolution outputs (300-600 DPI) suitable for academic papers
📄 Multiple formats: PNG, PDF, SVG
🎯 Automatic graph layout with beautiful styling
📊 Shows layer types, shapes, and parameter counts
🔀 Handles complex architectures (residual, multi-input/output, branching)
⚙️ Customizable styling (colors, fonts, layouts)

Quick Start

from ToTf.tenf import ModelView, draw_graph
import tensorflow as tf

# Create your model
model = tf.keras.Sequential([
    tf.keras.layers.Conv2D(32, 3, activation='relu', input_shape=(28, 28, 1)),
    tf.keras.layers.MaxPooling2D(2),
    tf.keras.layers.Flatten(),
    tf.keras.layers.Dense(10, activation='softmax')
])

# Quickest way - one-liner
draw_graph(model, input_shape=(28, 28, 1), save_path='model.png')

Basic Usage

from ToTf.tenf import ModelView

# Initialize ModelView
view = ModelView(model, input_shape=(28, 28, 1))

# Show text summary
view.show()

# Render high-resolution PNG
view.render('model.png', dpi=300)

Output Formats

# PNG - for presentations and quick viewing
view.render('architecture.png', format='png', dpi=300)

# PDF - for LaTeX documents (vector graphics)
view.render('architecture.pdf', format='pdf')

# SVG - for perfect scaling in any size
view.render('architecture.svg', format='svg')

Layout Options

# Vertical layout (top-to-bottom) - default
view.render('model_vertical.png', rankdir='TB')

# Horizontal layout (left-to-right) - for wide figures
view.render('model_horizontal.png', rankdir='LR')

Customization

# Control what to display
view.render(
    'model_custom.png',
    show_shapes=True,        # Display tensor shapes
    show_layer_names=True,   # Display layer names
    show_params=True,        # Display parameter counts
    dpi=600                  # Extra high resolution
)

# Custom node styling
custom_node_style = {
    'shape': 'box',
    'style': 'rounded,filled',
    'fillcolor': '#f0f0f0',
    'fontname': 'Helvetica',
    'fontsize': '11',
    'color': '#333333',
    'penwidth': '2'
}

# Custom edge styling
custom_edge_style = {
    'color': '#666666',
    'penwidth': '2',
    'arrowsize': '1.0'
}

view.render(
    'model_styled.png',
    node_style=custom_node_style,
    edge_style=custom_edge_style
)

Complex Architectures

Multi-Input Models:

# Model with text and image inputs
text_input = keras.Input(shape=(100,), name='text')
image_input = keras.Input(shape=(64, 64, 3), name='image')
# ... build model ...

model = keras.Model(inputs=[text_input, image_input], outputs=outputs)

# Visualize - provide list of input shapes
view = ModelView(model, input_shape=[(100,), (64, 64, 3)])
view.render('multimodal.png', rankdir='LR', dpi=300)

Residual Networks:

# ResNet-like architecture with skip connections
inputs = keras.Input(shape=(32, 32, 3))
x = layers.Conv2D(64, 3, padding='same')(inputs)
residual = x
x = layers.Conv2D(64, 3, padding='same')(x)
x = layers.Add()([x, residual])  # Skip connection
# ... more layers ...

model = keras.Model(inputs=inputs, outputs=outputs)
view = ModelView(model, input_shape=(32, 32, 3))
view.render('resnet.png', dpi=400)

Attention/Transformer Models:

# Transformer with attention mechanism
inputs = keras.Input(shape=(50, 128))
attention = layers.MultiHeadAttention(num_heads=8, key_dim=32)(inputs, inputs)
# ... build transformer ...

view = ModelView(model, input_shape=(50, 128))
view.render('transformer.png', rankdir='TB', dpi=300)

Export and Analysis

# Get summary as dictionary
summary_dict = view.get_summary_dict()
print(f"Total layers: {summary_dict['num_layers']}")
print(f"Total parameters: {summary_dict['total_parameters']:,}")

# Save summary as JSON
view.save_summary_json('model_architecture.json')

# Text-based summary with connections
view.show(detailed=True)

Best Practices for Publications

For Research Papers (LaTeX)

# Use PDF format for LaTeX - perfect quality at any zoom
view.render('paper_figure.pdf', format='pdf', dpi=300)

# Or high-DPI PNG if journal requires raster
view.render('paper_figure.png', format='png', dpi=600)

For Presentations

# Standard resolution PNG
view.render('presentation_arch.png', dpi=300, rankdir='LR')

For Documentation/Web

# SVG for responsive scaling
view.render('docs_architecture.svg', format='svg')

# Or moderate PNG
view.render('docs_architecture.png', dpi=150)

Installation Requirements

ModelView requires Graphviz:

# Python package
pip install graphviz

# System graphviz (required!)
# Ubuntu/Debian:
sudo apt-get install graphviz

# macOS:
brew install graphviz

# Windows:
# Download from https://graphviz.org/download/
# Add to PATH after installation

Examples

See example_modelview_tf.py for comprehensive examples including:

Simple Sequential models
CNN architectures
ResNet with skip connections
Multi-input models
Transformer/Attention models
Autoencoders
Custom styling

Run examples:

cd ToTf
python example_modelview_tf.py

All generated diagrams will be in the outputs/ directory.

Utility Functions Guide

Auto-Shape Flattener

Problem: Transitioning from convolutional layers to dense/linear layers requires calculating exact flattened sizes, which is error-prone.

Solution: lazy_flatten() and get_flatten_size() handle this automatically.

PyTorch Example

from ToTf.pytorch import lazy_flatten, get_flatten_size
import torch.nn as nn

class MyCNN(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv1 = nn.Conv2d(3, 32, 3)
        self.conv2 = nn.Conv2d(32, 64, 3)
        self.pool = nn.MaxPool2d(2)
        
        # Calculate flatten size automatically
        # Input: 32x32 -> Conv: 30x30 -> Pool: 15x15 -> Conv: 13x13 -> Pool: 6x6
        flat_size = get_flatten_size((64, 6, 6))  # 2304
        self.fc = nn.Linear(flat_size, 10)
    
    def forward(self, x):
        x = self.pool(F.relu(self.conv1(x)))
        x = self.pool(F.relu(self.conv2(x)))
        x = lazy_flatten(x)  # Automatically flattens to [batch, 2304]
        return self.fc(x)

TensorFlow Example

from ToTf.tenf import lazy_flatten, get_flatten_size
from tensorflow import keras

class MyCNN(keras.Model):
    def __init__(self):
        super().__init__()
        self.conv1 = keras.layers.Conv2D(32, 3)
        self.conv2 = keras.layers.Conv2D(64, 3)
        self.pool = keras.layers.MaxPooling2D(2)
        
        # Calculate flatten size automatically (channels-last format)
        flat_size = get_flatten_size((6, 6, 64))  # 2304
        self.fc = keras.layers.Dense(10)
    
    def call(self, x):
        x = self.pool(tf.nn.relu(self.conv1(x)))
        x = self.pool(tf.nn.relu(self.conv2(x)))
        x = lazy_flatten(x)  # Automatically flattens
        return self.fc(x)

Normalized Cross-Correlation (NCC) Loss

Problem: Medical imaging tasks (registration, segmentation) need losses robust to intensity variations. MSE fails when images have different brightness/contrast.

Solution: NCC loss measures structural similarity, invariant to linear intensity transformations.

When to Use NCC

✅ Medical image registration (MRI, CT scans)
✅ Image alignment tasks
✅ When images have varying intensity/contrast
✅ ACDC, ADNI, and similar medical datasets
❌ Classification tasks (use CrossEntropy)
❌ When exact pixel values matter

PyTorch Example

from ToTf.pytorch import loss_ncc, ncc_score
import torch.nn as nn

# Medical image segmentation model
model = MySegmentationModel()
optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)

for epoch in range(epochs):
    for images, masks in dataloader:
        optimizer.zero_grad()
        
        predictions = model(images)
        loss = loss_ncc(masks, predictions)  # NCC loss
        
        loss.backward()
        optimizer.step()
        
        # Track similarity score (higher is better)
        score = ncc_score(masks, predictions)
        print(f"NCC Score: {score.item():.4f}")

TensorFlow Example

from ToTf.tenf import NCCLoss, ncc_score

# Method 1: Use in model.compile()
model = MySegmentationModel()
model.compile(
    optimizer='adam',
    loss=NCCLoss(),  # Keras-compatible!
    metrics=['mse']  # Can track MSE for comparison
)

model.fit(x_train, y_train, epochs=10)

# Method 2: Use as function
for images, masks in dataset:
    with tf.GradientTape() as tape:
        predictions = model(images)
        loss = loss_ncc(masks, predictions)
        score = ncc_score(masks, predictions)

NCC Characteristics:

Returns 0 when images are identical
Range: 0 to 2 (lower is better for loss)
Score range: -1 to 1 (higher is better)
Invariant to linear intensity scaling: loss_ncc(img, img*2) ≈ 0

Learning Rate Finder

Problem: Finding the optimal learning rate requires trial and error. Too high = divergence, too low = slow training.

Solution: LR Finder runs a short test with exponentially increasing LRs to find the "sweet spot".

PyTorch Example

from ToTf.pytorch import find_lr, LRFinder
import torch.nn as nn

model = MyModel()
optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)  # Placeholder LR
criterion = nn.CrossEntropyLoss()

# Method 1: Quick and easy
best_lr = find_lr(model, optimizer, criterion, train_loader)
print(f"Suggested LR: {best_lr}")

# Now use the found LR
optimizer = torch.optim.Adam(model.parameters(), lr=best_lr)

# Method 2: Full control
lr_finder = LRFinder(model, optimizer, criterion, device='cuda')
lr_finder.range_test(train_loader, start_lr=1e-7, end_lr=10, num_iter=100)
lr_finder.plot()  # Shows loss vs LR curve
best_lr = lr_finder.get_best_lr()

TensorFlow Example

from ToTf.tenf import find_lr, LRFinder
from tensorflow import keras

model = MyModel()
loss_fn = keras.losses.SparseCategoricalCrossentropy()

# Method 1: Quick and easy
best_lr = find_lr(model, loss_fn, train_dataset)
print(f"Suggested LR: {best_lr}")

# Now compile with optimal LR
model.compile(optimizer=keras.optimizers.Adam(learning_rate=best_lr), loss=loss_fn)

# Method 2: Full control
lr_finder = LRFinder(model, loss_fn)
lr_finder.range_test(train_dataset, start_lr=1e-7, end_lr=10, num_iter=100)
lr_finder.plot()  # Shows loss vs LR curve
best_lr = lr_finder.get_best_lr()

How it Works:

Starts with a very small LR (e.g., 1e-7)
Trains for a few iterations, gradually increasing LR
Records loss at each LR
Finds the LR where loss decreases fastest (steepest gradient)
Stops if loss starts diverging

Best LR Selection:

The tool suggests the LR at the steepest negative gradient
This is typically 1/10th to 1/3 of the LR where loss starts increasing
Always visually inspect the plot for confirmation

API Reference

TrainingMonitor

TrainingMonitor(iterable, desc="Training", log_file="train_log.csv")

Parameters:

iterable - DataLoader or any iterable to monitor
desc (str) - Description for progress bar (default: "Training")
log_file (str) - CSV file path (default: "train_log.csv")

Methods:

log(metrics: Dict[str, float]) - Log metrics and update running averages

CSV Columns:

timestamp - ISO format timestamp
step - Global step counter
<metric> - Your logged metrics (running average)
RAM_pct - RAM usage percentage
VRAM_gb - GPU memory in GB (0 if no CUDA)

SmartSummary

PyTorch API

SmartSummary(model, input_size=None, batch_size=1, device='cpu', track_gradients=False)

Parameters:

model (nn.Module) - PyTorch model to analyze
input_size (Tuple, optional) - Input shape excluding batch, e.g., (3, 224, 224)
batch_size (int) - Batch size for inference (default: 1)
device (str) - Device: 'cpu' or 'cuda' (default: 'cpu')
track_gradients (bool) - Track gradient statistics (default: False, slower)

TensorFlow/Keras API

SmartSummary(model, input_shape=None, batch_size=1, track_gradients=False)

Parameters:

model (keras.Model) - TensorFlow/Keras model to analyze
input_shape (Tuple or List[Tuple], optional) - Input shape excluding batch, e.g., (224, 224, 3) for single input or [(224, 224, 3), (100,)] for multi-input
batch_size (int) - Batch size for inference (default: 1)
track_gradients (bool) - Track gradient statistics (default: False, slower)

Common Methods (Both Frameworks)

Methods:

show(show_bottlenecks=True) - Display formatted summary table
get_bottlenecks(top_n=5) - Get list of bottleneck layers
to_dict() - Export complete analysis as dictionary
save_to_file(filename) - Save summary to text file (UTF-8)

TensorFlow-specific Methods:

compare_with_keras_summary() - Show both Keras built-in summary and SmartSummary side-by-side

Bottleneck Dictionary Keys:

layer - Layer identifier
layer_name - Layer class name
score - Bottleneck score (higher = more critical)
reasons - List of issues (e.g., "High params (91.4%)")
params - Parameter count
output_shape - Output tensor shape

Utility Functions

lazy_flatten

# PyTorch
lazy_flatten(tensor: torch.Tensor, start_dim: int = 1) -> torch.Tensor

# TensorFlow
lazy_flatten(tensor: tf.Tensor, start_dim: int = 1) -> tf.Tensor

Parameters:

tensor: Input tensor to flatten
start_dim: Dimension to start flattening from (default: 1, preserves batch)

Returns: Flattened tensor

get_flatten_size

get_flatten_size(input_shape: Tuple[int, ...]) -> int

Parameters:

input_shape: Shape of tensor (excluding batch dimension)

Returns: Total flattened size

Example:

PyTorch: get_flatten_size((64, 7, 7)) → 3136
TensorFlow: get_flatten_size((7, 7, 64)) → 3136

loss_ncc

# PyTorch
loss_ncc(y_true: torch.Tensor, y_pred: torch.Tensor, eps: float = 1e-8) -> torch.Tensor

# TensorFlow
@tf.function
loss_ncc(y_true: tf.Tensor, y_pred: tf.Tensor, eps: float = 1e-8) -> tf.Tensor

Parameters:

y_true: Ground truth tensor
y_pred: Predicted tensor
eps: Small constant for numerical stability (default: 1e-8)

Returns: NCC loss value (range: 0 to 2, lower is better)

ncc_score

ncc_score(y_true, y_pred, eps: float = 1e-8)

Parameters: Same as loss_ncc

Returns: NCC similarity score (range: -1 to 1, higher is better)

Note: ncc_score = 1.0 - loss_ncc

NCCLoss (TensorFlow only)

NCCLoss(eps: float = 1e-8, name: str = "ncc_loss")

Keras-compatible NCC Loss class that can be used with model.compile().

Example:

model.compile(optimizer='adam', loss=NCCLoss())

find_lr

# PyTorch
find_lr(
    model: nn.Module,
    optimizer: Optimizer,
    criterion: Callable,
    dataloader: DataLoader,
    device: str = 'cpu',
    start_lr: float = 1e-7,
    end_lr: float = 10.0,
    num_iter: int = 100,
    plot: bool = True
) -> float

# TensorFlow
find_lr(
    model: keras.Model,
    loss_fn: Loss,
    dataset: tf.data.Dataset,
    optimizer: Optional[Optimizer] = None,
    start_lr: float = 1e-7,
    end_lr: float = 10.0,
    num_iter: int = 100,
    plot: bool = True
) -> float

Parameters:

model: Model to analyze
optimizer/loss_fn: Optimizer (PyTorch) or loss function (TensorFlow)
dataloader/dataset: Training data
start_lr: Starting LR (default: 1e-7)
end_lr: Ending LR (default: 10.0)
num_iter: Number of iterations (default: 100)
plot: Show plot (default: True)

Returns: Suggested optimal learning rate

LRFinder

# PyTorch
LRFinder(model, optimizer, criterion, device='cpu')

# TensorFlow
LRFinder(model, loss_fn, optimizer=None)

Methods:

range_test(dataloader/dataset, start_lr, end_lr, num_iter, ...): Run LR range test
plot(skip_start, skip_end, log_lr, show_best, save_path): Plot results
get_best_lr(): Get suggested learning rate

Comparison with Alternatives

SmartSummary vs Other Tools

Feature	SmartSummary	torchsummary	torchinfo	TF model.summary()
Basic layer info	✓	✓	✓	✓
Bottleneck detection	✓	✗	✗	✗
Gradient tracking	✓	✗	✗	✗
Optimization insights	✓	✗	✗	✗
Memory estimation	✓	✓	✓	✗
Export to file/dict	✓	✗	✓	✗
Complex models	✓	Limited	✓	Limited
PyTorch support	✓	✓	✓	✗
TensorFlow support	✓	✗	✗	✓

SmartSummary is the ONLY tool with automatic bottleneck detection and gradient tracking!

Examples

PyTorch Examples

example_usage.py - TrainingMonitor examples
example_smartsummary.py - SmartSummary examples (6 detailed scenarios)
example_utils_pytorch.py - Utility functions examples (5 scenarios: lazy_flatten, NCC loss, LR finder, complete pipeline)

TensorFlow Examples

example_smartsummary_tf.py - SmartSummary examples for TensorFlow/Keras (8 detailed scenarios including multi-input models, gradient tracking, and MobileNetV2 analysis)
example_utils_tf.py - Utility functions examples (6 scenarios: lazy_flatten, NCC loss with Keras, LR finder, Functional API usage)

Testing

Run the test suites to verify everything works:

# Test TrainingMonitor (PyTorch)
python test/test_monitor.py
python test/test_integration.py

# Test SmartSummary (PyTorch)
python test/test_smartsummary.py

# Test SmartSummary (TensorFlow)
python test/test_smartsummary_tf.py

# Test Utility Functions (PyTorch)
python test/test_utils_pytorch.py

# Test Utility Functions (TensorFlow)
python test/test_utils_tf.py

# Test Cross-Framework Integration
python test/test_utils_integration.py

All tests pass with 100% success rate ✓

Backend Detection

ToTf automatically detects whether PyTorch or TensorFlow is installed:

from ToTf import get_backend
print(get_backend())  # Returns 'torch' or 'tensorflow'

Project Structure

ToTf/
├── __init__.py                     # Package exports with auto backend detection
├── backend.py                      # Backend detection (PyTorch/TensorFlow)
├── requirements.txt                # Dependencies
├── setup.py                        # Package config
│
├── Documentation/
│   ├── README.md                   # This file (main documentation)
│   ├── QUICKSTART_MODELVIEW.md     # ModelView quick start guide
│   ├── MODELVIEW_IMPLEMENTATION_SUMMARY.md  # ModelView technical details
│   ├── UTILITIES_IMPLEMENTATION_SUMMARY.md  # Utility functions documentation
│   └── TENSORFLOW_IMPLEMENTATION_SUMMARY.md # TensorFlow implementation notes
│
├── pytorch/
│   ├── __init__.py
│   ├── trainingmonitor.py         # TrainingMonitor implementation
│   ├── smartsummary.py            # SmartSummary implementation (PyTorch)
│   └── utils.py                   # Utility functions (lazy_flatten, NCC loss, LR finder)
│
├── tenf/
│   ├── __init__.py
│   ├── smartsummary.py            # SmartSummary implementation (TensorFlow)
│   ├── modelview.py               # ModelView for architecture diagrams (NEW!)
│   └── utils.py                   # Utility functions (lazy_flatten, NCC loss, LR finder)
│
├── Examples/
│   ├── example_usage.py           # TrainingMonitor examples
│   ├── example_smartsummary.py    # SmartSummary examples (PyTorch)
│   ├── example_smartsummary_tf.py # SmartSummary examples (TensorFlow)
│   ├── example_modelview_tf.py    # ModelView examples (8+ examples)
│   ├── demo_complex_architectures.py # Complex architecture demos (5 examples) (NEW!)
│   ├── example_utils_pytorch.py   # Utility functions examples (PyTorch)
│   └── example_utils_tf.py        # Utility functions examples (TensorFlow)
│
├── test/
│   ├── test_monitor.py            # TrainingMonitor tests
│   ├── test_integration.py        # Integration tests
│   ├── test_smartsummary.py       # SmartSummary tests (PyTorch)
│   ├── test_smartsummary_tf.py    # SmartSummary tests (TensorFlow)
│   ├── test_modelview_tf.py       # ModelView tests (41 tests, 9 classes) (NEW!)
│   ├── test_utils_pytorch.py      # Utility functions tests (PyTorch)
│   ├── test_utils_tf.py           # Utility functions tests (TensorFlow)
│   └── test_utils_integration.py  # Cross-framework integration tests
│
└── verify_modelview.py            # Quick ModelView verification script

License

See LICENSE file for details.

Contributing

Contributions are welcome! Please ensure all tests pass before submitting PRs.

# Run all tests
pytest test/ -v

# Run specific test suites
pytest test/test_modelview_tf.py -v      # ModelView tests (32 tests)
pytest test/test_smartsummary_tf.py -v   # SmartSummary tests
pytest test/test_utils_tf.py -v          # Utilities tests

What's New

v0.2.2 Connection Extraction Fix 🔧

✅ Fixed connection extraction - Edges now properly displayed in complex architectures
✅ Improved graph visualization - Parallel branches, cross-connections, and merge points now visible
✅ Keras API compatibility - Updated to work with latest TensorFlow/Keras node structure
✅ Verified with 41 tests - All architectures correctly visualized

v0.2.1 - Enhanced Testing

✅ Extended test coverage - Added 9 new tests for complex architectures
✅ Multiple branches & cross-connections - Inception, DenseNet, DAG structures
✅ Advanced topologies - Parallel branches, skip connections, multi-output
✅ Demo examples - 5 complex architecture visualization demos
✅ Test suite expanded - 41 total tests, 100% passing

v0.2.0 - ModelView Release

✅ ModelView for TensorFlow - Publication-quality architecture diagrams
✅ High-resolution outputs (PNG, PDF, SVG) at 300-600 DPI
✅ Support for complex architectures (ResNet, multi-input, attention)
✅ Full documentation and 8+ examples
✅ Cleaner layer labels (just type names, no redundant info)

Previous Features:

SmartSummary with bottleneck detection
TrainingMonitor with progress tracking
Utility functions (NCC loss, LR finder, auto-flatten)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.3

Jun 5, 2026

This version

0.2.2

Jun 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

totf-0.2.2.tar.gz (108.4 kB view details)

Uploaded Jun 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

totf-0.2.2-py3-none-any.whl (67.2 kB view details)

Uploaded Jun 1, 2026 Python 3

File details

Details for the file totf-0.2.2.tar.gz.

File metadata

Download URL: totf-0.2.2.tar.gz
Upload date: Jun 1, 2026
Size: 108.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for totf-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`f62a0dda191761979c75c9b5bd46009f7f3e875b1972a9d66f563f49c7aa3435`
MD5	`499aa2d77da7aad226663848bb2e2dff`
BLAKE2b-256	`394b6fe455e92779191cae0e79a169dd371ef0c8b3158bbb4a28ee4d3412a22e`

See more details on using hashes here.

File details

Details for the file totf-0.2.2-py3-none-any.whl.

File metadata

Download URL: totf-0.2.2-py3-none-any.whl
Upload date: Jun 1, 2026
Size: 67.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for totf-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7f1b24c2cec603cb18c11bfcc82f09cbca9b77ff07ddcf78ea5e97e7b5fcfa67`
MD5	`68c77ed846026858260a22786f14e777`
BLAKE2b-256	`af7706fef9a8a2b43bafa1cfc2b73b0df13cf4e3574b80363f65a93054fb9273`

See more details on using hashes here.

ToTf 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ToTf

Features

🎯 TrainingMonitor

🔍 SmartSummary

✨ What's New (Added NEW!)

🛠️ Utility Functions

Auto-Shape Flattener

Normalized Cross-Correlation (NCC) Loss

Learning Rate Finder

📊 ModelView (PyTorch & TensorFlow)

Installation

System Requirements for ModelView

Documentation

Table of Contents

Quick Start

PyTorch

TrainingMonitor (PyTorch)

SmartSummary (PyTorch)

Utility Functions (PyTorch)

ModelView (PyTorch)

TensorFlow/Keras

SmartSummary (TensorFlow)

ModelView (TensorFlow)

Utility Functions (TensorFlow)

Features

SmartSummary Guide

Common Patterns

PyTorch

TensorFlow/Keras

Loss Curve Analysis

Bottleneck Detection

ModelView Guide

Overview

Quick Start

Basic Usage

Output Formats

Layout Options

Customization

Complex Architectures

Export and Analysis

Best Practices for Publications

For Research Papers (LaTeX)

For Presentations

For Documentation/Web

Installation Requirements

Examples

Utility Functions Guide

Auto-Shape Flattener

PyTorch Example

TensorFlow Example

Normalized Cross-Correlation (NCC) Loss

When to Use NCC

PyTorch Example

TensorFlow Example

Learning Rate Finder

PyTorch Example

TensorFlow Example

API Reference

TrainingMonitor

SmartSummary

PyTorch API

TensorFlow/Keras API

Common Methods (Both Frameworks)

Utility Functions

lazy_flatten

get_flatten_size

loss_ncc

ncc_score

NCCLoss (TensorFlow only)

find_lr

LRFinder