Hook-style hooks for Django bulk operations like bulk_create and bulk_update.
Project description
django-bulk-hooks
⚡ Bulk hooks for Django bulk operations and individual model lifecycle events.
django-bulk-hooks brings a declarative, hook-like experience to Django's bulk_create, bulk_update, and bulk_delete — including support for BEFORE_ and AFTER_ hooks, conditions, batching, and transactional safety. It also provides comprehensive lifecycle hooks for individual model operations.
✨ Features
- Declarative hook system:
@hook(AFTER_UPDATE, condition=...) - BEFORE/AFTER hooks for create, update, delete
- Hook-aware manager that wraps Django's
bulk_operations - NEW:
HookModelMixinfor individual model lifecycle events - Hook chaining, hook deduplication, and atomicity
- Class-based hook handlers with DI support
- Register hooks against abstract models; they apply to all concrete subclasses
- Support for both bulk and individual model operations
🚀 Quickstart
pip install django-bulk-hooks
Define Your Model
from django.db import models
from django_bulk_hooks.models import HookModelMixin
class Account(HookModelMixin):
balance = models.DecimalField(max_digits=10, decimal_places=2)
# The HookModelMixin automatically provides BulkHookManager
Create a Hook Handler
from django_bulk_hooks import hook, AFTER_UPDATE, Hook
from django_bulk_hooks.conditions import WhenFieldHasChanged
from .models import Account
class AccountHooks(Hook):
@hook(AFTER_UPDATE, model=Account, condition=WhenFieldHasChanged("balance"))
def log_balance_change(self, new_records, old_records):
print("Accounts updated:", [a.pk for a in new_records])
@hook(BEFORE_CREATE, model=Account)
def before_create(self, new_records, old_records):
for account in new_records:
if account.balance < 0:
raise ValueError("Account cannot have negative balance")
@hook(AFTER_DELETE, model=Account)
def after_delete(self, new_records, old_records):
print("Accounts deleted:", [a.pk for a in old_records])
🛠 Supported Hook Events
BEFORE_CREATE,AFTER_CREATEBEFORE_UPDATE,AFTER_UPDATEBEFORE_DELETE,AFTER_DELETE
🔄 Lifecycle Events
Individual Model Operations
The HookModelMixin automatically hooks hooks for individual model operations:
# These will hook BEFORE_CREATE and AFTER_CREATE hooks
account = Account.objects.create(balance=100.00)
account.save() # for new instances
# These will hook BEFORE_UPDATE and AFTER_UPDATE hooks
account.balance = 200.00
account.save() # for existing instances
# This will hook BEFORE_DELETE and AFTER_DELETE hooks
account.delete()
Bulk Operations
Bulk operations also hook the same hooks:
# Bulk create - hooks BEFORE_CREATE and AFTER_CREATE hooks
accounts = [
Account(balance=100.00),
Account(balance=200.00),
]
Account.objects.bulk_create(accounts)
# Bulk update - hooks BEFORE_UPDATE and AFTER_UPDATE hooks
for account in accounts:
account.balance *= 1.1
Account.objects.bulk_update(accounts) # fields are auto-detected
# Bulk delete - hooks BEFORE_DELETE and AFTER_DELETE hooks
Account.objects.bulk_delete(accounts)
Queryset Operations
Queryset operations are also supported:
# Queryset update - hooks BEFORE_UPDATE and AFTER_UPDATE hooks
Account.objects.update(balance=0.00)
# Queryset delete - hooks BEFORE_DELETE and AFTER_DELETE hooks
Account.objects.delete()
Subquery Support in Updates
When using Subquery objects in update operations, the computed values are automatically available in hooks. The system efficiently refreshes all instances in bulk for optimal performance:
from django.db.models import Subquery, OuterRef, Sum
def aggregate_revenue_by_ids(self, ids: Iterable[int]) -> int:
return self.find_by_ids(ids).update(
revenue=Subquery(
FinancialTransaction.objects.filter(daily_financial_aggregate_id=OuterRef("pk"))
.filter(is_revenue=True)
.values("daily_financial_aggregate_id")
.annotate(revenue_sum=Sum("amount"))
.values("revenue_sum")[:1],
),
)
# In your hooks, you can now access the computed revenue value:
class FinancialAggregateHooks(Hook):
@hook(AFTER_UPDATE, model=DailyFinancialAggregate)
def log_revenue_update(self, new_records, old_records):
for new_record in new_records:
# This will now contain the computed value, not the Subquery object
print(f"Updated revenue: {new_record.revenue}")
# Bulk operations are optimized for performance:
def bulk_aggregate_revenue(self, ids: Iterable[int]) -> int:
# This will efficiently refresh all instances in a single query
return self.filter(id__in=ids).update(
revenue=Subquery(
FinancialTransaction.objects.filter(daily_financial_aggregate_id=OuterRef("pk"))
.filter(is_revenue=True)
.values("daily_financial_aggregate_id")
.annotate(revenue_sum=Sum("amount"))
.values("revenue_sum")[:1],
),
)
🧠 Why?
Django's bulk_ methods bypass signals and save(). This package fills that gap with:
- Hooks that behave consistently across creates/updates/deletes
- NEW: Individual model lifecycle hooks that work with
save()anddelete() - NEW: Abstract-base hook registration; MTI support removed for simplicity and stability
- Scalable performance via chunking (default 200)
- Support for
@hookdecorators and centralized hook classes - NEW: Automatic hook hooking for admin operations and other Django features
- NEW: Proper ordering guarantees for old/new record pairing in hooks (Salesforce-like behavior)
- NEW: Automatic connection management to prevent connection leaks in long-running processes
🔌 Connection Management (Production Feature)
django-bulk-hooks automatically manages database connections for optimal resource usage. After each bulk operation, connections are immediately returned to the pool, preventing connection leaks in long-running processes.
Why This Matters
Without explicit connection cleanup, Django holds onto database connections until:
- The HTTP request ends (for web requests)
- The worker process terminates (for Celery/async tasks)
CONN_MAX_AGEexpires (default: persistent connections)
This can lead to connection pool exhaustion under concurrent load, especially in:
- ✅ Celery tasks - Critical (prevents "too many connections" errors)
- ✅ Management commands - Important (prevents long-held connections)
- ✅ Async workers - Important (better resource management)
- ✅ Web requests - Modest benefit (faster pool returns)
How It Works
# Every bulk operation automatically closes its connection
Account.objects.bulk_create(accounts) # ✅ Connection returned immediately
Account.objects.bulk_update(accounts, fields=["balance"]) # ✅ Connection returned
Account.objects.filter(balance=0).delete() # ✅ Connection returned
# Even when hooks raise exceptions, connections are cleaned up
# The finally block ensures cleanup happens no matter what
Multi-Database Support
Connection management respects your multi-database setup:
# Closes the correct connection for each database
Account.objects.using("primary").bulk_create(accounts) # ✅ Closes "primary"
Account.objects.using("replica").all() # ✅ Closes "replica"
Configuration
No configuration needed! Connection management is:
- ✅ Automatic and transparent
- ✅ Safe (handles cleanup failures gracefully)
- ✅ Idempotent (safe to call multiple times)
- ✅ Works with all connection pooling backends (PgBouncer, pgpool, etc.)
Performance Impact
Negligible - Connection cleanup just returns the connection to the pool. The real benefit is preventing connection accumulation that leads to "too many clients" database errors.
Real-World Example: Celery Task
from celery import shared_task
from .models import CreditModelResult
@shared_task
def process_credit_model(batch_ids):
"""Process credit model results in a Celery task"""
# Fetch data
results = fetch_credit_model_data(batch_ids)
# Perform bulk operations
CreditModelResult.objects.bulk_update(
results,
fields=["score", "is_qualified", "updated_at"]
)
# ✅ Connection automatically returned to pool
# If 100 tasks run concurrently, you won't hit connection limits!
# Without this feature, you'd accumulate 100+ connections until
# CONN_MAX_AGE expires or workers restart
Logging
Connection management includes debug logging:
# Set logging level to DEBUG to see connection lifecycle
import logging
logging.getLogger('django_bulk_hooks.operations.coordinator').setLevel(logging.DEBUG)
# Output:
# DEBUG: Closed database connection 'default' for Account operation
Connection cleanup failures are logged at WARNING level but don't break your operations (defensive programming).
📦 Usage Examples
Individual Model Operations
# These automatically hook hooks
account = Account.objects.create(balance=100.00)
account.balance = 200.00
account.save()
account.delete()
Bulk Operations
# These also hook hooks
Account.objects.bulk_create(accounts)
Account.objects.bulk_update(accounts) # fields are auto-detected
Account.objects.bulk_delete(accounts)
Advanced Hook Usage
class AdvancedAccountHooks(Hook):
@hook(BEFORE_UPDATE, model=Account, condition=WhenFieldHasChanged("balance"))
def validate_balance_change(self, new_records, old_records):
for new_account, old_account in zip(new_records, old_records):
if new_account.balance < 0 and old_account.balance >= 0:
raise ValueError("Cannot set negative balance")
@hook(AFTER_CREATE, model=Account)
def send_welcome_email(self, new_records, old_records):
for account in new_records:
# Send welcome email logic here
pass
Salesforce-like Ordering Guarantees
The system ensures that old_records and new_records are always properly paired, regardless of the order in which you pass objects to bulk operations:
class LoanAccountHooks(Hook):
@hook(BEFORE_UPDATE, model=LoanAccount)
def validate_account_number(self, new_records, old_records):
# old_records[i] always corresponds to new_records[i]
for new_account, old_account in zip(new_records, old_records):
if old_account.account_number != new_account.account_number:
raise ValidationError("Account number cannot be changed")
# This works correctly even with reordered objects:
accounts = [account1, account2, account3] # IDs: 1, 2, 3
reordered = [account3, account1, account2] # IDs: 3, 1, 2
# The hook will still receive properly paired old/new records
LoanAccount.objects.bulk_update(reordered) # fields are auto-detected
🧩 Integration with Other Managers
Recommended: QuerySet-based Composition (New Approach)
For the best compatibility and to avoid inheritance conflicts, use the queryset-based composition approach:
from django_bulk_hooks.queryset import HookQuerySet
from queryable_properties.managers import QueryablePropertiesManager
class MyManager(QueryablePropertiesManager):
"""Manager that combines queryable properties with hooks"""
def get_queryset(self):
# Get the QueryableProperties QuerySet
qs = super().get_queryset()
# Apply hooks on top of it
return HookQuerySet.with_hooks(qs)
class Article(models.Model):
title = models.CharField(max_length=100)
published = models.BooleanField(default=False)
objects = MyManager()
# This gives you both queryable properties AND hooks
# No inheritance conflicts, no MRO issues!
Alternative: Explicit Hook Application
For more control, you can apply hooks explicitly:
class MyManager(QueryablePropertiesManager):
def get_queryset(self):
return super().get_queryset()
def with_hooks(self):
"""Apply hooks to this queryset"""
return HookQuerySet.with_hooks(self.get_queryset())
# Usage:
Article.objects.with_hooks().filter(published=True).update(title="Updated")
Legacy: Manager Inheritance (Not Recommended)
The old inheritance approach still works but is not recommended due to potential MRO conflicts:
from django_bulk_hooks.manager import BulkHookManager
from queryable_properties.managers import QueryablePropertiesManager
class MyManager(BulkHookManager, QueryablePropertiesManager):
pass # ⚠️ Can cause inheritance conflicts
Why the new approach is better:
- ✅ No inheritance conflicts
- ✅ No MRO (Method Resolution Order) issues
- ✅ Works with any manager combination
- ✅ Cleaner and more maintainable
- ✅ Follows Django's queryset enhancement patterns
Framework needs to: Register these methods Know when to execute them (BEFORE_UPDATE, AFTER_UPDATE) Execute them in priority order Pass ChangeSet to them Handle errors (rollback on failure)
🔄 Migration (1.0.0)
- MTI (Multi-Table Inheritance) support has been removed.
- Register hooks against abstract base models to have them apply to all concrete subclasses.
- Example:
class AbstractBusiness(models.Model):
class Meta:
abstract = True
class Business(AbstractBusiness):
name = models.CharField(max_length=100)
class BusinessHook(Hook):
@hook(AFTER_UPDATE, model=AbstractBusiness)
def on_update(self, new_records, old_records, **kwargs):
...
If any model inherits from a concrete parent (true MTI), an error is raised at import time. Convert parents to abstract models instead.
📝 License
MIT © 2024 Augend / Konrad Beck
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file django_bulk_hooks-0.3.10.tar.gz.
File metadata
- Download URL: django_bulk_hooks-0.3.10.tar.gz
- Upload date:
- Size: 56.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.10 Windows/11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
21389fc94ec15b7a2e6d28427ac256ba25235ac9df06dec14535b9bd634babe3
|
|
| MD5 |
77c07c5c1e42da60342fd47e8c937df5
|
|
| BLAKE2b-256 |
cc28fe4cab01881fa102a7f2a6720ddd5bd365d2f6bad55a4809ef8cddafcb1d
|
File details
Details for the file django_bulk_hooks-0.3.10-py3-none-any.whl.
File metadata
- Download URL: django_bulk_hooks-0.3.10-py3-none-any.whl
- Upload date:
- Size: 67.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.10 Windows/11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
095336e6ad43c41e3563b87fd2832d87ef544222f0b97d07fd91fd6d2004c36e
|
|
| MD5 |
e04d7a50ca9538547d40a8fa47c1b35b
|
|
| BLAKE2b-256 |
4ed9c30e5b3e0f61dffa989c04b429e5b678d73dac394c4de24eff223ec0efbb
|