Request coalescing for Python - eliminate redundant work and protect your systems from thundering herds

These details have not been verified by PyPI

Project description

shared-call-py 🚀

Eliminate redundant work and protect your systems from thundering herds with intelligent request coalescing.

A Python implementation of request deduplication inspired by Go's singleflight pattern. When multiple concurrent requests ask for the same resource, only one does the actual work—everyone else gets the same result instantly.

🎯 The Problem

Modern applications face three critical challenges:

Thundering Herd: When cache expires, hundreds of requests simultaneously hammer your database
Rate Limit Hell: Concurrent identical API calls burn through your rate limits
Database Overload: High concurrency creates connection pool exhaustion and query slowdowns

Traditional approach: Every request executes independently—wasting resources and destabilizing systems.

shared-call-py approach: Coalesce duplicate in-flight requests into a single execution. The first caller becomes the "leader" and does the work. All others wait and receive the same result.

🚀 Real-World Impact

Database Load Reduction

Scenario: 100 concurrent requests hit a database with 10 connection pool limit

❌ WITHOUT Request Coalescing
   Concurrent Requests:   100
   Actual DB Queries:     100
   Total Duration:        6.012s
   Avg Latency:           2232.42ms
   p99 Latency:           6010.56ms

✅ WITH Request Coalescing
   Concurrent Requests:   100
   Actual DB Queries:     1
   Total Duration:        0.065s
   Avg Latency:           60.19ms
   p99 Latency:           62.05ms

📊 PERFORMANCE IMPROVEMENT
   Total Speedup:         92.6x faster
   Avg Latency:           37.1x faster
   p99 Latency:           96.9x faster
   DB Queries Eliminated: 99
   Load Reduction:        99.0%

Cache Stampede Protection

Scenario: 100 users hit endpoint simultaneously when cache expires

❌ WITHOUT Protection:
   Duration:       2.004s
   DB Queries:     100 (all 100 hit the database!)
   Wasted Queries: 99

✅ WITH Protection (AsyncSharedCall):
   Duration:       2.005s
   DB Queries:     1 (only the leader executes)
   Coalescing Rate: 99.0%
   Queries Prevented: 99

💡 System stays stable under load!

Rate Limit Prevention

Scenario: API with 10 requests/second limit, 50 concurrent requests

❌ WITHOUT Coalescing:
   Successful:     10
   Failed:         90 (rate limited!)
   Error handling: Required

✅ WITH Coalescing:
   Successful:     100
   Failed:         0
   API Calls Made: 1
   API Calls Saved: 99
   Rate Limit Status: ✅ No violations

📦 Installation

pip install shared-call-py

Or with Poetry:

poetry add shared-call-py

🎨 Quick Start

Async Usage (Recommended)

import asyncio
from shared_call_py import AsyncSharedCall

# Create a shared call instance
shared = AsyncSharedCall()

@shared.group()
async def fetch_user(user_id: int) -> dict:
    """Expensive database query - only executes once per unique user_id"""
    print(f"🔍 Fetching user {user_id} from database...")
    await asyncio.sleep(1)  # Simulate slow query
    return {"id": user_id, "name": f"User {user_id}"}

# Simulate 100 concurrent requests for the same user
async def main():
    tasks = [fetch_user(42) for _ in range(100)]
    results = await asyncio.gather(*tasks)
    print(f"✅ Got {len(results)} results, but only 1 database query!")

asyncio.run(main())

Output:

🔍 Fetching user 42 from database...
✅ Got 100 results, but only 1 database query!

Sync Usage

from shared_call_py import SharedCall

shared = SharedCall()

@shared.group()
def expensive_operation(x: int) -> int:
    print(f"Computing {x}...")
    import time
    time.sleep(1)
    return x * 2

# Multiple threads calling simultaneously - only one executes
result = expensive_operation(5)

🏗️ Use Cases

1. Protect Your Database

from shared_call_py import AsyncSharedCall

shared = AsyncSharedCall()

@shared.group()
async def get_user_profile(user_id: int):
    # Only one query executes, even with thousands of concurrent requests
    return await db.query("SELECT * FROM users WHERE id = ?", user_id)

2. Respect Rate Limits

from shared_call_py import AsyncSharedCall

shared = AsyncSharedCall()

class APIClient:
    @shared.group()
    async def fetch_data(self, endpoint: str):
        # Multiple requests coalesce into one API call
        return await self.http_client.get(endpoint)

# 1000 concurrent requests = 1 API call (if for same endpoint)

3. Prevent Cache Stampede

from shared_call_py import AsyncSharedCall

shared = AsyncSharedCall()

@shared.group()
async def get_popular_item():
    # When cache expires, only first request refills it
    result = await expensive_computation()
    cache.set("popular_item", result, ttl=300)
    return result

4. Deduplicate Background Jobs

from shared_call_py import AsyncSharedCall

shared = AsyncSharedCall()

@shared.group()
async def process_webhook(webhook_id: str):
    # If duplicate webhooks arrive, only process once
    return await process_payment(webhook_id)

🎛️ Advanced Features

Custom Key Functions

Control coalescing granularity with custom key functions:

from shared_call_py import AsyncSharedCall

shared = AsyncSharedCall()

# Coalesce by user_id only, ignore other parameters
@shared.group(key_fn=lambda user_id, include_details: f"user:{user_id}")
async def fetch_user(user_id: int, include_details: bool = False):
    return await db.get_user(user_id, include_details)

Statistics and Monitoring

stats = await shared.get_stats()
print(f"Hit Rate: {stats.hit_rate:.1%}")
print(f"Hits: {stats.hits}")
print(f"Misses: {stats.misses}")
print(f"Errors: {stats.errors}")
print(f"Active Calls: {stats.active}")

Cache Invalidation

# Forget a specific key
await shared.forget("user:42")

# Clear all tracked calls
await shared.forget_all()

# Reset statistics
await shared.reset_stats()

📊 Benchmarks

See detailed benchmark results and methodologies:

Database Load Benchmark - Connection pool exhaustion prevention
Cache Stampede Benchmark - Thundering herd protection
Rate Limit Benchmark - API quota preservation

Run benchmarks yourself:

python examples/mock_db_query.py
python examples/thundering_herd.py
python examples/ratelimit.py

📚 Documentation

Quick Start Guide - Get started in 5 minutes
API Reference - Complete API documentation
Benchmarks - Performance comparisons
Examples - Real-world usage patterns

🔧 How It Works

First Request: Becomes the "leader" and executes the function
Concurrent Requests: Wait for the leader's result via asyncio.Event or threading.Event
Result Sharing: All waiters receive the same result (or error)
Cleanup: Call completes, resources released

Key features:

Thread-safe and async-safe
Automatic key generation from function name and arguments
Error propagation - all waiters receive the same exception
Zero dependencies - uses only Python standard library

🤝 When NOT to Use

Mutations: Don't coalesce write operations (POST, PUT, DELETE)
User-specific data: Each user needs their own result
Time-sensitive: When staleness matters (though you can forget() keys)
Side effects: Functions with important side effects beyond the return value

🛠️ Development

# Clone the repository
git clone https://github.com/yourusername/shared-call-py.git
cd shared-call-py

# Install dependencies
poetry install

# Run tests
poetry run pytest

# Run benchmarks
python examples/mock_db_query.py

📝 License

MIT License - see LICENSE file for details.

🌟 Credits

Inspired by Go's singleflight pattern and adapted for Python's async/await paradigm.

🤔 FAQ

Q: What happens if the leader fails?
A: All waiting callers receive the same exception. They can retry, which will elect a new leader.

Q: How is this different from caching?
A: Caching stores past results. Coalescing deduplicates in-flight requests. They complement each other.

Q: Does this work with FastAPI/Django/Flask?
A: Yes! It's framework-agnostic. Just decorate your data-fetching functions.

Q: What about memory leaks?
A: Completed calls are automatically cleaned up. Use forget() or forget_all() for manual control.

Built with ❤️ to make Python applications faster and more resilient.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Oct 20, 2025

0.1.1

Oct 20, 2025

This version

0.1.0

Oct 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shared_call_py-0.1.0.tar.gz (7.8 kB view details)

Uploaded Oct 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shared_call_py-0.1.0-py3-none-any.whl (10.3 kB view details)

Uploaded Oct 20, 2025 Python 3

File details

Details for the file shared_call_py-0.1.0.tar.gz.

File metadata

Download URL: shared_call_py-0.1.0.tar.gz
Upload date: Oct 20, 2025
Size: 7.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for shared_call_py-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`644df37a95f9080ed98474b615cc81a4ce001d6582f9a5f403de11a84c20c2fd`
MD5	`43a362dffbee475e6e378e611faa6a33`
BLAKE2b-256	`eff883be44c0d5690f56d7b589b70347c728722bb7c3752caf42e980c754ec1e`

See more details on using hashes here.

File details

Details for the file shared_call_py-0.1.0-py3-none-any.whl.

File metadata

Download URL: shared_call_py-0.1.0-py3-none-any.whl
Upload date: Oct 20, 2025
Size: 10.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for shared_call_py-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8db4fbe45a07d0aa1775286b558692cb6875897f604d491a5295be3a7e4743fe`
MD5	`2144391ced7eb41e1e83f15f6b114c6f`
BLAKE2b-256	`11d98c68f6882b8aa2ea1464e825883bed4ec74f70b1e1a7d68748561e252917`

See more details on using hashes here.

shared-call-py 0.1.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

shared-call-py 🚀

🎯 The Problem

🚀 Real-World Impact

Database Load Reduction

Cache Stampede Protection

Rate Limit Prevention

📦 Installation

🎨 Quick Start

Async Usage (Recommended)

Sync Usage

🏗️ Use Cases

1. Protect Your Database

2. Respect Rate Limits

3. Prevent Cache Stampede

4. Deduplicate Background Jobs

🎛️ Advanced Features

Custom Key Functions

Statistics and Monitoring

Cache Invalidation

📊 Benchmarks

📚 Documentation

🔧 How It Works

🤝 When NOT to Use

🛠️ Development

📝 License

🌟 Credits

🤔 FAQ

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes