A modern HTTP client for Python with HTTP/3, WebSocket extensions, and enhanced security features
Project description
urllib4: An Enhanced HTTP Client for Python
ai-urllib4 is a modern HTTP client for Python that builds upon the foundation of urllib3 while adding enhancements for modern web applications. It provides a powerful yet user-friendly interface for making HTTP requests with advanced features.
Features
urllib4 provides a comprehensive set of features for modern web applications:
✅ Core Features:
- Connection pooling and thread safety
- Basic URL parsing and manipulation
- HTTP header handling and collections
- Multipart form data encoding
- SSL/TLS utility functions
- File upload functionality
- HTTP/HTTPS request handling
- Proxy support
- Retry mechanisms
- Redirect handling
- Compression support
✅ Advanced Features:
- Enhanced HTTP/2 Support
- WebSocket capabilities with extensions and subprotocols
- Improved security features
- HTTP/3 (QUIC) support with Multipath QUIC
✅ AI Features:
- Adaptive Headers & Request Optimization
- Domain Memory (learns from success/failure)
- Anomaly Detection & Smart Retries
Usage Example
You can use urllib4 for your HTTP requests with a simple, intuitive API:
# Simple GET request
>>> import urllib4
>>> resp = urllib4.request("GET", "http://httpbin.org/robots.txt")
>>> resp.status
200
>>> resp.data
b"User-agent: *\nDisallow: /deny\n"
# POST request with JSON data
>>> import urllib4
>>> import json
>>> data = {"name": "John", "age": 30}
>>> resp = urllib4.request(
... "POST",
... "http://httpbin.org/post",
... headers={"Content-Type": "application/json"},
... body=json.dumps(data).encode()
... )
>>> resp.status
200
Installation
You can install urllib4 with pip:
$ pip install urllib4
For additional features, you can install optional dependencies:
# For HTTP/3 support
$ pip install ai-urllib4[http3]
# For WebSocket subprotocols
$ pip install ai-urllib4[websocket]
# For all optional features
$ pip install ai-urllib4[all]
Alternatively, you can install from source:
$ git clone https://github.com/zinzied/urllib4.git
$ cd urllib4
$ pip install -e .
Development and Testing
To set up a development environment:
$ git clone https://github.com/zinzied/urllib4.git
$ cd urllib4
$ pip install -e ".[dev]"
To run tests:
$ python -m pytest
Note that many tests are currently failing as the library is under active development.
Documentation
Documentation is currently limited to code comments and this README. As the project matures, more comprehensive documentation will be developed.
Roadmap
The following features are planned for future development:
HTTP/3 Enhancements
- Connection migration for improved reliability
- HTTP/3 priority management
- WebTransport support
WebSocket Enhancements
- Additional WebSocket extensions
- WebSocket over HTTP/2
HTTP/2 Support
import urllib4
from urllib4.http2 import inject_into_urllib4, ConnectionProfile
# Enable HTTP/2 support
inject_into_urllib4()
# Create a pool manager with a specific connection profile
http = urllib4.PoolManager(http2_profile=ConnectionProfile.HIGH_PERFORMANCE)
# Make a request (automatically uses HTTP/2 if the server supports it)
response = http.request("GET", "https://nghttp2.org")
print(f"HTTP version: {response.version_string}")
WebSocket Support
from urllib4.websocket import WebSocketConnection
# Connect to a WebSocket server with compression and health monitoring
ws = WebSocketConnection(
"wss://echo.websocket.org",
enable_compression=True,
compression_level=9,
enable_health_monitoring=True,
ping_interval=30.0,
ping_timeout=5.0,
)
# Connect to the server
ws.connect()
# Send a text message
ws.send("Hello, WebSocket!")
# Send a binary message
ws.send(b"\x01\x02\x03\x04")
# Send a structured object using a subprotocol
ws = WebSocketConnection(
"wss://echo.websocket.org",
protocols=["json"],
)
ws.connect()
ws.send({"name": "John", "age": 30}) # Automatically encoded as JSON
# Receive a message with timeout
try:
message = ws.receive(timeout=5.0)
if isinstance(message, dict):
print(f"Received JSON: {message}")
else:
print(f"Received: {message.text}")
except WebSocketTimeoutError:
print("Timed out waiting for message")
# Close the connection
ws.close()
WebSocket Extensions and Subprotocols
urllib4 supports various WebSocket extensions and subprotocols:
-
Extensions:
permessage-deflate: Compresses WebSocket messages for better performance
-
Subprotocols:
json: Automatically encodes/decodes JSON datamsgpack: Efficiently encodes/decodes MessagePack data (requiresmsgpackpackage)cbor: Efficiently encodes/decodes CBOR data (requirescbor2package)
HTTP/3 Support
import urllib4
from urllib4.http3 import HTTP3Connection, HTTP3Settings, QUICSettings, inject_into_urllib4
# Direct HTTP/3 usage
quic_settings = QUICSettings(
enable_multipath=True, # Enable Multipath QUIC
max_paths=4,
enable_0rtt=True, # Enable 0-RTT for faster connections
)
http3_settings = HTTP3Settings(
quic=quic_settings,
)
# Create an HTTP/3 connection
conn = HTTP3Connection(
"cloudflare-quic.com", # A server that supports HTTP/3
settings=http3_settings,
)
# Connect and make a request
conn.connect()
response = conn.request("GET", "/")
print(f"Status: {response.status}")
print(f"Body: {response.data.decode()[:100]}...")
# Close the connection
conn.close()
# Alternatively, inject HTTP/3 support into urllib4
inject_into_urllib4()
# Now all HTTPS requests will automatically use HTTP/3 when available
http = urllib4.PoolManager()
response = http.request("GET", "https://cloudflare-quic.com/")
AI-Powered Smart Client
SmartClient is a high-level HTTP client that uses AI to learn from response patterns and optimize requests.
from ai_urllib4 import SmartClient
from ai_urllib4.exceptions import AIInitializationError
try:
# Initialize with AI optimization and your choice of LLM backend
client = SmartClient(
ai_optimize=True,
learn_from_success=True,
api_key="YOUR_API_KEY", # Optional: enables LLM-powered expert advice
ai_provider="gemini", # Optional: "gemini" (default)
ai_model="gemini-1.5-flash"
)
except AIInitializationError as e:
print(f"AI Initialization failed: {e}")
# Fallback to basic client without AI backend
client = SmartClient(ai_optimize=True)
# AI will suggest optimal headers/timing and learn from the result
response = client.request("GET", "https://example.com/api")
# Access learned domain insights
insights = client.get_domain_insights("example.com")
print(f"Success rate for example.com: {insights['success_rate']}")
# Detect anomalies or blocks
# (e.g., detecting Cloudflare challenges or sudden protocol changes)
anomaly = client.detect_anomaly(response)
if anomaly["is_anomaly"]:
print(f"⚠️ Warning: {anomaly['reason']}")
Key features:
- Request Pattern Learning: Remembers successful timings and headers per domain.
- Response Classification: Distinguishes between normal content, challenges (WAF), and blocks.
- Header Optimization: Suggests and rotates headers based on learned history.
- Retry Intelligence: Adapts retry logic using AI strategies (e.g., increasing delay after a 403).
- LLM Backends: Supports pluggable AI expert advice via Gemini and others.
🚀 AsyncIO Support
First-class support for asyncio and anyio is now available.
import asyncio
from ai_urllib4.async_connectionpool import connection_from_url
async def main():
pool = connection_from_url("http://httpbin.org")
response = await pool.urlopen("GET", "/get")
print(response)
await pool.close()
asyncio.run(main())
Enhanced Security Features
urllib4 includes built-in support for advanced security features like HSTS, SPKI Pinning, and Certificate Transparency.
import urllib4
from urllib4.util.cert_verification import SPKIPinningVerifier, CertificateTransparencyPolicy
from urllib4.util.hsts import HSTSCache, HSTSHandler
# HSTS is enabled by default in PoolManager and SmartClient
# It automatically upgrades HTTP requests to HTTPS for known HSTS hosts
client = urllib4.PoolManager()
client.request("GET", "http://github.com") # Automatically upgrades to HTTPS
# You can also manually manage HSTS policies
hsts_cache = HSTSCache()
hsts_handler = HSTSHandler(hsts_cache)
url = "http://github.com"
secured_url = hsts_handler.secure_url(url) # Returns https://github.com if policy exists
# Create a pool manager with SPKI pinning
pins = {
"example.com": {"pin-sha256:YLh1dUR9y6Kja30RrAn7JKnbQG/uEtLMkBgFF2Fuihg="}
}
http = urllib4.PoolManager(
spki_pins=pins,
cert_transparency_policy=CertificateTransparencyPolicy.BEST_EFFORT
)
Contributing
This project is in its early stages and contributions are welcome! Here's how you can help:
- Bug Reports: If you find a bug, please open an issue with detailed information.
- Feature Requests: Have ideas for new features? Open an issue to discuss.
- Code Contributions: Pull requests are welcome for bug fixes or new features.
- Documentation: Help improve or expand the documentation.
- Testing: Help write or improve tests for the codebase.
Development Guidelines
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add some amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
Why ai-urllib4? (vs urllib3)
While ai-urllib4 is built on the solid foundation of urllib3, it introduces key differences designed for modern, high-performance applications:
| Feature | urllib3 | ai-urllib4 |
|---|---|---|
| Async Support | ❌ Synchronous only (blocking) | ✅ Native AsyncIO (async/await) |
| Configuration | Manual Tuning | 🤖 AI Smart Config (Heuristic optimization) |
| HTTP/2 & 3 | Experimental / Partial | ✅ First-Class Support (Multipath QUIC, etc.) |
| WebSocket | Limited / Extension needed | ✅ Built-in Advanced Support (Subprotocols, etc.) |
Key Differentiators:
- 🚀 AsyncIO Support:
ai-urllib4allows non-blocking requests perfect for apps usingFastAPI,Quart, or standardasyncio. - 🤖 AI-Powered Tuning: The
AISmartConfigmodule automatically suggests timeout and retry settings based on the target URL type (API vs Download). - 🔒 Modern Security: Stronger focus on Certificate Transparency and SPKI Pinning out of the box.
Security Considerations
urllib4 is designed with security in mind, providing robust SSL/TLS verification, certificate transparency checking, and HSTS support. It's suitable for use in production environments where security is a priority.
Acknowledgements
This project builds on concepts from the urllib3 project and other Python HTTP libraries. We extend our gratitude to the authors and maintainers of these projects for their foundational work.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ai_urllib4-2.1.2.tar.gz.
File metadata
- Download URL: ai_urllib4-2.1.2.tar.gz
- Upload date:
- Size: 101.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
de0149a274c30dc99a07d4f13aa742747acaf2b4e1c073e0d041c7de54d13782
|
|
| MD5 |
b14aec7e5a55a793f639251c88cacaf2
|
|
| BLAKE2b-256 |
eea521ef9f718c98fa98d45b2fcd556e8dcd648e036c3b73b7b8e58fbcd1c22d
|
File details
Details for the file ai_urllib4-2.1.2-py3-none-any.whl.
File metadata
- Download URL: ai_urllib4-2.1.2-py3-none-any.whl
- Upload date:
- Size: 116.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1977ecfa0eb701a8f1dd8be3060d1be1b2d54fbcfd9f137f31d637fd36427b44
|
|
| MD5 |
abeef580eb37b6ce99a3930f18eee72e
|
|
| BLAKE2b-256 |
2db6196480c2513228fe850a052fa0ebf1cf8588e71c52da905d1fc805775c1a
|