Large scale WebSocket with FastAPI
Project description
FastAPI Distributed Websocket
A library to implement websocket for distibuted systems based on FastAPI.
N.B.: This library is still at an early stage, use it in production at your own risk.
What it does
The main features of this libarary are:
- Easly implementing broadcasting, pub/sub, chat rooms, etc...
- Proxy websocket connections to other servers (e.g. from an api gateway)
- Authentication
- Clean exception handling
- An in memory broker for fast development
Problems of scaling websocket among multiple servers in production
Websocket is a relatively new protocol for real time communication over HTTP. It establishes a durable, stateful, full-duplex connection between clients and the server. It can be used to implement chats, real time notifications, broadcasting and pub/sub models.
Connections from clients
HTTP request/response mechanism fits very well to scale among multiple server instances in production. Any time a client makes a request, it can connect to any server instance and it's going to receive the same response. After the response has been returned to the client, it went disconnected and it can make another request without the need to hit the same instace as before. This thanks to the stateless nature of HTTP.
However, Websocket establishes a stateful connection between the client and the server and, if some error occurs and the connection went lost, we have to ensure that clients are going to hit the same server instance they were connected before, since that instance was managing the connection state.
Stateful means that there is a state that can be manipulated. In particular, a stateful connection is a connection that heavily relies on its state in order to work
Broadcasting and group messages
Another problem of scaling Websocket occurs when we need to send messages to multiple connected clients (i.e. broadcasting a message or sending a message to all clients subscribed to a specific topic).
Imagine that we have a chat server, and that when an user send a message in a specific chat room, we broadcast it to all users subscribed to that room. If we have a single server instance, all connection are managed by this instance so we can safely trust that the message will be delivered to all recipents. On the other hand, with multiple server instances, users subscribing to a chat room will probably connect to different instances. This way, if an user send a message to the chat room 'xyz' at the server A, users subscribed to the same chat room at the server B are not receiving it.
Documenting Websocket interfaces
Another common problem with Websocket, that's not even related to scaling, is about documentation. Due to the event driven nature of the Websocket protocol it does not fit well to be documented with openapi. However a new specification for asynchronous, event driven interfaces has been defined recently. The spec name is asyncapi and I'm currently studying it. I don't know if this has to be implemented here or it's better having a separate library for that, however this is surely something we have to look at.
Other problems
When I came first to think about this library, I started making a lot of research of common problems related to Websocket on stackoverflow, reddit, github issues and so on. I found some interesting resource that are however related to the implementation itself. I picked up best solutions and elaborated my owns converging all of that in this library.
Examples
Installation
$ pip install fastapi-distributed-websocket
Basic usage
This is a basic example using an in memory broker with a single server instance.
from fastapi import FastAPI, WebSocket, WebSocketDisconnect, status
from distributed_websocket import Connection, WebSocketManager
app = FastAPI()
manager = WebSocketManager('channel:1', broker_url='memory://')
...
app.on_event('startup')
async def startup() -> None:
...
await manager.startup()
app.on_event('shutdown')
async def shutdown() -> None:
...
await manager.shutdown()
@app.websocket('/ws/{conn_id}')
async def websocket_endpoint(
ws: WebSocket,
conn_id: str,
*,
topic: Optional[Any] = None,
) -> None:
connection: Connection = await manager.new_connection(ws, conn_id)
try:
while True:
msg = await connection.receive_json()
await manager.broadcast(msg)
except WebSocketDisconnect:
await manager.remove_connection(connection)
The manger.new_connection
method create a new Connection object and add it to
the manager.active_connections
list. Note that after a WebSocketDisconnect
is raised, we call remove_connection
: this method only remove the connection
object from the manager.active_connections
list, without calling connection.close
, since
the connection is already closed.
If you need to close a connection at any other time, you can use manager.close_connection
.
If you use connection.iter_json
, it already handles the WebSocketDisconnect
exception, so
you can simply call manager.remove_connection
just after the loop (see next code block).
Note that here we are using manager.broadcast
to send the message to all connections managed
by the WebSocketManager instance. However, this method only work if we have a single server
instance. If we have multiple server instances, we have to use manager.receive
, to properly
send the message to the broker.
@app.websocket('/ws/{conn_id}')
async def websocket_endpoint(
ws: WebSocket,
conn_id: str,
*,
topic: Optional[Any] = None,
) -> None:
connection: Connection = await manager.new_connection(ws, conn_id)
# This is the preferred way of handling WebSocketDisconnect
async for msg in connection.iter_json():
await manager.receive(connection, msg)
await manager.remove_connection(connection)
Proxy from an API gateway
Let's say we are developing a chat service and that all our services are behind
an API gateway. If we want to keep our websocket service behind it too, then
fastapi-distributed-websocket provides us with WebSocketProxy
.
from distributed_websocket import WebSocketProxy
# skipped other imports for brevity
app = FastAPI()
WS_TARGET_ENDPOINT = 'ws://websocket_service:8000/wshandler'
@app.websocket('/ws')
async def websocket_proxy(websocket: WebSocket):
await websocket.accept()
ws_proxy = WebSocketProxy(websocket, WS_TARGET_ENDPOINT)
await ws_proxy()
This will forward all messages from the client to the target endpoint and all messages from the target endpoint to the client.
Now let's assume that our websocket service code is the code of our previous example. Our API Gateway code will be:
from distributed_websocket import WebSocketProxy
# skipped other imports for brevity
app = FastAPI()
WS_TARGET_ENDPOINT = 'ws://websocket_service:8000/ws/{}'
@app.websocket('/ws/{conn_id}')
async def websocket_endpoint(
ws: WebSocket,
conn_id: str,
) -> None:
await websocket.accept()
ws_proxy = WebSocketProxy(websocket, WS_TARGET_ENDPOINT.format(conn_id))
await ws_proxy()
API Reference
Connection
Connection objects wrap the websocket connection and provide a simple interface
to send and receive messages. They have a topics
attribute to store subscriptions
patterns and implement pub/sub models.
async
accept(self) -> None
Accept the connection.async
close(self, code: int = 1000) -> None
Close the connection with the specified status.async
receive_json(self) -> Any
Receive a JSON message.async
send_json(self, data: Any) -> None
Send a JSON message over the connection.async
iter_json(self) -> AsyncIterator[Any]
Iterate over the messages received over the connection.
Messages
Message objects store the message type, the topic and the data. They provides
an easy serialization/deserialization mechanism.
Remeber that messages returned by connection.iter_json
are already deserialized
into dict
objects, so here we call deserialization the process of converting
a dict
object into a Message
object.
-
type: str
The message type. -
topic: str
The message topic. -
conn_id: str | list[str]
The connection id or list of connection ids that the message should be sent to. -
data: Any
The message data. -
classmethod
from_client_message(cls, *, data: Any) -> Message
Create a message from a client message. -
__serialize__(self) -> dict
Serialize the message into adict
object.
Subscriptions
You can bind topics to connection objects to implement pub/sub models, notification and so on.
The topics
attribute is a set of strings that follows the pattern matching syntax of MQTT.
This library share connection objects state between server instances, so you may find
references to terms like channel
, publish
, subscribe
and unsubscribe
referring to
the same concepts but applied to the underlying server/broker communication.
This may be confusing, but remember to keep separated the communication between the server
and the clients, that you are developing and the communication between the server and the broker,
that you usually don't deal with.
subscribe(connection: Connection, message: Message) -> None
Subscribe a connection tomessage.topic
.unsubscribe(connection: Connection, message: Message) -> None
Unsubscribe a connection frommessage.topic
.hanlde_subscription_message(connection: Connection, message: Message) -> None
Callssubscribe
orunsubscribe
depending on the message type.matches(topic: str, patterns: set[str]) -> bool
Check iftopic
matches any of the patterns inpatterns
.
Authentication
Authentication is provided with the WebSocketOAuth2PasswordBearer
class.
It inherits from FastAPI OAuth2PasswordBearer
and overrides __call__
method to accept
a WebSocket
object.
async
__call__(self, websocket: WebSocket) -> str | None
Authenticate the websocket connection and return the Authorization header value.
If the authentication fails, returnNone
if the objects has been initialized withauto_error=False
or close the connection with theWS_1008_POLICY_VIOLATION
code.
Exceptions and Exception Handling
fastapi-distributed-websocket
provides exception handling via decorators. You can use the
apposite decorators passing an exception class and a handler callable. Exception handlers
should accept only the exception object as argument.
Why this is useful?
Because sometimes the same type of exception can be raised by different parts of the application,
this way you can decorate the higer level function in the call stack to handle the exception at
any level.
A base WebSocketException
class is provided to bind connection objects to the exception, so
your handler function can easily access it.
If you need to access connection objects from the exception handler, your custom exceptions
should inherit from WebSocketException
, no matter if they are really network related or not.
-
WebSocketException(self, message: str, *, connection: Connection) -> None
-
InvalidSubscription(self, message: str, *, connection: Connection) -> None
Raised when a subscription pattern use an invalid syntax. Inherits fromWebSocketException
. -
InvalidSubscriptionMessage(self, message: str, *, connection: Connection) -> None
LikeInvalidSubscription
it could be raised for bad syntax, but it could also be raised
when the message type is not subscribe or unsubscribe. Inherits fromWebSocketException
. -
handle(exc: BaseException, handler: Callable[..., Any]) -> Callable[..., Any]
Decorator to handle exceptions. If you decorate a function with this decorator, at any time
an exception of typeexc
is raised or propagated to the function, it will be handled byhandler
.
Use this decorator only if both your handler and the function are not async. -
async
ahandle( exc: BaseException, handler: Callable[..., Coroutine[Any, Any, Any]] ) -> Callable[..., Any]
Decorator to handle exceptions, same adhandle
, but the handler is a coroutine function.
Use this if your handler is a coroutine function, while the decorated function could be
either a sync or an async function.
Broker Interfaces
Connections' state is shared between server instances using a pub/sub broker. By default,
the broker is a reids.asyncio.Redis
instance (ex aioredis.Redis
), but you can use any
other implementation. fastapi-distributed-websocket
provides an InMemoryBroker
class
for development purposes.
You can inherit from BrokerInterface
and override the methods to implement your own broker.
async
connect(self) -> Coroutine[Any, Any, None]
Connect to the broker.async
disconnect(self) -> Coroutine[Any, Any, None]
Disconnect from the broker.async
subscribe(self, channel: str) -> Coroutine[Any, Any, None]
Subscribe to a channel.async
unsubscribe(self, channel: str) -> Coroutine[Any, Any, None]
Unsubscribe from a channel.async
publish(self, channel: str, message: Any) -> Coroutine[Any, Any, None]
Publish a message to a channel.async
get_message(self, **kwargs) -> Coroutine[Any, Any, Message | None]
Get a message from the broker.
WebSocketManager
The WebSocketManager
class is where the main logic of the library is implemented.
It keeps track of the connection objects and starts the broker connection.
It spawn a main task, a listener that wait (non-blocking) for messages from the broker,
and send them to the connection objects (broadcasting or checking for subscriptions)
spawning a new task for each send.
The broker initialisation is done in the constructor while calls to broker.connect
and
broker.disconnect
are handled in the startup
and shutdown
methods.
async
new_connection( self, websocket: WebSocket, conn_id: str, topic: str | None = None ) -> Coroutine[Any, Any, Connection]
Create a new connection object, add it toself.active_connections
and return it.async
close_connection( self, connection: Connection, code: int = status.WS_1000_NORMAL_CLOSURE ) -> Coroutine[Any, Any, None]
Close a connection object and remove it fromself.active_connections
.remove_connection(self, connection: Connection) -> None
Remove a connection object fromself.active_connections
.set_conn_id(self, connection: Connection, conn_id: str) -> None
Set the connection id and notify the client.send(self, topic: str, message: Any) -> None
Send a message to all the connection objects subscribed totopic
.
It spawns a new task wrapping the coroutine resulting fromself._send
.broadcast(self, message: Any) -> None
Send a message to all the connection objects.
It spawns a new task wrapping the coroutine resulting fromself._broadcast
.send_by_conn_id(self, conn_id: str | list[str], message: Any) -> None
Send a message to all the connection objects with the given connection id.
It spawns a new task wrapping the coroutine resulting fromself._send_by_conn_id
ifconn_id
is a string or from_send_multi_by_conn_id
if it is a list.send_msg(self, message: Message) -> None
Based on the message type, it callssend
,send_by_conn_id
orbroadcast
.async
receive( self, connection: Connection, message: Any ) -> Coroutine[Any, Any, None]
Receive a message from a connection object. It passes the message down to
a private method that handle eventual subscriptions and then publish the message
to the broker.async
startup(self) -> Coroutine[Any, Any, None]
Start the broker connection and the listener task.async
shutdown(self) -> Coroutine[Any, Any, None]
Close the broker connection and the listener task.
It also takes care to cancel all the tasks spawned bysend
andbroadcast
and
close all the connection objects before.
WebSocketProxy
The WebSocketProxy
class initialise callable objects that can be
used to start proxyng websocket messages from client to a server and viceversa.
It's initialised with a two parameters:
- client: a
WebSocket
object - server_endpoint: a
str
containing the endpoint of the server
Notice that the target server could be a remote server or the same server that starts the proxy.
async
__call__(self) -> Coroutine[Any, Any, None]
Start a websocket connection to server_endpoint and spawn two tasks:
one that forwards the messages from the client to the target and the other that
forwards the messages from the target to the client.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file fastapi-distributed-websocket-0.2.0.tar.gz
.
File metadata
- Download URL: fastapi-distributed-websocket-0.2.0.tar.gz
- Upload date:
- Size: 22.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ef8753faf58f881d1036a95a2bee4a3b70fa808a0f118ff7cf14496384f2206a |
|
MD5 | c103cdcdf9fca980ec30d474320d4891 |
|
BLAKE2b-256 | 3192118bd511272aaf331ba647cac318222bcabf5cdc507022bf9f6fe74cc3a3 |
File details
Details for the file fastapi_distributed_websocket-0.2.0-py3-none-any.whl
.
File metadata
- Download URL: fastapi_distributed_websocket-0.2.0-py3-none-any.whl
- Upload date:
- Size: 19.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3c7e95cacebc534e611c68b336d7fe026d310dcb68a2879d7b1581c1f2050915 |
|
MD5 | 88fe7b8f46ea86cb319c486dfcc1f061 |
|
BLAKE2b-256 | b562db21fb5fb52ababa0001cf64b3b35681ec3b2fc37bc681b0b778fc13235d |