Integrate the Sans-I/O layer#

This guide explains how to integrate the Sans-I/O layer of websockets to add support for WebSocket in another library.

As a prerequisite, you should decide how you will handle network I/O and asynchronous control flow.

Your integration layer will provide an API for the application on one side, will talk to the network on the other side, and will rely on websockets to implement the protocol in the middle.

../../_images/data-flow.svg

Opening a connection#

Client-side#

If you’re building a client, parse the URI you’d like to connect to:

from websockets.uri import parse_uri

wsuri = parse_uri("ws://example.com/")

Open a TCP connection to (wsuri.host, wsuri.port) and perform a TLS handshake if wsuri.secure is True.

Initialize a ClientProtocol:

from websockets.client import ClientProtocol

protocol = ClientProtocol(wsuri)

Create a WebSocket handshake request with connect() and send it with send_request():

request = protocol.connect()
protocol.send_request(request)

Then, call data_to_send() and send its output to the network, as described in Send data below.

Once you receive enough data, as explained in Receive data below, the first event returned by events_received() is the WebSocket handshake response.

When the handshake fails, the reason is available in handshake_exc:

if protocol.handshake_exc is not None:
    raise protocol.handshake_exc

Else, the WebSocket connection is open.

A WebSocket client API usually performs the handshake then returns a wrapper around the network socket and the ClientProtocol.

Server-side#

If you’re building a server, accept network connections from clients and perform a TLS handshake if desired.

For each connection, initialize a ServerProtocol:

from websockets.server import ServerProtocol

protocol = ServerProtocol()

Once you receive enough data, as explained in Receive data below, the first event returned by events_received() is the WebSocket handshake request.

Create a WebSocket handshake response with accept() and send it with send_response():

response = protocol.accept(request)
protocol.send_response(response)

Alternatively, you may reject the WebSocket handshake and return an HTTP response with reject():

response = protocol.reject(status, explanation)
protocol.send_response(response)

Then, call data_to_send() and send its output to the network, as described in Send data below.

Even when you call accept(), the WebSocket handshake may fail if the request is incorrect or unsupported.

When the handshake fails, the reason is available in handshake_exc:

if protocol.handshake_exc is not None:
    raise protocol.handshake_exc

Else, the WebSocket connection is open.

A WebSocket server API usually builds a wrapper around the network socket and the ServerProtocol. Then it invokes a connection handler that accepts the wrapper in argument.

It may also provide a way to close all connections and to shut down the server gracefully.

Going forwards, this guide focuses on handling an individual connection.

From the network to the application#

Go through the five steps below until you reach the end of the data stream.

Receive data#

When receiving data from the network, feed it to the protocol’s receive_data() method.

When reaching the end of the data stream, call the protocol’s receive_eof() method.

For example, if sock is a socket:

try:
    data = sock.recv(65536)
except OSError:  # socket closed
    data = b""
if data:
    protocol.receive_data(data)
else:
    protocol.receive_eof()

These methods aren’t expected to raise exceptions — unless you call them again after calling receive_eof(), which is an error. (If you get an exception, please file a bug!)

Send data#

Then, call data_to_send() and send its output to the network:

for data in protocol.data_to_send():
    if data:
        sock.sendall(data)
    else:
        sock.shutdown(socket.SHUT_WR)

The empty bytestring signals the end of the data stream. When you see it, you must half-close the TCP connection.

Sending data right after receiving data is necessary because websockets responds to ping frames, close frames, and incorrect inputs automatically.

Expect TCP connection to close#

Closing a WebSocket connection normally involves a two-way WebSocket closing handshake. Then, regardless of whether the closure is normal or abnormal, the server starts the four-way TCP closing handshake. If the network fails at the wrong point, you can end up waiting until the TCP timeout, which is very long.

To prevent dangling TCP connections when you expect the end of the data stream but you never reach it, call close_expected() and, if it returns True, schedule closing the TCP connection after a short timeout:

# start a new execution thread to run this code
sleep(10)
sock.close()  # does nothing if the socket is already closed

If the connection is still open when the timeout elapses, closing the socket makes the execution thread that reads from the socket reach the end of the data stream, possibly with an exception.

Close TCP connection#

If you called receive_eof(), close the TCP connection now. This is a clean closure because the receive buffer is empty.

After receive_eof() signals the end of the read stream, data_to_send() always signals the end of the write stream, unless it already ended. So, at this point, the TCP connection is already half-closed. The only reason for closing it now is to release resources related to the socket.

Now you can exit the loop relaying data from the network to the application.

Receive events#

Finally, call events_received() to obtain events parsed from the data provided to receive_data():

events = connection.events_received()

The first event will be the WebSocket opening handshake request or response. See Opening a connection above for details.

All later events are WebSocket frames. There are two types of frames:

  • Data frames contain messages transferred over the WebSocket connections. You should provide them to the application. See Fragmentation below for how to reassemble messages from frames.

  • Control frames provide information about the connection’s state. The main use case is to expose an abstraction over ping and pong to the application. Keep in mind that websockets responds to ping frames and close frames automatically. Don’t duplicate this functionality!

From the application to the network#

The connection object provides one method for each type of WebSocket frame.

For sending a data frame:

These methods raise ProtocolError if you don’t set the FIN bit correctly in fragmented messages.

For sending a control frame:

send_close() initiates the closing handshake. See Closing a connection below for details.

If you encounter an unrecoverable error and you must fail the WebSocket connection, call fail().

After any of the above, call data_to_send() and send its output to the network, as shown in Send data above.

If you called send_close() or fail(), you expect the end of the data stream. You should follow the process described in Close TCP connection above in order to prevent dangling TCP connections.

Closing a connection#

Under normal circumstances, when a server wants to close the TCP connection:

  • it closes the write side;

  • it reads until the end of the stream, because it expects the client to close the read side;

  • it closes the socket.

When a client wants to close the TCP connection:

  • it reads until the end of the stream, because it expects the server to close the read side;

  • it closes the write side;

  • it closes the socket.

Applying the rules described earlier in this document gives the intended result. As a reminder, the rules are:

  • When data_to_send() returns the empty bytestring, close the write side of the TCP connection.

  • When you reach the end of the read stream, close the TCP connection.

  • When close_expected() returns True, if you don’t reach the end of the read stream quickly, close the TCP connection.

Fragmentation#

WebSocket messages may be fragmented. Since this is a protocol-level concern, you may choose to reassemble fragmented messages before handing them over to the application.

To reassemble a message, read data frames until you get a frame where the FIN bit is set, then concatenate the payloads of all frames.

You will never receive an inconsistent sequence of frames because websockets raises a ProtocolError and fails the connection when this happens. However, you may receive an incomplete sequence if the connection drops in the middle of a fragmented message.

Tips#

Serialize operations#

The Sans-I/O layer expects to run sequentially. If your interact with it from multiple threads or coroutines, you must ensure correct serialization. This should happen automatically in a cooperative multitasking environment.

However, you still have to make sure you don’t break this property by accident. For example, serialize writes to the network when data_to_send() returns multiple values to prevent concurrent writes from interleaving incorrectly.

Avoid buffers#

The Sans-I/O layer doesn’t do any buffering. It makes events available in events_received() as soon as they’re received.

You should make incoming messages available to the application immediately and stop further processing until the application fetches them. This will usually result in the best performance.