Overview
gRPC protocol is exclusively based on HTTP/2 (aka h2) protocol. Main concepts:
each request in h2 connection is a bidirectional stream of frames;
streams give ability to do multiplexing - make requests in parallel using single TCP connection;
h2 has special flow control mechanism, which can help avoid network congestion and fix problems with slow clients, slow servers and slow network;
flow control only affects
DATA
frames, any other frame can be sent without limitations;streams can be cancelled individually, all other streams in h2 connection will continue work and there is no need to drop connection and reconnect;
h2 is a binary protocol and allows headers compression using HPACK format, so it is a very strict and efficient protocol.
h2 protocol is highly configurable, for example:
flow control mechanism can use dynamically configurable initial window size, to better match different use cases and conditions;
you can set maximum frame size to control how much data you will receive in each frame;
you can limit number of concurrent streams for h2 connection.
gRPC protocol adds to h2 protocol messages encoding format and a notion about metadata. gRPC metadata == additional h2 headers. So gRPC has the same level of extensibility as HTTP has.
Messages are sent using one or several DATA
frames, depending on maximum
frame size setting and message size. Messages are encoded using simple format:
prefix + data. Prefix contains length of the data and compression flag. You
can learn gRPC wire protocol in more details here: gRPC format.
gRPC has 4 method types: unary-unary, unary-stream (e.g. download), stream-unary (e.g. upload), stream-stream. They are all the same, the only difference is how many messages are sent in each direction: exactly one (unary) or any number of messages (stream).
Cancellation
As it was said above, h2 allows you to cancel any stream without affecting other
streams, which are living in the same connection. And h2 protocol has special
frame to do this: RST_STREAM. Both client and server can cancel streams.
This feature automatically gives you ability to proactively cancel gRPC method
calls in the same way. In grpclib
you can cancel method calls immediately,
for example:
client sends request to the server
server spawns task to handle this request
client wants to cancel this request and sends
RST_STREAM
frameserver receives
RST_STREAM
frame and cancels task immediately
Most other protocols doesn’t have this feature, so they have to terminate whole TCP connection and perform reconnect for the next call. It is also not obvious how to immediately detect terminated connections on the other side, and this means that server most likely will continue result computations, when this result is not needed anymore.
Deadlines
Deadlines are basically timeouts, which are propagated from service to service, to meet initial timeout constrains. This is a simple and powerful idea.
Example:
service X receives request with
grpc-timeout: 100m
in metadata (100m means 100 milliseconds)service X immediately converts timeout into deadline:
deadline = time.monotonic() + grpc_timeout
service X spent 20ms doing some work
now service X wants to make outgoing request to service Y, so it computes how much time remains to perform this request:
new_timeout = max(deadline - time.monotonic(), 0) # == 80ms
service X performs request to service Y with metadata
grpc-timeout: 80m
service Y uses the same logic to convert timeout -> deadline -> timeout.
With this feature it is possible to cancel whole call chain simultaneously, even in case of network failures (broken connections).
grpclib
grpclib
tries to give you full control over these bidirectional h2 streams.
Note
[auto] mark below means that it is not necessary to explicitly call these methods in your code, they will be called automatically behind the scenes. They are exists to have more control.
send_request
[auto]
request accepted
send_message
recv_message
Now you have to end stream from the client-side, and you can do this in two ways:
send_message(message, end=True)
- lastDATA
frame will containEND_STREAM
flagend()
- one extra frame will be sent, as shown below. It is better to avoid this way if possible.
end
(optional, read note above)
send_initial_metadata
[auto] You can
send initial metadata
even before receiving messages from the client. RPC success
or failure in gRPC
protocol is indicated in trailers.
recv_initial_metadata
[auto]
send_message
recv_message
send_trailing_metadata
[auto]
recv_trailing_metadata
[auto]