gRPC & Protobufs

Implementing Unary and Streaming RPC Communication Patterns

Master the four gRPC service types—unary, server streaming, client streaming, and bidirectional streaming—to handle diverse data requirements.

Backend & APIsIntermediate15 min read

In this article

The Architecture of Contract-First Communication

Defining the Service Schema

Unary RPCs: The Foundation of Request-Response

Implementing a Unary Service

Leveraging Server and Client Streaming

Real-Time Updates with Server Streaming

Bidirectional Streaming and Advanced Patterns

Architecting a Bi-Di Chat Service

Choosing the Right RPC Type

Decision Matrix for Developers

The Architecture of Contract-First Communication

In traditional web development, we often rely on RESTful APIs where the data structure is loose and the documentation is frequently out of sync with the implementation. This leads to runtime errors when a field is missing or a type changes unexpectedly. gRPC addresses this problem by enforcing a contract-first approach using Protocol Buffers as the Interface Definition Language.

A Protocol Buffer file serves as a single source of truth for both the server and the client. It defines the structure of the messages and the available service methods. By compiling this file, developers generate typed stubs in their language of choice, ensuring that data is valid before it even leaves the application layer.

Beyond type safety, gRPC leverages the HTTP/2 protocol to provide significant performance improvements over traditional HTTP/1.1 APIs. While older APIs transmit data as plain text JSON, gRPC uses a highly efficient binary serialization format. This reduces the size of the payload on the wire and minimizes the CPU cycles required for parsing.

HTTP/2 also introduces multiplexing, allowing multiple requests to be sent over a single TCP connection simultaneously. This eliminates the head-of-line blocking issue common in older protocols. Understanding these underlying mechanics is essential for choosing the correct communication pattern for your microservices.

Defining the Service Schema

The first step in any gRPC project is defining the service in a .proto file. This file lists the request and response message types along with the RPC methods that operate on them. Each field in a message is assigned a unique number to ensure backward and forward compatibility as the schema evolves.

In a real-world financial system, you might define an AccountService that manages bank transactions. The schema acts as a formal agreement between the frontend, backend, and any external partners. This prevents breaking changes from leaking into production environments without being caught at compile time.

protobufAccount Service Definition

1syntax = "proto3";
2
3package finance.v1;
4
5// Represents a bank transaction request
6message TransactionRequest {
7    string account_id = 1;
8    double amount = 2;
9    string currency = 3;
10}
11
12// Represents the status of the processed transaction
13message TransactionResponse {
14    string status = 1;
15    string transaction_id = 2;
16    double updated_balance = 3;
17}
18
19// The service definition with different RPC types
20service AccountService {
21    // A simple unary call
22    rpc ProcessTransaction(TransactionRequest) returns (TransactionResponse);
23}

Unary RPCs: The Foundation of Request-Response

Unary RPCs are the most common pattern in gRPC and function similarly to a traditional function call or a REST API endpoint. The client sends a single request message to the server and receives a single response back. This pattern is ideal for operations like creating a resource, performing a lookup, or processing a login request.

Under the hood, a unary call starts with the client sending a set of HTTP/2 HEADERS frames followed by a single DATA frame containing the serialized message. The server processes the request and sends back its own HEADERS, a DATA frame with the result, and finally TRAILERS. These trailers carry the status code and any additional metadata about the operation.

While it looks simple, the overhead of establishing a connection and managing the lifecycle is handled efficiently by the gRPC library. Developers do not need to worry about manually constructing headers or managing status codes. The generated client stub provides a clean interface that abstracts the network complexity away.

However, unary calls have limitations when dealing with massive datasets or long-running operations. If a response is too large, it might exceed the configured message size limit or cause memory pressure on the client. For these scenarios, we must look toward streaming patterns to break data into manageable chunks.

Implementing a Unary Service

In a Go implementation, the server satisfies an interface generated from the protobuf file. You implement the logic within a method that takes a context and the request object. The context allows you to handle timeouts and cancellations, which are critical for building resilient distributed systems.

Handling errors in unary calls involves returning a gRPC status code rather than a simple boolean or a generic error string. This allows the client to distinguish between transient network issues and permanent validation failures. Properly mapped status codes ensure that the system behaves predictably across different services.

goGo Unary Implementation

1package main
2
3import (
4	"context"
5	"google.golang.org/grpc/codes"
6	"google.golang.org/grpc/status"
7	pb "github.com/example/finance/proto"
8)
9
10type server struct {
11	pb.UnimplementedAccountServiceServer
12}
13
14func (s *server) ProcessTransaction(ctx context.Context, req *pb.TransactionRequest) (*pb.TransactionResponse, error) {
15	if req.Amount <= 0 {
16		// Return a specific gRPC error code for invalid input
17		return nil, status.Error(codes.InvalidArgument, "Transaction amount must be positive")
18	}
19
20	// Logic for processing the transaction goes here
21	return &pb.TransactionResponse{
22		Status: "SUCCESS",
23		TransactionId: "tx_998877",
24		UpdatedBalance: 1500.50,
25	}, nil
26}

Leveraging Server and Client Streaming

Streaming patterns allow for the continuous flow of data between the client and server. In server streaming, the client sends one request and the server responds with a sequence of messages. This is particularly useful for features like live activity feeds, stock tickers, or exporting large reports where you want to show progress to the user.

The server keeps the HTTP/2 stream open and sends multiple DATA frames until the operation is complete. This avoids the need for the client to poll the server repeatedly, which saves bandwidth and reduces latency. Once all data is sent, the server closes the stream by sending the trailing metadata.

Client streaming reversed this logic by allowing the client to send a stream of messages to the server. The server waits until it has received all messages or a specific signal before sending a single response back. This pattern is essential for high-frequency data ingestion, such as uploading logs or telemetry data from an IoT device.

When using client streaming, the server must be prepared to handle a long-running stream without exhausting resources. It usually processes messages as they arrive and maintains an internal state until the client signals the end of the stream. This allows for efficient memory usage because you do not need to load the entire dataset into memory at once.

Real-Time Updates with Server Streaming

To define a streaming method, you simply add the stream keyword before the request or response type in the protobuf file. The generated code for the server will then provide a stream object that allows you to call a Send method multiple times. This abstraction makes it feel like writing to a standard output stream.

Consider a monitoring system that tracks server health metrics. Instead of the dashboard asking for the current CPU usage every second, the server can push updates as they occur. This ensures that the user interface is always synchronized with the actual state of the infrastructure.

Server Streaming: Single request, multiple responses. Use for live updates or large downloads.
Client Streaming: Multiple requests, single response. Use for file uploads or batch processing.
Flow Control: gRPC automatically handles backpressure to prevent the sender from overwhelming the receiver.
Deadlines: Always set a deadline on streams to prevent them from hanging indefinitely due to network partitions.

Bidirectional Streaming and Advanced Patterns

Bidirectional streaming is the most flexible and complex pattern provided by gRPC. In this mode, both the client and the server send a sequence of messages using a single persistent stream. The two streams operate independently, meaning the server can respond as soon as it gets a message or wait to collect several messages before replying.

This pattern is the foundation for highly interactive applications like collaborative document editing, multiplayer gaming, and complex chat systems. It allows for full-duplex communication where both parties can initiate data transfer at any time. This reduces the overhead of establishing new connections for every interaction.

Managing bidirectional streams requires careful attention to concurrency. Typically, you will run a dedicated goroutine or thread for reading incoming messages while the main logic handles sending outgoing data. Error handling also becomes more nuanced, as one side of the stream might fail while the other is still active.

A common pitfall is failing to handle the close signal correctly on both sides. If the client closes its stream, the server must finish its remaining tasks and close its end to avoid resource leaks. Implementing heartbeat messages is also a recommended practice to ensure that the connection remains healthy during periods of inactivity.

Architecting a Bi-Di Chat Service

In a chat application, a bidirectional stream allows users to send messages and receive incoming messages simultaneously. The server acts as a message router, identifying which stream belongs to which user and forwarding data accordingly. This creates a seamless, low-latency experience for the participants.

Bidirectional streaming is powerful but introduces significant complexity in state management. Only choose this pattern when the application requirements truly demand real-time, two-way interaction that cannot be achieved through simpler unary or one-way streaming models.

When implementing this in code, you often use a loop to continuously listen for incoming messages from the stream's Recv method. If Recv returns an EOF error, it indicates that the other party has finished sending data. This is the signal to clean up any allocated resources and terminate the local stream handler.

Choosing the Right RPC Type

Selecting the appropriate RPC type is an architectural decision that impacts the scalability and maintainability of your system. Unary calls are the safest default choice because they are easy to understand, test, and debug. They fit most standard CRUD operations and are well-supported by load balancers and service meshes.

Streaming should be reserved for scenarios where the data size is unpredictable or the timing of the data is critical. While it offers performance benefits, streaming is harder to load balance because a single persistent connection might pin a client to one specific server instance for a long duration. This can lead to uneven traffic distribution in a large cluster.

Security is another consideration when choosing patterns. Long-lived streams require robust authentication and authorization checks at the start of the stream. Since the connection stays open, you must also consider how to handle token expiration or permission changes during the lifetime of the stream.

Ultimately, gRPC provides the tools to build diverse communication styles within a single framework. By mastering these four types, you can build microservices that are not only fast and efficient but also clearly documented and type-safe. Always prioritize the simplest pattern that meets your technical requirements to avoid unnecessary architectural debt.

Decision Matrix for Developers

Use Unary for atomic operations where the request and response fit easily in memory. This includes fetching a user profile, updating a database record, or checking the status of a specific task. It is the most robust pattern for general-purpose API development.

Use Server Streaming for read-only data that arrives over time, such as a log viewer or a notification system. Use Client Streaming for write-heavy operations where you are sending bulk data to the server. Finally, use Bidirectional Streaming for true real-time, interactive synchronization between two entities.

Reliability: Unary is the easiest to retry automatically using standard gRPC interceptors.
Efficiency: Streaming reduces the per-message overhead but increases server-side state complexity.
Observability: Ensure your tracing and logging tools support streaming events to avoid blind spots in your distributed system.
Compatibility: Some older proxy servers and load balancers may struggle with long-lived HTTP/2 streams.

Designing Efficient Schemas with Protocol Buffers Language Guide Optimizing API Performance via HTTP/2 and Binary Framing