Paper deep dive

Bringing Network Coding into Multi-Robot Systems: Interplay Study for Autonomous Systems over Wireless Communications

Anil Zaher, Kiril Solovey, Alejandro Cohen

Year: 2026Venue: arXiv preprintArea: cs.ROType: PreprintEmbeddings: 51

Intelligence

Status: succeeded | Model: google/gemini-3.1-flash-lite-preview | Prompt: intel-v1 | Confidence: 95%

Last extracted: 3/22/2026, 5:02:43 AM

Summary

This paper investigates the integration of adaptive causal network coding (AC-RLNC) into multi-robot systems (MRS) to mitigate communication-induced delays and packet losses. By comparing AC-RLNC against traditional retransmission-based protocols (SR-ARQ) and best-effort delivery (UDP), the authors demonstrate that network coding significantly improves estimation consistency in cooperative localization and enhances safety-critical deadline reliability in vehicle-to-vehicle (V2V) overtaking maneuvers, advocating for a co-design approach between autonomy algorithms and communication protocols.

Entities (5)

Adaptive Causal Random Linear Network Coding · protocol · 99%Multi-Robot Systems · system · 98%Cooperative Localization · task · 95%Safety-Critical Overtaking · task · 95%Selective-Repeat ARQ · protocol · 95%

Relation Signals (3)

Communication Layer → affects → Autonomy Pipeline

confidence 98% · the study highlights the need to jointly design autonomy algorithms and communication mechanisms

AC-RLNC → improves → Deadline Reliability

confidence 95% · Our results demonstrate that coding-based communication significantly reduces in-order delivery stalls... and improves deadline reliability

SR-ARQ → introduces → Head-of-line blocking

confidence 95% · SR-ARQ enforces ordered delivery... Under loss, this ordered release can increase in-order delay due to head-of-line blocking.

Cypher Suggestions (2)

Map the impact of protocols on system performance metrics · confidence 95% · unvalidated

MATCH (p:Protocol)-[r:IMPROVES]->(m:Metric) RETURN p.name, r.relation, m.name

Find all protocols evaluated in the study · confidence 90% · unvalidated

MATCH (p:Protocol)-[:EVALUATED_IN]->(s:Study) RETURN p.name

Abstract

Abstract:Communication is a core enabler for multi-robot systems (MRS), providing the mechanism through which robots exchange state information, coordinate actions, and satisfy safety constraints. While many MRS autonomy algorithms assume reliable and timely message delivery, realistic wireless channels introduce delay, erasures, and ordering stalls that can degrade performance and compromise safety-critical decisions of the robot task. In this paper, we investigate how transport-layer reliability mechanisms that mitigate communication losses and delays shape the autonomy-communication loop. We show that conventional non-coded retransmission-based protocols introduce long delays that are misaligned with the timeliness requirements of MRS applications, and may render the received data irrelevant. As an alternative, we advocate for adaptive and causal network coding, which proactively injects coded redundancy to achieve the desired delay and throughput that enable relevant data delivery to the robotic task. Specifically, this method adapts to channel conditions between robots and causally tunes the communication rates via efficient algorithms. We present two case studies: cooperative localization under delayed and lossy inter-robot communication, and a safety-critical overtaking maneuver where timely vehicle-to-vehicle message availability determines whether an ego vehicle can abort to avoid a crash. Our results demonstrate that coding-based communication significantly reduces in-order delivery stalls, preserves estimation consistency under delay, and improves deadline reliability relative to retransmission-based transport. Overall, the study highlights the need to jointly design autonomy algorithms and communication mechanisms, and positions network coding as a principled tool for dependable multi-robot operation over wireless networks.

PDF

Open source PDF →Open local PDF →

Full Text

50,243 characters extracted from source content.

Expand or collapse full text

Bringing Network Coding into Multi-Robot Systems: Interplay Study for Autonomous Systems over Wireless Communications Anil Zaher, Kiril Solovey, and Alejandro Cohen Viterbi Faculty of Electrical and Computer Engineering, Technion–Israel Institute of Technology, Haifa, Israel, Emails: anil.zaher@campus.technion.ac.il, kirilsol,alecohen@technion.ac.il Abstract— Communication is a core enabler for multi-robot systems (MRS), providing the mechanism through which robots exchange state information, coordinate actions, and satisfy safety constraints. While many MRS autonomy algorithms assume reliable and timely message delivery, realistic wireless channels introduce delay, erasures, and ordering stalls that can degrade performance and compromise safety-critical decisions of the robot task. In this paper, we investigate how transport- layer reliability mechanisms that mitigate communication losses and delays shape the autonomy–communication loop. We show that conventional non-coded retransmission-based protocols introduce long delays that are misaligned with the timeliness requirements of MRS applications, and may render the received data irrelevant. As an alternative, we advocate for adaptive and causal network coding, which proactively injects coded redundancy to achieve the desired delay and throughput that enable relevant data delivery to the robotic task. Specifically, this method adapts to channel conditions between robots and causally tunes the communication rates via efficient algorithms. We present two case studies: cooperative localization under delayed and lossy inter-robot communication, and a safety- critical overtaking maneuver where timely vehicle-to-vehicle message availability determines whether an ego vehicle can abort to avoid a crash. Our results demonstrate that coding- based communication significantly reduces in-order delivery stalls, preserves estimation consistency under delay, and im- proves deadline reliability relative to retransmission-based transport. Overall, the study highlights the need to jointly de- sign autonomy algorithms and communication mechanisms, and positions network coding as a principled tool for dependable multi-robot operation over wireless networks. I. INTRODUCTION Distributed systems form the backbone of modern com- putation, enabling numerous independent computing units to work together as a cohesive whole to achieve shared goals. When these principles are extended into the physical domain, they form the foundation of multi-robot systems (MRS), where networks of mobile agents cooperate to ac- complish complex objectives [1]. Recent work demonstrates the potential of such cooperation at scale. For instance, by exchanging information beyond onboard sensing, connected autonomous vehicles can improve traffic efficiency and safety through coordinated maneuvers, cooperative perception, and intersection management [2]–[5]. These capabilities rely on the exchange of state and intent information, such as po- sition, velocity, and planned maneuvers, allowing agents to anticipate and react to hazards earlier than would be possible with local sensing alone. Enabling such cooperation places stringent demands on communication. In many coordination and safety-critical Fig. 1. Illustraion of a cooperative localization scenario, which we consider in our first case study (see Sec. I-A and IV-A). Each robot obtains local GPS-like measurements and inter-robot measurements of nearby robots, which are shared over inter-robot communication channel. The colored curves show the robots’ ground-truth trajectories, while the arrows depict the communicated inter-robot observations. Arrow thickness and color intensity encode the communicated measurement delay (darker and thicker arrows indicate larger delays). applications, state updates must be delivered within tight latency bounds, ranging from tens to hundreds of mil- liseconds for cooperative awareness, and down to single- digit milliseconds for coordinated maneuvers, while meeting very high reliability targets [6], [7]. These requirements are commonly captured under the notion of ultra-reliable low- latency communication (URLLC) [8], which characterizes the reliability and timeliness needed to support dependable autonomous operation. These latency and reliability requirements arise directly in cooperative multi-robot applications, wherein each robot continuously tracks state information and makes real-time decisions based on information received over wireless links from other robots, while usually assuming timely and reliable delivery. In practice, however, communication networks are inherently non-ideal: wireless interference, shared-medium contention, buffering, and time-varying connectivity intro- duce delay, packet loss, and timing variability, causing in- formation to arrive late, out of order, or not at all. Such effects can violate the assumptions underlying estimation and decision-making algorithms, potentially degrading synchro- nization, estimation accuracy, and safety. This gap is visible in two representative application areas: arXiv:2603.17472v1 [cs.RO] 18 Mar 2026 Initial configuration Successful abort under AC-RLNC Collision scenario under SR-ARQ Fig. 2.Visualization of the overtaking scenario we consider in our second case study (see Sections I-B and IV-B) with an initial configuration, and two separate runs using AC-RLNC and SR-ARQ transport mechanisms, respectively. Ego vehicle A (red) follows the outer lane behind truck T (yellow), while oncoming vehicle B (green) approaches from the opposite direction on the same lane. Timely V2V packet reception is required for A to detect the oncoming hazard and abort. Blue dots indicate time instants at which V2V packets from vehicle B are successfully received by the ego vehicle A. cooperative localization and safety-critical vehicle-to-vehicle (V2V)-enabled maneuvers. Cooperative localization is a cen- tral capability in multi-robot systems, where inter-robot mea- surements improve pose estimation accuracy across the team [9]. Many formulations assume ideal or near-synchronous communication, even though wireless channels routinely introduce delay and packet loss that can destabilize Kalman filter (KF) based estimators [10]. Methods that address de- layed or out-of-sequence measurements, such as smoothing approaches and out-of-sequence measurement (OOSM) fil- tering [11], [12], provide mechanisms for incorporating late information, yet they do not examine how communication- layer behavior produces the delay and loss patterns that shape estimator performance. As a result, the interaction between communication-layer behavior and cooperative localization remains insufficiently characterized. Similarly, V2V communication enables safety applications such as forward-collision warning and cooperative maneu- vers [4]. Formal safety frameworks, e.g., [13], define tim- ing envelopes for safe decisions, and overtaking assistance systems further illustrate how timely inter-vehicle updates reduce collision risk [14]. However, these studies typically either assume that the communication layer satisfies required timing and reliability bounds or evaluate it only through aggregate metrics, and do not evaluate how protocol behavior influences meeting safety deadlines. Despite the central role of communication in enabling co- operation, much of the existing work on multi-robot systems treats communication as a secondary concern rather than a co-designed component of the autonomy pipeline. In par- ticular, reliability is often delegated to retransmission-based transport protocols that recover specific lost packets after feedback. While such mechanisms are effective in traditional data networks, they introduce round-trip-time (RTT) recovery delays and head-of-line blocking 1 , which can significantly increase in-order delivery delay under loss. This gap has been noted in prior work and motivates tighter co-design between autonomy algorithms and communication mechanisms [15]. Contribution. Ignoring the communication layer can turn an otherwise correct MRS autonomy pipeline into a brittle system: late, missing, or irregularly-timed information can corrupt estimation and can undermine safety-critical deci- sions. To make these consequences concrete and motivate communication-aware co-design, we present a focused case study that examines the autonomy–communication loop from two complementary perspectives. First, we evaluate cooperative localization under prac- tical communication conditions (Fig. 1) by comparing a basic EKF-based approach with a delay-aware iterative re-estimation method (I-ReE), inspired by smoothing and OOSM handling [11], [12]. We then quantify how delivery behavior under non-ideal links, ranging from an unreli- able baseline to reliable communication protocols, including adaptive coded protocols, shape the information available to the estimator and the resulting estimation accuracy. Second, we develop a similar analysis for a safety-critical overtaking scenario (Fig. 2), where timely V2V message availability determines whether an ego vehicle can safely abort before a computed deadline. We examine the complete autonomy and communication loop, rather than evaluating each ingredient in isolation. Overall, we emphasize that communication-layer behavior must be accounted for as a co-designed component of the MRS autonomy pipeline. In addition, we show that the structure of the transport protocol fundamentally shapes the information available to the autonomy pipeline. 1 RTT is the time between transmitting a packet and receiving its ac- knowledgment; after a loss, retransmission-based protocols incur at least one RTT before recovery. Head-of-line blocking occurs when ordered delivery prevents correctly received packets from being released because an earlier packet is missing. Rather than relying solely on reactive retransmissions, we investigate adaptive causal random linear network coding (AC-RLNC) as a reliability mechanism that proactively in- jects coded redundancy, which enables decoding based on accumulated degrees of freedom rather than specific packet identities [16]–[18]. Our results demonstrate that network coding is not merely a throughput enhancement technique, but a mechanism that can directly affect estimation consis- tency and safety in multi-robot systems. Organization. In Sec. I, we provide necessary background on communication. In Sec. I, we describe the system mod- els and the two case studies: cooperative localization under communication constraints and a safety-critical overtaking scenario under timing constraints. In Sec. IV, we present the experimental setup and results for both case studies. We conclude and discuss future directions in Sec. V. I. BACKGROUND ON COMMUNICATION SYSTEMS This section provides the background for interpreting our case studies and experimental results. It briefly reviews the relevant communication layers, introduces the transport protocols considered herein, and summarizes key communi- cation challenges in connected MRS. Modern communication networks are often described us- ing a five- or seven-layer architecture, in which each layer provides a distinct set of services to the one above it [19]. The following layers are most relevant to our setting. The phys- ical layer governs signal transmission. In MRS, it mainly determines which robot pairs can sustain a usable link given their positions, obstacles and occlusions in the scene, and local propagation conditions (e.g., path-loss, fading, and mul- tipath). The link layer organizes transmissions into frames and arbitrates access to the shared channel; in robot teams it enables one-hop exchanges among nearby robots over the same wireless channel. The network layer (e.g., Internet Pro- tocol (IP)) provides addressing and routing across multi-hop networks; in mobile teams it determines how messages are forwarded when robots are not directly connected and when relays are needed as connectivity changes. The transport layer defines end-to-end delivery behavior (e.g., best-effort vs. reliable and unordered vs. ordered delivery), shaping how information is delivered to the application. In robotics, it determines whether autonomy modules receive independent datagrams or delivery with recovery and ordering. Finally, the application layer exposes communication services to higher-level algorithms; in robotic systems, it corresponds to task-level messages such as state/intent broadcasts and shared observations. Reliable communication performance is typically charac- terized through an in-order delay–throughput tradeoff. In- order delay is the end-to-end time from message generation until information is delivered in the order messages were generated, while throughput is the rate at which information is successfully delivered to the receiver, accounting for protocol overhead and losses. The tradeoff arises because pushing for higher throughput and reliability often increases queueing and ordering stalls, raising in-order delay, while reducing in-order delay often requires added redundancy (e.g., using error-correction), which can reduce effective throughput. In connected multi-robot systems, where both timeliness and reliability are essential, in-order delay is therefore the notion of delay that best reflects what the autonomy pipeline actually experiences. Along the communication stack, several factors degrade these metrics: path-loss, fading, and obstruction can cause packet erasures; medium access and contention can increase channel access time; buffers in intermediate nodes add queueing delay or overflow under high load; and multi-hop forwarding and recovery mechanisms (e.g., feedback-based retransmissions using automatic repeat request (ARQ)) can further increase delay and reduce effective throughput. In multi-robot systems, these effects primarily manifest as loss, in-order delay, and variability in delivery time. A. Transport Protocols Among the outlined layers, we focus on the transport layer, where end-to-end delivery protocols are commonly implemented and which directly serves the autonomy algo- rithms. To study how delivery behavior affects the autonomy pipeline, we describe three transport protocols that represent common delivery behaviors with distinct timing character- istics: best-effort delivery, retransmission-based reliability with in-order delivery, and adaptive coded transport. These protocols determine the timing and availability characteristics of packet delivery that drive the estimator in cooperative localization and the reliable delivery required by the abort- by-deadline mechanism in the overtaking case study. A.1) Classical Non-Coded Transport-Layer Protocols: The User Datagram Protocol (UDP) [20] serves as our unreliable best-effort baseline. UDP provides a minimal, connectionless transport protocol without acknowledgments, retransmissions, congestion control, or ordering guarantees. Its simplicity results in low protocol overhead and mini- mal added delay, but lost or out-of-order packets are not recovered. UDP is therefore a common choice when best- effort delivery is sufficient and the application can tolerate occasional loss or reordering. Selective-Repeat ARQ (SR-ARQ) [21] serves as our retransmission-based reliable baseline. SR-ARQ improves reliability by retransmitting only packets that are explicitly reported as missing. Because it mitigates unnecessary re- transmissions, SR-ARQ is highly throughput-efficient and is therefore commonly used as a baseline for reliable data transfer. SR-ARQ enforces ordered delivery using a sliding window of outstanding packets, which allows multiple pack- ets to be in transmission at once, but the receiver releases them to the application only in order, even if later packets arrive first. Under loss, this ordered release can increase in- order delay due to head-of-line blocking. A.2) Code-Based Transport-Layer Protocols: Next, we consider a network coding-based transport pro- tocol as an alternative to feedback-driven retransmissions. In our setting, adaptive coded transport is relevant because it can reduce in-order delivery stalls under loss and improve the timeliness of information delivered to the autonomy pipeline. Network coding improves robustness by transmitting coded packets that combine information from multiple orig- inal packets, rather than sending or retransmitting a specific missing packet. A common realization of this idea is Random Linear Network Coding (RLNC) [22], which serves as the foundation for the adaptive causal coded protocol (e.g., AC- RLNC) used later in this work. In RLNC, the receiver can recover the originals once it has collected enough linearly independent coded packets, which makes recovery depend on how many coded packets were received rather than which ones. From the perspective of autonomy, the key distinction between retransmission-based transport and coded transport lies in how losses are repaired: ARQ-based schemes recover specific missing packets after feedback, coupling recovery time to the RTT and potentially inducing head-of-line block- ing, whereas network coding transforms erasure recovery into a degree-of-freedom accumulation problem, where any sufficiently large set of independent coded packets enables decoding. This decouples reliability from the identity of individual packets and reduces sensitivity to burst losses (i.e., multiple consecutive packet erasures) and timing variability, properties that are particularly relevant for real-time multi- robot cooperation. Adaptive Causal RLNC (AC-RLNC) [16] serves as our adaptive coded transport protocol. It extends RLNC for real-time settings (i.e., delay-sensitive applications where messages must be delivered within tight deadlines) by using a sliding coding window and adapting redundancy both proac- tively (to mask expected losses) and reactively (based on feedback). This enables robust and timely delivery without waiting for specific missing packets: once enough coded packets are received, the receiver can decode the corre- sponding set of original packets and release them in order. In particular, by reducing in-order stalls while maintaining reliability, AC-RLNC helps mitigate and manage the in-order delay–throughput tradeoff that is central to our setting. B. Communication Challenges in Multi-Robot Systems The above challenges are exacerbated in MRS. Robots operate in cluttered, dynamic environments where motion and obstacles cause time-varying degradation in link quality, while the robots seek relevant (freshest) task-oriented data delivered on a tight schedule. Teams share wireless spec- trum, leading to contention that induces unpredictable timing variability in packet delivery. Topology changes as robots move in and out of range, frequently altering which com- munication links exist. In addition, practical deployments are often heterogeneous: different agents may experience different link qualities, traffic loads, and communication modalities, so end-to-end behavior can vary across the team and over time. Extensive reviews highlight that such factors make multi-robot communication considerably more com- plex than static wireless networking, and that communication mechanisms are still rarely co-optimized for robotic task requirements [15], [23]. Crucially, in these settings we do not necessarily care about delay and throughput in isolation. What matters is whether delivered information is still useful to the autonomy pipeline. We therefore distinguish urgency, where a message is useful only if it arrives before a task-dependent deadline, and freshness, where an old update may be stale even if it is eventually delivered. These considerations motivate the case- study analysis presented in this work and point to the need for communication mechanisms that better align delivery behavior with autonomy requirements. I. SYSTEM MODELS AND CASE STUDIES Here, we describe the two system models considered in our case studies: (i) a multi-robot cooperative localization problem subject to communication delay and packet loss, and (i) a safety-critical overtaking scenario in which communi- cation timing determines collision avoidance. We formalize the motion, sensing, communication, transport-layer, and estimation models used in both studies. In both settings, we sought to design scenarios that are on the one hand simple to describe, yet capture important problems using realistic assumptions and models. A. Cooperative Localization This setting consists of a group of n robots moving within a shared workspace in a semi-random fashion (Fig. 1; for visual clarity, the figure depicts a simplified exam- ple with five robots). The goal of each robot is to esti- mate its state by relying on information from two noisy sources: (1) self-measurements using a GPS-like sensor and (2) inter-robot measurements generated by neighboring robots and exchanged over the wireless channel, subject to communication-induced delay and loss. Motion model. Each robot i ∈ R evolves at discrete time slots t∈0,...,T with sampling period ∆t and state x i t = (x i t ,y i t ,θ i t )∈R 3 , where (x i t ,y i t ) denotes the planar position and θ i t denotes the heading. The robot follows Ackermann steering kinematics [24] with control input u i t = (v i t ,δ i t ), where v i t is the forward velocity and δ i t is the steering angle, and wheelbase L. The deterministic motion update rule is x i t = f (x i t−1 ,u i t ) =   x i t−1 + ∆tv i t cosθ i t−1 y i t−1 + ∆tv i t sinθ i t−1 θ i t−1 + ∆t v i t tan(δ i t ) L   (1) The robots are initialized with random positions and accel- erations drawn from uniform distributions within predefined bounds, and with zero initial steering angle. During the simulation, control inputs are updated in a piecewise-constant manner: every ̃ T time slots we resample the acceleration and sample a new steering angle, and every ˆ T time slots we reset the steering angle to zero to avoid persistent circular motion. To prevent collisions, we apply a simple reactive rule: when two robots are detected to be closing on each other, both robots resample steering commands to increase separation. The velocity is updated by integrating acceleration with saturation to a predefined maximum speed before applying the model in Eq. (1). Sensor model. Each robot i obtains two measurement types. The first is a GPS-like self-measurement of its location with Gaussian noise ε i GPS (t) ∼ N (0,σ 2 GPS I 2 ), i.e., z i,GPS t = x i,true t y i,true t +ε i GPS (t). This measurement can be obtained by the robot at each time slot t, where σ GPS is a fixed standard deviation pa- rameter of the GPS measurement noise, assumed known to the robot. In addition, robot i can obtain an inter-robot measurement (e.g., by using a light detection and ranging (LiDAR) sensor) of robot j as z ij t = x j,true t y j,true t +ε ij L (t),for ε ij L (t)∼N (0,σ 2 L I 2 ), i.e., z ij t is robot i’s noisy estimate of robot j’s true global position, where the additive term ε ij L (t) captures both sensing noise and the additional uncertainty induced by the measur- ing robot’s pose uncertainty when expressing the detection in global coordinates. Accordingly, we parameterize the measurement-noise standard deviation σ L = σ internal + d ij (t)/ √ 2M , where d ij (t) is the Euclidean distance be- tween robots i and j, and σ internal is a fixed constant that captures additional uncertainty introduced by representing the inter-robot observation as a noisy global-position mea- surement (rather than using an explicit range-and-bearing model). The term √ 2M equals the workspace diagonal length (with M the side-length of the square workspace) and serves as an upper-bound range scale, so σ L increases with distance. The value of σ L is not assumed to be known to the robots. Communication channel model. The goal of the commu- nication layer in this setting is not high throughput per se, but timely and reliable delivery of relative measure- ments that directly affect state estimation accuracy. Inter- robot measurement packets are sent over a wireless channel modeled as a Binary Erasure Channel (BEC), where each packet is independently erased with probability ε ≥ 0 and otherwise delivered after a fixed one-way delay. This abstraction captures packet drops due to fading, interference, or contention, while keeping the model analytically simple. Feedback is provided via a backward acknowledgment chan- nel, i.e., by sending acknowledgment, ACK, or a negative- acknowledgment, NACK, messages according to the erasure realizations. For simplicity, we assume the backward channel is reliable and experiences the same one-way delay [16]. Recall that RTT is the round-trip time between two robots (assumed constant in our model). A successfully delivered packet becomes available for estimation after a one- way delay of RTT/2 (a standard modeling simplification). When un-coded retransmission-based protocols are used, loss recovery requires waiting for feedback over a full RTT, which directly impacts in-order delivery delay. Each packet consists of the sender ID, target ID, a synchro- nized timestamp of the measurement, and the measurement content, which includes robot i’s LiDAR-based estimate of robot j’s global position together with a noisy range measurement ˆ d ij (t) = d ij (t) + ε d,ij (t), where ε d,ij (t) ∼ N 0, d ij (t)/ √ 2M 2 (used for covariance construction). In general, in the considered communication model, packets may arrive late, out-of-order, or be erased permanently. Transport protocols. We consider the three transport mecha- nisms introduced in Sec. I: (1) UDP, without retransmission or ordering guarantees. (2) Selective Repeat ARQ, with reliable transmission via selective packet retransmission and a sliding window of size W SR = aβ RTT, which can stall if the base packet is repeatedly lost. (3) AC-RLNC, which uses a sliding coding window and adaptive redundancy (via a-priori and post-priori FEC). The maximum coding window size is W AC = bβ RTT. Here, a and b are tunable parameters selected to manage the in-order delay–throughput tradeoff. We choose a so that the SR-ARQ window remains sufficiently large to keep the sender pipeline full under RTT- scale feedback, and b to control the coding span in AC- RLNC and thereby limit the potential decoding delay. The communication rate is f c = βf s ,for β = 1 max1− ε− α, λ , where f s = 1/∆t is the simulation update rate, α is a small safety margin when computing β, and λ > 0 is a lower bound that prevents the computed rate from becoming excessively large under very high erasure probabilities. This adaptive rate ensures that transmission remains compatible with the effec- tive channel capacity, maintaining stability even under high packet-loss conditions. When network quality deteriorates (i.e., ε increases), β increases accordingly, enabling more frequent packet transmissions or coded-packet repetitions. AC-RLNC benefits in particular from this mechanism, as it uses the additional transmission opportunities to send coded packets. Each such transmission contributes a degree of freedom at the receiver, increasing the likelihood of earlier decoding and thereby reducing the in-order delivery delay. State estimation. Each robot maintains an extended Kalman filter (EKF) to estimate its state from noisy self- and inter- robot measurements. The applied control ̄u i t during the prediction step is modeled as a noisy measurement of the true input, ̄u i t := u i t + η i t , for η i t ∼ N (0, Σ u ), where Σ u = diag(σ 2 v ,σ 2 δ ) is a fixed covariance whose value is not known to the robots. The prediction of EKF is obtained from ˆx i t = f (ˆx i t−1 , ̄u i t ), and P i t = F i t P i t−1 (F i t ) ⊤ +Q, where F i t is the Jacobian of the motion model, and the process-noise co- variance is Q = diag σ 2 process , σ 2 process , σ 2 θ , with σ process and σ θ denoting fixed standard deviation parameters of the translational and heading process uncertainties, assumed known to the robot. In addition, each received measurement z (either a GPS self-measurement z i,GPS t or an inter-robot measurement z ij t delivered over the communication channel) triggers the standard EKF update K = P − H ⊤ (HP − H ⊤ + R) −1 , ˆx + = ˆx − + K(z− H ˆx − ), and P + = (I − KH)P − , where H denotes the observation Jacobian and R the mea- surement covariance. Note that, the robots do not know the true distribution from which the LiDAR measurements are drawn in the considered setting. I.e., the inter-robot measurements are generated using the baseline LiDAR un- certainty σ L , but each robot constructs its own measurement covariance using its predicted process uncertainty. For GPS updates we use R GPS = σ 2 GPS I 2 , while for inter-robot measurements the robot approximates the covariance as R LiDAR (t) = (σ process + ˆ d ij (t)/ √ 2M ) 2 I 2 , where ˆ d ij (t) is the (noisy) range estimate obtained from the LiDAR detection and transmitted to the receiving robot. This reflects the estimator’s assumption that measurement uncertainty increases with range, with an additional offset term (σ process ) that captures the effect of the measuring robot’s pose uncer- tainty on the transmitted global-position measurement. Delay-handling mechanisms. Packet delays cause measure- ments to arrive with information age d := t− τ , where t is the current time slot and τ is the time slot at which the mea- surement was generated. Two approaches are considered. The naive time-window approach processes each measurement immediately if its age satisfies d≤ D, where D is the estima- tor’s delay-handling window size, and discards it otherwise, causing delayed inter-robot measurements to be applied out of sequence and potentially degrading estimation accuracy. As a more robust alternative, we introduce the iterative re- estimation (I-ReE) approach. I-ReE is a communication- aware method that maintains a sliding window of the past D states, covariances, controls, and received measurements. When a delayed measurement with age d ≤ D arrives, it is inserted into the appropriate buffer, and the EKF is replayed from t − d max to t, where d max is the largest active delay. By reprocessing the prediction and update steps in temporal order, I-ReE incorporates delayed information consistently and avoids the degradation caused by out-of- sequence updates. B. Safety-Critical Overtaking Under Timing Constraints We consider a safety-critical overtaking scenario in which an ego vehicle must decide whether to abort an ongoing over- take based solely on the timeliness of V2V packet reception (Fig. 2). All vehicle motion here is deterministic; uncertainty arises exclusively from packet delays and communication channel erasures. The scenario involves three vehicles: the ego vehicle A, a slower truck T , and an oncoming vehicle B. Both A and B travel on the outer lane but in opposite directions, while T travels on the inner lane in the same direction as A. All vehicles follow the Ackermann steering model (Eq. (1)). The ego vehicle begins the maneuver already travelling in the outer lane behind T (Fig. 2). During the overtake, A remains in this lane while B approaches head-on in the same lane. If A does not abort the maneuver early enough, the two vehicles may collide. Note that no direct sensing of B is assumed by vehicle A (except in late situations where a collision is imminent), as vehicle T obstructs its view due to the problem geometry. Instead, the ego vehicle must rely entirely on receiving a sufficient number of packets to infer that B is approaching in its lane and that the overtake must be aborted. To determine whether the ego vehicle can still safely abort the overtake given the ego vehicle’s decisions, the considered setting rolls out an abort maneuver beginning at a candidate time slot: a strong braking with deceleration a max followed by a smooth, bounded steering input δ abort that guides the vehicle from the outer lane back toward the inner lane behind the truck. The steering profile produces a fast and stable inward trajectory without excessive lateral overshoot. During the rollout, full oriented-bounding-box collision checks against both the truck and the oncoming vehicle are performed. A candidate time slot is marked safe if ego vehicle A completes the abort without collision; the latest such time slot defines the abort deadline. Communication and information model. To determine whether to abort the overtake maneuver, the ego vehicle relies on messages received from vehicle B via an exchange of generic V2V packets over the same delayed binary-erasure channel introduced earlier. These packets represent the mini- mal information needed to establish a communication session and convey situational context (e.g., presence of an oncoming vehicle, maneuver intent, or status metadata). Vehicle A considers the number N (t) of packets success- fully received by time t from B, and aborts if and only if N (t) exceeds a certain threshold msg req before the abort deadline arrives. To emulate realistic V2V behavior along the road, packet loss varies over time in a piecewise-constant manner. The time slot horizon is divided into J contiguous intervals (such that T/T ′ = J ∈N), each assigned a fixed average erasure probability ε(t ′ ) for t ′ ∈ 1,...,J These probabilities decrease across intervals, reflecting the improving channel conditions as A and B move closer. Packet successes in each interval follow a Bernoulli process with success probability 1− ε(t ′ ). This design captures the abrupt changes in connectivity commonly observed when vehicles move between areas of weak and strong coverage. Finally, we mention that this scenario uses the same reliable transport mechanisms described in Sec. I and Sec. I- A, namely SR-ARQ and AC-RLNC. Their behavior here impacts only the arrival pattern of packets (i.e., their in- order delivery delay and throughput). No additional protocol modeling is introduced in this subsection. IV. EXPERIMENTAL SETUP AND RESULTS We evaluate how communication delay, packet loss, and protocol behavior affect (i) distributed estimation accuracy in a multi-robot localization scenario and (i) decision-making reliability in a safety-critical high-speed overtaking scenario. The parameters used in the experiments were set to reflect 025050075010001250150017502000 Time Slots 0 1 2 3 Estimation Error [m] = 0, No Dly, No I-ReE = 0, one-way Dly, No I-ReE = 1, one-way Dly, No I-ReE = 0, one-way Dly, I-ReE (a) Naive vs. communication-aware approaches. Note that the blue curve overlaps with the orange curve. 025050075010001250150017502000 Time Slots 0 1 Estimation Error [m] = 0, Dly, UDP, I-ReE = 0.2, Dly, UDP, I-ReE = 0.4, Dly, UDP, I-ReE = 0.6, Dly, UDP, I-ReE = 0.8, Dly, UDP, I-ReE = 1, Dly, UDP, I-ReE (b) Estimation error under erasures and delay with UDP. 025050075010001250150017502000 Time Slots 0 1 Estimation Error [m] = 0, Dly, SR-ARQ, I-ReE = 0.2, Dly, SR-ARQ, I-ReE = 0.4, Dly, SR-ARQ, I-ReE = 0.6, Dly, SR-ARQ, I-ReE = 0.8, Dly, SR-ARQ, I-ReE = 1, Dly, SR-ARQ, I-ReE (c) Estimation error under erasures and delay with SR-ARQ. 025050075010001250150017502000 Time Slots 0 1 Estimation Error [m] = 0, Dly, AC-RLNC, I-ReE = 0.2, Dly, AC-RLNC, I-ReE = 0.4, Dly, AC-RLNC, I-ReE = 0.6, Dly, AC-RLNC, I-ReE = 0.8, Dly, AC-RLNC, I-ReE = 1, Dly, AC-RLNC, I-ReE (d) Estimation error under erasures and delay with AC-RLNC. Fig. 3.Cooperative localization estimation error under different communication conditions, transport protocols, and delivery delays. Here ε denotes the packet erasure probability. “No Dly” denotes RTT = 0, while “one-way Dly” denotes a fixed one-way delay of RTT/2. “No I-ReE” corresponds to the naive approach that updates the EKF upon packet arrival, while “I-ReE” denotes the delay-aware iterative re-estimation method. For reliable protocols, packets additionally experience protocol-dependent in-order delivery delay due to retransmissions (SR-ARQ) or decoding (AC-RLNC). Curves without a protocol name correspond to the estimator-only comparison under the stated delay/erasure condition. realistic behavior and models, and are reported in Table I and I. A. Cooperative Localization Results We quantify estimation performance using a trajectory- estimation metric computed at each time slot t (Fig. 3). Let k(t) = min(200, t + 1), denote the length of a trailing evaluation window over which the error is averaged. When t ≥ 199, the evaluation window spans the most recent 200 time slots. The estimation error is defined as Err(t) = 1 n n X i=1 1 k(t) t X τ=t−k(t)+1 ∥ˆx i τ −x i,true τ ∥ 2 , which averages the recent position-estimation deviation across all robots. This trailing-window metric smooths high- frequency fluctuations and enables consistent comparison across communication conditions. We report results for a single seeded realization of the randomized robot motions (and measurement noise), using the same seed across all protocol comparisons; varying the seed produced similar trends and does not materially change the conclusions. We evaluate three communication regimes: (i) Ideal com- munication, i.e., no delay (RTT = 0) and no packet loss (ε = 0); (i) Delayed communication, i.e., lossless delivery with only one-way delay and no packet loss; (i) Lossy with delay, inter-robot measurements are subject to both delay and erasure probability ε. When there is no delay (ideal case or full erasure), the I-ReE estimator reduces to the standard EKF as no out-of-sequence measurements occur. Fig. 3 presents the resulting estimation error trajectories. In the first set of experiments (Fig. 3a), we evaluate the effect of ideal and delayed communication, coupled with a naive or a communication-aware estimation approach (I- ReE). The ideal case (ε = 0, No Dly, No I-ReE) serves as the baseline and reflects the best achievable estimator per- formance. In this regime, I-ReE collapses to the naive EKF because no measurements arrive delayed or out of sequence. When a one-way delay is introduced without packet loss (ε = 0, one-way Dly, No I-ReE), the naive EKF exhibits a marked increase in estimation error due to out-of-order incorporation of delayed measurements. The corresponding I-ReE curve (ε = 0, one-way Dly, I-ReE) remains close to the baseline by restoring chronological consistency before applying updates. In the lossy condition (ε = 1, one-way Dly, No I-ReE), all inter-robot measurements are erased, so no delayed information ever arrives, and I-ReE again reduces to the naive EKF. Overall, this comparison shows that delay, rather than loss, is the key factor that motivates the need for a delay-aware estimator: whenever delayed measurements are present, I-ReE provides a substantial improvement. Next, we assess the impact of the transport protocols under erasures and delay, combined with the communication-aware estimator (Fig. 3b–3d). In all cases, inter-robot measurements are subject to both packet erasures and protocol-dependent delivery delay. UDP exhibits monotonic degradation as era- sures increase, since lost measurements are never recovered. SR-ARQ performs well under moderate erasure but degrades sharply at high loss due to retransmission stalls that delay packets beyond the usable estimation window. AC-RLNC maintains near-ideal performance across all tested erasure probabilities below 1, owing to its adaptive coded redun- dancy and steady delivery of decodable packets. Overall, the findings here show that delay is the dominant factor impairing cooperative localization and that traditional protocols face structural limitations due to either loss (UDP) or latency stalls (SR-ARQ). The coding solution AC-RLNC combined with I-ReE robustly preserves near-ideal perfor- mance in the scenarios tested. This highlights the value of aligning transport behavior with estimator design and motivating tighter communication and estimation co-design. B. Safety-Critical Overtaking Results The overtaking simulation evaluates whether timely V2V packet delivery allows the ego vehicle to abort the maneuver before the physically computed abort deadline. The deadline is obtained from the abort rollout described in Sec. I-B. For this scenario, and problem parameters the abort deadline is at time slot t = 110. We require msg req = 25 packets to be successfully received, representing a conceptual packet requirement to establish a communication session and convey situational information (e.g., heading and intent) sufficient to trigger the abort maneuver. For each protocol, we examine the reliability-latency curve Pr[T 25 ≤ t], where T 25 denotes the arrival time (in time slots) of the 25th successfully received packet. This curve captures the full distribution of completion times under the time-varying erasure process. We estimate it by a Monte- Carlo test from N = 1000 independent simulation runs; across runs, packet delivery differs due to different real- izations of the time-varying packet erasures. Fig. 4 shows the results. In the scenario tested, AC-RLNC achieves a rapid rise in reliability, indicating that the required packets typically arrive much earlier than the deadline. SR-ARQ, by contrast, exhibits a long latency tail due to retransmission stalls, which significantly increase the probability of late completion. Evaluating the curve at t = 110 yields the probability of meeting the abort deadline for each pro- tocol, showing a higher completion probability for AC- RLNC (about 80%) than for SR-ARQ (about 60%) A typical outcome is illustrated in Fig. 2. This result attests again to the strength of the coded solution. V. DISCUSSION AND FUTURE WORK Our case study shows that communication delay, loss, and transport behavior can directly affect both estimation accuracy and safety-critical decision making in connected multi-robot systems. In cooperative localization, delayed and out-of-sequence measurements degrade accuracy when fused in arrival order, while delay-aware re-estimation mitigates this effect; combined with adaptive coded transport, accurate trajectory estimates are maintained even under impaired links. In the overtaking scenario, safety reduces to meeting a strict abort deadline: retransmission-driven latency tails 6080100120140160 Time Slots t 0.0 0.2 0.4 0.6 0.8 1.0 Reliability Pr [ T 25 t ] deadline D=110 SR-ARQ AC-RLNC Fig. 4.Overtaking reliability–latency function Pr[T 25 ≤ t], where t (horizontal axis) denotes time slots and T 25 is the arrival time of the 25th successfully received packet. The value at t = 110 corresponds to the probability of satisfying the abort-by-deadline requirement. increase deadline misses, whereas adaptive coded delivery significantly improves the probability of triggering an abort in time in the scenarios tested. Overall, the results reinforce a central point: autonomy pipelines and communication mechanisms should be co-designed, since protocol-induced timing behavior can determine whether information remains usable and whether safety constraints are met. Building on these insights, future work should formalize task-level information requirements such as deadlines, fresh- ness, relevance, and prioritization, and use them to guide both estimator/controller design under delayed updates and communication mechanisms and new streaming coding that schedules and allocates redundancy according to data time- liness and task criticality. The longer-term goal is principled co-design methods that enable reliable MRS behavior under heterogeneous and time-varying networks. Acknowledgments. The AI system ChatGPT was used for light editing and grammar enhancement, a preliminary liter- ature review, and aesthetic enhancements in Fig. 1. REFERENCES [1] O. Shorinwa, T. Halsted, J. Yu, and M. Schwager, “Distributed optimization methods for multi-robot systems: Part 1—a tutorial,” IEEE Robotics Autom. Mag., vol. 31, no. 3, p. 121–138, 2024. [2] R. E. Stern, S. Cui, M. L. Delle Monache, R. Bhadani, M. Bunting, M. Churchill, N. Hamilton, R. Haulcy, H. Pohlmann, F. Wu, B. Piccoli, B. Seibold, J. Sprinkle, and D. B. Work, “Dissipation of stop-and- go waves via control of autonomous vehicles: Field experiments,” Transportation Research Part C, vol. 89, p. 205–221, 2018. [3] C. Wu, A. R. Kreidieh, K. Parvate, E. Vinitsky, and A. M. Bayen, “Flow: A modular learning framework for mixed autonomy traffic,” IEEE Transactions on Robotics, 2021. [4] J. Harding, G. Powell, R. Yoon, J. Fikentscher, C. Doyle, D. Sade, M. Lukuc, J. Simons, and J. Wang, “Vehicle-to-vehicle communica- tions: Readiness of V2V technology for application,” Nat. Highway Traffic Safety Admin., Tech. Rep. DOT HS 812 014, 2014. [5] K. Dresner and P. Stone, “A multiagent approach to autonomous intersection management,” J. Artif. Intell. Res., vol. 31, 2008. [6] M. Boban, A. Kousaridas, K. Manolakis, J. Eichinger, and W. Xu, “Connected roads of the future: Use cases, requirements, and design considerations for vehicle-to-everything communications,” IEEE Ve- hicular Technology Magazine, vol. 13, no. 3, p. 110–123, 2018. [7] 3rd Generation Partnership Project (3GPP), “Service requirements for enhanced V2X scenarios,” Tech. Rep., 2024, version 18.0.1. [8] P. Popovski, J. J. Nielsen, C. Stefanovic, E. d. Carvalho, E. Strom, K. F. Trillingsgaard, A.-S. Bana, D. M. Kim, R. Kotaba, J. Park, and R. B. Sorensen, “Wireless access for ultra-reliable low-latency com- munication: Principles and building blocks,” IEEE Network, vol. 32, no. 2, p. 16–23, 2018. [9] S. Roumeliotis and G. Bekey, “Distributed multirobot localization,” IEEE Trans. on Robotics and Automation, vol. 18, no. 5, 2002. [10] B. Sinopoli, L. Schenato, M. Franceschetti, K. Poolla, M. Jordan, and S. Sastry, “Kalman filtering with intermittent observations,” IEEE Transactions on Automatic Control, vol. 49, no. 9, 2004. [11] Y. Bar-Shalom, “Update with out-of-sequence measurements in track- ing: exact solution,” IEEE Trans. on Aerospace and Electronic Sys- tems, vol. 38, no. 3, p. 769–777, 2002. [12] B. Kwon and P. S. Kim, “Novel unbiased optimal receding-horizon fixed-lag smoothers for linear discrete time-varying systems,” Applied Sciences, vol. 12, no. 15, 2022. [13] S. Shalev-Shwartz, S. Shammah, and A. Shashua, “On a formal model of safe and scalable self-driving cars,” CoRR, vol. 1708.06374, 2017. [14] I. Elleuch, A. Makni, and R. Bouaziz, “Cooperative overtaking assist. sys. based on V2V comm. and RTDB,” Computer Journal, 2019. [15] J. Gielis, A. Shankar, and A. Prorok, “A critical review of communi- cations in multi-robot systems,” Curr. Robotics Rep., vol. 3, 2022. [16] A. Cohen, D. Malak, V. B. Bracha, and M. M ́ edard, “Adaptive causal network coding with feedback,” IEEE Trans. on Comm., 2020. [17] E. Dias, D. Raposo, H. Esfahanizadeh, A. Cohen, T. Ferreira, M. Lu ́ ıs, S. Sargento, and M. M ́ edard, “Sliding window network coding enables NeXt generation URLLC millimeter-wave networks,” IEEE Network- ing Letters, vol. 5, no. 3, p. 159–163, 2023. [18] V. A. Vasudevan, H. Esfahanizadeh, B. D. Kim, L. Landon, A. Cohen, and M. M ́ edard, “Revisiting the interface between error and erasure correction in wireless standards,” IEEE Journal on Selected Areas in Communications, 2026. [19] J. Day and H. Zimmermann, “The OSI reference model,” Proceedings of the IEEE, vol. 71, no. 12, p. 1334–1340, 1983. [20] J. Postel, “User Datagram Protocol,” RFC 768 (Standard), Internet Engineering Task Force (IETF), August 1980. [21] E. Weldon, “An improved selective-repeat arq strategy,” IEEE Trans- actions on Communications, vol. 30, no. 3, p. 480–486, 1982. [22] T. Ho, M. Medard, R. Koetter, D. Karger, M. Effros, J. Shi, and B. Leong, “A random linear network coding approach to multicast,” IEEE Trans. on Information Theory, 2006. [23] O. Shorinwa, T. Halsted, J. Yu, and M. Schwager, “Distributed optimization methods for multi-robot systems: Part 2—a survey,” IEEE Robotics & Automation Magazine, vol. 31, no. 3, p. 154–169, 2024. [24] S. M. LaValle, Planning Algorithms. Cambridge Uni. Press, 2006. APPENDIX TABLE I COOPERATIVE LOCALIZATION SIMULATION PARAMETERS ParameterSymbolValue Number of robotsn10 Workspace side lengthM200 m Slot duration∆t0.1 s HorizonT2000 time slots Control update period ̃ T8 time slots Steering reset period ˆ T3 time slots WheelbaseL2.5 m GPS noise std.σ GPS 3 m Baseline LiDAR uncertainty σ internal 2 m Process noise (x,y)σ process 1 m Heading noiseσ θ 1 ◦ Control noise (velocity)σ v 3 m/s Control noise (steering)σ δ 2 ◦ Round-trip timeRTT4 time slots Estimation window sizeD10 time slots SR-ARQ windowW SR aβ RTT, a = 2 AC-RLNC windowW AC bβ RTT, b = 1.5 Protocol ratef c βf s , f s = 1/∆t Safety marginα0.11 Denominator floorλ0.15 Note: β is the transmission-rate scaling factor defined in Sec. I-A and α is a safety-margin parameter used when computing β. The floor parameter λ prevents denominator collapse (and thus rate blow-up) in the β computation. σ internal reflects a conservative baseline LiDAR uncertainty consistent with worst-case no-communication performance. RTT denotes the round-trip time of the communication channel. TABLE I SAFETY-CRITICAL OVERTAKING SIMULATION PARAMETERS ParameterSymbolValue Lane widthw lane 3.5 m Car lengthL car 4.5 m Car widthW car 1.9 m Truck lengthL truck 12.0 m Truck widthW truck 2.6 m WheelbaseL wb 0.6L vehicle Ego velocityv A 28 m/s Truck velocityv T 22 m/s Oncoming velocityv B 28 m/s Maximum brakinga max 10 m/s 2 Abort steering angle δ abort 10 ◦ Slot duration∆t0.05 s HorizonT160 time slots Round-trip timeRTT8 time slots Required packetsmsg req 25 Deadlinedeadline110 time slot Erasure profile1− ε(t ′ )[0.1, 0.9] Channel intervalsT ′ 16 time slots Protocol windows W SR ,W AC same as Tab. I Note: The abort deadline and required packet count correspond to the feasibility conditions of the abort maneuver defined in Sec. I-B. The piecewise erasure profile models changing V2V link quality as the vehicles move along the trajectory. The protocol window definitions follow those used in the cooperative localization experiment (Tab. I), with β = 1.