Multi-Path Data Transfer Using Network Coding
Techniques, devices, systems, and protocols are disclosed herein that relate to data transfer between communication nodes via multiple heterogeneous paths. In various embodiments, network coding may he used to improve data flow and reliability in a multiple path scenario. Transmission control protocol (TCP) may also be used within different paths to further enhance data transfer reliability. In some embodiments, multiple levels of network coding may be provided within a transmitter in a multiple path scenario, with one level being applied across ail paths and another being applied within individual paths.
Latest MASSACHUSETTS INSTITUTE OF TECHNOLOGY Patents:
The present application claims the benefit of U.S. Provisional Patent Application No. 61/592,746 filed on Jan. 31, 2012, which is incorporated by reference herein in its entirety.
GOVERNMENT RIGHTSThis invention was made with government support under Contract No. FA8721-05-C-0002 awarded by the U.S. Air Force and under Grant No. FA9550-09-1-0196 awarded by the Air Force Office of Scientific Research. The government has certain rights in the invention.
FIELDSubject matter disclosed herein relates generally to data communications and, more particularly, to techniques, systems, and protocols for use in transferring data between locations via multiple different paths.
BACKGROUNDModern communication devices (e.g., smart phones and other handheld communicators, tablet computers with communication functionality, etc.) often possess multiple network interfaces for use with different network technologies having different connectivity characteristics. The different network technologies may each have, for example, different delay, throughput, and reliability characteristics. A mobile device that is connected to a remote source node through both a 30 network and an IEEE 802.11 wireless LAN may, for example, observe different usage costs and quality of service through each interface. It would be desirable to be able to utilize multiple available network resources to carry out a data transfer operation for a communication device to, for example, increase throughput, decrease data transit time, and/or make efficient use of available resources. It would also be desirable if this could be done in a reliable and efficient manner that takes advantage of already existing communication techniques and protocols.
SUMMARYTechniques, devices, systems, and protocols are described herein that support efficient and reliable data transfer between communication nodes via multiple different paths involving different network technologies. A novel transport layer protocol is proposed (i.e., the Multi-Path Transfer Control Protocol with Network Coding protocol or MPTCP/NC) that may he used to coordinate several parallel connections over multiple interfaces/networks. The proposed protocol incorporates some features of the multi-path TCP (MPTCP) protocol introduced by the Internet Engineering Task Force (IETF), but builds on the protocol to allow many of the benefits of network coding to be achieved in a multi-path scenario, it has been shown through simulation that the techniques described herein are capable of achieving a significant improvement in data transfer completion times and average transfer rates over other known multi-path approaches. Simulations have also shown that the techniques are more robust and reliable to link failure and variation than other multi-path schemes.
In accordance with one aspect of the concepts, systems, circuits, and techniques described herein, a machine implemented method for use in transferring data to a destination node comprises: obtaining a plurality of original data packets to be transferred to the destination node; generating first coded packets by linearly combining original data packets using network coding; distributing the first coded packets among multiple available paths leading to the destination node; generating second coded packets by linearly combining first coded packets distributed to a first path of the multiple available paths, using network coding; and transmitting the second coded packets associated with the first path to the destination node via as network associated with the first path.
In one embodiment, the method further comprises: generating second coded packets by linearly combining first coded packets distributed to a second path of the multiple available paths, using network coding; and transmitting the second coded packets associated with tie second path to the destination node via a network associated with the second path.
In one embodiment, generating first coded packets includes generating first coded packets by linearly combining original data packets that are within a sliding coding window. In one embodiment, the sliding coding window has a variable width. In one embodiment, the method further comprises: receiving acknowledgement messages from the destination node that each indicate that a new degree of freedom has been received by the destination node in connection with the data transfer; and adjusting the width of the sliding coding window based, at least in part, on received acknowledgement messages.
In one embodiment, generating second coded packets includes generating second coded packets by linearly combining first coded packets that are within a sliding coding window. In one embodiment, generating second coded packets includes generating redundant second coded packets for each set of first coded packets within the sliding coding window.
In one embodiment, each of the multiple available paths includes a transmission control protocol (TCP) layer that adds sequence numbers to first coded packets distributed to the path, wherein generating second coded packets by linearly combining first coded packets masks the sequence numbers added to the first coded packets associated with a path.
In one embodiment, each of the multiple available paths includes a transmission control protocol (TCP) layer; and distributing the first coded packets includes distributing the first coded packets among the multiple available paths based, at least in part, on TCP congestion control window dynamics.
In one embodiment, the multiple available paths are associated with different network technologies.
In one embodiment, obtaining a plurality of original data packets includes receiving the original data packets from an application layer.
In accordance with another aspect of the concepts, systems, circuits, and techniques described herein, a machine implemented method for use in processing coded packets received from a source node via multiple different paths comprises: receiving coded packets associated with a first connection via multiple different paths, the coded packets each including a linear combination of original data packets; for each coded packet associated with the first connection that is successfully received, sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent of previously received coded packets associated with the first connection; forwarding all coded packets associated with the first connection, received from all paths, to a common processing layer without decoding the coded packets; and for each coded packet associated with the first connection forwarded to the common processing layer: (a) determining whether the coded packet is linearly independent of coded packets associated with the connection that were previously forwarded to the common processing layer; and (b) sending an acknowledgement message to the source node acknowledging that a new degree of freedom has been received for the first connection if the coded packet is determined to be linearly independent.
In one embodiment, determining whether the coded packet is linearly independent includes performing a Gauss-Jordan elimination operation on a coefficient matrix.
In one embodiment, receiving coded packets associated with a first connection via multiple different paths includes receiving coded packets from patios that use different network technologies, wherein each of the different paths uses transmission control protocol (TCP) to control a transfer of packets through the path. In one embodiment, sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent from previously received coded packets is performed as part of a TCP layer, in one embodiment, sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent of previously received coded packets is performed within a first portion of the TCP layer for coded packets received via a first path and within a second portion of the TCP layer for coded packets received via a second path that is different from the first path.
In one embodiment, die method further comprises discarding a received coded packet if it is determined that the packet is not linearly independent.
In one embodiment, receiving coded packets associated with as first connection via multiple different paths includes receiving first coded packets via a first path following an IEEE 802.16 wireless networking standard and receiving second coded packets via a second path following an LTE wireless cellular standard.
In one embodiment, forwarding all coded packets associated with the first connection to a common processing layer includes forwarding the coded packets to a multiple path transfer control protocol with network coding (MPTCP-NC) layer that is higher than a TCP layer in a corresponding protocol architecture. In one embodiment, the protocol architecture includes separate protocol stacks associated with each of the multiple different paths.
In accordance with still another aspect of the concepts, systems, circuits, and techniques described herein, a communication device comprises: a first network interface unit configured for communication in a first network; a second network interface unit configured for communication in a second network that is different from the first network; and at least one processor to manage data transfer between the communication device and a destination node using multiple different paths, the at least one processor to: (a) obtain a plurality of original packets representative of data to be transferred to the destination node; (b) generate first coded packets by linearly combining original data packets using network coding; (c) distribute first coded packets to the multiple different paths to the destination node; and (d) generate second coded packets within individual paths by linearly combining first coded packets distributed to the path, using network coding.
In one embodiment, the at least one processor is configured to transmit second coded packets generated within individual paths to the destination node via corresponding networks.
In one embodiment, the at least one processor is configured to generate first coded packets by linearly combining original data packets within a sliding coding window,
In one embodiment, the at least one processor is configured to generate second coded packets for a first path by linearly combining first coded packets within a sliding coding window. In one embodiment, the at least one processor is configured to generate redundant second coded packets for the first path for each set of first coded packets within the sliding coding window.
In one embodiment, the at least one processor is configured to distribute the first coded packets to the multiple different paths based, at least in part, on TCP congestion control window dynamics.
In one embodiment, a first of the multiple different paths is associated with the first network interface unit and a second of the multiple different paths is associated with the second network interface unit.
In accordance with a further aspect of the concepts, systems, circuits, and techniques described herein, a communication device comprises: a first network interface unit configured for communication in a first network; a second network interface unit configured for communication in a second network that is different from the first network; and at least one processor to manage processing of coded data packets received from a source node via multiple different paths as part of a first connection, the coded data packets each including a linear combination of original data packets, the at least one processor to: (a) send an acknowledgement message to the source node for each coded data packet associated with the first connection that is successfully received; (b) forward coded data packets associated with the first connection, received from all paths, to a common processing layer without first decoding the coded packets; and (c) for each coded packet associated with the first connection forwarded to the common processing layer: (i) determine whether the coded packet is linearly independent of coded packets associated with the connection that were previously forwarded to the common processing layer; and (ii) send an acknowledgement message to the source node acknowledging that a new degree of freedom has been received for the first connection if the coded packet is determined to be linearly independent.
In one embodiment, the at least one processor is configured to determine whether the coded packet is linearly independent by performing a Gauss-Jordan elimination operation on a coefficient matrix.
In one embodiment, the at least one processor is configured to discard a received coded packet if it is determined that the packet is not linearly independent.
In one embodiment, a first of the multiple different paths is associated with the first network interface unit and a second of the multiple different paths is associated with the second network interface unit.
The foregoing features may be more fully understood from the following description of the drawings in which:
Techniques, devices, systems, protocols described herein relate to the transfer of data between communication nodes via multiple heterogeneous paths. In various embodiments, network coding is used to improve data flow and reliability within a multiple path scenario. Transmission control protocol (TCP) may also he implemented within each of the different paths to further enhance data transfer reliability. In some embodiments, multiple levels of network coding may be provided within a transmitter in a multiple path networking scenario, one level that is applied across all of the different paths (or sub-flows) and another level that is applied within each individual path. As will be described in greater detail, a communication protocol (MPTCP/NC) is also described for use in implementing and standardizing these techniques in some embodiments.
In each of the above-described scenarios, it may be possible to use multiple different network technologies to support a single data transfer operation (or a single connection) between communication nodes. The use of multiple different network technologies associated with multiple different paths to support a single data transfer operation or other function may be referred to as “heterogeneous networking.” As used herein, phrases such as “network technologies” and “network access technologies” encompass both wireless and wired technologies and can involve various different wireless and/or wire line standards, specifications, and/or protocols. The different wireless standards may include, for example, wireless networking standards, wireless cellular standards, wireless satellite communication standards, infrared communications standards, point to point microwave link technologies, LMDS and MMDS technologies, and/or other forms of wireless standards. Although specific heterogeneous network scenarios are shown in
Transmission control protocol (TCP) is a communication protocol that may be used to provide reliable data stream service between, for example, a source node and a destination node in a network. TCP makes use of positive acknowledgment with retransmission to improve the reliability of data transfer in both accuracy and sequence. A TCP source node may keep a record of each packet transmitted as part of a data transfer operation. After transmitting a packet, the source node may automatically retransmit the packet if an acknowledgement message is not received from the destination node within a predetermined time period. TCP is considered a part of the TCP/IP protocol suite and provides a communication service at an intermediate layer between an application program and the Internet Protocol (IP).
Internet Protocol (IP) typically uses “IP packets” as a vehicle to transfer data between nodes on a physical medium. An IP packet is a sequence of octets that includes both a header and a body. Among other things, the header of an IP packet may include destination node address information to identify a destination of the IP packet. The body of an IP packet may include the data being transferred. When an application program within a first node desires to send a large amount of data to a remote node, the program can split up the data into smaller portions and then issue a series of IP requests to an IP layer to transfer the data. Alternatively, the program can issue a single TCP request and let TCP deal with the IP layer. For various reasons, IP packets can be lost, duplicated, or delivered out of sequence in a network. TCP can detect such problems, request retransmission of lost data, and rearrange packets in a receiver that were delivered out of sequence. TCP can also operate to reduce congestion in a network. Once a TCP enabled receiver has received and reassembled a number of octets, it may pass them to an application program for use in performing a particular function. The operation of TCP can be relatively transparent to an application.
Network coding is a coding technique that may be used to, among other things, improve information flow in a network. Network coding usually involves the generation of linear combinations of data packets to form “coded packets” for transmission in a network. One form of network coding, known as Random Linear Network Coding (RLNC), uses randomly generated coefficients to form linear combinations of packets (or coded packets) to be transmitted in the network. In some cases, information describing the coefficients used to make the linear combination may be appended to the coded packet before transmission. RLNC has been shown to be a powerful technique for achieving robust, high throughput packet distribution in certain network environments. One example system and method fir implementing RLNC is described in U.S. Pat. No. 7,706,365 to Effros et al. entitled “Randomized Distributed Network Coding,” which is hereby incorporated by reference in its entirety.
In a network that uses RLNC, a source node that has a number of original data packets to transfer to a destination node may combine the packets together into a “coded packet” that is then transmitted into the network. A destination node in the network that receives the coded packet may then store the packet for eventual decoding. The destination node will typically require a particular number of coded packets before it is able to decode the coded packets to extract the original data packets. The decoding process may involve, for example, the solution of a system of linear equations having a number of independent variables (i.e., the data packets) and a number of independent equations. Before a destination node can reliably solve for the original data packets, the number of “degrees of freedom” (or DOFs) received must equal or exceed the number of original packets. The DOFs missing may in some cases be defined as the number of unknown independent variables of a system of equations less the number of independent equations. In some implementations, each coded packet successfully received at a destination node may provide another equation for use in decoding and thus reduce the number of DOFs needed by the node by one (assuming the coded packet is linearly independent of previously received coded packets). In some networks, a source node may transmit a stream of coded packets until a destination node is able to decode the original data packets.
In developing the techniques, devices, systems, and protocols described herein, it was determined that improved throughput performance and decreased communication delay could be achieved in a network by intelligently combining TCP, network coding, and heterogeneous networking. The techniques, devices, systems, and protocols are also capable of increasing data transfer reliability and decreasing management overhead when transmitting a single source-destination flow over multiple interfaces/networks. Although described herein in connection with network coding, it should be appreciated that some or all of the techniques described herein may use other similar codes in place of, or in addition to, network coding in some implementations (e.g., fountain codes, systematic codes, structured codes, etc.),
As shown in
As illustrated, the lower portion of protocol architecture 50 is divided into a number of different “sub-flows” 64, 66 that may each correspond to a different networking technology and a different path between a source node and a destination node. Each sub-flow 64, 66 within protocol architecture 50 may include a corresponding protocol stack. For example, first sub-flow 64 may include a stack with a TCP layer 56a, Fast-TCP/NC layer 58a, IP layer 60a, and network interface layer 62a. Likewise, second sub-flow 66 may include a stack with a TCP layer 56b, Fast-TCP/NC layer 58b, IP layer 60b, and network interface layer 62b. The protocol stack associated with each sub-flow 64, 66 may be used to process communications flowing through the sub-flow 64, 66 for a corresponding networking technology. Thus, if the first sub-flow 64 corresponds to an IEEE 802.11 based network, TCP layer 56a will be used to provide TCP services for the IEEE 802.11 based communications and network interface 62a will be used to interface to a corresponding IEEE 802.11 wireless medium, and so on. Although illustrated in
In a typical communication scenario, an application program within application layer 52 may determine that a large amount of data (e,g., a large data file, etc.) needs to be transferred to a remote destination node. The application program may then transfer some or all of the data to MPTCP/NC layer 54 with instructions to transfer the data to the destination node. MPTCP/NC layer 54 may then apply network coding to the data and then divide the coded data amongst the various sub-flows 64, 66 for transmission to the destination node. Thus, instead of seeing multiple TCP links, application layer 52 only needs to interface with MPTCP/NC layer 54.
In one implementation, the data transferred from application layer 52 to MPTCP/NC layer 54 will include data packets that will be referred to herein as “original packets.” As will be described in greater detail, MPTCP/NC layer 54 may process original packets received from application layer 52 using network coding to generate “coded packets.” As described previously, coded packets may be generated by forming linear combinations of original packets using, for example, randomly generated coefficients. In one approach, MPTCP/NC layer 54 may generate a new coded packet by linearly combining all original packets that are within a coding buffer of MPTCP/NC layer 54 all original packets received so far that are associated with the corresponding connection). In another approach, MPTCP/NC layer 54 may generate a new coded packet by linearly combining all original packets that are within a sliding coding window of MPTCP/NC layer 54. Each new original packet added to the coding window may drive an older original packet out of the window. The coded packets generated by MPTCP/NC layer 54 will be referred to herein as “first coded packets” to distinguish them from subsequently generated coded packets. The sliding coding window of MPTCP/NC layer 54 may have a fixed or variable length in different implementations.
In the discussion that follows, it will be assumed that two or more TCP sub-flows are properly established and managed under a common connection. In at least one implementation, the procedure for initiating and managing multiple TCP sub-flows will be the same as, or similar to, the procedures described in the MPTCP protocol proposed by the IETF (e.g., MPTCP Application Interface Considerations, by Scharf at el., draft-ietf-mptcp-api-06, 2012-10-22, which is incorporated by reference herein in its entirety), although other processes may alternatively be used. MPTCP/NC layer 54 may monitor the different TCP sub-flows 64, 66 of protocol architecture 50 to determine when a particular sub-flow is able to receive one or more first coded packets. As TCP transmits packets over the network, its window slides or expands to include new packets. When there is a chance to inject a new coded packet to one of the TCP sub-flows 64, 66, the MPTCP/NC layer 54 may produce a new random linear network, coded packet (i.e., a new “first coded packet”) and inject the new coded packet to the identified TCP sub-flow.
As described above, in one possible scenario, the MPTCP/NC layer 54 may decide to inject a new coded packet into, for example, sub-flow 64. The TCP layer 56a of sub-flow 64 may operate in a conventional. TCP manner as all operations in MPTCP/NC layer 54 and Fast-TCP/NC layer 58a may be transparent to the TCP protocol. However, it should be noted that all packets in the TCP window of TCP layer 56a of protocol architecture 50 will be coded packets (i.e., first coded packets). These packets are pushed from TCP layer 56a to Fast-TCP/NC layer 58a, which maintains another coding window. Fast-TCP/NC layer 58a may then apply a second level of network coding to first coded packets in its coding window to generate second coded packets. For each first coded packet TCP layer 56a sends to Fast-TCP/NC 58a, the latter may produce R second coded packets that are random linear combinations of all first coded packets in its coding window. The second coded packets are eventually passed to IP layer 60a, after which they may be transmitted over a corresponding network (via network interface layer 62a) to a destination node or nodes. Thus, the transmitted packets are twice-encoded packets.
Although packets received by TCP layer 56 are previously coded by MPTCP/NC layer 54, TCP will not be aware of this and will assign sequence numbers to the packets. In a conventional system, the sequence numbers will allow the destination node to determine whether any of the transmitted packets were lost in transit. Due to the nature of network coding, assuming the network coding window is designed correctly, sequence as are no longer necessary to identify packets lost in transit. That is, once enough packets have been successfully received in the destination node, the node will be able to decode the original packets, regardless of the sequence in which the coded packets were transmitted. Thus, the second layer of coding provided by Fast-TCP/NC layer 58a will serve to mask the sequence numbers of the packets added by TCP layer 56a as they are not needed.
A received packet is considered “innovative” if it is linearly independent of all previously received packets associated with the connection. If innovative, the packet will contribute a new “degree of freedom” and induce a new “seen packet” after Gaussian-Jordan elimination. Once the MPTCP/NC layer 54 receives an innovative packet from any of the sub-flows, it sends an ACK message (i.e., DATA_ACK) to acknowledge the new degree of freedom to the MPTCP/NC layer 54 of source node 82. Eventually, the MPTCP/NC layer 54 of the destination node 84 collects enough linearly independent packets to allow it to solve the linear system and recover all original packets. The original packets may then be delivered to the application level 52 of the destination node 84.
As described above, the Fast-TCP/NC layer 58 is directly below the transport layer 56. In the Fast-TCP/NC layer 58 of destination node 84, received packets are not checked for linear independence. That is, Gaussian-Jordan elimination is not performed and the seen packet concept is not employed. Instead, for each packet successfully received, regardless of whether it is linearly independent of previously received packets, Fast-TCP/NC layer 58 will automatically send an acknowledgement message badly to source node 82. This is because in any particular sub-flow TCP layer or below, there is no global knowledge of the linear space spanned by the packets received from all sub-flows. Therefore, a linear dependency check will not be efficient at the Fast-TCP/NC layer 58. Instead, Fast-TCP/NC layer 58 may deliver packets upwards to MPTCP/NC layer 54, which does have global knowledge of all packets, to handle the linear independency cheek and decoding. Furthermore, assuming that the coefficients in the packet header are properly organized and adjusted, the results of the second encoding are just other random linearly encoded packets. These packets can be decoded at the MPTCP/NC layer 54 of destination node 84, without intermediate decoding at the lower level. Since the Fast-TCP/NC layer 58 does not handle linear dependency issues, it may send an ACK every time a packet is received. This is reasonable as packet linear dependency is a problem associated with the coding operation, rather than with link congestion. The fast ACK allows the window management to be more efficient.
The MPTCP/NC layer 54 may maintain a coding window which decides the number of original packets to be coded and pushed to the TCP slab-flows 64, 66. As mentioned above, MPTCP/NC layer 54 has global knowledge of all packets received from all TCP sub-flows. Therefore, Gaussian-Jordan elimination may be carried out here, whenever a packet is received, to check linear dependency. The MPTCP/NC layer 54 of source node 82 receives a DATA_ACK for each &gee of freedom received at destination node 84 and may adjust its coding window accordingly. The design of the coding window is very flexible, as is not constrained by the TCP congestion window. For example, the left boundary of the window could slide according to the slowest TCP sub-flow, while the right side of the window can slide when the fastest sub-flow contributes a degree of freedom. This will allow packets delivered by any sub-flow to be innovative with high probability.
Within the first sub-flow 112, a number of redundant coded packets 116, 118 may be generated using coded packet c1 and coded packet c3 (and randomly generated coefficients). The redundant packets are generated to overcome potential packet losses in transit. Likewise, within the second sub-flow 114, a number of redundant coded packets 120, 122, 124 may be generated using coded packet c2, and coded packet c4 (and randomly generated coefficients). Once generated, the coded packets 116, 118, 120, 122, 124 may be transmitted to a remote destination node via corresponding network media. It should be noted that the number of redundant coded packets generated within first sub-flow 112 does not have to he the same as the number generated within second sub-flow 114 (although it could be the same in some scenarios).
The multiple different paths may be monitored to identify one or more paths that can receive a new first coded packet at a particular point in time (136). This monitoring process may continue until all of the desired data has been successfully transferred to and decoded at the destination node. In some embodiments, the monitoring process may be based upon the dynamics of a TCP congestion control window. When a path is identified, one or more first coded packets may be injected into the identified path (block 138). The injected packet may then be added to a second coding window (e.g., a Fast TCP/NC, coding window, etc.) associated with the identified path. Over time, different first coded packets may be injected into different paths using the above-described technique. For each path, one or more second coded packets may be generated by linearly combining the first coded packets that currently reside within a corresponding second coding window using network coding (block 140). Randomly generated coefficients may be used to perform the linear combination, as described above.
In some implementations, a number of redundant second coded packets may be generated for each set of first coded packets within the second coding window associated with a particular path. As before, for each new first coded packet added to a second coding window, an older first coded packet may be shifted out of the window. A new set of redundant packets may be generated for each new first coded packet added to a second coding window in some implementations. The redundant packets may be transmitted to the destination node via a corresponding network medium after they have been generated (block 142). The process of generating second coded packets within each path may be ongoing until all of the desired data has been successfully transferred to, and decoded at, the destination node. As described previously, the generation of second coded packets using first coded packets may mask any sequence numbers added to the first coded packets by corresponding TCP functionality.
Referring now to
After the ACK message has been transmitted, it may next be determined whether enough linearly independent coded packets have been successfully received for the first connection to be able to decode the received coded packets to extract the original packets (block 174). If enough linearly independent coded packets have not been received to allow decoding (block 174-N), the common processing layer may proceed to look for a new coded packet (block 168). If enough linearly independent packets have been received (block 174-Y), then a decoding procedure may be performed to extract the original packets from the received coded packets (block 176). The extracted original packets may then be forwarded to a corresponding application for use thereby (block 178). The above-described process may be repeated for each data transfer operation to be carried out between the source node and the destination node.
In Table 1 below, the proposed protocol is compared with some existing methods that use TCP connections for data transmission. In Table 1, the TCP/NC protocol is referring to that proposed in “Network Coding Meets TCP: Theory and Implementation,” by Sundararajan et al, Proceedings of the IEEE, vol. 99, no. 3, pp. 490-512, March 2011.
An analytical model will now be presented to evaluate the performance of MPTCP/NC. The model will use several key elements presented in previous sections to provide some of the relevant abstractions. First, the model will assume that the MPTCP/NC layer provides a method that eliminates the need to track specific packets sent over a sub-flow. Because degrees of freedom are delivered to a sub-flow, rather than individual uncoded packets, the analysis presented will only need to track each degree of freedom through the network. Second, the model will assume that the congestion control algorithm used is similar to that of TCP Reno. For example, the congestion window increases by the number of received acknowledgements divided by the current window size, decreases by half upon receipt of a triple-duplicate acknowledgement, and completely closes if no acknowledgement is received after a given time-out period. Third, the model will assume that the network coding operations performed by the past-TCP/NC layer eliminate the need to consider the effects of triple-duplicates on TCP s window size, instead, network coding simplifies the analysis. Since each received acknowledgement is an indication of a degree of freedom obtained by the receiver, only the effects of coded packet losses need to he considered. As a result, packet losses are interpreted as an increase in the round-trip time (RTT) between the source and destination. Finally, the concept of sending redundant coded packets will be used. For every packet contained in the TCP contention window, R linearly independent packets will be transmitted. This added redundancy significantly reduces the probability of a time-out and it will be assumed that R is large enough to reduce the probability of a time-out to zero. Each of the above abstractions contributes to the ability to analyze the performance of MPTCP/NC.
MPTCP/NC will be analyzed below using two metrics, the average throughput and the expected MPTCP/NC congestion window evolution [W]. Since the instantaneous throughput of TCP is directly related to [W], the analysis will focus on finding [W] and then take an average of [W] over time to find . MPTCP/NC's behavior will he modeled in terms of rounds. Within a round i, MPTCP/NC's expected congestion window size Wi will be determined, which is defined as the sum of each sub-flow's congestion window size for sub-flows j={1,2}:
The natural choice for determining the duration of a round is to use the round trip time (RTT) from the sender to the receiver (i.e., duration(i)=RTT), where equation (1) assumes that RTT1=RTT2. While this may work if there is a single TCP connection, each sub-flow in MPTCP/NC will typically have different round trip times making it difficult to determine which RTT to use. In order to account for this, the duration of each round will be set equal to the least common multiple (LCM) of the RTT for sub-flow 1 and sub-flow 2 (i.e., duration(i)=LCM(RTT1, RTT2)).
The window size for sub-flow j in round i is then for k=[α,β}. Equation (1) now becomes:
Wi=W┌α┐(1)+W┌β┐(2). (3)
The expected TCP congestion window size will now be analyzed for each sub-flow. The results will then be extended so that MPTCP/NC's end-to-end throughput can be determined. The analysis for the jth sub-flow in MPTCP/NC will be the same for each sub-flow. The most basic implementation of TCP will be used in the analysis and initially it will be assumed that each round's duration is equal to the RTT of sub-flow j. It will also be assumed that the window size dating round i is determined by the number of acknowledgements α obtained during round i−1. Specifically, the window size on each round is:
Assuming that Rj linearly independent packets are sent for each packet contained the TCP congestion window and a packet loss rate of pj, the expectation of the window size, E[Wi], may be expressed as:
Since the window size of TCP can increase by a maximum of one packet per round,
where equation (12) is obtained by iterating equation (11).
Now that the expected window size in round i has been found, the throughput (j) during the round can be calculated as follows:
where the minimization is necessary to account for packets that are received that do not deliver new degrees of freedom. Since the Fast-TCP/NC layer codes all packets within the TCP congestion window, delivered packets 1 through contain new degrees of freedom. If more than packets are received in the round, the MPTCP/NC layer will disregard them since they obtain no new degrees of freedom. Therefore, the throughput is adjusted to ensure that only new degrees of freedom are taken into account.
The fact that the round duration is not equal to sub-flow j's RTT (i.e., the duration of a round is equal to the LCM(RTT1, RTT2)) will now be taken into account. The expected window size in equation (12) can be adjusted to account for the shorter rounds by substituting ┌lγv┐ for i, were l is the round number when using the shorter round duration and γ={α,β}:
[W┌l/γ┐(j)]=[]+(┌l/γ┐−1)min(1,(1−pj)Rj). (14)
Substituting the above equation into equation (13), the per-round throughput for a TCP sub-flow j is obtained as follows:
This equation can be reduced if a large enough redundancy factor Rj is considered. As will be demonstrated later, the value chosen for Rj can be critical in achieving the maximum throughput If it is assumed that the network capacity between the source and the destination is larger than the maximum window size ×Rj for values of
then equation (16) becomes,
Finally, the fact that the window size of a TCP connection is limited by a maximum value needs to be addressed. This changes the expected window size during round l to be the minimum of equation (14) and . Furthermore, the per-round throughput becomes:
The model used in the above analysis of MPTCP/NC sub-flow performance makes several assumptions that, in practice, should be considered. First, it is assumed that packet losses are independent and identically distributed (i.i.d.) with loss probability pi. When the network is not congested and none of the links experience outages, this assumption is valid; but it fails when packet losses are correlated. A possible extension to the analysis presented here is to take correlated packet losses into account by modeling them using Gilbert-Elliot mode. Second, it is assumed that the number of redundant packets Rj sent for every packet contained in the TCP congestion window was sufficiently large to ignore the possibility of a time-out, which would close the TCP window to []. When Rj=1/(1−pi), this assumption is not necessarily valid. As will be shown in subsequent sections, time-outs occur frequently with i.i.d. packet losses when Rj=1/(1−pi). Specifically, a time-out occurs when the sum of received acknowledgements over two rounds, i and i+1, is less than the window size during round i with probability r(αai+αi+1<Wi). One possible direction is to model time-outs as a random walk with a threshold crossing (although time-outs are not included in the present analysis).
Third, it is assumed that RTTj remains constant. In practice, this is not true. Implementations of TCP generally use an averaged round-trip time often referred to as the “smoothed” round-trip time SRTT. Because network coding masks i.i.d. packet losses by extending the duration of RTT and network congestion or transmission delay is not taken into account, the RTT used in our analysis is effectively equal to SRTT.
It is desirable to determine average end-to-end throughput for MPTCP/NC. As shown above, the throughput for TCP/NC is dependent on the TCP congestion window size. Likewise, the average MPTCP/NC throughput is dependent on the joint window size of all sub-flows (i.e., Wi=Σj). While the approach shown here can be used for any number of TCP/NC sub-flows, results will only he provided for two sub-flows designated as sub-flow 1 and sub-flow 2 with packet loss probabilities and round-trip times of p1, RTT1 and p2, RTT2, respectively.
Using the least common multiple of RTT1 and RTT2 as the duration of each round and the values of α and β defined in (2), the expected congestion window size for MPTCP/NC is
One thing that was not taken into account is the maximum window size of each TCP/NC, sub-flow. Incorporating the maximum window size of sub-flow 1, Wmax(1), and sub-flow 2, Wmax(2), the expected MPTCP/NC window size becomes:
The equation above shows that the expected MPTCP/NC window size is monotonically increasing. As a result, it would be desirable to determine how fast it increases. This may be investigated by finding the number of rounds W it takes to get to a window size of W. The intuition behind finding the expected number of rounds it will take to reach a window size of W is that each sub-flow is geometrically distributed. Letting 0≦x≦W:
Now that the expected MPTCP/NC congestion window size has been determined, the average end-to-end throughput for MPTCP/NC may be found, which is a function of the above window size:
Assuming that nα, nβ ∈,
If nα, nβ ∉ , the above equation will contain additional terms that contain packets sent in the rounds from └nγ┘ to n γ for γ={α,β}. While these additional packets increase the throughput, their contribution is negligible for large enough n. Finally, the maximum window size of each sub-flow is considered, First, let:
r(1)=α(Wmax(1)[W1(1)]) and r(2)=β(Wmax(2)−[W1(2)]). (32)
Using equation (31) and assuming that R1>1/(1−p1) and R2>1/(1−p2), four cases for the average end-to-end throughput are generated:
An analysis has thus been provided to determine the sub-flow performance and overall MPTCP/NC performance using the expected window size W and the end-to-end average throughput as metrics. As noted above, results were provided for two sub-flows; but network coding makes it possible to use the same technique for any umber of sub-flows. In the analysis, assumptions were made regarding the congestion control algorithm, time-outs and the amount of redundancy introduced into the network to account for packet losses. In the discussion that follows, numerical results are provided which show that the amount of redundancy is critical in eliminating time-outs.
The performance of the proposed MPTCP/NC scheme was analyzed through numerical simulations. As part of the analysis, the same two-link model used in the mathematical analysis was considered, where the packet losses on a given link are i.i.d. with probability pi, i=1,2 and fixed throughout the use of the channel. In addition, the losses are independent between links, packets are not received out of order, the round trip time on each link is fixed and acknowledgments are not lost. This proposed model is suitable for a wireless scenario where congestion is not a major factor for packet losses.
In the simulations, TCP is restricted to operate in two modes: slow start and congestion avoidance. The slow start threshold (SST) for each TCP window is 64 packets, and the window can have a maximum size of Wmax=128 packets. A timeout is declared by TCP when a packet is not acknowledged (remains in the window) for two round trip times. When a timeout occurs, the window is reset to one. Furthermore, the window size is reduced to half when a triple duplicate ACK is received. Note that this never occurs in MPTCP/NC, since every received package is acknowledged. In addition, it is assumed that the transmitter sends a file of a given (fixed) size, to which it has access before transmission starts.
Three MPTCP schemes were considered:
-
- (1) Uncoded MPTCP: this is traditional MPTCP operation where packets are dynamically allocated to a given TCP subflow when requested.
- (2) MPTCP/NC with low redundancy: MPTCP/NC operation is modeled with redundancy on each link equal to Ri=(1−pi)−1, i=1,2, as discussed in the mathematical analysis above. It is assumed that the entire transmitted file is coded.
- (3) MPTCP/NC with high redundancy: MPTCP/NC is modeled with redundancy 5% above the average loss, i.e., Ri=1.05(1−oi)−1, i=1,2.
Three parameters were measured for each of these schemes considering different the sizes, packet loss probabilities and round trip times: (i) completion time, (ii) average rate, and (iii) total window size (i.e., sum of the windows of the two TCP subflows). The round trip time and packet loss probability parameters are shown in
The simulated model has several limitations. For example, it assumed that each link always supports up to 128 packets being transmitted simultaneously, regardless of the round trip time and link capacity. As mentioned previously, this model also does not consider losses due to congestion or lost acknowledgments. Furthermore, the assumption that the MPTCP/NC layer has access to the whole file being transmitted is unrealistic for several different applications, such as video streaming. Nevertheless, as will be shown in the following analysis, this model does capture some of the fundamental characteristics of MPTCP/NC's operation, clearly illustrating the benefits of coding over two TCP flows in regards using uncoded MPTCP.
In the discussion that follows, numerical results are provided for four different simulated scenarios, each with different values of RTT and link loss probability. The average results presented were taken over a set of 1000 different simulation instances done in MATLAB.
In the first scenario, the upper left hand side corner of the table in
The average completion time and average rate for different the sizes are illustrated in
The behavior of each of the three schemes can he better understood by
Even for this low loss scenario, uncoded MPTCP/NC performs poorly, with the joint window size never achieving the maximum value. Notice how for this particular instance uncoded TCP quickly “averages out”, already varying over its average joint window size after only the first second of transmission time.
The second considered scenario is more realistic, with one link representing, for example, a WiMax connection, with RTT2=80 ms and p2=10−1, and the second link having a high round trip time and low packet loss probability, given by RTT2=80 ms and p2=10−1. The second link could represent, for example, a satellite connection. Similarly to case 1,
The advantage of MPTCP/NC over uncoded MPTCP becomes clear in this scenario, with an average rate increase of over 10× for the investigated file sizes. In the uncoded MPTCP scheme, the link with low RTT is underutilized due to the high number of losses. This can be seen by the small variations of the line corresponding to encoded MPTCP in
On the other hand, MPTCP/NC is able to compensate for the packet loss rate in the link with low RTT through coding. When adding redundancy according to the average packet loss probability, time outs still occur periodically in the faster, less reliable link, as can be observed in the lowest curve in
When an additional 5% redundancy is added, time out events become very rare in the low RTT link. Consequently, this link is fully utilized, with the corresponding TCP subflow achieving its maximal window during the beginning of the transmission. Time out events still occur, as shown in
In the previous case, it was shown how uncoded MPTCP/NC tends to rely on the link with the lowest packet loss probability, regardless of the round trip time. In order to make this phenomena even more apparent, a third case will now be considered where one link is very reliable but with a very long round trip time, with RTT1=1500 ms and p1=10−3, and the second link has a low round trop time (100 times smaller) but is very unreliable, with RTT2=15 ms and p2=0.3. This corresponds to two opposite extreme points of
The average rate obtained by MPTCP/NC with 5% extra redundancy is deafly two orders of magnitude larger than uncoded MPTCP. The dependence of MPTCP on the slower link becomes obvious in
Once again it is noted that MPTCP/NC manages to reduce the number of time outs with coding. In addition, when a time out occurs, it is able to quickly recover due to the Fast-TCP/NC mechanism. As noted previously, the extra 5% redundancy also contributes significantly towards less timeouts, resulting in a significantly larger throughput in this noise scenario.
In the final scenario, both links are assumed to have the same error probability and different but comparable round trip times, with RTT1=80 ms, p1=10−1, RTT2=250 ms and p2=10−1. This scenario could represent, for example, a 3G and a WiFi link in parallel. The results for this case are depicted in
Due to the high packet loss probability in both links, the MPTCP scheme quickly “averages out”, and does not manage to increase significantly the joint window size of the TCP sub-flows. Furthermore, for large file sizes, adding 5% redundancy to MPTCP/NC above the average throughput of the channel almost doubles the achievable average rate. As discussed previously, this is due to the fact that the extra redundancy makes time out events very rare, and a joint time out for the two links will almost never occur for most file sizes. On the other hand, when R=(1−p)−1, time outs happen more frequently and can occur jointly for both links, reducing the window size to only a few packets. Nevertheless, even in this case, the Fast-TCP/NC layer manages to quickly increase the joint window size.
Based on the above analyses, it is apparent that MPTCP/NC can provide an average transmission rate that is an order of magnitude or more better than MPTCP in different cases of interest. Some conclusions that may be drawn include: (a) a little redundancy in MPTCP/NC can be very helpful, reducing time outs (by ≈2× average rate in some noisy scenarios); (b) MPTCP performs poorly when compared to the coded case, even with low or probabilities; (c) MPTCP's performance is determined by the link with lower packet loss, nearly independent of RTT (d) using coding masks packet losses as higher RTTs, as expected; and (e) MPTCP/NC is effective in leveraging links with significantly different RTTs.
Digital processor(s) 204 may include, for example, one or more general purpose microprocessors, digital signals processors (DSPs), controllers, microcontrollers, application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), programmable to is arrays (PLAs), programmable logic devices (PLDs), reduced instruction set computers (RISCs), and/or other processing devices or systems, including combinations of the above. Digital processor(s) 204 may be used to, for example, execute an operating system for a corresponding node device. Digital processor(s) 204 may also be used to, for example, execute one or more application programs for a node device. In addition, digital processor(s) 204 may be used to implement, either partially or fully, one or more of the communications related processes or techniques described herein in some implementations.
As described above, first and second wireless transceivers 208, 210 may include any type of transceivers that are capable of supporting wireless communication with one or more remote wireless entities. In various implementations, these transceivers 208, 210 may be configured in accordance with one or more wireless standards (e.g., wireless networking standards, wireless cellular standards, etc.) for use in communicating with corresponding wireless networks, systems, or devices. In this fashion, these transceivers may be used to support heterogeneous communication as described above. Although illustrated with two wireless transceivers in
Wired network interface 212 may be configured in accordance with one or more wired communication standards. Although illustrated with a sin e wired network interface 212, it should be appreciated that any number of wired interfaces may be used in other implementations. In some embodiments, wired network interface 212 may be used to support heterogeneous networking in conjunction with one or more other transceivers (wired or wireless).
Memory 206 may include any type of system, device, or component, or combination thereof, that is capable of storing digital information (e.g., digital data, computer executable instructions and/or programs, etc.) for access by a processing device or other component. This may include, for example, semiconductor memories, magnetic data storage devices, disc based storage devices, optical storage devices, read only memories (ROMs), random access memories (RAMs), non-volatile memories, flash memories, USB drives, compact disc read only memories (CD-ROMs), DVDs, Blu-Ray disks, magneto-optical disks, erasable programmable ROMs (EPROMs), electrically erasable programmable ROMs (EEPROMs), magnetic or optical cards, and/or other digital storage suitable for storing electronic instructions and/or data. In some embodiments, computer executable instructions may be stored in memory 206 that, if executed within a digital processor, can result in the performance of one or more of the methods described herein, or portions thereof.
Network encoder/decoder 216 may include a device or system for performing network encoding and/or decoding for a node device. In a source node device that will generate and transmit network ceded packets, network encoder/decoder 216 may include network encoding functionality. Likewise, in a destination node device that will receive and decode network encoded packets, network encoder/decoder 216 may include decoding functionality. In a node that may serve as both a source node and a destination node, both network encoding and decoding functionality may be provided. It should be appreciated that, although illustrated as a separate unit, network encoder/decoder 216 may be implemented, at least partially, within another component or device of a node in sonic implementations (e.g., within processor(s) 204 of
It should be appreciated that the node device architecture 200 of
Having described exemplary embodiments of the invention, it will now become apparent to one of ordinary skill in the art that other embodiments incorporating their concepts may also be used. The embodiments contained herein should not be limited to disclosed embodiments but rather should be limited only by the spirit and scope of the appended claims. All publications and references cited herein are expressly incorporated herein by reference in their entirety.
Claims
1. A machine implemented method for use in transferring data to a destination node, the method comprising:
- obtaining as plurality of original data packets to he transferred to the destination node;
- generating first coded packets by linearly combining original data packets using network coding;
- distributing the first coded packets among multiple available paths leading to the destination node;
- generating second coded packets by linearly combining first coded packets distributed to a first path of the multiple available paths, using network coding; and
- transmitting the second coded packets associated with the first path to the destination node via a network associated with the first path.
2. The method of claim 1, further comprising:
- generating second coded packets by linearly combining first coded packets distributed to a second path of the multiple available paths, using network coding; and
- transmitting the second coded packets associated with the second path to the destination node via a network associated with the second path.
3. The method of claim 1, wherein:
- generating first coded packets includes generating first coded packets by linearly combining original data packets that are within a sliding coding window.
4. The method of claim 3, further comprising:
- receiving acknowledgement messages from the destination node that each indicate that a new degree of freedom has been received by the destination node in connection with the data transfer; and
- adjusting the width of the sliding coding window based, at least in part, on received acknowledgement messages.
5. The method of claim 1, wherein:
- generating second coded packets includes generating second coded packets by linearly combining first coded packets that are within a sliding coding window.
6. The method of claim 5, wherein:
- generating second coded packets includes generating redundant second coded packets for each set of first coded packets within the sliding coding window.
7. The method of claim 1, wherein:
- each of the multiple available paths includes a transmission control protocol (TCP) layer that adds sequence numbers to first coded packets distributed to the path, wherein generating second coded packets by linearly combining first coded packets masks the sequence numbers added to the first coded packets associated with a path.
8. The method of claim 1, wherein:
- each of the multiple available paths includes a transmission control protocol (TCP) layer; and
- distributing the first coded packets includes distributing the first coded packets among the multiple available paths based, at least in part, on TCP congestion control window dynamics.
9. The method of claim 1, wherein:
- the multiple available paths are associated with different network technologies.
10. The method of claim 1, wherein:
- obtaining a plurality of original data packets includes receiving the original data packets from an application layer.
11. A machine implemented method for use in processing coded packets received from a source node via multiple different paths, the method comprising:
- receiving coded packets associated with a first connection via multiple different paths, the coded packets each including a linear combination of original data packets;
- for each coded packet associated with the first connection that is successfully received, sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent of previously received coded packets associated with the first connection;
- forwarding all coded packets associated with the first connection, received from all paths, to a common processing layer without decoding the coded packets; and
- for each coded packet associated with the first connection forwarded to the common processing layer: determining whether the coded packet is linearly independent of coded packets associated with the connection that were previously forwarded to the common processing layer; and sending an acknowledgement message to the source node acknowledging that a new degree of freedom has been received for the first connection if the coded packet is determined to be linearly independent.
12. The method of claim 11, wherein:
- determining whether the coded packet is linearly independent includes performing a Gauss-Jordan elimination operation on a coefficient matrix.
13. The method of claim 11, wherein:
- receiving coded packets associated with a first connection via multiple different paths includes receiving coded packets from paths that use different network technologies, wherein each of the different paths uses transmission control protocol (TCP) to control a transfer of packets through the path.
14. The method of claim 13, wherein:
- sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent from previously received coded packets is performed as part of a TCP layer.
15. The method of claim 14, wherein:
- sending an acknowledgement message to the source node without first determining whether the coded packet is linearly independent of previously received coded packets is performed within a first portion of the TCP layer for coded packets received via a first path and within a second portion of the TCP layer for coded packets received via a second path that is different from the first path.
16. The method of claim 11, wherein:
- receiving coded packets associated with a first connection via multiple different paths includes receiving some coded packets via a first path following a first communication standard and receiving some other coded packets via a second path following a second communication standard that is different from the first communication standard.
17. The method of claim 11, wherein:
- forwarding all coded packets associated with the first connection to a common processing layer includes forwarding the coded packets to a multiple path transfer control protocol with network coding (MPTCP-NC) layer that is higher than a TCP layer in a corresponding protocol architecture.
18. The method of claim 17, wherein:
- the protocol architecture includes separate protocol stacks associated with each of the multiple different paths.
19. A communication device comprising:
- a first network interface unit configured for communication in a first network;
- a second network interface unit configured for communication in a second network that is different from the first network; and
- at least one processor to manage data transfer between the communication device and a destination node using multiple different paths, the at least one processor to: obtain a plurality of original packets representative of data to be transferred to the destination node; generate first coded packets by linearly combining original data packets using network coding; distribute first coded packets to the multiple different paths to the destination node; and generate second coded packets within individual paths by linearly combining first coded packets distributed to the path, using network coding.
20. The communication device of claim 19, wherein the at least one processor is configured to transmit second coded packets generated within individual paths to the destination node via corresponding networks.
21. The communication device of claim 19, wherein the at least one processor is configured to generate first coded packets by linearly combining original data packets within a sliding coding windows.
22. The communication device of claim 19, wherein the at least one processor is configured to generate second coded packets for a first path by linearly combining first coded packets within a sliding coding window.
23. The communication device of claim 22, wherein the at least one processor is configured to generate redundant second coded packets for the first path for each set of first coded packets within the sliding coding window.
24. The communication device of claim 19, wherein the at least one processor is configured to distribute the first coded packets to the multiple different paths based, at least in part, on TCP congestion control window dynamics.
25. The communication device of claim 19, wherein a first of the multiple different paths is associated with the first network interface unit and a second of the multiple different paths is associated with the second network interface unit.
26. The communication device of claim 25, further comprising;
- at least one additional network interface unit configured for communication within at least one additional network.
27. A communication device comprising:
- a first network interface unit configured for communication in a first network;
- a second network interface unit configured for communication in a second network that is different from the first network; and
- at least one processor to manage processing of coded data packets received from a source node via multiple different paths as part of a first connection, the coded data packets each including a linear combination of original data packets, the at least one processor to: send an acknowledgement message to the source node for each coded data packet associated with the first connection that is successfully received; forward coded data packets associated with the first connection, received from all paths, to a common processing layer without first decoding the coded packets; and for each coded packet associated with the first connection forwarded to the common processing layer: determine whether the coded packet is linearly independent of coded packets associated with the connection that were previously forwarded to the common processing layer; and send an acknowledgement message to the source node acknowledging that a new degree of freedom has been received for the first connection if the coded packet is determined to be linearly independent.
28. The communication device of claim 27, wherein the at least one processor is configured to discard a received coded packet if it is determined that the packet is not lineally independent.
29. The communication device of claim 27, wherein
- a first of the multiple different paths is associated with the first network interface unit and a second of the multiple different paths is associated with the second network interface unit.
30. The communication device of claim 27, further comprising:
- at least one additional network interface unit configured for communication within at least one additional network.
Type: Application
Filed: Jan 30, 2013
Publication Date: Aug 1, 2013
Patent Grant number: 9537759
Applicant: MASSACHUSETTS INSTITUTE OF TECHNOLOGY (Cambridge, MA)
Inventor: Massachusetts Institute of Technology (Cambridge, MA)
Application Number: 13/754,398
International Classification: H04L 12/56 (20060101);