Transporting operations of arbitrary size over remote direct memory access

- Microsoft

The embodiments described herein generally relate to a protocol for implementing data operations, e.g., a version of SMB, atop RDMA transports. In embodiments, systems and methods use the protocol definition, which specifies new messages for negotiating an RDMA connection and for transferring SMB2 data using the negotiated communication. A new protocol message may include new header information to determine message size, number of messages, and other information for sending the SMB2 data over RDMA. The header information is used to accommodate differences in message size requirements between RDMA and SMB2. The SMB Direct protocol allows SMB2 data to be fragmented into multiple individual RDMA messages that a receiver may then logically concatenate into a single SMB2 request or SMB2 response. The SMB Direct protocol also may allow SMB2 to transfer application data via efficient RDMA direct placement and to signal the application data's availability when the transfer is complete.

Skip to: Description  ·  Claims  ·  References Cited  · Patent History  ·  Patent History
Description
COPYRIGHT NOTICE AND PERMISSION

A portion of the disclosure of this patent document may contain material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever. The following notice shall apply to this document: Copyright© 2011, Microsoft Corp.

BACKGROUND

File access protocols, such as Server Message Block (SMB) or versions thereof, e.g., SMB2, may operate as application-layer network protocols mainly used to provide shared access to files and miscellaneous communications between nodes on a network. Historically, SMB or SMB2 operated atop transmission control protocol (TCP) transports and traditional network infrastructure. While SMB2 has been very successful as a protocol for general purpose remote file access, SMB2 has not been widely adopted for remote file access where high throughput and low latency file input/output is required.

Remote Direct Memory Access (RDMA) is a direct memory access from the memory of one computer into that of another computer without involving the operating system of the other computer. This direct transfer permits high-throughput, low-latency data transfers over a network, which is especially useful in performance-critical deployments. When an application performs an RDMA Read or Write request, the application data are delivered directly from a source memory buffer to a destination memory buffer using RDMA-capable network adapters, which do not involve the central processing unit (CPU) (also referred to simply as a processor) or operating system in the transfer. These RDMA transfers reduce latency and enable fast message transfer. Unfortunately, the benefits of RDMA have not been exploited by systems using SMB2 because SMB2 has not operated with RDMA.

Although specific problems have been addressed in this Background, this disclosure is not intended in any way to be limited to solving those specific problems.

SUMMARY

Embodiments generally relate to a protocol and processing to implement data operations, such as SMB2 operations (or other versions of SMB operations or file access protocol operations, for example), atop RDMA transports. In embodiments, the protocol definition specifies new messages for negotiation of an RDMA connection and for transferring SMB2 data, for example, using the negotiated connection. In an embodiment, a protocol for implementing SMB2 operations atop RDMA transports is the SMB Direct protocol. However, other embodiments provide for other SMB protocols, SMB protocol versions, or other data operation protocols without departing from the spirit and scope of the present disclosure. According to an embodiment, a new SMB Direct message may include new header information, which may include, but is not limited to, one or more of the following: CreditsRequested, CreditsGranted, Flags, Reserved, RemainingDataLength, DataOffset, and DataLength. The header information is used because RDMA transports support receiving messages of a size fixed only by the receiver, and SMB2 message size can vary widely from about a hundred bytes to very large messages over a million bytes. The SMB2 protocol is modified to recognize the existence of the RDMA capability, while the SMB Direct protocol adds a new layer to the networking stack to allow multiple individual RDMA messages to be logically concatenated into a single request or response to accommodate both the fixed size restrictions on RDMA messages and the indeterminate size requirements inherent to SMB2 messaging. The changes to the SMB2 protocol and the addition of the SMB Direct protocol allow for the direct transfer of data between memories of the peers. In embodiments, the SMB2 server may read from or write to a client's memory using RDMA to perform direct placement of data. The server performs an RDMA Write to the client to complete an SMB2 read and performs an RDMA Read to complete an SMB2 write. While the SMB Direct protocol allows for the direct transfer of SMB/SMB2 data between memories of peers, the SMB Direct protocol may be adapted to other protocols, according to embodiments. According to embodiments, the bi-directional, peer-to-peer nature of the SMB Direct protocol lends itself to numerous types of data transfer operations.

RDMA transports also restrict the number of messages which may be processed at any time, again with a value fixed only by the receiver. To conform to this requirement of RDMA, embodiments provide for the peers to exchange or assign “credits,” which are numeric values, requested by and granted to each mutual peer, in the protocol header that specify the number of RDMA messages the sender may send to the receiver. Credits are dynamic and are managed independently by each peer. Rules for managing and making sufficient credits available to perform SMB2 exchanges are defined by the protocol in embodiments disclosed herein.

In embodiments, the provision of independent, bidirectional credits may permit each peer to send requests and responses without explicit negotiation or prior knowledge and agreement by the receiving peer. Sequenced sends, associated with RDMA, may permit the exchange of unexpectedly large messages without invoking errors in the RDMA processing and without resorting to less efficient negotiated transfers when such conditions arise.

Additional messages may be used for negotiation of the protocol version and other parameters, according to embodiments. A Negotiate Request message may include, for example, fields for CreditsRequested, Reserved, MinVersion, MaxVersion, OutboundSendSize, MaxInboundSendSize, etc. In turn, a Negotiate Response message sent in response to the request message may include, for example, fields for CreditsRequested, CreditsGranted, Version, Reserved, Status, OutboundSendSize, InboundSendSize, etc. These parameters support the negotiation of capabilities, end-to-end optimization of resources, and compatibility with future enhanced versions of the protocol.

This Summary is provided to introduce a selection of concepts in a simplified form that is further described below in the Detailed Description. This Summary is not intended to identify key or essential features of the claimed subject matter, nor is it intended to be used in any way as to limit the scope of the claimed subject matter.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present disclosure may be more readily described by reference to the accompanying drawings in which like numerals refer to like items.

FIG. 1 illustrates an example logical representation of an environment or system for exchanging SMB2 messages over RDMA, in accordance with embodiments of the present disclosure.

FIG. 2A depicts an example logical representation of a client system for sending SMB2 messages over RDMA, in accordance with embodiments of the present disclosure.

FIG. 2B illustrates a logical representation of a server system for receiving SMB2 messages over RDMA, in accordance with embodiments of the present disclosure.

FIGS. 3A-3E show logical representations of messages sent or received when exchanging data using SMB2 messages over RDMA, in accordance with embodiments of the present disclosure.

FIGS. 4A-4C depict a flow diagram illustrating the operational characteristics of a process for negotiating a communication using SMB2 over RDMA, in accordance with embodiments of the present disclosure.

FIGS. 5A-5D show a flow diagram illustrating the operational characteristics of a process for exchanging data using SMB2 over RDMA, in accordance with embodiments of the present disclosure.

FIG. 6 illustrates a flow diagram depicting the operational characteristics of a process for exchanging data using an RDMA direct data transfer, in accordance with embodiments of the present disclosure.

FIG. 7 depicts an example computing system upon which embodiments of the present disclosure may be implemented.

DETAILED DESCRIPTION

This disclosure will now more fully describe example embodiments with reference to the accompanying drawings, in which specific embodiments are shown. Other aspects may, however, be embodied in many different forms, and the inclusion of specific embodiments in this disclosure should not be construed as limiting such aspects to the embodiments set forth herein. Rather, the embodiments depicted in the drawings are included to provide a disclosure that is thorough and complete and which fully conveys the intended scope to those skilled in the art. Dashed lines may be used to show optional components or operations.

Embodiments generally relate to systems, methods, and protocols for exchanging SMB2 data over an RDMA connection. RDMA provides advantages in transporting data. For example, RDMA makes transfers from one memory to another device's or system's memory. These transfers do not involve the processor and, thus, reduce the overhead involved in transporting data. Further, without the involvement of a processor in transport management, RDMA transmits data with fewer clock cycles. Thus, RDMA provides a low-latency, high bandwidth connection.

In general, a version of SMB, such as SMB2, is an application-layer network protocol used to provide shared access to files and miscellaneous communications between nodes on a network. SMB and SMB2 are thus examples of file access operation protocols. SMB2 allows for the transfer of data in the exchange of messages between a client(s) and a server(s). The description of some embodiments herein refers to SMB2. However, in other embodiments, any version of SMB or other file access operation protocol may be used without departing from the spirit and scope of the present disclosure.

Due, generally, to differences in operation, SMB2 (or SMB) does not currently operate over RDMA but, instead, uses TCP to transport data. The present embodiments create a system and protocol for exchanging SMB2 messages over an RDMA-capable network protocol. First, a client and/or a server, which may be collectively referred to as peers, discover the capabilities or abilities of each other using a component of SMB2. In an embodiment, a request is sent by the client asking for information about the server. The responding server may reply with information about the number of network interface cards (NICs) the server has, the internet protocol (IP) address for the NICs, the speed of the NICs, whether the NICs are RDMA capable, and/or possibly an IP address. In embodiments, the asking client may use this information to determine how to interface with the server.

If a server is RDMA capable, an RDMA-capable client may then negotiate an RDMA connection with that server, according to embodiments of the present disclosure. A new interface, referred to as the RDMA interface in embodiments, is an interface between a RDMA-capable network adapter and the other system components (including an SMB client). Further, a new module is added to the stack called an SMB Direct client/server. First, the SMB Direct client, possibly in response to the capabilities request, pre-posts a receive by reserving at least a portion of a memory buffer to receive a message from the other peer. The SMB Direct client may then send an SMB Direct Negotiate Request packet. An SMB Direct Negotiate Request begins a process of establishing an SMB Direct connection between the peers. According to embodiments, the SMB Direct Negotiate Request includes one or more fields that define how the RDMA connection will function.

In embodiments, part of the SMB Direct Negotiate Request is a request for “credits.” Credits are an allotment of space in the receiving peer's memory buffer. As RDMA transfers directly from one memory to another memory, the receiving peer reserves space in a memory buffer where the sending peer may place transported data. The memory buffers are allotted in blocks (e.g., 1 Kbyte blocks). In embodiments, the allotments are set in that the block size does not change after the SMB Direct connection has been established. In other embodiments, the allotments are not set, in which the block size may change. Where the allotments are set, any transfer to the memory buffer may not exceed the block size. To transfer SMB2 data having a size greater than the block size, the sender sends two or more packets that are stored into two or more allotments of the memory buffer. To reserve the memory allocations, the sending peer requests credits, where the number of credits requested may be governed by a local policy and is not necessarily affected by the composition of the message. Each credit represents a block of the memory buffer and, thus, represents a message the requester may send to the other peer.

In embodiments of the present disclosure, the server may send a response, e.g., an SMB Direct Negotiate Response, to the requester. This SMB Direct Negotiate Response also includes various fields that define the RDMA connection. In response to the request, the SMB Direct Negotiate Response provides a number of credits to the client. Further, the SMB Direct Negotiate Response may also ask for credits that reserve an allocation in the client's memory buffer. The exchange of the SMB Direct Negotiate Request and SMB Direct Negotiate Response, in embodiments, establish the RDMA connection. Thereafter, SMB2 data may be exchanged over the connection.

According to embodiments, the exchange of SMB2 data over the RDMA connection includes the transport of at least one packet. If the SMB2 data being sent is smaller than the block size, only one packet, called an SMB Direct Data Transfer packet, may be sent. However, if SMB2 data to be sent is larger than the block size, two or more SMB Direct Data Transfer packets may be sent, according an embodiment. RDMA allows for the sequenced reception of packets. Thus, a second packet received in a communication will be placed directly after a first received packet. To utilize this advantage of RDMA, the SMB Direct Data Transfer packets include fields to announce the total amount of SMB2 data to be sent and how much data are left to be sent after the present packet. In this way, the SMB Direct peers may determine when an SMB transfer is completed, and the SMB2 data may be reassembled. The SMB Direct Protocol, therefore, provides in embodiments for quick transfers of SMB2 data with low latency and low overhead, while overcoming the issues associated with transferring SMB2 data over RDMA.

An example logical environment or system 100 for exchanging SMB2 data over an RDMA connection is shown in FIG. 1, according to embodiments disclosed herein. Connection peers (referred to also as client 102 and server 106) 102 and 106 may move SMB2 data across an RDMA connection between RDMA NICs (RNICs) 108a and 108b over network 104. A connection peer may be any computer system as described with respect to FIG. 7, for example. The connection peers are shown as a client/application server 102 and a file server 106 in FIG. 1. However, these peers are offered by way of example only. Any type of client(s) or server(s) may function as a connection peer in the embodiments. Thus, RDMA connections may be between multiple clients, multiple servers, a server farm(s), a server cluster(s), a message server(s), or between a client(s) and a server(s), for example. The client/application server 102 and file server 106 are offered as examples only for purposes of understanding the teachings of the embodiments disclosed herein.

While SMB2 data are moved over network 104, the network 104 may be as described with respect to FIG. 7, for example. Network 104, although shown as an individual single network, may be any type of network conventionally understood by those of ordinary skill in the art. In accordance with an embodiment, the network may be the global network (e.g., the Internet or World Wide Web, i.e., “Web” for short). It may also be a local area network, e.g., intranet, or a wide area network. In accordance with embodiments, communications over network 104 occur according to one or more standard packet-based formats, e.g., H.323, IP, Ethernet, and/or ATM.

Further, in embodiments, an RNIC 108(a and/or b) may be any network interface card, network adapter, and/or network interface controller that supports RDMA. There are several vendors that offer RNICs. iWARP or InfiniBand, for example, are network protocols that support RDMA. The RNICs may support RDMA, which allows direct transfers of data from memory 110a to memory 110b and vice versa. These transfers of data do not require or include the oversight of the processor 112a or 112b. Thus, the RDMA transfers are high-bandwidth, low latency, and low overhead, according to embodiments of the present disclosure. The processors 112 and memories 110 may be as described with respect to FIG. 7, for example.

While FIG. 1 illustrates a general environment for SMB2 data exchanges over RDMA, FIG. 2A depicts an example peer 102 for sending or receiving the SMB2 data over the RDMA connection, according to embodiments disclosed herein. In this example, the peer 102 is a client and/or application server. The various components of the client 102 may include software and/or hardware, according to embodiments of the present disclosure. However, for purposes of illustration, the components hereinafter will be described as software modules. In embodiments, the client 102 includes one or more of, but is not limited to, a kernel 202a, at least one user application 220, a memory buffer 222a, one or more timers 224a, and/or one or more settings 226a. In embodiments, a “kernel” is the core of an operating system, in which memory, files, and peripheral devices are managed, applications are triggered and launched, and system resources are allocated.

In embodiments, the kernel 202a may include one or more of, but is not limited to, a WIN32® File application programming interface (API) or equivalent 204, an SMB2 module (shown as an SMB2 Client 208), an SMB Direct module (shown as an SMB Direct Client 214a), and a RDMA Interface 216a. The SMB Direct module and the RDMA Interface 216a are introduced to perform the methods and processes described herein. The modules, components, and/or interfaces will be described as follows.

In embodiments, the WIN32® File API 204 may be an interface between the kernel 202a and one or more user applications 220. In an example embodiment, the WIN32® File API 204 is a set of APIs available in the MICROSOFT WINDOWS® operating systems. Almost all WINDOWS® programs interact with the WINDOWS® API to perform functions. Embodiments of the WIN32® File API 204 provide access to the resources available to a WINDOWS® system, such as, for example, file systems, devices, and/or error handling. WIN32® File API 204 may provide access to functionality additional to the kernel, according to embodiments. In embodiments, WIN32® File API 204 additionally allows a system to perform actions on remote files that use underlying file access functions that, in turn, use various networking capabilities and allow for remote access.

In embodiments, the SMB2 Client 208 manages the communication between applications and the interfaces provided by the SMB2 Server 212b. Because devices may operate at speeds that may not match the operating system, the communication between the operating system and device drivers is primarily done through I/O request packets (IRPs). These packets are similar to network packets or WINDOWS® message packets, for example. The packets are passed from an operating system(s) to specific drivers and from one driver to another. In embodiments, the SMB2 Client 208 may redirect the I/O requests to network resources and compose SMB messages to conduct the communications over the network. The SMB2 Client 208 communicates the SMB packets to the SMB Direct Client 214a to exchange the SMB packets or data over RDMA. Similarly, the SMB2 Server 212b can also use IRPs to send file requests from the SMB2 Client 208 to the server's storage, according to an embodiment.

In embodiments, the SMB Direct Module 214 is an instance created from the SMB Direct network provider interface (NPI) in the kernel 202. The SMB Direct Module 214 exposes an API (referred to as the SMB Direct NPI) to the SMB2 client and SMB2 server modules. The SMB2 client/server modules use this SMB Direct NPI to make requests to send or receive data over an SMB Direct connection. The SMB Direct module 214 implements the SMB Direct Protocol and sits between the SMB2 client/server modules and the underlying RDMA interface, according to embodiments. The SMB Direct NPI enables the SMB Direct protocol. The SMB Direct NPI may create and destroy SMB Direct connections, send and receive data over SMB Direct connections, register/unregister memory, perform RDMA Read/Write data operations from/to a peer over a SMB Direct connection, receive notifications when an SMB Direct connection is disconnected by the peer, marshal/unmarshal SMB2 packets for transmission across a SMB Direct connection, among other operations. To accomplish these tasks, an SMB Direct Module 214 is created that may manage the sending and retrieving of the SMB2 data from the memory buffers 222 as stored by the RDMA protocol. Thus, the SMB Direct Module 214 converts data from simply SMB2 data into RDMA and back from RDMA to SMB2. The SMB Direct Module 214 communicates with another new module, the RDMA Interface 216 to execute the operations in embodiments of the present disclosure.

According to embodiments, the SMB Direct Module 214 performs several functions. An SMB DirectReceiveEvent callback function notifies SMB2 Client 208 or SMB2 Server 212b that a message has been received on a SMB Direct endpoint. An SMB DirectDisconnectEvent event callback function notifies SMB2 Client 208 or SMB2 Server 212b that a connection on an endpoint has been disconnected. An SMB DirectAcceptEvent event callback function notifies the SMB2 Server 212b that an incoming connection on a listening endpoint has been accepted. An SMB DirectListen function creates a listener endpoint that listens for incoming connections on a given local address. An SMB DirectCloseEndpoint function closes an endpoint and frees any associated resources. An SMB DirectConnect function connects an endpoint to a remote SMB Direct transport address. An SMB DirectDisconnect function disconnects an endpoint from a remote SMB Direct transport address. An SMB DirectSend function sends a buffer of data to a remote SMB Direct peer. An SMB DirectRegisterMemory function allows the SMB2 Client 208 to register memory buffers for use in RDMA Read/Write operations. An SMB DirectUnregisterMemory function unregisters memory buffers that were previously registered via the SMB DirectRegisterMemory API. An SMB DirectRdmaRead function causes RDMA to read data directly from the memory of the remote peer to which the endpoint is connected. An SMB DirectRdmaWrite function causes RDMA to write data directly into the memory of the remote peer to which the endpoint is connected. These functions and the operation thereof will be explained in conjunction with FIG. 6.

In an embodiment, the RDMA Interface 216 is a new interface to interface with the vendor-specific RDMA functionality of the RNICs. The RDMA Interface 216 can give access to the functions of the RDMA device. The functions of the RDMA device can include listening to a port to receive SMB Direct packets and provide the SMB2 data to the SMB Direct Module 214. In an embodiment, the RDMA device may include a kernel mode RDMA module to manage communications over the RDMA connection. Further, the RDMA device may include an RDMA access layer and Extensions to access and listen on a port sending RDMA messages. A proxy driver may interface with the hardware driver of the RNIC, according to embodiments.

In embodiments, the user application 220 may be any software executed by a processor for a user or other software. Examples of user applications 220 include web browsers, email, etc. These user applications 220 interface with the kernel 202a to send and receive data, especially from remote storage locations, such as the file server 106.

In an embodiment, a memory buffer 222 may be any type of memory, as described with respect to FIG. 7. The memory buffer 222 may be used to receive SMB Direct messages and/or SMB2 data carried in the messages and may be used to stage outgoing SMB Direct messages before transmitting those messages. Thus, the memory buffer 222 may be partitioned into blocks as described hereinafter, according to embodiments.

The timers 224, in embodiments, are a set of clocks that may count down from a predetermined time to zero. Thus, the timers 224 represent stored data and a clock function executed by a processor. Expiration of a timer 224 may trigger one or more functions in the RDMA Interface or with the SMB Direct Module 214. In other embodiments, the timers may count from zero to a threshold or perform some other type of counting. Some of the timers may include a SendCreditGrantTimer, which is the timer that is started when SendCreditCount reaches zero and operates in the timers section 224. The remotely connected peer has until this timer expires to grant additional Send Credits. The SendCreditGrantTimer can also regulate the amount of time that the client/server waits for the peer to grant it additional Send Credits. When the client/server finds that it cannot send a packet to the peer because the value of SendCreditsCount is zero, then the client/server sets a timer that will expire in a predetermined number of seconds, according to embodiments. If the timer expires before a Send Credit becomes available, the client/server disconnects the connection. In an embodiment, an Idle Connection Timer regulates the amount of time that the client/server waits for the peer to send a packet. If no packets have been received from the peer in the last predetermined number of seconds, the client/server sends a keep alive request to the peer and sets a KeepAliveResponse timer. If no response is received before the expiration of the KeepAliveResponse timer, then the connection is disconnected, according to embodiments of the present disclosure.

A KEEPALIVE_REQUESTED flag may also be set for any SMB Direct Data transfers over the SMB Direct Connection, according to embodiments of the present disclosure. In an embodiment, the KEEPALIVE_REQUESTED flag is a request for the receiving peer to respond to the sender as soon as possible so that the sender knows that the receiver is still connected and responsive. One or more systems may try to time out the connection without a message exchange. Thus, a message with the KEEPALIVE_REQUESTED flag can maintain the connection. In alternative embodiments, a message with the KEEPALIVE_REQUESTED flag set can be used to request or receive credits.

In an embodiment, a settings 226 store stores and retrieves data relevant to exchanging SMB messages over RDMA. The settings 226 may be stored in any type of memory or storage devices as described with respect to FIG. 7. As an example, the settings may include how many seconds after being set does the Credit Replenishment Timer expire, what is the maximum sized SMB Direct Data Transfer packet that the peer is willing to receive from another peer, or what is the limit on the number of send credits the peer will grant to another peer, etc.

The SMB_DIRECT_ENDPOINT Structure may be an opaque structure that represents a SMB Direct endpoint, in embodiments. An SMB Direct endpoint is analogous in function to a network socket, for example. SMB2 Client 208 or SMB2 Server 212b may not access the members of this structure directly but through the SMB Direct Module 214, according to an embodiment. The SMB_DIRECT_ENDPOINT Structure may include data for several operations. MwReleaseList is a list of memory windows that may be released back to the remotely connected peer. These memory windows are associated with RDMA Read/Write operations that have completed. ReceiveCreditCount is the number of Receive Credits that are to be granted to the remotely connected peer. The endpoint's host may have at least this number of receives pending on the endpoint. SendCreditCount is the number of Send Credits that the endpoint's host currently has. The remotely connected peer may have at least this number of receives pending on their endpoint. PendingRdmaReadCount is the number of RDMA Read operations that have been initiated but not yet completed on this endpoint. PendingRdmaReadLimit is the maximum number of RDMA Read operations that may be simultaneously pending on the endpoint. DeferredInitiatorOpQueue is a queue of initiator operations that are deferred because the endpoint resources to issue them are not currently available, in embodiments. NdkQp is the queue pair object that represents the receive and initiator request queues. NdkReceiveCq is the NDK receive completion queue. Receive request completions are queued to this queue. NdkReceiveQueueCapacity is the maximum number of Receive requests that may be simultaneously pending on the endpoint. NdkInitiatorCq is the NDK initiator completion queue. Send, Bind, Fast-Register, Read, Write, and Invalidate request completions are queued to this queue, for example. In embodiments, NdkInitiatorQueueCapacity is the maximum number of Send, Bind, Fast-Register, Read, Write, and Invalidate requests that may be simultaneously pending on the endpoint.

FIG. 2B depicts an example peer 106 for sending or receiving the SMB2 data over the RDMA connection, in accordance with embodiments disclosed herein. In this example, the peer 106 is a file server. The various components of the file server 106 may include software and/or hardware. While embodiments describe the components as software modules, other embodiments provide for other types of modules. In embodiments, the file server 106 includes one or more of, but is not limited to, a kernel 202b, an NTFS 232, a memory buffer 222b, one or more timers 224b, and/or one or more settings 226b. Some of these functions are the same or similar to those described in FIG. 2A.

According to embodiments, the kernel 202b (and/or as described with respect to FIG. 2A) may include one or more of, but is not limited to, an input/output (I/O) manager 206b, an SMB2 Server 212b, a SMB Direct Server 214b, and an RDMA Interface 216b. Some of these functions are the same or similar to those described in FIG. 2A.

In embodiments, SMB2 Server 212b is a driver that implements the server side of the SMB2 protocol for MICROSOFT® servers. Other embodiments provide for other types of servers. SMB2 Server 212b may start and exchange data using SMB and supply the data or receive the data from the I/O Manager 206b. In embodiments, SMB2 Server 212b sends or receives the SMB2 data over a network connection. SMB2 Server 212b functions to communicate over networks for the server. Thus, SMB2 Server 212b provides the SMB2 data for transmission or receives the SMB2 data from a network transmission, according to embodiments.

In embodiments, the New Technology File System (NTFS) 232 may be a standard file system. NTFS includes support for metadata and the use of advanced data structures to improve performance, reliability, and disk space utilization, plus additional extensions such as security access control lists (ACL) and file system journaling. NTFS functions to organize and store file data for one or more clients. This file data may be provided to the client over communications with the client, such as SMB2 data transfers over RDMA.

According to embodiments, SMB Direct creates an RDMA connection to exchange SMB2 data. The protocol creates the RDMA connection through a negotiation process. After the peers 102 and 106 negotiate the RDMA connection, either peer 102 or 106 may send SMB2 data over the RDMA connection.

According to embodiments, a first SMB Direct Negotiate Request packet 300 is shown in FIG. 3A. In turn, the associated SMB Direct Negotiate Response packet 336 is shown in FIG. 3B, in embodiments. Further, the SMB Direct Data Transfer packet 338 used to transfer data is shown in FIG. 3C, according to embodiments. Each message or packet may be created, transmitted, stored, and/or received, for example. The packets or messages may each include portions or fields that store different data, in embodiments.

Turning to FIG. 3A, an SMB Direct Negotiate Request packet 300 is shown, according to embodiments of the present disclosure. The SMB Direct Negotiate Request packet 300 may include one or more of, but is not limited to, the following fields, for example: MinVersion 302, MaxVersion 304, Reserved 306, CreditsRequested 308, PreferredSendSize 310, MaxReceiveSize 312, and/or MaxSMB2MessageSize 314. The SMB Direct Negotiate Request packet 300 may include additional or fewer fields than those shown in FIG. 3A, as represented by ellipses 316.

In an embodiment, the MinVersion field 302 may include a value for the lowest SMB Direct Protocol version that the client/requester 102 supports. The MaxVersion field 304 may store the highest SMB Direct Protocol version that the client/requester 102 supports. This value may be equal to or greater than the value of the MinVersion field 302. In embodiments, the client/requester supports all of the protocol versions (inclusive) in the range between the value in the MinVersion field 302 and the value in the MaxVersion field 304. The Reserved field 306 is simply a reserved field for future unknown requirements and is not used by the client, according to an embodiment.

The CreditsRequested field 308, according to embodiments, comprises a value for the number of send credits that the client/requester 102 is requesting from the server/receiver 106. In embodiments, the value in the CreditsRequested field 308 is greater than zero (0) to ensure that the SMB2 data may be sent in a subsequent message. However, the value in the CreditsRequested field 308 may be any number and may be set based on an average usage, according to embodiments. In alternative embodiments, the value in the CreditsRequested field 308 may be based at least on the size of the SMB message to be sent. In still further embodiments, other or additional factors may be considered in requesting credits. Also, large messages may be transferred using RDMA, in embodiments.

The PreferredSendSize field 310 may include the size (possibly in bytes) of the largest SMB Direct Data Transfer packet 338 that the client/requester 102 wishes to be able to transmit to the server/receiver 106. The MaxReceiveSize field 312 includes a size (possibly in bytes) of the largest SMB Direct Data Transfer message that the client/requester 102 will accept from the server/receiver 106. This value may be equal to the block or predetermined allocation of the memory buffer 222a that the client 102 set in embodiments. Thus, no single SMB2 data transfer will exceed the memory allocation. In embodiments, this value is greater than or equal to threshold 128, which is at least the size of the SMB Direct header in the data transfer packet and a small SMB2 message. The MaxSmb2MessageSize field 314 may include a size (possibly in bytes) of the largest SMB2 Protocol message that the client/requester will accept from the server/receiver 106, according to an embodiment. This value is predetermined and set by the client 102. In embodiments, the value in the MaxSmb2MessageSize field 314 may not be greater than the total size of the memory buffer 222a. In this way, no SMB message may overflow the memory buffer 222a. However, the value of the MaxSmb2MessageSize field 314 may be less than the amount of memory in the memory buffer 222a, as determined by a user, according to an embodiment.

Turning to FIG. 3B, an SMB Direct Negotiate Response packet 336 is shown in accordance with embodiments. The SMB Direct Negotiate Response packet 336 may be sent by a peer 106 to complete the negotiation for the RDMA connection. The SMB Direct Negotiate Response packet 336 may include one or more of, but is not limited to, the following fields, for example: MinVersion 302b, MaxVersion 304b, PreferredVersion 318, Reserved 306b, CreditsRequested 308b, CreditsGranted 320a, Status 322, PreferredSendSize 310b, MaxReceiveSize 312b, and/or MaxSMB2MessageSize 314b. The SMB Direct Negotiate Response packet 336 may include additional or fewer fields than those shown in FIG. 3B, as represented by ellipses 324.

According to embodiments, the MinVersion field 302b may include a value for the lowest SMB Direct Protocol version that the server/receiver 106 supports. The MaxVersion field 304b may store the highest SMB Direct Protocol version that the server/receiver 106 supports. This value may be equal to or greater than the value of the MinVersion field 302b. In embodiments, the server/receiver 106 supports all of the protocol versions (inclusive) in the range between the value in the MinVersion field 302b and the value in the MaxVersion field 304b. The PreferredVersion field 318 stores a value for the common SMB Direct Protocol version. The value of the PreferredVersion field 318, in embodiments, is within the range specified by MinVersion field 302a and MaxVersion field 304a of the client's SMB Direct Negotiate Request packet 300. In further embodiments, the value of the PreferredVersion field 318 is between the range specified by MinVersion field 302b and the MaxVersion field 304b of the server's SMB Direct Negotiate Response packet 336. The Reserved field 306b is simply a reserved field for future unknown requirements and is not used by the server, according to embodiments.

The CreditsRequested field 308b, according to embodiments, comprises a value for the number of send credits that the server/receiver 106 is requesting from the client/requester 102. In embodiments, the value in the CreditsRequested field 308b is greater than zero (0) to ensure that the SMB2 data may be sent in a subsequent message. However, the value in the CreditsRequested field 308b may be any number and may be set based on an average usage. In alternative embodiments, the value in the CreditsRequested field 308b is based on the size of the SMB message to be sent or may be based on other factors. Large messages may be sent when one credit is requested, according to embodiments. The CreditsGranted field 320a includes the number of credits granted from the server/receiver 106 to the client/requester 102. In embodiments, the value of the CreditsGranted field 320a is greater than zero (0) to allow the client/requester 102 to send the next SMB Direct message.

In embodiments, a status field 322 includes at least one flag or value. In an embodiment, the status field 322 includes one or two values, either status success or status not supported. The success flag designates that the server/receiver 106 has accepted the client/requester's SMB Direct Negotiate Request packet 300. The not supported flag designates that the server/receiver 106 has rejected the client/requester's 102 SMB Direct Negotiate Request packet 300.

According to embodiments, the PreferredSendSize field 310b may include the size (possibly in bytes) of the largest SMB Direct Data Transfer packet 338 that the server/receiver 106 wishes to be able to transmit to the client/requester 102. The PreferredSendSize field 310b is, in embodiments, smaller than or equal to the MaxReceiveSize field 312a in the SMB Direct Negotiate Request packet 300. Thus, the packet size sent by the server/receiver 106 may not be larger than the memory buffer 222a allotment of the client/requester 102.

In embodiments, the MaxReceiveSize field 312b includes a size (possibly in bytes) of the largest SMB Direct Data Transfer message that the server/receiver 106 will accept from the client/requester 102. This value may be equal to the block or predetermined allocation of the memory buffer 222b that the server/receiver 106 set. Thus, in embodiments, no SMB2 data transfer will exceed the memory allocation. In embodiments, this value is greater than or equal to the threshold 128, which is at least the size of the SMB Direct header in the data transfer packet. The MaxSmb2MessageSize field 314b may include a size (possibly in bytes) of the largest SMB2 Protocol message that the server/receiver 106 will accept from the client/requester 102. This value is predetermined and set by the server/receiver 106. In embodiments, the value in the MaxSmb2MessageSize field 314b may not be greater than the total size of the memory buffer 222b. In this way, no SMB message may overflow the memory buffer 222b. However, the value of the MaxSmb2MessageSize field 314b may be less than the amount of memory in the memory buffer 222b, as determined by a user, according to an embodiment.

An SMB Direct Data Transfer packet 338 is shown in FIG. 3C, in accordance with embodiments disclosed herein. The SMB Direct Data Transfer packet 338 may be sent to transfer SMB2 data across the RDMA connection established during the negotiation. Either the client/requester 102 or the server/receiver 106 may send or receive the SMB2 data. As such, both the client/requester 102 and the server/receiver 106 are referred to generally as the receiver and sender. The SMB Direct Data Transfer packet 338 may include one or more of, but is not limited to, the following fields, for example: CreditsRequested 308c, CreditsGranted 320b, Reserved 306c, RemainingDataLength 326, DataOffset 328, DataLength 330, and/or SMB2 Data 332. The SMB Direct Data Transfer packet 338 may include additional or fewer fields than those shown in FIG. 3C, as represented by ellipses 334, according to embodiments.

In embodiments, the CreditsRequested field 308c comprises a value for the number of send credits that the sender is requesting from the receiver. In embodiments, the value in the CreditsRequested field 308c is greater than zero (0) to ensure that the SMB2 data may be sent in a subsequent message. However, the value in the CreditsRequested field 308c may be any number and may be set based on predicted future usage, i.e., how many more packets are to complete the transfer of the SMB2 data. The CreditsGranted field 320b includes the number of credits granted from the sender to the receiver. In embodiments, the value of the CreditsGranted field 320b can be zero because the peer is not obligated to honor a peer's request for credits. However, the peer can provide credits in the CreditsGranted field 320b to allow the client/requester 102 to send the next SMB Direct message. The Reserved field 306c is simply a reserved field for future unknown requirements and is not used by the client, according to embodiments disclosed herein.

In embodiments, the RemainingDataLength field 326 may include a number of bytes of a fragmented SMB2 message that the receiver has yet to receive. Thus, any value other than zero in this field indicates to the receiver that another SMB Direct Data Transfer packet 338 will be sent with more data. If the SMB Direct Data Transfer packet 338 carries a complete SMB2 message or the last of two or more SMB Direct Data Transfer packets 338 that carry fragmented SMB2 data, then the value in the RemainingDataLength field 326 is zero (0), according to embodiments. RDMA is capable of sending messages sequentially. As such, reassembling messages is simplified because sequential messages are reassembled in strict order of receipt. Thus, the embodiments forgo the need for a message identifier in the header or the use of other more complicated reassembly techniques, for example.

In an embodiment, the DataOffset field 328 includes a value for the offset, in bytes, from the beginning of the SMB Direct Data Transfer packet 338 to the first 8-byte aligned byte of the encapsulated SMB2 Protocol message. In embodiments, if the value of the DataLength field 330 is zero, then the DataOffset field 328 is also set to zero. If the value of DataLength is not zero, then the DataOffset field 328 is some value and may be greater than or equal to 24 bytes, which is the size of the other fields in the header, according to an embodiment. The DataLength field 330 may include the size, in bytes, of the encapsulated SMB2 Protocol message in the SMB2 Data field 332. In embodiments, if the SMB Direct Data Transfer packet 338 does not encapsulate an SMB2 Protocol message, then the value of the DataLength field 330 is set to zero. The SMB2 Data field includes any SMB2 Protocol message, according to embodiments.

An SMB2 Read/Write Request 340 is shown in FIG. 3D, in accordance with embodiments disclosed herein. The SMB2 Read/Write Request 340 may be encapsulated in the SMB Direct Data Transfer packet 338 to perform a direct RDMA read/write. A direct RDMA read/write sends only unencoded application data. The SMB2 Read/Write Request 340 may be a SMB2 Read Request if the server 106 is to perform an RDMA write to the client 102 and a SMB2 Write Request if the server 106 is to perform an RDMA read from the client 102, according to embodiments of the present disclosure. Regardless, the fields in the SMB2 Read/Write Request 340 are similar. It should be noted that the server 106 performs the RDMA transfer that the client 102 requests. The SMB2 Read/Write Request 340 may include one or more of, but is not limited to, the following fields, for example: Channel 342, ChannelInfoOffset 346, and ChannelInfoLength 348. In an embodiment, the fields may include the steering information for completing the RDMA data transfer. The Reserved field 344 is simply a reserved field for future unknown requirements and is not used, according to an embodiment. The SMB2 Read/Write Request 340 may include additional or fewer fields than those shown in FIG. 3D, as represented by ellipses 351. The fields shown are offered for purposes of illustration and are not intended to be limiting.

In embodiments, the Channel field 342 comprises a value for the channel where the data are to be found. The RDMA connection may include several channels. The channel information may include information that the peer provides to the RDMA device to accomplish the transfer, according to embodiments. For example, the channel information may contain one or more tokens, offsets, and lengths of memory segments, along with other RDMA-specific information, as shown in FIG. 3E.

In an embodiment, the ChannelInfoOffset field 346 and the ChannelInfoLength field 348 are pointers to the offset, token, and length information in the data packet. The pointers give the location in the SMB2 read or write request where the information may be located. In embodiments, the Flags 350 include any information to control or change the behavior of the RDMA direct data transfer.

Turning to FIG. 3E, an RDMA channel descriptor 352 is shown, in accordance with embodiments disclosed herein. The RDMA channel descriptor 352 may be encapsulated in the SMB Direct request, typically in the channel field 342. The RDMA channel descriptor 352 may include one or more of, but is not limited to, the following fields, for example: Offset 354, Token 356, and Length 358. In an embodiment, the fields are the steering information for completing the RDMA data transfer. The RDMA channel descriptor 352 may include additional or fewer fields than those shown in FIG. 3E, as represented by ellipses 360. The fields shown are offered for purposes of illustration. To complete a direct data transfer, RDMA is directed to the memory with the data that is to be read or written and the length of the data based on the information in the channel field 342. As the direct data transfer sends only application data, this information helps ensure a proper transfer.

According to an embodiment, the Offset 354 is the value in bytes or bits where the data begins. The Offset 354 may be measured from a memory block starting address, which may be located using information provided in the Token 356. The Length 358 is the value in bits or bytes of how long the data segment is. This steering information guides the direct data transfer, for example.

The interactions of the various software functional modules depicted in FIGS. 2A and 2B are further illustrated in the operational steps 400 depicted in FIGS. 4A-4C for negotiating an RDMA connection, in accordance with an embodiment disclosed herein. FIG. 4A shows the representation 400A of the transfers of packets 408 and 424 between a client 102 and a server 106, according to embodiments of the present disclosure. FIG. 4B is in the perspective of the client 102, while FIG. 4C is in the perspective of the server 106, according to embodiments of the present disclosure. It should be noted that the process 400 is described as the client requesting the RDMA connection and the server 106 responding. However, the reverse could be true where the server 106 requests and the client responds, according to embodiments. Further, the method may be conducted between multiple clients and/or between multiple servers, for example.

Turning to FIG. 4B, example operational steps 400B for negotiating a connection using SMB2 over RDMA are shown in accordance with an embodiment. Process 400B is initiated at START operation 402B, and process 400B proceeds to the client pre-posting a receive 404. In embodiments, the RDMA Interface 216a automatically detects and leverages the RNICs 108a/108b to see that they are available and properly configured. Next, in an embodiment, the SMB Direct NPI is exposed by the SMB Direct module 214. The SMB Direct Client 214a may open a handle to the RDMA Adapter to form an RDMA connection for the SMB2 Client 208. In an embodiment, the SMB Direct Client 214a then requests an RDMA Connection through the RDMA Interface 216a. The SMB Direct Client 214a may also create the allocations in the memory buffer 222a and determine what to request for initial send credits and what will be allowed for receive credits. The allocations in the memory buffer 222a may be of a predetermined size, in embodiments. The total amount of memory in the memory buffer 222a may also be determined, according to embodiments.

Returning to FIG. 4B, the SMB Direct Client 214a may then construct 406 the SMB Direct Negotiate Request packet 300 to send to the RDMA Interface 216a. The SMB Direct Negotiate Request packet 300 is as described in FIG. 3A, for example. As such, the SMB Direct Client 214a sets the fields in the SMB Direct Negotiate Request packet 300. Thus, the SMB Direct Client 214a determines the minimum and maximum versions of SMB Direct that the client 102 may support. The SMB Direct Client 214a determines the number of credits to request. The number of credits to request may be based on known future SMB messages that are to be sent, on predicted SMB traffic that may occur in the future, on historical use of SMB, or by some other method, in embodiments. The send size is determined by internal functions and speed considerations. The maximum receive size may be pegged to the size of the memory buffer allotment. Finally, a maximum SMB message size is determined and set (generally, the maximum SMB message size is large enough to allow a peer to send large SMB2 packets but not so large as to use all the memory). This collected information is entered into the SMB Direct Negotiate Request packet 300, according to embodiments of the present disclosure.

Next, the SMB Direct Client 214a sends 408B the SMB Direct Negotiate Request packet 300. In an embodiment, the SMB Direct Client 214a requests the RDMA Interface 216a to send the SMB Direct Negotiate Request packet 300 by a send operation. The RDMA Interface 216a communicates with the RDMA Interface 216b and sends the SMB Direct Negotiate Request packet 300 to the server 106. With the sending of the SMB Direct Negotiate Request packet 300, the SMB Direct Client 214a begins a negotiation request expiration timer 410. The negotiation request expiration timer may have a predetermined value and counts down from the value to zero. The SMB Direct Client 214a then waits for a response to the SMB Direct Negotiate Request packet 300. If it is determined 411 that the negotiation request expiration timer expires before receiving a response to the SMB Direct Negotiate Request packet 300, process 400B proceeds NO to END operation 432B where the connection is dropped.

Turning to FIG. 4C at start operation 402C, the server 106 anticipates the SMB Direct Negotiate Request packet 300 and pre-posts a receive 412, in embodiments. The SMB2 Server 212b may listen for activity and automatically discover that RDMA is available and bound on a port of the RNIC 108b, according to embodiments. SMB2 Server 212b opens a listener endpoint and determines that RDMA is available on the RNIC 108b. Next, SMB2 Server 212b then begins communications with the SMB Direct Server 214b. The SMB Direct Server 214b may open an RDMA connection and receive an RDMA Adapter Handle. The SMB Direct Server 214b then requests an RDMA Connection through the RDMA Interface 216b. The SMB Direct Server 214b may then accept the RDMA Connection, thereby informing the client 102 that the connection request succeeded. This success indication from the RDMA interface 216a triggers the client to send the negotiate request.

The SMB Direct Server 214b may also create the allocations in the memory buffer 222b and determine what to request for initial send credits and what will be allowed for receive credits, according to embodiments. The allocations in the memory buffer 222b may be of a predetermined size. The total amount of memory in the memory buffer 222b may also be determined, according to embodiments of the present disclosure.

Returning to FIG. 4C, the SMB Direct Server 214b may then start a negotiation request expiration timer 414. The negotiation request expiration timer at the server 106 may have a predetermined value and counts down from the value to zero, in embodiments. The SMB Direct Server 214b then waits to receive the SMB Direct Negotiate Request packet 300. While waiting, the SMB Direct Server 214b monitors the RDMA connection and determines 416 if the negotiation request expiration timer expires before receiving the SMB Direct Negotiate Request packet 300. If the negotiation request timer expires, process 400C proceeds NO to END operation 432C where the connection is dropped.

However, if the SMB Direct Negotiate Request packet 300 arrives at the RNIC 108b and the RDMA Interface 216b before the expiration of the negotiation request expiration timer, process 400C proceeds YES to receiving and validating 408C, by the RDMA Interface 216b, the SMB Direct Negotiate Request packet 300 and placing the packet in the memory buffer 222b. Next, the SMB Direct Server 214b is informed of the data and determines if the SMB Direct Negotiate Request packet 300 is valid 418.

In embodiments, to validate the SMB Direct Negotiate Request packet 300, the SMB Direct Server 214b reads the data from the SMB Direct Negotiate Request packet 300 and determines the following, for example: if the value of the MaxVersion field 304 is less than the value of the MinVersion field 302 or if no value in the range is supported; if the value of the CreditsRequested field 308 is zero; if the value of the MaxReceiveSize field 312 is less than a predetermined threshold (e.g., 128 bytes); or, if the value of the MaxSmb2MessageSize field 314 is less than a predetermined threshold. If any of the above are true, then the SMB Direct Negotiate Request packet 300 is not valid, and process 400C proceeds NO to END operation 432C, where SMB Direct Server 214b disconnects. If all of the checks are not true, then process 400C proceeds YES to process SMB Direct Response 420, in which the SMB Direct Negotiate Request packet 300 is sent to the SMB Direct Client 214a by the SMB Direct Server 214b.

In processing the SMB Direct Negotiate Request packet 300, the SMB Direct Server 214b sets the setting Protocol Version to equal the maximum protocol version shared by the client 102 and the server 106, in embodiments. The SMB Direct Server 214b also determines how many credits to grant. The number of credits to grant depends, in embodiments, on the available space in the memory buffer 222b or other factors. After making these determinations, the SMB Direct Server 214b generates 422 the SMB Direct Negotiate Response packet 336. In an embodiment, the SMB Direct Server 214b sets the fields in the SMB Direct Negotiate Response packet 336. For example, the PreferredVersion field is set to the value in the setting Protocol Version. The CreditsGranted 320a is set to the number of credits determined by the SMB Direct Server 214b, according to an embodiment. The other fields are filled in a similar fashion to the SMB Direct Negotiate Request packet 300. The SMB Direct Negotiate Response packet 336 is then sent 424C to the client. In embodiments, the SMB Direct Server 214b sends the SMB Direct Negotiate Response packet 336 to the RDMA Interface 216b to send.

Returning to FIG. 4B, the SMB Direct client 214a may, according to embodiments, determine 411 if the response was received before expiration of the negotiate expiration timer. If the response was received before expiration of the timer, process 400B proceeds YES to validate 424B the SMB Direct Negotiate Response packet 336. The RDMA Interface 216a receives the message into the memory buffer 222a. The SMB Direct Client 214a reads the SMB Direct Negotiate Response packet 336 and then validates the message. To validate the SMB Direct Negotiate Response packet 336, the SMB Direct Client 214a determines 428 the following, for example: if the Status field 322 is not STATUS_SUCCESS; if the PreferredVersion field 318 does not contain a value that is within the range specified by the MinVersion field 302 and MaxVersion field 304 of the client's SMB Direct Negotiate Request packet 300; if the CreditsRequested field 308b is zero; if the CreditsGranted field 320a is zero; if the PreferredSendSize field 310b is greater than the value specified by the MaxReceiveSize field 312 of the client's SMB Direct Negotiate Request packet 330; or if the MaxReceiveSize field 312b is less than a predetermined threshold (e.g., 128 bytes), according to embodiments of the present disclosure. If any of the above are true, then the SMB Direct Negotiate Response packet 336 is not valid, and process 400B proceeds NO to END operation 432B, in which the SMB Direct Client 214a disconnects. If all of the checks are not true, then process 400B proceeds YES to process SMB Direct Response 430, in which the SMB Direct Negotiate Response packet 336 is processed by the SMB Direct Client 214a, according to embodiments disclosed herein.

In processing the SMB Direct Negotiate Response packet 336, the SMB Direct Client 214a completes the following, in embodiments, for example: sets the connection's PeerTargetSendCreditsCount setting in the settings 226 equal to the CreditsRequested field 308b; sets the connection's SendCreditsCount setting in the SMB Direct connection properties equal to the CreditsGranted field 320a; sets the connection's MaxSendSize setting in the settings 226 equal to the MaxReceiveSize field 312b; sets the connection's ReceiveSize setting in the settings 226 equal to the PreferredSendSize field 310b; sets the connection's MaxOutboundFragmentedMessageSize setting in the settings 226 equal to the MaxReceiveSize field 312b; and sets the connection's Idle Connection timer to expire in a predetermined amount of time (e.g., a few hours, a few minutes, etc.) and starts the timer. Once the above steps have been performed, the RDMA connection negotiation has completed, and the client 102 and server 106 may begin exchanging SMB Direct Data Transfer packets. Process 400B then terminates at END operation 432B.

The interactions of the various software functional modules depicted in FIGS. 2A and 2B are further illustrated in the operational steps 500 depicted in FIGS. 5A-5D for exchanging SMB2 data over the established RDMA connection, in accordance with an embodiment disclosed herein. In request and response communications, SMB2 data are transmitted between the client 102 and the server 106 over the established SMB Direct connection by encapsulating the SMB2 data as the data payload of a SMB Direct Data Transfer packet 338, according to embodiments of the present disclosure. Request and response communications may be used for control channel communications, the exchange of file metadata, or for other processes, for example. FIG. 5A shows a representation 500A of the transfer of packets 516/518, 530, 532, and 546 between a client(s) 102 and a server(s) 106, according to embodiments of the present disclosure. FIGS. 5B and 5C are in the perspective of the client 102, while FIG. 5D is in the perspective of the server 106. It should be noted that the process 500 is described as the client sending SMB2 data to the server 106 either independently or in response to a request from the server. However, the reverse could be true where the server 106 responds to a request and sends data back to the client 102. Thus, the processes described herein apply where the server sends requests and requires credits, in embodiments. Further, the method may be conducted between multiple clients or between multiple servers, for example.

Turning to FIG. 5B, example operational steps 500B for exchanging data using SMB2 over RDMA are illustrated in accordance with embodiments disclosed herein. Start operation 502 is initiated, and process 500B proceeds to the client 102 establishing an RDMA connection 504. Establishing an RDMA connection may be as described with respect to FIGS. 4A-4C, according to embodiments of the present disclosure. In an embodiment, the SMB Direct Client 214a sets several connection properties including setting the MaxSendSize setting equal to the MaxReceiveSize field 312b, setting the connection's ReceiveSize setting equal to the PreferredSendSize field 310b, and setting the connection's MaxOutboundFragmentedMessageSize setting in the settings 226 equal to the MaxReceiveSize field 312b. In this way, the client 102 has established the size of packets that may be sent to the server 106, according to embodiments.

Returning to FIG. 5B, the SMB Direct Client 214a may determine the number of SMB Direct Data Transfer packets that are needed to transport an SMB2 message to the server 106, according to embodiments. To make the determination, the SMB Direct Client 214a may first retrieve the MaxSendSize (designated as R) as set during the negotiation. The SMB Direct Client 214a may then determine the size of the SMB2 protocol message(s) that is to be sent; this value is set as “S” in embodiments. The SMB Direct Client 214a also determines the number of bytes (designated as P, in embodiments) consumed by the SMB Direct Data Transfer packet 338 header and the padding (e.g., the DataOffset 328, etc.) for the SMB Direct Data Transfer packet 338 payload to begin on an 8-byte-aligned boundary. In embodiments, the SMB Direct Client 214a then determines if S is less than or equal to (R−P), and, if so, may determine the number of packets by dividing S by (R−P), according to embodiments.

Process 500B then proceeds to determining, by the SMB Direct Client 214a, if fragmentation is used 510. In an embodiment, fragmentation is used when the SMB2 protocol message does not “fit” into a single SMB Direct Data Transfer packet 338. In other words, is S greater than (R−P)? If fragmentation is not to be used, process 500B proceeds NO to send one SMB Direct Data Transfer Packet 518B. On the other hand, if fragmentation is used, process 500B proceeds YES to initialize transmission buffer bytes 512.

Returning to step 518, an SMB2 protocol message may be transmitted without fragmentation, according to embodiments. A portion of the memory buffer 222a, used for transmitting packets, is initialized. In embodiments, the first P bytes are initialized to set the header information for the SMB Direct Data Transfer packet 338 and to add zeroed padding bytes to ensure that the SMB2 payload is 8-byte aligned. Thus, in the buffer, the DataOffset field 328 is set to P, the DataLength field 330 is set to S, and RemainingDataLength field 326 is set to zero (0), according to embodiments. Information about, or requests for, credits are stored in the CreditsRequested field 308c or the CreditsGranted field 320b. Thus, the client 102 may request or grant more credits in the SMB Direct Data Transfer packet 338. In embodiments, the sender of an SMB Direct Data Transfer packet must set the CreditsRequested field to at least one (1). The next S bytes are initialized in the memory buffer 222a, and the SMB2 protocol message is stored in the SMB2 data field 332. After the SMB Direct Data Transfer packet 338 is assembled, the SMB Direct Client 214a sends 518B the packet to the RDMA Interface 216a to send through the RDMA connection to the server 106. Process 500B then proceeds through page connector B 522 to END operation 548C, and process 500B terminates.

Returning to step 512, where S is greater than (R−P), the SMB2 protocol message is divided into two or more portions, according to embodiments. The portions are then sequenced to be sent in a series of RDMA Send operations, each of which carries a portion (called a fragment) of the SMB2 protocol message. To divide the SMB2 protocol message into fragments, the size of the first payload is set to X, which is equal to or less than (R−P). X represents the number of bytes of the SMB2 protocol message that will be transmitted by at least the first RDMA Send operation, according to an embodiment. As with step 518B, a portion of the memory buffer 222a, used for transmitting packets, is initialized. In embodiments, the first P bytes are initialized to set the header information for the SMB Direct Data Transfer packet 338 plus the padding bytes. Next, the DataOffset field 328 is set to P, the DataLength field 330 is set to X, and the RemainingDataLength field 326 is set to (S−X) 514. The RemainingDataLength field 326 indicates, in embodiments, how many bytes of the SMB2 protocol message remain to be transmitted. In embodiments, when the last fragment of the SMB2 protocol message is transmitted, (S−X) will be zero (0) In the last packet, the RemainingDataLength field 326 is then set to zero, which indicates that the SMB Direct Data Transfer packet 338 carries the last fragment of the SMB2 protocol message. Information about, or requests for, credits are stored in the CreditsRequested field 308c or the CreditsGranted field 320b. Thus, in embodiments, the client 102 requests or grants more credits in the SMB Direct Data Transfer packet 338. The next X bytes of the memory buffer 222a are then initialized. The fragment of the SMB2 protocol message may then be stored in the SMB2 data field 332, according to embodiments disclosed herein.

After assembling the SMB Direct Data Transfer packet 338 with the first untransmitted X bytes of the SMB2 protocol message, the SMB Direct Data Transfer packet 338 is sent 516B to the server. The SMB Direct Client 214a sends 516B the SMB Direct Data Transfer packet 338 to the RDMA Interface 216a to send through the RDMA connection to the server 106, in embodiments. Process 500B then proceeds through page connector A 520 to optional step 526, in which the SMB Direct Client 214a determines 526 if another SMB Direct Data Transfer packet 338 containing another fragment is to be sent. Thus, in embodiments, the SMB Direct Client 214a determines if, after sending the last packet, (S−X) is 0. If (S−X) is 0 or less than zero, meaning the last fragment was sent in the last SMB Direct Data Transfer packet 338, process 500B proceeds NO to END operation 548C, and process 500B terminates. However, if (S−X) is greater than 0, then process 500B proceeds YES to determine if enough send credits exist 528. In embodiments, step 526 may be optional (as shown) because the SMB Direct Data Transfer packets 338 transporting the fragments may be pre-staged in the memory buffer 222a and sent sequentially. When the buffer 222a is empty, process 500B terminates. There is then no need to determine if a next packet is to be sent. It should be noted that when an SMB2 protocol message is fragmented, the RDMA Send operations that carry the fragments are sent sequentially and monotonically, and may not be interrupted by other RDMA Send operations that are unrelated to the fragmented SMB2 protocol message, according to embodiments. The RDMA transport ordering, at the receiver, will preserve the sequencing of the fragments, such that the receiving peer may reconstruct the original message, according to an embodiment.

Returning to step 528, the SMB Direct Client 214a determines 528 if there are enough send credits to send the next fragment. Essentially, the SMB Direct Client 214a determines if there is one send credit left. If there is a send credit, process 500B proceeds YES through page connector C 524 to step 512. In embodiments, RDMA transports have a receive buffer pre-posted by the receiver before the sender may send a packet. This rule requires coordination between the sender and receiver to ensure that senders do not attempt to send a packet before the receiver has pre-posted a receive. SMB Direct uses a system of Send Credits to achieve the desired coordination, according to embodiments of the present disclosure.

In an embodiment, a send credit granted from a server 106 or client 102 represents a single pre-posted receive on the server 106 or client 102. The peer is entitled to perform one RDMA Send operation with one send credit. The client 102 may also provide information to the server 106 about how many send credits the client 102 needs to efficiently support its workload by setting the CreditsRequested field 308c or in the form of a separate Send Credit Request (which is not to be sent in the middle of a set of SMB Direct Data Transfer packets 338 transporting fragmented SMB2 data, according to an embodiment). Send credits are granted in the CreditsGranted field 320b of the SMB Direct Data Transfer packet 338. Thus, the SMB Direct Client 214a may request more send credits by setting the value in the CreditsRequest field 308c to a higher number (possibly covering all the future fragmented packets). The SMB Direct Client 214a may then wait 530C to receive send credits by awaiting a SMB Direct Data Transfer packet 338, sent from the server 106, which has no data but includes send credits in the CreditsGranted field 320b of the SMB Direct Data Transfer packet 338, according to an embodiment. The SMB Direct Client 214a may determine at query 532C if credits are received by setting a SendCreditsGrantedTimer in the timers 224a. If the SendCreditsGrantedTimer expires 547 before credits are granted, process 500B proceeds YES to fail, or terminate, the transmission operation, and process 500B then terminates at END operation 548C. However, if credits are granted 532C, process 500B proceeds YES through page connector C 524 to step 512. According to an embodiment, if no timer is set and/or the timer has not expired 547, process 500B proceeds NO and does not fail, or terminate, the transmission operation but, instead, continues to wait 530C to receive send credits.

Turning to FIG. 5D and start operation 502D for process 500D, at the server 106, the SMB Direct Data Transfer packet 338 is received 516D. In an embodiment, the RDMA Interface 216b receives the SMB Direct Data Transfer packet 338 and sends it to the SMB Direct Server 214b. First, the SMB Direct Server 214b reads the header information. The SMB Direct Server 214b proceeds to query 536 to determine if the RemainingDataLength field 326 is something other than zero. If the RemainingDataLength field 326 is zero, then the SMB Direct Data Transfer packet 338 is the only packet, and process 500D proceeds NO to process SMB packet data 544 where the SMB Direct Server 214b processes the packet data and sends the SMB2 protocol message to SMB2 Server 212b, in an embodiment. If the RemainingDataLength field 326 is something other than zero, process 500D proceeds YES to allocate reassembly buffer 538.

In step 538, the SMB Direct Server 214b allocates a portion of the memory buffer 222b to reassemble the SMB2 protocol message in the buffer 222b, in an embodiment. Thus, the SMB Direct Server 214b may allocate enough blocks in the memory buffer 222b to accept the data in the current SMB Direct Data Transfer packet 338 and in the upcoming SMB Direct Data Transfer packets 338, which is based on the DataLength field 330 value plus the RemainingDataLength field 326 value. The SMB Direct Server 214b may then read the SMB2 data from the SMB2 data field 332 and copy the data into the allocated buffer 540, according to embodiments of the present disclosure.

Next, an embodiment provides for the SMB Direct Server 214b to wait and receive 546D the next SMB Direct Data Transfer packets 338. The data in the SMB2 data field 332 is also copied into the next portion of the memory buffer 222b, in an embodiment. In an alternative embodiment, the SMB Direct Server 214b may optionally determine if this next SMB Direct Data Transfer packet 338 is the last packet containing fragmented data 542. To make the determination, the SMB Direct Server 214b checks if the RemainingDataLength field 326 is zero. If the RemainingDataLength field 326 is zero, the SMB Direct Server 214b understands that no more packets will be received, and process 500D proceeds YES to process the SMB2 protocol message 544 from the memory buffer 222b. Process 500D then terminates at END operation 548D. On the other hand, if the RemainingDataLength field 326 is something other than zero, the SMB Direct Server 214b understands that another packet(s) will be received, and process 500D proceeds NO to receive 546D the next SMB Direct Data Transfer packet 338, according to an embodiment.

The interactions of the various software functional modules depicted in FIGS. 2A and 2B are further illustrated in the operational steps 600 depicted in FIG. 6 for performing an RDMA Direct Data Transfer in accordance with an embodiment disclosed herein. FIG. 6 shows the representation of the data transfer between a client 102 and a server 106, according to embodiments of the present disclosure.

As shown in FIG. 6, process 600 is initiated at start operation 602, and the SMB2 Client 208 creates 604 an SMB2 read/write request 340. An application 220 can request data to be transferred to the server 106 or read from the server 106. The data transfer may be directed by the SMB2 Client 208 to employ an RDMA direct data transfer, such that the SMB2 server 106 uniquely performs the actual RDMA request, in accordance with embodiments of the present disclosure. To begin the transfer, the SMB2 Client 208 registers the target memory buffers to provide or receive unencoded application data, and then generates an SMB read/write command that is to be sent and conducted over RDMA. The command may be included in the SMB2 read/write request 340 and sent to the SMB Direct Client 214a, which encapsulates 606 the SMB2 read/write request 340 in a SMB Direct Data Transfer packet. In embodiments, the SMB2 read/write request 340 is stored in the SMB2 Data field 332 in the SMB Direct Data Transfer packet. The SMB Direct Data Transfer packet may then be sent 608 over the RDMA interface 216 to the server 106. In an embodiment, the server 106 receives the SMB Direct Data Transfer packet and reads the SMB2 read/write request 340 from the SMB2 Data field 332. The SMB Direct Server 214b may then read the data from the SMB2 read/write request 340, including the channel 342, ChannelInfoOffset 346, and the ChannelInfoLength 348. This information may direct the SMB Direct Server 214b to the steering information 610 in the RDMA Channel Descriptor 352. From the steering information, the SMB Direct Server 214b may start the direct data transfer by sending or retrieving 612 unencoded application data to or from a memory location in a buffer of the client 102. Process 600 then terminates at END operation 614.

FIGS. 4A-4C, 5A-5D, and 6 illustrate example operational characteristics for negotiating a communication using a file access protocol over RDMA and exchanging data using a file access protocol over RDMA, respectively, in accordance with embodiments disclosed herein. In embodiments, operational steps depicted may be combined into other steps and/or rearranged. Further, fewer or additional steps may be used, for example.

In general, credits are an advantage to the present embodiments. A send credit represents a buffer that has been pre-posted to receive incoming data on a peer. As such, a send credit represents limited receiver resources (memory, memory regions, etc.) that have been committed to the peer so that the peer may use them to transmit data. Due to the nature of RDMA, a receive, once-posted, may not be canceled, in embodiments. For the resources that are associated with the receive to be released, the resources are used to service an incoming send. This poses a possible problem in embodiments if the receiving peer starts to run low on resources and wants to reclaim some of the resources it has dedicated to outstanding receives. Since receives may not be cancelled according to embodiments disclosed herein, reclaiming these resources relies on the cooperation of the peer.

A solution to this issue in SMB Direct is referred to as the Send Credit Revocation, according to an embodiment. An SMB Direct Data Transfer packet that is transmitted to the peer has a CreditsGranted field that specifies how many additional send credits have been granted to the peer. In alternative embodiments, by setting a CreditRevocation flag (not shown) in the SMB Direct Data Transfer packet 338, the meaning of the CreditsGranted field 320b changes to be the number of Send Credits that the peer may keep. For example, if the CreditRevocation flag were set and the value of CreditsGranted field 320b is ten (10) then ten (10) is the number of credits the receiver may retain. If the receiver currently holds more than ten (10) Send Credits, then the receiver performs a series of RDMA Send operations to use up the revoked Send Credits so that the receiver ends up with ten (10) or fewer Send Credits. The send operations that are performed to use up revoked Send Credits may include sending an empty SMB Direct Data Transfer packet 338 (a packet with no data payload). Upon receiving the incoming empty SMB Direct Data Transfer packets 338, the peer that revoked the Send Credits may release these resources or reuse them as it sees fit, according to embodiments.

While Send Credits allow two peers to synchronize their Send and Receive operations, the peers, in embodiments, attempt to avoid send credit deadlocks. Imagine a scenario in which Peer Y has a single Send Credit from peer X and where peer X has a single Send Credit from peer Y. In this scenario, each peer is entitled to perform a single send to its peer at any time. Imagine that both X and Y simultaneously perform a Send operation, using their single Send Credit in the process, but the SMB Direct Data Transfer packet 338 that is transmitted by both peers grants no additional Send Credits to the peer. The resulting state is known as a Send Credit deadlock. Both X and Y have used their last Send Credit. To be able to send additional packets to the peer, each peer needs additional Send Credits. However, Send Credits are granted via SMB Direct Data Transfer packets 338, and neither peer may perform any further send operations. The result is that neither peer may transmit any additional packets, and there is no mechanism by which they may acquire further send credits. A deadlock has therefore occurred.

A solution to this issue is simply a rule, according to embodiments of the present disclosure. When an SMB Direct peer uses its last Send Credit, the SMB Direct Data Transfer packet 338 that is being sent grants at least one (1) additional send credit to the peer, according to an embodiment. If both peers follow this rule, the deadlock does not occur because each peer will always be able to respond to the other with a grant of additional Send Credits.

Finally, FIG. 7 illustrates an example computing system 700 upon which embodiments disclosed herein may be implemented. A computer system 700, such as client/application server 102 or file server 106, for example, which has at least one processor 702 for exchanging message data as shown herein, is depicted in accordance with embodiments disclosed herein. The system 700 has a memory 704 comprising, for example, system memory, volatile memory, and nonvolatile memory. In its most basic configuration, computing system 700 is illustrated in FIG. 7 by dashed line 706. Additionally, system 700 may also include additional storage (removable and/or non-removable) including, but not limited to, magnetic or optical disks or tape. Such additional storage is illustrated in FIG. 7 by removable storage 708 and non-removable storage 710.

The term computer readable media as used herein may include computer storage media. Computer storage media may include volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. System memory 704, removable storage 708, and non-removable storage 710 are all computer storage media examples (i.e., memory storage.) Computer storage media may include, but is not limited to, RAM, ROM, electrically erasable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which may be used to store information and which may be accessed by computing system 700. Any such computer storage media may be part of system 700. The illustration in FIG. 7 is intended in no way to limit the scope of the present disclosure.

The term computer readable media as used herein may also include communication media. Communication media may be embodied by computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and includes any information delivery media. The term “modulated data signal” may describe a signal that has one or more characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media may include wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, radio frequency (RF), infrared, and other wireless media.

System 700 may also contain communications connection(s) 716 that allow the device to communicate with other devices. Additionally, system 700 may have input device(s) 714 such as a keyboard, mouse, pen, voice input device, touch input device, etc. Output device(s) 712 such as a display, speakers, printer, etc. may also be included. All of these devices are well known in the art and need not be discussed at length here. The aforementioned devices are examples and others may be used.

Having described embodiments of the present disclosure with reference to the figures above, it should be appreciated that numerous modifications may be made to the embodiments that will readily suggest themselves to those skilled in the art and which are encompassed within the scope and spirit of the present disclosure and as defined in the appended claims. Indeed, while embodiments have been described for purposes of this disclosure, various changes and modifications may be made which are well within the scope of the present disclosure.

Similarly, although this disclosure has used language specific to structural features, methodological acts, and computer-readable media containing such acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific structure, acts, features, or media described herein. For example, while specific names or naming conventions have been used in describing aspects of the embodiments, such as names for APIs, routines, etc., numerous modifications may be made to such names and/or naming conventions which are encompassed within the spirit and scope of the present disclosure. The specific structures, features, acts, names, naming conventions, and/or media described above are disclosed as example forms of implementing the claims. Aspects of embodiments allow for multiple client/application servers, multiple file servers, multiple networks, multiple connection peers, etc. Or, in other embodiments, a single client computer with a single server and a single network are used. One skilled in the art will recognize other embodiments or improvements that are within the scope and spirit of the present disclosure. Therefore, the specific structure, acts, or media are disclosed as example embodiments of implementing the present disclosure. The disclosure is defined by the appended claims.

Claims

1. A computer-implemented method for establishing a connection that exchanges data using server message block (SMB/SMB2) over remote direct memory access (RDMA), the method comprising:

Receiving, by a receiver, a first SMB Direct Data Transfer packet, wherein the first SMB Direct Data Transfer packet comprises a RemainingDataLength field and SMB2 data, wherein the RemainingDataLength field is equal to a number of bytes of a fragmented message that the receiver has yet to receive;
determining if the RemainingDataLength field is non-zero;
when the RemainingDataLength field is zero: processing the SMB2 data in the first SMB Direct Data Transfer packet;
when the RemainingDataLength field is non-zero: allocating a reassembly buffer to the connection; copying the SMB2 data from the first SMB Direct Data Transfer packet into the reassembly buffer; receiving at least one other SMB Direct Data Transfer packet; and copying the SMB2 data from the at least one other SMB Direct Data Transfer packet into the reassembly buffer.

2. The method of claim 1, wherein the SMB Direct Data Transfer packet comprises one or more fields from the group consisting of: CreditsRequested, CreditsGranted, DataOffset, and DataLength.

3. The method of claim 2, wherein the CreditsGranted field delineates how many credits were provided to a sender of the SMB Direct Data Transfer packet.

4. The method of claim 1, wherein a credit is a value for the number of RDMA messages that may be sent by the sender.

5. The method of claim 1, further comprising:

determining if the at least one other SMB Direct Data Transfer packet is a last packet;
if the at least one other SMB Direct Data Transfer packet is the last packet, processing the SMB2 data in the reassembly buffer;
if the at least one other SMB Direct Data Transfer packet is not the last packet, repeating receiving at least one other SMB Direct Data Transfer packet and copying the SMB2 data from the at least one other SMB Direct Data Transfer packet into the reassembly buffer.

6. The method of claim 5, wherein the reassembly buffer includes data from at least two SMB Direct Data Transfer packets.

7. The method of claim 5, wherein the determining if the first SMB Direct Data Transfer packet is a last packet comprises determining if the RemainingDataLength equals zero.

8. The method of claim 1, further comprising:

pre-posting a receive of a SMB Direct Negotiate Request;
setting a negotiation request expiration timer;
determining if the SMB Direct Negotiate Request was received before the negotiation request expiration timer expired;
if the SMB Direct Negotiate Request was not received before the negotiation request expiration timer expired, cancelling the receive of the SMB Direct Negotiate Request;
if the SMB Direct Negotiate Request was received before the negotiation request expiration timer expired, receiving the SMB Direct Negotiate Request, wherein the SMB Direct Negotiate Request requests to establish the connection to send SMB2 data over RDMA;
validating the SMB Direct Negotiate Request;
processing the SMB Direct Negotiate Request;
in response to processing the SMB Direct Negotiate Request, generating a SMB Direct Negotiate Response, wherein the SMB Direct Negotiate Response allows the connection and assigns at least one credit to a sender of the SMB Direct Negotiate Request to send the SMB Direct Data Transfer packet;
sending the SMB Direct Negotiate Response.

9. The method of claim 8, wherein the SMB Direct Negotiate Request comprises one or more fields from the group consisting of: MinVersion, MaxVersion, CreditsRequested, PreferredSendSize, MaxReceiveSize, and MaxSMB2MessageSize.

10. The method of claim 9, wherein the SMB Direct Negotiate Response comprises one or more fields from the group consisting of: MinVersion, MaxVersion, PreferredVersion, CreditsRequested, CreditsGranted, Status, PreferredSendSize, MaxReceiveSize, and MaxSMB2MessageSize.

11. A computer readable storage device storing computer-executable instructions that when executed by a processor perform a method for exchanging data using data operations over remote direct memory access (RDMA), the method comprising:

establishing a connection with a server;
negotiating a connection with the server, wherein the negotiation establishes a maximum number of bytes the server will receive in the connection;
determining a number of packets to send data associated with the connection to the server;
determining if fragmentation of the data is to be used;
when fragmentation of the data is not to be used: sending a first protocol packet to the server with the data;
when fragmentation of the data is to be used: initializing first bytes within a transmission buffer to send as data in the first protocol packet; setting a DataLength field and a RemainingDataLength field in the first protocol packet, wherein the RemainingDataLength field is equal to a number of bytes of a fragmented message that the server has yet to receive; sending the first protocol packet to the server; for at least a second protocol packet, instructions to repeat, comprising: initializing bytes within the transmission buffer to send as data in the at least a second protocol packet; setting the RemainingDataLength field in the at least a second protocol packet; and sending the at least a second protocol packet to the server.

12. The computer readable storage device of claim 11, wherein the determining if fragmentation is to be used comprises determining if the number of packets to send the data is greater than one.

13. The computer readable storage device of claim 12, wherein the determining a number of packets to send data associated with the connection to the server comprises determining if the maximum number of bytes the server will receive in the connection is less than a number of bytes of data to be sent.

14. The computer readable storage device of claim 11, wherein negotiating the connection with the server comprises:

pre-posting a receive;
constructing a protocol negotiate request;
sending the protocol negotiate request to the server;
setting a negotiate expiration timer;
determining if a response to the protocol negotiate request is received before the negotiate expiration timer expires;
if a response to the protocol negotiate request is not received before the negotiate expiration timer expires, resending the protocol negotiate response;
if a response to the protocol negotiate request is received before the negotiate expiration timer expires, receiving a protocol negotiate response;
validating the protocol negotiate response; and
processing the protocol negotiate response, wherein the protocol negotiate response provides at least one credit to the sender to send the protocol packet.

15. The computer readable storage device of claim 14, further comprising:

if enough credits exist for sending the at least a second protocol packet, sending the at least a second protocol packet;
if enough credits do not exist to send the at least a second protocol packet, requesting at least one more credit from the server;
determining if enough credits are received;
if enough credits are not received, terminating the connection; and
if enough credits are received, sending the at least a second protocol packet.

16. A system configured to exchange data using server message block (SMB/SMB2) over remote direct memory access (RDMA), the system comprising:

a RDMA network interface card (RNIC) operable to transfer data by a RDMA message;
memory operable to store computer program instructions executable by a processor; and
the processor, in communication with the RNIC and the memory, operable to execute a kernel, the kernel comprising: a first protocol client operable to: communicate fragmented SMB data in two or more RDMA messages, wherein each RDMA message comprises a RemainingDataLength field, and wherein the RemainingDataLength field is equal to a number of bytes of a fragmented message that a receiver has yet to receive; and create a receive buffer to exchange credits, wherein the credits determine a number of RDMA messages that a sender may send to a receiver.

17. The system of claim 16, further comprising an RDMA Interface in communication with the first protocol client, the RDMA Interface operable to open an RDMA connection through the RNIC.

18. The system of claim 17, further comprising:

a second protocol client operable to: create an SMB2 read/write request to instruct a server to perform an RDMA direct data transfer; send the SMB2 read/write request to the first protocol client; and
wherein the first protocol client is further operable to: encapsulate the SMB2 read/write request in a first protocol Data Transfer packet; and send the first protocol Data Transfer packet to the server.

19. The system of claim 18, wherein the SMB2 read/write request includes fields that identify RDMA channel descriptor information, and wherein the first protocol is the SMB Direct protocol.

20. The system of claim 19, wherein the RDMA channel descriptor information includes steering information, and wherein the server determines the steering information and performs an RDMA write or an RDMA read of unencoded application data to or from memory addresses identified by the steering information.

Referenced Cited
U.S. Patent Documents
4399504 August 16, 1983 Obermarck
4780821 October 25, 1988 Crossley
4791566 December 13, 1988 Sudama
4825354 April 25, 1989 Agrawal
4887204 December 12, 1989 Johnson
4891785 January 2, 1990 Donohoo
4914570 April 3, 1990 Peacock
5008853 April 16, 1991 Bly
5109519 April 28, 1992 Zimmet
5113519 May 12, 1992 Johnson
5202971 April 13, 1993 Henson
5218696 June 8, 1993 Baird et al.
5261051 November 9, 1993 Masden et al.
5265261 November 23, 1993 Rubin et al.
5313646 May 17, 1994 Hendricks et al.
5349642 September 20, 1994 Kingdon
5375207 December 20, 1994 Blakely et al.
5410697 April 25, 1995 Baird
5437013 July 25, 1995 Rubin et al.
5452447 September 19, 1995 Nelson
5491752 February 13, 1996 Kaufman et al.
5493728 February 20, 1996 Solton
5513314 April 30, 1996 Kandasamy
5522042 May 28, 1996 Fee et al.
5535375 July 9, 1996 Eshel et al.
5560008 September 24, 1996 Johnson et al.
5588117 December 24, 1996 Karp et al.
5628005 May 6, 1997 Hurvig
5764887 June 9, 1998 Kells et al.
5826027 October 20, 1998 Pedersen et al.
5931913 August 3, 1999 Meriwether et al.
5933602 August 3, 1999 Grover
5978802 November 2, 1999 Hurvig
5987621 November 16, 1999 Duso et al.
6085247 July 4, 2000 Parsons et al.
6092199 July 18, 2000 Dutcher
6125122 September 26, 2000 Favichia et al.
6131125 October 10, 2000 Rostoker et al.
6208952 March 27, 2001 Goertzel
6219799 April 17, 2001 Kandasamy
6243862 June 5, 2001 Lebow
6247139 June 12, 2001 Walker et al.
6275953 August 14, 2001 Vahalia et al.
6317844 November 13, 2001 Kleiman
6324581 November 27, 2001 Xu et al.
6349250 February 19, 2002 Hart et al.
6349350 February 19, 2002 Hathorn et al.
6401123 June 4, 2002 Shields et al.
6430691 August 6, 2002 Di Santo et al.
6438691 August 20, 2002 Mao
6452903 September 17, 2002 Peck et al.
6453354 September 17, 2002 Jiang et al.
6640226 October 28, 2003 Shringeri et al.
6658476 December 2, 2003 Van
6829473 December 7, 2004 Raman et al.
6883015 April 19, 2005 Geen et al.
6910082 June 21, 2005 Marcotte
6928577 August 9, 2005 Moser et al.
7020651 March 28, 2006 Ripley
7103007 September 5, 2006 Natarajan et al.
7111035 September 19, 2006 McClellan et al.
7111060 September 19, 2006 Araujo et al.
7197535 March 27, 2007 Salesky et al.
7243132 July 10, 2007 Choi
7290141 October 30, 2007 Sengodan et al.
7293192 November 6, 2007 Allen et al.
7318102 January 8, 2008 Krause et al.
7330910 February 12, 2008 Young et al.
7339885 March 4, 2008 Ahrens et al.
7380080 May 27, 2008 Hughes
7380155 May 27, 2008 Fung et al.
7383463 June 3, 2008 Hayden et al.
7386889 June 10, 2008 Shay
7388866 June 17, 2008 Fan et al.
7409420 August 5, 2008 Pullara et al.
7421502 September 2, 2008 Czap et al.
7434087 October 7, 2008 Singh
7444536 October 28, 2008 Jairath
7451221 November 11, 2008 Basani et al.
7453879 November 18, 2008 Lo
7457722 November 25, 2008 Shain et al.
7509407 March 24, 2009 Miller et al.
7526658 April 28, 2009 He et al.
7526668 April 28, 2009 Shitomi et al.
7539722 May 26, 2009 Mohamed et al.
7562129 July 14, 2009 Lee et al.
7664991 February 16, 2010 Gunda et al.
7673066 March 2, 2010 Zheng
7702745 April 20, 2010 Lin et al.
8275815 September 25, 2012 Aronovich et al.
20020019874 February 14, 2002 Borr
20020024963 February 28, 2002 Reeve
20020073211 June 13, 2002 Lin et al.
20020083130 June 27, 2002 Shimada et al.
20020152315 October 17, 2002 Kagan et al.
20020161980 October 31, 2002 Nishikawa
20030018927 January 23, 2003 Gadir et al.
20030056069 March 20, 2003 Cabrera et al.
20030058277 March 27, 2003 Bowman-Amuah
20030093643 May 15, 2003 Britt, Jr.
20030093678 May 15, 2003 Bowe et al.
20030112754 June 19, 2003 Ramani et al.
20030115341 June 19, 2003 Sinha et al.
20030126195 July 3, 2003 Reynolds et al.
20030140129 July 24, 2003 Livnat et al.
20030169859 September 11, 2003 Strathmeyer et al.
20030182282 September 25, 2003 Ripley
20040003210 January 1, 2004 Duale et al.
20040003241 January 1, 2004 Sengodan et al.
20040018829 January 29, 2004 Raman et al.
20040019660 January 29, 2004 E. et al.
20040032876 February 19, 2004 Garg et al.
20040044930 March 4, 2004 Keller et al.
20040103342 May 27, 2004 Moser et al.
20040111527 June 10, 2004 Czap, Jr. et al.
20040136325 July 15, 2004 Dobric et al.
20040160909 August 19, 2004 Sheynblat
20040215794 October 28, 2004 Lauer
20040225952 November 11, 2004 Brown et al.
20040255202 December 16, 2004 Wong et al.
20040260748 December 23, 2004 Springer et al.
20040267932 December 30, 2004 Voellm et al.
20040268118 December 30, 2004 Bazan Bejarano
20050010670 January 13, 2005 Greschler et al.
20050015511 January 20, 2005 Izmailov et al.
20050015747 January 20, 2005 Zatloukal et al.
20050021832 January 27, 2005 Bennett et al.
20050038828 February 17, 2005 Kaluskar et al.
20050041686 February 24, 2005 Roy et al.
20050055345 March 10, 2005 Ripley
20050060442 March 17, 2005 Beverly et al.
20050091212 April 28, 2005 Mohamed et al.
20050102537 May 12, 2005 Zheng
20050111030 May 26, 2005 Berkema et al.
20050114670 May 26, 2005 Bowe et al.
20050125378 June 9, 2005 Kawada
20050129045 June 16, 2005 Machulsky et al.
20050131832 June 16, 2005 Fransdonk
20050132077 June 16, 2005 Biran et al.
20050138528 June 23, 2005 Ameigeiras et al.
20050149817 July 7, 2005 Biran et al.
20050177635 August 11, 2005 Schmidt et al.
20050182850 August 18, 2005 Kohno
20050198113 September 8, 2005 Mohamed et al.
20050198247 September 8, 2005 Perry et al.
20050198359 September 8, 2005 Basani et al.
20050198380 September 8, 2005 Panasyuk et al.
20050223014 October 6, 2005 Sharma et al.
20050228884 October 13, 2005 Hawley
20050246803 November 3, 2005 Spencer
20050248803 November 10, 2005 Ohara
20050251448 November 10, 2005 Gropper
20050257022 November 17, 2005 Hughes
20050258022 November 24, 2005 Horton et al.
20050262084 November 24, 2005 Tomita
20050262103 November 24, 2005 Stakutis et al.
20060031519 February 9, 2006 Helliwell et al.
20060036744 February 16, 2006 Andrus et al.
20060041698 February 23, 2006 Han et al.
20060045005 March 2, 2006 Blackmore et al.
20060047818 March 2, 2006 Kruglick et al.
20060059118 March 16, 2006 Byrd et al.
20060080443 April 13, 2006 Kruglick et al.
20060080568 April 13, 2006 Subbaraman et al.
20060085328 April 20, 2006 Cohen et al.
20060095382 May 4, 2006 Mahlbacher
20060130107 June 15, 2006 Gonder et al.
20060168262 July 27, 2006 Frazer
20060206705 September 14, 2006 Khosravi
20060271679 November 30, 2006 Mousseau et al.
20060271692 November 30, 2006 Kruse et al.
20060271697 November 30, 2006 Kruse et al.
20060281525 December 14, 2006 Borissov
20070150558 June 28, 2007 Teodosiu et al.
20070171793 July 26, 2007 Mesut et al.
20070192326 August 16, 2007 Angal et al.
20070220155 September 20, 2007 Nalla et al.
20080010485 January 10, 2008 Shitomi et al.
20080126704 May 29, 2008 Ulrich et al.
20080151885 June 26, 2008 Horn et al.
20080172397 July 17, 2008 Maeda et al.
20080301311 December 4, 2008 Bestler
20090077097 March 19, 2009 Lacapra et al.
20090138615 May 28, 2009 Cristallo et al.
20090172085 July 2, 2009 Arthursson
20090183005 July 16, 2009 Kawell et al.
20090222582 September 3, 2009 Josefsberg et al.
20090319661 December 24, 2009 Shiozawa et al.
20090327798 December 31, 2009 D'Amato et al.
20100042715 February 18, 2010 Tham et al.
20100161855 June 24, 2010 Mohamed et al.
20100185704 July 22, 2010 George et al.
20110040826 February 17, 2011 Chadzelek et al.
20120151249 June 14, 2012 Swan et al.
20130007518 January 3, 2013 George et al.
20130066941 March 14, 2013 Kruse et al.
20130067095 March 14, 2013 Kruse et al.
20130091199 April 11, 2013 Kruse et al.
20130097211 April 18, 2013 Kruse et al.
20130304932 November 14, 2013 Kruse et al.
20140372521 December 18, 2014 George et al.
20150026248 January 22, 2015 Kruse et al.
20150281404 October 1, 2015 Kruse et al.
20150365482 December 17, 2015 Kruse et al.
Foreign Patent Documents
1155119 July 1997 CN
1299484 June 2001 CN
1758608 April 2006 CN
1767472 May 2006 CN
1882933 December 2006 CN
1906593 January 2007 CN
101207635 June 2008 CN
101217483 July 2008 CN
101594320 December 2009 CN
101605194 December 2009 CN
101926155 December 2010 CN
1 259 045 November 2002 EP
1 259 045 November 2002 EP
1643406 April 2006 EP
1 727 056 November 2006 EP
2727287 May 2014 EP
60-019341 January 1985 JP
62-297927 December 1987 JP
63-061148 March 1988 JP
63-205747 August 1988 JP
64-061148 March 1989 JP
02-101847 April 1990 JP
03-048558 March 1991 JP
03-074745 March 1991 JP
04-172039 June 1992 JP
04-229746 August 1992 JP
05-089048 April 1993 JP
05-143488 June 1993 JP
06-075890 March 1994 JP
H10-133971 May 1998 JP
62-256165 October 1998 JP
10-313342 November 1998 JP
11-055314 February 1999 JP
2001-094613 April 2001 JP
2003-016766 January 2003 JP
2003-069610 March 2003 JP
2003-281091 March 2003 JP
2003-125022 April 2003 JP
2004-005427 January 2004 JP
2004-229143 August 2004 JP
2005-322016 November 2005 JP
2007-49755 February 2007 JP
3967758 June 2007 JP
10-1994-0001700 March 1994 KR
10-0860152 September 2008 KR
WO 03/096646 November 2003 WO
WO 03105439 December 2003 WO
Other references
  • Chinese 4th Office Action in Application 200510127998.2, mailed Nov. 16, 2011, 7 pgs.
  • Chinese 5th Office Action in Application 200510127998.2, mailed Mar. 2, 2012, 8 pgs.
  • Chinese Notice of Allowance in Application 2005101279978.2, mailed Dec. 5, 2011, 4 pgs.
  • European Exam Report in Application 05111885.9 mailed Sep. 13, 2007, 5 pgs.
  • European Invitation to Correct Defects in Application No. 08008916.2 mailed Sep. 4, 2008, 6 pgs.
  • European Notice of Allowance in Application 05111885.9 mailed Jun. 11, 2008, 6 pgs.
  • European Notice of Allowance in Application 080089162 mailed Jan. 24, 2011, 6 pgs.
  • European Notice to Grant in Application 05111885.9 mailed Oct. 9, 2008, 1 page.
  • Hong Kong Certificate of Grant in Application 07105689.8 mailed Jun. 26, 2009, 2 pgs.
  • Japanese Office Action in Application 200510127997.8 mailed Aug. 3, 2011, 8 pgs.
  • Japanese Office Notice of Rejection in Application 2006-307121 mailed Aug. 12, 2011, 5 pgs.
  • Japanese Office Notice of Allowance in Application 2006-307121 mailed Feb. 14, 2012, 6 pgs.
  • Maiworm, Daniel, “Symantec Enterprise Vault”, Retrieved at http://www.cstl.com/products/Symantec/Symantec- EnterpriseVault/File System Archiving.pdf, Feb. 5, 2007, pp. 35.
  • NTFS Design Goals and. Features, Retrieved at http://wininternals.uw.hu/ch12lev1sec4.html, Retrieved Date: Oct. 11, 2010, pp. 9.
  • Oehme, et al.,“IBM Scale out File Services: Reinventing network-attached storage”, Retrieved at http://www-935.ibm.com/services/us/its/pdf/sofs-am-journal-final-07010B.pdf, vol. 52 No. 4/5 Jul./Sep. 200B, 10 Pages.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Nov. 4, 2011, 17 pgs.
  • U.S. Appl. No. 11/182,251, Office Action mailed Jan. 30, 2012, 24 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response filed Dec. 1, 2011, 13 pgs.
  • U.S. Appl. No. 11/182,989, Office Action mailed Feb. 28, 2012, 20 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed May 30, 2012, 14 pgs.
  • U.S. Appl. No. 11/182,251, Notice of Allowance mailed Jul. 12, 2012, 8 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response filed May 29, 2012, 11 pgs.
  • U.S. Appl. No. 11/182,989, Notice of Allowance mailed Jun. 21, 2012, 5 pgs.
  • Aboba et al., Extensible Authentication Protocol (EAP) [online], See Fast Connect, RFC 3748, Jun. 2004, [Retrieved Mar. 3, 2007], Retrieved from: http://www.ietf.org/rfc/rfc3748.txt?number=3748.
  • Alvisi et al., “Low-Overhead Protocols for Fault-Tolerant File Sharing”; In Proceedings of the IEEE 18th International Conference on Distributed Computing Systems; 1998; 10 pgs.
  • Almeida, “FIFS: A Framework for Implementing User-Mode File Systems in Windows NT”, Proceedings of the 3rd USENIX Windows NT Symposium, Jul. 12-15, 1999, 19 pgs.
  • Ansi, Financial Institution Message Authentication (Wholesale), Financial Services Technical Publication, ANSI X9.9-1994, Aug. 15, 1986; 38 pgs.
  • Asokan et al., Server Supported Signatures, Journal of Computer Security, Fall 1997; 13 pgs.
  • Bell Labs, Plan 9 default Fossil File System [online], [Retrieved Sep. 17, 2007], Retrieved from: http://plan9.bell-labs.com/magic/man2html/4/fossil; 4 pgs.
  • Bensaou et al., Credit-Based Fair Queueing (CBFQ): A Simple Service-Scheduling Algorithm for Packet-Switched Networks, IEEE/ACM Transactions on Networking, vol. 9, No. 5, Oct. 2001.
  • CIFS Oplock File Locking, MSDN, [Retrieved Jan. 7, 2008], Retrieved from: http://msdn2.microsoft.com/en-us/library/aa302210.aspx; 3 pgs.
  • CIFS http://msdn.microsoft.com/library/default.asp?url= /library/en-us/cifs/protocol/cifs.asp, 2 pgs.
  • CIFS or Public SMB Information on Common Internet File System http://support.microsoft.com/default.aspx?scid=kb;en-us;199072; 2 pgs.
  • Dehaese, G., The ISO 9660 File System [online], May 1995, [Retrieved Sep. 14, 2007], Retrieved from: http://users.pandora.be/it3.consultants.bvba/handouts/ISO9960.html.
  • Digital Equipment Corporation, RSX-11M/M-Plus RMS-11 User's Guide [online], Order No. AA-L669A-TC, Apr. 1983, [Retrieved Aug. 17, 2007], Retrieved from: http://www.computer.museum.uq.edu.au/RSX-11%20Manuals.html; 186 pgs.
  • Digital Equipment Corporation, Introduction to RSX-11M [online, Order No. AA-L763B-TC, RSX-11M Version 4.2, First Printing Sep. 1979, Revised Nov. 1981 and Jul. 1985, [Retrieved Aug. 9, 2007], Retrieved from: http://www.computer.museum.uq.edu.au/RSX-11%20Manuals.html; 65 pgs.
  • ECMA, Volume and File Structure for Write-Once and Rewritable Media using Non-Sequential Recording for Information Interchange 3rd Edition [online], ECMA-167, Jun. 1997, [Retrieved Aug. 9, 2007, Retrieved from: http://www.ecma-international.org/publications/files/ECMA-ST/ECMA-167.pdf; 150 pgs.
  • Gifford et al., The Cedar File System, Communications of the ACM, vol. 31, No. 3, Mar. 1998; 11 pgs.
  • Greenberg et al., “NFILE—A File Access Protocol”; Network Working Group; RFC 1037; Dec. 1997; 43 pgs.
  • Gu et al., “SABUL: A High Performance Data Transfer Protocol”; IEEE Communications Letters; 2001; 11 pgs.
  • Hartman; “The Zebra Striped Network File System”; Doctoral dissertation at the University of California at Berkeley; 1991; 159 pgs.
  • Hertel, Implementing CIFS The Common Internet File System [online], [Retrieved Jul. 13, 2007], Retrieved from: http://ubiqx.org/cifs/; 3 pgs.
  • Hiltunen et al., “Implementing Integrated Fine-Grain Customizable QoS Using Cactus”; The 29th Annual International Symposium on Fault-Tolerant Computing (Fast Abstract); Madison, WI; 1999, 2 pgs.
  • Hitz et al., File System Design for an NFS File Server Appliance [online], Network Appliance, TR 3002, 1994, [Retrieved Aug. 9, 2007], Retrieved from: http://www.netapp.com/library/tr/3002.pdf; 13 pgs.
  • Hobbit, CIFS: Common Insecurities Fail Scrutiny [online], Avian Research, Jan. 1997, Retrieved from: http://web.textfiles.com/hacking/cifs.txt; 39 pgs.
  • IBM, IBM Personal Computer Seminar Proceedings, vol. 2, No. 5, Sep. 1984; 13 pgs.
  • International Organization for Standardization, Banking—Approved algorithms for message authentication—, ISO 8731-1, Jun. 1, 1987; 4 pgs.
  • Internet Assigned Numbers Authority, Well-Known Port Numbers, http://www.jana.org/assignments/port-numbers, 233 pgs.
  • Kent et al., Security Architecture for the Internet Protocol [online], RFC 2401, Nov. 1998, [Retrieved Jun. 6, 2007], Retrieved from: http://www.ietf.rg/rfc/rfc2401.txt? number=2401, 62 pgs.
  • Kent et al., IP Authentication Header [online], RFC 2402, Nov. 1998, [Retrieved Aug. 9, 2007], Retrieved from: http://tools.ieft.org/html/rfc2402, 20 pgs.
  • Klima, “Tunnels in Hash Functions: MD5 Collisions Within a Minute”, Version 1, Mar. 2006, Version 2 Apr. 2006, Cryptology ePrint Archive, 17 pgs.
  • Krawczyk, “HMAC: Keyed-Hashing for Message Authentication”, RFC-2104, Feb. 1997, http://www.jetf.org/rfc/rfc2104.txt, 10 pgs.
  • Leach et al., CIFS Logon and Pass Through Authentication Preliminary Draft [online], Jan. 3, 1997, 22 pgs.
  • Leach et al., CIFS/E Browser Protocol Preliminary Draft [online], Jan. 10, 1997, 33 pgs.
  • Leach et al., draft-leach-cifs-print-spec-00.txt, CFIS Printing Specification Preliminary Draft [online], Jan. 31, 1997; 30 pgs.
  • Leach et al., draft-leach-cifs-rap-spec-00.txt, CFIS Remote Administration Protocol Preliminary Draft [online], Feb. 26, 1997; 39 pgs.
  • Leach, P., Naik, D., A Common Internet File System (CIFS/1.0) Protocol Preliminary Draft [online], Dec. 19, 1997. Retrieved from: http://tools.ietf.org/html/draft-leach-cifs-v1-spec-01; 132 pgs.
  • Leach, P. et a., “A Common Internet File System (CIFS/1.0) Protocol Preliminary Draft”, draft-leach-cifs-v1-spec-02, Mar. 13, 1997, http://microsoft.com/mscorp/ip/protocols/BSTD/CIFS; 160 pgs.
  • LeGrow, “Maintenance—MSRPC Update (Version 11) and SMB Update (Version 3)”; cfr-users mailing list; http://list.nfr.com/mailman/listingfo/nfr-users; May 20, 2004; 2 pgs.
  • Linn, “Generic Security Service Application Program Interface, Version 2, Update 1”, RFC 2743, Jan. 2000, http://www.ieft.org/rfc/rfc2743.txt, 90 pgs.
  • Morris, “Andrew: A Distributed Personal Computing Environment”, Communications of the ACM, vol. 29, No. 3, Mar. 1986, New York, 20 pgs.
  • Mullender, “A Distributed File Service Based on Optimistic Concurrency Control”, Proceedings of the Tenth ACM Symposium on Operating Systems Principles, Dec. 1-4, 1985, Orcas Island, WA, 14 pgs.
  • Murphy, Origins and Development of TOPS-20 [online], © 1989, 1996, [Retrieved Aug. 9, 2007], Retrieved from: http://www.opost.com/dlm/tenex/hbook.html; 28 pgs.
  • National Bureau of Standards, Computer Data Authentication, Federal Information Processing Standards Publication 113, May 30, 1985; 9 pgs.
  • Pawlowski, Brian et al. “The NFS Version 4 Protocol” (Publication date not available), 20 pgs.
  • Platform SDK: File Systems: Microsoft SMB Protocol and CIFS Protocol Overview http://msdn.microsoft.com/ library/default.asp?url=/library/ en-us/fileio/fs/ microsoftsmbprotocolandcifsprotocoloverview.asp; 1 pg.
  • Pranevich, “The Wonderful World of Linux 2.6”; 2005; 17 pgs.
  • Rivest, “The MD5 Message-Digest-Algorithm”, RFC 1321, Apr. 1992, http://www.jetf.org/rfc/rfc2104.txt, 19 pgs.
  • Rubin, F., Message Authentication Using Quadratic Residues [online], Jan. 31, 1995, [Retrieved Sep. 14, 2007], Retrieved from: http://www.mastersoftware.biz/crypt002.htm; 6 pgs.
  • Samba Team, The Samba Team are pleased to announce Samba1.9.18 [online], Jan. 7, 1998, [Retrieved Jan. 4, 2008], Retrieved from: http://de.samba.org/samba/history/samba1.9.18.html; 4 pgs.
  • Satran et al. Internet Small Computer Systems Interface (iSCSI) [online], RFC 3720, Apr. 2004, [Retrieved Mar. 3, 2007], Retrieved from: http://www.ietf.org/rfc/rfc3720.txt?number=3720; 67 pgs.
  • Satyanaryanan et al, “Scalable, Secure and Highly Available Distributed File Access”, May 1990, 12 pgs.
  • Shepler, S., NFS Version 4 Design Considerations [online], RFC 2624, Jun. 1999, [Retrieved Jan. 4, 2008], Retrieved from: http://tools.ietf.org/html/rfc2624; 22 pgs.
  • Shepler, S. et al., “Network File System (NFS) Version 4 Protocol”, Network Working Group, Request for Comments: 3530, Apr. 2003, 275 pgs.
  • Shepler, S. et al., “NFS Version 4 Protocol”, RFC 3010, Proceedings on the 2nd International System Administration and Networking Conference (SANE2000), Dec. 2000, 212 pgs.
  • Schneier, B., Applied Cryptography Protocols, Algorithm and Source Code in C Second Edition, John Wiley & Sons, Inc., © 1996; 788 pgs.
  • SMB: The Server Message Block Protocol [online], 1999, Retrieved from: http://ubiqx.org/cifs/SMB.html; 108 pgs.
  • SNIA, Common Internet File System (CIFS) Technical Reference [online], Mar. 1, 2002, Retrieved from: http://www.snia.org/techactivities/CIFS/CIFS-TR-1p00FINAL.pdf; 150 pgs.
  • Soules et al., Metadata Efficiency in a Comprehensive Versioning File System, May 2002; 33 pgs.
  • Srinivasan et al., Spritely NFS: Implementation and Performance of Cache-Consistency Protocols [online], May 1989, [Retrieved Jan. 4, 2008], Retrieved from: http://www.hpl.hp.com/techreports/Compaq-DEC/WRL-89-5.pdf; 35 pgs.
  • Talpey et al., “NFSv4 Session Extensions, draft-ietf-nfsv4-sess-01”; Internet Draft; The Internet Society; Feb. 2005; 70 pgs.
  • Tichy, W., RCS—A System for Version Control [online], Jan. 3, 1991, [Retrieved Aug. 6, 2007], Retrieved from: http://www.cs.purdue.edu/homes/trinkle/RCS/rcs.ps; 20 pgs.
  • The Java CIFS Client Library [online], [Retrieved Jul. 13, 2007], Retrieved from: http://jcifs.samba.org/, 8 pgs.
  • The Open Group; Technical Standard; Protocols for X/Open PC Interworking: SMB, Version 2; http://www.opengroup.org/onlinepubs/9697999099/toc.pdf; retrieved on Sep. 1, 1992; 534 pgs.
  • TOPS-20 [online], Wikipedia, [Retrieved Mar. 4, 2007], Retrieved from: http://en.wikipedia.org/wiki/TOPS-20; 4 pgs.
  • Tridgell, “Inside Microsoft Networking”, Jun. 25, 1998, 6 pgs.
  • Vanwasi, “Unleashing the power of P2P”; Network Magazine India; Apr. 2002; 5 pgs.
  • Zhu, “The Simple and Protected Generic Security Service Application Program Interface (GSS-API) Negotiation Mechanism”, RFC-4178, Oct. 2005, http://www.ietf.org/rfc/rfc4178.txt, 20 pgs.
  • European Search Report dated Jan. 20, 2006 in Application No. RS113279/US18298905.
  • European Search Report dated Jan. 20, 2006 in Application No. RS113280/US18225105.
  • European Search Report dated Feb. 15, 2006 in Application No. 05111729.9.
  • European Search Report dated Feb. 22, 2006 in Application No. 05111729.9.
  • Japanese Office Notice of Rejection mailed Mar. 10, 2006 in Application No. 2005-356146.
  • Japanese Office Notice of Rejection mailed May 12, 2006 in Application No. 2005-356145.
  • European Search Report dated Sep. 19, 2006 in Application No. 055111885.9.
  • Japanese Office Notice of Rejection mailed Nov. 10, 2006 in Application No. 2005-356146.
  • Korean Notice of Rejection mailed Nov. 17, 2006 in Application No. 10-2005-0124340.
  • Japanese Office Notice of Rejection mailed Apr. 3, 2007 in Application No. 2005-356145.
  • Japanese Office Notice of Rejection mailed Jan. 15, 2008 in Application No. 2005-356145.
  • European Search Report dated Jun. 18, 2008 in Application No. 08008916.2.
  • Chinese Office Action dated Apr. 29, 2010 in Application No. 200510127997.8.
  • Chinese Office Action dated Apr. 29, 2010 in Application No. 200510127998.2.
  • European Search Report dated Jan. 4, 2011 in Application No. 10012923-8.
  • Korean Notice of Preliminary Rejection mailed Jan. 21, 2011, Application No. 10-2007-80691.
  • European Search Report dated Feb. 1, 2011 in Application No. 10013021.0.
  • Chinese Second Office Action dated Mar. 3, 2011 in Application No. 200510127998.2.
  • Chinese Second Office Action dated Mar. 30, 2011 in Application No. 200510127997.8.
  • U.S. Official Action dated Mar. 13, 2009 cited in U.S. Appl. No. 11/182,251.
  • U.S. Official Action dated May 14, 2009 cited in U.S. Appl. No. 11/182,989.
  • U.S. Official Action dated Jan. 25, 2010 cited in U.S. Appl. No. 11/182,251.
  • U.S. Official Action dated Feb. 2, 2010 cited in U.S. Appl. No. 11/182,989.
  • U.S. Official Action dated Jul. 23, 2010 cited in U.S. Appl. No. 11/182,989.
  • U.S. Official Action dated Jul. 30, 2010 cited in U.S. Appl. No. 11/182,251.
  • Chinese Third Office Action dated Jul. 7, 2011 in Application No. 200510127998.2.
  • PCT International Search Report and Written Opinion in International Application PCT/US2011/063618, mailed Jun. 28, 2012, 9 pgs.
  • Chinese Notice of Allowance in Application 2005101279982, mailed Aug. 20, 2012, 4 pgs.
  • French, Steven M., “A New Network File System is Born: Comparison of SMB2, CIFS, and NFS”, retrieved Mar. 23, 2011, 14 pgs.
  • John H. Samba Team Terpstra, “Chapter 32. High Availability Part III. Advanced Configuration”, retrieved Mar. 22, 2011, 6 pages.
  • Loafman, Zach, “SMB1/SMB2; A BSD Perspective”, retrieved Mar. 22, 2011, 35 pgs.
  • MS-SMB2-Preview: Server Message Block (SMB) Version 2 Protocol Specification, copyright 2010 Microsoft Corporation, 309 pgs.
  • Periasamy, Anand Babu, “Next-Gen Linux File Systems: Change Is the New Constant”, retrieved Mar. 23, 2011, 4 pages.
  • U.S. Appl. No. 11/182,989, Office Action mailed Sep. 5, 2008, 6 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response mailed Nov. 26, 2008, 12 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response mailed Aug. 14, 2009, 11 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response mailed Oct. 19, 2009, 6 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response mailed May 3, 2010, 14 pgs.
  • U.S. Appl. No. 11/182,989, Amendment and Response mailed Oct. 29, 2010, 14 pgs.
  • U.S. Appl. No. 11/182,989, Office Action mailed Sep. 1, 2011, 19 pgs.
  • U.S. Appl. No. 11/182,251, Office Action mailed Aug. 22, 2008, 7 pgs.
  • U.S. Appl. No. 11/182,251, Office Action mailed Nov. 6, 2008, 4 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Nov. 26, 2008, 12 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Jun. 15, 2009, 14 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Sep. 14, 2009, 7 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Apr. 26, 2010, 17 pgs.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Nov. 30, 2010, 17 pgs.
  • U.S. Appl. No. 11/182,251, Office Action mailed Aug. 4, 2011, 23 pgs.
  • Microsoft Computer Dictionary, Microsoft Press, Fifth Edition, 2002, p. 486.
  • U.S. Appl. No. 11/182,251, Amendment and Response filed Oct. 11, 2012, 2 pgs.
  • U.S. Appl. No. 12/964,749, Office Action mailed Jan. 29, 2013, 17 pgs.
  • U.S. Appl. No. 13/663,827, Office Action mailed Jan. 29, 2013, 16 pgs.
  • U.S. Appl. No. 13/664,012, Office Action mailed Jan. 25, 2013, 19 pgs.
  • Chinese 1st Office Action in Application 201110329007.4, mailed Oct. 10, 2013, 13 pgs.
  • U.S. Appl. No. 13/228,732, Office Action mailed Jul. 8, 2013, 16 pgs.
  • U.S. Appl. No. 13/228,818, Office Action mailed Jul. 18, 2013, 18 pgs.
  • U.S. Appl. No. 12/964,749, Office Action mailed May 23, 2013, 13 pgs.
  • U.S. Appl. No. 13/663,827, Amendment and Response filed Aug. 6, 2013, 7 pgs.
  • U.S. Appl. No. 13/663,827, Advisory Action mailed Aug. 22, 2013, 3 pgs.
  • U.S. Appl. No. 13/663,827, Amendment and Response filed Sep. 9, 2013, 8 pgs.
  • U.S. Appl. No. 13/663,827, Notice of Allowance mailed Sep. 26, 2013, 11 pgs.
  • U.S. Appl. No. 13/664,012, Amendment and Response filed Jul. 30, 2013, 12 pgs.
  • U.S. Appl. No. 13/664,012, Notice of Allowance mailed Sep. 12, 2013, 14 pgs.
  • U.S. Appl. No. 12/964,749, Amendment and Response filed Aug. 23, 2013, 8 pgs.
  • U.S. Appl. No. 12/964,749, Notice of Allowance mailed Sep. 5, 2013, 11 pgs.
  • Indian First Exam Report in Application 3305/DE/L2005, mailed Mar. 28, 2013, 2 pgs.
  • PCT International Search Report and Written Opinion in International Application PCT/US2012/041049, mailed Jan. 17, 2013, 12 pgs.
  • PCT International Search Report and Written Opinion in International Application PCT/US2012/054038, mailed Feb. 20, 2013, 10 pgs.
  • U.S. Appl. No. 12/964,749, Amendment and Response filed Apr. 29, 2013, 9 pgs.
  • U.S. Appl. No. 13/663,827, Amendment and Response filed Apr. 29, 2013, 10 pgs.
  • U.S. Appl. No. 13/664,012, Amendment and Response filed Apr. 24, 2013, 10 pgs.
  • U.S. Appl. No. 13/664,012, Office Action mailed Apr. 30, 2013, 16 pgs.
  • U.S. Appl. No. 13/663,827, Office Action mailed May 7, 2013, 16 pgs.
  • Chinese Notice of Entering into Substantive Examination in Application 201210331041.X, mailed Mar. 6, 2013, 3 pgs.
  • PCT International Search Report and Written Opinion in International Application PCT/US2012/041703, mailed Feb. 14, 2013, 13 pgs.
  • PCT International Search Report and Written Opinion in International Application PCT/US2012/054039, mailed Feb. 27, 2013, 11 pgs.
  • U.S. Appl. No. 13/228,818, Amendment and Response filed Oct. 18, 2013, 24 pgs.
  • European Supplementary Search Report in Application 11846483.3, mailed Dec. 16, 2014, 10 pgs.
  • European Communication in Application 11846483.3, mailed Jan. 12, 2015, 1 page.
  • Chinese 3rd Office Action in Application 201110329007.4, mailed Dec. 29, 2014, 6 pgs.
  • U.S. Appl. No. 14/473,488, Notice of Allowance mailed Jan. 26, 2015, 15 pgs.
  • European Extended Search Report in Application 12829430.3, mailed Feb. 12, 2015, 6 pgs.
  • U.S. Appl. No. 13/174,271, Amendment and Response filed Apr. 10, 2014, 10 pgs.
  • U.S. Appl. No. 13/663,827, Notice of Allowance mailed Apr. 25, 2014, 10 pgs.
  • Chinese 1st Office Action in Application 201110462797.3, mailed Apr. 16, 2014, 11 pgs.
  • U.S. Appl. No. 13/664,012, Notice of Allowance mailed May 20, 2014, 13 pgs.
  • U.S. Appl. No. 13/174,271, Notice of Allowance mailed May 29, 2014, 10 pgs.
  • U.S. Appl. No. 13/228,818, Amendment and Response filed Jun. 16, 2014, 15 pgs.
  • Chinese 2nd Office Action in Application 201110329007.4, mailed Jun. 12, 2014, 8 pgs.
  • U.S. Appl. No. 13/228,732, Amendment after Allowance filed Jun. 4, 2014, 7 pgs.
  • U.S. Appl. No. 13/228,732, USPTO response to Amendment after Allowance mailed Jun. 27, 2014, 2 pgs.
  • U.S. Appl. No. 13/174,271, Notice of Allowance mailed Aug. 8, 2014, 2 pgs.
  • U.S. Appl. No. 13/228,818, Office Action mailed Aug. 15, 2014, 17 pgs.
  • New Zealand Notice of Acceptance in Application 622122, mailed Jan. 15, 2015, 1 page.
  • U.S. Appl. No. 14/473,488, Notice of Allowance mailed Feb. 12, 2015, 5 pgs.
  • U.S. Appl. No. 13/228,732, Amendment and Response filed Nov. 8, 2013, 15 pgs.
  • U.S. Appl. No. 13/228,818, Amendment and Response filed Nov. 25, 2013, 28 pgs.
  • U.S. Appl. No. 12/964,749, Notice of Allowance mailed Nov. 15, 2013, 2 pgs.
  • U.S. Appl. No. 13/663,827, Notice of Allowance mailed Jan. 13, 2014, 10 pgs.
  • U.S. Appl. No. 13/664,012, Notice of Allowance mailed Dec. 24, 2013, 11 pgs.
  • U.S. Appl. No. 13/174,271, Office Action mailed Jan. 10, 2014, 17 pgs.
  • U.S. Appl. No. 13/228,732, Notice of Allowance mailed Mar. 4, 2014, 17 pgs.
  • U.S. Appl. No. 13/664,012, Amendment and Response filed Mar. 19, 2014, 7 pgs.
  • U.S. Appl. No. 13/228,818, Office Action mailed Mar. 14, 2014, 18 pgs.
  • European Intention to Grant in Application 10013021.0, mailed Jan. 28, 2014, 7 pgs.
  • Chinese 2nd Office Action in Application 201110462797.3, mailed Sep. 24, 2014, 6 pgs.
  • Chinese 1st Office Action in Application 201210331041.X, mailed Sep. 3, 2014, 14 pgs.
  • New Zealand Examination Report in Application 622122, mailed Aug. 28, 2014, 2 pgs.
  • U.S. Appl. No. 13/174,271, Supplemental Notice of Allowance mailed Sep. 8, 2014, 2 pgs.
  • Chinese 1st Office Action in Application 201210330725.8, mailed Oct. 10, 2014, 11 pgs.
  • Chinese 1st Office Action in Application 201210434813.2, mailed Nov. 3, 2014, 10 pgs.
  • U.S. Appl. No. 13/228,818, Amendment and Response filed Nov. 17, 2014, 14 pgs.
  • U.S. Appl. No. 14/473,488, Notice of Allowance mailed Apr. 16, 2015, 2 pgs.
  • U.S. Appl. No. 13/228,818, Office Action mailed Mar. 25, 2015, 18 pgs.
  • Chinese 2nd Office Action in Application 201210331041.X, mailed Mar. 9, 2015, 10 pgs.
  • European Communication and partial Search Report in Application 12830847.5, mailed Mar. 18, 2015, 6 pgs.
  • European Communication in Application 12829430.3, mailed Mar. 3, 2015, 1 page.
  • Chinese Notice of Allowance in Application 201110462797.3, mailed Mar. 3, 2015, 4 pgs.
  • European Search Report in Application 12805065.5, mailed Jun. 1, 2015, 9 pgs.
  • Sapuntzakis, C. et al., Internet Draft: “TCP RDMA option”; Available at: draft-csapuntz-tcprdma-00.txt, Feb. 1, 2000, 20 pgs.
  • Defense Advanced Research Projects Agency, by Information Sciences Institute Univ. of So. CA., “Internet Protocol—DARPA Internet Program Protocol Specification”; Available at: https://www.rfc-editor.org/rfc/rfc791.txt, Sep. 1, 1981, 52 pgs.
  • Recio, R. et al., “A Remote Direct Memory Access Protocol Specification”, Network Working Group Request for Comments: 5040, Available at: https://tools.ietf.org/html/rfc5040, Oct. 1, 2007, 66 pgs.
  • European Partial Search Report in Application 12804233.0, mailed May 7, 2015, 7 pgs.
  • South African Notice of Allowance in Application 2014/01381, mailed May 5, 2015, 1 page.
  • European extended Search Report in Application 12804233.0, mailed Jul. 15, 2015, 18 pgs.
  • European extended Search Report in Application 12830847.5, mailed Jul. 7, 2015, 11 pgs.
  • European Communication in Application 12830847.5, mailed Jul. 24, 2015, 1 page.
  • European Communication in Application 12805065.5, mailed Jun. 18, 2015, 1 page.
  • Chinese Notice of Allowance in Application 201110329007.4, mailed Jul. 1, 2015, 6 pgs.
  • Chinese 2nd Office Action in Application 201210434813.2, mailed Jun. 30, 2015, 7 pgs.
  • Chinese 2nd Office Action in Application 201210330725.8, mailed Jul. 1, 2015, 7 pgs.
  • Chinese 1st Office Action and Search Report Issued in Chinese Patent Application No. 201280032558.7, Mailed Date: Aug. 5, 2015, 19 Pages.
  • European Communication in Application 12804233.0, mailed Jul. 31, 2015, 1 page.
  • Chinese 3rd Office Action in Application 201210331041.X, mailed Aug. 18, 2015, 7 pgs.
  • U.S. Appl. No. 14/720,083, Office Action mailed Jul. 1, 2015, 28 pgs.
  • U.S. Appl. No. 14/475,081, Office Action mailed Aug. 18, 2015, 17 pgs.
  • U.S. Appl. No. 14/475,081, Amendment and Response filed Nov. 18, 2015, 14 pgs.
  • Chinese Office Action in Application 201280032570.8, mailed Dec. 11, 2015, 15 pgs.
  • Chinese Notice of Allowance in Application 201210434813.2, mailed Oct. 10, 2015, 4 pgs.
  • U.S. Appl. No. 13/946,550, Office Action mailed Sep. 23, 2015, 24 pages.
  • European Notice of Allowance in Application 11846483.3, mailed Aug. 11, 2015, 7 pages.
  • European Office Action in Application 12829430.3, mailed Oct. 6, 2015, 4 pages.
  • Indian Office Action in Patent Application No. 3306/DEL/2005, mailed Aug. 10, 2015, 2 pages.
  • U.S. Appl. No. 13/946,550, Amendment and Response filed Dec. 23, 2015, 10 pgs.
  • U.S. Appl. No. 14/720,083, Notice of Allowance mailed Dec. 22, 2015, 8 pgs.
  • U.S. Appl. No. 14/720,083, Notice of Allowance mailed Jan. 11, 2016, 2 pgs.
  • U.S. Appl. No. 14/475,081, Office Action mailed Jan. 21, 2016, 18 pgs.
  • Australian Office Action in Application 2012275906, mailed Jan. 11, 2016, 3 pgs.
  • Israeli Communication in Application 231373, mailed Dec. 28, 2015, 3 pgs.
  • Singapore Notice of Allowance and Supplemental Examination Report in Application 11201400442S, mailed Jan. 26, 2016, 4 pgs.
Patent History
Patent number: 9331955
Type: Grant
Filed: Jun 29, 2011
Date of Patent: May 3, 2016
Patent Publication Number: 20130007180
Assignee: Microsoft Technology Licensing, LLC (Redmond, WA)
Inventors: Thomas M. Talpey (Stow, MA), Gregory Robert Kramer (Bothell, WA), David Matthew Kruse (Kirkland, WA)
Primary Examiner: Viet Vu
Assistant Examiner: Herman Belcher
Application Number: 13/172,757
Classifications
Current U.S. Class: Queuing Arrangement (370/412)
International Classification: G06F 15/16 (20060101); H04L 12/925 (20130101); H04L 1/18 (20060101); H04L 29/08 (20060101); G06F 17/30 (20060101); H04L 12/861 (20130101); H04L 12/801 (20130101);