Multi-rate shared memory architecture for frame storage and switching
Shared memory provides buffering and switching for all frames that flow through a fiber channel switch. Received frames are written to shared memory by the receiving port then read from shared memory by the transmitting port. Shared memory provides for data to be written to a buffer at one rate, and read from a buffer at a different rate, or vice versa.
The present invention is a continuation of and claims the benefit of and priority from the prior-filed U.S. Nonprovisional patent application Ser. No. 09/932,223; filed Aug. 17, 2001, entitled “Multi-rate Shared Memory Architecture for Frame Storage and Switching,” the subject matter of which hereby being specifically incorporated here by reference for all that it discloses and teaches.
BACKGROUND OF THE INVENTIONThe present invention relates, in general, to the field of fibre channel switching technology. More particularly, the present invention relates to a route caching scheme for a receive port in a fibre channel switch.
Fibre Channel is a high performance, serial interconnect standard designed for bi-directional, point-to-point communications between servers, storage systems, workstations, switches, and hubs. It offers a variety of benefits over other link-level protocols, including efficiency and high performance, scalability, simplicity, ease of use and installation, and support for popular high level protocols.
Fibre channel employs a topology known as a “fabric” to establish connections between ports. A fabric is a network of switches for interconnecting a plurality of devices without restriction as to the manner in which the switch can be arranged. A fabric can include a mixture of point-to-point and arbitrated loop topologies.
In Fibre Channel, a channel is established between two nodes where the channel's primary task is to transport data from one point to another at high speed with low latency. The Fibre channel switch provides flexible circuit/packet switched topology by establishing multiple simultaneous point-to-point connections. Because these connections are managed by the switches or “fabric elements” rather than the connected end devices or “nodes”, fabric traffic management is greatly simplified from the perspective of the device.
In a fibre channel switching environment, a shared memory within the switching element is used to store incoming frames bound for a particular exit port. A destination ID (D_ID) value in the frame header identifies the exit port associated with the incoming frame. For multiple data rates, rate matching FIFO's are used to store the frames before transmission out the receiving port.
In prior approaches, shared memory and storage structures required a large rate matching FIFO to support multiple data rates. For example, to buffer a frame that is incoming at twice the rate of the outgoing port, the FIFOs are at least one half the size of the maximum length frame. Since such FIFO's must necessarily be large to accommodate the number of ports that are coupled to shared memory, this approach requires a significant sized FIFO.
Existing architectures for buffer are also inefficient. For example, memory locations may be allocated to a particular frame, but never actually used. Buffers must handle both rate adaptive cases and same-speed cases, and if they use the same read/write method for both cases, memory is wasted. The current invention provides a different read/write methodology depending on the case.
SUMMARY OF THE INVENTIONA shared memory design in accordance with the present invention provides a solution to the aforementioned problem by providing a shared location for storing data frames prior to transmission to a receiving port in an efficient manner. The present invention can be implemented without requiring significant design changes and with only a relatively straightforward alteration to existing processes for networking in a fibre channel switching environment.
Particularly disclosed herein is a method for routing a data frame through a fibre channel fabric having a switch with a plurality of ports. The ports are operative for transmitting and receiving the data frame. A first port of the switch receives the data frame. The data frame is then written to a shared memory location. The shared memory location is coupled to at least two of the plurality of ports. A second port of the switch is identified. The second port is operative for transmitting the data frame from the shared memory location. A message is transmitted from the first port to the second port indicating the position in the shared memory location of the data frame. Finally, the data frame is read from the shared memory to the second port, wherein the switch has more than one rate for reading and writing data.
In another aspect, the present invention provides a shared memory location within a fibre channel fabric having a plurality of ports. The shared memory is coupled to the plurality of ports. The shared memory has a plurality of memory locations, each memory location having a column and also having a plurality of rows for storing a data frame. The shared memory also has a plurality of first-in, first out data storage locations, wherein a word is written from the memory location to the first-in first-out storage location before transmission to a port.
BRIEF DESCRIPTION OF THE DRAWINGSThe aforementioned and other features and objects of the present invention and the manner of attaining them will become more apparent and the invention itself will be best understood by reference to the following description of a preferred embodiment taken in conjunction with the accompanying drawings, wherein:
The present invention is illustrated and described in terms of a fibre channel fabric and particular fibre channel fabric elements (e.g., switches), however, the present invention is readily adapted to other protocols and environments. Particular examples herein specify data transmission rates and specific values and types of components to ease illustration and understanding. However, these specific examples are not limitations of the present invention unless explicitly indicted to the contrary.
The fibre channel fabric associated with switch 100 is the method for connecting the various node ports (N-Ports) of the devices together. In this way, the fabric is capable of routing fibre channel frames using only the destination identification information in the fibre channel frame header. The destination identification information identifies which N-Port receives the frame.
Fibre channel switch 100 has a plurality of ports for receiving and transferring data through the switch. In
Continuing, fibre channel switch 100 has an embedded port 145 in addition to illustrated GL_Ports 130, 135 and 140. Within the context of the invention, embedded port 145 may be used for several functions. First, it may provide a system services processor an access point for all of the fibre channel well-known addresses for both the reception and transmission of frames. Secondly, it may handle any fibre channel frame that cannot be delivered to a destination for either busy, reject, or timeout conditions. It may also be responsible for the generation, modification and/or interpretation of all Fibre Channel-Arbitrated Loop (FC-AL) initialization frames (such as LIFA, LIPA, LIHA, and LISA frames) for the GL-Ports operating in fabric loop mode. The embedded port interface is slightly different than that of an actual GL_Port module. Since the Fibre Channel-0/1 layers are not required, embedded port 145 does not implement the low-level interface for either primitive signaling or sequences. After the system services processor has completed initialization of embedded port 145, it enters and remains in the fibre channel active state.
Embedded port 145 provides a register set accessible to the system services processor for basic initialization and low-level control. Once enabled, embedded port 145 is responsible for all functions related to the transmission and reception of frames to and from shared memory 120. For data path consistency within the following description, the direction of fibre channel frame flow is referenced to shared memory 120. Thus, a transmit (TX) path actually contains paths destined to, or received by, the embedded port from shared memory 120 and a receive (RX) path contains frames generated or transmitted by embedded port 145.
Embedded port 145 creates and consumes buffers that contain complete fibre channel frames. Frames may be held in SRAM coupled to embedded port 145. SRAM will typically hold two frames, one TX frame received from shared memory 120 and one RX frame waiting to be moved into shared memory 120. All other TX frames waiting to be read by embedded port 145 and RX frames previously created by embedded port 145 are stored in shared memory buffers. In one embodiment, embedded port 145 may be allocated up to 12 shared memory buffers for storage of RX frames. Typically, TX frames utilize the shared memory buffers allocated to the GL-Ports that receive the fibre channel frames.
For example, for fast turn-around of Arbitrated Loop address initialization frames, a TX frame may be modified in place in the embedded port SRAM by software and sent via an RX path without the need to move the frame. One of the shared memory buffers allocated to an embedded port may be designated as a protected buffer. The protected buffer can be filled with an RX frame that is transmitted frequently and left intact so that the frame can be sent to an exit port without the time delay of moving the frame from the extended port SRAM to the shared memory buffer.
An embedded port receiver (RX) is used to transfer frames from a system services processor to other ports in the switch 100. The Receiver module will be substantially similar to the GL_Port RX module described hereinafter. Similarly, an embedded port transmitter (TX) is used to transfer frames from other ports via shared memory to a system services processor. The Transmitter module will be substantially similar to the GL_Port TX module also described hereinafter. An Embedded Port Front-end is used to transfer data between the Embedded Port SRAM and the TX and RX modules.
Shared memory 120 provides buffering and switching for all fibre channel frames that flow through switch 100. Received frames are written to shared memory 120 by the receiving port then read from shared memory 120 by the transmitting port. In one embodiment, shared memory 120 has 162 total frame locations shared by the GL-Ports 130, 135 and 140 and embedded port 145. In such an example, each port may be allocated as many as 12 buffers, so long as the total of 162 buffers is not exceeded.
Central Route Control module 110 provides a common route table for all ports in switch 100. Route table provides a translation from each possible Destination ID (D_ID) value to the appropriate exit port. Additionally, the route table provides for hard zoning, which is the capability for blocking traffic from certain receive ports to certain D_IDs. Each port uses an exit port request and response bus to communicate with central route table 110.
GL_Ports 130, 135 and 140 transmit and receive fibre channel frames to and from the switch and to and from the fibre channel fabric. As shown in
GL_Ports may function as an E_Port, an F_Port or an FL_Port, for example. An E_Port is an expansion port that serves as a physical interface within the fabric that is used to create multi-switch fabrics by attaching another switches E_Port through an interswitch link (ISL). An F_Port is a fabric port that operates as a physical interface within the fabric that attaches to an N_Port of a destination device through a point-to-point link connection. An FL_Port is a fabric loop port that contains arbitrated loop (AL) functions associated with the FC-AL topology. FC-AL is a fibre channel topology where ports use arbitration to establish a point-to-point circuit.
Continuing with the illustrated example of
Local route control module (LR) 200 is used by the GL_Port RX module to request the exit port for a frame based on the frame's destination ID value (D_ID). As shown in
For the processing of frame traffic, FE 210 provides independent, symmetrical RX and TX interfaces to carry frame data. These paths consist of a data bus and control signals that identify the beginning and ending frame delimiters. For the RX path, status information about the frame including CRC validation, truncated frames and other pertinent status is also included as part of the signal.
Fibre channel front-end 210 continuously monitors its receive link for the detection of a start of frame (SOF) delimiter in the fibre channel frame. When a SOF is detected, FE 210 then forwards the frame to the RX module 220. RX module 220 stores the frame into the next available shared memory buffer. RX module 220 uses LR 200 to make a destination port routing decision from the header information of the received frame. RX module 220 then combines the shared memory buffer number into a field, which may be referred to as a Qentry field, which is passed to a TX module of the destination port through QC Module 205 RX module 220 then waits for the TX module to return the buffer number via an AckQEntry field. When RX module 220 receives the AckQEntry field it indicates to FE 210 that the buffer is being consumed.
The time that is required for all this processing is less than 1 microsecond, for example, in a particular fibre channel implementation. For this example it is most likely that RX module 220 is still storing the received frame while the TX module is transmitting the same frame, creating a cut-through switching effect. If the TX did not immediately transmit the frame, it is possible that the entire frame has been written into the buffer memory when transmission commenced, providing for a store and forward type of switching function.
The TX logic continuously monitors the bus coupling QC Controller 205 with TX module 230 for QEntries. When a QEntry is received, it is placed in TX module 230. When FE 210 is able to transmit a new frame, the queue selects a QEntry for processing. The shared memory buffer number for the frame to transmit is extracted from the QEntry and the TX module initiates a shared memory read operation. The frame data is then passed from the shared memory 120 to the FE 210. FE 210 transmits the frame.
Continuing with
RX module requests an exit port by providing a D_ID from the frame header to local route control module 200 (step 400). In one embodiment, the D_ID has 24 bits, starting with the 0 bit, which is represented by a designation [23:0]. The first operation 320 of local route control module 200 is to determine if the D_ID identifies a multicast (MC), a broadcast (BC) or a well-known address (WKA) (step 410). Multicast and broadcast addresses are directed to a MC/BC/WKA table 310 to identify the exit port. Well-Known Addresses and FC-AL Loop Initialization addresses always result in the Embedded Port being selected as the Exit Port. Domain controller identifier addresses are sent to the central route table for exit port lookup unless a bit is set and the frame is not a class F frame, in which case the Embedded port is selected as the exit port.
If central route table 110 has not yet been initialized, then all frames are routed to the embedded port. All other D_ID values are forwarded to central route table 110. However, prior to forwarding a request for an exit port to central route table 110, LR 200 performs an operation 330 to determine whether an association between the requested D_ID and an exit port designation is found in route cache 300 (step 430). If the D_ID to exit port association is found in route cache 300, the exit port is available immediately. Route cache 300 improves latency by caching the most recent exit port lookups. In one embodiment, the sixteen most recent lookups are stored in cache 300. Cache 300 should be cleared when either the central route table 110 or indirect exit tort table 350 is modified.
If the D_ID is not located in route cache 300, then local route controller 200 performs an operation 340 to send the D_ID to a central route table 110. Central route table 110 retrieves the D_ID to exit port association and returns it to local route controller 200. If applicable, the D_ID to exit port association is stored in an indirect exit port table 350 (step 470). As shown in step 480, D_ID to exit port association is stored in route cache 300. In the illustrated example, D_ID to exit port association is sent to RX module over ExitPort bus 250 (step 490).
Shared memory 120 provides buffering and switching for all frames that flow through the switch 100. Received frames are written to shared memory 120 by the receiving port then read from shared memory 120 by the transmitting port. In one embodiment, shared memory 120 has 136 total buffers shared by the 16 GL_ports and the Embedded Port. For this example, each port may be allocated as many as 12 buffers, so long as the total of 136 buffers is not exceeded.
Shared memory 120 is basically a data frame buffer that is spread out over several columns of memory. As shown in
# of rows=2148 bytes/(17 columns*4 words/byte)
Since the number of rows a frame consumes is essentially the size of the buffer, the more general equation to determine the number of rows per buffer is as follows:
# of rows/buffer=# bytes/buffer #bytes/row
The number of bytes per row is determined by multiplying the number of columns by the number of bytes per word.
The first word of the frame should be written to the first available column. For example, if GL Port_7 has completed read and write cycles to Column 15, it will then get access to Column 16. After accessing Column 16, the port will next have access to Column 0, then Column 1, etc.
Continuing with the example for port_0, in the given example for N=17, the next 16 words will be written into the same row but the next column.
As stated previously, to support 17 simultaneous 4-byte accesses every 106 MHz clock period, shared memory 120 is constructed from 17 separate SRAM modules or columns. Each buffer is striped across all 17 columns and requires 32 rows per buffer, where a row consists of 1 word at the same offset in each of the 17 columns. Continuing with the example of N=17, the total memory requirement for 136 frame buffers is:
8 frames/port*17 ports*32 rows/frame=4352 rows
However, memory depth is typically provided in multiples of 512 so there are actually 4608 rows.
In one embodiment, shared memory 120 supports both 1 Gbps and 2 Gbps read and write data rates. In one important aspect of the invention, shared memory 120 allows data to be written to a buffer at one rate and read from the same buffer at another rate, in this case 1 Gbps and 2 Gbps. One skilled in the art should understand that other data rates may be used without departing from the intended scope of the invention.
Continuing with the write example of
Continuing with the example illustrated in
Continuing with the write example of
Continuing with the example illustrated in
According to the example of
Continuing with the write example of
Continuing with the example illustrated in
According to the example of
Data is read alternately from FIFO_0 and FIFO_1 to the receiving port in a manner so as to ensure the order and therefore integrity of the frame. In the illustrated example, the first nine words are read out of FIFO_0. The number of words is determined by the equation (N+1)/2 where N is equal to the number of ports. Once the first nine words are read from FIFO_0, the first seventeen words are read from FIFO_1. The number of words read from FIFO_1 is equal to the number of ports N. Then, the remaining words are read from FIFO_0, however, if there were more than N words only the first N words would be written from FIFO_0. The sequence (N+1)/2, N.sub.1, N.sub.2 . . . N.sub.x represents this procedure of reading words from FIFO_0 and FIFO_1, wherein N represents the number of ports and x represents the number of read operations to completely empty FIFO_0 and FIFO_1.
Continuing with the write example of
Continuing with the example illustrated in
According to the example of
The central route controller may insert an exit port response on the bus in any clock cycle. A local route control module will extract the exit port response from the bus if it is addressed to that GL_Port. The local route control module is used to provide the attachment to the exit port response bus for each module. The exit port response bus signals are described in Table 2.
While there have been described above the principles of the present invention in conjunction with a specific embodiment, it is to be clearly understood that the foregoing description is made only by way of example and not as a limitation to the scope of the invention. Particularly, it is recognized that the teachings of the foregoing disclosure will suggest other modifications to those persons skilled in the relevant art. Such modifications may involve other features which are already known per se and which may be used instead of or in addition to features already described herein.
Although claims have been formulated in this application to particular combinations of features, it should be understood that the scope of the disclosure herein also includes any novel feature or any novel combination of features disclosed either explicitly or implicitly or any generalization or modification thereof which would be apparent to persons skilled in the relevant art, whether or not such relates to the same invention as presently claimed in any claim and whether or not it mitigates any or all of the same technical problems as confronted by the present invention. The applicants hereby reserve the right to formulate new claims to such features and/or combinations of such features during the prosecution of the present application or of any further application derived therefrom.
Claims
1-18. (canceled)
19. A fibre channel fabric comprising:
- a fibre channel switch having a plurality of fibre channel ports embodied thereon for transmitting and receiving data frames;
- a route control module coupled to one of said plurality of fibre channel ports, said route control module for providing an exit port in response to a request from said plurality of fibre channel ports for said exit port, said request comprising a destination identification; and
- a memory location, said memory location coupled to each of said plurality of fibre channel ports, wherein said memory location stores data frames transmitted to each of said plurality of fibre channel ports, wherein said fibre channel switch has more than one rate for reading and writing data from said memory location to said each of said plurality of fibre channel ports.
20. The fibre channel fabric of claim 19, further comprising an interface control module that couples a first fibre channel port and a second fibre channel port of said fibre channel switch, wherein said interface control module provides a first control signal to said second fibre channel switch in response to a second control signal from said first fibre channel switch.
21. The fibre channel fabric of claim 19, wherein data frames that are received at a first receiving port of said plurality of ports are written into said memory, and said data frames written into said memory are read from said memory by a first transmitting port of said plurality of ports.
22. The fibre channel fabric of claim 19, wherein said memory comprises a plurality of SRAM modules.
23. The fibre channel fabric of claim 22, wherein said SRAM modules number 17 in total.
24. The fibre channel fabric of claim 19, wherein said memory supports both 1 gigabit per second and 2 gigabit per second read and write data rates.
25. The fibre channel fabric of claim 24, wherein said memory supports a different read rate and write data rate such that data can be written to said memory at a first rate and simultaneously read from said memory at a second, different rate.
26. The fibre channel fabric of claim 19 further comprising a bus coupling said one of said plurality of fibre channel ports to said memory for transmitting said data frame.
27. A method for providing a shared memory location coupled to a plurality of ports in a fibre channel switch for buffering and switching data frames from a first port to a second port, said method comprising:
- providing a plurality of columns of memory, said columns of memory comprising a plurality of rows for storing a portion of said data frames, wherein said portion is written from said first port; and
- providing a plurality of first-in first out data storage locations, wherein said portion of said data frame is stored in said data storage location before being read to said second port.
28. The method of claim 27, wherein the number of columns of memory is equal to or greater than the number of ports.
29. A method for providing a shared memory location comprising:
- receiving data frame at a first port;
- writing a first portion of said data frame to a first column of memory;
- reading said first portion of said data and thereafter writing said first portion to a first-in first out storage location; and
- reading said first portion of said data from said storage location and thereafter writing said first portion to a second port.
30. The method of claim 29, wherein said first portion of said frame comprises a word.
31. The method of claim 30, wherein said word is 34 bits in length.
32. The method of claim 30, wherein said writing to the first column of memory is at a first data rate and the reading from the first column is at a second data rate, wherein said first data rate is different than said second data rate.
33. The method of claim 32, wherein said first data rate is two gigibits per second and said second data rate is one gigibit per second.
34. The method of claim 32, wherein said first data rate is one gigibit per second and said second data rate is two gigibits per second.
35. A share d memory location within a fibre channel fabric having a plurality of ports, said shared memory coupled to said plurality of ports, said shared memory comprising:
- a plurality of memory locations, each memory location comprising a column and having a plurality of rows for storing a data frame, said data frame comprising a plurality of words; and
- a plurality of first-in first out data storage locations, wherein a word is written from said memory location to said first-in first-out storage location before transmission to a port.
36. The shared memory location of claim 35, wherein the number of memory locations is greater than or equal to the number of ports.
37. The shared memory location of claim 35, wherein the number of memory locations is an odd number.
38. The shared memory location of claim 35, wherein the memory locations are implemented using SRAM modules.
39. The shared memory location of claim 35, wherein the width of said memory location is 34 bits, wherein 34 bits is the length of said word of said data frame.
40. The shared memory location of claim 35, wherein said plurality of said first-in first-out storage locations comprises two first-in first-out storage locations.
41. The shared memory location of claim 35, wherein the depth of a buffer is determined by multiplying the number of frames per port by the number of ports by the number of rows per frame.
42. The shared memory location of claim 35, wherein the number of rows per frame is determined by dividing the frame byte size by the total resulting from multiplying the number of columns by the number of bytes per word.
Type: Application
Filed: May 30, 2006
Publication Date: Jan 4, 2007
Inventor: William Mitchem (Lafayette, CO)
Application Number: 11/444,632
International Classification: H04L 12/56 (20060101);