Method for data communication
In order to enable high speed, high bandwidth data transfer between two ASIC devices, for example in a backplane, a wide parallel input data word is divided into a smaller number of words, and each smaller word is converted to serial form and then transmitted over a respective sub-link at a high clock rate relative to the system clock. At the receiving side, the clock is recovered from the serial words, and the serial words are converted back to parallel form. An alignment process is then carried out, firstly involving detecting the positions of the bits of the words and then storing the words in a buffer FIFO register. The words are clocked out of the FIFO register in synchronism under control of the system clock once it is detected that valid words are received in the FIFO registers.
Latest Agere Systems Inc. Patents:
- Systems and methods for low latency media defect detection
- Methods and apparatus for determining threshold of one or more DFE transition latches based on incoming data eye
- Method of identifying and/or programming an integrated circuit
- Semiconductor device and a method of manufacture therefor
- Systems and methods for low latency media defect detection
This application is a continuation of U.S. patent application Ser. No. 09/808,664 filed Mar. 15, 2001, which claims priority to Great Britain Application No. 0006291.9 filed Mar. 15, 2000, each of which is hereby incorporated by reference in its entirety.
BACKGROUND1. Field of the Invention
The present invention relates to a data communication link for high speed, high bandwidth applications.
2. Description of the Related Art
In applications such as providing a data communication link between two Application Specific Integrated Circuits (ASICs) in a local backplane of a computing system, very high data rates may be required, e.g., an average data rate of at least 4.8 Gigabits per second (Gbps). The data link may be 64 bits wide.
Of the various possibilities for implementing such a link, it is possible to provide an interface that transfers data from the transmitting ASIC to the receiving ASIC as a single parallel word with a synchronising clock signal running at the system clock rate CK, say 78 MHz. However, for a data word of 64 bits to achieve a data transfer rate of 4.8 Gbps this would require 65 device pins, which for many applications would be either impractical or too costly to provide in the ASICs.
A synchronous interface could be used using a smaller number of pins, by multiplexing a 64 bit wide data word N times onto W bits (=64/N) and by providing a synchronising clock. However with a clock signal running at 78 MHz, the bandwidth would be reduced to W*CK=BW/N, which would give an unacceptably slow data transfer rate.
In order to achieve a bandwidth of 4.8 Gbps, the transfer rate may be multiplied N times. A synchronous interface which has a resultant Transfer Clock, N*CK, of less than 200 MHz may be practical. Above 200 MHz, which would be necessary to achieve the desired transfer rate of 4.8 Gbps, each data bit would be valid for a maximum of 5 ns, reducing further when rise-fall times of the interconnect signals and input/output buffers are included. The task of achieving a robust design, ensuring that all W bits are aligned such that the synchronising clock can always capture valid data bytes at the receiving ASIC, is far from trivial.
SUMMARY OF THE INVENTIONWith a view to avoiding the above noted problems, the invention provides a method for transmitting data, comprising the steps of: (a) responsive to a system clock, generating a transfer clock at a high rate relative to the system clock; (b) dividing an input word into a plurality of smaller words; and (c) transmitting the plurality of smaller words over corresponding serial sub-links in response to the transfer clock.
In an alternative embodiment, the invention further provides a method for receiving data, comprising the steps of: (a) converting received serial data words from a plurality of serial sub-links into parallel form; and (b) responsive to the received data, generating a low speed clock with a frequency nominally equal to a system clock.
The invention further provides a data transmitter having: a transfer clock generator, responsive to the system clock, generating a transfer clock at a high rate relative to the system clock; and a parallel to serial register, for dividing an input word into a plurality of smaller words and transmitting them over corresponding serial sub-links in response to the transfer clock.
In an alternative embodiment, the invention still further provides a receiver having: a plurality of serial to parallel registers coupled to corresponding serial sub-links, for converting received serial data words from the sub-links into parallel form; and a clock generator, responsive to the received data, for generating a low speed clock with a frequency nominally equal to the system clock.
BRIEF DESCRIPTION OF THE DRAWINGSA preferred embodiment of the invention will now be described with reference to the accompanying drawings wherein:
Referring to
DW=W*N
Where;
DW=Bit width of wide data word
W=Bit width of sub-data word
N=An integer value, greater than 1
The data bandwidth across the link is given as;
BW=DW*CK
Where;
BW=Bandwidth in Mega bits per second (Mbps)
CK=Transfer Clock in Mega Hertz (MHz)
For example, W=8, N=8, DW=64. The Transfer Clock, CK, is 78 MHz giving a BW of 4992 Mbps. However the invention is not limited to these specific values.
Interface 6 in ASIC 2 has a register 8 for breaking down the wide input data words, DW, into N (in this embodiment 8) smaller sub-words W (each 8 bits long). Each sub-word W is treated independently, using a Clock Data Recovery Module 10 (CDRM) macrocell. CDRM 10 has a multiplier 12 for multiplying the clock CK, W (8) times and respective parallel to serial (PISO) converters 14 for operating on each of N, W bit words. Each serial word is transmitted over a respective sub-link 16.
Referring to
For Transmit, Low Speed Data (LDTX) on line 40 will be presented to the CDRM at the rate of the Reference Clock (REFCK). The Reference Clock will be multiplied in frequency eight times by Phase Locked Loop (PLL) 50 to create High Speed Clock (HSCK) on line 52. LDTX data on line 40 will be loaded into a Parallel Serial Output (PISO) register 54 at the REFCK rate, and clocked out serially at the HSCK rate to form HDTX data on line 42.
For Receive, the High Speed Clock (HSCK) will be divided by eight at 58 to create a Low Speed Clock (LDCK) output. However, the phase of this clock must be adjusted so that its associated Low Speed Data (LDRX) is stable at the time of the active edge of LDCK. This is done by edge detection and phase adjustment circuit 60, 62 which monitors the High Speed Data (HDRX) on line 44. HDRX is also passed into a Serial Parallel Output (SIPO) register 56 to create the Low Speed Received Parallel Data (LDRX) on line 46. The output from the SIPO 56 will be enabled on the opposite edge to the active edge of its associated clock LDCK.
The number of transmit and sub-links are replicated 8 times in this example. However, there will generally only be a single PLL per CDRM macrocell.
On the receive side, the serial links are passed through CDRM macrocell 22, and a W bit word and clock will be recovered for each of the N serial links. The CDRM 22 has no knowledge of the boundary between one W bit word and the next within the serial data stream and it is therefore the first task of the Interface 20 to identify the correct bit alignment within each sub-link. Having recovered the W bit words for each sub-link, all N of the W bit words have to be aligned and synchronised to recreate the original DW width word.
The bit alignment is achieved by the transmit side sending consecutive initialisation words constructed by ASIC 2. These initialisation words (of W bits) have the property that however many times the word is shifted right or left within another word that is 2W bits wide, there is a unique position that defines the bit alignment. For example consider an initialisation word, for W=8, of “10111000”. A register 24 that is 2W words wide holds the previously received and
currently received words of W bits as shown in the above table. The initialisation word is sent at least twice followed by another synchronisation word (user defined) delimiter to indicate the start of transmission of true data. The position of the word is located in the register by means of a state machine (not shown) and this information is relayed to subsequent stages.
During transmission, each ASIC transmitting/receiving interface will respectively create/recreate a cyclic redundancy code (CRC) from the true data. The CRC words are inserted at a pre-determined interval, programmed to both transmit and receive sides. After this interval the transmitted CRC should equal the recreated CRC. If not, then either bit alignment has been lost or a corruption has occurred during the transmission of the data. This provides an Integrity Check individually on each of the serial links.
Thus, as shown in
The bit alignment and the Integrity Check are performed in each sub-link using the recovered clock generated for that serial link. There is no guarantee of any phase relationship between any of the N recovered clock (RCK[n])s, and each of the recovered clocks may be jittering (except that the recovered clocks will be within one clock cycle of one another). However, the average frequency of all recovered clocks and that of the Transfer Clock, CK, on the transmit side must be exactly the same, since the reference clock to both the transmit and receive ASICs will be driven from the same crystal oscillator. A mechanism is therefore required to re-align the N recovered sub-words and resynchronise the wide data word back to the Transfer Clock, CK. This is done by using a short First In First Out (FIFO) 28, 6 words long, at the end of each serial link.
The recovered sub-word plus a marker bit (W+1 bits) is written to the FIFO 28 by its associated recovered clock on line 48. The marker bit indicates whether that data word was Transmitted Synchronisation or Integrity Check Word. The very first word to be written by each of the links, will be a synchronisation word (marker bit set) and the second will be the first sub-word of true data. The first write will occur at a slightly different time for each link, but by the time the second write occurs, all will have written at least once. The addressing of the FIFOs may use Johnson coding, as more clearly seen in
The initial value of the address is 011 and the address scheme changes as indicated in
Thus, only a single address bit of FIFO's 28 changes per write and by ensuring that the top address bit is set on the second write, that address bit can be logically OR'd with the equivalent bit from all N links. This single bit signal, which goes high when the first word in a sub-link is received, is resynchronised via the metastability registers 70. By this time, since it is known all FIFO registers will be written to within a clock cycle of one another, all FIFO's will contain words, and the state machine 74 triggers the Word Aligner to read from all N FIFO's in parallel at the Transfer Clock rate, CK. This read should therefore occur when each of the FIFOs contain approximately four words. As the average frequency of the read and write clocks to the FIFO is the same each FIFO should always contain approximately four words. A FIFO that is at least six deep will isolate against jitter on the recovered clocks.
The very first FIFO read will all be synchronised sub-words but the second will be the recovery of the first true wide data word. The output of the FIFOs are applied to a Word Aligner register 78 which reconstitutes the original data word 80 (
The scheme outlined provides a robust high speed, high bandwidth local link by using a number of serial asynchronous links in parallel.
Thus, it will now be understood that there has been disclosed a new method and apparatus for providing a data communication link. While the invention has been particularly illustrated and described with reference to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form, details, and applications may be made therein. It is accordingly intended that the appended claims shall cover all such changes in form, details and applications which do not depart from the true spirit and scope of the invention.
Claims
1. A method for transmitting data over a communication link between a first integrated circuit having a system clock and a second integrated circuit, comprising the steps of:
- a. responsive to the system clock, generating a transfer clock at a high rate relative to the system clock;
- b. sending one or more bit alignment code words to initialize the communication link;
- c. dividing an input word into a plurality of smaller words; and
- d. transmitting the plurality of smaller words over corresponding serial sub-links in response to the transfer clock.
2. The method of claim 1, further comprising the step of:
- e. transmitting a CRC code word at predetermined intervals.
3. A method for receiving data transmitted over a communication link having a plurality of serial sub-links at an integrated circuit having a system clock, comprising the steps of:
- a. receiving a plurality of serial data words over the plurality of serial sub-links;
- b. responsive to the plurality of serial data words, generating a low speed clock with a frequency nominally equal to the system clock;
- c. storing the received serial data words for each sub-link in a plurality of buffer memories corresponding respectively to each sub-link; and
- d. reading the buffer memories in synchronism under control of the system clock in order to reconstitute each data word in parallel form.
4. The method according to claim 3, wherein the buffer memories are FIFO registers.
5. The method according to claim 4, wherein the step c. of storing received data words comprises the step of:
- e. addressing the buffer memories by changing only one bit of the address for each incremental address.
6. The method according to claim 5, wherein the step d. of reading the buffer memories comprises the steps of:
- f. comparing a predetermined bit of the address of each FIFO; and
- g. generating a trigger signal to actuate a state machine to cause reading of the FIFO registers.
7. A method for receiving data transmitted over a communication link having a plurality of serial sub-links at an integrated circuit having a system clock, comprising the steps of:
- a. receiving a plurality of serial data words over the plurality of serial sub-links;
- b. storing the received serial data words for each sub-link in a plurality of serial-to-parallel registers corresponding to the serial sub-links;
- c. responsive to the plurality of serial data words, generating a low speed clock with a frequency nominally equal to the system clock;
- d. detecting one or more edges of the received serial data words;
- e. based on the one or more detected edges, aligning the low speed clock with the plurality of serial data words; and
- f. applying the low speed clock to the plurality of serial-to-parallel registers for clocking out parallel words from the plurality of serial-to-parallel registers.
8. The method according to claim 7, further comprising the step of:
- g. storing the received bit alignment words in a bit alignment register in order to locate the position of the bits in the plurality of serial-to-parallel registers.
9. The method according to claim 8, further comprising the steps of:
- h. generating a CRC code word in response to the received data; and
- i. checking a received CRC code word against the generated CRC code word.
10. A method for communicating data over a communication link between a first integrated circuit having a first system clock and a second integrated circuit having a second system clock, comprising the steps of:
- a. responsive to the first system clock, generating a transfer clock at a high rate relative to the first system clock;
- b. sending one or more bit alignment code words from the first integrated circuit to the second integrated circuit to initialize the communication link;
- c. dividing an input word into a plurality of smaller words;
- d. transmitting the plurality of smaller words from the first integrated circuit to the second integrated circuit over a corresponding plurality of serial sub-links in response to the transfer clock;
- e. receiving, at the second integrated circuit, the plurality of smaller words over the plurality of serial sub-links;
- f. responsive to the plurality of serial data words, generating a low speed clock with a frequency nominally equal to the second system clock;
- g. storing the received serial data words for each sub-link in a plurality of buffer memories corresponding respectively to each sub-link; and
- h. reading the buffer memories in synchronism under control of the second system clock in order to reconstitute each data word in parallel form.
Type: Application
Filed: Mar 22, 2006
Publication Date: Aug 24, 2006
Applicant: Agere Systems Inc. (Allentown, PA)
Inventor: Robert Spooner (Lower Earley)
Application Number: 11/386,418
International Classification: H04J 3/06 (20060101);