Partial Response Equalizer and Related Method
A multi-phase partial response receiver supports various incoming data rates by sampling PrDFE output values at a selected one of at least two clock phases. The receiver includes a calibration circuit that performs a timing analysis of critical data paths in the circuit, and this analysis is then used to select the particular clock phase used to latch the output values. These techniques permit the multiplexer outputs from for each phase of the partial response receiver to directly drive selection of a multiplexer for the ensuing phase, i.e., by avoiding regions of instability or uncertainty in the respective multiplexer outputs.
Latest RAMBUS Inc. Patents:
The present invention relates generally to the field of communications, and more particularly to high speed electronic signaling within and between integrated circuit devices.
BACKGROUNDMulti-gigabyte data rates, common in high speed signaling systems, can be affected by dispersion-type inter-symbol interference (ISI) created by the channels and interfaces that connect integrated circuits (IC). The effects of this ISI typically have a greater impact upon “data eyes” of each transmitted signal at faster signaling rates, ultimately degrading the signal quality to the point where it becomes difficult to interpret the digital levels represented by the signal.
Receivers in these systems sometimes use a decision-feedback equalizer (DFE) to cancel this ISI. A DFE uses the feedback of one or more previously resolved symbols to offset their impact on the incoming symbol. Typically, each of the n recently received symbols is multiplied by some weighting (e.g., a coefficient), and these are used to adjust the received signal to offset ISI. The ISI associated with the prior data is thereby removed.
In some high-speed systems it can be difficult to resolve the most recent data bit(s) in time to close a tight feedback loop at high clock rates. Some receivers ignore the impact of such bit(s) on the incoming signal, and consequently fail to correct for the ISI-attributed to those bits. Other receivers employ “partial response” DFEs (PrDFEs) that produce multiple “conditional samples” of incoming data, each assuming a different threshold (based on possible states of as-yet, still unresolved previous data). The correct sample is then selected from the multiple conditional samples after the previously received bit(s) is resolved. As implied, usually only the immediately previously received bit is used for partial response equalization (i.e., to select the conditional sample), although it is possible to base partial response evaluation on two or more previously resolved bits.
While conventional in many systems and useful for a wide range of signaling rates, PrDFE receivers such as the one illustrated in
Unfortunately, while signaling rates continue to increase, the speed of digital circuitry (such as the circuitry illustrated in
The subject matter disclosed is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which like reference numerals refer to similar elements and in which:
A signal receiving circuit for use in electrical signaling systems is disclosed in various embodiments. In one embodiment, the signal receiving circuit is a multi-phase PrDFE receiver having a feedback timing constraint of tsel<1UL This improved feedback timing constraint ensures that the feedback latency is within the minimum bit period and enables the receiver to handle relatively faster data signaling rates. As a consequence, the various embodiments presented below allows the use of PrDFE in faster signaling systems.
More particularly, a multi-phase receiver includes multiple PrDFE circuits, an output circuit and a calibration circuit. Each PrDFE circuit generates two or more conditional samples and selects one of these to produce a resolved value (e.g., a bit value) based on a previously resolved value from another one of the PrDFE circuits. The output circuit delays each resolved value out according to a clock signal. The calibration circuit may be used to perform analysis of the critical paths in the receiver to determine the speed of the receiver circuit. Based on this analysis, the calibration circuit can determine the appropriate clock phase at which the outputs of the PrDFE circuits are stable. In this manner, the latch 116 or 128 used in the prior art of
Attention now turns to a further description of the embodiments of this improved receiver. Notably, as used herein, a “receiver” should be generally understood to be a per-channel (or per lane) circuit that resolves one or more data values; a “PrDFE” circuit should be understood to be that circuitry that produces and selects between multiple conditional samples for a given phase of the input signal (e.g., “even” or “odd” phase in the case of a DDR receiver). Thus, a double data rate (DDR) PrDFE receiver may include two PrDFE circuits, a quad data rate (QDR) PrDFE receiver may include four PrDFE circuits, and so on.
The system 200 can be utilized in a number of different applications. In one embodiment, the system 200 can be a memory system. The transmitter 202 may be a memory controller and the receiver 204 can be part of a memory device (e.g., DRAM, SRAM, RRAM, etc.) or vice versa, the transmitter 202 may be part of a memory device and the receiver 204 may be part of a memory controller. The transmitter 202 and receiver 204 can be located on the same or separate ICs. Alternatively, the system 200 can be part of a memory device positioned on a single IC. Generally speaking, one contemplated application of the system 200 is to a bidirectional memory system where the transmitter is employed in each of a memory controller IC and a DRAM IC, and the receiver is also employed in the memory controller IC and the DRAM IC, for transmissions from one IC to the other. The DRAM IC may be a dedicated memory device that receives read and write commands for a specific memory address, and that multiplexes memory contents responsive to the address onto or off of one or more channels or lanes of the signal path (e.g., onto signaling pins for a parallel or serial data transmission system).
Regardless of system application, the transmitter 202 transmits data on the signal path 206 during successive time intervals. The transmitted data signal arrives at the input of the receiver 204 after a propagation time, Tp (not shown in
The receiver 204 may also include a receive-side equalizer 212 (e.g., either in addition to or in lieu of the transmit equalizer) and one or more PrDFE circuits 214; only one PrDFE circuit is used in this embodiment, although additional circuits may be used, e.g., for additional phases associated with a DDR or QDR system. Equalizer 212 equalizes the received data signal from signal path 206 to produce equalized signal Din. Equalizer 212 amplifies the received data signal using a range of amplification factors, for example, with higher frequency components typically being treated to higher amplification factors. The PrDFE circuit 214 then reduces or eliminates ISI attributed to one or more “most recently received” symbols associated with the transmission.
Depending on implementation, it may be desired to calibrate each receiver 230a-n to be calibrated separately, and to perform calibration on a static or dynamic basis. For example, in some semiconductor devices, there can be on chip variations (OCV) within different parts of a single die. The OCV can be due to PVT (pressure, voltage or temperature) variations that occur with respect to time, or that occur within different circuit elements on different parts of the die. In the case of time-based variations, it may be desired to perform calibration dynamically, e.g., every few milliseconds or at some other schedule, and in the case of circuit or geography based variations, it may be desired to perform calibration independently for each receiver. As this statement implies, in at least one embodiment, calibration may be performed for one signaling path and shared across multiple receivers.
The odd-phase PrDFE circuit 402 includes two samplers 407 and 408, used to sample the input data signal Din. The samplers 407 and 408 have threshold levels that are dependent on two possible partial responses to the preceding symbol. The threshold level of sampler 407 is set to −α level and the threshold level of sampler 408 is set to +α level. In this manner, if the preceding symbol was a ‘0,’ sampler 407 will resolve the incoming signal as being a ‘1’ or ‘0’ by determining whether the signal level is above or below the partial response level −α. Conversely, if the preceding symbol was a ‘1,’ sampler 408 will resolve the incoming signal as being a ‘1’ or ‘0’ by determining whether the signal is above or below the partial response level +α. The conditional samples output by samplers 407 and 408 are respective inputs to storage elements 412, 414 (e.g., D flip-flops or other types of storage circuits) and are provided as respective inputs to multiplexer 416. The use of the two data paths considering two possible responses is sometimes also referred to as loop unrolling.
Multiplexer 416 selects one of the conditional samples from storage elements 412, 414 based on the immediately preceding symbol, as resolved by the even-phase sample circuit 404. If the immediately preceding symbol was a logic ‘1,’ then multiplexer 416 selects the sample value output from storage element 414; conversely, if the immediately preceding symbol was a logic ‘0,’ then multiplexer 416 selects the sample value output from storage element 412. The selected sample value from multiplexer 416, or resolved value, is fed back as the selection signal that controls multiplexer 426 to select one of the two ensuing conditional values generated by samplers 418 and 420.
The even-phase PrDFE circuit 404 includes two samplers, 418 and 420, used to sample the input data signal Din at a clock phase opposite of the odd-phase receiver 402. Samplers 418 and 420 have threshold levels dependent upon the two possible partial responses to the preceding symbol. The threshold level of sampler 418 is set to −α level and the threshold level of sampler 420 is set to +α level. In this manner, if the preceding symbol was a ‘0,’ the sampler 418 will resolve the incoming signal as being a ‘1’ or ‘0’ by determining whether the signal level is above or below the partial response level −α. Conversely, if the preceding symbol was a ‘0,’ sampler 420 will resolve the incoming signal as being a ‘1’ or ‘0’ by determining whether the signal is above or below the partial response level +α.
The output circuit 406 has two storage elements to receive the resolved symbol from each PrDFE circuit (i.e., for each phase of data output). In particular, the output circuit has two storage elements 428, 430 (e.g., D flip-flops or other types of storage circuits) coupled to receive the selected sample values from odd-phase PrDFE circuit 402. Storage element 428 is clocked using a first timing choice (e.g., in response to sampling clock signal clk) and storage element 430 is clocked using a second timing choice (e.g., in response sampling clock signal clkb). Storage element 428 stores the resolved symbol from multiplexer 416 at the same phase at which multiplexer 416 receives its inputs, and storage element 430 stores the same resolved symbol at a different phase, e.g., the opposite clock phase in this example. The outputs of storage elements 428, 430 are inputs to a selection circuit or multiplexer 432. Multiplexer 432 receives a selection signal 444 that controls which of its inputs is the Dn−1 sample will then be passed as a stable output, e.g., to another storage element 434 or to some other form of output.
The output circuit 406 also has two storage elements 436, 438 (e.g., D flip-flops or other types of storage circuits) that receive the output of the even-phase PrDFE circuit 404. Storage element 436 is clocked in response to clock signal clkb and storage element 438 is clocked in response to clock signal clk. Storage element 436 stores (i.e., samples) the resolved symbol from multiplexer 426 at the same phase multiplexer 426 receives its inputs, and storage element 438 stores (i.e., samples) the resolved symbol from multiplexer 426 at a different clock phase (e.g., the opposite phase). The outputs of storage elements 436, 438 are the inputs to selection circuit or multiplexer 440. Multiplexer 440 receives a selection signal 444 that controls which of the inputs is relied upon, to represent a sampling instant where the input is guaranteed to be stable as the even-phase sample value, Dn. Again, this resolved symbol may be stored in storage element 442 or passed as some other form of data output.
In the embodiment presented above, the timing choices may each be one of the clocks used to time the sampling of one of the phases of a multi-phase input signal. In alternate embodiments, other clock signals may be used, or the timing choices may be generated and/or selected in some other manner.
The CAL (calibration) signal 444 is used in the embodiment of
It would be helpful at this point to reiterate some of the performance improvements obtained by this PrDFE design.
Common problems with semiconductor devices include silicon lot variations, which can account for timing problems in specific where data signals arrive at storage elements too early in one die and late in another die, causing premature gating, latching of erroneous values, and the like. These variations may create problems at certain frequencies but not others, and may be influenced by environmental conditions, e.g., PVT variations. These device-to-device variations may also cause devices to perform marginally even for signaling rates within specification. Whatever the cause, at some point the variations can be stray outside of the bounds of reliable device operation. These instances are referred to as process corners.
In connection with the embodiments presented herein, the proximity of specific devices to process corners can to an extent be corrected for, in a manner that allows direct cross-coupled multiplexer feedback without intervening delay elements that affect the critical path (i.e., without delay elements that might affect feedback paths 417 or 429, specifically). That is to say, once the process corners are known relative to specific signaling rates, the receiver of the embodiments presented herein can be tuned so that cross-coupled PrDFE multiplexers are sampled at a time when data is known to be reliable. In the context of the embodiment of
Conceptually speaking, the timing problem (i.e., the uncertainty problem) is introduced with respect to
Since the data signal DO (FF) is valid for up to a full clock period, there will generally be at least one edge (rising or falling) of the and potentially two clock edges (rising and falling) at which the data signal produced by the multiplexer may be sampled. However, depending on signaling rates and the process corners referenced above, an unknown one of the clock edges may produce invalid (i.e., unreliable) data.
It should be noted that the issue as to when data may be reliably sampled at the multiplexer output of each PrDFE circuit is not just a function of the total delay (e.g., the “fast” versus “slow” representations of
To address these issues, storage circuit pairs 428/430 and/or 436/438 provide for alternate clock selection, which effectively permit selective advance of the sampling of the “DO” from the multiplexer to be one-half clock cycle earlier, prior to a corresponding process comer's instability, i.e., selecting storage element 430 instead of storage element 428 ensures that sample output “DO” will be sampled in a manner before the next odd-phase sample can propagate to the selection signal used to control multiplexer 416. In the case of a slow process corner, it may be desired to allow more time before the sampling of data out of each PrDFE circuit by the output circuit 406, and storage elements 430,438 may be selected as providing a sampling instant which occurs at a time when sample outputs DO/DE are relatively stable.
When circuitry responds slow relative to the signaling rate, the CAL signal 444 is set to ‘0’ indicating that the output circuit 406 is to use only storage elements 428 and 436 (the other storage elements 430, 438 are left out of
tsel<1 UI,
where tsel represents the time for the selected sample to control the next multiplexer 426 or 416 to select one of its inputs via feedback signal 429 or 417. However, as alluded to above, in a relatively slow circuit, clocking may be selected based on the concern that the feedforward path may be too slow, i.e., it may be problematic to clock the feedforward path subject to the constraint of
tck-Q+tmuxtsu<1 UI,
where tck-Q represents the “clock to Q” time of storage elements 412, 414, 422 and 424 and tmux represents the time for a selected sample to propagate through multiplexer 416 or 426, and tsu represents the setup time of the next storage elements 428, 430, 436, and 438. [Conceptually, the situation was referenced at the bottom of
tck-Q+tmux+tsu<2 UI.
tck-Q+tmux+tsel+tsu˜1 UI,
which is to say, if the output circuit were to sample data at the opposite phase in a DDR circuit, the feedback from the next ensuing symbol (Dn+1) might be so fast as to render sampled data unreliable. [This situation was referenced in the middle of
tck-Q+tmux+tsu<1 UI,
representing that the feedforward path (702 in
0<<tck-Q+tmux+tsel+tsu<2 UI,
but the system avoids the fast data region based on its selection of the clock used to sample the output of multiplexers 432 and 440.
There is also an overlap region where the PrDFE circuits where the rate is just right, which is to say, neither the fast corner nor the slow corner is invoked. Mathematically, the overlap region is described as follows:
tck-Q+tmux+tsu<1 UI<tck-Q+tmux+tseltsu,
where the slow data rate region would mathematically be represented by tck-Q+tmux+tsu<1 UI, and the fast data rate region (associated with the feedback path) is tck-Q+tmux+tsel+tsu>1 UI. In this case, the value of the CAL signal does not matter, i.e., the receiver should operate without reliability issues irrespective of the value CAL.
In one embodiment, the determination of the appropriate setting for the CAL signal is made based on selecting the region with the larger timing margin. Mathematically, these margins can be described as follows:
Slow data rate region margin=1 UI−(tck-Q+tmux+tsu); and
Fast data rate region margin=(tck-Q+tmux+tsel+tsu)−1 UI.
These quantities can be measured, and once relative magnitude is determined, the signal CAL may be responsively set.
The above timing analysis described above with respect to
Fast data rate region or slow process corner: A>1 UI and A+B>1 UI, then CAL=0; (1)
Invalid case: A>1 UI and A+B<1 UI; (2)
Slow data rate region or fast process corner: A<1 UI and A+B<1 UI, then CAL=1; and (3)
Overlap region: A<1 UI and A+B>1 UI, then CAL=0 or 1. (4)
The calibration circuitry is configured to detect these cases and set the calibration signal accordingly.
In one embodiment, the calibration circuitry 800 has a finite state machine 802, a pulse generator 804, a replica timing circuit 806, and pulse detectors 818, 820. The finite state machine 802 controls the operation of the calibration circuitry 800 and sets the value of the CAL signal. The pulse generator 804 is used to generate timing signals that flow through the replica timing circuits. The replica timing circuit 806 contains replica circuit elements in exemplary feedforward and feedback paths of the DDR (or other) PrDFE receiver. Pulse detector 818 is coupled to the output of a replica feedforward path and pulse detector 820 is coupled to the output of a replica cascade feedback path.
Replica timing circuit 806 contains the circuit elements that significantly affect the timing of the feedforward path. There is storage element 810 responsive to sampling clock signal clk. The output of the storage element 810 is coupled to a first select circuit or multiplexer 812. The first select circuit 812 is coupled to storage element 814 (clocked at clk) and storage element 816 (clocked at clkb). The output of storage element 816 is coupled to pulse detector 818. Pulse detector 818 is configured to determine whether A<1 UI and transmits the output of the feedforward path, out1, to finite state machine 802 at the clock cycle indicated by a selection signal 828.
Replica timing circuit 806 also contains the critical path circuit elements affecting the timing of the cascade path. There is a second multiplexer 822 configured to receive a selection signal 821 the output of multiplexer 812, and an output from this multiplexer is coupled to a further (dummy) multiplexer 823 to replicate loading effects. The output of multiplexer 812 is coupled to storage element 824 (clocked at clkb), and storage element 826 (clocked at clk). Storage element 824 is coupled to pulse detector 820. Pulse detector 820 is configured to determine whether A+B<1 UI and to transmit the output of the cascade feedback path, out2 to finite state machine 802 according to signal 828.
At the time when calibration is performed (i.e., during device or board assembly and/or testing, during the initialization or power-up of the IC containing the PrDFE receiver or during the initialization of the receiver, or during periodic “dynamic” calibration (e.g., performed to compensate for PVT variations), the finite state machine 802 is first initialized; the finite state machine, in turn, causes the pulse generator 804 to generate a test signal 809 synchronized to the sampling clock domain used by the receiver(s) (i.e., one or more receivers on the same integrated circuit). Clock circuitry (not shown) generates the sampling clock signals, clk and clkb. After the test signal 809 transitions to ‘1,’ the pulse generator generates the signal 828 at a predetermined clock cycle. The signal 828 is transmitted to each of the pulse detectors 818, 820 to measure output signals, out1 and out2, and to transmit their states to the finite state machine 802. The signal 828 is generated so that the phase detectors read the output signals, out1 and out2, at the clock cycle after completion of the longest data path. The finite state machine 802 receives signals out1 and out2 and based on these values, determines the correct setting of the calibration signal, CAL.
The table shown in
In the overlap region, where A<1 UI and A+B>1 UI, the finite state machine 802 can repeat the timing analysis repeatedly for a predetermined number of times, n. At the completion of the nth pass through the replica timing circuit 806, the finite state machine 802 can set the CAL signal to the value that was detected most frequently. After the CAL signal is set, the finite state machine 802 can terminate the calibration process.
As represented by this description, in one embodiment, a calibration process (e.g., with separate dummy circuits) can be used for each receiver of a circuit (e.g., an “IC” having multiple receivers). This configuration may be useful for the system of
Several time points, T1-T4, are illustrated in
The calibration circuitry 1000 includes a storage element 832 which receives test signal 809 from pulse generator 804 and responds to clock signal clk. The output of storage element 832 is fed into the selection control for multiplexer 834. The outputs of storage element 832 and multiplexer 834 are inputs to circuit 836. Circuit 836 is effectively a modified multiplexer which produces an output which is dependent on both of its inputs, representing the average timing of both inputs plus circuit propagation delays, and the output is then both fed as the selection signal to a dummy multiplexer 842 and transmitted to storage elements 838 and 840. Storage element 838 is responsive to clock signal, clk, and storage element 840 is responsive to clock signal, clkb. The output of storage element 840 is transmitted to pulse detector 844. Pulse detector 844 detects the value of the output signal, out3, in response to the selection signal 828. The output signal, out3, is transmitted to finite state machine 802 which, in turn, determines the appropriate setting for the CAL signal as shown in
Although the embodiments herein have been shown with respect to a DDR signaling system, the technology described herein can be easily adapted to any multi-phase signaling system, such as without limitation, a quadruple or quad data rate (QDR) signaling system, an octal data rate (ODR) signaling system, and the like. By way of example, attention now turns to a discussion of a QDR signaling system employing the PrDFE technology described herein.
Referring back to
The conditional samples output by samplers 1206a-h are respective inputs to storage elements 1208a-h (e.g., D flip-flops or other types of storage circuits) and are respective inputs to a multiplexer 1210a-d. Each multiplexer 1210a-d selects one of the conditional samples from respective ones of storage elements 1208a-h based on the immediately preceding symbol. If the immediately preceding symbol was a logic ‘1’, then each multiplexer 1210a-d selects the conditional samples output from respective storage element 1208b,d,f,h; conversely, if the immediately preceding symbol was a logic ‘0’, then each multiplexer 1210a-d selects the conditional samples output from respective storage element 1208a,c,e,g.
The selected sample value (i.e., sample selected to represent the resolved symbol) from multiplexer 1210a is fed as the selection signal for multiplexer 1210b to select one of the two conditional samples generated by samplers 1206c and 1206d; the sample selected from multiplexer 1210b is fed as the selection signal for multiplexer 1210c to select one of the two conditional samples generated by samplers 1206e and 1206f; the sample selected from multiplexer 1210c is fed as the selection signal for multiplexer 1210d, to select one of the two conditional samples generated by samplers 1206g and 1206h; and sample selected from multiplexer 1210d is fed as the selection signal for multiplexer 1210a, to select one of its two conditional samples (i.e., from samplers 1206a and 1206b).
The output circuit 1212 has two storage elements (e.g., D flip-flops or other types of storage circuits) for each PrDFE circuit, coupled to receive the sample selected by the corresponding PrDFE circuit to represent the resolved symbol. The output circuit 1212 samples that symbol according to a selected one of at least two different clock phases. In particular, storage elements 1214a-b are coupled to receive the resolved symbol value of multiplexer 1210a at each of two differently phased versions of sampling clock clk1, where storage element 1214a is clocked in response to sampling clock signal clk1 and storage element 1214b is clocked in response to sampling clock signal clk1b. Multiplexer 1216a receives the outputs of these storage elements 1214a-b and responsive to the signal CAL, selects one of the values as the D0 value. As mentioned, this output value can then be stored in storage element 1218a or provided as some other form of output.
Storage elements 1214c-d are coupled to receive the resolved symbol from multiplexer 1210b at each of two phases of a sampling clock clk2, where storage element 1214c is clocked in response to sampling clock signal clk2 and storage element 1214d is clocked in response to sampling clock signal clk2b. Multiplexer 1216b receives the outputs of storage elements 1214c-d and selects one of these according to the signal CAL as the D1 value, which then can be stored in storage element 1218b.
Storage elements 1214e-f are coupled to receive the resolved symbol output from multiplexer 1210c at each of two phases of a corresponding sampling clock clk1b, where storage element 1214e is clocked in response to sampling clock signal clk1b and storage element 1214f is clocked in response to sampling clock signal clk1. Multiplexer 1216c receives the outputs of these storage elements and selects one of them as an output value D2, which then can be stored in storage element 1218c.
Similarly, storage elements 1214g-h are coupled to receive the resolved symbol from multiplexer 1210d at each of two phases of sampling clock clk2b, where storage element 1214g is clocked in response to sampling clock signal clk2b and storage element 1214h is clocked in response to sampling clock signal clk2. Multiplexer 1216d receives the outputs of storage elements 1214g-h and selects one of these according to the signal CAL for output as the D3 value, which then can be stored in storage element 1218d.
As mentioned, the CAL signal 1244 is set by calibration circuitry.
The QDR PrDFE receiver operates within the following timing constraints:
Feedback path: tsel<1 UI;
Feedforward path: tck-Q+tmux+tsu<4 UI; and
An embodiment of the calibration circuitry for the quad rate PrDFE receiver 1200 is shown in
Fast data rate or slow process corner: A>2 UI and A+B>1 UI, then CAL=0; (1)
Invalid case: A>2 UI and A+B<1 UI; (2)
Slow data rate or fast process corner: A<2 UI and A+B<1 UI, then CAL=1; (3)
Overlap region: A<2 UI, A+B>1 UI then CAL=0 or 1. (4)
In these equations, A=tck-Q+tmux+tsu and B=tsel.
An exemplary replica feedforward timing path is composed of storage element 1410, clocked according to clk1, whose output is input to multiplexer 1412. The output of multiplexer 1412, is input to storage element 1414, which is responsive to clock signal clk1, and to storage element 1416, which is responsive to clock signal clk1b. The output of this timing path is transmitted to pulse detector1 1419 which is configured to determine whether the timing delay of the feedforward path is greater or less than 2 UI based on the value of out1 at the clock cycle selected for this input. The output of pulse detectors 1419, out1, is then transmitted to the finite state machine 1402.
An exemplary replica cascade timing path is composed of storage element 1410, which is clocked according to signal clk1. The output of this storage element is then input to multiplexer 1412, which in turn generates the selection signal used to control multiplexer 1422. The output of multiplexer 1422 is transmitted both to storage element 1424 (clocked according to signal clk2), and to storage element 1426 (clocked according to signal clk2b), again with a dummy multiplexer 1423 present to replicate circuit loading effects. The output of this timing path is transmitted to pulse detector2 1420 which is configured to determine whether the timing delay of the cascade feedback path is greater or less than 1 UI The output of pulse detector2 1420, out2, is then transmitted to the finite state machine 1402.
The finite state machine 1402 determines the appropriate setting for the CAL signal based on the outputs from pulse detectors 1419 and 1420. The table in
The embodiments described herein provide versatile PrDFE receivers able to accommodate a range of data rate signaling paths. This capability is achieved with minimal overhead and expense. The additional circuitry does not alter the timings of the critical paths since it is on the replica data paths and not in the critical paths. In addition, the additional circuitry does not alter the clock distribution. There is a minimal increase in power consumption which is exhibited only at IC initialization (or at other calibration) and, thereafter, the calibration circuitry can be powered down. More importantly, response speeds provided by the PrDFE designs presented herein are permit usage of relatively smaller (shorter) unit intervals, i.e., can be used with relatively faster signaling rates without imposing a stability bottleneck.
Importantly, by providing for output timing adjustment, the designs presented herein permit direct cross-coupling of multiplexers in a PrDFE receiver, while using timing adjustment to avoid data uncertainty problems. The embodiments presented above address this by providing output latches (i.e., samplers or storage elements) that are differently clocked and may be alternatively selected. However, there are many other designs for varying output timing in a manner consistent with the principles presented above. For example, more than two timing choices may be offered, or timing choices may be made variable based on variable delays, time borrowing, or other techniques. In addition, there may be many other ways of performing the calibration referenced above, or otherwise estimating or predicting the operation of any given design relative to desired signaling rates. Various alternative designs based on the principles expressed above will no doubt be apparent to those having skill in the art.
The foregoing description, for purposes of explanation, has been described with reference to specific embodiments. However, the illustrative teachings above are not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated.
The various embodiments described herein can be used in various communication systems using time-dispersive channels or signaling paths causing ISI interference, such as, without limitation, memory devices, memory systems, optical communication devices, telecommunication devices (e.g., modems), and the like. As mentioned, in one contemplated implementation, these embodiments may be used in a memory system having a memory controller and one or more DRAM memories, each of these configured if desired as discrete integrated circuits.
An output of a process for designing an integrated circuit, or a portion of an integrated circuit, having one or more circuits described herein may be a computer readable medium, such as without limitation, a magnetic tape, optical disk, magnetic disk, or the like. The computer readable medium may be encoded with data structures or other information describing circuitry that may be physically instantiated as an integrated circuit or a portion of an integrated circuit. Although various formats may be used for such encoding, these data structures are commonly written in Caltech Intermediate Format (CIF), Calma GDS II Stream Format (GDSII), or Electronic Design Interchange Format (EDIF). Those skilled in the art of integrated circuit design can develop such data structures from schematic diagrams of the type detailed above and the corresponding descriptions and encode the data structures on computer readable medium. Those skilled in the art of integrated circuit fabrication can use such encoded data to fabricate integrated circuits comprising one or more of the circuits described herein.
Claims
1. A multi-phase partial response receiver, comprising:
- at least two partial response circuits, each partial response circuit to sample an input signal according to a respective phase of the input signal and to generate a value based on a previously resolved symbol from the input signal; and
- an output circuit to receive the values from the at least two partial response circuits, to select a timing from at least two timing choices, and to output each value according to the selected timing.
2. The multi-phase partial response receiver of claim 1, wherein the output circuit further comprises for each phase of the multi-phase partial response receiver a selection circuit and at least two associated storage elements, each selection circuit to output each value according to a selected one of the at least two timing choices.
3. The multi-phase partial response receiver of claim 2, wherein:
- each storage element is to delay the value for an associated phase of the multi-phase partial response receiver, each storage element delaying the value for the associated phase of the multi-phase partial response receiver according to a respective one of at least two clock signals; and
- the selection circuit for the associated phase of the multi-phase partial response receiver further comprises a multiplexer to select between outputs of the associated storage element according to a calibration signal.
4. The multi-phase partial response receiver of claim 2, wherein each storage element is a flip flop.
5. The multi-phase partial response receiver of claim 1, wherein:
- each partial response circuit includes a multiplexer that selects one of plural conditional samples; and
- an output of each multiplexer drives conditional sample selection by the multiplexer of one of the other partial response circuits, a path between the output of the multiplexer which it drives not have any intervening storage element.
6. The multi-phase partial response receiver of claim 5, wherein the output circuit further comprises for each phase of the multi-phase partial response receiver a selection circuit and at least two associated storage elements, and wherein each timing choice is represented by a clock signal, each selection circuit to sample an output of a respective one of the multiplexers according to a selected one of the at least two clock signals, the storage elements for each selection circuit each receiving a respective one of the at least two clock signals and sampling the output of the respective multiplexer according to the respective one of the at least two clock signals, each selection circuit selecting between the outputs of the associated storage elements.
7. The multi-phase partial response receiver of claim 6, wherein each selection circuit selects between the outputs of the associated storage elements in dependence upon a calibration signal.
8. The multi-phase partial response receiver of claim 1, further comprising a calibration circuit to control the output circuit to select the one of the at least two timing choices, the calibration circuit having:
- a replica circuit to replicate a timing path in the multi-phase partial response receiver; and
- control circuitry to test the replica circuit and to responsively control the output circuit to select the one of the at least two timing choices.
9. The multi-phase partial response receiver of claim 8, wherein the control circuitry further comprises:
- a signal generator to generate a test signal for the replica timing circuit; and
- a signal detector to detect outputs of the replica timing circuit based on the test signal.
10. The multi-phase partial response receiver of claim 1, embodied as a double data rate partial response receiver.
11. The multi-phase partial response receiver of claim 1, embodied as a quad data rate partial response receiver.
12. The multi-phase partial response receiver of claim 1, embodied in a memory device.
13. The multi-phase partial response receiver of claim 1, embodied in a memory controller.
14. The multi-phase partial response receiver of claim 1, embodied in an integrated circuit.
15. The multi-phase partial response receiver of claim 1, wherein the at least two timing choices are sampling clocks, the sampling clocks having a common frequency and a phase difference that is a multiple of ninety degrees.
16. The multi-phase partial response receiver of claim 1, further comprising a calibration circuit to control the output circuit to select the one of the at least two timing choices, wherein the calibration circuitry performs calibration in dependence upon each power-on of the multi-phase partial response receiver.
17. The multi-phase partial response receiver of claim 1, further comprising a calibration circuit to control the output circuit to select the one of the at least two timing choices, wherein the calibration circuitry performs calibration on a one-time basis.
18. The multi-phase partial response receiver of claim 1, further comprising a calibration circuit to control the output circuit to select the one of the at least two timing choices, wherein the calibration circuitry performs calibration on a dynamic basis during normal data operations of the multi-phase partial response receiver.
19. The multi-phase partial response receiver of claim 18, wherein the calibration circuit includes a test path that is independent from the at least two partial response circuits, the test path for use during normal data operations of the multi-phase partial response receiver.
20. A method employed in a digital receiver that receives an input signal, comprising:
- using at least two partial response circuits to sample respective data phases of the input signal, each partial response circuit generating plural conditional samples and outputting a value selected from the plural conditional samples dependent on a previously resolved datum from another one the at least two partial response circuits; and
- outputting each symbol according to a selected one of at least two timing choices.
21. The method of claim 20, wherein:
- the method further comprises calibrating a first data path of the signaling system; and
- outputting includes sampling each value based on the calibrating of the first data path.
22. The method of claim 20, wherein:
- the method further comprises calibrating a second data path in the signaling system; and
- outputting includes sampling each value based on the calibrating of the second data path.
23. The method of claim 20, wherein each partial response circuit includes a multiplexer to select between the respective plural conditional samples, the method further comprising:
- driving selection by each one of the multiplexers using the output from the multiplexer for another one the at least two partial response circuits without any intervening storage element.
24. The method of claim 23, wherein the digital receiver further includes a test circuit that is independent from the at least two partial response circuits, the method further comprising periodically performing dynamic selection of the one of the at least two timing choices according to an output of the test circuit, the test circuit being used in parallel with normal data operations of digital receiver.
25. The method of claim 24, further comprising using a sampling clock for each timing choice.
26. A system, comprising:
- a plurality of communications channels, each communication channel conveying a sequence of data; and
- a receive device having a plurality of receivers, one receiver for each communications channel, each receiver including at least two partial response circuits, each partial response circuit to sample an input signal according to a respective phase of the input signal and to generate a value based on at least one previously resolved datum from the input signal, and an output circuit to receive the values from the at least two partial response circuits, and to output each value according to a selected one of at least two clock signals.
27. The system of claim 26, further comprising a transmit device coupled to the plurality of communications channels, the transmit device transmitting the sequences.
28. The system of claim 27, wherein the transmit device includes one of a memory controller and a memory device and the receive device includes the other of the memory controller and the memory device.
29. The system of claim 28, wherein the memory device is a dynamic random access memory.
30. The system of claim 27, wherein the plurality of receivers are coupled to at least one calibration circuit, the at least one calibration circuit controlling for each receiver the selection of selection between the at least two clocks.
31. A partial response receiver, comprising:
- a first sampler to sample an input signal according to a first clock phase of a sampling clock to generate a first sample;
- a second sampler to sample the input signal according to the first clock phase to generate a second sample;
- a select circuit that selects either the first or second sample to be output as a value according to a previously generated datum; and
- an output circuit that receives the value output from the select circuit, that samples the value according to each of the first and second clock phase to generate respective sampled values, and that chooses one of the one of the sampled values to output.
32. The partial response receiver of claim 31, further comprising:
- a calibration circuit to detect timings on at least one path and to direct the output circuit to choose the one of the sampled values based on the timings.
33. A multi-phase partial response receiver, comprising:
- means for sampling an input signal according to a respective phase of the input signal and for generating a value for each phase in dependence upon a previously resolved datum; and
- means for selecting at least one of two clock signals and for sampling each value according to a selected one of the at least two clock signals.
Type: Application
Filed: Mar 25, 2011
Publication Date: Oct 13, 2011
Patent Grant number: 8942319
Applicant: RAMBUS Inc. (Sunnyvale, CA)
Inventors: Chintan S. Thakkar (Berkeley, CA), Kun-Yung Chang (Los Altos, CA), Ting Wu (Tianjin)
Application Number: 13/072,642