Multiplication Patents (Class 708/620)
  • Publication number: 20110225220
    Abstract: A Montgomery multiplication device calculates a Montgomery product of an operand X and an operand Y with respect to a modulus M and includes a plurality of processing elements. In a first clock cycle, two intermediate partial sums are created by obtaining an input of length w?1 from a preceding processing element as w?1 least significant bits. The most significant bit is configured as either zero or one. Then, two partial sums are calculated using a word of the operand Y, a word of the modulus M, a bit of the operand X, and the two intermediate partial sums. In a second clock cycle, a selection bit is obtained and one of the two partial sums is selected based on the value of the selection bit. Then, the selected partial sum is used for calculation of a word of the Montgomery product.
    Type: Application
    Filed: March 1, 2010
    Publication date: September 15, 2011
    Inventors: Miaoqing Huang, Krzysztof Gaj
  • Patent number: 8015231
    Abstract: A data processing apparatus and method includes multiplier logic operable to multiply the first and second n-bit significands to produce a pair of 2n-bit vectors. Half adder logic is arranged to produce a plurality of carry and sum bits representing a corresponding plurality of most significant bits of the pair of 2n-bit vectors. The first adder logic then performs a first sum operation with a first rounded result and a second adder logic performs a second sum operation with a second rounded result. The required n-bit result is then derived from either the first rounded result or the second rounded result. The data processing apparatus takes advantage of a property of the half adder form to enable a rounding increment value to be injected prior to performance of the first and second sum operations without requiring full adders to be used to inject the rounding increment value.
    Type: Grant
    Filed: November 30, 2004
    Date of Patent: September 6, 2011
    Assignee: ARM Limited
    Inventors: David Raymond Lutz, Christopher Neal Hinds
  • Patent number: 8001360
    Abstract: A system and software for improving the performance of processors by incorporating an execution unit operable to decode and execute single instructions specifying a data selection operand and a first and a second register providing a plurality of data elements, the data selection operand comprising a plurality of fields each selecting one of the plurality of data elements, the execution unit operable to provide the data element selected by each field of the data selection operand to a predetermined position in a catenated result.
    Type: Grant
    Filed: January 16, 2004
    Date of Patent: August 16, 2011
    Assignee: Microunity Systems Engineering, Inc.
    Inventors: Craig Hansen, John Moussouris
  • Patent number: 7987344
    Abstract: A programmable processor and method for improving the performance of processors by incorporating an execution unit configurable to execute a plurality of instruction streams from the plurality of threads, wherein each instruction stream includes a group instruction that operates on a plurality of data elements in partitioned fields of at least one of the registers to produce a catenated result.
    Type: Grant
    Filed: January 16, 2004
    Date of Patent: July 26, 2011
    Assignee: Microunity Systems Engineering, Inc.
    Inventors: Craig Hansen, John Moussouris
  • Patent number: 7958180
    Abstract: A multiplier engine that reduces the size of the circuitry used to provide the multiplier engine, as well as increases the speed at which the multiplication algorithm is performed, are provided. The illustrative embodiments may comprise a M*8 multiplication engine having one or more 4:2 compressors that comprise only two full adders, as opposed to the three full adders in the known 5:2 compressor based architecture. The 4:2 compressors are able to achieve the same operation as the known 5:2 compressor based architecture by virtue of using the unused bits in a least significant portion of the partial product inputs to store the negate bit values. Moreover, a negate bit value that is not fused with the partial product inputs may be input to the 4:2 compressors for a bit 0 position.
    Type: Grant
    Filed: July 5, 2007
    Date of Patent: June 7, 2011
    Assignee: International Business Machines Corporation
    Inventors: Douglas H. Bradley, Owen Chiang, Sherman M. Dance
  • Patent number: 7945061
    Abstract: A real-time implementation of a subspace tracker is disclosed. Efficient architecture addresses the unique computational elements of the Fast Approximate Subspace Tracking (FAST) algorithm. Each of these computational elements can scale with the rank and size of the subspace. One embodiment of architecture described is implemented in digital hardware that performs variable rank subspace tracking using the FAST algorithm. In particular, the FAST algorithm is effectively implemented by a few processing elements, coupled with an efficient Singular Vector Decomposition (SVD), and the realization/availability of high density programmable logic devices. The architecture enables the ability to track the possibly changing dimension of the signal subspace.
    Type: Grant
    Filed: June 7, 2006
    Date of Patent: May 17, 2011
    Assignee: BAE Systems Information and Electronic Systems Integration Inc.
    Inventors: John M. Smith, Michael J. Kotrlik, Edward C. Real
  • Publication number: 20110103578
    Abstract: Systems and methods efficiently process digests, hashes or other results by performing multiplicative functions in parallel with each other. In various embodiments, successive processing stages are provided, with each stage performing parallel multiplicative functions and also combining input terms to reduce the total number of terms that remain to be processed. By progressively combining the active terms into a smaller number of terms for subsequent processing, the time needed to process a result can be significantly reduced.
    Type: Application
    Filed: October 30, 2009
    Publication date: May 5, 2011
    Applicant: GENERAL DYNAMICS C4 SYSTEMS, INC.
    Inventors: Gerardo ORLANDO, David KING, Mark KRUMPOCH
  • Patent number: 7930336
    Abstract: A plurality of specialized processing blocks in a programmable logic device, including multipliers and circuitry for adding results of those multipliers, can be configured as a larger multiplier by adding to the specialized processing blocks selectable circuitry for shifting multiplier results before adding. In one embodiment, this allows all but the final addition to take place in specialized processing blocks, with the final addition occurring in programmable logic. In another embodiment, additional compression and adding circuitry allows even the final addition to occur in the specialized processing blocks.
    Type: Grant
    Filed: December 5, 2006
    Date of Patent: April 19, 2011
    Assignee: Altera Corporation
    Inventors: Martin Langhammer, Kumara Tharmalingam
  • Patent number: 7916860
    Abstract: A scalar multiplication apparatus may include at least two encryptors, each adapted to receive an input point and a changed secret key to generate an encrypted output point; a first logic circuit adapted to receive the encrypted output points to perform a first logic operation; a second logic circuit adapted to receive a first logic operation result and a secret key to perform a second logic operation and generate the changed secret key; and a random number generator adapted to generate random number data. A scalar multiplication method may include receiving an input point and a changed secret key, generating first and second encrypted output points from the input point and changed secret key, performing a first logic operation on the first and second encrypted output points, and performing a second logic operation on a first logic operation result and a secret key and generating the changed secret key.
    Type: Grant
    Filed: March 17, 2006
    Date of Patent: March 29, 2011
    Assignee: Samsung Electronics Co. Ltd.
    Inventor: Ihor Vasyltsov
  • Patent number: 7912891
    Abstract: Provided are a high speed and low power fixed-point multiplier and method thereof. The multiplier includes: a partial product calculation unit for dividing input data into a plurality of bit groups, each bit group having a predetermined number of bits, generating partial products by independently multiplying a fixed coefficient for each bit group, and summing partial products included in a corresponding bit group, to thereby generate a summed partial products; and an adding unit for adding the summed partial products of each bit group generated from the partial product calculation unit.
    Type: Grant
    Filed: December 8, 2006
    Date of Patent: March 22, 2011
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Sang-In Cho, Cheol-Ho Shin, Kyu-Min Kang, Sung-Woo Choi, Sang-Sung Choi, Jin-Gyun Chung, Yong-Eun Kim
  • Publication number: 20110060780
    Abstract: Quantum annealing may include applying and gradually removing disorder terms to qubits of a quantum processor, for example superconducting flux qubits of a superconducting quantum processor. A problem Hamiltonian may be established by applying control signals to the qubits, an evolution Hamiltonian established by applying disorder terms, and annealing by gradually removing the disorder terms. Change in persistent current in the qubits may be compensated. Multipliers may mediate coupling between various qubits and a global signal line, for example by applying respective scaling factors. Two global signal lines may be arranged in an interdigitated pattern to couple to respective qubits of a communicatively coupled pair of qubits. Pairs of qubits may be communicatively isolated and used to measure a response of one another to defined signals.
    Type: Application
    Filed: May 19, 2009
    Publication date: March 10, 2011
    Applicant: D-Wave Systems Inc.
    Inventors: Andrew J. Berkley, Richard G. Harris, Mohammad Amin
  • Publication number: 20110060781
    Abstract: Systems and methods for performing multiplication of fixed-point fractional values with the same throughput as addition and subtraction operations, and without loss of accuracy in the result. In one embodiment, a method includes reading data from a pair of source registers that contains multiple single-width multiplicand values. Each multiplicand value in one of the source registers is paired with a corresponding multiplicand value in the other source register. For each pair of multiplicands, a double-width product is generated, then a single-width portion of the product is selected and stored in a target register. The selection of the single-width portion is performed by shifting the double-width products in funnel shifters. The immediate shifting of the double-width products to select the single-width portions allows the operation to achieve the same throughput as addition and subtraction operations.
    Type: Application
    Filed: September 8, 2009
    Publication date: March 10, 2011
    Inventor: Shigeaki Iwasa
  • Publication number: 20100325188
    Abstract: A processor including instruction support for implementing large-operand multiplication may issue, for execution, programmer-selectable instructions from a defined instruction set architecture (ISA). The processor may include an instruction execution unit comprising a hardware multiplier datapath circuit, where the hardware multiplier datapath circuit is configured to multiply operands having a maximum number of bits M.
    Type: Application
    Filed: June 19, 2009
    Publication date: December 23, 2010
    Inventors: Christopher H. Olson, Jeffrey S. Brooks, Robert T. Golla, Paul J. Jordan
  • Publication number: 20100318592
    Abstract: The technology is a division circuit with decreased circuit area. An embodiment includes an integrated circuit implementing multiplicative division of a dividend input and a divisor input. The integrated circuit includes a lookup table circuit and multiplier circuits. The lookup table circuit providing an approximation of a reciprocal of a divisor input. The multiplier circuits receive the approximation and refine a quotient output of the dividend input and a divisor input. At least one of the multiplier circuits is a squaring circuit implementing multiplication with a reduced number of intermediate partial products. The reduced number of intermediate partial products prevent the squaring circuit from multiplication of any two unequal numbers and limiting the squaring circuit to multiplication of a same number by the same number.
    Type: Application
    Filed: June 22, 2009
    Publication date: December 16, 2010
    Applicant: Synopsys, Inc.
    Inventors: Kyung-Nam Han, Alexandre Tenca, David Tran, Rick Kelly
  • Publication number: 20100306298
    Abstract: A device to perform DFT calculations, for example in a GNSS receiver, including two banks of multipliers by constant integer value, the values representing real and imaginary part of twiddle factors in the DFT. A control unit selectively routes the data through the appropriate multipliers to obtain the desired DFT terms. Unused multipliers are tied to constant input values, in order to minimize dynamic power.
    Type: Application
    Filed: May 15, 2008
    Publication date: December 2, 2010
    Applicant: Qualcomm Incorporated
    Inventors: Andrea Cenciotti, Nestor Lucas Barriola, Philip John Young
  • Publication number: 20100306292
    Abstract: A processor may have at least one multiplier unit which can be controlled to operate in a signed, an unsigned, or a mixed sign mode; a multiplier unit mode decoder coupled with the multiplier unit which receives location information of a first and second operands, wherein the multiplier mode decoder controls the multiplier unit when in the mixed sign mode depending on the location information to operate in a signed mode, an unsigned mode, or a combined signed/unsigned mode.
    Type: Application
    Filed: May 7, 2010
    Publication date: December 2, 2010
    Inventors: Michael I. Catherwood, Settu Duraisamy
  • Patent number: 7844655
    Abstract: A computer is connected to a memory. The computer to execute an encryption program in the memory. The encryption program including a multiplication portion to perform multiplication of input operands. The multiplication portion includes graph based functions to generate coefficients representing products returned from the multiplication portion to generate encryption keys. Other embodiments are described and claimed.
    Type: Grant
    Filed: June 28, 2006
    Date of Patent: November 30, 2010
    Assignee: Intel Corporation
    Inventors: Michael E. Kounavis, Arun Raghunath
  • Patent number: 7822199
    Abstract: A method and device for performing a cryptographic operation by a device controlled by a security application executed outside thereof in which a cryptographic value (y) is produced a calculation comprising at least one multiplication between first and second factors containing a security key (s) associated with the device and a challenge number (c) provided by the security application. The first multiplication factor comprises a determined number of bits (L) in a binary representation and the second factor is constrained in such a way that it comprises, in a binary representation, several bits at 1 with a sequence of at least L?1 bits at 0 between each pair of consecutive bits to 1 while the multiplication is carried out by assembling the binary versions of the first factor shifted according to positions of the bits at 1 of the second factor, respectively.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: October 26, 2010
    Assignee: France Telecom
    Inventors: Marc Girault, David Lefranc
  • Patent number: 7797365
    Abstract: A design structure for a Booth decoder is provided. The Booth decoder may comprise three circuits that run in parallel. A first circuit is used to generate a shift control signal output. A second circuit is used to generate a zero control signal output. A third circuit is used to generate an invert control signal output. The first and second circuits receive the three-bit block as an input and generate their respective outputs based on the setting of each of the bits. The third circuit receives only the most significant bit of the three-bit block as its input and generates an invert signal output based on the setting of the most significant bit. In each of these circuits, the number of complex gates and transistors is minimized thereby reducing gate delay and power consumption in generating the control signals for performing a Booth multiplication operation.
    Type: Grant
    Filed: May 27, 2008
    Date of Patent: September 14, 2010
    Assignee: International Business Machines Corporation
    Inventor: Owen Chiang
  • Patent number: 7797364
    Abstract: A Booth decoder may comprise three circuits that run in parallel. A first circuit is used to generate a shift control signal output. A second circuit is used to generate a zero control signal output. A third circuit is used to generate an invert control signal output. The first and second circuits receive the three-bit block as an input and generate their respective outputs based on the setting of each of the bits. The third circuit receives only the most significant bit of the three-bit block as its input and generates an invert signal output based on the setting of the most significant bit. In each of these circuits, the number of complex gates and transistors is minimized thereby reducing gate delay and power consumption in generating the control signals for performing a Booth multiplication operation.
    Type: Grant
    Filed: June 27, 2006
    Date of Patent: September 14, 2010
    Assignee: International Business Machines Corporation
    Inventor: Owen Chiang
  • Publication number: 20100228806
    Abstract: Digital signal processing (“DSP”) circuit blocks are provided that can more easily work together to perform larger (e.g., more complex and/or more arithmetically precise) DSP operations if desired. These DSP blocks may also include redundancy circuitry that facilitates stitching together multiple such blocks despite an inability to use some block (e.g., because of a circuit defect). Systolic registers may be included at various points in the DSP blocks to facilitate use of the blocks to implement systolic form, finite-impulse-response (“FIR”), digital filters.
    Type: Application
    Filed: March 3, 2010
    Publication date: September 9, 2010
    Inventors: Keone Streicher, Martin Langhammer, Yi-Wen Lin, Wai-Bor Leung, David Lewis, Volker Mauer, Henry Y. Lul, Suleyman Sirri Demirsoy, Hyun Yi
  • Publication number: 20100202605
    Abstract: A method of simplifying a combinational circuit establishes an initial combinational circuit operable to calculate a set of target signals. A quantity of multiplication operations performed in a first portion of the initial combinational circuit is reduced to create a first, simplified combinational circuit. The first portion includes only multiplication operations and addition operations. A quantity of addition operations performed in a second portion of the first, simplified combinational circuit is reduced to create a second, simplified combinational circuit. The second portion includes only addition operations. Also, the second, simplified combinational circuit is operable to calculate the target signals using fewer operations than the initial combinational circuit.
    Type: Application
    Filed: February 9, 2009
    Publication date: August 12, 2010
    Inventors: Rene Caupolican Peralta, Joan Boyar
  • Publication number: 20100205234
    Abstract: A method and apparatus for detecting a signal using a cyclo-stationary characteristic value is provided. A method of detecting a signal using a cyclo-stationary characteristic value includes: calculating cyclo-stationary characteristic values with respect to a cyclic frequency domain of an input signal; multiplying the calculated cyclo-stationary characteristic values with each other; and detecting the signal from the input signal based on the result of the multiplication.
    Type: Application
    Filed: August 21, 2008
    Publication date: August 12, 2010
    Inventors: Sunmin Lim, Sang-Won Kim, Changhyun Park, Myung Sun Song, Gwangzeen Ko, Chang-Joo Kim
  • Publication number: 20100198894
    Abstract: A digital signal processor is provided having an instruction set with an exponential function that uses a reduced look-up table. The disclosed digital signal processor evaluates an exponential function for an input value, x, by decomposing the input value, x, to an integer part, N, a first fractional part, q1, larger than a specified value, x0, and a second fractional part, q2, smaller than the specified value, x0; computing 2q2 using a polynomial approximation, such as a cubic approximation; obtaining 2q1 from a look-up table; and evaluating the exponential function for the input value, x, by multiplying 2q2, 2q1 and 2N together. Look-up table entries have a fewer number of bits than a number of bits in the input value, x.
    Type: Application
    Filed: January 30, 2009
    Publication date: August 5, 2010
    Inventors: Kameran Azadet, Jian-Guo Chen, Samer Hijazi, Joseph Williams
  • Publication number: 20100198893
    Abstract: A digital signal processor is provided having an instruction set with an xK function that uses a reduced look-up table. The disclosed digital signal processor evaluates an xK function for an input value, x, by computing Log(x) in hardware; multiplying the Log(x) value by K; and determining the xK function by applying an exponential function in hardware to a result of the multiplying step. One or more of the computation of Log(x) and the exponential function employ at least one look-up table having entries with a fewer number of bits than a number of bits in the input value, x.
    Type: Application
    Filed: January 30, 2009
    Publication date: August 5, 2010
    Inventors: Kameran Azadet, Jian-Guo Chen, Samer Hijazi, Joseph Williams
  • Patent number: 7769781
    Abstract: A method for calculating numerical values in a manner which can be interpreted as encoding places in a hierarchy, and are in a format convenient for storage and retrieval on computer systems. The numerical values are calculated by associating paths in a hierarchy with sub-sequences of terms of a mathematical series where an ordering of the sub-sequences according to the occurrence of the first terms of the sub-sequences in the mathematical series is the same as an ordering of the magnitude of the sums of the terms of the sub-sequences. Said numerical values can be conveniently stored as integer or floating-point data types commonly used in computer systems and as such assigned to appropriate data elements in a data structure which defines serial relationships between the items it stores. Thus this invention enables sequential data structures such as arrays, linked lists and databases to store and retrieve tree structure data efficiently.
    Type: Grant
    Filed: May 23, 2007
    Date of Patent: August 3, 2010
    Inventor: Stephen Edward Huntley
  • Publication number: 20100191788
    Abstract: A digital system has a memory configured to hold operands and a multiply-shift unit coupled to the memory and configured to receive a first operand and a second operand from the memory in parallel, wherein the first operand includes a concatenated encoded shift amount. The multiply-shift unit includes a multiplier configured to receive the first operand after being separated from the concatenated encoded shift amount and to form a quotient from the two operands. A shifter is coupled to receive the quotient and to shift the quotient by an amount indicated by the encoded shift amount and to thereby form a shifted quotient on an output of the multiply-shift unit.
    Type: Application
    Filed: June 28, 2009
    Publication date: July 29, 2010
    Inventor: Laurent Le-Faucheur
  • Patent number: 7765252
    Abstract: A technology generally related to large-scale computations employed in the fields of cryptography and data security system employing a new and improved variant of the Karatsuba multiplication approach. The variant of the Karatsuba multiplication approach being utilized to minimize the number of coefficient multiplications needed to multiple two polynomials of degree four.
    Type: Grant
    Filed: March 19, 2004
    Date of Patent: July 27, 2010
    Assignee: Microsoft Corporation
    Inventor: Peter L. Montgomery
  • Publication number: 20100125620
    Abstract: A device and methods are disclosed for communicating an unrounded result from one arithmetic calculation for use in a second, subsequent calculation. For example, an unrounded result of a first calculation can be forwarded to provide a multiplier, a multiplicand or an addend operand for the subsequent operation. The operand can be forwarded to the input of the same fused multiply addition module (FMAM) that supplied the result, or to another FMAM, and do so without regard to the precision of the forwarded operand, the precision of the subsequent operation, or the native precision of the FMAM.
    Type: Application
    Filed: November 20, 2008
    Publication date: May 20, 2010
    Applicant: ADVANCED MICRO DEVICES, INC.
    Inventors: David S. Oliver, Debjit Das Sarma, Scott Hilker
  • Publication number: 20100125621
    Abstract: An arithmetic processing unit is disclosed that can perform multiply operations, addition operations, or a combination thereof. The arithmetic processing unit can operate in two modes. The first mode supports one single, double, or extended-precision computation, and the second mode supports two simultaneous single-precision computations using the same exponent and mantissa datapaths.
    Type: Application
    Filed: November 20, 2008
    Publication date: May 20, 2010
    Applicant: Advanced Micro Devices, Inc.
    Inventors: David S. Oliver, Debjit Das Sarma, Scott Hilker
  • Patent number: 7720901
    Abstract: Methods and apparatus are provided for implementing circuitry operable to perform barrel shifting, multiplication, and rotation operations in hard coded logic on a programmable chip. A hard coded multiplier is augmented using multiplexer circuitry, a logical operation, and a bypassable 2^N functional block. Based on control signals, the multiplexer circuitry can be used to select a rotation, multiplication, or barrel shifted output. Multiplexer control signals also provide sign information associated with operands passed to the multiplier. A single augmented multiplier can perform barrel shifting, rotation, or multiplication operations. Inputs of a multiplier can also be selectively grounded to allow the multiplier to perform logic operations.
    Type: Grant
    Filed: May 26, 2006
    Date of Patent: May 18, 2010
    Assignee: Altera Corporation
    Inventors: James L. Ball, James R. Lawson
  • Patent number: 7716266
    Abstract: A method and system for performing a binary mode and hexadecimal mode Multiply-Add floating point operation in a floating point arithmetic unit according to a formula A*C+B, wherein A, B and C operands each have a fraction and an exponent part expA, expB and expC and the exponent of the product A*C is calculated and compared to the exponent of the addend under inclusion of an exponent bias value dedicated to use unsigned biased exponents, wherein the comparison yields a shift amount used for aligning the addend with the product operand, wherein a shift amount calculation provides a common value CV for both binary and hexadecimal according to the formula (expA+expC?expB+CV).
    Type: Grant
    Filed: January 26, 2006
    Date of Patent: May 11, 2010
    Assignee: International Business Machines Corporation
    Inventors: Son Dao Trong, Juergen Haess, Klaus Michael Kroener, Eric M. Schwarz
  • Publication number: 20100088357
    Abstract: Various embodiments of the present invention provide systems and methods for estimating signal and noise powers in a received signal set. For example, one embodiment of the present invention provides a method for determining signal power and noise power. The method uses a storage medium that includes a Na×Nw data pattern. The Na×Nw data pattern includes Na bits repeated Nw times. Both Na and Nw are each greater than one. The methods further include performing an initial read of the Na×Nw data pattern, which is stored to a first register.
    Type: Application
    Filed: October 8, 2008
    Publication date: April 8, 2010
    Inventors: George Mathew, Yuan Xing Lee, Hongwei Song, David L. Parker, Scott M. Dziak
  • Patent number: 7689641
    Abstract: Method, apparatus, and program means for performing a packed multiply high with round and shift operation. The method of one embodiment comprises receiving a first operand having a first set of L data elements. A second operand having a second set of L data elements is received. L pairs of data elements are multiplied together to generate a set of L products. Each of the L pairs includes a first data element from the first set of L data element and a second data element from a corresponding data element position of the second set of L data elements. Each of the L products are rounded to generate L rounded values. Each of said L rounded values are scaled to generate L scaled values. Each of the L scaled values are truncated for storage at a destination. Each truncated value is to be stored at a data element position corresponding to its pair of data elements.
    Type: Grant
    Filed: June 30, 2003
    Date of Patent: March 30, 2010
    Assignee: Intel Corporation
    Inventors: James C. Abel, Derin C. Walters, Jonathan J. Tyler
  • Patent number: 7680872
    Abstract: An apparatus comprising an address generation circuit, a lookup table, a multiplexer and an output circuit. The address generation circuit may be configured to generate a series of addresses. The lookup table may be configured to generate one or more coefficients in response to the addresses. The multiplexer circuit may be configured to generate one or more shifted values in response to (i) the coefficients and (ii) the one or more operands. The output circuit may be configured to generate an output signal by combining one or more component values in response to said shifted values. The coefficients are grouped as one over power of 2 components into mutually exclusive groups.
    Type: Grant
    Filed: January 11, 2005
    Date of Patent: March 16, 2010
    Assignee: VIA Telecom Co., Ltd.
    Inventor: Alon Saado
  • Publication number: 20100063986
    Abstract: In a computing device that calculates a square of an element in a finite field, a vector representation of the element in the finite field is accepted. The vector representation includes a plurality of elements. The computing device performs a multiplication operation on a base field using the accepted elements, and obtains a multiplication value. The multiplication operation is determined by a condition under which the element in the finite field is placed in an algebraic torus. The computing device performs an addition and subtraction operation using the obtained multiplication value and the accepted elements, and obtains a calculation result of the square of the element. The addition and subtraction operation is determined by the condition. The computing device then outputs the calculation result.
    Type: Application
    Filed: February 26, 2009
    Publication date: March 11, 2010
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Tomoko YONEMURA, Hirofumi MURATANI, Atsushi SHIMBO, Kenji OHKUMA, Taichi ISOGAI, Yuichi KOMANO, Kenichiro FURUTA, Yoshikazu HANATANI
  • Patent number: 7668896
    Abstract: The first and second n-bit significands are multiplied producing a pair of 2n-bit vectors, and half adder logic produces a corresponding plurality of carry and sum bits. A product exponent is checked for correspondence with a predetermined exponent value. A sum operation generates a first result equivalent to the addition of the pair of 2n-bit vectors. First adder logic uses corresponding m carry and sum bits, the least significant of them carry bits being replaced with the increment value prior to the first adder logic performing the first sum operation. Second adder logic performs a second sum operation and uses the corresponding m?1 carry and sum bits replacing the least significant m?1 carry bits with the rounding increment value prior to the second adder logic second sum operation. The n-bit result is derived from either the first rounded result, the second rounded result or a predetermined result value.
    Type: Grant
    Filed: March 17, 2005
    Date of Patent: February 23, 2010
    Assignee: ARM Limited
    Inventors: David Raymond Lutz, Christopher Neal Hinds
  • Publication number: 20100023569
    Abstract: A method of computing arithmetic operations more efficiently than the conventional Arithmetic Logic Unit (ALU) is disclosed. By encoding both operands from Binary Coded Decimal (BCD) codes (0000, to 1001) into decimal digits (0 to 9), inputting them in the GerTh's™ look-up tables, which are made of an array of AND gates, the invention finds the answer more efficiently. This method finds the result in fewer steps than a traditional ALU by reducing the repetitive calculation steps and logic gates required. And this new method makes the unsolvable computerized binary floating-point multiplications and divisions back to the solvable GerTh's computerized decimal digits' (0-9) elementary arithmetic operations.
    Type: Application
    Filed: July 22, 2008
    Publication date: January 28, 2010
    Applicant: DAW SHIEN SCIENTIFIC RESEARCH & DEVELOPMENT, INC.
    Inventors: James Shihfu Shiao, Albert Shihyung Shiao
  • Patent number: 7650374
    Abstract: Multiple-precision hybrid multiplication is a technique that takes advantage of row-wise multiplication and column-wise multiplication. To generate a product for multiple-precision operands, partial products of the multiple-precision operands are accumulated in accordance with a hybrid of column-wise multiplication and row-wise multiplication. The partial products accumulated are of partial rows. The partiality of the row-wise partial products is defined by a parameter.
    Type: Grant
    Filed: November 23, 2004
    Date of Patent: January 19, 2010
    Assignee: Sun Microsystems, Inc.
    Inventors: Nils Gura, Lawrence A. Spracklen
  • Publication number: 20100011047
    Abstract: A system, method, and apparatus for performing hardware-based cryptographic operations are disclosed. The apparatus can include an encryption device with a hardware accelerator having an accumulator, a multiplier circuit, an adder circuit, and a state machine. The state machine can control successive operation of the hardware accelerator to carry out a rapid, multiplier-based reduction of a large integer by a prime modulus value. Optionally, the hardware accelerator can include a programmable logic device such as a field-programmable gate array with one or more dedicated multiple-accumulate blocks.
    Type: Application
    Filed: July 7, 2009
    Publication date: January 14, 2010
    Applicant: ViaSat, Inc.
    Inventors: David Jackson, John Andolina
  • Patent number: 7640286
    Abstract: A data processing apparatus and method are provided for multiplying first and second n-bit significands of first and second floating point operands to produce an n-bit result. The data processing apparatus comprises multiplier logic for multiplying the first and second n-bit significands to produce a pair of 2n-bit vectors, and sum logic operable to perform a sum operation to add a first set of bits of each of the pair of 2n-bits vectors. Sticky determination logic is also provided for determining from a second set of bits of each of the pair of 2n-bit vectors a sticky value, and selector logic is then used to derive the n-bit result from the output of the sum logic with reference to the sticky value.
    Type: Grant
    Filed: March 11, 2005
    Date of Patent: December 29, 2009
    Assignee: ARM Limited
    Inventors: David Raymond Lutz, Christopher Neal Hinds
  • Patent number: 7620677
    Abstract: Provided are a simplified 4:2 carry save adder (CSA) cell and a 4:2 carry save adding method. The 4:2 CSA cell is formed of an odd detector and first through sixth switches through logic optimization. The odd detector generates an XOR of the first through fourth input signals, outputs the XOR as an odd signal, generates an XOR of the first and second input signals, and outputs the XOR as a first XOR signal. The first switch outputs the third input signal as a carry output signal in response to the first XOR signal. The second switch outputs the first input signal as the carry output signal in response to an inverted first XOR signal. The third switch outputs the carry input signal as a carry signal in response to the odd signal. The fourth switch outputs the fourth input signal as the carry signal in response to an inverted odd signal. The fifth switch outputs an inverted carry input signal as a sum signal in response to the odd signal.
    Type: Grant
    Filed: January 10, 2005
    Date of Patent: November 17, 2009
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Yo-han Kwon
  • Publication number: 20090254333
    Abstract: Apparatus for emulating the operation of an LFSR having at least one of one or more inputs and one or more outputs, wherein the state of the LFSR can be described by a state vector having sections describing the input or inputs, if any, of the LFSR, the contents of the LFSR and the output or outputs, if any, of the LFSR, wherein the state vector can be multiplied by a time shift matrix to time shift the state specified by the vector and wherein the apparatus comprises means for multiplying a first instance of the state vector by the matrix to produce a second instance of the state vector that is time shifted relative to the first instance and wherein one or both of the input and output sections of the state vector are dimensioned to accommodate, respectively, inputs that arrive at different times during the time shift specified by the matrix and outputs that are produced at different times during the time shift.
    Type: Application
    Filed: July 16, 2007
    Publication date: October 8, 2009
    Applicant: MSTAR SEMICONDUCTOR, INC.
    Inventor: Daniel Edward Alt
  • Publication number: 20090248769
    Abstract: A multiply and accumulate engine may implement a digital filter. In some embodiments, the number of coefficients that are stored may be equal to only half of the number of filter taps that are implemented. This may be done by doing multiplications operand by operand within two data registers in a first direction and then shifting directions so that the first operand in a first register is multiplied by the last operand in another register. In some embodiments, the multiply and accumulate engine may be implemented as a two cycle engine wherein in the first stage, multiply and accumulate operations are implemented and then stored into a register. In a second stage and a second cycle, the results stored in the register are further accumulated.
    Type: Application
    Filed: March 26, 2008
    Publication date: October 1, 2009
    Inventor: Teck-Kuen Chua
  • Patent number: 7595659
    Abstract: A logic cell array having a number of logic cells and a segmented bus system for logic cell communication, the bus system including different segment lines having shorter and longer segments for connecting two points in order to be able to minimize the number of bus elements traversed between separate communication start and end points.
    Type: Grant
    Filed: October 8, 2001
    Date of Patent: September 29, 2009
    Assignee: Pact XPP Technologies AG
    Inventors: Martin Vorbach, Frank May, Dirk Reichardt, Frank Lier, Gerd Ehlers, Armin Nückel, Volker Baumgarte, Prashant Rao, Jens Oertel
  • Publication number: 20090228538
    Abstract: Conventional multi-input multiplication and addition circuit having fixed multipliers has problems in that when the number of inputs increases, the number of partial product generator circuits would increase, and also the number of stages of the addition blocks would increase. In order to solve the above-described problems, it is constructed such that there are provided a multi-input encoder (11) which comprises a plurality of encoder parts (11a) each of which accomplishes a function corresponding to generation of a partial product in multiplication, and which also has a plurality of outputs which correspond to the multi-bit output of the respective encoder parts, and a multi-input adder circuit (12) which adds the plural outputs from the multi-input encoder (11).
    Type: Application
    Filed: October 24, 2006
    Publication date: September 10, 2009
    Inventors: Kouichi Nagano, Hiroyuki Nakahira
  • Patent number: 7587443
    Abstract: A digital signal processor architecture allows the digital signal processor to be used efficiently for multiplying words which are longer than the word length for which the architecture is primarily designed. The multiplication unit has a register file which is adapted to store data words of a first length, and a multiplier which is adapted to multiply together data words of a second length, the second length being twice the first length. In a first mode, the architecture multiplies data words of the first length, by extending them to the second length. In a second mode, the architecture multiplies data words of the second length, by retrieving each of the data words in two parts, each part being of the first length.
    Type: Grant
    Filed: March 23, 2004
    Date of Patent: September 8, 2009
    Assignee: Altera Corporation
    Inventor: Martin Langhammer
  • Patent number: 7565391
    Abstract: A multiplying system for binary digits. The digits are multiplied in a rectangular memory array, where the digits are placed along the edges, and intersections between 1's form blocks of 1's in the memory array. The blocks of 1's are evaluated based on a weighting assigned to positions within the memory, either directly, or by reducing each block to a reduced block representation. The system can be used for multiplications, partial multiplications, and divisions, as well as applications thereof.
    Type: Grant
    Filed: December 17, 2004
    Date of Patent: July 21, 2009
    Assignee: The Regents of the University of California
    Inventor: Te Chiang Hu
  • Patent number: 7564971
    Abstract: A signal processing apparatus for performing modular multiplication for use in a signal processing system includes a first logic for outputting a signed multiplicand by selectively performing a one's complementary operation on a multiplicand according to a Booth conversion result of a multiplier in modular multiplication; a second logic for outputting a modulus which is signed in the modular multiplication based on a carry input value Carry-in of a current clock, determined from a carry value cin for correction of a previous clock, and on a sign bit of the multiplicand; and a third logic for receiving the signed multiplicand and the signed modulus, and calculating a result value of the modular multiplication by iteratively performing a full addition operation on a carry value C and a sum value S of the full addition operation, found at the previous clock. The present invention provides a high-speed modular multiplication apparatus with fewer gates and reduced power consumption.
    Type: Grant
    Filed: March 12, 2004
    Date of Patent: July 21, 2009
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Kyung-Hee Lee, Bum-Jin Im, Mi-Suk Huh
  • Patent number: 7562106
    Abstract: Apparatus and method for performing multi-value arithmetic operations are disclosed. Multi-value signals can be added, subtracted and multiplied using a first truth table to generate a residue and a second truth table to generate a carry. Additionally, method and apparatus to efficiently perform the function a0b1+a1b0 on multi-value signals are disclosed. Also an efficient method of processing large binary signals is disclosed.
    Type: Grant
    Filed: December 20, 2004
    Date of Patent: July 14, 2009
    Assignee: Ternarylogic LLC
    Inventor: Peter Lablans