Round Off Or Truncation Patents (Class 708/551)
-
Patent number: 12198072Abstract: Disclosed are techniques for generating features to train a predictive model to predict a customer lifetime value or churn rate. In one embodiment, a method is disclosed comprising receiving a record that includes a plurality of fields and selecting a value associated with a selected field in the plurality of fields. The method then queries a lookup table comprising a mapping of values to aggregated statistics using the value and receives an aggregated statistic based on the querying. Next, the method generates a feature vector by annotating the record with the aggregated statistic. The method uses this feature vector as an input to a predictive model.Type: GrantFiled: December 20, 2023Date of Patent: January 14, 2025Assignee: AMPERITY, INC.Inventors: Yan Yan, Aria Haghighi, Nicholas Resnick, Andrew Lim
-
Patent number: 11698772Abstract: An instruction is executed in round-for-reround mode wherein the permissible resultant value that is closest to and no greater in magnitude than the infinitely precise result is selected. If the selected value is not exact and the units digit of the selected value is either 0 or 5, then the digit is incremented by one and the selected value is delivered. In all other cases, the selected value is delivered.Type: GrantFiled: September 21, 2020Date of Patent: July 11, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Eric Mark Schwarz, Martin Stanley Schmookler
-
Patent number: 11531727Abstract: Some embodiments provide a method for a circuit that executes a neural network including multiple nodes. The method loads a set of weight values for a node into a set of weight value buffers, a first set of bits of each input value of a set of input values for the node into a first set of input value buffers, and a second set of bits of each of the input values into a second set of input value buffers. The method computes a first dot product of the weight values and the first set of bits of each input value and a second dot product of the weight values and the second set of bits of each input value. The method shifts the second dot product by a particular number of bits and adds the first dot product with the bit-shifted second dot product to compute a dot product for the node.Type: GrantFiled: December 6, 2018Date of Patent: December 20, 2022Assignee: PERCEIVE CORPORATIONInventors: Jung Ko, Kenneth Duong, Steven L. Teig
-
Patent number: 11416736Abstract: Systems and methods are related to improving throughput of neural networks in integrated circuits by combining values in operands to increase compute density. A system includes an integrated circuit (IC) having multiplier circuitry. The IC receives a first value and a second value in a first operand. The IC performs a multiplication operation, via the multiplier circuitry, on the first operand and a second operand to produce a first multiplied product based at least in part on the first value and a second multiplied product based at least in part on the second value.Type: GrantFiled: December 27, 2017Date of Patent: August 16, 2022Assignee: Intel CorporationInventors: Kevin Nealis, Randy Huang
-
Patent number: 11182666Abstract: In one example, an integrated circuit includes a first circuit, a second circuit, a third circuit, and a fourth circuit. The first circuit is configured to receive an input value and generate a first intermediate value based on a first probability density distribution associated with the input value. The second circuit comprises a set of multiplexer circuits configured to select, from a first set of candidate values and based on the first intermediate value, a first product of the first intermediate value and a weight value. The third circuit is configured to generate a second intermediate value based on a sum of the first product and a second product received from another circuit. The fourth circuit is configured to generate an output value based on the second intermediate value and a second probability density distribution associated with the second intermediate value.Type: GrantFiled: November 7, 2017Date of Patent: November 23, 2021Assignee: Amazon Technologies, Inc.Inventors: Taylor Phebus, Asif Khan
-
Patent number: 11003446Abstract: Adder trees may be constructed for efficient packing of arithmetic operators into an integrated circuit. The operands of the trees may be truncated to pack an integer number of nodes per logic array block. As a result, arithmetic operations may pack more efficiently onto the integrated circuit while providing increased precision and performance.Type: GrantFiled: December 14, 2017Date of Patent: May 11, 2021Assignee: Intel CorporationInventors: Martin Langhammer, Gregg William Baeckler, Bogdan Pasca
-
Patent number: 10761805Abstract: The present embodiments relate to performing reduced-precision floating-point arithmetic operations using specialized processing blocks with higher-precision floating-point arithmetic circuitry. A specialized processing block may receive four floating-point numbers that represent two single-precision floating-point numbers, each separated into an LSB portion and an MSB portion, or four half-precision floating-point numbers. A first partial product generator may generate a first partial product of first and second input signals, while a second partial product generator may generate a second partial product of third and fourth input signals.Type: GrantFiled: September 26, 2018Date of Patent: September 1, 2020Assignee: Altera CorporationInventor: Martin Langhammer
-
Patent number: 9702960Abstract: A method for determining a FDOA of a pulsed waveform received by two sensors includes obtaining a respective plurality of in-phase and quadrature-phase (IQ) samples indicative of a pulse envelope of the received pulsed waveform. The method includes determining a TDOA responsive to a leading edge of a pulse of the pulsed waveform and obtaining a first cross correlation of IQ samples at a delay (dc) closest to the TDOA, and respective second and third cross correlations at least one additional delay (dc+1 and dc?1) on either side of the closest delay. The method includes refining the approximation of the TDOA according to an interpolation of amplitudes of the cross-correlation and determining a respective rate of change of cross-correlation phase (??). The method includes approximating a straight line fit to the rates of change of cross-correlation phase (d??/dt), the slope of the straight line representative of the FDOA.Type: GrantFiled: July 19, 2013Date of Patent: July 11, 2017Assignee: Raytheon CompanyInventors: John T. Broad, Lee M. Savage
-
Patent number: 9535659Abstract: Embodiments relate to a hardware circuit that is operable as a fixed point adder and a checksum adder. An aspect includes a driving of a multifunction compression tree disposed on a circuit path based on a control bit to execute one of first and second schemes of vector input addition and a driving of a multifunction adder disposed on the circuit path based on the control bit to perform the one of the first and second schemes of vector input addition.Type: GrantFiled: March 10, 2016Date of Patent: January 3, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: James R. Cuffney, John G. Rell, Jr., Eric M. Schwarz, Patrick M. West, Jr.
-
Patent number: 9436434Abstract: Embodiments relate to a hardware circuit that is operable as a fixed point adder and a checksum adder. An aspect includes a driving of a multifunction compression tree disposed on a circuit path based on a control bit to execute one of first and second schemes of vector input addition and a driving of a multifunction adder disposed on the circuit path based on the control bit to perform the one of the first and second schemes of vector input addition.Type: GrantFiled: March 14, 2014Date of Patent: September 6, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: James R. Cuffney, John G. Rell, Jr., Eric M. Schwarz, Patrick M. West, Jr.
-
Patent number: 9286267Abstract: In one embodiment, the present invention includes a method for receiving a rounding instruction and an immediate value in a processor, determining if a rounding mode override indicator of the immediate value is active, and if so executing a rounding operation on a source operand in a floating point unit of the processor responsive to the rounding instruction and according to a rounding mode set forth in the immediate operand. Other embodiments are described and claimed.Type: GrantFiled: March 11, 2013Date of Patent: March 15, 2016Assignee: Intel CorporationInventors: Ronen Zohar, Shane Story
-
Patent number: 9244653Abstract: A floating point value can represent a number or something that is not a number (NaN). A floating point value that is a NaN having data field that stores information, such as a propagation count that indicates the number of times a NaN value has been propagated through instructions. A NaN evaluation instruction can determine whether one or more operands is a NaN operand of a particular type, and if so can generate a result that is a NaN of a different type. An exception can be generated based upon the NaN of the different type being provided as a resultant.Type: GrantFiled: March 15, 2013Date of Patent: January 26, 2016Assignee: FREESCALE SEMICONDUCTOR, INC.Inventor: William C. Moyer
-
Patent number: 9223751Abstract: In one embodiment, the present invention includes a method for receiving a rounding instruction and an immediate value in a processor, determining if a rounding mode override indicator of the immediate value is active, and if so executing a rounding operation on a source operand in a floating point unit of the processor responsive to the rounding instruction and according to a rounding mode set forth in the immediate operand. Other embodiments are described and claimed.Type: GrantFiled: September 22, 2006Date of Patent: December 29, 2015Assignee: Intel CorporationInventors: Ronen Zohar, Shane Story
-
Patent number: 9213524Abstract: A floating-point value can represent a number or something that is not a number (NaN). A floating-point value that is a NaN includes a portion that stores information about the source operands of the instruction.Type: GrantFiled: September 30, 2013Date of Patent: December 15, 2015Assignee: FREESCALE SEMICONDUCTOR, INC.Inventor: William C. Moyer
-
Patent number: 9104479Abstract: Processing circuitry is provided to perform an operation FRINT for rounding a floating-point value to an integral floating-point value. Control circuitry controls the processing circuitry to perform the FRINT operation in response to an FRINT instruction. The processing circuitry includes shifting circuitry for generating a rounding value by shifting a base value, adding circuitry for adding the rounding value to the significand of the floating-point value to generate a sum value, mask generating circuitry for generating a mask for clearing fractional-valued bits of the sum value, and masking circuitry for applying the mask to the sum value to generate the integral floating-point value.Type: GrantFiled: December 7, 2011Date of Patent: August 11, 2015Assignee: ARM LimitedInventors: David Raymond Lutz, Neil Burgess, Sabrina Marie Romero
-
Patent number: 8972472Abstract: A system and method for unbiased rounding away from, or toward, zero by truncating N bits from a M bit input number to provide a M?N bit number, and adding the equivalent value of ‘½’ to the M?N bit number unless the input number is negative, or positive, respectively, and the N truncated bits represent exactly ½. The method for rounding away from zero may include outputting a (M?N) bit truncated number if the M-bit input number is negative and the sequence of N truncated bits comprises a most significant bit of 1, followed by zeros; and otherwise, computing and outputting a sum of (a) a number that has an equivalent value of one followed by (N?1) replicas of zero, the one provided by applying a logical operation on the most significant bit of the sequence of truncated bits and (b) the (M?N) bit truncated number.Type: GrantFiled: September 17, 2008Date of Patent: March 3, 2015Assignee: Densbits Technologies Ltd.Inventors: Ofir Avraham Kanter, Ilan Bar
-
Publication number: 20150039665Abstract: A processing apparatus supports a narrowing-and-rounding arithmetic operation which generates, in response to two operands each comprising at least one W-bit data element, a result value comprising at least one X-bit result data element, with each X-bit result data element representing a sum or difference of corresponding W-bit data elements of the two operands rounded to an X-bit value (W>X). The arithmetic operation is implemented using a number of N-bit additions (N<W), with carry values from a first stage of N-bit additions being added at a second stage of N-bit additions for adding a rounding value to the result of the first stage additions. This technique reduces the amount of time required for performing the narrowing-and-rounding arithmetic operation.Type: ApplicationFiled: July 31, 2013Publication date: February 5, 2015Applicant: Arm LimitedInventors: Neil BURGESS, David Raymond LUTZ
-
Patent number: 8862652Abstract: A method is provided for deriving an RTL a logic circuit performing a multiplication as the sum of addends operation with a desired rounding position. In this, an error requirement to meet for the design rounding position is derived. For each of the CCT and the VCT implementation a number columns to discard is derived and a constant to include in the sum addends. For an LMS implementation, a number of columns to discard is derived. After discarding the columns and including the constants as appropriate, an RTL representation of the sum of addends operation is derived for each of the CCT, VCT and LMS implementations and a logic circuit synthesized for each of these. The logic circuit which gives the best implementation is selected for manufacture.Type: GrantFiled: June 29, 2012Date of Patent: October 14, 2014Assignee: Imagination Technologies, LimitedInventor: Theo Alan Drane
-
Publication number: 20140181170Abstract: According to one embodiment, an arithmetic circuit includes follows. The arithmetic unit performs an arithmetic operation including addition and multiplication to generate a first value of (n+m) bits. The rounding preprocessor performs an OR operation on lower (m?k) bits of the first value to generate a second value of 1 bit. The register stores a third value of (n+k+1) bits obtained by concatenating upper (n+k) bits of the first value and the second value. The rounding postprocessor calculates a carry bit value of 1 bit from a most significant bit of the third value and lower (k+1) bits of the third value, and adds the carry bit value to upper n bits of the third value.Type: ApplicationFiled: December 24, 2013Publication date: June 26, 2014Applicant: KABUSHIKI KAISHA TOSHIBAInventor: Koichiro BAN
-
Patent number: 8639738Abstract: A low-error reduced-width multiplier is provided by the present invention. The multiplier can dynamically compensate the truncation error. The compensation value is derived by the dependencies among the multiplier partial products, and thus, can be analyzed according to the multiplication type and the multiplier input statistics.Type: GrantFiled: February 28, 2011Date of Patent: January 28, 2014Assignee: National Chiao Tung UniversityInventors: Yen-Chin Liao, Hsie-Chia Chang
-
Patent number: 8615543Abstract: Saturation and rounding capabilities are implemented in MAC blocks to provide rounded and saturated outputs of multipliers and of add-subtract-accumulate circuitrs implemented using DSP. These features support any suitable format of value representation, including the x.15 format. Circuitry within the multipliers and the add-subtract-accumulate circuits implement the rounding and saturation features of the present invention.Type: GrantFiled: June 22, 2011Date of Patent: December 24, 2013Assignee: Altera CorporationInventors: Leon Zheng, Martin Langhammer, Steven Perry, Paul Metzgen, Nitin Prasad, William Hwang
-
Patent number: 8537047Abstract: The invention relates to the digital signal requantization, at a given quantization step size, of a first word received in a first period of time and encoded in a first number of bits, into a second word, with a quantization error equal to a third number. A sequence of third words is outputted, equal to the second word, with the sequence subdivided into a first group comprising a number of third words that is equal to the third number and a second group of third words. Before outputting them, the correction means adds a least significant bit to the third words of the first group and adds or subtracts least significant bits to or from the third words of the second group, such that the sum of the least significant bits added to and subtracted from the second group is zero.Type: GrantFiled: July 26, 2010Date of Patent: September 17, 2013Assignee: ST-Ericsson SAInventor: Sébastien Cliquennois
-
Patent number: 8495124Abstract: A decimal multiplication mechanism for fixed and floating point computation in a computer having a coefficient mechanism without resulting leading zero detection (LZD) and process which assumes that the final product will be M+N digits in length and performs all calculations based on this assumption. Least significant digits that would be truncated are no longer stored, but retained as sticky information which is used to finalize the result product. Once the computation of the product is complete, a final check based on the examination of key bits observed during partial product accumulation is used to determine if the final product is truly M+N digits in length, or M+N?1 digits. If the latter is true, then corrective final product shifting is employed to obtain the proper result. This eliminates the need for dedicated leading zero detection hardware used to determine the number of significant digits in the final product.Type: GrantFiled: June 23, 2010Date of Patent: July 23, 2013Assignee: International Business Machines CorporationInventors: Steven R. Carlough, Adam B. Collura, Michael Kroener, Silvia Melitta Mueller
-
Patent number: 8443027Abstract: A method, computer-readable medium, and an apparatus for implementing a floating point weighted average function. The method includes receiving an input containing 2N input values, 2N weights, and an opcode, where N is a positive integer number and each of the input values corresponds to one of the weights. Furthermore, the method also includes using existing dot product circuit function to generate 2N addends by multiplying each of the input values with the corresponding weight. In addition, the method includes generating a sum value by adding the 2N addends, where the sum value includes an exponent value, and generating the weighted average value based on the sum value by decreasing the exponent value by N. In this fashion, the same circuit area may be used to carry out both dot product and weighted average calculations, leading to greater circuit area savings and performance advantages.Type: GrantFiled: September 26, 2007Date of Patent: May 14, 2013Assignee: International Business Machines CorporationInventors: Adam James Muff, Matthew Ray Tubbs
-
Patent number: 8407271Abstract: An apparatus and method for computing a rounded floating point number. A floating point unit (FPU) receives an instruction to round a floating point number to a nearest integral value and retrieves a binary source operand having an exponent of a fixed first number of bits and a mantissa of a fixed second number of bits. If the unbiased exponent value is greater than or equal to zero and less than the fixed second number, the FPU generates a mask having N consecutive ‘1’ bits beginning with the least significant bit and whose remaining bits have a value of ‘0’, where N is equal to the fixed second number minus the unbiased exponent value. The FPU computes a bitwise OR of the source operand with the mask, increments the result if the instruction is to round up, and computes a bitwise AND of the result with the inverse of the mask.Type: GrantFiled: August 28, 2009Date of Patent: March 26, 2013Assignee: Advanced Micro Devices, Inc.Inventors: Kevin Hurd, Daryl Lieu, Kelvin Goveas, Scott Hilker
-
Patent number: 8370226Abstract: A technique for performing a financial calculation is described. In this calculation technique, initial financial values are rounded based on a rounding criterion, and a total financial value is calculated by summing the rounded financial values. Based on the rounded financial values, associated rounding error values are computed. These rounding error values are then summed to determine a total error value. Moreover, the total error value is rounded based on the rounding criterion, and the resulting rounded total error value is used to correct a rounding error in the total financial value.Type: GrantFiled: April 19, 2010Date of Patent: February 5, 2013Assignee: Intuit Inc.Inventor: Patanjali Bhatt
-
Publication number: 20130007085Abstract: A method is provided for deriving an RTL a logic circuit performing a multiplication as the sum of addends operation with a desired rounding position. In this, an error requirement to meet for the design rounding position is derived. For each of the CCT and the VCT implementation a number columns to discard is derived and a constant to include in the sum addends. For an LMS implementation, a number of columns to discard is derived. After discarding the columns and including the constants as appropriate, an RTL representation of the sum of addends operation is derived for each of the CCT, VCT and LMS implementations and a logic circuit synthesized for each of these. The logic circuit which gives the best implementation is selected for manufacture.Type: ApplicationFiled: June 29, 2012Publication date: January 3, 2013Applicant: Imagination Technologies, Ltd.Inventor: Theo Alan Drane
-
Publication number: 20120311008Abstract: Pricing values may be automatically computed by converting a base price with a predefined price ending based on predetermined rounding rules. A base price may be adjusted employing a rounding syntax and two pricing points, one for a rounding lower limit the other for rounding upper limit. Based on a comparison of a portion of the price computed with the rounding syntax, the adjusted (or sales) price may be computed reflecting a desired pricing strategy such as a psychological pricing strategy.Type: ApplicationFiled: June 1, 2011Publication date: December 6, 2012Applicant: MICROSOFT CORPORATIONInventor: Jakob Hall
-
Patent number: 8266198Abstract: A specialized processing block for a programmable logic device includes circuitry for performing multiplications and sums thereof, as well as circuitry for rounding the result. The rounding circuitry can selectably perform round-to-nearest and round-to-nearest-even operations. In addition, the bit position at which rounding occurs is preferably selectable. The specialized processing block preferably also includes saturation circuitry to prevent overflows and underflows, and the bit position at which saturation occurs also preferably is selectable. The selectability of both the rounding and saturation positions provides control of the output data word width. The rounding and saturation circuitry may be selectably located in different positions based on timing needs. Similarly, rounding may be speeded up using a look-ahead mode in which both rounded and unrounded results are computed in parallel, with the rounding logic selecting between those results.Type: GrantFiled: June 5, 2006Date of Patent: September 11, 2012Assignee: Altera CorporationInventors: Kwan Yee Martin Lee, Martin Langhammer, Yi-Wen Lin, Triet M. Nguyen
-
Publication number: 20120215825Abstract: Techniques are disclosed that involve the multiplication of values. For instance, a plurality of partial products may be calculated from a first operand and a second operand. This calculating bypasses calculating partial products having corresponding shift values that are less than a shift threshold value. These partial products are summed to produce a summed product. In turn, the summed product is truncated into a final product having a final precision. This final precision may be a shared precision employed by multiple processing units (e.g., algorithmic units in a graphics or display processing pipeline).Type: ApplicationFiled: February 22, 2011Publication date: August 23, 2012Inventor: Abhay M. Mavalankar
-
Patent number: 8214419Abstract: Methods and apparatus are provided for implementing an efficient saturating multiplier associated with addition and subtraction logic. The result of the multiplier is saturated before accumulating. The result of the multiplier can be stored in a result register in unsaturated form. The output of the result register can then be saturated and provided to addition and subtraction logic to allow efficient implementation of a saturating multiplier.Type: GrantFiled: December 30, 2008Date of Patent: July 3, 2012Assignee: Altera CorporationInventor: Paul Metzgen
-
Patent number: 8095586Abstract: Methods and arrangements to correct for double rounding errors when rounding floating point numbers to nearest away are described. Embodiments include transformations, code, state machines or other logic to perform a floating point operation on one or more floating point numbers of precision P1 in base b, producing positive result res0 of precision greater than precision P1; rounding positive result res0 to precision P1 to the nearest away, producing positive result res1; and rounding the result res1 to precision P2 to the nearest away, where P2 is narrower than P1, producing result res2. The embodiments may also include correcting res2 for double rounding errors. The correcting may include determining that res1 is midway between two consecutive floating point numbers of precision P2, the larger being res2, determining that rounding res0 to produce res1 involved rounding up, and decrementing the significand of res2 to obtain the corrected result.Type: GrantFiled: December 31, 2007Date of Patent: January 10, 2012Assignee: Intel CorporationInventor: Marius Cornea-Hasegan
-
Patent number: 8095587Abstract: An arithmetic unit comprising: an encoding circuit arranged to receive first and second operands each having a bit length of m bits and to generate therefrom a number n of partial products of bit length of 2m bits or less; an addition circuit having 2m columns each having n inputs, wherein bits of said partial products are applied to said inputs for combining said partial products into a result leaving certain of said inputs unused; and a rounding bit generator connected to supply a rounding bit to at least one of said unused inputs in one of said in columns at a bit position to cause said result to be rounded.Type: GrantFiled: June 30, 2006Date of Patent: January 10, 2012Assignee: STMicroelectronics (Research & Development) Ltd.Inventors: Tariq Kurd, Mark O. Homewood
-
Publication number: 20110320512Abstract: A decimal multiplication mechanism for fixed and floating point computation in a computer having a coefficient mechanism without resulting leading zero detection (LZD) and process which assumes that the final product will be M+N digits in length and performs all calculations based on this assumption. Least significant digits that would be truncated are no longer stored, but retained as sticky information which is used to finalize the result product. Once the computation of the product is complete, a final check based on the examination of key bits observed during partial product accumulation is used to determine if the final product is truly M+N digits in length, or M+N?1 digits. If the latter is true, then corrective final product shifting is employed to obtain the proper result. This eliminates the need for dedicated leading zero detection hardware used to determine the number of significant digits in the final product.Type: ApplicationFiled: June 23, 2010Publication date: December 29, 2011Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Steven R. Carlough, Adam B. Collura, Michael Kroener, Silvia Melitta Mueller
-
Patent number: 8069199Abstract: Methods and arrangements to correct for double rounding errors when rounding floating point numbers to nearest even are described. Embodiments include transformations, code, state machines or other logic to perform a floating point operation on one or more floating point numbers of precision P1 in base b, producing positive result res0 of precision greater than precision P1; rounding positive result res0 to precision P1 to the nearest even, producing positive result res1; and rounding the result res1 to precision P2 to the nearest even, where P2 is narrower than P1, producing result res2. The embodiments may also include correcting res2 for double rounding errors. The correcting may include determining that res1 is midway between two consecutive floating point numbers of precision P1, the larger (smaller) being res2, determining that rounding res0 to produce res1 involved rounding up (down), and decrementing (incrementing) the significand of res2 to obtain the corrected result res2?.Type: GrantFiled: December 31, 2007Date of Patent: November 29, 2011Assignee: Intel CorporationInventor: Marius Cornea-Hasegan
-
Publication number: 20110270901Abstract: An FFT algorithm that splits a large bit width waveform into two parts, making it possible to conduct the FFT with much lower logic resource consumption is disclosed. The waveform is split into its most significant bits and its least significant bits through division in the form of a bit shift. Each partial signal is then put through an FFT algorithm. The MSB FFT output is then right bit shifted. The two partial FFT's are summed to create a single output that is largely equivalent to an FFT of the original waveform. Rounding distortion is reduced by overlapping the MSB and LSB partial signals.Type: ApplicationFiled: April 30, 2010Publication date: November 3, 2011Applicant: SRC, INC.Inventors: Kristen L. Dobart, Michael T. Addario
-
Publication number: 20110225224Abstract: Efficient computation of complex multiplication results and very efficient fast Fourier transforms (FFTs) are provided. A parallel array VLIW digital signal processor is employed along with specialized complex multiplication instructions and communication operations between the processing elements which are overlapped with computation to provide very high performance operation. Successive iterations of a loop of tightly packed VLIWs are used allowing the complex multiplication pipeline hardware to be efficiently used. In addition, efficient techniques for supporting combined multiply accumulate operations are described.Type: ApplicationFiled: May 26, 2011Publication date: September 15, 2011Applicant: ALTERA CORPORATIONInventors: Nikos P. Pitsianis, Gerald G. Pechanek, Ricardo E. Rodriguez
-
Patent number: 8005884Abstract: A system and method for efficient floating-point rounding in computer systems. A computer system may include at least one floating-point unit for floating-point arithmetic operations such as addition, subtraction, multiplication, division and square root. For the division operation, the constraints for the remainder may be relaxed in order to reduce the area for look-up tables. An extra internal precision bit may not be used. Only one quotient may be calculated, rather than two, further reducing needed hardware to perform the rounding. Comparison logic may be required that may add a couple of cycles to the rounding computation beyond the calculation of the remainder. However, the extra latency is much smaller than a second floating-point multiply accumulate latency.Type: GrantFiled: October 9, 2007Date of Patent: August 23, 2011Assignee: Advanced Micro Devices, Inc.Inventors: Alexandru Fit-Florea, Debjit Das-Sarma
-
Patent number: 8005885Abstract: A processor, an instruction set architecture, an instruction, a computer readable medium and a method for implementing optimal per-instruction encoding of rounding control to emulate directed rounding are disclosed. In one embodiment, an apparatus designed to perform directed rounding includes an instruction decoder configured to decode an instruction, which includes a rounding control information to calculate a result boundary. The apparatus also includes a directed rounding emulator configured to adjust the result boundary to form an adjusted result boundary as a function of the rounding control bit. The adjusted result boundary establishes an endpoint for an interval that includes a result. In one embodiment, the directed round emulator is further configured to emulate a round-to-negative infinity rounding mode and a round-to-positive infinity rounding mode based on at least the single rounding control bit.Type: GrantFiled: October 14, 2005Date of Patent: August 23, 2011Assignee: Nvidia CorporationInventor: Nicholas Patrick Wilt
-
Publication number: 20110185000Abstract: A low-error reduced-width multiplier is provided by the present invention. The multiplier can dynamically compensate the truncation error. The compensation value is derived by the dependencies among the multiplier partial products, and thus, can be analyzed according to the multiplication type and the multiplier input statistics.Type: ApplicationFiled: February 28, 2011Publication date: July 28, 2011Applicant: National Chiao Tung UniversityInventors: Yen-Chin Liao, Hsie-Chia Chang
-
Patent number: 7949701Abstract: A method and system to perform shifting and rounding operations within a microprocessor, such as, for example, a digital signal processor, during execution of a single instruction are described. An instruction to shift and round data within a source register unit of a register file structure is received within a processing unit. The instruction includes a shifting bit value indicating the bit amount for a right shift operation and is subsequently executed to shift data within the source register unit to the right by an encoded bit value, calculated by subtracting a single bit from the shifting bit value contained within the instruction. A predetermined bit extension is further inserted within the vacated bit positions adjacent to the shifted data. Subsequently, an addition operation is performed on the shifted data and a unitary integer value is added to the shifted data to obtain resulting data.Type: GrantFiled: August 2, 2006Date of Patent: May 24, 2011Assignee: QUALCOMM IncorporatedInventors: Lucian Codrescu, Erich Plondke, Mao Zeng
-
Patent number: 7948267Abstract: A specialized processing block for a configurable integrated circuit device includes circuitry for performing multiplications and sums thereof, as well as circuitry for rounding the result. The rounding circuitry reuses an adder that is also available, in other configurations, for accumulation of the result. Rounding is performed by adding a constant to the result and then truncating at the bit position at which rounding is desired. The constant may be entered by a user, or may be derived based on a desired rounding method from mask data entered by the user to identify the rounding bit position.Type: GrantFiled: February 9, 2010Date of Patent: May 24, 2011Assignee: Altera CorporationInventors: Volker Mauer, Martin Langhammer
-
Patent number: 7912888Abstract: A computing device has a rounding processor that inputs therein a set of plural (K) input data IN1 through INK comprising z bits. The rounding processor selects an ensured bit field depending upon the state of usage of each of specific areas A of upper z/2 bits of the 32-bit input data IN1 through INK and rounds the corresponding input data to z/2. As a result of rounding processing, shift information SHIFT of lower (16?n) bits of each discarded non-specific area B is stored in a memory area. D10-1 through D10-K of the rounded respective 16 bits are subjected to multiplication by a multiplier. A digit adjuster shifts multiplication results to the left on the basis of the shift information SHIFT respectively stored in the memory areas to adjust digits.Type: GrantFiled: March 15, 2007Date of Patent: March 22, 2011Assignee: Oki Semiconductor Co., Ltd.Inventor: Wataru Uchida
-
Patent number: 7853636Abstract: An integrated circuit (IC) for convergent rounding including: an adder circuit configured to produce a summation; a comparison circuit configured to bitwise compare the summation with an input pattern, bitwise mask the comparison using a mask, and combine the masked comparison to produce a comparison bit; and rounding circuitry for rounding the summation based at least in part on the comparison bit.Type: GrantFiled: May 12, 2006Date of Patent: December 14, 2010Assignee: Xilinx, Inc.Inventors: Bernard J. New, Jennifer Wong, James M. Simkins, Alvin Y. Ching, John M. Thendean, Anna Wing Wah Wong, Vasisht Mantra Vadi
-
Publication number: 20100306292Abstract: A processor may have at least one multiplier unit which can be controlled to operate in a signed, an unsigned, or a mixed sign mode; a multiplier unit mode decoder coupled with the multiplier unit which receives location information of a first and second operands, wherein the multiplier mode decoder controls the multiplier unit when in the mixed sign mode depending on the location information to operate in a signed mode, an unsigned mode, or a combined signed/unsigned mode.Type: ApplicationFiled: May 7, 2010Publication date: December 2, 2010Inventors: Michael I. Catherwood, Settu Duraisamy
-
Patent number: 7822799Abstract: Adder/rounder circuitry for use in a programmable logic device computes a rounded sum quickly, and ideally within one clock cycle. The rounding position is selectable within a range of bit positions. In an input stage, for each bit position in that range, bits from both addends and a rounding bit are processed, while for each bit position outside that range only bits from both addends are processed. The input stage processing aligns its output in a common format for bits within and outside the range. The input processing may include 3:2 compression for bit positions within the range and 2:2 compression for bit positions outside the range, so that further processing is performed for all bit positions on a sum vector and a carry vector. Computation of the sum proceeds substantially simultaneously with and without the rounding input, and rounding logic makes a selection later in the computation.Type: GrantFiled: June 26, 2006Date of Patent: October 26, 2010Assignee: Altera CorporationInventors: Martin Langhammer, Triet M. Nguyen, Yi-Wen Lin
-
Publication number: 20100260429Abstract: A signal processing apparatus according to an embodiment of the present invention includes: a compression processing unit that performs compression processing on n-bit data; a bit-number conversion unit that converts m-bit input image data into n-bit data (where n<m) by performing round-up or round-down processes on the lower (m?n) bits of the m-bit input image data, and feeds the obtained n-bit data to the compression processing unit; and a conversion processing control unit that selects either one of the round-up process and the round-down process to be performed on each datum of the n-bit data in accordance with a predefined rule on the basis of the position of a frame to which the datum belongs and the position in the frame at which the datum is located, and instructs the bit-number conversion unit to perform the selected round-up process or round-down process.Type: ApplicationFiled: March 31, 2010Publication date: October 14, 2010Applicant: Sony CorporationInventor: Tsutomu Ichinose
-
Patent number: 7765221Abstract: Methods and apparatus, including computer systems and program products, for normalizing computer-represented collections of objects. A first minimum value can be normalized based on a second minimum value of a universal set object that corresponds to the first set object. The second minimum value is both a minimum value supported by a data type (e.g., 1-byte integer) and a minimum value defined to be in the universal set object (e.g., 0 for a universal set of all natural numbers). Similarly, a first maximum value can be normalized based on a second maximum value of the universal set object where the second maximum value is both a maximum value supported by a data type and in the universal set object. Intervals can be normalized, which can involve replacing half-open intervals with equivalent half-closed intervals. Also, a consecutively ordered, uninterrupted, sequence of values of a set object can be normalized.Type: GrantFiled: December 20, 2005Date of Patent: July 27, 2010Assignee: SAP AGInventor: Peter K. Zimmerer
-
Patent number: RE43145Abstract: A processor which executes positive conversion processing, which converts coded data into uncoded data, and saturation calculation processing, which rounds a value to an appropriate number of bits, at high speed. When a positive conversion saturation calculation instruction “MCSST D1” is decoded, the sum-product result register 6 outputs its held value to the path P1. The comparator 22 compares the magnitude of the held value of the sum-product result register 6 with the coded 32-bit integer “0x0000_00FF”. The polarity judging unit 23 judges whether the eighth bit of the value held by the sum-product result register 6 is “ON”. The multiplexer 24 outputs one of the maximum value “0x0000_00FF” generated by the constant generator 21, the zero value “0x0000_0000” generated by the zero generator 25, and the held value of the sum-product result register 6 to the data bus 18.Type: GrantFiled: December 21, 2004Date of Patent: January 24, 2012Assignee: Panasonic CorporationInventors: Toru Morikawa, Nobuo Higaki, Akira Miyoshi, Keizo Sumida
-
Patent number: RE43729Abstract: A processor which executes positive conversion processing, which converts coded data into uncoded data, and saturation calculation processing, which rounds a value to an appropriate number of bits, at high speed. When a positive conversion saturation calculation instruction “MCSST D1” is decoded, the sum-product result register 6 outputs its held value to the path P1. The comparator 22 compares the magnitude of the held value of the sum-product result register 6 with the coded 32-bit integer “0x0000_00FF”. The polarity judging unit 23 judges whether the eighth bit of the value held by the sum-product result register 6 is “ON”. The multiplexer 24 outputs one of the maximum value “0x0000_00FF” generated by the constant generator 21, the zero value “0x0000_0000” generated by the zero generator 25, and the held value of the sum-product result register 6 to the data bus 18.Type: GrantFiled: April 22, 2011Date of Patent: October 9, 2012Assignee: Panasonic CorporationInventors: Toru Morikawa, Nobuo Higaki, Akira Miyoshi, Keizo Sumida