Variable Length Or Precision Patents (Class 708/513)
-
Patent number: 11461095Abstract: The present disclosure relates to a method of storing, by a load and store circuit or other processing means, a variable precision floating point value to a memory address of a memory, the method comprising: reducing the bit length of the variable precision floating point value to no more than a size limit, and storing the variable precision floating point value to one of a plurality of storage zones in the memory, each of the plurality of storage zones having a storage space equal to or greater than the size limit (MBB).Type: GrantFiled: March 6, 2020Date of Patent: October 4, 2022Assignees: Commissariat à l'Energie Atomique et aux Energies Alternatives, Institut National des Sciences Appliquées de LyonInventors: Andrea Bocco, Florent Dupont De Dinechin, Yves Durand
-
Patent number: 11442728Abstract: A sequence alignment method that may be performed by a vector processor is may include loading a sequence that is an instance of vector data including a plurality of elements, dividing the sequence into two groups, aligning respective elements of the groups to generate a sequence of sorted elements according to a single instruction multiple data mode, and iteratively performing an alignment operation based on a determination that each group in the sequence of sorted elements includes more than one element of the plurality of elements. Each iteration may include dividing each group to form new groups and aligning respective elements of each pair of adjacent new groups to generate a new sequence of sorted elements. The new sequence of a current iteration of the alignment operation may be transmitted as a data output, based on a determination that each new group does not include more than one element.Type: GrantFiled: June 20, 2019Date of Patent: September 13, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Hyun Pil Kim, Hyun Woo Sim, Seong Woo Ahn
-
Patent number: 11372643Abstract: Disclosed embodiments relate to systems and methods for performing instructions to convert to 16-bit floating-point format. In one example, a processor includes fetch circuitry to fetch an instruction having fields to specify an opcode and locations of a first source vector comprising N single-precision elements, and a destination vector comprising at least N 16-bit floating-point elements, the opcode to indicate execution circuitry is to convert each of the elements of the specified source vector to 16-bit floating-point, the conversion to include truncation and rounding, as necessary, and to store each converted element into a corresponding location of the specified destination vector, decode circuitry to decode the fetched instruction, and execution circuitry to respond to the decoded instruction as specified by the opcode.Type: GrantFiled: November 9, 2018Date of Patent: June 28, 2022Assignee: Intel CorporationInventors: Alexander F. Heinecke, Robert Valentine, Mark J. Charney, Raanan Sade, Menachem Adelman, Zeev Sperber, Amit Gradstein, Simon Rubanovich
-
Patent number: 11169776Abstract: Systems, apparatuses and methods may provide for technology that in response to an identification that one or more hardware units are to execute on a first type of data format, decomposes a first original floating point number to a plurality of first segmented floating point numbers that are to be equivalent to the first original floating point number. The technology may further in response to the identification, decompose a second original floating point number to a plurality of second segmented floating point numbers that are to be equivalent to the second original floating point number. The technology may further execute a multiplication operation on the first and second segmented floating point numbers to multiply the first segmented floating point numbers with the second segmented floating point numbers.Type: GrantFiled: June 28, 2019Date of Patent: November 9, 2021Assignee: Intel CorporationInventors: Nitin N. Garegrat, Maciej Urbanski, Michael Rotzin, Brian J. Hickmann, Valentina Popescu
-
Patent number: 11068265Abstract: A sequence alignment method that may be performed by a vector processor is may include loading a sequence that is an instance of vector data including a plurality of elements, dividing the sequence into two groups, aligning respective elements of the groups to generate a sequence of sorted elements according to a single instruction multiple data mode, and iteratively performing an alignment operation based on a determination that each group in the sequence of sorted elements includes more than one element of the plurality of elements. Each iteration may include dividing each group to form new groups and aligning respective elements of each pair of adjacent new groups to generate a new sequence of sorted elements. The new sequence of a current iteration of the alignment operation may be transmitted as a data output, based on a determination that each new group does not include more than one element.Type: GrantFiled: June 20, 2019Date of Patent: July 20, 2021Assignee: Samsung Electronics Co., Ltd.Inventors: Hyun Pil Kim, Hyun Woo Sim, Seong Woo Ahn
-
Patent number: 11068263Abstract: Disclosed embodiments relate to systems and methods for performing instructions to convert to 16-bit floating-point format. In one example, a processor includes fetch circuitry to fetch an instruction having fields to specify an opcode and locations of a first source vector comprising N single-precision elements, and a destination vector comprising at least N 16-bit floating-point elements, the opcode to indicate execution circuitry is to convert each of the elements of the specified source vector to 16-bit floating-point, the conversion to include truncation and rounding, as necessary, and to store each converted element into a corresponding location of the specified destination vector, decode circuitry to decode the fetched instruction, and execution circuitry to respond to the decoded instruction as specified by the opcode.Type: GrantFiled: December 23, 2020Date of Patent: July 20, 2021Assignee: Intel CorporationInventors: Alexander F. Heinecke, Robert Valentine, Mark J. Charney, Raanan Sade, Menachem Adelman, Zeev Sperber, Amit Gradstein, Simon Rubanovich
-
Patent number: 11068262Abstract: Disclosed embodiments relate to systems and methods for performing instructions to convert to 16-bit floating-point format. In one example, a processor includes fetch circuitry to fetch an instruction having fields to specify an opcode and locations of a first source vector comprising N single-precision elements, and a destination vector comprising at least N 16-bit floating-point elements, the opcode to indicate execution circuitry is to convert each of the elements of the specified source vector to 16-bit floating-point, the conversion to include truncation and rounding, as necessary, and to store each converted element into a corresponding location of the specified destination vector, decode circuitry to decode the fetched instruction, and execution circuitry to respond to the decoded instruction as specified by the opcode.Type: GrantFiled: December 23, 2020Date of Patent: July 20, 2021Assignee: Intel CorporationInventors: Alexander F. Heinecke, Robert Valentine, Mark J. Charney, Raanan Sade, Menachem Adelman, Zeev Sperber, Amit Gradstein, Simon Rubanovich
-
Patent number: 10877764Abstract: A vector processor includes: a temporary storage device configured to retain a plurality of elements representing data used at the time of performing an operation appropriate for an instruction; a data type determining part configured to determine what data type the elements retained by the temporary storage device are to be handled as among predetermined data types, in accordance with the instruction; and an output destination deciding part configured to decide an output destination of each of the elements stored by the temporary storage device, based on the result of determination by the data type determining part. The vector processor is configured to output each of the elements to the output destination decided by the output destination deciding part, thereby performing the operation.Type: GrantFiled: March 5, 2019Date of Patent: December 29, 2020Assignee: NEC CORPORATIONInventor: Kei Kimoto
-
Patent number: 10853080Abstract: A processor including a physical register file with multiple physical registers, mapping logic, and a merge system. The mapping logic maps up to a first maximum number of the physical registers for each architectural register specified in received program instructions and stores corresponding mappings in a rename table. The merge system generates a merge instruction for each architectural register that needs to be merged, inserts each generated merge instruction into the program instructions to provide a modified set of instructions, and that issues the modified set of instructions in consecutive issue cycles based on a take rule. In one embodiment, the first maximum number may be two.Type: GrantFiled: November 13, 2017Date of Patent: December 1, 2020Assignee: SHANGHAI ZHAOXIN SEMICONDUCTOR CO., LTD.Inventors: Xiaolong Fei, Mengchen Yang
-
Patent number: 10114613Abstract: A computing system includes computational memory and digital combinational circuitry operatively coupled with the computational memory. The computational memory is configured to perform computations at a prescribed precision. The digital combinational circuitry is configured to increase the precision of the computations performed by the computational memory. The computational memory and the digital combinational circuitry may be configured to iteratively perform a computation to a predefined precision. The computational memory may include circuitry configured to perform analog computation using values stored in the computational memory, and the digital combinational circuitry may include a central processing unit, a graphics processing unit and/or application specific circuitry. The computational memory may include an array of resistive memory elements having resistance or conductance values stored therein, the respective resistance or conductance values being programmable.Type: GrantFiled: September 7, 2016Date of Patent: October 30, 2018Assignee: International Business Machines CorporationInventors: Konstantinos Bekas, Alessandro Curioni, Evangelos S. Eleftheriou, Manuel Le Gallo-Bourdeau, Abu Sebastian, Tomas Tuma
-
Patent number: 10042646Abstract: A processor including a physical register file, a rename table, mapping logic, size tracking logic, and merge logic. The rename table maps an architectural register with a larger index and a smaller index. The mapping logic detects a partial write instruction that specifies an architectural register that is already identified by an entry of the rename table mapped to a second physical register allocated for a larger write operation, and includes an index for the allocated register for the partial write instruction into the smaller index location of the entry. The size tracking logic provides a merge indication for the partial write instruction if the write size of the previous write instruction is larger. The merge logic merges the result of the partial write instruction with the second physical register during retirement of the partial write instruction.Type: GrantFiled: August 25, 2016Date of Patent: August 7, 2018Assignee: VIA ALLIANCE SEMICONDUCTOR CO., LTD.Inventor: Xiaolong Fei
-
Patent number: 9589496Abstract: A method of accumulating data by a processor in a nonvolatile memory to track use of a device. The method includes: retrieving by the processor a next datum for accumulation into a first accumulation stored in the memory, the next datum representing a next use of the device; generating by the processor a next dither offset; adding by the processor the next dither offset to the next datum to produce a first sum; dividing by the processor the first sum by a scale factor to produce a quantized datum; and adding by the processor the quantized datum to the first accumulation. The first accumulation tracks the use of the device.Type: GrantFiled: August 1, 2014Date of Patent: March 7, 2017Assignee: Samsung Display Co., Ltd.Inventor: Ning Lu
-
Patent number: 9405505Abstract: A control device includes: a first conversion unit that converts floating point data generated by an operation of a floating point operation command into a numeric string in first format; and a second conversion unit that converts the numeric string in first format into a numeric string in second format. A character string data generation unit generates a character string data including the numeric strings in first format and in second format and outputs the character string data to an external device or an external storage medium.Type: GrantFiled: October 7, 2014Date of Patent: August 2, 2016Assignee: FANUC CorporationInventor: Noritake Nagashima
-
Patent number: 9189200Abstract: A specialized processing block in a programmable integrated circuit device is configurable to perform floating-point arithmetic operations at selectable different precisions. The specialized processing block includes a plurality of different respective types of floating-point arithmetic operator structures. For each respective type of floating-point arithmetic operator structure, respective control circuitry for partitions the respective type of floating-point arithmetic operator structure to select between a first precision for which the respective type of floating-point arithmetic operator structure is not partitioned, and at least a second precision, less than the first precision, for which the respective type of floating-point arithmetic operator structure is partitioned into at least two smaller ones of the respective type of floating-point arithmetic operator structure.Type: GrantFiled: March 14, 2013Date of Patent: November 17, 2015Assignee: Altera CorporationInventor: Martin Langhammer
-
Patent number: 8959131Abstract: Apparatus for processing data includes processing circuitry 16, 18, 20, 22, 24, 26 and decoder circuitry 14 for decoding program instructions. The program instructions decoded include a floating point pre-conversion instruction which performs round-to-nearest ties to even rounding upon the mantissa field of an input floating number to generate an output floating point number with the same mantissa length but with the mantissa rounded to a position corresponding to a shorter mantissa field. The output mantissa field includes a suffix of zero values concatenated the rounded value. The decoder for circuitry 14 is also responsive to an integer pre-conversion instruction to quantize and input integer value using round-to-nearest ties to even rounding to form an output integer operand with a number of significant bits matched to the mantissa size of a floating point number to which the integer is later to be converted using an integer-to-floating point conversion instruction.Type: GrantFiled: September 22, 2011Date of Patent: February 17, 2015Assignee: ARM LimitedInventors: Jorn Nystad, Andreas Due Engh-Halstvedt, Simon Alex Charles
-
Publication number: 20140195580Abstract: A method of an aspect includes receiving a floating point round-off amount determination instruction. The instruction indicates a source of one or more floating point data elements, indicates a number of fraction bits after a radix point, and indicates a destination storage location. A result including one or more result floating point data elements is stored in the destination storage location in response to the floating point round-off amount determination instruction. Each of the one or more result floating point data elements includes a difference between a corresponding floating point data element of the source in a corresponding position, and a rounded version of the corresponding floating point data element of the source that has been rounded to the indicated number of the fraction bits. Other methods, apparatus, systems, and instructions are disclosed.Type: ApplicationFiled: December 30, 2011Publication date: July 10, 2014Inventors: Cristina S. Anderson, Bret L. Toll, Robert Valentine, Simon Rubanovich, Amit Gradsieien
-
Patent number: 8706790Abstract: The resources needed—particularly in a programmable device—when carrying out a mixed-precision multiplication-based floating-point operation (i.e., multiplication or division) is reduced by maintaining the mantissas of the operands in their native precisions instead of promoting the lower-precision number to the higher precision. Exponents and other elements can be handled by the higher-precision logic as they do not consume significant resources.Type: GrantFiled: March 3, 2009Date of Patent: April 22, 2014Assignee: Altera CorporationInventor: Martin Langhammer
-
Patent number: 8595279Abstract: A method and apparatus for performing a floating-point operation with a floating-point processor having a given precision is disclosed. A subprecision for the floating-point operation on one or more floating-point numbers is selected. The selection of the subprecision results in one or more excess bits for each of the one or more floating-point numbers. Power may be removed from one or more components in the floating-point processor that would otherwise be used to store or process the one or more excess bits, and the floating-point operation is performed with power removed from the one or more components.Type: GrantFiled: February 27, 2006Date of Patent: November 26, 2013Assignee: QUALCOMM IncorporatedInventor: Kenneth Alan Dockser
-
Patent number: 8484266Abstract: An embedded control system capable of ensuring precision in arithmetic with data in the floating-point format and also avoiding a shortage of the storage area of a memory is provided. According to an embedded control system in the present invention, when discrete data in the floating-point format is stored in a read-only memory, the discrete data in the floating-point format is converted into data in a significand-reduced floating-point format before being stored. Here, a significand-reduced floating-point number is a number obtained by deleting low-order bits of the significand of a floating-point number. Further, an interpolation search is performed using discrete data, the discrete data in the significand-reduced floating-point format stored in the read-only memory is brought back to the discrete data in the floating-point format before an interpolation search being performed.Type: GrantFiled: February 19, 2009Date of Patent: July 9, 2013Assignee: Hitachi, Ltd.Inventors: Shinya Fujimoto, Keiichiro Ohkawa
-
Patent number: 8468191Abstract: Systems and methods for multi-precision computation are disclosed. One embodiment of the present invention includes a plurality of multiply-add units (MADDs) configured to perform one or more single precision operations and an arrangement generator to generate one or more mantissa arrangements using a plurality of double precision numbers. Each MADD is configured to receive and load said mantissa arrangements from the arrangement generator. The MADDs compute a result of a multi-precision computation using the mantissa arrangements. In an embodiment, the MADDs are configured to simultaneously perform operations that include, single precision operations, double-precision additions and double-precision multiply and additions.Type: GrantFiled: June 10, 2010Date of Patent: June 18, 2013Assignee: Advanced Micro Devices, Inc.Inventors: Michael J. Mantor, Jeffrey T. Brady, Daniel B. Clifton, Christopher Spencer
-
Patent number: 8463834Abstract: A floating point multiplier includes a data path in which a plurality of partial products are calculated and then reduced to a first partial product and a second partial product. Shift amount determining circuitry 100 analyzes the exponents of the input operands A and B as well as counting the leading zeros in the fractional portions of these operands to determine an amount of left shift or right shift to be applied by shifting circuitry 200, 202 within the multiplier data path. This shift amount is applied so as to align the partial products so that when they are added they will produce the result C without requiring this to be further shifted. Furthermore, shifting the partial products to the correct alignment in this way in advance of adding these partial products permits injection rounding combined with the adding of the partial products to be employed for cases including subnormal values.Type: GrantFiled: November 3, 2009Date of Patent: June 11, 2013Assignee: ARM LimitedInventor: David Raymond Lutz
-
Patent number: 8421794Abstract: The disclosure describes an adaptive multi-shader within a processor that uses one or more high-precision arithmetic logic units (ALUs) and low-precision ALUs to process data based on the type of the data. Upon receiving a stream of data, the adaptive multi-shader first determines the type of the data. For example, the adaptive multi-shader may determine whether the data is suitable for high-precision processing or low-precision processing. The adaptive multi-shader then processes the data using the high-precision ALUs when the data is suitable for high-precision processing, and processes the data using the high-precision ALUs and the low-precision ALUs when the data is suitable for low-precision processing. The adaptive multi-shader may substantially reduce power consumption and silicon size of the processor by implementing the low-precision ALUs while maintaining the ability to process data using high-precision processing by implementing the high-precision ALUs.Type: GrantFiled: March 23, 2007Date of Patent: April 16, 2013Assignee: QUALCOMM IncorporatedInventors: Yun Du, Guofang Jiao, Chun Yu
-
Patent number: 8412760Abstract: A floating point execution unit is capable of selectively repurposing a subset of the significand bits in a floating point value for use as additional exponent bits to dynamically provide an extended range for floating point calculations. A significand field of a floating point operand may be considered to include first and second portions, with the first portion capable of being concatenated with the second portion to represent the significand for a floating point value, or, to provide an extended range, being concatenated with the exponent field of the floating point operand to represent the exponent for a floating point value.Type: GrantFiled: July 22, 2008Date of Patent: April 2, 2013Assignee: International Business Machines CorporationInventors: Mark J Hickey, Adam J Muff, Matthew R Tubbs, Charles D Wait
-
Patent number: 8412761Abstract: A single-precision floating-point data storing method for use in a processor including a register, which has a size that can store double-precision floating-point data, for storing double-precision floating-point data and single-precision floating-point data comprises writing input single-precision floating-point data to the high-order half of the register, and writing all zeros to the low-order half of the register if a single-precision floating-point data process is specified.Type: GrantFiled: November 24, 2008Date of Patent: April 2, 2013Assignee: Fujitsu LimitedInventor: Toshio Yoshida
-
Patent number: 8411854Abstract: A method of generating a private key for use in an authentication protocol comprises, at a client: receiving a user specific identifier; converting the identifier through a one-way function to a string of a pre-determined length; and mapping said string to a permutation ?priv of a pre-determined order, said permutation being operable with a first graph G1 to generate a second graph G2=?priv(G1).Type: GrantFiled: December 10, 2008Date of Patent: April 2, 2013Assignee: National University of Ireland, GalwayInventors: Slawomir Grzonkowski, Wojciech Zaremba
-
Patent number: 8392490Abstract: A method, computer program product and a system for identifying decimal floating point addition operations that guarantee operand alignment and do not require alignment, normalization or rounding are provided. The method includes: receiving an instruction to perform an addition of a first operand and a second operand; extracting a first exponent (EXP) and a first most significant digit (MSD) from the first operand; extracting a second EXP and a second MSD from the second operand; and determining whether alignment between the first operand and the second operand is guaranteed, based on the first EXP, the first MSD, the second EXP and the second MSD.Type: GrantFiled: February 18, 2008Date of Patent: March 5, 2013Assignee: International Business Machines CorporationInventors: Adam B. Collura, Steven R. Carlough, Wen He Li, Eric M. Schwarz
-
Patent number: 8024394Abstract: Included are embodiments of a Multiply-Accumulate Unit to process multiple format floating point operands. For short format operands, embodiments of the Multiply Accumulate Unit are configured to process data with twice the throughput as long and mixed format data. At least one embodiment can include a short exponent calculation component configured to receive short format data, a long exponent calculation component configured to receive long format data, and a mixed exponent calculation component configured to receive short exponent data, the mixed exponent calculation component further configured to received long format data. Embodiments also include a mantissa datapath configured for implementation to accommodate processing of long, mixed, and short floating point operands.Type: GrantFiled: February 6, 2007Date of Patent: September 20, 2011Assignee: Via Technologies, Inc.Inventors: Boris Prokopenko, Timour Paltashev, Derek Gladding
-
Patent number: 7877431Abstract: Systems and methods for encoding floating point numbers. A system can include encoding logic which encodes invalid floating point representations as valid data. Decoding logic can be used to recognize the invalid floating point representations and map can provide the invalid floating point representations to valid data values. The decoding logic then can provide the valid data values so that operations on the valid data values can be performed in accordance with instructions received from an associated program.Type: GrantFiled: September 7, 2005Date of Patent: January 25, 2011Assignee: Research In Motion LimitedInventors: John F.A. Dahms, David P. Yach
-
Patent number: 7860914Abstract: In this invention we describe a new type of computer—infinity computer—that is able to operate with infinite, infinitesimal, and finite numbers in such a way that it becomes possible to execute the usual arithmetical operations with all of them. For the new computer it is shown how the memory for storage of these members is organized and how the new arithmetic logic unit (NALU) executing arithmetical operations with them works.Type: GrantFiled: September 8, 2006Date of Patent: December 28, 2010Inventor: Sergeev Yaroslav
-
Patent number: 7716267Abstract: Decimal calculation apparatus, which performs multidigit decimal calculation with the number of calculation digits set in calculation instruction, comprises multidigit memory section capable of storing values with greater numbers of digits than the number of digits of a predetermined digit unit in plurality of memory areas; calculation-instruction memory section which stores calculation instruction having the number of calculation digits and type of calculation set therein; and decimal calculation section which performs decimal calculation of sequentially calculating numerical values of corresponding digit units respectively stored in plurality of memory areas of the multidigit memory section, digit unit by digit unit in the number of calculation digits set in calculation instruction stored in calculation-instruction memory section, in decimal calculation according to type of calculation set in calculation instruction stored in calculation-instruction memory section, and sequentially writing calculation resulType: GrantFiled: April 19, 2005Date of Patent: May 11, 2010Assignee: Casio Computer Co., Ltd.Inventors: Hisashi Ito, Tetsuichi Nakae
-
Patent number: 7668897Abstract: Within a processor 2 providing single instruction multiple data (SIMD) type operation, single data processing instructions can serve to control processing logic 4, 6, 8, 10 to perform SIMD-type processing operations upon multiple independent input values to generate multiple independent result values having a greater data width than the corresponding input values. A repartitioner (FIG. 5) in the form of appropriately controlled multiplexers serves to partition these result data values into high order bit portions and low order bit portions that are stored into separate registers 38, 40. The required SIMD width preserved result values can be read from the desired high order 38 result register or low order result register 40 without further processing being required. Furthermore, the preservation of the full result facilitates improvements in accuracy, such as over extended accumulate operations and the like.Type: GrantFiled: June 16, 2003Date of Patent: February 23, 2010Assignee: ARM LimitedInventor: Daniel Kershaw
-
Publication number: 20100023568Abstract: A floating point execution unit is capable of selectively repurposing a subset of the significand bits in a floating point value for use as additional exponent bits to dynamically provide an extended range for floating point calculations. A significand field of a floating point operand may be considered to include first and second portions, with the first portion capable of being concatenated with the second portion to represent the significand for a floating point value, or, to provide an extended range, being concatenated with the exponent field of the floating point operand to represent the exponent for a floating point value.Type: ApplicationFiled: July 22, 2008Publication date: January 28, 2010Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Mark J. Hickey, Adam J. Muff, Matthew R. Tubbs, Charles D. Wait
-
Publication number: 20090240757Abstract: A single-precision floating-point data storing method for use in a processor including a register, which has a size that can store double-precision floating-point data, for storing double-precision floating-point data and single-precision floating-point data comprises writing input single-precision floating-point data to the high-order half of the register, and writing all zeros to the low-order half of the register if a single-precision floating-point data process is specified.Type: ApplicationFiled: November 24, 2008Publication date: September 24, 2009Applicant: FUJITSU LIMITEDInventor: Toshio YOSHIDA
-
Patent number: 7483936Abstract: A calculating unit including a number of bit slices which is less than the number of positions of the operand to be processed. Each bit slice has a logic element and a communication bus between the logic element and the plurality of register cells. The register cells are connected in parallel with respect to the slice-internal communication bus and are controlled by a controller so that only one register cell of the plurality of register cells is coupled to the communication bus at a time.Type: GrantFiled: March 14, 2005Date of Patent: January 27, 2009Assignee: Infineon Technologies AGInventors: Norbert Janssen, Tanja Roemer, Holger Sedlak
-
Patent number: 7472051Abstract: A microcontroller comprising a central processing unit and a further fault processing unit suitable for performing validation of operations of said central processing unit. The further fault processing unit is external and different with respect to said central processing unit and said further fault processing unit comprises at least a module for performing validation of operations of said central processing unit and one or more modules suitable for performing validation of operations of other functional parts of said microcontroller. Validation of operations of said central processing unit is performed by using one or more of the following fault tolerance techniques: data shadowing; code&flow signature; data processing legality check; addressing legality check; ALU concurrent integrity checking; concurrent mode/interrupt check.Type: GrantFiled: July 9, 2004Date of Patent: December 30, 2008Assignee: Yogitech SpaInventors: Riccardo Mariani, Silvano Motto, Monia Chiavacci
-
Publication number: 20080235316Abstract: The disclosure describes an adaptive multi-shader within a processor that uses one or more high-precision arithmetic logic units (ALUs) and low-precision ALUs to process data based on the type of the data. Upon receiving a stream of data, the adaptive multi-shader first determines the type of the data. For example, the adaptive multi-shader may determine whether the data is suitable for high-precision processing or low-precision processing. The adaptive multi-shader then processes the data using the high-precision ALUs when the data is suitable for high-precision processing, and processes the data using the high-precision ALUs and the low-precision ALUs when the data is suitable for low-precision processing. The adaptive multi-shader may substantially reduce power consumption and silicon size of the processor by implementing the low-precision ALUs while maintaining the ability to process data using high-precision processing by implementing the high-precision ALUs.Type: ApplicationFiled: March 23, 2007Publication date: September 25, 2008Inventors: Yun Du, Guofang Jiao, Chun Yu
-
Patent number: 7397399Abstract: The present invention concerns a method for transcoding a N bits word into a M bits word with M<N. The invention is applicable in various fields and more particularly in the display field. The method comprises the following steps:-breaking down the N bits word into an exponent part and a mantissa part having each a size which varies according to the value of said N bits word, the size of the mantissa part increasing with the value of said N bits word, and -encoding the exponent part of the N bits word into a variable number of bits A and removing, if need be, least significant bits of the mantissa part in order to obtain a mantissa with a variable number of bits B, with A+B=M.Type: GrantFiled: December 14, 2004Date of Patent: July 8, 2008Assignee: Thomson LicensingInventors: Cédric Thebault, Carlos Correa, Sébastien Weitbruch
-
Patent number: 7047271Abstract: In one embodiment, a digital signal processor (DSP) processes both n-bit data and (n/2)-bit data. The DSP includes multiple processing paths. A first processing path processes n-bit data. A second processing path is processes (n/2)-bit data. The multiple processing paths may be established using multiple components or may share components. When the processing paths share components, only one of the processing paths may be used at a time.Type: GrantFiled: April 20, 2004Date of Patent: May 16, 2006Assignees: Intel Corporation, Analog Devices, Inc.Inventors: Bradley C. Aldrich, Jose Fridman, Paul Meyer, Gang Liang
-
Patent number: 7043518Abstract: A multiply accumulate unit (“MAC”) that performs operations on packed integer data. In one embodiment, the MAC receives 2 32-bit data words which, depending on the specified mode of operation, each contain either four 8-bit operands, two 16-bit operands, or one 32-bit operand. Depending on the mode of operation, the MAC performs either sixteen 8×8 operations, four 16×16 operations, or one 32×32 operation. Results may be individually retrieved from registers and the corresponding accumulator cleared after the read cycle. In addition, the accumulators may be globally initialized. Two results from the 8×8 operations may be packed into a single 32-bit register. The MAC may also shift and saturate the products as required.Type: GrantFiled: February 9, 2004Date of Patent: May 9, 2006Assignee: Cradle Technologies, Inc.Inventors: Moshe B. Simon, Erik P. Machnicki, David A. Harrison, Rakesh K. Singh
-
Patent number: 7039906Abstract: A compiler for data processing outputs lower-level code for packing multiple signed data elements per register into a processor's registers using the rules set forth herein, and when executed, the code simultaneously operates on the elements in a register in a single cycle using the same operand. The elements can be independent of each other as defined by compiler directives, and the sizes of the elements in a register can differ from each other. Moreover, a relatively large element can be split across multiple registers. In an exemplary application, a data stream representing two images can be simultaneously processed using the same number of registers as have been required to process a single image. Or, a single image can be processed approaching N-times faster, where N is the number of elements per register. In any case, the present invention results in a significant increase in processing efficiency.Type: GrantFiled: October 20, 2000Date of Patent: May 2, 2006Assignee: International Business Machines CorporationInventors: Jennifer Q. Trelewicz, Michael Thomas Brady, Joan Laverne Mitchell
-
Patent number: 6854001Abstract: A computing device (40) comprises an electrical circuit and a software application. A display screen (138) and an input device (140) are electrically coupled to the electrical circuit. The software application provides instructions to determine the number of significant figures for a number entered via the input device, and simultaneously display on the display screen the entered number along with the number of significant figures for the entered number, and/or the software application provides instructions to calculate a floating point answer for a mathematical operation entered for one or more numbers entered into the computing device, round the floating point answer to the proper precision or to the proper number of significant figures, determine the number of significant figures for the rounded answer, and simultaneously display on the display the rounded answer and its number of significant figures.Type: GrantFiled: December 11, 2001Date of Patent: February 8, 2005Assignee: Texas Instruments IncorporatedInventors: John C. Good, Shawn Prestridge
-
Publication number: 20040205096Abstract: A programmable processor and system for improving the performance of processors by incorporating an execution unit operable to decode and execute single instructions specifying three registers each containing a plurality of data elements, the execution unit operable to multiply the first and second registers and add the third register to produce a catenated result containing a plurality of data elements. Additional instructions provide group floating-point subtract, add, multiply, set less, and set greater equal operations. The set less and set greater equal operations produce alternatively zero or an identity element for each element of a catenated result, the result facilitating alternative selection of individual data elements using bitwise Boolean operations and without requiring conditional branch operations.Type: ApplicationFiled: January 16, 2004Publication date: October 14, 2004Applicant: MICROUNITY SYSTEMS ENGINEERING, INC.Inventors: Craig Hansen, John Moussouris
-
Patent number: 6681236Abstract: The process for performing operations with a variable arithmetic does not call for any shifting of the data in the different registers that come into play in the operation. The input registers can have empty parts which are completed by appropriate bit sequences to ensure a propagation of a possible outgoing carry over in order to recover that carry over from a result register.Type: GrantFiled: December 15, 2000Date of Patent: January 20, 2004Assignee: STMicroelectronics S.A.Inventors: David Jacquet, Pascal Fouilleul
-
Patent number: 6629231Abstract: There is disclosed a pipelined floating point unit comprising: a) a first plurality of pipelined functional units for processing operands conforming to a single instruction-multiple data stream (SIMD) instruction set architecture (ISA); b) a second plurality of pipelined functional units for processing operands conforming to a scalar instruction set architecture (ISA); and c) a first format fault detection circuit associated with at least one of the first plurality of pipelined functional units for determining whether a first operand is a denormal number and, in response to the determination, generating a first fault signal. The first fault signal causes a number conversion circuit associated with the pipelined floating point unit to modify a significand and an exponent of at least one operand in a data register associated with the pipelined floating point unit to thereby convert the at least one operand to a denormal number.Type: GrantFiled: January 4, 2000Date of Patent: September 30, 2003Assignee: National Semiconductor CorporationInventor: Jeffrey A. Lohman
-
Publication number: 20030172100Abstract: Logic for calculating a sum of numbers using an overflow counter in an environment exceeded by the numbers in bit-size accesses a least significant portion of a first number, accesses a least significant portion of a second number, and adds the least significant portion of the first number to the least significant portion of the second number. The resulting sum includes a first intermediate number. If a carry is generated by the addition of the least significant portion of the first number to the least significant portion of the second number, the logic increments an overflow counter to record the generated carry. The logic accesses least significant portions of the remaining numbers, adds the least significant portions to the first intermediate number, and increments the overflow counter each time a carry is generated to record the generated carry. After each of the least significant portions has been added to the first intermediate number, the logic stores the first intermediate number.Type: ApplicationFiled: January 22, 2003Publication date: September 11, 2003Inventor: Alexander Tessarolo
-
Publication number: 20030163500Abstract: Logic for performing 64-bit scaled sum-of-product operations in a 32-bit environment accesses a first 32-bit number, a second 32-bit number, and a shift number in a first operation. The logic multiplies the first 32-bit number by the second 32-bit number. The resulting product includes a first 64-bit number that includes a most and least significant 32-bit portions. The logic right-shifts the least significant 32-bit portion of the first 64-bit number according to the shift number, accesses a least significant 32-bit portion of a second 64-bit number, and adds the right-shifted least significant 32-bit portion of the first 64-bit number to the least significant 32-bit portion of the second 64-bit number. The resulting sum includes a least significant 32-bit portion of a final result of a 64-bit scaled sum-of-product operation and a carry bit. In a second operation, the logic multiplies the first 32-bit number by the second 32-bit number. The resulting product includes the first 64-bit number.Type: ApplicationFiled: January 22, 2003Publication date: August 28, 2003Inventor: Alexander Tessarolo
-
Publication number: 20030163499Abstract: In one embodiment of the present invention, logic for limiting the value of a 64-bit number to a maximum limit in a 32-bit environment allocates one or more bit flags in a first operation. The logic accesses a most significant 32-bit portion of a first 64-bit number including a first value. The logic accesses a most significant 32-bit portion of a second 64-bit number including a maximum limit and a second value. The logic compares the first value with the second value and, if the first value is greater than the second value, sets the bit flags accordingly and changes the most significant 32-bit portion of the first 64-bit number to match the same of the second 64-bit number. If the first value is equal to the second value, the logic sets the one or more allocated bit flags accordingly. In a second operation following the first operation, the logic accesses the bit flags.Type: ApplicationFiled: January 22, 2003Publication date: August 28, 2003Inventors: Alexander Tessarolo, Karthikeyan Rajan Madathil, G. Subash Chandar
-
Patent number: 6564238Abstract: A digital signal processing system performs different word-length arithmetic operations (e.g., 24-bit arithmetic and 16-bit arithmetic) using the same hardware. The digital signal processing system includes internal buses, a host processor coupled to the internal buses, a DSP coprocessor coupled to the internal buses for performing a digital signal processing under control of the host processor, and first and second data memories coupled to the internal buses. The DSP coprocessor further includes a random access memory (RAM) pointer for generating addresses to access the first and second data memories, a multiply and accumulate (MAC) unit for performing a multiply and accumulate operation, an arithmetic unit, a shift and exponent unit for shifting operands and for evaluating exponents, and a local decoder for decoding DSP commands from the host processor and controlling the RAM pointer, the arithmetic unit, and the shift and exponent unit.Type: GrantFiled: February 2, 2000Date of Patent: May 13, 2003Assignee: Samsung Electronics Co., Ltd.Inventors: Hong-Kyu Kim, Yong-Chun Kim
-
Patent number: 6557096Abstract: A signal processor with an instruction set architecture (ISA) for flexible data typing, permutation, and type matching of operands. The signal processor includes a data typer and aligner to support flexible data typing, permutation and type matching of operands of the instruction set architecture. The data typer and aligner is selectively configued to align and select one of more sets of data bits from one or more data buses as operands for functional blocks of the signal processor in response to fields of an instruction.Type: GrantFiled: August 31, 2000Date of Patent: April 29, 2003Assignee: Intel CorporationInventors: Kumar Ganapathy, Ruban Kanapathipillai
-
Patent number: 6523057Abstract: A high-speed, wide dynamic range, digital accumulator includes a first adder stage in which an input addend is added to a value of a least significant part of an output of an accumulator from a preceding clock period. The accumulator also includes at least one second stage having incrementer/decrementer means for performing an increment, decrement or identity operation on a most significant part of the output of the accumulator. The incrementer/decrementer means includes logic means for triggering the increment, a decrement or identity operation on the most significant part of the accumulator output based on a decision made on results obtained at the previous clock period.Type: GrantFiled: May 7, 1999Date of Patent: February 18, 2003Assignee: STMicroelectronics S.r.l.Inventors: Pierandrea Savo, Luigi Zangrandi, Stefano Marchese