Decoding Instruction To Accommodate Variable Length Instruction Or Operand Patents (Class 712/210)

Central processor for memory tag

Patent number: 8856497

Abstract: This invention provides a processor 200, especially for use as the central processing unit of a memory tag 1200. The processor 200 has a minimal footprint in Silicon or other suitable material. It also is driven by the data that it receives. The processor includes a plurality 206, 212, 214, 224 of registers configured to receive in parallel data that are input to the processor, and to process in parallel the received data, and a micro sequencer and instruction decoder module 202 adapted to select two or more of the plurality of registers to receive the data that are input to the processor, and to control the processing of the received data by the end or more selected registers. A memory 1200 device utilizing such a processor, and a method of processing instructions are also provided.

Type: Grant

Filed: December 6, 2005

Date of Patent: October 7, 2014

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Weng Wah Loh, Fraser John Dickin, Thomas Rathbone
PROCESSORS, METHODS, AND SYSTEMS TO IMPLEMENT PARTIAL REGISTER ACCESSES WITH MASKED FULL REGISTER ACCESSES

Publication number: 20140297994

Abstract: A method includes receiving a packed data instruction indicating a first narrower source packed data operand and a narrower destination operand. The instruction is mapped to a masked packed data operation indicating a first wider source packed data operand that is wider than and includes the first narrower source operand, and indicating a wider destination operand that is wider than and includes the narrower destination operand. A packed data operation mask is generated that includes a mask element for each corresponding result data element of a packed data result to be stored by the masked packed data operation. All mask elements that correspond to result data elements to be stored by the masked operation that would not be stored by the packed data instruction are masking out. The masked operation is performed using the packed data operation mask. The packed data result is stored in the wider destination operand.

Type: Application

Filed: March 30, 2013

Publication date: October 2, 2014

Inventors: Edward T. Grochowski, Seyed Yahya Sotoudeh, Buford M. Guy
System using a unique marker with each software code-block

Patent number: 8850410

Abstract: A system and method for improving software maintainability, performance, and/or security by associating a unique marker to each software code-block; the system comprising of a plurality of processors, a plurality of code-blocks, and a marker associated with each code-block. The system may also include a special hardware register (code-block marker hardware register) in each processor for identifying the markers of the code-blocks executed by the processor, without changing any of the plurality of code-blocks.

Type: Grant

Filed: January 29, 2010

Date of Patent: September 30, 2014

Assignee: International Business Machines Corporation

Inventors: Ramanjaneya S. Burugula, Joefon Jann, Pratap C. Pattnaik
Systems, Apparatuses,and Methods for Zeroing of Bits in a Data Element

Publication number: 20140281400

Abstract: Embodiments of systems, methods and apparatuses for execution a NAME instruction are described. The execution of a VPBZHI causes, on a per data element basis of a second source, a zeroing of bits higher (more significant) than a starting point in the data element. The starting point is defined by the contents of a data element in a first source. The resultant data elements are stored in a corresponding data element position of a destination.

Type: Application

Filed: March 15, 2013

Publication date: September 18, 2014

Inventors: Elmoustapha Ould-Ahmed-Vall, Robert Valentine
Systems, Apparatuses, and Methods for Determining a Trailing Least Significant Masking Bit of a Writemask Register

Publication number: 20140281401

Abstract: The execution of a KZBTZ finds a trailing least significant zero bit position in an first input mask and sets an output mask to have the values of the first input mask, but with all bit positions closer to the most significant bit position than the trailing least significant zero bit position in an first input mask set to zero. In some embodiments, a second input mask is used as a writemask such that bit positions of the first input mask are not considered in the trailing least significant zero bit position calculation depending upon a corresponding bit position in the second input mask.

Type: Application

Filed: March 15, 2013

Publication date: September 18, 2014

Inventors: Christopher J. Hughes, Mark J. Charney, Jesus Corbal, Milind B. Girkar, Elmoustapha Ould-Ahmed_Vall, Bret L. Toll, Robert Valentine
Prefix accumulation for efficient processing of instructions with multiple prefix bytes

Patent number: 8838938

Abstract: In a microprocessor that has an instruction set architecture in which the instructions may include a variable number of prefix bytes, an apparatus for efficiently extracting instructions from a stream of undifferentiated instruction bytes. Decode logic determines which byte is an opcode byte for each instruction of a plurality of instructions within the stream of undifferentiated instruction bytes. The opcode byte is the first non-prefix byte of the instruction. The decode logic accumulates prefix information onto the opcode byte of the instruction for each instruction of the plurality of instructions. A queue holds the stream of undifferentiated instruction bytes and the accumulated prefix information. Extraction logic extracts the plurality of instructions from the queue in one clock cycle independent of the number of prefix bytes included in each of the plurality of instructions.

Type: Grant

Filed: October 1, 2009

Date of Patent: September 16, 2014

Assignee: Via Technologies, Inc.

Inventors: Thomas C. McDonald, John L. Duncan
Length determination of instruction code with address form field and escape opcode value by evaluating portions other than instruction specific opcode

Patent number: 8793470

Abstract: A method, apparatus and system are disclosed for decoding an instruction in a variable-length instruction set. The instruction is one of a set of new types of instructions that uses a new escape code value, which is two bytes in length, to indicate that a third opcode byte includes the instruction-specific opcode for a new instruction. The new instructions are defined such the length of each instruction in the opcode map for one of the new escape opcode values may be determined using the same set of inputs, where each of the inputs is relevant to determining the length of each instruction in the new opcode map. For at least one embodiment, the length of one of the new instructions is determined without evaluating the instruction-specific opcode.

Type: Grant

Filed: March 15, 2013

Date of Patent: July 29, 2014

Assignee: Intel Corporation

Inventors: James S. Coke, Peter J. Ruscito, Masood Tahir, David B. Jackson, Ves A. Naydenov, Scott D. Rodgers, Bret L. Toll, Frank Binns
SYSTEM AND METHOD FOR PERFORMING A SHUFFLE INSTRUCTION

Publication number: 20140189311

Abstract: An apparatus and method for performing a shuffle operation on packed data using computer-implemented steps is described. In one embodiment, a first packed data operand having at least two data elements is accessed. A second packed data operand having at least two data elements is accessed. One of the data elements in the first packed data operand is shuffled into a lower destination field of a destination register, and one of the data elements in the second packed data operand is shuffled into an upper destination field of the destination register.

Type: Application

Filed: December 31, 2012

Publication date: July 3, 2014

Inventors: Patrice Roussel, Srinivas Chennupaty, Michael Cranford, Mohammad Abdallah, James Coke, Katherine Kong
System and apparatus for group floating-point inflate and deflate operations

Patent number: 8769248

Abstract: Systems and apparatuses are presented relating a programmable processor comprising an execution unit that is operable to decode and execute instructions received from an instruction path and partition data stored in registers in the register file into multiple data elements, the execution unit capable of executing group data handling operations that re-arrange data elements in different ways in response to data handling instructions, the execution unit further capable of executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results.

Type: Grant

Filed: June 11, 2012

Date of Patent: July 1, 2014

Assignee: Microunity Systems Engineering, Inc.

Inventors: Craig Hansen, John Moussouris, Alexia Massalin
RISC processor with instruction executing on different size operand and prefix instruction identifying select flag update for respective size

Patent number: 8762689

Abstract: A RISC data processor in which the number of flags generated by each instruction is increased so that a decrease of flag-generating instructions exceeds an increase of flag-using instructions in quantity, thereby achieving the decrease in instructions. An instruction for generating flags according to operands' data sizes is defined, and an instruction set handled by the RISC data processor includes an instruction capable of executing an operation on operands in more than one data size. An identical operation process is conducted on the small-size operand and on low-order bits of the large-size operand, and flags are generated capable of coping with the respective data sizes regardless of the data size of each operand subjected to the operation. Thus, a reduction in instruction code space of the RISC data processor can be achieved.

Type: Grant

Filed: September 6, 2012

Date of Patent: June 24, 2014

Assignee: Renesas Electronics Corporation

Inventor: Fumio Arakawa
Compressed instruction format

Patent number: 8756403

Abstract: A technique for decoding an instruction in a variable-length instruction set. In one embodiment, an instruction encoding is described, in which legacy, present, and future instruction set extensions are supported, and increased functionality is provided, without expanding the code size and, in some cases, reducing the code size.

Type: Grant

Filed: March 15, 2013

Date of Patent: June 17, 2014

Assignee: Intel Corporation

Inventors: Robert Valentine, Doron Orenstein, Brett L. Toll
Serial memory interface for extended address space

Patent number: 8677100

Abstract: An integrated circuit memory device has a memory array and control logic with at least a first addressing mode in which the instruction includes a first instruction code and an address of a first length; and a second addressing mode in which the instruction includes the first instruction code and an address of a second length. The first length of the address is different from the second length of the address.

Type: Grant

Filed: June 10, 2010

Date of Patent: March 18, 2014

Assignee: Macronix International Co., Ltd.

Inventors: Yulan Kuo, Kuen-Long Chang, Chun-Hsiung Hung
Cracking destructively overlapping operands in variable length instructions

Patent number: 8645669

Abstract: A method, information processing system, and computer program product manage computer executable instructions. At least one machine instruction for execution is received. The at least one machine instruction is analyzed. The machine instruction is identified as a predefined instruction for storing a variable length first operand in a memory location. Responsive to this identification and based on fields of the machine instruction, a relative location of a variable length second operand of the instruction with location of the first operand is determined. Responsive to the relative location having the predefined relationship, a first cracking operation is performed. The first cracking operation cracks the instruction into a first set of micro-ops (Uops) to be executed in parallel. The first set of Uops is for storing a first plurality of first blocks in the first operand. Each of said first block to be stored are identical. The first set Uops are executed.

Type: Grant

Filed: May 5, 2010

Date of Patent: February 4, 2014

Assignee: International Business Machines Corporation

Inventors: Khary J. Alexander, Fadi Busaba, Brian Curran, Bruce Giamei, Christian Jacobi
Apparatus and method for marking start and end bytes of instructions in a stream of instruction bytes in a microprocessor having an instruction set architecture in which instructions may include a length-modifying prefix

Patent number: 8612727

Abstract: An apparatus in a microprocessor that has an instruction set architecture in which instructions may include a length-modifying prefix used to select an address/operand size other than a default address/operand size, wherein the apparatus marks the start byte and the end byte of each instruction in a stream of instruction bytes. Decode logic decodes each instruction byte of a predetermined number of instruction bytes to determine whether the instruction byte specifies a length-modifying prefix and generates a start mark and an end mark for each of the instruction bytes based on an address/operand size. Operand/address size logic provides the default operand/address size to the decode logic to use to generate the start and end marks during a first clock cycle during which the decode logic decodes the predetermined number of instruction bytes.

Type: Grant

Filed: October 1, 2009

Date of Patent: December 17, 2013

Assignee: VIA Technologies, Inc.

Inventors: Thomas C. McDonald, John L. Duncan
Odd and even start bit vectors

Patent number: 8589661

Abstract: A method and apparatus are presented for processing a stream of information, including preprocessing the stream, which includes partitioning the stream into packets of interest; determining boundaries for the packets of interest, wherein a packet boundary is either a start location or an end location for a packet; and making a record of the packet boundaries by setting a hint bit in a hint bit vector, a location of the hint bit within the hint bit vector corresponding to a position of the packet in the stream. The hint bit vector is split into two or more vectors, where the hint bits are assigned to one of the vectors on an alternating basis. The packets of interest are processed corresponding to the hint bits assigned to each vector in parallel over multiple clock cycles, wherein an original order of the packets of interest is maintained in the stream.

Type: Grant

Filed: December 7, 2010

Date of Patent: November 19, 2013

Assignee: Advanced Micro Devices, Inc.

Inventors: Mike Butler, Donald A. Priore, Steven Beigelmacher
REGISTER FILE POWER SAVINGS

Publication number: 20130290681

Abstract: A system and method for efficiently reducing the power consumption of register file accesses. A processor is operable to execute instructions with two or more data types, each with an associated size and alignment. Data operands for a first data type use operand sizes equal to an entire width of a physical register within a physical register file. Data operands for a second data type use operand sizes less than an entire width of a physical register. Accesses of the physical register file for operands associated with a non-full-width data type do not access a full width of the physical registers. A given numerical value may be bypassed for the portion of the physical register that is not accessed.

Type: Application

Filed: April 30, 2012

Publication date: October 31, 2013

Inventors: James B. Keller, John H. Mylius, Conrado Blasco-Allue, Gerard R. Williams, III, Sandeep Gupta
COMPRESSED INSTRUCTION FORMAT

Publication number: 20130290682

Abstract: A technique for decoding an instruction in a variable-length instruction set. In one embodiment, an instruction encoding is described, in which legacy, present, and future instruction set extensions are supported, and increased functionality is provided, without expanding the code size and, in some cases, reducing the code size.

Type: Application

Filed: March 15, 2013

Publication date: October 31, 2013

Inventors: Robert Valentine, Doron Orenstein, Brett L. Toll
Lane crossing instruction selecting operand data bits conveyed from register via direct path and lane crossing path for execution

Patent number: 8560811

Abstract: The present invention provides a method and apparatus for handling lane-crossing instructions in an execution pipeline. One embodiment of the method includes conveying bits of an instruction from a register to an execution stage in a pipeline along a first data path that includes a lane crossing stage configured to change a first mapping of the register to the execution stage to a second mapping. The method also includes concurrently conveying the bits along a second data path from the register to the execution stage that bypasses the lane crossing stage. The method further includes selecting the first or second data path to provide the bits to the execution stage.

Type: Grant

Filed: August 5, 2010

Date of Patent: October 15, 2013

Assignee: Advanced Micro Devices, Inc.

Inventor: John M. King
Execution of variable width vector processing instructions

Patent number: 8555034

Abstract: A processing unit executes a vector width instruction in a program and the processing unit obtains and supplies the width of an appropriate vector register that will be used to process variable vector processing instructions. Then, when the processing unit executes variable vector processing instructions in the program, the processing unit processes the variable vector processing instructions using the appropriate vector register with the instructions having the same width as the appropriate vector register. The width that the processing unit obtains may be less than an actual width of the appropriate vector register and may set by the processing unit. In this way, many different vector widths can be supported using a single set of instructions for vector processing. New instructions are not required if vector widths are changed and processing units having vector registers of differing widths do not require different code.

Type: Grant

Filed: December 15, 2009

Date of Patent: October 8, 2013

Assignee: Oracle America, Inc.

Inventor: Peter Carl Damron
Reusing expression graphs in computer programming languages

Patent number: 8549464

Abstract: A reusable expression graph system and method that generates reusable expression graphs that can be used with potentially different input parameters in order to achieve computational efficiency and ease of programming. Reusable expression graph mitigate the need to rebuild an expression for each new value. This is achieved in part by creating a node called a “parameter node.” The parameter node acts as a generic placeholder for a leaf node in the expression graph. In addition, the parameter node acts as a proxy for a bindable term of the leaf node, and the bindable term can be either a value or one or more additional expressions. The parameter node then is bound to the bindable term and the expression is evaluated with that bindable term instead of the placeholder. The parameter node created by embodiments of the reusable expression graph system and method works across many different programming languages.

Type: Grant

Filed: November 22, 2010

Date of Patent: October 1, 2013

Assignee: Microsoft Corporation

Inventors: Barry Clayton Bond, Vivian Sewelson, Daniel Johannes Pieter Leijin, Lubomir Boyanov Litchev
Data processing circuit

Patent number: 8543736

Abstract: A data processing circuit is disclosed in the present invention. The data processing circuit includes a decoder and a number of N-stage circuits. The circuits receive input data from at least a memory and separate the input data into N stages. The circuit process and store the N input data simultaneously to decrease the time of data processing in the data processing circuit.

Type: Grant

Filed: September 8, 2010

Date of Patent: September 24, 2013

Assignee: Etron Technology, Inc.

Inventors: Chien-Chou Chen, Ming-Sung Huang, Wen Min Lu
DATA PROCESSOR TO PROCESS DATA

Publication number: 20130246749

Abstract: A data processor includes a first register file including registers, a second register file including resisters, a number of which is larger than that of the registers of the first register file, an instruction decoder and an operation unit. The instruction decoder decodes an instruction described in first and second instruction formats. The first instruction format includes a first register-addressing field for designating the first register file. The second instruction format includes a second register-addressing field for designating the second register file, a size of which is larger than that of the first register-addressing field. The operation unit executes an instruction described in the first and second instruction formats using operand data stored in the first and second register files, respectively, based on the instruction decoder, and executes operations in parallel, a number of which is determined by a certain field included in the second instruction format.

Type: Application

Filed: April 12, 2013

Publication date: September 19, 2013

Applicant: PANASONIC CORPORATION

Inventors: Takeshi KISHIDA, Masaitsu NAKAJIMA
Apparatus for efficiently determining instruction length instruction within a stream of x86 instruction bytes

Patent number: 8533434

Abstract: An apparatus efficiently determines the length of an instruction within a stream of instruction bytes processed by a microprocessor having a variable instruction length instruction set architecture. The apparatus includes combinatorial logic associated with each instruction byte of the stream, each configured to receive the associated instruction byte and the next instruction byte of the stream and to generate in response thereto a first length, a second length, and a select control. A multiplexor associated with each of the combinatorial logic selects and outputs one of the following inputs based on the select control received from the combinatorial logic: a zero input and the second length received from the combinatorial logic associated with each of the next three instruction bytes of the stream. An adder associated with each of the combinatorial logic and multiplexor adds the first length and the output of the multiplexor to generate the length of the instruction.

Type: Grant

Filed: October 1, 2009

Date of Patent: September 10, 2013

Assignee: VIA Technologies, Inc.

Inventors: John L. Duncan, Thomas C. McDonald
INSTRUCTION SET EXTENSION USING 3-BYTE ESCAPE OPCODE

Publication number: 20130219152

Abstract: A method, apparatus and system are disclosed for decoding an instruction in a variable-length instruction set. The instruction is one of a set of new types of instructions that uses a new escape code value, which is two bytes in length, to indicate that a third opcode byte includes the instruction-specific opcode for a new instruction. The new instructions are defined such the length of each instruction in the opcode map for one of the new escape opcode values may be determined using the same set of inputs, where each of the inputs is relevant to determining the length of each instruction in the new opcode map. For at least one embodiment, the length of one of the new instructions is determined without evaluating the instruction-specific opcode.

Type: Application

Filed: March 15, 2013

Publication date: August 22, 2013

Inventors: James S. Coke, Peter J. Ruscito, Masood Tahir, David B. Jackson, Ves A. Naydenov, Scott D. Rodgers, Bret L. Toll, Frank Binns
Central processing unit and microcontroller

Patent number: 8516225

Abstract: A program data area 38 storing program data is provided in an internal memory unit that a control circuit 31 of a CPU 3 can directly red from. The program data is constituted by instructions each comprising an instruction information part and an operand (i.e., a complementary information part) for use in execution of this instruction information part. The program data area 38 comprises a plurality of 24-bit data areas each having an address indicative thereof. One instruction is stored in one data area such that the instruction information part resides at the beginning of the data area.

Type: Grant

Filed: November 22, 2011

Date of Patent: August 20, 2013

Inventors: Koichi Kitagishi, Masami Fukushima
In-Lane Vector Shuffle Instructions

Publication number: 20130212360

Abstract: In-lane vector shuffle operations are described. In one embodiment a shuffle instruction specifies a field of per-lane control bits, a source operand and a destination operand, these operands having corresponding lanes, each lane divided into corresponding portions of multiple data elements. Sets of data elements are selected from corresponding portions of every lane of the source operand according to per-lane control bits. Elements of these sets are copied to specified fields in corresponding portions of every lane of the destination operand. Another embodiment of the shuffle instruction also specifies a second source operand, all operands having corresponding lanes divided into multiple data elements. A set selected according to per-lane control bits contains data elements from every lane portion of a first source operand and data elements from every corresponding lane portion of the second source operand. Set elements are copied to specified fields in every lane of the destination operand.

Type: Application

Filed: March 15, 2013

Publication date: August 15, 2013

Inventors: Zeev Sperber, Robert Valentine, Benny Eitan, Doron Orenstein
Compressed instruction format

Patent number: 8504802

Abstract: A system, techniques and apparatus are described for decoding an instruction in an a variable-length instruction set. An instruction encoding is described, in which legacy, present, and future instruction set extensions are supported, and increased functionality is provided, without expanding the code size and, in some cases, reducing the code size.

Type: Grant

Filed: September 7, 2012

Date of Patent: August 6, 2013

Assignee: Intel Corporation

Inventors: Robert Valentine, Doron Orenstien, Bret Toll
Instruction length based cracking for instruction of variable length storage operands

Patent number: 8495341

Abstract: A method, information processing system, and computer program product manage variable operand length instructions. At least one variable operand length instruction is received. The at least one variable operand length instruction is analyzed. A length of at least one operand in the variable operand length instruction is identified based on the analyzing. The at least one variable operand length instruction is organized into a set of unit of operations. The set of unit of operations are executed. The executing increases one or more performance metrics of the at least one variable operand length instruction.

Type: Grant

Filed: February 17, 2010

Date of Patent: July 23, 2013

Assignee: International Business Machines Corporation

Inventors: Fadi Busaba, Brian Curran, Bruce Giamei, Christian Jacobi, Wen Li
Data packet arithmetic logic devices and methods

Patent number: 8473719

Abstract: New instruction definitions for a packet add (PADD) operation and for a single instruction multiple add (SMAD) operation are disclosed. In addition, a new dedicated PADD logic device that performs the PADD operation in about one to two processor clock cycles is disclosed. Also, a new dedicated SMAD logic device that performs a single instruction multiple data add (SMAD) operation in about one to two clock cycles is disclosed.

Type: Grant

Filed: October 31, 2006

Date of Patent: June 25, 2013

Assignee: Intel Corporation

Inventors: Corey Gee, Bapiraju Vinnakota, Saleem Mohammadali, Carl A. Alberola
Indirect indexing instructions

Patent number: 8452945

Abstract: A data processor includes an instruction decoder, an execution unit, a general-purpose register file, and an index-register file. The instruction set for the data processor includes indirect-indexing instructions to facilitate table lookups. When executing such an instruction, the execution unit reads an index stored at an index-register location specified by the instruction. The index refers to a general-purpose register location, which is then read and copied to a general-purpose register location as specified by the instruction. The disclosed execution unit includes four functional units, each with two read ports and a write port so that eight table lookups can be performed in parallel.

Type: Grant

Filed: September 17, 2002

Date of Patent: May 28, 2013

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Dale Morris
Apparatus and method for marking start and end bytes of instructions in a stream of instruction bytes in a microprocessor having an instruction set architecture in which instructions may include a length-modifying prefix

Patent number: 8443172

Abstract: An apparatus in a microprocessor that has an instruction set architecture in which instructions may include a length-modifying prefix used to select an address/operand size other than a default address/operand size, wherein the apparatus marks the start byte and the end byte of each instruction in a stream of instruction bytes. Decode logic decodes each instruction byte of a predetermined number of instruction bytes to determine whether the instruction byte specifies a length-modifying prefix and generates a start mark and an end mark for each of the instruction bytes based on an address/operand size. Operand/address size logic provides the default operand/address size to the decode logic to use to generate the start and end marks during a first clock cycle during which the decode logic decodes the predetermined number of instruction bytes.

Type: Grant

Filed: October 1, 2009

Date of Patent: May 14, 2013

Assignee: VIA Technologies, Inc.

Inventors: Thomas C. McDonald, John L. Duncan
Method for instructing a data processor to process data

Patent number: 8443173

Abstract: A data processor which executes instructions described in first and second instruction formats. The first instruction format defines a register-addressing field of a predetermined size, while the second instruction format defines a register-addressing field of a size larger than that of the register-addressing field defined by the first instruction format. The data processor includes: instruction-type identifier, responsive to an instruction, for identifying the received instruction as being described in the first or second instruction format by the instruction itself; a first register file including a plurality of registers; and a second register file also including a plurality of registers, the number of the registers included in the second register file being larger than that of the registers included in the first register file.

Type: Grant

Filed: June 3, 2011

Date of Patent: May 14, 2013

Assignee: Panasonic Corporation

Inventors: Takeshi Kishida, Masaitsu Nakajima
Method and Apparatus for Packing Packed Data

Publication number: 20130117539

Abstract: An apparatus includes an instruction decoder, first and second source registers and a circuit coupled to the decoder to receive packed data from the source registers and to unpack the packed data responsive to an unpack instruction received by the decoder. A first packed data element and a third packed data element are received from the first source register. A second packed data element and a fourth packed data element are received from the second source register. The circuit copies the packed data elements into a destination register resulting with the second packed data element adjacent to the first packed data element, the third packed data element adjacent to the second packed data element, and the fourth packed data element adjacent to the third packed data element.

Type: Application

Filed: December 29, 2012

Publication date: May 9, 2013

Inventors: Alexander Peleg, Yaakov Yaari, Millind Mittal, Larry M. Mennemeier, Benny Eitan
METHOD AND APPARATUS FOR UNPACKING PACKED DATA

Publication number: 20130117540

Abstract: An apparatus includes an instruction decoder, first and second source registers and a circuit coupled to the decoder to receive packed data from the source registers and to unpack the packed data responsive to an unpack instruction received by the decoder. A first packed data element and a third packed data element are received from the first source register. A second packed data element and a fourth packed data element are received from the second source register. The circuit copies the packed data elements into a destination register resulting with the second packed data element adjacent to the first packed data element, the third packed data element adjacent to the second packed data element, and the fourth packed data element adjacent to the third packed data element.

Type: Application

Filed: December 29, 2012

Publication date: May 9, 2013

Inventors: Alexander Peleg, Yaakov Yaari, Millind Mittal, Larry M. Mennemeier, Benny Eitan
Instruction extraction through prefix accumulation

Patent number: 8438367

Abstract: An apparatus has a queue, each entry stores a different line of a stream of instruction bytes and accumulated prefix information associated with each instruction byte.

Type: Grant

Filed: October 1, 2009

Date of Patent: May 7, 2013

Assignee: VIA Technologies, Inc.

Inventors: Thomas C. McDonald, John L. Duncan
Flag generation and use in processor with same processing for operation on small size operand as low order bits portion of operation on large size operand

Patent number: 8402254

Abstract: The RISC data processor is based on the idea that in case that there are many flag-generating instructions, the number of flags generated by each instruction is increased so that a decrease of flag-generating instructions exceeds an increase of flag-using instructions in quantity, thereby achieving the decrease in instructions. With the data processor, an instruction for generating flags according to operands' data sizes is defined. To an instruction set handled by the RISC data processor, an instruction capable of executing an operation on operand in more than one data size, which performs a process identical to an operation process conducted on the small-size operand on low-order bits of the large-size operand, and generates flags capable of coping with the respective data sizes regardless of the data size of each operand subjected to the operation is added. Thus, the reduction in instruction code space of the RISC data processor tight in instruction code space can be achieved.

Type: Grant

Filed: February 11, 2009

Date of Patent: March 19, 2013

Assignee: Renesas Electronics Corporation

Inventor: Fumio Arakawa
Determining length of instruction with address form field exclusive of evaluating instruction specific opcode in three byte escape opcode

Patent number: 8402252

Abstract: A method, apparatus and system are disclosed for decoding an instruction in a variable-length instruction set. The instruction is one of a set of new types of instructions that uses a new escape code value, which is two bytes in length, to indicate that a third opcode byte includes the instruction-specific opcode for a new instruction. The new instructions are defined such the length of each instruction in the opcode map for one of the new escape opcode values may be determined using the same set of inputs, where each of the inputs is relevant to determining the length of each instruction in the new opcode map. For at least one embodiment, the length of one of the new instructions is determined without evaluating the instruction-specific opcode.

Type: Grant

Filed: March 10, 2012

Date of Patent: March 19, 2013

Assignee: Intel Corporation

Inventors: James S. Coke, Peter J. Ruscito, Masood Tahir, David B. Jackson, Ves A. Naydenov, Scott D. Rodgers, Bret L. Toll, Frank Binns
Bitstream Buffer Manipulation With A SIMD Merge Instruction

Publication number: 20130061024

Abstract: Method, apparatus, and program means for performing bitstream buffer manipulation with a SIMD merge instruction. The method of one embodiment comprises determining whether any unprocessed data bits for a partial variable length symbol exist in a first data block is made. A shift merge operation is performed to merge the unprocessed data bits from the first data block with a second data block. A merged data block is formed. A merged variable length symbol comprised of the unprocessed data bits and a plurality of data bits from the second data block is extracted from the merged data block.

Type: Application

Filed: November 5, 2012

Publication date: March 7, 2013

Inventors: Yen-Kueng Chen, William W. Macy, JR., Matthew Holliman, Eric L. Debes, Minerva M. Yeung
Bitstream Buffer Manipulation With A SIMD Merge Instruction

Publication number: 20130061025

Abstract: Method, apparatus, and program means for performing bitstream buffer manipulation with a SIMD merge instruction. The method of one embodiment comprises determining whether any unprocessed data bits for a partial variable length symbol exist in a first data block is made. A shift merge operation is performed to merge the unprocessed data bits from the first data block with a second data block. A merged data block is formed. A merged variable length symbol comprised of the unprocessed data bits and a plurality of data bits from the second data block is extracted from the merged data block.

Type: Application

Filed: November 5, 2012

Publication date: March 7, 2013

Inventors: Yen-Kueng Chen, William W. Macy, Matthew Holliman, Eric L. Debes, Minerva M. Yeung
Microprocessor and method for register addressing therein

Patent number: 8364934

Abstract: A microprocessor architecture comprising a microprocessor operably coupled to a plurality of registers and arranged to execute at least one instruction. The microprocessor is arranged to determine a class of data operand. The at least one instruction comprises one or more codes in a register specifier that indicates whether relative addressing or absolute addressing is used in accessing a register. In this manner, absolute and relative register addressing is supported within a single instruction word.

Type: Grant

Filed: July 11, 2006

Date of Patent: January 29, 2013

Assignee: Freescale Semiconductor, Inc.

Inventor: Martin Raubuch
SIMD SIGN OPERATION

Publication number: 20120331272

Abstract: Method, apparatus, and program means for nonlinear filtering and deblocking applications utilizing SIMD sign and absolute value operations. The method of one embodiment comprises receiving first data for a first block and second data for a second block. The first data and said second data are comprised of a plurality of rows and columns of pixel data. A block boundary between the first block and the second block is characterized. A correction factor for a deblocking algorithm is calculated with a first instruction for a sign operation that multiplies and with a second instruction for an absolute value operation. Data for pixels located along said block boundary between the first and second block are corrected.

Type: Application

Filed: September 4, 2012

Publication date: December 27, 2012

Inventors: William W. Macy, JR., Huy V. Nguyen
Early release of cache data with start/end marks when instructions are only partially present

Patent number: 8335910

Abstract: An apparatus extracts instructions from a stream of undifferentiated instruction bytes in a microprocessor having an instruction set architecture in which the instructions are variable length. Decoders generate an associated start/end mark for each instruction byte of a line from a first queue of entries each storing a line of instruction bytes. A second queue has entries each storing a line received from the first queue along with the associated start/end marks.

Type: Grant

Filed: October 1, 2009

Date of Patent: December 18, 2012

Assignee: VIA Technologies, Inc.

Inventor: Thomas C. McDonald
Architecture and implementation method of programmable arithmetic controller for cryptographic applications

Patent number: 8302083

Abstract: An architecture includes a controller. The controller is configured to receive a microprogram. The microprogram is configured for performing at least one of hierarchical or a sequence of polynomial computations. The architecture also includes an arithmetic logic unit (ALU) communicably coupled to the controller. The ALU is controlled by the controller. Additionally, the microprogram is compiled prior to execution by the controller, the microprogram is compiled into a plurality of binary tables, and the microprogram is programmed in a command language in which each command includes a first portion for indicating at least one of a command or data transferred to the ALU, and a second portion for including a control command to the controller. The architecture and implementation of the programmable controller may be for cryptographic applications, including those related to public key cryptography.

Type: Grant

Filed: January 23, 2009

Date of Patent: October 30, 2012

Assignee: LSI Corporation

Inventors: Anatoli A. Bolotov, Mikhail I. Grinchuk, Lav Ivanovic, Alexei Galatenko
IMPLEMENTING INSTRUCTION SET ARCHITECTURES WITH NON-CONTIGUOUS REGISTER FILE SPECIFIERS

Publication number: 20120265967

Abstract: There are provided methods and computer program products for implementing instruction set architectures with non-contiguous register file specifiers. A method for processing instruction code includes processing an instruction of an instruction set using a non-contiguous register specifier of a non-contiguous register specification. The instruction includes the non-contiguous register specifier.

Type: Application

Filed: March 21, 2012

Publication date: October 18, 2012

Applicant: International Business Machines Corporation

Inventors: Michael Karl Gschwind, Robert K. Montoye, Brett Olsson, John-David Wellman
MIXED SIZE DATA PROCESSING OPERATION

Publication number: 20120233444

Abstract: A data processing system 2 includes a processor core 4 and a memory 6. The processor core 4 includes processing circuitry 12, 14, 16, 18, 26 controlled by control signals generated by decoder circuitry 24 which decodes program instructions. The program instructions include mixed operand size instructions (either load/store instructions or arithmetic instructions) which have a first input operand of a first operand size and a second input operand of a second input operand size where the second operand size is smaller than the first operand size. The processing performed first converts the second operand so as to have the first operand size. The processing then generates a third operand using as inputs the first operand of the first operand size and the second operand now converted to have the first operand size.

Type: Application

Filed: January 19, 2012

Publication date: September 13, 2012

Inventors: Nigel John Stephens, David James Seal
SEGMENTAL ALLOCATION METHOD OF EXPANDING RISC PROCESSOR REGISTER

Publication number: 20120210100

Abstract: A segmental allocation method of expanding RISC processor register includes the steps of a) setting an instruction format of the RISC processor, the destination register field being set having 6 bits to correspond to 64 registers and at least one source register field having at least 4 bits to correspond to at least 16 registers; b) providing two solutions to the problem resulting from that the instruction format in the step a) goes beyond range under some circumstances; and c) setting a register segment allocation algorithm having the steps of c1) providing and grouping a plurality of pseudo registers; c2) prioritizing the pseudo registers in each of the groups; c3) combining the groups pursuant to the priorities thereof; and c4) locating the physical register of lowest computational cost.

Type: Application

Filed: September 9, 2011

Publication date: August 16, 2012

Inventors: Rong-Guey CHANG, Yuan-Shin HWANG, Chia-Hsien SU
INSTRUCTION SET EXTENSION USING 3-BYTE ESCAPE OPCODE

Publication number: 20120173852

Abstract: A method, apparatus and system are disclosed for decoding an instruction in a variable-length instruction set. The instruction is one of a set of new types of instructions that uses a new escape code value, which is two bytes in length, to indicate that a third opcode byte includes the instruction-specific opcode for a new instruction. The new instructions are defined such the length of each instruction in the opcode map for one of the new escape opcode values may be determined using the same set of inputs, where each of the inputs is relevant to determining the length of each instruction in the new opcode map. For at least one embodiment, the length of one of the new instructions is determined without evaluating the instruction-specific opcode.

Type: Application

Filed: March 10, 2012

Publication date: July 5, 2012

Inventors: James S. Coke, Peter J. Ruscito, Masood Tahir, David B. Jackson, Ves A. Naydenov, Scott D. Rodgers, Bret L. Toll, Frank Binns
Handling Media Streams In A Programmable Bit Processor

Publication number: 20120159128

Abstract: In one embodiment, the present invention is directed to a bit processor that includes an execution unit to, responsive to an instruction for access of data of a first bit width, access data of a second bit width, the second bit width having a different number of bits than the first bit width when some of the data accessed includes non-stream data. Other embodiments are described and claimed.

Type: Application

Filed: December 16, 2010

Publication date: June 21, 2012

Inventors: Raul Gutierrez, Suryaprasad Kareenahalli, Daniel Nemiroff, Balaji Vembu
PROGRAMMABLE LOGIC ARRAY AND READ-ONLY MEMORY AREA REDUCTION USING CONTEXT-SENSITIVE LOGIC FOR DATA SPACE MANIPULATION

Publication number: 20120159129

Abstract: A computer, circuit, and computer-readable medium are disclosed. In one embodiment, the processor includes an instruction decoder unit that can decode a macro instruction into at least one micro-operation with a set of data fields. The resulting micro-operation has at least one data field that is in a compressed form. The instruction decoder unit has storage that can store the micro-operation with the compressed-form data field. The instruction decoder unit also has extraction logic that is capable of extracting the compressed-form data field into an uncompressed-form data field. After extraction, the instruction decoder unit also can send the micro-operation with the extracted uncompressed-form data field to an execution unit. The computer also includes an execution unit capable of executing the sent micro-operation.

Type: Application

Filed: December 17, 2010

Publication date: June 21, 2012

Inventors: Kameswar Subramaniam, Anthony Wojciechowski, Jonathan D. Combs
Apparatus and method for scheduling threads in multi-threading processors

Patent number: 8205204

Abstract: An multi-threading processor is provided. The multi-threading processor includes a first instruction fetch unit to receive a first thread and a second instruction fetch unit to receive a second thread. A multi-thread scheduler coupled to the instruction fetch units and a execution unit. The multi-thread scheduler determines the width of the execution unit and the execution unit executes the threads accordingly.

Type: Grant

Filed: January 23, 2009

Date of Patent: June 19, 2012

Assignee: Intel Corporation

Inventors: Ken Shoemaker, Sailesh Kottapalli, Kin-Kee Sit

prev 1 2 3 4 5 6 7 … next