Patents Examined by George Giroux

Anticipated prefetching for a parent core in a multi-core chip

Patent number: 9792120

Abstract: Embodiments relate to prefetching data on a chip having a scout core and a parent core coupled to the scout core. The method includes determining that a program executed by the parent core requires content stored in a location remote from the parent core. The method includes sending a fetch table address determined by the parent core to the scout core. The method includes accessing a fetch table that is indicated by the fetch table address by the scout core. The fetch table indicates how many of pieces of content are to be fetched by the scout core and a location of the pieces of content. The method includes based on the fetch table indicating, fetching the pieces of content by the scout core. The method includes returning the fetched pieces of content to the parent core.

Type: Grant

Filed: March 5, 2013

Date of Patent: October 17, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Brian R. Prasky, Fadi Y. Busaba, Steven R. Carlough, Christopher A. Krygowski, Chung-lung K. Shum
Stack pointer and memory access alignment control

Patent number: 9760374

Abstract: A data processing system 2 includes a stack pointer register 26, 28, 30, 32 storing a stack pointer value for use in stack access operations to a stack data store 44, 46, 48, 50. Stack alignment checking circuitry 36 which is selectively disabled may be provided to check memory address alignment of the stack pointer value associated with a stack memory access. The action of the stack alignment checking circuitry 36 is independent of any further other alignment checking performed in respect of all memory accesses. Thus, general alignment checking circuitry 38 may be provided and independently selectively disabled in respect of any memory access.

Type: Grant

Filed: June 28, 2011

Date of Patent: September 12, 2017

Assignee: ARM Limited

Inventor: Richard Roy Grisenthwaite
Multiprocessor messaging system

Patent number: 9760526

Abstract: A multiprocessor system includes a first microprocessor and a second microprocessor. An external memory system is coupled to the first and second microprocessors and is configured to receive and temporarily store messages transferred between the first and second microprocessors. A first signaling pathway may be configured to send message transmission coordination signals from the first microprocessor to the second microprocessor. A second signaling pathway may be configured to send message transmission coordination signals from the second microprocessor to the first microprocessor. The first signaling pathway may be independent of the second signaling pathway. The first signaling pathway may be coupled to at least two flag registers associated with the second microprocessor. The second signaling pathway may be coupled to at least two flag registers associated with the first microprocessor.

Type: Grant

Filed: September 30, 2011

Date of Patent: September 12, 2017

Assignee: EMC IP Holdings Company LLC

Inventor: Paul A. Shubel
Register files for storing data operated on by instructions of multiple widths

Patent number: 9760375

Abstract: A processor core includes even and odd execution slices each having a register file. The slices are each configured to perform operations specified in a first set of instructions on data from its respective register file, and together configured to perform operations specified in a second set of instructions on data stored across both register files. During utilization, the processor receives a first instruction of the first set specifying an operation, a target register, and a source register. Next, a second instruction upon which content of the source register depends is identified as being of the second set. In response, the first instruction is dispatched to the even slice. In accordance with the operation specified in the first instruction, the even slice uses content of the source register in its register file to produce a result. Copies of the result are written to the target register in both register files.

Type: Grant

Filed: September 9, 2014

Date of Patent: September 12, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Maarten J. Boersma, Markus Kaltenbach, David Lang, Jentje Leenstra
Register files for storing data operated on by instructions of multiple widths

Patent number: 9740486

Abstract: A processor core includes even and odd execution slices each having a register file. The slices are each configured to perform operations specified in a first set of instructions on data from its respective register file, and together configured to perform operations specified in a second set of instructions on data stored across both register files. During utilization, the processor receives a first instruction of the first set specifying an operation, a target register, and a source register. Next, a second instruction upon which content of the source register depends is identified as being of the second set. In response, the first instruction is dispatched to the even slice. In accordance with the operation specified in the first instruction, the even slice uses content of the source register in its register file to produce a result. Copies of the result are written to the target register in both register files.

Type: Grant

Filed: December 18, 2014

Date of Patent: August 22, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Maarten J. Boersma, Markus Kaltenbach, David Lang, Jentje Leenstra
Instruction and logic to monitor loop trip count and remove loop optimizations

Patent number: 9715388

Abstract: Logic and instruction to monitor loop trip count are disclosed. Loop trip count information of a loop may be stored in a dedicated hardware buffer. Average loop trip count of the loop may be calculated based on the stored loop trip count information. Based on the average trip count, loop optimizations may be removed from the loop. The stored loop trip count information may include an identifier identifying the loop, a total loop trip count of the loop, and an exit count of the loop.

Type: Grant

Filed: March 30, 2012

Date of Patent: July 25, 2017

Assignee: Intel Corporation

Inventors: Jaewoong Chung, Hyunchul Park, Hongbo Rong, Cheng Wang, Youfeng Wu
Efficient hardware instructions for processing bit vectors for single instruction multiple data processors

Patent number: 9697174

Abstract: A method and apparatus for efficiently processing data in various formats in a single instruction multiple data (“SIMD”) architecture is presented. Specifically, a method to unpack a fixed-width bit values in a bit stream to a fixed width byte stream in a SIMD architecture is presented. A method to unpack variable-length byte packed values in a byte stream in a SIMD architecture is presented. A method to decompress a run length encoded compressed bit-vector in a SIMD architecture is presented. A method to return the offset of each bit set to one in a bit-vector in a SIMD architecture is presented. A method to fetch bits from a bit-vector at specified offsets relative to a base in a SIMD architecture is presented. A method to compare values stored in two SIMD registers is presented.

Type: Grant

Filed: September 10, 2013

Date of Patent: July 4, 2017

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Amit Ganesh, Shasank K. Chavan, Vineet Marwah, Jesse Kamp, Anindya C. Patthak, Michael J. Gleeson, Allison L. Holloway, Roger Macnicol
Vector instructions to enable efficient synchronization and parallel reduction operations

Patent number: 9678750

Abstract: In one embodiment, a processor may include a vector unit to perform operations on multiple data elements responsive to a single instruction, and a control unit coupled to the vector unit to provide the data elements to the vector unit, where the control unit is to enable an atomic vector operation to be performed on at least some of the data elements responsive to a first vector instruction to be executed under a first mask and a second vector instruction to be executed under a second mask. Other embodiments are described and claimed.

Type: Grant

Filed: March 12, 2013

Date of Patent: June 13, 2017

Assignee: Intel Corporation

Inventors: Mikhail Smelyanskiy, Victor Lee, Christopher Hughes, Daehyun Kim, Yen-Kuang Chen, Changkyu Kim, Jatin Chhugani, Anthony D. Nguyen, Sanjeev Kumar
Scheduling apparatus and method of dynamically setting the size of a rotating register

Patent number: 9678752

Abstract: A scheduling apparatus for dynamically setting a size of a rotating register of a local register file during runtime ids provided. The scheduling apparatus may include a determiner configured to determine whether a non-rotating register of a central register file is sufficient to schedule a program loop; a selector configured to select at least one local register file to which a needed non-rotating register is allocated in response to a determination that the non-rotating register of a central register file has a size which is sufficient to loop a program loop; a scheduler configured to schedule a non-rotating register of the at least one selected local register file.

Type: Grant

Filed: April 22, 2014

Date of Patent: June 13, 2017

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Tai-Song Jin
Processing quantum information

Patent number: 9663358

Abstract: A quantum information processor can include a control system and a system of processor nodes. Each of the processor nodes can include multiple qubits and an actuator. The control system can manipulate the qubits of multiple processor nodes based on cross-node quantum interactions between the qubits. In some instances, the control system may perform multi-qubit quantum gates on qubits of different processor nodes based on the cross-node quantum interactions. Within each processor node, the qubits interact with the actuator by an intra-node quantum coupling. Between processor nodes, the actuators interact with each other by an inter-node quantum coupling. The cross-node quantum interaction can be produced by non-commutivity of the intra-node quantum couplings and the inter-node quantum couplings. In some instances, the qubits can be manipulated by applying a control sequence that produces an interaction frame where the cross-node quantum interaction dominates the time evolution of the system.

Type: Grant

Filed: June 26, 2012

Date of Patent: May 30, 2017

Assignee: Quantum Valley Investment Fund LP

Inventors: David G. Cory, Troy W. Borneman, Christopher E. Granade
Banked physical register data flow architecture in out-of-order processors

Patent number: 9652246

Abstract: In a method of executing instructions in a processing system, respective global age tags are assigned to each of the one or more instructions fetched for processing by the processing system. Each global age tag indicates an age of the corresponding instruction in the processing system. Respective physical registers in a physical register file are allocated to each destination logical register referenced by each instruction. The respective global age tags are written to the in respective physical registers allocated to the destination logical registers of the instructions. The instructions are executed by the processing system. At least some of the instructions are executed in an order different from a program order of the instructions.

Type: Grant

Filed: December 20, 2013

Date of Patent: May 16, 2017

Assignee: Marvell International Ltd.

Inventors: Kit Sang Tam, Winston Lee
Indirect function call instructions in a synchronous parallel thread processor

Patent number: 9639365

Abstract: An indirect branch instruction takes an address register as an argument in order to provide indirect function call capability for single-instruction multiple-thread (SIMT) processor architectures. The indirect branch instruction is used to implement indirect function calls, virtual function calls, and switch statements to improve processing performance compared with using sequential chains of tests and branches.

Type: Grant

Filed: November 12, 2012

Date of Patent: May 2, 2017

Assignee: NVIDIA Corporation

Inventors: Brett W. Coon, John R. Nickolls, Lars Nyland, Peter C. Mills, John Erik Lindholm
IT instruction pre-decode

Patent number: 9626185

Abstract: Various techniques for processing and pre-decoding branches within an IT instruction block. Instructions are fetched and cached in an instruction cache, and pre-decode bits are generated to indicate the presence of an IT instruction and the likely boundaries of the IT instruction block. If an unconditional branch is detected within the likely boundaries of an IT instruction block, the unconditional branch is treated as if it were a conditional branch. The unconditional branch is sent to the branch direction predictor and the predictor generates a branch direction prediction for the unconditional branch.

Type: Grant

Filed: February 22, 2013

Date of Patent: April 18, 2017

Assignee: Apple Inc.

Inventors: Shyam Sundar, Ian D. Kountanis, Conrado Blasco-Allue, Gerard R. Williams, III, Wei-Han Lien, Ramesh B. Gunna
Transactional memory system supporting unbroken suspended execution

Patent number: 9626187

Abstract: Mechanisms are provided, in a data processing system having a processor and a transactional memory, for executing a transaction in the data processing system. These mechanisms execute a transaction comprising one or more instructions that modify at least a portion of the transactional memory. The transaction is suspended in response to a transaction suspend instruction being executed by the processor. A suspended block of code is executed in a non-transactional manner while the transaction is suspended. A determination is made as to whether an interrupt occurs while the transaction is suspended. In response to an interrupt occurring while the transaction is suspended, a transaction abort operation is delayed until after the transaction suspension is discontinued.

Type: Grant

Filed: May 27, 2010

Date of Patent: April 18, 2017

Assignee: International Business Machines Corporation

Inventors: Harold W. Cain, III, Bradly G. Frey, Benjamin Herrenschmidt, Hung Q. Le, Cathy May, Maged M. Michael, Jose E. Moreira, Priya A. Nagpurkar, Naresh Nayar, Randal C. Swanberg
MFENCE and LFENCE micro-architectural implementation method and system

Patent number: 9612835

Abstract: A system and method for fencing memory accesses. Memory loads can be fenced, or all memory access can be fenced. The system receives a fencing instruction that separates memory access instructions into older accesses and newer accesses. A buffer within the memory ordering unit is allocated to the instruction. The access instructions newer than the fencing instruction are stalled. The older access instructions are gradually retired. When all older memory accesses are retired, the fencing instruction is dispatched from the buffer.

Type: Grant

Filed: September 14, 2012

Date of Patent: April 4, 2017

Assignee: Intel Corporation

Inventors: Salvador Palanca, Stephen A. Fischer, Subramaniam Maiyuran, Shekoufeh Qawami
Method and apparatus for obtaining a call stack to an event of interest and analyzing the same

Patent number: 9582275

Abstract: In one embodiment, a processor includes a performance monitor including a last branch record (LBR) stack to store a call stack to an event of interest, where the call stack is collected responsive to a trigger for the event. The processor further includes logic to control the LBR stack to operate in a call stack mode such that an entry to a call instruction for a leaf function is cleared on return from the leaf function. Other embodiments are described and claimed.

Type: Grant

Filed: May 31, 2011

Date of Patent: February 28, 2017

Assignee: Intel Corporation

Inventors: Michael W. Chynoweth, Peggy J. Irelan, Matthew C. Merten, Seung-Woo Kim, Laura A. Knauth, Stanislav Bratanov
Apparatus and methods for in-application programming of flash-based programable logic devices

Patent number: 9582266

Abstract: An architecture disposed in an integrated circuit for in-application programming of flash-based programmable logic devices includes a processor coupled to a processor system bus. An I/O peripheral is coupled to the processor over the system bus and is also coupled to an off-chip data source. A programmable logic device fabric includes flash-based programmable devices. A program controller is coupled to the flash-based programmable devices. An in-application programming controller is coupled to the program controller and is coupled to the processor over the system bus.

Type: Grant

Filed: February 28, 2011

Date of Patent: February 28, 2017

Assignee: Microsemi SoC Corporation

Inventors: Venkatesh Narayanan, Kenneth R. Irving, Ming-Hoe Kiu
System and method for managing code isolation

Patent number: 9582302

Abstract: A computing system is configured to use a trampoline to isolate sensitive code in a virtual environment and in other applications. An import table may describe the entry points of a privileged code module or driver that generates privileged code. A system and method loads a shadow kernel to facilitate isolating the linkage between drivers and the rest of the system. The shadow kernel may be a copy of the operating system kernel that does not have the same integral position in the operation of the computing device. The shadow kernel may be used as a template for creating a jump table to redirect more critical privileged resource access requests from specially loaded kernel mode drivers to the PVM. All requests may pass through the PVM, which redirects non-critical functions to the original kernel. Multiple copies of a given driver or code module may be loaded in a given session.

Type: Grant

Filed: March 7, 2013

Date of Patent: February 28, 2017

Assignee: Citrix Systems, Inc.

Inventors: Michael Larkin, James Yarbrough, Yashabh Sethi
Zero cycle move

Patent number: 9575754

Abstract: A system and method for reducing the latency of data move operations. A register rename unit within a processor determines whether a decoded move instruction is eligible for a zero cycle move operation. If so, control logic assigns a physical register identifier associated with a source operand of the move instruction to the destination operand of the move instruction. Additionally, the register rename unit marks the given move instruction to prevent it from proceeding in the processor pipeline. Further maintenance of the particular physical register identifier may be done by the register rename unit during commit of the given move instruction.

Type: Grant

Filed: April 16, 2012

Date of Patent: February 21, 2017

Assignee: Apple Inc.

Inventors: James B. Keller, John H. Mylius, Conrado Blasco-Allue, Gerard R. Williams, III, Suparn Vats
Dynamic propagation with iterative pipeline processing

Patent number: 9529599

Abstract: Systems, apparatuses, methods, and software for processing data in pipeline architectures are provided herein. In one example, a pipeline architecture is presented. The pipeline architecture includes a plurality of processing stages, linked in series, that iteratively process data as the data propagates through the plurality of processing stages. The pipeline architecture includes at least one other processing stage linked in series with and preceded by the plurality of processing stages and configured to iteratively process the data a number of times based at least on an iteration count comprising how many times the data was iteratively processed as the data propagated through the plurality of processing stages.

Type: Grant

Filed: February 12, 2013

Date of Patent: December 27, 2016

Inventor: William Erik Anderson

prev … 7 8 9 10 11 12 13 14 15 … next