Patents by Inventor Srikanth T. Srinivasan

Srikanth T. Srinivasan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System, apparatus and method for program order queue (POQ) to manage data dependencies in processor having multiple instruction queues

Patent number: 11243775

Abstract: In one embodiment, an apparatus includes: a plurality of registers; a first instruction queue to store first instructions; a second instruction queue to store second instructions; a program order queue having a plurality of portions each associated with one of the plurality of registers, each of the portions having entries to store a state of an instruction, the state comprising an encoding of a use of the register by the instruction and a source instruction queue for the instruction; and a dispatcher to dispatch for execution the first and second instructions from the first and second instruction queues based at least in part on information stored in the program order queue, to manage instruction dependencies between the first instructions and the second instructions. Other embodiments are described and claimed.

Type: Grant

Filed: March 26, 2019

Date of Patent: February 8, 2022

Assignee: Intel Corporation

Inventors: Andrey Ayupov, Srikanth T. Srinivasan, Jonathan D. Pearce, David B. Sheffield
System, apparatus and method for symbolic store address generation for data-parallel processor

Patent number: 11188341

Abstract: In one embodiment, an apparatus includes: a plurality of execution lanes to perform parallel execution of instructions; and a unified symbolic store address buffer coupled to the plurality of execution lanes, the unified symbolic store address buffer comprising a plurality of entries each to store a symbolic store address for a store instruction to be executed by at least some of the plurality of execution lanes. Other embodiments are described and claimed.

Type: Grant

Filed: March 26, 2019

Date of Patent: November 30, 2021

Assignee: Intel Corporation

Inventors: Jeffrey J. Cook, Srikanth T. Srinivasan, Jonathan D. Pearce, David B. Sheffield
Gather-scatter cache architecture having plurality of tag and data banks and arbiter for single program multiple data (SPMD) processor

Patent number: 10896141

Abstract: In one embodiment, a cache memory includes: a plurality of data banks, each of the plurality of data banks having a plurality of entries each to store a portion of a cache line distributed across the plurality of data banks; and a plurality of tag banks decoupled from the plurality of data banks, wherein a tag for a cache line is to be assigned to one of the plurality of tag banks. Other embodiments are described and claimed.

Type: Grant

Filed: March 26, 2019

Date of Patent: January 19, 2021

Assignee: Intel Corporation

Inventors: Jeffrey J. Cook, Jonathan D. Pearce, Srikanth T. Srinivasan, Rishiraj A. Bheda, David B. Sheffield, Abhijit Davare, Anton Alexandrovich Sorokin
Gather-Scatter Cache Architecture For Single Program Multiple Data (SPMD) Processor

Publication number: 20200310992

Abstract: In one embodiment, a cache memory includes: a plurality of data banks, each of the plurality of data banks having a plurality of entries each to store a portion of a cache line distributed across the plurality of data banks; and a plurality of tag banks decoupled from the plurality of data banks, wherein a tag for a cache line is to be assigned to one of the plurality of tag banks. Other embodiments are described and claimed.

Type: Application

Filed: March 26, 2019

Publication date: October 1, 2020

Inventors: Jeffrey J. Cook, Jonathan D. Pearce, Srikanth T. Srinivasan, Rishiraj A. Bheda, David B. Sheffield, Abhijit Davare, Anton Alexandrovich Sorokin
System, Apparatus And Method For Program Order Queue (POQ) To Manage Data Dependencies In Processor Having Multiple Instruction Queues

Publication number: 20200310815

Abstract: In one embodiment, an apparatus includes: a plurality of registers; a first instruction queue to store first instructions; a second instruction queue to store second instructions; a program order queue having a plurality of portions each associated with one of the plurality of registers, each of the portions having entries to store a state of an instruction, the state comprising an encoding of a use of the register by the instruction and a source instruction queue for the instruction; and a dispatcher to dispatch for execution the first and second instructions from the first and second instruction queues based at least in part on information stored in the program order queue, to manage instruction dependencies between the first instructions and the second instructions. Other embodiments are described and claimed.

Type: Application

Filed: March 26, 2019

Publication date: October 1, 2020

Inventors: Andrey Ayupov, Srikanth T. Srinivasan, Jonathan D. Pearce, David B. Sheffield
System, Apparatus And Method For Symbolic Store Address Generation For Data-Parallel Processor

Publication number: 20200310817

Abstract: In one embodiment, an apparatus includes: a plurality of execution lanes to perform parallel execution of instructions; and a unified symbolic store address buffer coupled to the plurality of execution lanes, the unified symbolic store address buffer comprising a plurality of entries each to store a symbolic store address for a store instruction to be executed by at least some of the plurality of execution lanes. Other embodiments are described and claimed.

Type: Application

Filed: March 26, 2019

Publication date: October 1, 2020

Inventors: Jeffrey J. Cook, Srikanth T. Srinivasan, Jonathan D. Pearce, David B. Sheffield
Methods and systems for performing a calculation across a memory array

Patent number: 10418098

Abstract: Devices for computing the sum of multiple Vector-Vector Dot-Products (VVDP) or multiple partial sums of VVDP can include a resistive memory array and a reduction circuit. The reduction circuit can be configured to determine a sum of a selected one or more of a plurality of bit lines of the resistive memory array. A VVDP reduction can be determined from the sum of the selected one or more of the plurality of bit lines.

Type: Grant

Filed: December 28, 2017

Date of Patent: September 17, 2019

Assignee: Intel Corporation

Inventors: Srikanth T. Srinivasan, Shigeki Tomishima
Scheduler implementing dependency matrix having restricted entries

Patent number: 10235180

Abstract: A scheduler implementing a dependency matrix having restricted entries is disclosed. A processing device of the disclosure includes a decode unit to decode an instruction and a scheduler communicably coupled to the decode unit. In one embodiment, the scheduler is configured to receive the decoded instruction, determine that the decoded instruction qualifies for allocation as a restricted reservation station (RS) entry type in a dependency matrix maintained by the scheduler, identify RS entries in the dependency matrix that are free for allocation, allocate one of the identified free RS entries with information of the decoded instruction in the dependency matrix, and update a row of the dependency matrix corresponding to the claimed RS entry with source dependency information of the decoded instruction.

Type: Grant

Filed: December 21, 2012

Date of Patent: March 19, 2019

Assignee: Intel Corporation

Inventors: Srikanth T. Srinivasan, Matthew C. Merten, Bambang Sutanto, Rahul R. Kulkarni, Justin M. Deinlein, James D. Hadley
PRE-POST RETIRE HYBRID HARDWARE LOCK ELISION (HLE) SCHEME

Publication number: 20190065160

Abstract: A method and apparatus for hybrid pre and post-retire tentative access tracking is herein described. Access tracking is often performed during execution of critical sections, which may be defined by traditional locks or transactional memory instructions. Pre-retire accesses to memory are performed to update tracking information for access during execution of a critical section. However, post-retire updates to tracking information are performed for subsequent consecutive critical section accesses in a pipeline when a previous end critical section operation is retired.

Type: Application

Filed: November 7, 2007

Publication date: February 28, 2019

Inventors: Haitham Akkary, Shlomo Raikin, Ravi Rajwar, Gad Sheaffer, Srikanth T. Srinivasan
METHODS AND SYSTEMS FOR PERFORMING A CALCULATION ACROSS A MEMORY ARRAY

Publication number: 20190043572

Abstract: Devices for computing the sum of multiple Vector-Vector Dot-Products (VVDP) or multiple partial sums of VVDP can include a resistive memory array and a reduction circuit. The reduction circuit can be configured to determine a sum of a selected one or more of a plurality of bit lines of the resistive memory array. A VVDP reduction can be determined from the sum of the selected one or more of the plurality of bit lines.

Type: Application

Filed: December 28, 2017

Publication date: February 7, 2019

Applicant: Intel Corporation

Inventors: SRIKANTH T. SRINIVASAN, SHIGEKI TOMISHIMA
Method and apparatus for implementing dynamic portbinding within a reservation station

Patent number: 9904553

Abstract: A processor and method are described for scheduling operations for execution within a reservation station. For example, a method in accordance with one embodiment of the invention includes the operations of: classifying a plurality of operations based on the execution ports usable to execute those operations; allocating the plurality of operations into groups within a reservation station based on the classification, wherein each group is serviced by one or more execution ports corresponding to the classification, and wherein two or more entries within a group share a common read port and a common write port; dynamically scheduling two or more operations in a group for concurrent execution based on the ports capable of executing those operations and a relative age of the operations.

Type: Grant

Filed: May 19, 2016

Date of Patent: February 27, 2018

Assignee: INTEL CORPORATION

Inventors: Bambang Sutanto, Srikanth T. Srinivasan, Matthew C. Merten, Chia Yin Kevin Lai, Ammon J Christiansen, Justin M. Deinlein
POST-RETIRE SCHEME FOR TRACKING TENTATIVE ACCESSES DURING TRANSACTIONAL EXECUTION

Publication number: 20180011748

Abstract: A method and apparatus for post-retire transaction access tracking is herein described. Load and store buffers are capable of storing senior entries. In the load buffer a first access is scheduled based on a load buffer entry. Tracking information associated with the load is stored in a filter field in the load buffer entry. Upon retirement, the load buffer entry is marked as a senior load entry. A scheduler schedules a post-retire access to update transaction tracking information, if the filter field does not represent that the tracking information has already been updated during a pendency of the transaction. Before evicting a line in a cache, the load buffer is snooped to ensure no load accessed the line to be evicted.

Type: Application

Filed: September 8, 2017

Publication date: January 11, 2018

Inventors: Haitham Akkary, Ravi Rajwar, Srikanth T. Srinivasan
Post-retire scheme for tracking tentative accesses during transactional execution

Patent number: 9798590

Abstract: A method and apparatus for post-retire transaction access tracking is herein described. Load and store buffers are capable of storing senior entries. In the load buffer a first access is scheduled based on a load buffer entry. Tracking information associated with the load is stored in a filter field in the load buffer entry. Upon retirement, the load buffer entry is marked as a senior load entry. A scheduler schedules a post-retire access to update transaction tracking information, if the filter field does not represent that the tracking information has already been updated during a pendency of the transaction. Before evicting a line in a cache, the load buffer is snooped to ensure no load accessed the line to be evicted.

Type: Grant

Filed: September 7, 2006

Date of Patent: October 24, 2017

Assignee: Intel Corporation

Inventors: Haitham Akkary, Ravi Rajwar, Srikanth T. Srinivasan
Instruction and logic for non-blocking register reclamation

Patent number: 9652236

Abstract: A processor includes a logic to execute a first instruction and a second instruction. The first instruction is ordered before the second instruction. Each instruction references a respective logical register assigned to a respective physical register. The processor also includes logic to reassign a physical register of the second instruction to another logical register before retirement of the first instruction.

Type: Grant

Filed: December 23, 2013

Date of Patent: May 16, 2017

Assignee: Intel Corporation

Inventors: Srikanth T. Srinivasan, Mark J. Dechene, Yury N. Ilin, Justin M. Deinlein, Christine E. Wang, Matthew C. Merten
METHOD AND APPARATUS FOR IMPLEMENTING DYNAMIC PORTBINDING WITHIN A RESERVATION STATION

Publication number: 20170024213

Abstract: A processor and method are described for scheduling operations for execution within a reservation station. For example, a method in accordance with one embodiment of the invention includes the operations of: classifying a plurality of operations based on the execution ports usable to execute those operations; allocating the plurality of operations into groups within a reservation station based on the classification, wherein each group is serviced by one or more execution ports corresponding to the classification, and wherein two or more entries within a group share a common read port and a common write port; dynamically scheduling two or more operations in a group for concurrent execution based on the ports capable of executing those operations and a relative age of the operations.

Type: Application

Filed: May 19, 2016

Publication date: January 26, 2017

Inventors: Bambang SUTANTO, Srikanth T. SRINIVASAN, Matthew C. MERTEN, Chia Yin Kevin LAI, Ammon J. CHRISTIANSEN, Justin M. DEINLEIN
Two level re-order buffer

Patent number: 9495159

Abstract: In response to detecting one or more conditions are met, a checkpoint of a current state of a thread may be created. One or more incomplete instructions may be moved from a first level of a re-order buffer to a second level of the re-order buffer. Each incomplete instruction may be currently executing or awaiting execution.

Type: Grant

Filed: September 27, 2013

Date of Patent: November 15, 2016

Assignee: Intel Corporation

Inventors: Mark J. Dechene, Srikanth T. Srinivasan, Matthew C. Merten, Tong Li, Christine E. Wang
Method and apparatus for implementing dynamic portbinding within a reservation station

Patent number: 9372698

Abstract: A processor and method are described for scheduling operations for execution within a reservation station. For example, a method in accordance with one embodiment of the invention includes the operations of: classifying a plurality of operations based on the execution ports usable to execute those operations; allocating the plurality of operations into groups within a reservation station based on the classification, wherein each group is serviced by one or more execution ports corresponding to the classification, and wherein two or more entries within a group share a common read port and a common write port; dynamically scheduling two or more operations in a group for concurrent execution based on the ports capable of executing those operations and a relative age of the operations.

Type: Grant

Filed: June 29, 2013

Date of Patent: June 21, 2016

Assignee: INTEL CORPORATION

Inventors: Bambang Sutanto, Srikanth T. Srinivasan, Matthew C. Merten, Chia Yin Kevin Lai, Ammon J Christiansen, Justin M Deinlein
Enhanced loop streaming detector to drive logic optimization

Patent number: 9354875

Abstract: An enhanced loop streaming detection mechanism is provided in a processor to reduce power consumption. The processor includes a decoder to decode instructions in a loop into micro-operations, and a loop streaming detector to detect the presence of the loop in the micro-operations. The processor also includes a loop characteristic tracker unit to identify hardware components downstream from the decoder that are not to be used by the micro-operations in the loop, and to disable the identified hardware components. The processor also includes execution circuitry to execute the micro-operations in the loop with the identified hardware components disabled.

Type: Grant

Filed: December 27, 2012

Date of Patent: May 31, 2016

Assignee: Intel Corporation

Inventors: Matthew C. Merten, Justin M. Deinlein, Yury N. Ilin, Alexandre J. Farcy, Tong Li, Srikanth T. Srinivasan
Critical section detection and prediction mechanism for hardware lock elision

Patent number: 9262173

Abstract: A method and apparatus for detecting lock instructions and lock release instruction, as well as predicting critical sections is herein described. A lock instruction is detected with detection logic, which potentially resides in decode logic. A lock instruction entry associated with the lock instruction is stored/created. Address locations and values to be written to those address location of subsequent potential lock release instruction are compared to the address loaded from by the lock instruction and the value load by the lock instruction. If the addresses and values match, it is determined that the lock release instruction matches the lock instruction. A prediction entry stores a reference to the lock instruction, such as a last instruction pointer (LIP), and an associated value to represent the lock instruction is to be elided upon subsequent detection, if it is determined that the lock release instruction matches the lock instruction.

Type: Grant

Filed: January 13, 2012

Date of Patent: February 16, 2016

Assignee: Intel Corporation

Inventors: Haitham Akkary, Ravi Rajwar, Srikanth T. Srinivasan
Store address prediction for memory disambiguation in a processing device

Patent number: 9244827

Abstract: A processing device implementing store address prediction for memory disambiguation in a processing device is disclosed. A processing device of the disclosure includes a store address predictor to predict an address for store operations that store data to a memory hierarchy. The processing device further includes a store buffer for buffering the store operations prior to completion, the store buffer to comprise the predicted address for each of the store operations. The processing device further includes a load buffer to buffer a load operation, the load operation to reference the store buffer to, based on the predicted addresses, determine whether to speculatively execute ahead of each store operation and to determine whether to speculatively forward data from one of the store operations.

Type: Grant

Filed: September 25, 2013

Date of Patent: January 26, 2016

Assignee: Intel Corporation

Inventors: Ho-Seop Kim, Robert S. Chappell, Choon Y. Soo, Srikanth T. Srinivasan

1 2 3 next