Patents by Inventor Rama S. Gopal

Rama S. Gopal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Memory load to load fusing

Patent number: 10956155

Abstract: A system and a method to cascade execution of instructions in a load-store unit (LSU) of a central processing unit (CPU) to reduce latency associated with the instructions. First data stored in a cache is read by the LSU in response a first memory load instruction of two immediately consecutive memory load instructions. Alignment, sign extension and/or endian operations are performed on the first data read from the cache in response to the first memory load instruction, and, in parallel, a memory-load address-forwarded result is selected based on a corrected alignment of the first data read in response to the first memory load instruction to provide a next address for a second of the two immediately consecutive memory load instructions. Second data stored in the cache is read by the LSU in response to the second memory load instruction based on the selected memory-load address-forwarded result.

Type: Grant

Filed: May 23, 2019

Date of Patent: March 23, 2021

Inventors: Paul E. Kitchin, Rama S. Gopal, Karthik Sundaram
MEMORY LOAD TO LOAD FUSING

Publication number: 20190278603

Abstract: A system and a method to cascade execution of instructions in a load-store unit (LSU) of a central processing unit (CPU) to reduce latency associated with the instructions. First data stored in a cache is read by the LSU in response a first memory load instruction of two immediately consecutive memory load instructions. Alignment, sign extension and/or endian operations are performed on the first data read from the cache in response to the first memory load instruction, and, in parallel, a memory-load address-forwarded result is selected based on a corrected alignment of the first data read in response to the first memory load instruction to provide a next address for a second of the two immediately consecutive memory load instructions. Second data stored in the cache is read by the LSU in response to the second memory load instruction based on the selected memory-load address-forwarded result.

Type: Application

Filed: May 23, 2019

Publication date: September 12, 2019

Inventors: Paul E. KITCHIN, Rama S. GOPAL, Karthik SUNDARAM
Memory load to load fusing

Patent number: 10372452

Abstract: A system and a method to cascade execution of instructions in a load-store unit (LSU) of a central processing unit (CPU) to reduce latency associated with the instructions. First data stored in a cache is read by the LSU in response a first memory load instruction of two immediately consecutive memory load instructions. Alignment, sign extension and/or endian operations are performed on the first data read from the cache in response to the first memory load instruction, and, in parallel, a memory-load address-forwarded result is selected based on a corrected alignment of the first data read in response to the first memory load instruction to provide a next address for a second of the two immediately consecutive memory load instructions. Second data stored in the cache is read by the LSU in response to the second memory load instruction based on the selected memory-load address-forwarded result.

Type: Grant

Filed: June 6, 2017

Date of Patent: August 6, 2019

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Paul E. Kitchin, Rama S. Gopal, Karthik Sundaram
Memory load and arithmetic load unit (ALU) fusing

Patent number: 10275217

Abstract: According to one general aspect, a load unit may include a load circuit configured to load at least one piece of data from a memory. The load unit may include an alignment circuit configured to align the data to generate an aligned data. The load unit may also include a mathematical operation execution circuit configured to generate a resultant of a predetermined mathematical operation with the at least one piece of data as an operand. Wherein the load unit is configured to, if an active instruction is associated with the predetermined mathematical operation, bypass the alignment circuit and input the piece of data directly to the mathematical operation execution circuit.

Type: Grant

Filed: June 2, 2017

Date of Patent: April 30, 2019

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Rama S. Gopal, Paul E. Kitchin, Karthik Sundaram
MEMORY LOAD TO LOAD FUSING

Publication number: 20180267800

Abstract: A system and a method to cascade execution of instructions in a load-store unit (LSU) of a central processing unit (CPU) to reduce latency associated with the instructions. First data stored in a cache is read by the LSU in response a first memory load instruction of two immediately consecutive memory load instructions. Alignment, sign extension and/or endian operations are performed on the first data read from the cache in response to the first memory load instruction, and, in parallel, a memory-load address-forwarded result is selected based on a corrected alignment of the first data read in response to the first memory load instruction to provide a next address for a second of the two immediately consecutive memory load instructions. Second data stored in the cache is read by the LSU in response to the second memory load instruction based on the selected memory-load address-forwarded result.

Type: Application

Filed: June 6, 2017

Publication date: September 20, 2018

Inventors: Paul E. KITCHIN, Rama S. GOPAL, Karthik SUNDARAM
MEMORY LOAD AND ARITHMETIC LOAD UNIT (ALU) FUSING

Publication number: 20180267775

Abstract: According to one general aspect, a load unit may include a load circuit configured to load at least one piece of data from a memory. The load unit may include an alignment circuit configured to align the data to generate an aligned data. The load unit may also include a mathematical operation execution circuit configured to generate a resultant of a predetermined mathematical operation with the at least one piece of data as an operand. Wherein the load unit is configured to, if an active instruction is associated with the predetermined mathematical operation, bypass the alignment circuit and input the piece of data directly to the mathematical operation execution circuit.

Type: Application

Filed: June 2, 2017

Publication date: September 20, 2018

Inventors: Rama S. GOPAL, Paul E. KITCHIN, Karthik SUNDARAM
Multi-core shared page miss handler

Patent number: 9921967

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Grant

Filed: July 26, 2011

Date of Patent: March 20, 2018

Assignee: Intel Corporation

Inventors: Christopher D. Bryant, Rama S. Gopal
Multi-core shared page miss handler

Patent number: 9921968

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Grant

Filed: December 31, 2016

Date of Patent: March 20, 2018

Assignee: Intel Corporation

Inventors: Christopher D. Bryant, Rama S. Gopal
Multi-core shared page miss handler

Patent number: 9892056

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Grant

Filed: December 31, 2016

Date of Patent: February 13, 2018

Assignee: Intel Corporation

Inventors: Christopher D. Bryant, Rama S. Gopal
Multi-core shared page miss handler

Patent number: 9892059

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Grant

Filed: December 31, 2016

Date of Patent: February 13, 2018

Assignee: Intel Corporation

Inventors: Christopher D. Bryant, Rama S. Gopal
MULTI-CORE SHARED PAGE MISS HANDLER

Publication number: 20170147506

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Application

Filed: December 31, 2016

Publication date: May 25, 2017

Inventors: CHRISTOPHER D. BRYANT, RAMA S. GOPAL
MULTI-CORE SHARED PAGE MISS HANDLER

Publication number: 20170116134

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Application

Filed: December 31, 2016

Publication date: April 27, 2017

Inventors: CHRISTOPHER D. BRYANT, RAMA S. GOPAL
MULTI-CORE SHARED PAGE MISS HANDLER

Publication number: 20170109293

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Application

Filed: December 31, 2016

Publication date: April 20, 2017

Inventors: CHRISTOPHER D. BRYANT, RAMA S. GOPAL
MULTI-CORE SHARED PAGE MISS HANDLER

Publication number: 20130031332

Abstract: Methods and apparatus are disclosed for using a shared page miss handler device to satisfy page miss requests of a plurality of devices in a multi-core system. One embodiment of such a method comprises receiving one or more page miss requests from one or more respective requesting devices of the plurality of devices in the multi-core system, and arbitrating to identify a first page miss requests of the one or more requesting devices A page table walk is performed to generate a physical address responsive to the first page miss request. Then the physical address is sent to the corresponding requesting device, or a fault is signaled to an operating system for the corresponding requesting device responsive to the first page miss request.

Type: Application

Filed: July 26, 2011

Publication date: January 31, 2013

Inventors: Christopher D. Bryant, Rama S. Gopal
Cache memory subsystem including a fixed latency R/W pipeline

Patent number: 7251710

Abstract: A cache memory subsystem including a fixed latency read/write pipeline. The cache memory subsystem includes a cache storage which may be configured to store a plurality of cache lines of data. The cache memory subsystem further includes a scheduler which may be configured to schedule reads and writes of information associated with the cache storage using a fixed latency pipeline. In response to scheduling a read request, the scheduler may be further configured to cause an associated write to occur a fixed number of cycles after the scheduling of the read request.

Type: Grant

Filed: January 12, 2004

Date of Patent: July 31, 2007

Assignee: Advanced Micro Devices, Inc.

Inventors: Roger D. Isaac, Mitchell Alsup, Rama S. Gopal, James K. Pickett, Michael A. Filippo
Load store unit with replay mechanism

Patent number: 7165167

Abstract: A microprocessor may include a scheduler configured to issue operations and a load store unit configured to execute memory operations issued by the scheduler. The load store unit is configured to store information identifying memory operations issued to the load store unit. In response to detection of incorrect data speculation for one of the issued memory operations, the load store unit is configured to replay at least one of the issued memory operations by providing an indication to the scheduler. The scheduler is configured to responsively reissue the memory operations identified by the load store unit.

Type: Grant

Filed: June 10, 2003

Date of Patent: January 16, 2007

Assignee: Advanced Micro Devices, Inc.

Inventors: Michael A. Filippo, James K. Pickett, Benjamin T. Sander, Rama S. Gopal
Load store unit with replay mechanism

Publication number: 20040255101

Abstract: A microprocessor may include a scheduler configured to issue operations and a load store unit configured to execute memory operations issued by the scheduler. The load store unit is configured to store information identifying memory operations issued to the load store unit. In response to detection of incorrect data speculation for one of the issued memory operations, the load store unit is configured to replay at least one of the issued memory operations by providing an indication to the scheduler. The scheduler is configured to responsively reissue the memory operations identified by the load store unit.

Type: Application

Filed: June 10, 2003

Publication date: December 16, 2004

Applicant: Advanced Micro Devices, Inc.

Inventors: Michael A. Filippo, James K. Pickett, Benjamin T. Sander, Rama S. Gopal