Patents by Inventor Zehra Sura

Zehra Sura has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Data driven mixed precision learning for neural networks

Patent number: 11568235

Abstract: Embodiments for implementing mixed precision learning for neural networks by a processor. A neural network may be replicated into a plurality of replicated instances and each of the plurality of replicated instances differ in precision used for representing and determining parameters of the neural network. Data instances may be routed to one or more of the plurality of replicated instances for processing according to a data pre-processing operation.

Type: Grant

Filed: November 19, 2018

Date of Patent: January 31, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra Sura, Parijat Dube, Bishwaranjan Bhattacharjee, Tong Chen
Reducing fragmentation of computer memory

Patent number: 11403213

Abstract: A method for transparently moving a block of memory with respect to an application using the block of memory, includes inserting, by a compiler, in an application that includes a memory allocation call, instructions for transparently moving a block of memory with respect to an application using the block of memory. The instructions include obtaining a first pointer returned by a memory allocator, where the first pointer points to an internal data structure, the internal data structure includes a read-write lock and a second pointer, and the second pointer points to an actual memory block. The instructions further include acquiring a read lock on a read-write lock in the internal data structure, before the first pointer is used by the application, obtaining the second pointer to the actual memory block, and dereferencing the second pointer to access the actual memory block for the application data.

Type: Grant

Filed: June 28, 2019

Date of Patent: August 2, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Wenqi Cao, Arun Iyengar, Gong Su, Zehra Sura, Qi Zhang
REDUCING FRAGMENTATION OF COMPUTER MEMORY

Publication number: 20200409833

Abstract: A method for transparently moving a block of memory with respect to an application using the block of memory, includes inserting, by a compiler, in an application that includes a memory allocation call, instructions for transparently moving a block of memory with respect to an application using the block of memory. The instructions include obtaining a first pointer returned by a memory allocator, where the first pointer points to an internal data structure, the internal data structure includes a read-write lock and a second pointer, and the second pointer points to an actual memory block. The instructions further include acquiring a read lock on a read-write lock in the internal data structure, before the first pointer is used by the application, obtaining the second pointer to the actual memory block, and dereferencing the second pointer to access the actual memory block for the application data.

Type: Application

Filed: June 28, 2019

Publication date: December 31, 2020

Inventors: WENQI CAO, ARUN IYENGAR, GONG SU, ZEHRA SURA, QI ZHANG
Partial synchronization between compute tasks based on threshold specification in a computing system

Patent number: 10824481

Abstract: Embodiments for implementing partial synchronization between compute processes based on threshold specification in a computing environment. One or more compute processes may be synchronized in one of a plurality of types of computing platforms using a barrier having a barrier release condition based on a threshold of one or more measures. The barrier is defined according to one or more parameters. The one or more compute processes may be released via the barrier upon exceeding the threshold of the one or more measures.

Type: Grant

Filed: November 13, 2018

Date of Patent: November 3, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra Sura, Li Zhang, Ashish Kundu, Ravi Nair
DATA DRIVEN MIXED PRECISION LEARNING FOR NEURAL NETWORKS

Publication number: 20200160169

Abstract: Embodiments for implementing mixed precision learning for neural networks by a processor. A neural network may be replicated into a plurality of replicated instances and each of the plurality of replicated instances differ in precision used for representing and determining parameters of the neural network. Data instances may be routed to one or more of the plurality of replicated instances for processing according to a data pre-processing operation.

Type: Application

Filed: November 19, 2018

Publication date: May 21, 2020

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra SURA, Parijat DUBE, Bishwaranjan BHATTACHARJEE, Tong CHEN
PARTIAL SYNCHRONIZATION BETWEEN COMPUTE TASKS BASED ON THRESHOLD SPECIFICATION IN A COMPUTING SYSTEM

Publication number: 20200151028

Abstract: Embodiments for implementing partial synchronization between compute processes based on threshold specification in a computing environment. One or more compute processes may be synchronized in one of a plurality of types of computing platforms using a barrier having a barrier release condition based on a threshold of one or more measures. The barrier is defined according to one or more parameters. The one or more compute processes may be released via the barrier upon exceeding the threshold of the one or more measures.

Type: Application

Filed: November 13, 2018

Publication date: May 14, 2020

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Zehra SURA, Li ZHANG, Ashish KUNDU, Ravi NAIR
Compilation and placement of instructions in a memory system

Patent number: 8930921

Abstract: According to one embodiment of the present invention, a computer system is provided where the computer system includes a main processor, first and second active memory device. The computer system is configured to perform a method including receiving an executable module generated by a compiler, wherein the executable module includes a code section identified as executable by a first processing element in the first active memory device and a second processing element in the second active memory device. The method includes copying the code section to memory in the first device based on the code section being executable on the first device, copying the code section from the first active memory device to an instruction buffer of the first processing element and copying the code section from the first device to the second device based on the code section being executable on the second device.

Type: Grant

Filed: November 20, 2012

Date of Patent: January 6, 2015

Assignee: International Business Machines Corporation

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
Data placement for execution of an executable

Patent number: 8914778

Abstract: According to one embodiment, a method for a compiler to produce an executable module to be executed by a computer system including a main processor and active memory devices includes dividing source code into code sections, identifying a first code section to be executed by the active memory devices, wherein the first code section is one of the code sections and identifying data structures that are used by the first code section. The method also includes classifying the data structures based on pre-defined attributes, formulating, by the compiler, a storage mapping plan for the data structures based on the classifying and generating, by the compiler, mapping code that implements the storage mapping plan, wherein the mapping code is part of the executable module and wherein the mapping code maps storing of the data structures to storage locations in the active memory devices.

Type: Grant

Filed: November 5, 2012

Date of Patent: December 16, 2014

Assignee: International Business Machines Corporation

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
Data placement for execution of an executable

Patent number: 8914779

Abstract: According to one embodiment, a system including a compiler to produce an executable module to be executed by a computer system including a main processor and active memory devices is provided. The system configured to perform a method including dividing source code into code sections, identifying a first code section to be executed by the active memory devices and identifying data structures that are used by the first code section. The method also includes classifying the data structures based on pre-defined attributes, formulating, by the compiler, a storage mapping plan for the data structures based on the classifying and generating, by the compiler, mapping code that implements the storage mapping plan, wherein the mapping code is part of the executable module and wherein the mapping code maps storing of the data structures to storage locations in the active memory devices.

Type: Grant

Filed: November 26, 2012

Date of Patent: December 16, 2014

Assignee: International Business Machines Corporation

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
Compilation and placement of instructions in a memory system

Patent number: 8863099

Abstract: According to one embodiment of the present invention, a method for operation of a computer system including a main processor, a first and a second active memory device includes receiving an executable module generated by a compiler, wherein the executable module includes a code section identified as executable by a first processing element in the first active memory device and a second processing element in the second active memory device. The method further includes copying the code section to memory in the first device based on the code section being executable on the first device, copying the code section from the memory in the first active memory device to an instruction buffer of the first processing element and copying the code section from the memory in the first device to the second device based on the code section being executable on the second device.

Type: Grant

Filed: November 5, 2012

Date of Patent: October 14, 2014

Assignee: International Business Machines Corporation

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
DATA PLACEMENT FOR EXECUTION OF AN EXECUTABLE

Publication number: 20140129787

Abstract: According to one embodiment, a method for a compiler to produce an executable module to be executed by a computer system including a main processor and active memory devices includes dividing source code into code sections, identifying a first code section to be executed by the active memory devices, wherein the first code section is one of the code sections and identifying data structures that are used by the first code section. The method also includes classifying the data structures based on pre-defined attributes, formulating, by the compiler, a storage mapping plan for the data structures based on the classifying and generating, by the compiler, mapping code that implements the storage mapping plan, wherein the mapping code is part of the executable module and wherein the mapping code maps storing of the data structures to storage locations in the active memory devices.

Type: Application

Filed: November 5, 2012

Publication date: May 8, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
PLACEMENT OF INSTRUCTIONS IN A MEMORY SYSTEM

Publication number: 20140130023

Abstract: According to one embodiment of the present invention, a computer system is provided where the computer system includes a main processor, first and second active memory device. The computer system is configured to perform a method including receiving an executable module generated by a compiler, wherein the executable module includes a code section identified as executable by a first processing element in the first active memory device and a second processing element in the second active memory device. The method includes copying the code section to memory in the first device based on the code section being executable on the first device, copying the code section from the first active memory device to an instruction buffer of the first processing element and copying the code section from the first device to the second device based on the code section being executable on the second device.

Type: Application

Filed: November 20, 2012

Publication date: May 8, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
PLACEMENT OF INSTRUCTIONS IN A MEMORY SYSTEM

Publication number: 20140130022

Abstract: According to one embodiment of the present invention, a method for operation of a computer system including a main processor, a first and a second active memory device includes receiving an executable module generated by a compiler, wherein the executable module includes a code section identified as executable by a first processing element in the first active memory device and a second processing element in the second active memory device. The method further includes copying the code section to memory in the first device based on the code section being executable on the first device, copying the code section from the memory in the first active memory device to an instruction buffer of the first processing element and copying the code section from the memory in the first device to the second device based on the code section being executable on the second device.

Type: Application

Filed: November 5, 2012

Publication date: May 8, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Tong Chen, John K. O'Brien, Zehra Sura
Target Memory Hierarchy Specification in a Multi-Core Computer Processing System

Publication number: 20110283067

Abstract: Target memory hierarchy specification in a multi-core computer processing system is provided including a system for implementing prefetch instructions. The system includes a first core processor, a dedicated cache corresponding to the first core processor, and a second core processor. The second core processor includes instructions for executing a prefetch instruction that specifies a memory location and the dedicated local cache corresponding to the first core processor. Executing the prefetch instruction includes retrieving data from the memory location and storing the retrieved data on the dedicated local cache corresponding to the first core processor.

Type: Application

Filed: May 11, 2010

Publication date: November 17, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Tong Chen, Yaoqing Gao, Kevin K. O'Brien, Zehra Sura, Lixin Zhang
Method and apparatus for application-specific dynamic cache placement

Patent number: 7836256

Abstract: One embodiment of the present method and apparatus for application-specific dynamic cache placement includes grouping sets of data in a cache memory system into two or more virtual partitions and processing a load/store instruction in accordance with the virtual partitions, where the load/store instruction specifies at least one of the virtual partitions to which the load/store instruction is assigned.

Type: Grant

Filed: June 30, 2008

Date of Patent: November 16, 2010

Assignee: International Business Machines Corporation

Inventors: Krishnan Kunjunny Kailas, Rajiv Alazhath Ravindran, Zehra Sura
Method and apparatus for dynamic priority-based cache replacement

Patent number: 7502890

Abstract: One embodiment of the present method and apparatus for dynamic priority-based cache replacement includes selectively assigning relative priority values to at least a subset of data items in the cache memory system, fetching a new data item to load into the cache memory system, the data item being associated with a priority value, and selecting an existing data item from the cache memory system to replace with the new data item, in accordance with the relative priority values and the priority value of the new data item.

Type: Grant

Filed: July 7, 2006

Date of Patent: March 10, 2009

Assignee: International Business Machines Corporation

Inventors: Krishnan Kunjunny Kailas, Rajiv Alazhath Ravindran, Zehra Sura
METHOD AND APPARATUS FOR APPLICATION-SPECIFIC DYNAMIC CACHE PLACEMENT

Publication number: 20080270705

Abstract: One embodiment of the present method and apparatus for application-specific dynamic cache placement includes grouping sets of data in a cache memory system into two or more virtual partitions and processing a load/store instruction in accordance with the virtual partitions, where the load/store instruction specifies at least one of the virtual partitions to which the load/store instruction is assigned.

Type: Application

Filed: June 30, 2008

Publication date: October 30, 2008

Inventors: KRISHNAN KUNJUNNY KAILAS, Rajiv Alazhath Ravindran, Zehra Sura
Method and apparatus for application-specific dynamic cache placement

Publication number: 20080010413

Abstract: One embodiment of the present method and apparatus for application-specific dynamic cache placement includes grouping sets of data in a cache memory system into two or more virtual partitions and processing a load/store instruction in accordance with the virtual partitions, where the load/store instruction specifies at least one of the virtual partitions to which the load/store instruction is assigned.

Type: Application

Filed: July 7, 2006

Publication date: January 10, 2008

Inventors: Krishnan Kunjunny Kailas, Rajiv Alazhath Ravindran, Zehra Sura
Method and apparatus for dynamic priority-based cache replacement

Publication number: 20080010414

Abstract: One embodiment of the present method and apparatus for dynamic priority-based cache replacement includes selectively assigning relative priority values to at least a subset of data items in the cache memory system, fetching a new data item to load into the cache memory system, the data item being associated with a priority value, and selecting an existing data item from the cache memory system to replace with the new data item, in accordance with the relative priority values and the-priority value of the new data item.

Type: Application

Filed: July 7, 2006

Publication date: January 10, 2008

Inventors: Krishnan Kunjunny Kailas, Rajiv Alazhath Ravindran, Zehra Sura
COMPILER IMPLEMENTED SOFTWARE CACHE APPARATUS AND METHOD IN WHICH NON-ALIASED EXPLICITLY FETCHED DATA ARE EXCLUDED

Publication number: 20070261042

Abstract: A compiler implemented software cache apparatus and method in which non-aliased explicitly fetched data are excluded are provided. With the mechanisms of the illustrative embodiments, a compiler uses a forward data flow analysis to prove that there is no alias between the cached data and explicitly fetched data. Explicitly fetched data that has no alias in the cached data are excluded from the software cache. Explicitly fetched data that has aliases in the cached data are allowed to be stored in the software cache. In this way, there is no runtime overhead to maintain the correctness of the two copies of data. Moreover, the number of lines of the software cache that must be protected from eviction is decreased. This leads to a decrease in the amount of computation cycles required by the cache miss handler when evicting cache lines during cache miss handling.

Type: Application

Filed: April 14, 2006

Publication date: November 8, 2007

Inventors: Tong Chen, John O'Brien, Kathryn O'Brien, Byoungro So, Zehra Sura, Tao Zhang