Patents by Inventor Ron E. Liu

Ron E. Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Data driven parallel sorting system and method

Patent number: 9990412

Abstract: A data driven parallel sorting method includes distributing input data records to n partitions one by one in a circular manner. Each partition corresponds to a parallel sorting process with an allocated memory chunk sized to store m data records. The method also includes sorting, in parallel, current data records in respective memory chunks in respective partitions. The method also includes in response to distribution of data records of ?m/n? rounds, circularly controlling one of the n partitions, and writing data records that have been sorted in the memory chunk of the partition into a mass storage as an ordered data chunk, and emptying the memory chunk. The method also includes in response to all data records being distributed, writing data chunks that have been sorted in respective memory chunks into the mass storage, and performing a merge sort on all ordered data chunks in the mass storage.

Type: Grant

Filed: April 28, 2014

Date of Patent: June 5, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Brian K. Caufield, Ron E. Liu, Dong J. Wei, Xin Ying Yang
Dynamic node group allocation

Patent number: 9762672

Abstract: Provided are techniques for improving data locality for parallel applications running in a big data distributed file system with a dynamic node group. In response to a consumer job starting to read one or more files in a big data distributed file system having multiple nodes, node group information for the one or more files to be read is retrieved, wherein the node group information identifies nodes from the multiple nodes on which a producer job wrote the one or more files, and the consumer job is assigned to the nodes identified by the node group information to allow for local reading of the one or more files by the consumer job.

Type: Grant

Filed: June 15, 2015

Date of Patent: September 12, 2017

Assignee: International Business Machines Corporation

Inventors: Krishna K. Bonagiri, Eric A. Jacobson, Yong Li, Ron E. Liu, Xiaoyan Pu
Multiple stage workload management system

Patent number: 9733984

Abstract: Provided are techniques for multiple stage workload management. A staging queue and a run queue are provided. A workload is received. In response to determining that application resources are not available and that the workload has not been previously semi-started, the workload is added to the staging queue. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that run resources are available, the workload is started. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that the run resources are not available, adding the workload to the run queue.

Type: Grant

Filed: March 2, 2016

Date of Patent: August 15, 2017

Assignee: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Mi W. Shum, Chun H. Sun, DongJie Wei
Sharing a partitioned data set across parallel applications

Patent number: 9652308

Abstract: Provided are techniques for sharing a partitioned data set across parallel applications. Under control of a producing application, a partitioned data set is generated; a descriptor that describes the partitioned data set is generated; and the descriptor is registered in a registry. Under control of a consuming application, the registry is accessed to obtain the descriptor of the partitioned data set; and the descriptor is uses to determine how to process the partitioned data set.

Type: Grant

Filed: September 5, 2014

Date of Patent: May 16, 2017

Assignee: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Xiaoyan Pu
Apparatus and method for identifying performance bottlenecks in pipeline parallel processing environment

Patent number: 9575916

Abstract: A system identifies a performance bottleneck status in a parallel data processing environment by examining data flow associated with the parallel data processing environment to identify at least one operator, where an operator type is associated with at least one operator, at least one buffer, and a relationship that the buffer has with the operator, where the relationship is associated with the operator type. The system monitors the buffer to determine a buffer status associated with the buffer. The system applies a set of rules to identify an operator bottleneck status associated with the operator. The set of rules is applied to the operator, based on the operator type, the buffer status, and relationship that the buffer has with the operator. The system then determines a performance bottleneck status associated with the parallel data processing environment, based on the operator bottleneck status.

Type: Grant

Filed: January 6, 2014

Date of Patent: February 21, 2017

Assignee: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, DongJie Wei, Xin Ying Yang
Sharing a partitioned data set across parallel applications

Patent number: 9542246

Abstract: Provided are techniques for sharing a partitioned data set across parallel applications. Under control of a producing application, a partitioned data set is generated; a descriptor that describes the partitioned data set is generated; and the descriptor is registered in a registry. Under control of a consuming application, the registry is accessed to obtain the descriptor of the partitioned data set; and the descriptor is uses to determine how to process the partitioned data set.

Type: Grant

Filed: May 20, 2015

Date of Patent: January 10, 2017

Assignee: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Xiaoyan Pu
DYNAMIC NODE GROUP ALLOCATION

Publication number: 20160366224

Abstract: Provided are techniques for improving data locality for parallel applications running in a big data distributed file system with a dynamic node group. In response to a consumer job starting to read one or more files in a big data distributed file system having multiple nodes, node group information for the one or more files to be read is retrieved, wherein the node group information identifies nodes from the multiple nodes on which a producer job wrote the one or more files, and the consumer job is assigned to the nodes identified by the node group information to allow for local reading of the one or more files by the consumer job.

Type: Application

Filed: June 15, 2015

Publication date: December 15, 2016

Inventors: Krishna K. Bonagiri, Eric A. Jacobson, Yong Li, Ron E. Liu, Xiaoyan Pu
MULTIPLE STAGE WORKLOAD MANAGEMENT SYSTEM

Publication number: 20160179578

Abstract: Provided are techniques for multiple stage workload management. A staging queue and a run queue are provided. A workload is received. In response to determining that application resources are not available and that the workload has not been previously semi-started, the workload is added to the staging queue. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that run resources are available, the workload is started. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that the run resources are not available, adding the workload to the run queue.

Type: Application

Filed: March 2, 2016

Publication date: June 23, 2016

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Mi W. Shum, Chun H. Sun, DongJie Wei
Multiple stage workload management system

Patent number: 9304816

Abstract: Provided are techniques for multiple stage workload management. A staging queue and a run queue are provided. A workload is received. In response to determining that application resources are not available and that the workload has not been previously semi-started, the workload is added to the staging queue. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that run resources are available, the workload is started. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that the run resources are not available, adding the workload to the run queue.

Type: Grant

Filed: August 5, 2013

Date of Patent: April 5, 2016

Assignee: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Mi W. Shum, Chun H. Sun, DongJie Wei
SHARING A PARTITIONED DATA SET ACROSS PARALLEL APPLICATIONS

Publication number: 20160070608

Abstract: Provided are techniques for sharing a partitioned data set across parallel applications. Under control of a producing application, a partitioned data set is generated; a descriptor that describes the partitioned data set is generated; and the descriptor is registered in a registry. Under control of a consuming application, the registry is accessed to obtain the descriptor of the partitioned data set; and the descriptor is uses to determine how to process the partitioned data set.

Type: Application

Filed: May 20, 2015

Publication date: March 10, 2016

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Xiaoyan Pu
SHARING A PARTITIONED DATA SET ACROSS PARALLEL APPLICATIONS

Publication number: 20160070607

Abstract: Provided are techniques for sharing a partitioned data set across parallel applications. Under control of a producing application, a partitioned data set is generated; a descriptor that describes the partitioned data set is generated; and the descriptor is registered in a registry. Under control of a consuming application, the registry is accessed to obtain the descriptor of the partitioned data set; and the descriptor is uses to determine how to process the partitioned data set.

Type: Application

Filed: September 5, 2014

Publication date: March 10, 2016

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Xiaoyan Pu
APPARATUS AND METHOD FOR IDENTIFYING PERFORMANCE BOTTLENECKS IN PIPELINE PARALLEL PROCESSING ENVIRONMENT

Publication number: 20150193368

Abstract: A system identifies a performance bottleneck status in a parallel data processing environment by examining data flow associated with the parallel data processing environment to identify at least one operator, where an operator type is associated with at least one operator, at least one buffer, and a relationship that the buffer has with the operator, where the relationship is associated with the operator type. The system monitors the buffer to determine a buffer status associated with the buffer. The system applies a set of rules to identify an operator bottleneck status associated with the operator. The set of rules is applied to the operator, based on the operator type, the buffer status, and relationship that the buffer has with the operator. The system then determines a performance bottleneck status associated with the parallel data processing environment, based on the operator bottleneck status.

Type: Application

Filed: January 6, 2014

Publication date: July 9, 2015

Applicant: International Business Machines Corporation

Inventors: Brian K. CAUFIELD, Ron E. LIU, DongJie WEI, Xin Y. YANG
MULTIPLE STAGE WORKLOAD MANAGEMENT SYSTEM

Publication number: 20150040133

Abstract: Provided are techniques for multiple stage workload management. A staging queue and a run queue are provided. A workload is received. In response to determining that application resources are not available and that the workload has not been previously semi-started, the workload is added to the staging queue. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that run resources are available, the workload is started. In response to determining that the application resources are not available and that the workload has been semi-started, and, in response to determining that the run resources are not available, adding the workload to the run queue.

Type: Application

Filed: August 5, 2013

Publication date: February 5, 2015

Applicant: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Sriram K. Padmanabhan, Mi W. Shum, Chun H. Sun, DongJie Wei
Data Driven Parallel Sorting System and Method

Publication number: 20140324890

Abstract: A data driven parallel sorting method includes distributing input data records to n partitions one by one in a circular manner. Each partition corresponds to a parallel sorting process with an allocated memory chunk sized to store m data records. The method also includes sorting, in parallel, current data records in respective memory chunks in respective partitions. The method also includes in response to distribution of data records of ?m/n? rounds, circularly controlling one of the n partitions, and writing data records that have been sorted in the memory chunk of the partition into a mass storage as an ordered data chunk, and emptying the memory chunk. The method also includes in response to all data records being distributed, writing data chunks that have been sorted in respective memory chunks into the mass storage, and performing a merge sort on all ordered data chunks in the mass storage.

Type: Application

Filed: April 28, 2014

Publication date: October 30, 2014

Applicant: International Business Machines Corporation

Inventors: Brian K. Caufield, Ron E. Liu, Dong J. Wei, Xin Y. Yang