Patents by Inventor Berthold Reinwald

Berthold Reinwald has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Pipelined approach to fused kernels for optimization of machine learning workloads on graphical processing units

Patent number: 9972063

Abstract: A method for optimization of machine learning (ML) workloads on a graphics processor unit (GPU). The method includes identifying a computation having a generic pattern commonly observed in ML processes. An optimized fused GPU kernel is employed to exploit temporal locality for inherent data-flow dependencies in the identified computation. Hierarchical aggregation spanning a memory hierarchy of the GPU for processing for the identified computation is performed. GPU kernel launch parameters are estimated following an analytical model that maximizes thread occupancy and minimizes atomic writes to GPU global memory.

Type: Grant

Filed: July 30, 2015

Date of Patent: May 15, 2018

Assignee: International Business Machines Corporation

Inventors: Arash Ashari, Matthias Boehm, Keith W. Campbell, Alexandre Evfimievski, John D. Keenleyside, Berthold Reinwald, Shirish Tatikonda
DYNAMIC RECOMPILATION TECHNIQUES FOR MACHINE LEARNING PROGRAMS

Publication number: 20170228222

Abstract: The embodiments described herein relate to recompiling an execution plan of a machine-learning program during runtime. An execution plan of a machine-learning program is compiled. In response to identifying a directed acyclic graph of high-level operations (HOP DAG) for recompilation during runtime, the execution plan is dynamically recompiled. The dynamic recompilation includes updating statistics and dynamically rewriting one or more operators of the identified HOP DAG, recomputing memory estimates of operators of the rewritten HOP DAG based on the updated statistics and rewritten operators, constructing a directed acyclic graph of low-level operations (LOP DAG) corresponding to the rewritten HOP DAG based in part on the recomputed memory estimates, and generating runtime instructions based on the LOP DAG.

Type: Application

Filed: April 28, 2017

Publication date: August 10, 2017

Applicant: International Business Machines Corporation

Inventors: Matthias Boehm, Berthold Reinwald, Shirish Tatikonda
Dynamic recompilation techniques for machine learning programs

Patent number: 9715373

Abstract: The embodiments described herein relate to recompiling an execution plan of a machine-learning program during runtime. An execution plan of a machine-learning program is compiled. In response to identifying a directed acyclic graph of high-level operations (HOP DAG) for recompilation during runtime, the execution plan is dynamically recompiled. The dynamic recompilation includes updating statistics and dynamically rewriting one or more operators of the identified HOP DAG, recomputing memory estimates of operators of the rewritten HOP DAG based on the updated statistics and rewritten operators, constructing a directed acyclic graph of low-level operations (LOP DAG) corresponding to the rewritten HOP DAG based in part on the recomputed memory estimates, and generating runtime instructions based on the LOP DAG.

Type: Grant

Filed: December 18, 2015

Date of Patent: July 25, 2017

Assignee: International Business Machines Corporation

Inventors: Matthias Boehm, Berthold Reinwald, Shirish Tatikonda
DYNAMIC RECOMPILATION TECHNIQUES FOR MACHINE LEARNING PROGRAMS

Publication number: 20170177312

Abstract: The embodiments described herein relate to recompiling an execution plan of a machine-learning program during runtime. An execution plan of a machine-learning program is compiled. In response to identifying a directed acyclic graph of high-level operations (HOP DAG) for recompilation during runtime, the execution plan is dynamically recompiled. The dynamic recompilation includes updating statistics and dynamically rewriting one or more operators of the identified HOP DAG, recomputing memory estimates of operators of the rewritten HOP DAG based on the updated statistics and rewritten operators, constructing a directed acyclic graph of low-level operations (LOP DAG) corresponding to the rewritten HOP DAG based in part on the recomputed memory estimates, and generating runtime instructions based on the LOP DAG.

Type: Application

Filed: December 18, 2015

Publication date: June 22, 2017

Applicant: International Business Machines Corporation

Inventors: Matthias Boehm, Berthold Reinwald, Shirish Tatikonda
R-language integration with a declarative machine learning language

Patent number: 9684493

Abstract: In a method for analyzing a large data set using a statistical computing environment language operation, a processor generates code from the statistical computing environment language operation that can be understood by a software system for processing machine learning algorithms in a MapReduce environment. A processor transfers the code to the software system for processing machine learning algorithms in a MapReduce environment. A processor invokes execution of the code with the software system for processing machine learning algorithms in a MapReduce environment.

Type: Grant

Filed: June 2, 2014

Date of Patent: June 20, 2017

Assignee: International Business Machines Corporation

Inventors: Matthias Boehm, Douglas R. Burdick, Stefan Burnicki, Berthold Reinwald, Shirish Tatikonda
GLOBAL DATA FLOW OPTIMIZATION FOR MACHINE LEARNING PROGRAMS

Publication number: 20170147943

Abstract: A method for global data flow optimization for machine learning (ML) programs. The method includes receiving, by a storage device, an initial plan for an ML program. A processor builds a nested global data flow graph representation using the initial plan. Operator directed acyclic graphs (DAGs) are connected using crossblock operators according to inter-block data dependencies. The initial plan for the ML program is re-written resulting in an optimized plan for the ML program with respect to its global data flow properties. The re-writing includes re-writes of: configuration dataflow properties, operator selection and structural changes.

Type: Application

Filed: November 23, 2015

Publication date: May 25, 2017

Inventors: Matthias Boehm, Mathias Peters, Berthold Reinwald, Shirish Tatikonda
SINGLE-PASS DISTRIBUTED SAMPLING FROM BLOCK-PARTITIONED MATRICES

Publication number: 20170147673

Abstract: A computer-implemented method is provided that includes identifying an input dataset formatted as an input matrix, the input matrix including a plurality of rows and a plurality of columns. The computer-implemented method also includes dividing the input matrix into a plurality of input matrix blocks. Further, the computer-implemented method includes distributing the input matrix blocks to a plurality of different machines across a distributed filesystem, and sampling, by at least two of the different machines in parallel, at least two of the input matrix blocks. Finally, the computer-implemented method includes generating at least one sample matrix based on the sampling of the at least two of the input matrix blocks.

Type: Application

Filed: November 20, 2015

Publication date: May 25, 2017

Inventors: Douglas R. Burdick, Alexandre V. Evfimievski, Berthold Reinwald, Sebastian Schelter
Sparsity-driven matrix representation to optimize operational and storage efficiency

Patent number: 9652374

Abstract: Embodiments of the invention relate to sparsity-driven matrix representation. In one embodiment, a sparsity of a matrix is determined and the sparsity is compared to a threshold. Computer memory is allocated to store the matrix in a first data structure format based on the sparsity being greater than the threshold. Computer memory is allocated to store the matrix in a second data structure format based on the sparsity not being greater than the threshold.

Type: Grant

Filed: August 31, 2016

Date of Patent: May 16, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Berthold Reinwald, Shirish Tatikonda, Yuanyuan Tian
PIPELINED APPROACH TO FUSED KERNELS FOR OPTIMIZATION OF MACHINE LEARNING WORKLOADS ON GRAPHICAL PROCESSING UNITS

Publication number: 20170032487

Abstract: A method for optimization of machine learning (ML) workloads on a graphics processor unit (GPU). The method includes identifying a computation having a generic pattern commonly observed in ML processes. An optimized fused GPU kernel is employed to exploit temporal locality for inherent data-flow dependencies in the identified computation. Hierarchical aggregation spanning a memory hierarchy of the GPU for processing for the identified computation is performed. GPU kernel launch parameters are estimated following an analytical model that maximizes thread occupancy and minimizes atomic writes to GPU global memory.

Type: Application

Filed: July 30, 2015

Publication date: February 2, 2017

Inventors: Arash Ashari, Matthias Boehm, Keith W. Campbell, Alexandre Evfimievski, John D. Keenleyside, Berthold Reinwald, Shirish Tatikonda
SPARSITY-DRIVEN MATRIX REPRESENTATION TO OPTIMIZE OPERATIONAL AND STORAGE EFFICIENCY

Publication number: 20160364327

Abstract: Embodiments of the invention relate to sparsity-driven matrix representation. In one embodiment, a sparsity of a matrix is determined and the sparsity is compared to a threshold. Computer memory is allocated to store the matrix in a first data structure format based on the sparsity being greater than the threshold.

Type: Application

Filed: August 31, 2016

Publication date: December 15, 2016

Inventors: Berthold Reinwald, Shirish Tatikonda, Yuanyuan Tian
Placing a database

Patent number: 9483503

Abstract: A method and system for placing database. The method includes: receiving a request of creating a new database; determining whether there is a need to migrate current database among current virtual machines based on resource demand and free resource in the current virtual machines; determining database placement plan based on the resource demand, migration strategy and migration cost associated with the migration strategy in response to whether there is a need to migrate the database; and executing the database placement plan. The invention can help a database service provider to optimize database layout in database provision through database migration.

Type: Grant

Filed: May 24, 2013

Date of Patent: November 1, 2016

Assignee: International Business Machines Corporation

Inventors: Jie Qiu, Berthold Reinwald, Qi Rong Wang, Tao Yu, Lei Zhi
Sparsity-driven matrix representation to optimize operational and storage efficiency

Patent number: 9454472

Abstract: Embodiments of the invention relate to sparsity-driven matrix representation. In one embodiment, a sparsity of a matrix is determined and the sparsity is compared to a threshold. Computer memory is allocated to store the matrix in a first data structure format based on the sparsity being greater than the threshold. Computer memory is allocated to store the matrix in a second data structure format based on the sparsity not being greater than the threshold.

Type: Grant

Filed: April 15, 2016

Date of Patent: September 27, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Berthold Reinwald, Shirish Tatikonda, Yuanyuan Tian
SPARSITY-DRIVEN MATRIX REPRESENTATION TO OPTIMIZE OPERATIONAL AND STORAGE EFFICIENCY

Publication number: 20160217066

Abstract: Embodiments of the invention relate to sparsity-driven matrix representation. In one embodiment, a sparsity of a matrix is determined and the sparsity is compared to a threshold. Computer memory is allocated to store the matrix in a first data structure format based on the sparsity being greater than the threshold.

Type: Application

Filed: April 15, 2016

Publication date: July 28, 2016

Inventors: Berthold Reinwald, Shirish Tatikonda, Yuanyuan Tian
Sparsity-driven matrix representation to optimize operational and storage efficiency

Patent number: 9396164

Abstract: Embodiments of the invention relate to sparsity-driven matrix representation. In one embodiment, a sparsity of a matrix is determined and the sparsity is compared to a threshold. Computer memory is allocated to store the matrix in a first data structure format based on the sparsity being greater than the threshold. Computer memory is allocated to store the matrix in a second data structure format based on the sparsity not being greater than the threshold.

Type: Grant

Filed: October 21, 2013

Date of Patent: July 19, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Berthold Reinwald, Shirish Tatikonda, Yuanyuan Tian
HYBRID PARALLELIZATION STRATEGIES FOR MACHINE LEARNING PROGRAMS ON TOP OF MAPREDUCE

Publication number: 20160124730

Abstract: Parallel execution of machine learning programs is provided. Program code is received. The program code contains at least one parallel for statement having a plurality of iterations. A parallel execution plan is determined for the program code. According to the parallel execution plan, the plurality of iterations is partitioned into a plurality of tasks. Each task comprises at least one iteration. The iterations of each task are independent. Data required by the plurality of tasks is determined. An access pattern by the plurality of tasks of the data is determined. The data is partitioned based on the access pattern.

Type: Application

Filed: January 12, 2016

Publication date: May 5, 2016

Inventors: Matthias Boehm, Doughlas Burdick, Berthold Reinwald, Prithviraj Sen, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan
Hybrid parallelization strategies for machine learning programs on top of MapReduce

Patent number: 9286044

Abstract: Hybrid parallelization strategies for machine learning programs on top of MapReduce are provided. In one embodiment, a method of and computer program product for parallel execution of machine learning programs are provided. Program code is received. The program code contains at least one parallel for statement having a plurality of iterations. A parallel execution plan is determined for the program code. According to the parallel execution plan, the plurality of iterations is partitioned into a plurality of tasks. Each task comprises at least one iteration. The iterations of each task are independent.

Type: Grant

Filed: June 27, 2014

Date of Patent: March 15, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Matthias Boehm, Douglas Burdick, Berthold Reinwald, Prithviraj Sen, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan
SCALING DYNAMIC AUTHORITY-BASED SEARCH USING MATERIALIZED SUBGRAPHS

Publication number: 20160012059

Abstract: A method that includes generating, in a query pre-processor, a set of pre-computed materialized sub-graphs by executing a pre-processing dynamic random-walk based search for a bin of terms. The method also includes receiving, in a query processor, a search query having at least one search query term. In response to receiving the search query, the method includes accessing the set of pre-computed materialized sub-graphs. The accessing includes accessing a text index based on the search query term to retrieve a corresponding term group identifier and accessing the corresponding pre-computed materialized sub-graph based on the term group identifier. The method also includes executing a dynamic random-walk based search on only the corresponding pre-computed materialized sub-graph and based on the executing, retrieving nodes in the dataset and transmitting the nodes as results of the query.

Type: Application

Filed: September 21, 2015

Publication date: January 14, 2016

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Berthold Reinwald
HYBRID PARALLELIZATION STRATEGIES FOR MACHINE LEARNING PROGRAMS ON TOP OF MAPREDUCE

Publication number: 20150378696

Abstract: Hybrid parallelization strategies for machine learning programs on top of MapReduce are provided. In one embodiment, a method of and computer program product for parallel execution of machine learning programs are provided. Program code is received. The program code contains at least one parallel for statement having a plurality of iterations. A parallel execution plan is determined for the program code. According to the parallel execution plan, the plurality of iterations is partitioned into a plurality of tasks. Each task comprises at least one iteration. The iterations of each task are independent.

Type: Application

Filed: June 27, 2014

Publication date: December 31, 2015

Inventors: Matthias Boehm, Douglas Burdick, Berthold Reinwald, Prithviraj Sen, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan
R-LANGUAGE INTEGRATION WITH A DECLARATIVE MACHINE LEARNING LANGUAGE

Publication number: 20150347101

Abstract: In a method for analyzing a large data set using a statistical computing environment language operation, a processor generates code from the statistical computing environment language operation that can be understood by a software system for processing machine learning algorithms in a MapReduce environment. A processor transfers the code to the software system for processing machine learning algorithms in a MapReduce environment. A processor invokes execution of the code with the software system for processing machine learning algorithms in a MapReduce environment.

Type: Application

Filed: June 2, 2014

Publication date: December 3, 2015

Applicant: International Business Machines Corporation

Inventors: Matthias Boehm, Douglas R. Burdick, Stefan Burnicki, Berthold Reinwald, Shirish Tatikonda
Scaling dynamic authority-based search using materialized subgraphs

Patent number: 9171077

Abstract: According to one embodiment of the present invention, a method for processing a query is provided. The method includes generating a set of pre-computed materialized sub-graphs from a dataset and receiving a search query having one or more search query terms. A particular one of the pre-computed materialized sub-graphs is accessed and a dynamic authority-based keyword search is executed on the particular one of the pre-computed materialized sub-graphs. Nodes in the dataset are then retrieved based on the executing, and a response to the search query is provided which includes the retrieved nodes.

Type: Grant

Filed: February 27, 2009

Date of Patent: October 27, 2015

Assignee: International Business Machines Corporation

Inventors: Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Berthold Reinwald

prev 1 2 3 4 next