Patents by Inventor Yefim Shuf

Yefim Shuf has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-dimension variable predictive modeling for analysis acceleration

Patent number: 10452793

Abstract: In one example, a method for evaluating a system includes obtaining a model of the system that defines a boundary between at least one failure region and a non-failure region for a performance indicator with respect to at least one variable of the system. In one embodiment, obtaining the model involves constructing a new model; however, in other embodiments, obtaining the model involves accepting or retrieving a pre-constructed model is input. The method further includes obtaining importance samples for the at least one variable that are biased to the at least one failure region, and calculating indicator values for the performance indicator by applying the importance samples to the model.

Type: Grant

Filed: April 8, 2016

Date of Patent: October 22, 2019

Assignee: International Business Machines Corporation

Inventors: Rajiv V. Joshi, Yefim Shuf, Jonathan Sloan
Generating synthetic data

Patent number: 10171311

Abstract: A method of generating synthetic data from a model of a dataset comprises the steps of receiving a model of a dataset, extracting information from the received model, constructing a database view from the extracted information, receiving a query to the constructed database view, and generating synthetic data from the constructed database view according to the received query.

Type: Grant

Filed: October 17, 2013

Date of Patent: January 1, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Matthew C. Harvey, Niall F. McCarroll, Yefim Shuf
Method for sorting data

Patent number: 10089379

Abstract: A sequence of one or more input objects are sorted by identifying a property that is exhibited by a sequence of one or more input objects, determining whether each input object from the sequence of one or more input objects exhibits the property, storing each of the one or more input objects into a buffer, wherein an input object is stored in a first buffer if it exhibits the property and an input object is stored in a second buffer if it does not exhibit the property, sorting each of the one or more input objects in each buffer, and merging the one or more input objects in each buffer into a sequence of one or more input objects.

Type: Grant

Filed: August 18, 2008

Date of Patent: October 2, 2018

Assignee: International Business Machines Corporation

Inventors: Hong Min, Yefim Shuf, Hubertus Franke, Allan Benjamin Lebovitz, Christopher John Crone, Akiko Hoshikawa
Big data assistant

Patent number: 9940113

Abstract: Embodiments disclosed herein generally relate to techniques for managing processing of data in computing systems. One embodiment provides a method which includes receiving a set of inputs to apply to at least a first big data definition. Each big data definition encapsulates information specified in the set of inputs into a set of reusable software components. The method also includes generating, based on the reusable software components, a first executable code component for the at least first big data definition. The method further includes deploying the first executable code component to an executing application without interrupting services provided by the executing application to which the first executable code component is deployed.

Type: Grant

Filed: September 9, 2015

Date of Patent: April 10, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James D. Johnston, Ruiping Li, Sachin P. Sant, Yefim Shuf
BIG DATA ASSISTANT

Publication number: 20170068525

Abstract: Embodiments disclosed herein generally relate to techniques for managing processing of data in computing systems. One embodiment provides a method which includes receiving a set of inputs to apply to at least a first big data definition. Each big data definition encapsulates information specified in the set of inputs into a set of reusable software components. The method also includes generating, based on the reusable software components, a first executable code component for the at least first big data definition. The method further includes deploying the first executable code component to an executing application without interrupting services provided by the executing application to which the first executable code component is deployed.

Type: Application

Filed: September 9, 2015

Publication date: March 9, 2017

Inventors: James D. JOHNSTON, Ruiping Li, Sachin P. Sant, Yefim Shuf
Optimization of mixed database workload scheduling and concurrency control by mining data dependency relationships via lock tracking

Patent number: 9436721

Abstract: The present disclosure relates generally to the field of determining data access patterns associated with different workloads. In various examples, determining data access patterns associated with different workloads may be implemented in the form of systems, methods and/or algorithms. The present disclosure also relates generally to the field of scheduling workloads (and/or recommending scheduling for workloads) based upon the determined data access patterns associated with the different workloads. In various examples, scheduling workloads (and/or recommending scheduling for workloads) may be implemented in the form of systems, methods and/or algorithms.

Type: Grant

Filed: February 28, 2014

Date of Patent: September 6, 2016

Assignee: International Business Machines Corporation

Inventors: Donna N. Dillenberger, Hubertus Franke, Hong Min, Yefim Shuf
MULTI-DIMENSION VARIABLE PREDICTIVE MODELING FOR ANALYSIS ACCELERATION

Publication number: 20160224705

Abstract: In one example, a method for evaluating a system includes obtaining a model of the system that defines a boundary between at least one failure region and a non-failure region for a performance indicator with respect to at least one variable of the system. In one embodiment, obtaining the model involves constructing a new model; however, in other embodiments, obtaining the model involves accepting or retrieving a pre-constructed model is input. The method further includes obtaining importance samples for the at least one variable that are biased to the at least one failure region, and calculating indicator values for the performance indicator by applying the importance samples to the model.

Type: Application

Filed: April 8, 2016

Publication date: August 4, 2016

Inventors: Rajiv V. Joshi, Yefim Shuf, Jonathan Sloan
System and method for an efficient query sort of a data stream with duplicate key values

Patent number: 9235622

Abstract: An apparatus and method for efficiently performing a query sort on a data set with duplicate key values is disclosed. The method includes identifying unique key values for a key in a data set after determining that a number of duplicate key values for the key exceed a predefined threshold. The method also includes recording an association of each unique key value with a record in the data set and sorting unique key values. The method further includes storing the unique key values in a sorted order, wherein each unique key value is associated with an appropriate record in the data set.

Type: Grant

Filed: February 27, 2009

Date of Patent: January 12, 2016

Assignee: International Business Machines Corporation

Inventors: Yefim Shuf, Hong Min, Hubertus Franke, Allan B. Lebovitz
OPTIMIZATION OF MIXED DATABASE WORKLOAD SCHEDULING AND CONCURRENCY CONTROL BY MINING DATA DEPENDENCY RELATIONSHIPS VIA LOCK TRACING

Publication number: 20150248452

Abstract: The present disclosure relates generally to the field of determining data access patterns associated with different workloads. In various examples, determining data access patterns associated with different workloads may be implemented in the form of systems, methods and/or algorithms. The present disclosure also relates generally to the field of scheduling workloads (and/or recommending scheduling for workloads) based upon the determined data access patterns associated with the different workloads. In various examples, scheduling workloads (and/or recommending scheduling for workloads) may be implemented in the form of systems, methods and/or algorithms.

Type: Application

Filed: February 28, 2014

Publication date: September 3, 2015

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Donna N. Dillenberger, Hubertus Franke, Hong Min, Yefim Shuf
GENERATING SYNTHETIC DATA

Publication number: 20140115007

Abstract: A method of generating synthetic data from a model of a dataset comprises the steps of receiving a model of a dataset, extracting information from the received model, constructing a database view from the extracted information, receiving a query to the constructed database view, and generating synthetic data from the constructed database view according to the received query.

Type: Application

Filed: October 17, 2013

Publication date: April 24, 2014

Applicant: International Business Machines Corporation

Inventors: Matthew C. Harvey, Niall F. McCarroll, Yefim Shuf
Generating materialized query table candidates

Patent number: 8620899

Abstract: Techniques for generating a set of one or more materialized query table (MQT) candidates for a workload are provided. The techniques include receiving a workload, wherein the workload comprises a set of one or more queries, generating one or more best matching MQTs (BMQTs) based on one or more query blocks of the one or more queries by removing syntax that is not qualified for a MQT re-write, determining one or more frequently used multi-joins in the workload, using the one or more BMQTs and the one or more frequently used multi-joins to generate a set of one or more workload MQTs (WMQTs), and grouping one or more WMQTs and one or more BMQTs into one or more groups to merge into a set of a smaller number of MQTs and to cover the workload.

Type: Grant

Filed: February 9, 2010

Date of Patent: December 31, 2013

Assignee: International Business Machines Corporation

Inventors: Dongsheng Chen, Hong Min, Terence P. Purcell, Yefim Shuf, Xiaobo Wang, Zhongliang Zhang
Database table look-up

Patent number: 8359316

Abstract: Techniques for database table look-up are provided. The techniques include storing one or more column attributes of a database table in a data structure, wherein the data structure also comprises a record identification (RID) column of a table, one or more predicate columns corresponding to the RID column, and a sequence number column that is associated with one or more updated records, generating a key using one or more portions from one or more of the one or more predicate columns, using the key to partition the data structure, wherein partitioning the data structure comprises partitioning the one or more predicate columns for evaluation, and evaluating the one or more predicate columns against the data structure for each matching predicate column-data structure partition.

Type: Grant

Filed: March 1, 2010

Date of Patent: January 22, 2013

Assignee: International Business Machines Corporation

Inventors: Hubertus Franke, You-Chin Fuh, Hong Min, Terence P. Purcell, Yefim Shuf
Cache line replacement monitoring and profiling

Patent number: 8190824

Abstract: Systems and methods for cache replacement monitoring (CRM) are provided. The system includes a monitored cache comprising a monitored cache line set, the monitored cache line set comprising at least one cache line capable of holding data of a monitored address; and a CRM mechanism operatively associated with the monitored cache. The CRM mechanism collects CRM information for the monitored address. The method includes the steps of collecting CRM information for a monitored address in a monitored cache; and recording the CRM information for the monitored address, when at least one of (1) the monitored address is cached in the monitored cache, (2) the monitored address is replaced in the monitored cache, (3) any cache line in a cache line set corresponding to the monitored address is cached in the monitored cache, and (4) any cache line in a cache line set corresponding to the monitored address is replaced in the monitored cache.

Type: Grant

Filed: June 9, 2008

Date of Patent: May 29, 2012

Assignee: International Business Machines Corporation

Inventors: Xiaowei Shen, Yefim Shuf, Peter F. Sweeney
System and method for managing computer operations according to stored user preferences

Patent number: 8140979

Abstract: A system and method for managing computer operations according to stored user preferences is disclosed. The system includes a calendar module for scheduling an event and centrally storing the user preferences regarding the computer operations during the event, and a controller module for retrieving the user preferences centrally stored in the calendar module and centrally controlling the computer operations according to the user preferences during the event.

Type: Grant

Filed: August 15, 2008

Date of Patent: March 20, 2012

Assignee: International Business Machines Corporation

Inventors: Yefim Shuf, Alex Zlatsin, Dimitri Kanevsky, Genady Grabarnik
Method, apparatus, and computer program product for adaptive query parallelism partitioning with look-ahead probing and feedback

Patent number: 8140522

Abstract: A database query is partitioned into an initial partition including a plurality of parallel groups, and is executed, via an execution plan, based on the initial partition. A sampling subset of data is identified from the plurality of parallel groups. Substantially in parallel with the executing of the query, the execution plan is executed on the sampling subset of data as a sampling thread. The execution plan is modified based on feedback from the execution of the execution plan on the sampling subset of data.

Type: Grant

Filed: August 12, 2008

Date of Patent: March 20, 2012

Assignee: International Business Machines Corporation

Inventors: Hong Min, Yefim Shuf, Terence Patrick Purcell, You-Chin Fuh, Chunfeng Pei, Ou Jin
Storage allocation

Patent number: 8122219

Abstract: Techniques for storage allocation of a data record are provided. The techniques include attempting to identify a first location for storing a data record, wherein the data record comprises one or more data record attributes, if the first location is identified, selecting the first location for storing the data record, and if the first location is not identified, identifying a second location for storing the data record using a cost penalty function and selecting the second location for storing the data record based on the cost penalty function.

Type: Grant

Filed: July 22, 2009

Date of Patent: February 21, 2012

Assignee: International Business Machines Corporation

Inventors: Yefim Shuf, Hong Min, Hubertus Franke, Ulrich Alfons Finkler
Database Table Look-up

Publication number: 20110213775

Abstract: Techniques for database table look-up are provided. The techniques include storing one or more column attributes of a database table in a data structure, wherein the data structure also comprises a record identification (RID) column of a table, one or more predicate columns corresponding to the RID column, and a sequence number column that is associated with one or more updated records, generating a key using one or more portions from one or more of the one or more predicate columns, using the key to partition the data structure, wherein partitioning the data structure comprises partitioning the one or more predicate columns for evaluation, and evaluating the one or more predicate columns against the data structure for each matching predicate column-data structure partition.

Type: Application

Filed: March 1, 2010

Publication date: September 1, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Hubertus Franke, You-Chin Fuh, Hong Min, Terence P. Purcell, Yefim Shuf
Generating Materialized Query Table Candidates

Publication number: 20110196857

Abstract: Techniques for generating a set of one or more materialized query table (MQT) candidates for a workload are provided. The techniques include receiving a workload, wherein the workload comprises a set of one or more queries, generating one or more best matching MQTs (BMQTs) based on one or more query blocks of the one or more queries by removing syntax that is not qualified for a MQT re-write, determining one or more frequently used multi-joins in the workload, using the one or more BMQTs and the one or more frequently used multi-joins to generate a set of one or more workload MQTs (WMQTs), and grouping one or more WMQTs and one or more BMQTs into one or more groups to merge into a set of a smaller number of MQTs and to cover the workload.

Type: Application

Filed: February 9, 2010

Publication date: August 11, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Dong Sheng Chen, Hong Min, Terence P. Purcell, Yefim Shuf, Xiao Bo Wang, Zhong Liang Zhang
Method for partitioning a query

Patent number: 7930294

Abstract: Techniques for partitioning a query are provided. The techniques include establishing one or more criterion for partitioning a query, wherein the query comprises one or more tables, materializing a first of the one or more tables, partitioning the first of the one or more tables until the one or more criterion have been satisfied, and partitioning and joining a remainder of the one or more tables of the query.

Type: Grant

Filed: August 12, 2008

Date of Patent: April 19, 2011

Assignee: International Business Machines Corporation

Inventors: Yefim Shuf, Hong Min, Terence Patrick Purcell, Ou Jin, Fen-Ling Lin, Brian Thinh-Vinh Tran, Patrick Dooling Bossman
STORAGE ALLOCATION

Publication number: 20110022815

Abstract: Techniques for storage allocation of a data record are provided. The techniques include attempting to identify a first location for storing a data record, wherein the data record comprises one or more data record attributes, if the first location is identified, selecting the first location for storing the data record, and if the first location is not identified, identifying a second location for storing the data record using a cost penalty function and selecting the second location for storing the data record based on the cost penalty function.

Type: Application

Filed: July 22, 2009

Publication date: January 27, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yefim Shuf, Hong Min, Hubertus Franke, Ulrich Alfons Finkler

1 2 next