Patents by Inventor Yuan-Chi Chang

Yuan-Chi Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Resource exploitation management system, method and program product

Patent number: 11163751

Abstract: A resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database.

Type: Grant

Filed: January 17, 2019

Date of Patent: November 2, 2021

Assignee: International Business Machines Corporation

Inventors: Sonia Mariette Embid Droz, Cristina Ibanez-Llano, Giorgio De Paola, Ruben Rodriguez Torrado, Akiko Suzuki, Mustafa Canim, Yuan-Chi Chang, Robert Farrell, Sharon M. Trewin
Pre-filtering of join execution over multi-column range summaries and other synopses

Patent number: 11126503

Abstract: Techniques are provided for pre-filtering of join execution over multi-column range summaries and other synopses. An exemplary method comprises maintaining a synopsis for a plurality of data tables, wherein a given synopsis summarizes a set of records in a corresponding data table; and, in response to a request for a join operation for a set of the data tables: joining the synopses associated with the set of data tables to generate a joined synopsis; for joined records in the joined synopsis, obtaining corresponding records from the set of data tables as candidate records; and joining the candidate records. Two or more of the set of data tables can be distributed across a plurality of nodes and the synopses can be replicated and/or broadcasted across the plurality of nodes. Incremental updates to broadcasted and/or replicated synopses are optionally provided to at least one node.

Type: Grant

Filed: August 10, 2016

Date of Patent: September 21, 2021

Assignee: International Business Machines Corporation

Inventors: Yuan Chi Chang, Timothy Ray Malkemus, Mohammad Sadoghi Hamedani
Testing and modifying calendar and event sensitive timer series data analytics

Patent number: 11099979

Abstract: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.

Type: Grant

Filed: October 31, 2019

Date of Patent: August 24, 2021

Assignee: International Business Machines Corporation

Inventors: Yuan-Chi Chang, Long Vu, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
Scenario planning and management

Patent number: 11030561

Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.

Type: Grant

Filed: December 14, 2017

Date of Patent: June 8, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer
Scenario planning and risk management

Patent number: 11023840

Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.

Type: Grant

Filed: January 27, 2017

Date of Patent: June 1, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer
Context Driven Model Selection

Publication number: 20210158084

Abstract: Mechanism are provided to select a machine learning model from an analytics model library based on ingested data. One or more pieces of clarified data are fused to provide time-correlated data tuples of data streams. One or more features are extracted from the time-correlated data tuples and scored based on a set of predetermined rules thereby generating discriminative scoring of trigger data. Utilizing the discriminative scoring of the trigger data, trigger data of a current analytics model being utilized by the data processing and one or more new analytics models from the analytics model library are scored. Responsive to the scoring of the trigger data indicating a selection of a different analytics model from the analytics model library, the current analytics model is replaced with a selected analytics model from the analytics model library such that the data processing system executes the selected analytics model.

Type: Application

Filed: November 22, 2019

Publication date: May 27, 2021

Inventors: Timothy R. Dinger, Yuan-Chi Chang, Long Vu, Venkata N. Pavuluri, Lingtao Cao
Data Loss Machine Learning Model Update

Publication number: 20210142211

Abstract: A mechanism is provided for implementing a model update mechanism to update new models in real time while avoiding data loss and system downtime. Responsive to receiving a request to update a scorer model currently being executed by an existing worker thread in the data processing system, the model update mechanism initializing a new worker thread. The model update mechanism loads an updated scorer model into the new worker thread and initializes a state transfer from the existing worker thread to the new worker thread. The model update mechanism executes the updated scorer model such that the updated scorer model scores the input data. The model update mechanism then outputs a prediction based on the updated scorer model processing of the input data.

Type: Application

Filed: November 12, 2019

Publication date: May 13, 2021

Inventors: Long Vu, Yuan-Chi Chang, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
AUTOMATED DATA AND LABEL CREATION FOR SUPERVISED MACHINE LEARNING REGRESSION TESTING

Publication number: 20210142222

Abstract: Split an input dataset into training and test datasets; the former includes a plurality of data examples, each represented as a feature vector, and having an associated true label. Split the training dataset into a plurality of training data subsets; for each, train a corresponding machine learning model to obtain a plurality of such models, and apply same to the test dataset to obtain a plurality of predicted labels and prediction scores. For each of the plurality of examples, compute an agreement metric based on a corresponding one of the associated true labels; corresponding ones of the predicted labels; and corresponding ones of the prediction scores. Based on the computed metric, select, for at least some of the true label values, appropriate ones of the data examples to be added to a regression set. Add the appropriate ones of the data examples from the test dataset to the regression set.

Type: Application

Filed: November 13, 2019

Publication date: May 13, 2021

Inventors: Yuan-Chi Chang, Deepak Srinivas Turaga, Long Vu, Venkata Nagaraju Pavuluri, Saket Sathe, Rodrigue Ngueyep Tzoumpe
Testing and Modifying Calendar and Event Sensitive Timer Series Data Analytics

Publication number: 20210133090

Abstract: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.

Type: Application

Filed: October 31, 2019

Publication date: May 6, 2021

Inventors: Yuan-Chi Chang, Long Vu, Timothy R. Dinger, Venkata N. Pavuluri, Lingtao Cao
ENHANCED ENSEMBLE MODEL DIVERSITY AND LEARNING

Publication number: 20200327456

Abstract: Embodiments for implementing enhanced ensemble model diversity and learning by a processor. One or more data sets may be created by combining one or more clusters of data points of a minority class with selected data points of a majority class. One or more ensemble models may be created from the one or more data sets using a supervised machine learning operation. An occurrence of an event may be predicted using the one or more ensemble models.

Type: Application

Filed: April 11, 2019

Publication date: October 15, 2020

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Saket SATHE, Deepak TURAGA, Charu AGGARWAL, Raju PAVULURI, Yuan-Chi CHANG
RESOURCE EXPLOITATION MANAGEMENT SYSTEM, METHOD AND PROGRAM PRODUCT

Publication number: 20200233851

Abstract: A resource exploitation management system, method and a computer program product therefor. A description of new geological evidence for a geological resource is received, e.g., as one or more triples describing the evidence. Keywords in the description are matched against keywords in representations in a geological resource database. Geological relations are inferred from the descriptions and matched against predefined geological relations from the geological resource database. Consistent triple matches are merged with the geological resource database. The confidence level for merged matches is updated in the geological resource database.

Type: Application

Filed: January 17, 2019

Publication date: July 23, 2020

Applicants: REPSOL, S. A., International Business Machines Corporation

Inventors: Sonia Mariette Embid Droz, Cristina Ibanez-Llano, Giorgio De Paola, Ruben Rodriguez Torrado, Akiko Suzuki, Mustafa Canim, Yuan-Chi Chang, Robert Farrell, Sharon M. Trewin
INDEX MAINTENANCE BASED ON A COMPARISON OF REBUILD VS. UPDATE

Publication number: 20200142879

Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.

Type: Application

Filed: January 8, 2020

Publication date: May 7, 2020

Inventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
Index maintenance based on a comparison of rebuild vs. update

Patent number: 10579608

Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.

Type: Grant

Filed: June 7, 2018

Date of Patent: March 3, 2020

Assignee: International Business Machines Corporation

Inventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
Methods and apparatus for incremental frequent subgraph mining on dynamic graphs

Patent number: 10409828

Abstract: Methods and apparatus are provided for incremental frequent subgraph mining on dynamic graphs. An exemplary subgraph mining method comprises maintaining a set of embeddings comprising matching embeddings of a given subgraph in an input graph; maintaining a first fringe set of subgraphs comprising subgraphs substantially on a fringe of frequent subgraphs in the input graph that satisfy a predefined support threshold; maintaining a second fringe set of subgraphs comprising subgraphs substantially on a fringe of infrequent subgraphs in the input graph that do not satisfy the predefined support threshold; for an edge addition, checking a support of the subgraphs in the second fringe set based on the set of the embeddings and searching for new embeddings created by the edge addition; and for an edge deletion, removing obsolete embeddings that comprise the deleted edge from the first fringe set based on the set of embeddings.

Type: Grant

Filed: July 29, 2016

Date of Patent: September 10, 2019

Assignee: International Business Machines Corporation

Inventors: Ehab Abdelhamid Mohammed Abdelhamid, Bishwaranjan Bhattacharjee, Mustafa Canim, Yuan Chi Chang, Mohammad Sadoghi Hamedani
Data skipping and compression through partitioning of data

Patent number: 10289707

Abstract: Conventionally, in addition to indexing, a synopsis of a base table of a database is used to skip and compress data. However, scanning of the entire synopsis for all queries is required, which takes a long time when the synopsis gets significantly big in a large data warehouse. A method for efficient data skipping and compression through vertical partitioning of data is provided to eliminate the cost of synopsis storage overhead while enabling the synopsis search functionality.

Type: Grant

Filed: August 10, 2015

Date of Patent: May 14, 2019

Assignee: International Business Machines Corporation

Inventors: Yuan-Chi Chang, Timothy R. Malkemus, Mohammad Sadoghi Hamedani
Analytics integration workbench within a comprehensive framework for composing and executing analytics applications in business level languages

Patent number: 10242406

Abstract: Systems, methods and articles of manufacture are disclosed for building and executing analytics solutions. Such a solution may provide a comprehensive analytics solution (e.g., a risk assessment, fraud detection solution, dynamic operational risk evaluations, regulatory compliance assessments, etc.). The analytics solution may perform an analytics task using operational data distributed across a variety of independently created and governed data repositories in different departments of an organization. A framework is disclosed which allows a user (e.g., a risk analyst) to compose analytical tools that can access data from a variety of sources (both internal and external to an enterprise) and perform a variety of analytic functions.

Type: Grant

Filed: September 14, 2009

Date of Patent: March 26, 2019

Assignee: International Business Machines Corporation

Inventors: Manoj Kumar, Yuan-Chi Chang, Chung-Sheng Li
Analytics information directories within a comprehensive framework for composing and executing analytics applications in business level languages

Patent number: 10127299

Abstract: Systems, methods and articles of manufacture are disclosed for building and executing analytics solutions. Such a solution may provide a comprehensive analytics solution (e.g., a risk assessment, fraud detection solution, dynamic operational risk evaluations, regulatory compliance assessments, etc.). The analytics solution may perform an analytics task using operational data distributed across a variety of independently created and governed data repositories in different departments of an organization. A framework is disclosed which allows a user (e.g., a risk analyst) to compose analytical tools that can access data from a variety of sources (both internal and external to an enterprise) and perform a variety of analytic functions.

Type: Grant

Filed: September 14, 2009

Date of Patent: November 13, 2018

Assignee: International Business Machines Corporation

Inventors: Manoj Kumar, Yuan Chi Chang, Chung-Sheng Li
INDEX MAINTENANCE BASED ON A COMPARISON OF REBUILD VS. UPDATE

Publication number: 20180285400

Abstract: A method, system and computer program product for index maintenance in a computer system comprising a plurality of nodes, a database, and an index to the database. In one embodiment, the method comprises, for a defined period of time, building a snapshot of selected change requests received by the nodes to change the database. After this defined period of time, a selection is made, based on specified criteria, whether to rebuild a new index to the database, or to add entries to a current index. When the selection is to rebuild a new index, the new index is rebuilt based on data in the database and in the change requests in the snapshot. When the selection is to add entries to a current index, entries are added to the current index based on data in the database and in the change requests in the snapshot.

Type: Application

Filed: June 7, 2018

Publication date: October 4, 2018

Inventors: Yuan-chi Chang, Jason Crawford, Liana L. Fong, Wei Tan
Parallel top-K simple shortest paths discovery

Patent number: 10050866

Abstract: A method for searching the top-K simple shortest paths between a specified source node and a specified target node in a graph, with graph data partitioned and distributed across a plurality of computing servers, the method including a parallel path search initialized from either one or both of the source and target nodes and traversing the graph by building likely path sequences for a match. Each computing server determines and forwards a path sequence as discovery progresses until the top-K paths are discovered.

Type: Grant

Filed: December 3, 2015

Date of Patent: August 14, 2018

Assignee: International Business Machines Corporation

Inventors: Yuan-Chi Chang, Mustafa Canim
SCENARIO PLANNING AND RISK MANAGEMENT

Publication number: 20180218300

Abstract: Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.

Type: Application

Filed: December 14, 2017

Publication date: August 2, 2018

Inventors: Yuan-Chi Chang, Mark D. Feblowitz, Nagui Halim, Stuart S. Horn, Edward J. Pring, Anton V. Riabov, Edward W. Shay, Shirin Sohrabi Araghi, Deepak S. Turaga, Octavian Udrea, Fang Yuan, Peter Zimmer

prev 1 2 3 4 5 6 … next